Saganaki22/ComfyUI-Step_Audio_EditX_TTS

ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice cloning and audio editing with emotion, style, speed control, and more.

41
/ 100
Emerging

This tool helps creative professionals and content creators generate natural-sounding speech in any voice from just a short audio sample. You provide a text script and a brief voice recording, and it produces new audio spoken in that cloned voice. It also allows you to modify existing audio to change emotion, style, speed, or add effects, making it ideal for podcasters, animators, game developers, or marketers.

Use this if you need to create consistent voiceovers for long-form content, generate character voices for media, or modify audio recordings to express different emotions or styles.

Not ideal if you require editing audio segments longer than 30 seconds for style or emotion, as these need to be manually split first.

voice-cloning audio-production content-creation media-localization game-development
No Package No Dependents
Maintenance 6 / 25
Adoption 8 / 25
Maturity 13 / 25
Community 14 / 25

How are scores calculated?

Stars

57

Forks

8

Language

Python

License

Apache-2.0

Last pushed

Dec 04, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Saganaki22/ComfyUI-Step_Audio_EditX_TTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.