Bai-YT/ConsistencyTTA

ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

23
/ 100
Experimental

This project helps audio creators and content producers generate high-quality audio clips from simple text descriptions. You input a text prompt like "Food sizzling with some knocking and banging followed by a dog barking" and get a 10-second audio file. This tool is ideal for sound designers, game developers, or marketers who need unique sound effects quickly without extensive audio engineering.

No commits in the last 6 months.

Use this if you need to rapidly create specific sound effects or ambient audio from text descriptions with significantly reduced generation time.

Not ideal if you require precise musical composition or human-like speech synthesis, as this tool focuses on sound effects and environmental audio.

sound-design content-creation game-audio text-to-audio media-production
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

39

Forks

Language

Python

License

MIT

Last pushed

Nov 20, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Bai-YT/ConsistencyTTA"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.