haidog-yaqub/EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

/ 100

Emerging

This project helps sound designers, content creators, and educators quickly generate high-quality audio from text descriptions. You provide a text prompt describing the desired sound, and it outputs a corresponding audio file. It also supports editing existing audio, like replacing a section based on new text, or generating audio that matches a reference.

330 stars.

Use this if you need to create realistic sound effects, background audio, or short audio clips from text without recording or extensive sound design software.

Not ideal if you require precise musical composition, human speech generation, or extremely long-form audio content.

sound design content creation audio production media editing education resources

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

330

Forks

Language

Python

License

MIT

Higher-rated alternatives

open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄:...

jdh-algo/JoyVASA

Diffusion-based Portrait and Animal Animation

404-Repo/404-gen-blender-add-on

Blender add-on for 404-GEN 3D generator running on Bittensor

linzhiqiu/t2v_metrics

Evaluating text-to-image/video/3D models with VQAScore

TIGER-AI-Lab/AnyV2V

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

Explore Generative AI Tools

All categories Trending Generative AI directory Insights