Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

/ 100

Emerging

This project helps you create high-fidelity audio clips from written descriptions. You provide a text prompt describing the sound you want, and it generates an audio file. This is ideal for sound designers, content creators, or educators who need custom sound effects or ambient audio for various projects.

669 stars. No commits in the last 6 months.

Use this if you need to generate specific sound effects or background audio from text descriptions for creative projects or multimedia content.

Not ideal if you need to generate human speech or voices, as its use for that purpose without consent is prohibited.

sound-design content-creation audio-generation multimedia sound-effects

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

669

Forks

Language

Python

License

MIT

Higher-rated alternatives

PrunaAI/pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster,...

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

haoheliu/AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...

ivanvovk/WaveGrad

Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.

Explore Diffusion Models

All categories Trending Diffusion directory Insights