Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
This project helps you create high-fidelity audio clips from written descriptions. You provide a text prompt describing the sound you want, and it generates an audio file. This is ideal for sound designers, content creators, or educators who need custom sound effects or ambient audio for various projects.
669 stars. No commits in the last 6 months.
Use this if you need to generate specific sound effects or background audio from text descriptions for creative projects or multimedia content.
Not ideal if you need to generate human speech or voices, as its use for that purpose without consent is prohibited.
Stars
669
Forks
92
Language
Python
License
MIT
Category
Last pushed
May 22, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Text-to-Audio/Make-An-Audio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrunaAI/pruna
Pruna is a model optimization framework built for developers, enabling you to deliver faster,...
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.