Ploscha/Awesome-Audio-Generation
Awesome-Audio-Generation is a collection of resources for Text-to-Audio Generation, focusing on ambient sound and music. 🎵 Explore foundational models and contribute your findings to help grow this GitHub community! 🐙
This is a curated collection of papers, code, and resources for Text-to-Audio (TTA) Generation, particularly ambient sound and music. It helps researchers and developers find the latest advancements, methodologies, and practical implementations to convert text descriptions into audio. This resource is for anyone working on creating synthetic audio from textual input, from academic study to building tools.
Use this if you are a researcher or developer looking for a centralized hub of information, models, and code for generating audio from text.
Not ideal if you are an end-user simply looking for a ready-to-use application to generate specific audio files without engaging with research papers or code.
Stars
12
Forks
—
Language
—
License
—
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Ploscha/Awesome-Audio-Generation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Spr-Aachen/Easy-Voice-Toolkit
A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
PrzemyslawSwiderski/python-gradle-plugin
Gradle plugin to run Python projects.
alphacep/awesome-russian-speech
Russian speech technology links
ftyers/commonvoice-utils
Linguistic processing for Common Voice
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech