EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
This resource provides a comprehensive guide to using large AI models for various audio tasks. It collects papers and open-source implementations, covering everything from transcribing speech to generating music and translating languages. Researchers, audio engineers, and developers working with audio data can use this to understand the latest advancements and find practical tools.
726 stars.
Use this if you are a researcher or practitioner in audio AI looking for the latest developments, benchmarks, and open-source implementations of large audio models for tasks like speech recognition, synthesis, music generation, or translation.
Not ideal if you are looking for an off-the-shelf application to directly process your audio without any technical implementation.
Stars
726
Forks
48
Language
—
License
—
Category
Last pushed
Oct 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/EmulationAI/awesome-large-audio-models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chrisliu298/awesome-llm-unlearning
A resource repository for machine unlearning in large language models
worldbench/awesome-vla-for-ad
🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
zjukg/KG-MM-Survey
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
worldbench/awesome-spatial-intelligence
🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems