EmulationAI/awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

39
/ 100
Emerging

This resource provides a comprehensive guide to using large AI models for various audio tasks. It collects papers and open-source implementations, covering everything from transcribing speech to generating music and translating languages. Researchers, audio engineers, and developers working with audio data can use this to understand the latest advancements and find practical tools.

726 stars.

Use this if you are a researcher or practitioner in audio AI looking for the latest developments, benchmarks, and open-source implementations of large audio models for tasks like speech recognition, synthesis, music generation, or translation.

Not ideal if you are looking for an off-the-shelf application to directly process your audio without any technical implementation.

audio-processing speech-recognition music-generation speech-synthesis language-translation
No License No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

726

Forks

48

Language

License

Last pushed

Oct 16, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/EmulationAI/awesome-large-audio-models"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.