EmulationAI/awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

/ 100

Emerging

This resource provides a comprehensive guide to using large AI models for various audio tasks. It collects papers and open-source implementations, covering everything from transcribing speech to generating music and translating languages. Researchers, audio engineers, and developers working with audio data can use this to understand the latest advancements and find practical tools.

726 stars.

Use this if you are a researcher or practitioner in audio AI looking for the latest developments, benchmarks, and open-source implementations of large audio models for tasks like speech recognition, synthesis, music generation, or translation.

Not ideal if you are looking for an off-the-shelf application to directly process your audio without any technical implementation.

audio-processing speech-recognition music-generation speech-synthesis language-translation

No License No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

726

Forks

Language

—

License

—

Higher-rated alternatives

chrisliu298/awesome-llm-unlearning

A resource repository for machine unlearning in large language models

worldbench/awesome-vla-for-ad

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

zjukg/KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

worldbench/awesome-spatial-intelligence

🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Explore LLM Tools

All categories Trending LLM Tool directory Insights