inclusionAI/Ming-UniAudio

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

/ 100

Emerging

This project helps audio content creators and developers work with spoken audio. It takes speech input and can generate new speech, understand spoken content, or edit existing audio based on text instructions. Anyone who needs to produce, analyze, or modify speech, like podcasters, voiceover artists, or researchers, would find this useful.

435 stars.

Use this if you need to perform multiple tasks like transcribing, generating, or editing speech using simple text commands, especially for complex changes without needing to specify exact timestamps.

Not ideal if you only need a basic speech-to-text or text-to-speech tool and don't require advanced editing or combined capabilities.

audio-editing speech-synthesis speech-recognition voice-production podcast-creation

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 13 / 25

How are scores calculated?

Stars

435

Forks

Language

Python

License

MIT

Higher-rated alternatives

Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

alphacep/awesome-russian-speech

Russian speech technology links

ftyers/commonvoice-utils

Linguistic processing for Common Voice

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Explore Voice AI Tools

All categories Trending Voice AI directory Insights