JIA-Lab-research/MGM-Omni

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

49
/ 100
Emerging

MGM-Omni is an omni-chatbot that helps you interact with AI using various types of content, including long audio, video, images, and text. It takes these diverse inputs and provides responses in both text and natural-sounding speech, even cloning voices from short clips. This tool is designed for creators, educators, or businesses who need to process complex multimedia conversations and generate personalized audio.

265 stars.

Use this if you need an AI assistant that can understand and respond to conversations involving lengthy speech, videos, images, and text, and generate custom voice responses.

Not ideal if your primary need is simple text-to-text interaction or if you require an AI that only handles short audio clips.

multimedia-interaction voice-cloning long-form-audio-processing ai-assistants content-creation
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 11 / 25

How are scores calculated?

Stars

265

Forks

16

Language

Python

License

Apache-2.0

Last pushed

Mar 16, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/JIA-Lab-research/MGM-Omni"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.