JIA-Lab-research/MGM-Omni
MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech
MGM-Omni is an omni-chatbot that helps you interact with AI using various types of content, including long audio, video, images, and text. It takes these diverse inputs and provides responses in both text and natural-sounding speech, even cloning voices from short clips. This tool is designed for creators, educators, or businesses who need to process complex multimedia conversations and generate personalized audio.
265 stars.
Use this if you need an AI assistant that can understand and respond to conversations involving lengthy speech, videos, images, and text, and generate custom voice responses.
Not ideal if your primary need is simple text-to-text interaction or if you require an AI that only handles short audio clips.
Stars
265
Forks
16
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 16, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/JIA-Lab-research/MGM-Omni"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.