fixie-ai/ultravox

A fast multimodal LLM for real-time voice

51
/ 100
Established

Ultravox helps you build applications that can understand and respond to human speech in real-time, without any noticeable delay. It takes in live audio input and instantly provides a text transcription, making it perfect for interactive voice agents. Developers building voice-enabled tools and platforms would use this to create highly responsive conversational experiences.

4,377 stars.

Use this if you need an AI model that can process live spoken words and immediately output text for extremely fast, natural voice interactions.

Not ideal if your primary need is for offline audio transcription or if you're looking for a model that outputs spoken responses directly, as this currently outputs text.

voice-AI real-time-transcription conversational-AI speech-recognition voice-assistants
No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

4,377

Forks

367

Language

Python

License

MIT

Last pushed

Dec 12, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/fixie-ai/ultravox"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.