nari-labs/dia2

TTS model capable of streaming conversational audio in realtime.

47
/ 100
Emerging

This tool helps create realistic, streaming conversational audio from text, perfect for interactive applications. You provide written dialogue, optionally with existing audio to set the voice, and it generates spoken audio instantly, as if a natural conversation is unfolding. It's ideal for developers building real-time voice assistants, interactive characters, or speech-to-speech systems.

1,100 stars.

Use this if you need to generate human-like spoken dialogue in real time, where audio starts playing as soon as the first few words are available, rather than waiting for the full text.

Not ideal if you require consistent, high-quality audio with a specific, unchanging voice for all generations without prior voice conditioning, or if you need to generate audio longer than two minutes.

AI voice generation conversational AI real-time audio speech synthesis interactive systems
No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 13 / 25
Community 18 / 25

How are scores calculated?

Stars

1,100

Forks

91

Language

Python

License

Apache-2.0

Last pushed

Nov 29, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nari-labs/dia2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Compare