ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

46
/ 100
Emerging

This project helps anyone who needs to quickly understand or communicate across different languages in real-time. It takes spoken language in one language and can instantly transcribe it, translate it to text, or translate it to spoken language in another language. It's designed for professionals like international communicators, multilingual content creators, or those facilitating cross-cultural discussions who need immediate, accurate translation.

1,252 stars. No commits in the last 6 months.

Use this if you need an "all-in-one" solution for converting speech to text, translating speech to text, or translating speech to synthesized speech, whether offline or simultaneously.

Not ideal if your primary need is for advanced visual translation, multimodal interactions beyond speech, or if you require support for a very niche language pair not commonly covered.

simultaneous-interpretation voice-transcription multilingual-communication audio-localization live-translation
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

1,252

Forks

102

Language

Python

License

MIT

Last pushed

Jun 29, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ictnlp/StreamSpeech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.