opendilab/CleanS2S
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
This project helps create a Chinese-speaking virtual assistant that can have natural, real-time voice conversations. You speak into a microphone, and the assistant responds immediately with spoken Chinese, just like talking to a person. It’s designed for researchers and product developers exploring advanced voice interfaces and interactive AI agents.
499 stars.
Use this if you need a high-quality, streaming, full-duplex speech-to-speech interaction agent prototype, especially for Chinese language applications, to quickly test new ideas or demonstrate capabilities.
Not ideal if you need a ready-to-deploy, production-grade conversational AI without any development or integration work, or if your primary need is for written text-based interaction.
Stars
499
Forks
52
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/opendilab/CleanS2S"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
huggingface/speech-to-speech
Build local voice agents with open-source models
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models