opendilab/CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

/ 100

Emerging

This project helps create a Chinese-speaking virtual assistant that can have natural, real-time voice conversations. You speak into a microphone, and the assistant responds immediately with spoken Chinese, just like talking to a person. It’s designed for researchers and product developers exploring advanced voice interfaces and interactive AI agents.

499 stars.

Use this if you need a high-quality, streaming, full-duplex speech-to-speech interaction agent prototype, especially for Chinese language applications, to quickly test new ideas or demonstrate capabilities.

Not ideal if you need a ready-to-deploy, production-grade conversational AI without any development or integration work, or if your primary need is for written text-based interaction.

conversational-ai voice-assistants human-computer-interaction language-technology interactive-systems

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

499

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

huggingface/speech-to-speech

Build local voice agents with open-source models

linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

Picovoice/speech-to-text-benchmark

speech to text benchmark framework

vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights