OpenMOSS/MOSS-Speech

MOSS-Speech is a true speech-to-speech large language model without text guidance.

/ 100

Emerging

This project helps create direct, natural voice-to-voice interactions for spoken applications. You provide spoken input, and it responds directly with spoken output, without ever converting to text in between. It's designed for anyone building interactive voice assistants, dialogue systems, or real-time spoken translation tools.

127 stars.

Use this if you need a speech-to-speech system that offers more natural conversations and avoids the limitations of text-based processing.

Not ideal if your workflow specifically requires a text transcript of the spoken input or output, or if you need to perform text-based analysis.

voice-assistants spoken-dialogue-systems real-time-voice-interaction speech-technology conversational-AI

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 9 / 25

How are scores calculated?

Stars

127

Forks

Language

Python

License

Apache-2.0

Compare

MOSS-Speech and MOSS-TTSD MOSS-Speech and MOSS-TTS

Higher-rated alternatives

travisvn/chatterbox-tts-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate...

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment...

fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible...

OpenMOSS/MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis....

Explore Voice AI Tools

All categories Trending Voice AI directory Insights