MOSS-TTS and MOSS-Speech

These are complementary components of an end-to-end speech processing pipeline: MOSS-Speech handles speech-to-speech understanding and generation, while MOSS-TTS provides the specialized text-to-speech synthesis layer needed to produce high-fidelity audio output.

MOSS-TTS
55
Established
MOSS-Speech
44
Emerging
Maintenance 17/25
Adoption 10/25
Maturity 11/25
Community 17/25
Maintenance 10/25
Adoption 10/25
Maturity 15/25
Community 9/25
Stars: 922
Forks: 82
Downloads:
Commits (30d): 16
Language: Python
License: Apache-2.0
Stars: 127
Forks: 7
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No Package No Dependents
No Package No Dependents

About MOSS-TTS

OpenMOSS/MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

The MOSS-TTS Family helps you create incredibly realistic and expressive speech and sound effects from text. You provide text and receive high-quality audio that sounds like a real person, handles multiple speakers, and can even generate unique voices or environmental sounds. This is perfect for content creators, game developers, virtual assistant designers, and anyone needing advanced audio generation.

content-creation voice-acting game-audio virtual-assistants audio-production

About MOSS-Speech

OpenMOSS/MOSS-Speech

MOSS-Speech is a true speech-to-speech large language model without text guidance.

This project helps create direct, natural voice-to-voice interactions for spoken applications. You provide spoken input, and it responds directly with spoken output, without ever converting to text in between. It's designed for anyone building interactive voice assistants, dialogue systems, or real-time spoken translation tools.

voice-assistants spoken-dialogue-systems real-time-voice-interaction speech-technology conversational-AI

Scores updated daily from GitHub, PyPI, and npm data. How scores work