snakers4/silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

/ 100

Established

This project offers pre-trained text-to-speech models that convert written text into natural-sounding spoken audio. You input text, select a voice and language, and receive an audio file of that text being spoken aloud. It's designed for anyone needing to generate speech from text, such as content creators, educators, or customer service departments.

5,822 stars. Actively maintained with 5 commits in the last 30 days.

Use this if you need to quickly and easily convert text into natural-sounding speech across multiple languages for various applications.

Not ideal if you require highly customized voice cloning or real-time, low-latency conversational AI without any development setup.

audio-content-creation e-learning voice-assistants multilingual-communication accessibility

No Package No Dependents

Maintenance 16 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

5,822

Forks

360

Language

Jupyter Notebook

License

—

Recent Releases

v5.5 03 Feb 2026 v5.4 30 Jan 2026 v5.2 22 Nov 2025 v5.1 30 Oct 2025 v5.0 30 Oct 2025

Related tools

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot...

JSchmie/ScrAIbe-WebUI

WebUI for ScAIbe

isaiahbjork/orpheus-tts-local

Run Orpheus 3B Locally With LM Studio

snakers4/silero-stress

Silero Stress — pre-trained enterprise-grade automated stress and homograph disambiguation for...

MerlinCN/kinoko7danmaku

调用TTS来播报哔哩哔哩直播中的弹幕、礼物、舰长等

Explore Voice AI Tools

All categories Trending Voice AI directory Insights