hcy71o/SC-CNN

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

/ 100

Emerging

This project helps create high-quality, natural-sounding speech from text using the voice of a speaker it has never heard before. By providing a short audio sample of any speaker's voice, it generates speech in that voice from your written text. This is ideal for content creators, audiobook producers, or anyone needing custom voice narration without hiring a voice actor.

No commits in the last 6 months.

Use this if you need to generate spoken audio in a variety of voices from text, especially if you need to mimic a specific voice from a small audio sample.

Not ideal if you primarily need to transcribe audio to text or analyze existing speech for insights, as its focus is on speech generation.

text-to-speech voice-synthesis audiobook-production content-creation narration

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights