CMsmartvoice/Unet-TTS

One-shot TTS with Improved Unseen Speaker and Style Transfer

/ 100

Emerging

This tool helps content creators and developers generate natural-sounding speech in a specific voice and style, even for voices they haven't encountered before. You provide a short audio sample of the target voice and any text you want spoken, and it outputs synthesized speech that mimics the speaker's unique characteristics and emotional tone. It's ideal for anyone needing to create personalized audio content, like voiceovers or virtual assistants, with minimal effort.

No commits in the last 6 months.

Use this if you need to quickly clone a voice and speaking style from a very short audio clip to synthesize new, arbitrary text.

Not ideal if you require extremely high-fidelity, indistinguishable voice cloning for sensitive applications where even minor artificiality is unacceptable.

voice-cloning audio-content-creation speech-synthesis virtual-assistants e-learning-narration

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights