lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

/ 100

Established

This project helps researchers and developers explore and modify text-to-speech technology. It takes plain text and optionally a speech sample, and outputs generated speech in various voices, or a voice matching the sample. This is for machine learning researchers, audio engineers, or anyone wanting to customize or understand text-to-speech systems.

186 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are a researcher or developer looking for a straightforward, adaptable text-to-speech system to experiment with or build upon.

Not ideal if you need a production-ready, highly accurate text-to-speech system for commercial applications without further training.

speech-synthesis audio-generation voice-cloning machine-learning-research digital-audio-processing

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 15 / 25

How are scores calculated?

Stars

186

Forks

Language

Python

License

MIT

Related tools

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

rishikksh20/FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Explore Voice AI Tools

All categories Trending Voice AI directory Insights