lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
This project helps researchers and developers explore and modify text-to-speech technology. It takes plain text and optionally a speech sample, and outputs generated speech in various voices, or a voice matching the sample. This is for machine learning researchers, audio engineers, or anyone wanting to customize or understand text-to-speech systems.
186 stars. No commits in the last 6 months. Available on PyPI.
Use this if you are a researcher or developer looking for a straightforward, adaptable text-to-speech system to experiment with or build upon.
Not ideal if you need a production-ready, highly accurate text-to-speech system for commercial applications without further training.
Stars
186
Forks
21
Language
Python
License
MIT
Category
Last pushed
Aug 03, 2025
Commits (30d)
0
Dependencies
13
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lucasnewman/nanospeech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech