met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs in-browser (WebGPU/WASM) or on local Node.js WebSocket/REST server (CPU).
This tool helps animators, game developers, or content creators bring digital characters to life by generating natural-sounding speech directly from text. You provide text input, and it outputs audio along with precise timing data for phonemes and visemes (mouth shapes). It's designed for anyone needing to synchronize character lip movements with spoken dialogue, making animated characters speak realistically without manual effort.
112 stars. Available on npm.
Use this if you need to generate high-quality, free English speech with detailed lip-sync data for animated characters in a browser-based application or a local Node.js environment.
Not ideal if you require text-to-speech in languages other than English or if you are using older browsers that don't support WebGPU, as the performance will be significantly slower.
Stars
112
Forks
16
Language
JavaScript
License
MIT
Category
Last pushed
Dec 08, 2025
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/met4citizen/HeadTTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
lucasjinreal/Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with...