frothywater/kanade-tokenizer

Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative modeling.

37
/ 100
Emerging

This tool helps researchers and engineers working with spoken language models convert raw audio into a compact, numerical representation. You provide audio files as input, and it outputs disentangled speech tokens that can be used for tasks like voice synthesis or speech recognition. It's designed for those developing or training advanced speech-related AI.

Use this if you need to process spoken audio into discrete, manageable tokens for developing generative or discriminative speech models.

Not ideal if you're looking for a direct, end-user application for transcribing audio to text or generating speech without developing models yourself.

spoken-language-modeling speech-synthesis speech-recognition audio-processing AI-model-development
No License No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 5 / 25
Community 13 / 25

How are scores calculated?

Stars

85

Forks

11

Language

Python

License

Last pushed

Feb 03, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/frothywater/kanade-tokenizer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.