frothywater/kanade-tokenizer

Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative modeling.

/ 100

Emerging

This tool helps researchers and engineers working with spoken language models convert raw audio into a compact, numerical representation. You provide audio files as input, and it outputs disentangled speech tokens that can be used for tasks like voice synthesis or speech recognition. It's designed for those developing or training advanced speech-related AI.

Use this if you need to process spoken audio into discrete, manageable tokens for developing generative or discriminative speech models.

Not ideal if you're looking for a direct, end-user application for transcribing audio to text or generating speech without developing models yourself.

spoken-language-modeling speech-synthesis speech-recognition audio-processing AI-model-development

No License No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 5 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

guillaume-be/rust-tokenizers

Rust-tokenizer offers high-performance tokenizers for modern language models, including...

sugarme/tokenizer

NLP tokenizers written in Go language

elixir-nx/tokenizers

Elixir bindings for 🤗 Tokenizers

openscilab/tocount

ToCount: Lightweight Token Estimator

reinfer/blingfire-rs

Rust wrapper for the BlingFire tokenization library

Explore NLP Tools

All categories Trending NLP directory Insights