ftyers/commonvoice-utils

Linguistic processing for Common Voice

/ 100

Established

This tool helps linguists and machine learning practitioners prepare text data for training speech recognition (ASR) and text-to-speech (TTS) systems. It takes raw text, often from sources like Wikipedia or Common Voice datasets, and outputs cleaned, segmented, and phonemized text, along with relevant linguistic alphabets. Anyone working with under-resourced languages or large text corpora for speech technology development would find this useful.

No commits in the last 6 months. Available on PyPI.

Use this if you need to standardize, segment, or convert written text into phonetic representations for building or improving speech technology models in various languages.

Not ideal if you are looking for advanced natural language processing tasks like sentiment analysis, machine translation, or text summarization.

speech-recognition text-to-speech computational-linguistics language-modeling data-preparation

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

AGPL-3.0

Related tools

Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

alphacep/awesome-russian-speech

Russian speech technology links

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Explore Voice AI Tools

All categories Trending Voice AI directory Insights