kurianbenoy/whisper_normalizer
A python package for whisper normalizer
This tool helps improve the accuracy of speech-to-text systems by standardizing spoken text into a consistent written form. It takes raw text output from an Automatic Speech Recognition (ASR) system and converts it into a normalized version, making it easier to compare and evaluate against ground truth. This is ideal for speech technology researchers and developers working on ASR metrics and model comparisons.
Used by 2 other packages. No commits in the last 6 months. Available on PyPI.
Use this if you need to standardize text for evaluating Automatic Speech Recognition (ASR) systems, especially if you're working with English or Indic languages like Malayalam.
Not ideal if your primary need is for general natural language processing tasks not related to ASR evaluation or if you require normalization for many low-resource languages beyond those specifically supported.
Stars
76
Forks
17
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Oct 06, 2025
Commits (30d)
0
Dependencies
3
Reverse dependents
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kurianbenoy/whisper_normalizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI