tarun7r/SpeechAlgo
A Comprehensive Speech Processing Algorithms Library for research and production use
This library helps speech researchers and audio engineers analyze and clean audio recordings. It takes raw audio files and processes them to extract key characteristics like speech presence, pitch, and phonetic features, or to remove background noise. It's designed for someone building speech-enabled applications or conducting academic research into spoken language.
Available on PyPI.
Use this if you need to understand, enhance, or extract specific information from speech recordings for applications like voice assistants, transcription services, or sound analysis.
Not ideal if you're looking for a complete, off-the-shelf speech recognition or natural language processing solution, as this focuses on foundational algorithm implementations.
Stars
15
Forks
2
Language
Python
License
MIT
Category
Last pushed
Oct 25, 2025
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tarun7r/SpeechAlgo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
cmusphinx/pocketsphinx
A small speech recognizer
tensorflow/lingvo
Lingvo
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition