Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
This tool helps convert spoken audio into written text, a process known as speech recognition or transcription. It takes audio input from a microphone or an audio file and outputs the corresponding words. This is useful for anyone needing to analyze spoken content, create captions, or automate interactions based on voice commands.
8,959 stars. Used by 19 other packages. Actively maintained with 26 commits in the last 30 days. Available on PyPI.
Use this if you need to transcribe audio files or real-time speech into text for various applications, such as meeting summaries, voice assistants, or content analysis.
Not ideal if you need highly specialized, domain-specific speech recognition without needing to integrate with multiple existing speech-to-text services.
Stars
8,959
Forks
2,434
Language
Python
License
BSD-3-Clause
Category
Last pushed
Mar 11, 2026
Commits (30d)
26
Dependencies
3
Reverse dependents
19
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Uberi/speech_recognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Recent Releases
Related tools
cmusphinx/pocketsphinx
A small speech recognizer
tensorflow/lingvo
Lingvo
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition
istupakov/onnx-asr
A lightweight Python package for Automatic Speech Recognition using ONNX models