Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

85
/ 100
Verified

This tool helps convert spoken audio into written text, a process known as speech recognition or transcription. It takes audio input from a microphone or an audio file and outputs the corresponding words. This is useful for anyone needing to analyze spoken content, create captions, or automate interactions based on voice commands.

8,959 stars. Used by 19 other packages. Actively maintained with 26 commits in the last 30 days. Available on PyPI.

Use this if you need to transcribe audio files or real-time speech into text for various applications, such as meeting summaries, voice assistants, or content analysis.

Not ideal if you need highly specialized, domain-specific speech recognition without needing to integrate with multiple existing speech-to-text services.

audio-transcription voice-to-text meeting-minutes content-accessibility voice-user-interface
Maintenance 20 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

8,959

Forks

2,434

Language

Python

License

BSD-3-Clause

Last pushed

Mar 11, 2026

Commits (30d)

26

Dependencies

3

Reverse dependents

19

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Uberi/speech_recognition"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.