shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
This tool helps people who work with spoken Bangla language convert audio into written text, even without an internet connection. You feed it audio files like recordings or live microphone input, and it outputs the spoken words as text. It's useful for transcribing interviews, lectures, or any spoken content in Bangla.
121 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly and accurately transcribe Bangla speech from various audio formats into text, especially when working offline.
Not ideal if your primary need is for languages other than Bangla, or if you require extremely low Word Error Rates (WER) for highly sensitive applications without considering model size.
Stars
121
Forks
18
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 01, 2025
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shhossain/BanglaSpeech2Text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning
FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️