shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

51
/ 100
Established

This tool helps people who work with spoken Bangla language convert audio into written text, even without an internet connection. You feed it audio files like recordings or live microphone input, and it outputs the spoken words as text. It's useful for transcribing interviews, lectures, or any spoken content in Bangla.

121 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly and accurately transcribe Bangla speech from various audio formats into text, especially when working offline.

Not ideal if your primary need is for languages other than Bangla, or if you require extremely low Word Error Rates (WER) for highly sensitive applications without considering model size.

Bangla-transcription audio-processing content-analysis voice-to-text speech-recognition
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 16 / 25

How are scores calculated?

Stars

121

Forks

18

Language

Python

License

Apache-2.0

Last pushed

Mar 01, 2025

Commits (30d)

0

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shhossain/BanglaSpeech2Text"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.