biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

42
/ 100
Emerging

This project offers tools to convert spoken Thai into written text accurately, even when there's background noise or specific vocabulary from fields like finance or medicine. It takes Thai audio recordings, like interviews or voicemails, and produces a written transcript. Anyone needing to process Thai speech, such as content creators, researchers, or business analysts, would find this useful for tasks like subtitling or documentation.

186 stars. No commits in the last 6 months.

Use this if you need to reliably transcribe Thai audio into text, especially if the audio contains domain-specific language or environmental noise.

Not ideal if your primary need is for precise timestamping with the highest accuracy, as dedicated timestamp models might have slightly lower overall transcription accuracy.

Thai speech transcription audio to text content creation research documentation media analysis
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

186

Forks

20

Language

Jupyter Notebook

License

MIT

Last pushed

Jul 29, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/biodatlab/thonburian-whisper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.