Whisper Speech Transcription Transformer Models

Tools and applications for automatic speech recognition (ASR) and audio transcription using Whisper models. Includes implementations with various interfaces (API, GUI, web), fine-tuning for specific languages/accents, and integration with other AI systems. Does NOT include text-to-speech, voice cloning, audio classification without transcription, or general speech processing unrelated to transcription.

There are 17 whisper speech transcription models tracked. The highest-rated is Arkapravo-Ghosh/speech-to-text at 46/100 with 8 stars.

Get all 17 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=whisper-speech-transcription&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	Arkapravo-Ghosh/speech-to-text Speech to Text Transcription using OpenAI Whisper v3 and FastAPI	46	Emerging	8	Python
2	biodatlab/thonburian-whisper Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo...	42	Emerging	186	Jupyter Notebook
3	scalable-ml-deep-learning/fine_tune_whisper Fine-Tune Whisper for Italian ASR with transformers	27	Experimental	11	Jupyter Notebook
4	Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages This project presents a multilingual pipeline for both speech-to-text and...	24	Experimental	3	Jupyter Notebook
5	EdVince/whisper-trtllm Whisper in TensorRT-LLM	23	Experimental	17	C++
6	mahiiyh/asr-primer A complete implementation of an Automatic Speech Recognition (ASR) system...	21	Experimental	—	Jupyter Notebook
7	tomdewildt/whisper-experiment Experiments using the Whisper model from Open AI	21	Experimental	—	Jupyter Notebook
8	ahmedbesbes/audiolizr A bentoML-powered API to transcribe audio and make sense of it	21	Experimental	39	Python
9	romanyn36/whisperx-asr-with-fastapi WhisperX ASR is a FastAPI-based application for automatic speech...	20	Experimental	1	HTML
10	hasanhalacli/whisper-german-finetuning Fine-tune OpenAI Whisper for German speech recognition using LoRA with audio...	17	Experimental	—	Python
11	RAHB-REALTORS-Association/transcriber-describer Transcribes videos and describes them with OpenAI APIs or local models.	14	Experimental	3	Python
12	samratrajsharma/OpenAI-Whisper-Fine-Tuned-ASR-using-LoRA-PEFT End-to-end Hindi Speech AI project for improving ASR accuracy using...	13	Experimental	—	Jupyter Notebook
13	kulsoom-abdullah/Qwen2-VL-Audio-Adapter Architecture grafting: Extending Qwen2-VL with Whisper encoder for speech...	13	Experimental	—	Jupyter Notebook
14	Ailurus1/ASR-bot ASR telegram assistant for voice/video messages transcribing	11	Experimental	—	Python
15	amtam0/speech-timer Voice Enabled Sport Interval Timer using Transformers models	11	Experimental	—	JavaScript
16	alfa7g7/audio-educational-pipeline Pipeline completo para transcripción y corrección de audio educativo con...	10	Experimental	1	Jupyter Notebook
17	lalitdotdev/transcribeX Transcribe audio in minutes with OpenAI's WhisperV3 and Flash Attention v2 +...	10	Experimental	2	TypeScript

Comparisons in this category

speech-to-text and whisperx-asr-with-fastapi (46 vs 20)