Whisper Speech Transcription Transformer Models

Tools and applications for automatic speech recognition (ASR) and audio transcription using Whisper models. Includes implementations with various interfaces (API, GUI, web), fine-tuning for specific languages/accents, and integration with other AI systems. Does NOT include text-to-speech, voice cloning, audio classification without transcription, or general speech processing unrelated to transcription.

There are 17 whisper speech transcription models tracked. The highest-rated is Arkapravo-Ghosh/speech-to-text at 46/100 with 8 stars.

Get all 17 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=whisper-speech-transcription&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

46
Emerging
2 biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo...

42
Emerging
3 scalable-ml-deep-learning/fine_tune_whisper

Fine-Tune Whisper for Italian ASR with transformers

27
Experimental
4 Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages

This project presents a multilingual pipeline for both speech-to-text and...

24
Experimental
5 EdVince/whisper-trtllm

Whisper in TensorRT-LLM

23
Experimental
6 mahiiyh/asr-primer

A complete implementation of an Automatic Speech Recognition (ASR) system...

21
Experimental
7 tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

21
Experimental
8 ahmedbesbes/audiolizr

A bentoML-powered API to transcribe audio and make sense of it

21
Experimental
9 romanyn36/whisperx-asr-with-fastapi

WhisperX ASR is a FastAPI-based application for automatic speech...

20
Experimental
10 hasanhalacli/whisper-german-finetuning

Fine-tune OpenAI Whisper for German speech recognition using LoRA with audio...

17
Experimental
11 RAHB-REALTORS-Association/transcriber-describer

Transcribes videos and describes them with OpenAI APIs or local models.

14
Experimental
12 samratrajsharma/OpenAI-Whisper-Fine-Tuned-ASR-using-LoRA-PEFT

End-to-end Hindi Speech AI project for improving ASR accuracy using...

13
Experimental
13 kulsoom-abdullah/Qwen2-VL-Audio-Adapter

Architecture grafting: Extending Qwen2-VL with Whisper encoder for speech...

13
Experimental
14 Ailurus1/ASR-bot

ASR telegram assistant for voice/video messages transcribing

11
Experimental
15 amtam0/speech-timer

Voice Enabled Sport Interval Timer using Transformers models

11
Experimental
16 alfa7g7/audio-educational-pipeline

Pipeline completo para transcripción y corrección de audio educativo con...

10
Experimental
17 lalitdotdev/transcribeX

Transcribe audio in minutes with OpenAI's WhisperV3 and Flash Attention v2 +...

10
Experimental