Speech To Text Transcription Voice AI Tools

There are 31 speech to text transcription tools tracked. 2 score above 50 (established tier). The highest-rated is AbdullahHendy/live-translation at 56/100 with 13 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speech-to-text-transcription&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM...

56
Established
2 i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

51
Established
3 512z/podlens

Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent |...

47
Emerging
4 Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

44
Emerging
5 aws-solutions/content-localization-on-aws

Automatically generate multi-language subtitles using AWS AI/ML services....

43
Emerging
6 fizamusthafa/whisper-app

This repository contains a web application for multi-lingual transcription...

42
Emerging
7 AEmotionStudio/ComfyUI-FFMPEGA

Intelligent FFMPEG agent node for ComfyUI - transforms natural language...

40
Emerging
8 gkrsv/split_audio

A rough and ready Python utility which splits audio files based on silence...

38
Emerging
9 i4Ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

38
Emerging
10 vdutts7/ai-rapper

Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise...

33
Emerging
11 stevenlawton/GPT-Whisper-captions

Automate subtitle generation for videos using OpenAI's Whisper API and...

29
Experimental
12 nalbion/whisper-server

streaming speech to text server using Whisper

28
Experimental
13 Jayem-11/Swahili_speech_to_text

Speech to Text for Swahili Language with Whisper-small.

27
Experimental
14 lukereichold/SpeechTimestamper

Generate an accurate, timestamped transcript given an audio file and its...

26
Experimental
15 RizhongLin/PolyglotWhisperer

Transcribe, translate, and learn — Whisper + LLM video pipeline with dual...

25
Experimental
16 somosnlp/wav2vec2-spanish

Pre-train a Spanish Wav2Vec2 model using the Spanish portion of the Common...

25
Experimental
17 uqqu/sync_book

audiobook generator with smart personalized translation

22
Experimental
18 LexMainye/Kasuku-Transcriber

A speech to text web app for people with speech impairments that has support...

22
Experimental
19 stellarloop/video2text

Python API & command-line tool to easily transcribe speech-based video files...

21
Experimental
20 eray-yuztyurk/python-ai-audio-transcriber-summarizer

AI-powered tool for fast, accurate audio transcription and summarization....

20
Experimental
21 Jyotibrat/Speech-To-Text

Speech to Text model

19
Experimental
22 danielsobrado/audio-processor

Audio processor, focused on english and arabic with diarization and summarization

19
Experimental
23 udit-rawat/whisper-space

An ASR Gradio GUI based project that transcript the audion and provides NLP...

17
Experimental
24 antarades/emotion-aware-automatic-speech-recognition

An intelligent speech recognition system that combines OpenAI's Whisper for...

13
Experimental
25 MaharshPatelX/Speechitive

A Video analytics tool converting videos to M3U8 playlists using HLS and...

12
Experimental
26 singleshade8/japanese-subtitle-generator

GPU-accelerated Japanese → English subtitle generator using faster-whisper...

11
Experimental
27 bivex/voice_to_text

A Python application for real-time Russian voice-to-text transcription and...

11
Experimental
28 Ashishkumar-hub/Text-to-Speech-using-gTTS

Text to speech conversion NLP based end to end project

11
Experimental
29 ChaoticByte/audio-summarize

An audio summarizer (faster-whisper and BART glued together)

11
Experimental
30 zyovo-0829/LingxiCouplet

基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统,支持语音输入、自动生成下联和智能评分。An...

10
Experimental
31 kalindasiaminwe/ChitongaASR

A natural language processing and machine learning project for a low...

10
Experimental

Comparisons in this category