Speech To Text Transcription Voice AI Tools
There are 31 speech to text transcription tools tracked. 2 score above 50 (established tier). The highest-rated is AbdullahHendy/live-translation at 56/100 with 13 stars.
Get all 31 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speech-to-text-transcription&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
AbdullahHendy/live-translation
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM... |
|
Established |
| 2 |
i4Ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model. |
|
Established |
| 3 |
512z/podlens
Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent |... |
|
Emerging |
| 4 |
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer |
|
Emerging |
| 5 |
aws-solutions/content-localization-on-aws
Automatically generate multi-language subtitles using AWS AI/ML services.... |
|
Emerging |
| 6 |
fizamusthafa/whisper-app
This repository contains a web application for multi-lingual transcription... |
|
Emerging |
| 7 |
AEmotionStudio/ComfyUI-FFMPEGA
Intelligent FFMPEG agent node for ComfyUI - transforms natural language... |
|
Emerging |
| 8 |
gkrsv/split_audio
A rough and ready Python utility which splits audio files based on silence... |
|
Emerging |
| 9 |
i4Ds/whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model. |
|
Emerging |
| 10 |
vdutts7/ai-rapper
Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise... |
|
Emerging |
| 11 |
stevenlawton/GPT-Whisper-captions
Automate subtitle generation for videos using OpenAI's Whisper API and... |
|
Experimental |
| 12 |
nalbion/whisper-server
streaming speech to text server using Whisper |
|
Experimental |
| 13 |
Jayem-11/Swahili_speech_to_text
Speech to Text for Swahili Language with Whisper-small. |
|
Experimental |
| 14 |
lukereichold/SpeechTimestamper
Generate an accurate, timestamped transcript given an audio file and its... |
|
Experimental |
| 15 |
RizhongLin/PolyglotWhisperer
Transcribe, translate, and learn — Whisper + LLM video pipeline with dual... |
|
Experimental |
| 16 |
somosnlp/wav2vec2-spanish
Pre-train a Spanish Wav2Vec2 model using the Spanish portion of the Common... |
|
Experimental |
| 17 |
uqqu/sync_book
audiobook generator with smart personalized translation |
|
Experimental |
| 18 |
LexMainye/Kasuku-Transcriber
A speech to text web app for people with speech impairments that has support... |
|
Experimental |
| 19 |
stellarloop/video2text
Python API & command-line tool to easily transcribe speech-based video files... |
|
Experimental |
| 20 |
eray-yuztyurk/python-ai-audio-transcriber-summarizer
AI-powered tool for fast, accurate audio transcription and summarization.... |
|
Experimental |
| 21 |
Jyotibrat/Speech-To-Text
Speech to Text model |
|
Experimental |
| 22 |
danielsobrado/audio-processor
Audio processor, focused on english and arabic with diarization and summarization |
|
Experimental |
| 23 |
udit-rawat/whisper-space
An ASR Gradio GUI based project that transcript the audion and provides NLP... |
|
Experimental |
| 24 |
antarades/emotion-aware-automatic-speech-recognition
An intelligent speech recognition system that combines OpenAI's Whisper for... |
|
Experimental |
| 25 |
MaharshPatelX/Speechitive
A Video analytics tool converting videos to M3U8 playlists using HLS and... |
|
Experimental |
| 26 |
singleshade8/japanese-subtitle-generator
GPU-accelerated Japanese → English subtitle generator using faster-whisper... |
|
Experimental |
| 27 |
bivex/voice_to_text
A Python application for real-time Russian voice-to-text transcription and... |
|
Experimental |
| 28 |
Ashishkumar-hub/Text-to-Speech-using-gTTS
Text to speech conversion NLP based end to end project |
|
Experimental |
| 29 |
ChaoticByte/audio-summarize
An audio summarizer (faster-whisper and BART glued together) |
|
Experimental |
| 30 |
zyovo-0829/LingxiCouplet
基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统,支持语音输入、自动生成下联和智能评分。An... |
|
Experimental |
| 31 |
kalindasiaminwe/ChitongaASR
A natural language processing and machine learning project for a low... |
|
Experimental |