abus-aikorea/kara-audio
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.
This tool helps content creators, educators, or anyone working with video and audio to easily produce high-quality subtitles and cleaned audio. It takes YouTube video links or audio files, separates vocals from music, and generates accurate transcriptions and subtitle files in over 90 languages. Users include video editors, transcribers, and educators looking to make content more accessible or reusable.
No commits in the last 6 months.
Use this if you need to quickly create subtitles for videos, extract instrumental tracks for karaoke, or generate text transcripts from audio, especially from YouTube.
Not ideal if you are using a Mac or Linux operating system, or if you don't have an NVIDIA GPU, as performance may be significantly impacted.
Stars
67
Forks
9
Language
Python
License
GPL-3.0
Category
Last pushed
Oct 02, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/abus-aikorea/kara-audio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI