abus-aikorea/kara-audio

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.

/ 100

Emerging

This tool helps content creators, educators, or anyone working with video and audio to easily produce high-quality subtitles and cleaned audio. It takes YouTube video links or audio files, separates vocals from music, and generates accurate transcriptions and subtitle files in over 90 languages. Users include video editors, transcribers, and educators looking to make content more accessible or reusable.

No commits in the last 6 months.

Use this if you need to quickly create subtitles for videos, extract instrumental tracks for karaoke, or generate text transcripts from audio, especially from YouTube.

Not ideal if you are using a Mac or Linux operating system, or if you don't have an NVIDIA GPU, as performance may be significantly impacted.

subtitle-production video-editing audio-transcription content-creation media-accessibility

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

GPL-3.0

Higher-rated alternatives

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights