shashikg/WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

46
/ 100
Emerging

This tool helps convert audio recordings into written text quickly and accurately. You feed in audio files, and it produces a transcript in various formats like TXT, JSON, or SRT. It's designed for anyone who needs fast and reliable transcriptions, such as journalists, researchers, content creators, or meeting facilitators.

541 stars. No commits in the last 6 months.

Use this if you need to transcribe audio files into text exceptionally fast, especially for large volumes of audio, and require high accuracy.

Not ideal if you primarily need to translate speech into a different language without transcription, or if you don't work with audio transcription.

audio-transcription meeting-minutes content-creation qualitative-research video-captioning
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

541

Forks

73

Language

Jupyter Notebook

License

MIT

Last pushed

Aug 27, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shashikg/WhisperS2T"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.