tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and subtitle generation using OpenAI’s Whisper on CPU, Nvidia GPU and Apple MLX.

/ 100

Established

This tool helps you quickly convert audio and video files into text. You provide your media files, and it generates precise transcriptions, translations, speaker annotations, and even subtitles. It's designed for anyone who needs to process many recordings, such as researchers, podcasters, or content creators, to make their content more accessible and searchable.

108 stars. Available on PyPI.

Use this if you need a fast, reliable way to transcribe, translate, or subtitle a large batch of audio or video files, including identifying different speakers.

Not ideal if you only need occasional, single-file transcription and prefer a web-based service over installing software.

transcription media-localization content-creation research-analysis accessibility

Maintenance 13 / 25

Adoption 9 / 25

Maturity 25 / 25

Community 15 / 25

How are scores calculated?

Stars

108

Forks

Language

Python

License

MIT

Compare

whisply and whisperX whisply and whisper-v3-diarization whisply and sussu

Related tools

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

linto-ai/linto-stt

An automatic speech recognition API

linto-ai/linto-studio

Transcription and annotation interface for recorded audio or video files

Explore Voice AI Tools

All categories Trending Voice AI directory Insights