jianchang512/stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

/ 100

Established

This tool helps you convert spoken words from audio or video files into written text. You simply upload your media, choose the language and desired output format (like plain text, SRT subtitles with timestamps, or JSON), and it generates the transcript. This is perfect for content creators, transcribers, or anyone needing to quickly document spoken content from media without relying on online services.

4,331 stars.

Use this if you need to quickly and accurately transcribe audio or video content into text, subtitles, or a structured data format while keeping your data offline.

Not ideal if you require real-time transcription for live events or need advanced speaker diarization features.

transcription video-editing content-creation audio-processing subtitle-generation

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

4,331

Forks

463

Language

Python

License

GPL-3.0

Compare

stt and realtime-stt

Related tools

cyberofficial/Synthalingua

Synthalingua - Real Time Translation

Jaymon/transcribe

Convert images or audio files to plain text on the command line

developers-cosmos/Mimasa

Real time multilingual face translator

lperezmo/real-time-translator

A quick app to translate speech in real time using the Whisper API for transcribing audio,...

book000/audio-transcriber-docker

Automatically transcribe the audio of video / audio files using Speech Recognition.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights