linto-ai/linto-stt
An automatic speech recognition API
This tool helps convert spoken audio into written text, making it easier to analyze recordings or create captions. You provide an audio file or stream, and it outputs the transcription as text, with optional timestamps and confidence scores for each word. It's designed for developers or system administrators who need to integrate speech-to-text capabilities into their applications or services.
Use this if you need a flexible and deployable speech recognition system that can handle both pre-recorded audio files and real-time voice streams.
Not ideal if you are an end-user looking for a ready-to-use desktop application or a simple web interface for transcription.
Stars
81
Forks
21
Language
Python
License
AGPL-3.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/linto-ai/linto-stt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-studio
Transcription and annotation interface for recorded audio or video files