linto-ai/linto-diarization
Speaker diarization service
This helps speech analysts, researchers, or anyone working with audio content to automatically identify 'who spoke when' in a recording. You feed it an audio file, and it outputs a breakdown of which speaker is talking at specific timestamps. Optionally, if you provide voice samples of known individuals, it can also tell you exactly which person (e.g., 'Alice' or 'Bob') spoke at each segment.
Use this if you need to analyze multi-speaker audio recordings to understand conversational turns, measure speaking time, or attribute dialogue to specific individuals.
Not ideal if you only need a text transcript of the audio without any information about the speakers.
Stars
28
Forks
1
Language
Python
License
AGPL-3.0
Category
Last pushed
Feb 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/linto-ai/linto-diarization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-stt
An automatic speech recognition API