Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

/ 100

Emerging

This tool helps you automatically figure out who spoke when in any audio recording, such as meetings, podcasts, or interviews. You provide examples of each person's voice once, and it outputs a timeline showing who spoke and for how long. It can even generate a full transcript, telling you "who said what." This is ideal for anyone needing to analyze spoken content, like researchers, journalists, or content creators.

Available on PyPI.

Use this if you need to quickly identify specific speakers and potentially transcribe their words from audio files, without manually listening through everything.

Not ideal if you only need a basic transcription without speaker identification or if you're working with extremely low-quality audio where voices are difficult to distinguish.

audio-analysis speech-recognition meeting-minutes podcast-production interview-transcription

Maintenance 13 / 25

Adoption 7 / 25

Maturity 18 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...

i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

512z/podlens

Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具

aws-solutions/content-localization-on-aws

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...

fizamusthafa/whisper-app

This repository contains a web application for multi-lingual transcription using OpenAI's...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights