antarades/emotion-aware-automatic-speech-recognition
An intelligent speech recognition system that combines OpenAI's Whisper for accurate transcription with dual emotion detection models. Analyzes both audio characteristics (tone, pitch, intensity) and textual content to provide comprehensive emotional context alongside transcriptions.
No commits in the last 6 months.
Stars
—
Forks
1
Language
Python
License
MIT
Category
Last pushed
Sep 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/antarades/emotion-aware-automatic-speech-recognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
AbdullahHendy/live-translation
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...
i4Ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
512z/podlens
Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer
aws-solutions/content-localization-on-aws
Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...