antarades/emotion-aware-automatic-speech-recognition

An intelligent speech recognition system that combines OpenAI's Whisper for accurate transcription with dual emotion detection models. Analyzes both audio characteristics (tone, pitch, intensity) and textual content to provide comprehensive emotional context alongside transcriptions.

/ 100

Experimental

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 0 / 25

Maturity 9 / 25

Community 2 / 25

How are scores calculated?

Stars

—

Forks

Language

Python

License

MIT

Category

speech-to-text-transcription

Last pushed

Sep 13, 2025

Commits (30d)

GitHub

Speech To Text Transcription · 31 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/antarades/emotion-aware-automatic-speech-recognition"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...

i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

512z/podlens

Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具

Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

aws-solutions/content-localization-on-aws

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights