VyetGokyra/Speech_project_Vin
Multimodal Speech Emotion Recognition ViT (AST) for audio encoder and Multiscale Attention Net (MANet) for visual encoder
This helps researchers and practitioners analyze emotions expressed in speech, combining both auditory cues and visual expressions. It takes spoken audio and corresponding video footage as input and outputs a classification of the emotion being conveyed. This tool is ideal for scientists studying human emotion, psychologists, or anyone interested in automated affect analysis.
No commits in the last 6 months.
Use this if you need to automatically identify emotions from combined audio and video recordings of human speech.
Not ideal if you only have text or still images for emotion analysis, or if you need to detect subtle emotional nuances beyond a fixed set of classifications.
Stars
7
Forks
3
Language
Python
License
—
Category
Last pushed
Jan 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/VyetGokyra/Speech_project_Vin"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python,...
MiteshPuthran/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio...
marcogdepinto/emotion-classification-from-audio-files
Understanding emotions from audio files using neural networks and multiple datasets.
maelfabien/Multimodal-Emotion-Recognition
A real time Multimodal Emotion Recognition web app for text, sound and video inputs
xiamx/awesome-sentiment-analysis
😀😄😂😠A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤