VyetGokyra/Speech_project_Vin

Multimodal Speech Emotion Recognition ViT (AST) for audio encoder and Multiscale Attention Net (MANet) for visual encoder

/ 100

Experimental

This helps researchers and practitioners analyze emotions expressed in speech, combining both auditory cues and visual expressions. It takes spoken audio and corresponding video footage as input and outputs a classification of the emotion being conveyed. This tool is ideal for scientists studying human emotion, psychologists, or anyone interested in automated affect analysis.

No commits in the last 6 months.

Use this if you need to automatically identify emotions from combined audio and video recordings of human speech.

Not ideal if you only have text or still images for emotion analysis, or if you need to detect subtle emotional nuances beyond a fixed set of classifications.

emotion-recognition affective-computing speech-analysis psychology-research human-computer-interaction

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

x4nth055/emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python,...

MiteshPuthran/Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio...

marcogdepinto/emotion-classification-from-audio-files

Understanding emotions from audio files using neural networks and multiple datasets.

maelfabien/Multimodal-Emotion-Recognition

A real time Multimodal Emotion Recognition web app for text, sound and video inputs

xiamx/awesome-sentiment-analysis

😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤

Explore ML Frameworks

All categories Trending ML Framework directory Insights