aliyzd95/modified-shemo

A modification on the Sharif Emotional Speech Database

28
/ 100
Experimental

This project provides a refined version of the Sharif Emotional Speech Database (ShEMO), which is a collection of Persian speech audio and text transcripts. It takes the original ShEMO dataset, identifies and corrects misaligned audio and text files, and also resolves inconsistencies in emotional labels. The output is a more accurate dataset of Persian emotional speech, ready for use by researchers in speech technology.

No commits in the last 6 months.

Use this if you are a speech researcher or machine learning engineer working with Persian emotional speech data and need a clean, corrected dataset for training and evaluating models.

Not ideal if you are looking for a brand new emotional speech dataset rather than a corrected version of an existing one, or if you need emotional speech data in a language other than Persian.

speech-recognition natural-language-processing sentiment-analysis machine-learning-datasets persian-language
No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 13 / 25

How are scores calculated?

Stars

10

Forks

2

Language

Jupyter Notebook

License

Last pushed

Apr 16, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/aliyzd95/modified-shemo"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.