Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading
Indonesian Speech Dataset
This project provides a large collection of Indonesian spoken sentences, recorded by native speakers reading various texts. It offers high-quality audio recordings along with precise text transcriptions. This resource is designed for developers building or improving voice-enabled applications for the Indonesian market.
No commits in the last 6 months.
Use this if you need a substantial and accurately transcribed dataset of Indonesian speech to train and develop speech recognition or voiceprint recognition systems.
Not ideal if you are looking for conversational or spontaneous Indonesian speech, as this dataset consists of read speech.
Stars
7
Forks
2
Language
—
License
—
Category
Last pushed
Aug 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hstsethi/in-mob-prefix
Dataset, charts, models of 4 digit mobile number prefixes in India by state, operator name.
apple/ml-spatial-librispeech
A large synthetic dataset of spatial audio with multiple labels
Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone
Japanese Speaking English Speech Dataset
Nexdata-AI/98-Hours-Taiwan-Mandarin-Speech-Data-by-Mobile-Phone_Reading
Taiwan Speech Dataset
Nexdata-AI/607-Hours-Cantonese-Conversational-Speech-Data-by-Mobile-Phone-and-Voice-Recorder
Cantonese Conversational Speech Dataset