Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading

Indonesian Speech Dataset

/ 100

Experimental

This project provides a large collection of Indonesian spoken sentences, recorded by native speakers reading various texts. It offers high-quality audio recordings along with precise text transcriptions. This resource is designed for developers building or improving voice-enabled applications for the Indonesian market.

No commits in the last 6 months.

Use this if you need a substantial and accurately transcribed dataset of Indonesian speech to train and develop speech recognition or voiceprint recognition systems.

Not ideal if you are looking for conversational or spontaneous Indonesian speech, as this dataset consists of read speech.

speech-recognition voice-AI-development natural-language-processing Indonesian-language-tech

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

hstsethi/in-mob-prefix

Dataset, charts, models of 4 digit mobile number prefixes in India by state, operator name.

apple/ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone

Japanese Speaking English Speech Dataset

Nexdata-AI/98-Hours-Taiwan-Mandarin-Speech-Data-by-Mobile-Phone_Reading

Taiwan Speech Dataset

Nexdata-AI/607-Hours-Cantonese-Conversational-Speech-Data-by-Mobile-Phone-and-Voice-Recorder

Cantonese Conversational Speech Dataset

Explore ML Frameworks

All categories Trending ML Framework directory Insights