Chia-Hsuan-Lee/ODSQA

ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET

28
/ 100
Experimental

This project provides a large, real-world Chinese dataset for training machines to answer questions from spoken content. It takes audio recordings of documents and spoken questions, then outputs text-based answers extracted directly from the document. This dataset is for researchers and developers building spoken question answering systems.

No commits in the last 6 months.

Use this if you are developing or researching systems that need to understand spoken Chinese documents and answer spoken questions based on that content.

Not ideal if your primary interest is in text-based question answering or if you require an English-only dataset.

spoken-language-understanding natural-language-processing machine-comprehension speech-recognition artificial-intelligence
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 12 / 25

How are scores calculated?

Stars

63

Forks

7

Language

Shell

License

Last pushed

Feb 20, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Chia-Hsuan-Lee/ODSQA"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.