Chia-Hsuan-Lee/ODSQA
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
This project provides a large, real-world Chinese dataset for training machines to answer questions from spoken content. It takes audio recordings of documents and spoken questions, then outputs text-based answers extracted directly from the document. This dataset is for researchers and developers building spoken question answering systems.
No commits in the last 6 months.
Use this if you are developing or researching systems that need to understand spoken Chinese documents and answer spoken questions based on that content.
Not ideal if your primary interest is in text-based question answering or if you require an English-only dataset.
Stars
63
Forks
7
Language
Shell
License
—
Category
Last pushed
Feb 20, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Chia-Hsuan-Lee/ODSQA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both...
shuaihuaiyi/QA
使用深度å¦ä¹ ç®—æ³•å®žçŽ°çš„ä¸æ–‡é—®ç”系统
allenai/deep_qa
A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)
worldbank/iQual
iQual is a package that leverages natural language processing to scale up interpretative...
seriousran/awesome-qa
😎 A curated list of the Question Answering (QA)