sherlcok314159/ChineseMRC-Data
收集了目前为止中文领域的MRC抽取式数据集
This project provides a collection of datasets for training and evaluating systems that can automatically answer questions from Chinese text. It takes a question and a related passage of text as input, and outputs the precise answer extracted directly from the passage. This is ideal for anyone working with information retrieval, customer support, or content analysis in Chinese.
122 stars. No commits in the last 6 months.
Use this if you need high-quality, pre-annotated Chinese text and corresponding questions with extractable answers to build or test an automated question-answering system.
Not ideal if your task requires generating creative answers, summarizing long documents, or working with languages other than Chinese.
Stars
122
Forks
15
Language
—
License
—
Category
Last pushed
Jun 20, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sherlcok314159/ChineseMRC-Data"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ymcui/cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
thunlp/MultiRD
Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"
princeton-nlp/DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval...
IndexFziQ/KMRC-Papers
A list of recent papers regarding knowledge-based machine reading comprehension.
danqi/rc-cnn-dailymail
CNN/Daily Mail Reading Comprehension Task