rifkiaputri/IDK-MRC
Code & dataset for EMNLP 2022 paper: IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension
This project provides a unique dataset and models to help build better question-answering systems for Indonesian. It takes paragraphs of Indonesian text and a set of questions (some answerable, some not) and trains models to accurately identify answers or state when an answer isn't present in the text. This is designed for researchers and NLP practitioners working on machine reading comprehension for the Indonesian language.
No commits in the last 6 months.
Use this if you are developing or evaluating machine reading comprehension systems for Indonesian text and need a robust dataset that includes unanswerable questions.
Not ideal if you are looking for a pre-built, ready-to-deploy question-answering application for end-users, or if your focus is on languages other than Indonesian.
Stars
9
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 20, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/rifkiaputri/IDK-MRC"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vinhkhuc/MemN2N-babi-python
End-To-End Memory Networks for bAbI question-answering tasks
patil-suraj/question_generation
Neural question generation using transformers
nelson-liu/paraphrase-id-tensorflow
Various models and code (Manhattan LSTM, Siamese LSTM + Matching Layer, BiMPM) for the...
YuriyGuts/kaggle-quora-question-pairs
My solution to Kaggle Quora Question Pairs competition (Top 2%, Private LB log loss 0.13497).
dtrizna/slp
Shell Language Processing (SLP). Pre-processing of sh/bash/zsh/.. commands for Machine Learning models.