luohongyin/RGX
Synthetic QA generation for long documents.
This project helps generate question-answer pairs from long documents, which is useful for creating training data for question-answering systems or for quickly generating quizzes. You feed in unstructured text documents, and it outputs a list of questions with their corresponding answers and the answer's location within the text. This tool is designed for data scientists, machine learning engineers, or anyone building or evaluating AI models that need to comprehend and answer questions about large volumes of text.
No commits in the last 6 months.
Use this if you need to quickly create a large dataset of questions and answers from a collection of documents without manually writing each one.
Not ideal if you need highly nuanced, human-curated questions or if your documents are very short and don't require extensive QA generation.
Stars
16
Forks
2
Language
Python
License
—
Category
Last pushed
Jul 22, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/luohongyin/RGX"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
asahi417/lm-question-generation
Multilingual/multidomain question generation datasets, models, and python library for question...
SparkJiao/SLQA
An Unofficial Pytorch Implementation of Multi-Granularity Hierarchical Attention Fusion Networks...
MurtyShikhar/Question-Answering
TensorFlow implementation of Match-LSTM and Answer pointer for the popular SQuAD dataset.
hsinyuan-huang/FlowQA
Implementation of conversational QA model: FlowQA (with slight improvement)
allenai/aokvqa
Official repository for the A-OKVQA dataset