luohongyin/RGX

Synthetic QA generation for long documents.

/ 100

Experimental

This project helps generate question-answer pairs from long documents, which is useful for creating training data for question-answering systems or for quickly generating quizzes. You feed in unstructured text documents, and it outputs a list of questions with their corresponding answers and the answer's location within the text. This tool is designed for data scientists, machine learning engineers, or anyone building or evaluating AI models that need to comprehend and answer questions about large volumes of text.

No commits in the last 6 months.

Use this if you need to quickly create a large dataset of questions and answers from a collection of documents without manually writing each one.

Not ideal if you need highly nuanced, human-curated questions or if your documents are very short and don't require extensive QA generation.

natural-language-processing data-generation machine-learning-training information-extraction document-analysis

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

asahi417/lm-question-generation

Multilingual/multidomain question generation datasets, models, and python library for question...

SparkJiao/SLQA

An Unofficial Pytorch Implementation of Multi-Granularity Hierarchical Attention Fusion Networks...

MurtyShikhar/Question-Answering

TensorFlow implementation of Match-LSTM and Answer pointer for the popular SQuAD dataset.

hsinyuan-huang/FlowQA

Implementation of conversational QA model: FlowQA (with slight improvement)

allenai/aokvqa

Official repository for the A-OKVQA dataset

Explore NLP Tools

All categories Trending NLP directory Insights