luohongyin/RGX

Synthetic QA generation for long documents.

23
/ 100
Experimental

This project helps generate question-answer pairs from long documents, which is useful for creating training data for question-answering systems or for quickly generating quizzes. You feed in unstructured text documents, and it outputs a list of questions with their corresponding answers and the answer's location within the text. This tool is designed for data scientists, machine learning engineers, or anyone building or evaluating AI models that need to comprehend and answer questions about large volumes of text.

No commits in the last 6 months.

Use this if you need to quickly create a large dataset of questions and answers from a collection of documents without manually writing each one.

Not ideal if you need highly nuanced, human-curated questions or if your documents are very short and don't require extensive QA generation.

natural-language-processing data-generation machine-learning-training information-extraction document-analysis
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 9 / 25

How are scores calculated?

Stars

16

Forks

2

Language

Python

License

Last pushed

Jul 22, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/luohongyin/RGX"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.