svjack/docvqa-gen
Question Answering dataset generator of Document Visual in English and Chinese
This tool helps researchers and data scientists generate question-answer pairs directly from document images in both English and Chinese. You input an image containing text, such as a scanned document or a screenshot, and it outputs a list of relevant questions and their corresponding answers extracted from that image. It's designed for anyone working with document-based information who needs to create structured Q&A datasets.
No commits in the last 6 months.
Use this if you need to quickly create comprehensive question-answer datasets from various document images for training AI models or analyzing information.
Not ideal if you primarily work with plain text documents and do not require image-based question generation.
Stars
24
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Apr 17, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/svjack/docvqa-gen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
cdqa-suite/cdQA
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
AMontgomerie/question_generator
An NLP system for generating reading comprehension questions
KristiyanVachev/Leaf-Question-Generation
Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.
robinniesert/kaggle-google-quest
Google QUEST Q&A Labeling Kaggle Competition 6th Place Solution
cooelf/AwesomeMRC
IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading Comprehension (AAAI 2021)