svjack/docvqa-gen

Question Answering dataset generator of Document Visual in English and Chinese

/ 100

Experimental

This tool helps researchers and data scientists generate question-answer pairs directly from document images in both English and Chinese. You input an image containing text, such as a scanned document or a screenshot, and it outputs a list of relevant questions and their corresponding answers extracted from that image. It's designed for anyone working with document-based information who needs to create structured Q&A datasets.

No commits in the last 6 months.

Use this if you need to quickly create comprehensive question-answer datasets from various document images for training AI models or analyzing information.

Not ideal if you primarily work with plain text documents and do not require image-based question generation.

document-intelligence data-labeling optical-character-recognition information-extraction dataset-generation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

cdqa-suite/cdQA

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.

AMontgomerie/question_generator

An NLP system for generating reading comprehension questions

KristiyanVachev/Leaf-Question-Generation

Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.

robinniesert/kaggle-google-quest

Google QUEST Q&A Labeling Kaggle Competition 6th Place Solution

cooelf/AwesomeMRC

IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading Comprehension (AAAI 2021)

Explore Transformer Models

All categories Trending Transformer directory Insights