Zh1yuShen/HopWeaver

Try HopWeaver: The first automatic synthesis framework based on any corpora, with quality approaching manual annotation.

30
/ 100
Emerging

HopWeaver helps researchers and data scientists automatically create complex, multi-step questions from large amounts of unstructured text, like research papers or articles. It takes your text documents and generates detailed questions that require connecting information across different parts of the text. This tool is ideal for anyone who needs to build high-quality question-answering datasets for specialized fields, without the extensive time and cost of manual annotation.

No commits in the last 6 months.

Use this if you need to generate high-quality, complex questions from your text data to train or evaluate advanced question-answering systems, especially when manual question creation is too costly or slow.

Not ideal if you need simple, single-fact questions or if your text data is very small and doesn't require deep, cross-document reasoning.

natural-language-processing dataset-generation knowledge-discovery text-analytics AI-research
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 6 / 25
Maturity 15 / 25
Community 7 / 25

How are scores calculated?

Stars

24

Forks

2

Language

Python

License

MIT

Last pushed

Jul 24, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Zh1yuShen/HopWeaver"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.