Zh1yuShen/HopWeaver
Try HopWeaver: The first automatic synthesis framework based on any corpora, with quality approaching manual annotation.
HopWeaver helps researchers and data scientists automatically create complex, multi-step questions from large amounts of unstructured text, like research papers or articles. It takes your text documents and generates detailed questions that require connecting information across different parts of the text. This tool is ideal for anyone who needs to build high-quality question-answering datasets for specialized fields, without the extensive time and cost of manual annotation.
No commits in the last 6 months.
Use this if you need to generate high-quality, complex questions from your text data to train or evaluate advanced question-answering systems, especially when manual question creation is too costly or slow.
Not ideal if you need simple, single-fact questions or if your text data is very small and doesn't require deep, cross-document reasoning.
Stars
24
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jul 24, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Zh1yuShen/HopWeaver"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NatLibFi/Annif
Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.
explosion/displacy
:boom: displaCy.js: An open-source NLP visualiser for the modern web
hshindo/react-nlp
Visualization of Natural Language Processing for React
microsoft/browsecloud
A web app to create and browse text visualizations for automated customer listening.
microsoft/VisTalk
A JavaScript toolkit for Natural Language-based Visualization Authoring