zjunlp/IEDatasetZoo
Information Extraction Dataset Zoo.
This is a collection of high-quality datasets designed to help you train and evaluate systems that extract specific pieces of information from text. It provides pre-processed text data, often from sources like Wikipedia, news articles, or medical reports, that can be used to identify key entities, relationships between them, or event triggers and arguments. Researchers and practitioners working on natural language processing tasks who need well-structured data for training their models would find this valuable.
No commits in the last 6 months.
Use this if you are developing or testing algorithms for automatically identifying important entities, relationships, or events within large volumes of text.
Not ideal if you need tools for text generation, translation, or general-purpose text analysis like sentiment analysis or summarization.
Stars
30
Forks
4
Language
—
License
—
Category
Last pushed
Apr 09, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/zjunlp/IEDatasetZoo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
zjunlp/OpenUE
[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text
OpenSextant/Xponents
Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction...
BaptisteBlouin/EventExtractionPapers
A list of NLP resources focused on event extraction task
philipperemy/stanford-openie-python
Stanford Open Information Extraction made simple!
uma-pi1/minie
An open information extraction system that provides compact extractions