amazon-science/wqa-multi-sentence-inference
This repository contains code used for our Multi Sentence Inference NAACL'22 paper.
This project offers tools to prepare large text datasets, like news articles or Wikipedia, for training language models. It takes raw text documents and converts them into a format suitable for tasks that require understanding relationships between multiple sentences. The primary users are researchers or engineers who are building or fine-tuning advanced natural language processing models for tasks like question answering or fact-checking.
No commits in the last 6 months.
Use this if you need to pre-train or fine-tune transformer-based language models for tasks requiring multi-sentence inference, using your own large text corpora.
Not ideal if you are looking for a pre-trained model to use directly or a tool for basic text classification or single-sentence analysis.
Stars
12
Forks
2
Language
Python
License
—
Category
Last pushed
Mar 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/amazon-science/wqa-multi-sentence-inference"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ymcui/cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
thunlp/MultiRD
Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"
princeton-nlp/DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval...
IndexFziQ/KMRC-Papers
A list of recent papers regarding knowledge-based machine reading comprehension.
danqi/rc-cnn-dailymail
CNN/Daily Mail Reading Comprehension Task