allenai/papermage
library supporting NLP and CV research on scientific papers
Papermage helps scientific researchers analyze the structure and content of research papers. You input a PDF of a scientific paper, and it outputs a structured digital document that breaks down the paper into components like pages, paragraphs, sentences, tables, and figures. This tool is for scientists, academics, or anyone needing to programmatically extract and work with specific elements from scientific PDFs.
791 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to programmatically understand the layout and extract specific textual or visual elements from scientific PDFs for tasks like building a QA system or a knowledge graph.
Not ideal if you're looking for an actively maintained, production-ready solution, as this project is a research prototype.
Stars
791
Forks
64
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 08, 2024
Commits (30d)
0
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/allenai/papermage"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
neuml/paperai
📄 🤖 AI for medical and scientific papers
supriya46788/Research-Paper-Organizer
Open-source beginner-friendly project
asreview/asreview-makita
Workflow generator for simulation studies using the command line interface of ASReview LAB
alibaba/AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Tavris1/AI-Toolkit-Easy-Install
One-click Portable Windows installation of 'AI-Toolkit by Ostris'