Extralit/extralit
Fast and accurate systemic data extraction with LLM assistance
Extralit helps scientists, researchers, and data analysts efficiently extract specific information from scientific papers, reports, and other unstructured documents like PDFs. You feed it a document and a schema (what information you want to find), and it outputs structured data, allowing you to quickly analyze complex information. It's designed for anyone who needs to convert large volumes of text documents into usable, organized data.
Use this if you need to systematically extract precise data from many unstructured documents, especially scientific literature, and ensure high accuracy through human-in-the-loop validation.
Not ideal if you're looking for a simple keyword search tool or don't require high-accuracy, structured data extraction from complex documents.
Stars
41
Forks
50
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 01, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Extralit/extralit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
google/langextract
A Python library for extracting structured information from unstructured text using LLMs with...
Keyvanhardani/german-ocr
German-OCR is specifically trained to extract text from German documents including invoices,...
oidlabs-com/Lexoid
Multimodal document parser for high quality data understanding and extraction
xingbow/SciDaEx
Structured data extraction from research literature
parsee-ai/parsee-core
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and...