xingbow/SciDaEx
Structured data extraction from research literature
This tool helps researchers quickly extract and organize data from scientific papers. You input PDF research articles, and it outputs structured data tables, along with extracted figures and text, ready for evidence synthesis. It's designed for scientists, academics, and anyone conducting systematic reviews or meta-analyses.
No commits in the last 6 months.
Use this if you need to efficiently gather and structure specific data points, tables, and figures from a large collection of scientific literature for research synthesis.
Not ideal if you only need to read a few papers manually or are not working with a large volume of documents.
Stars
26
Forks
4
Language
Python
License
MIT
Category
Last pushed
Aug 02, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/xingbow/SciDaEx"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google/langextract
A Python library for extracting structured information from unstructured text using LLMs with...
Extralit/extralit
Fast and accurate systemic data extraction with LLM assistance
Keyvanhardani/german-ocr
German-OCR is specifically trained to extract text from German documents including invoices,...
oidlabs-com/Lexoid
Multimodal document parser for high quality data understanding and extraction
parsee-ai/parsee-core
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and...