NeotomaDB/MetaExtractor
A repository for the UBC MDS Capstone team to develop a metadata extractor for Neotoma
This tool helps paleoecological researchers and data stewards efficiently update the Neotoma Paleoecological Database. It automatically scans newly published scientific articles, identifies those relevant to Neotoma, and extracts key data like fossil sites, taxa, ages, and geographic coordinates. The extracted information is then presented for review and correction before submission to the database.
No commits in the last 6 months.
Use this if you need to systematically find and extract paleoecological data from scientific literature to populate or update a database like Neotoma.
Not ideal if you are looking for a general-purpose scientific article search tool or if your data extraction needs are outside of paleoecology.
Stars
9
Forks
3
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/NeotomaDB/MetaExtractor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...