lias-laboratory/cidoccrm-llm-extractor
A tool for automating CIDOC CRM knowledge graph population using Large Language Models (LLMs), with a focus on RDF triple extraction from archaeological datasets.
This tool helps archaeologists and cultural heritage researchers transform their structured archaeological datasets, typically in CSV format, into a structured knowledge graph. It uses advanced AI to identify key entities and relationships, outputting these as RDF triples compliant with the CIDOC CRM standard. This enables easier integration and analysis of complex historical information.
Use this if you need to automatically convert detailed archaeological inventories or site data into a standardized, machine-readable knowledge graph format for better research and preservation.
Not ideal if your data is unstructured text documents, or if you are not working with archaeological or cultural heritage data that aligns with the CIDOC CRM standard.
Stars
9
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/lias-laboratory/cidoccrm-llm-extractor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NanoNets/docstrange
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple...
th1nhhdk/local_ai_ocr
An local, offline (after initial setup), portable OCR software that can process images and PDF...
Dicklesworthstone/llm_aided_ocr
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking,...
emcf/thepipe
Get clean data from tricky documents, powered by vision-language models ⚡
langstruct-ai/langstruct
Extract structured data from any content using LLMs.