peng-yiwen/WiKC

A cleaned version of Wikidata taxonomy - Refined using Large Language Models

/ 100

Experimental

This project helps knowledge engineers and data scientists refine and clean large-scale taxonomies, specifically those derived from Wikidata. It takes raw Wikidata taxonomy data and processes it using large language models and graph mining techniques to produce a more accurate and structured taxonomy. The cleaned taxonomy can be used for various applications requiring precise hierarchical knowledge.

No commits in the last 6 months.

Use this if you need to create or improve a structured, clean, and semantically consistent taxonomy from a large, noisy knowledge base like Wikidata for use in knowledge graphs, search, or semantic understanding applications.

Not ideal if you're looking for a simple keyword extraction tool or a small, domain-specific ontology builder rather than a large-scale, automated taxonomy refinement pipeline.

knowledge-engineering taxonomy-management data-curation semantic-data information-architecture

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

HTML

License

MIT

Higher-rated alternatives

ananthanarayanan431/Langchain-Projects-LLM

Various projects using Large Language Model (GPT & LLAMA) other open source model from...

deepset-ai/haystack-home

Website for Haystack, the open source LLM framework

astronomer/ask-astro

An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer

jndiogo/sibila

Extract structured data from local or remote LLM models

Quhaoh233/ChatEV

ChatEV: Predicting electric vehicle charging demand as natural language processing

Explore LLM Tools

All categories Trending LLM Tool directory Insights