joelbarmettlerUZH/ConceptFormer

Towards Finding the Essence of Everything in Large Language Models

/ 100

Emerging

This project is for AI researchers or data scientists working on understanding how large language models (LLMs) connect to real-world knowledge. It helps create specialized datasets like T-Rex Star and Tri-Rex by extracting entities from text and linking them to knowledge graphs like Wikidata, and then generating synthetic sentences using local LLMs. The output is structured datasets and trained model configurations that can be used to pretrain and evaluate LLMs on knowledge-intensive tasks.

Use this if you are a researcher focused on the interpretability or knowledge representation of large language models and need to generate complex, knowledge-graph-infused datasets for experimentation and training.

Not ideal if you are looking for a plug-and-play solution for general LLM fine-tuning or do not have significant computational resources (e.g., multiple high-end GPUs and hundreds of GBs of RAM).

LLM Research Knowledge Graph Generation Dataset Creation AI Model Pretraining Natural Language Processing

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

facebookresearch/LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

FareedKhan-dev/train-llm-from-scratch

A straightforward method for training your LLM, from downloading data to generating text.

kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Explore Transformer Models

All categories Trending Transformer directory Insights