Llm Scaling Architecture Transformer Models

There are 74 llm scaling architecture models tracked. 5 score above 50 (established tier). The highest-rated is jncraton/languagemodels at 61/100 with 1,197 stars.

Get all 74 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-scaling-architecture&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	jncraton/languagemodels Explore large language models in 512MB of RAM	61	Established	1,197	HTML
2	microsoft/unilm Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities	57	Established	22,042	Python
3	haizelabs/verdict Inference-time scaling for LLMs-as-a-judge.	55	Established	332	Jupyter Notebook
4	albertan017/LLM4Decompile Reverse Engineering: Decompiling Binary Code with Large Language Models	54	Established	6,407	Python
5	bytedance/Sa2VA Official Repo For Pixel-LLM Codebase	54	Established	1,558	Python
6	Cardinal-Operations/ORLM ORLM: Training Large Language Models for Optimization Modeling	47	Emerging	237	Python
7	sinanuozdemir/oreilly-optimizing-llms Optimizing LLMs with Fine-Tuning and Prompt Engineering	46	Emerging	88	Jupyter Notebook
8	JIA-Lab-research/LISA Project Page for "LISA: Reasoning Segmentation via Large Language Model"	45	Emerging	2,604	Python
9	Tencent-Hunyuan/GradLoc Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...	45	Emerging	89	Python
10	Victorwz/LongMem Official implementation of our NeurIPS 2023 paper "Augmenting Language...	44	Emerging	822	Python
11	thunlp/InfLLM The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for...	42	Emerging	395	Python
12	skit-ai/SpeechLLM This repository contains the training, inference, evaluation code for...	40	Emerging	130	Python
13	yang-ai-lab/SleepLM SleepLM: Natural-Language Intelligence for Human Sleep	40	Emerging	29	Jupyter Notebook
14	JKevin17/TM-LLM The official code for "(ISCC 2025) Network Traffic Matrix Imputation via...	40	Emerging	6	Python
15	huggingface/datablations Scaling Data-Constrained Language Models	40	Emerging	342	Jupyter Notebook
16	UCSC-VLAA/m1 [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical...	39	Emerging	48	Jupyter Notebook
17	nercone-dev/zeta-llm-tool Fully Open-source LLM Tool	38	Emerging	5	Python
18	NiuTrans/LMT Building a inclusive, scalable, and high-performance multilingual translation model	38	Emerging	125	Python
19	sshh12/llm_optimize LLM Optimize is a proof-of-concept library for doing LLM (large language...	38	Emerging	61	Python
20	StupidTrees/SplitLLM Split Learning Simulation Framework for LLMs	37	Emerging	38	Python
21	WANGXinyiLinda/concept-based-demonstration-selection Offical code of the paper Large Language Models Are Implicitly Topic Models:...	37	Emerging	75	Python
22	locuslab/massive-activations Code accompanying the paper "Massive Activations in Large Language Models"	37	Emerging	197	Python
23	pdfosborne/elsciRL The core repository of the elsciRL framework.	37	Emerging	18	Python
24	mkuchnik/relm ReLM is a Regular Expression engine for Language Models	37	Emerging	107	Python
25	luohongyin/LangCode LangCode - Improving alignment and reasoning of large language models (LLMs)...	37	Emerging	49	Python
26	VityaVitalich/STASC [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models	37	Emerging	11	Jupyter Notebook
27	OSU-STARLAB/Simul-LLM [ACL 2024] An easily extensible framework for simultaneous, text-to-text...	36	Emerging	18	Python
28	martin-wey/peft-llm-code Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...	36	Emerging	25	Python
29	luciusssss/ZhuangBench [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly	36	Emerging	25	Python
30	ai8hyf/llm_split_recall_test Split and Recall: A simple and efficient benchmark to evaluate in-context...	35	Emerging	9	Python
31	NiuTrans/LaMaTE Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...	34	Emerging	28	Python
32	YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language...	33	Emerging	472	Python
33	Kitsunp/Prueba-de-modelo-de-ByteLatentTransformer Este es una prueba de concepto del paper mencionado de Meta junto a otros...	33	Emerging	8	Python
34	ZigeW/data_management_LLM Collection of training data management explorations for large language models	33	Emerging	337	—
35	QwenLM/ParScale Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling	32	Emerging	476	Python
36	ymoslem/Adaptive-MT-LLM Adaptive Machine Translation with Large Language Models	32	Emerging	32	JavaScript
37	zzz47zzz/codebase-for-incremental-learning-with-llm [ACL2024] A Codebase for Incremental Learning with Large Language Models;...	31	Emerging	60	Python
38	ryoungj/ObsScaling [NeurIPS'24 Spotlight] Observational Scaling Laws	31	Emerging	60	Jupyter Notebook
39	dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...	31	Emerging	24	Python
40	fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI This research examines the performance of Large Language Models (GPT-3.5...	31	Emerging	3	Jupyter Notebook
41	mubingshen/MLC-SLM-Baseline The project is associated with the recently-launched INTERSPEECH 2025...	30	Emerging	50	Python
42	yinzhangyue/EoT Exchange-of-Thought: Enhancing Large Language Model Capabilities through...	30	Emerging	21	Python
43	bminixhofer/zett Code for Zero-Shot Tokenizer Transfer	29	Experimental	143	Python
44	Butanium/llm-lang-agnostic minimal code to reproduce results from Separating Tongue from Thought:...	29	Experimental	13	Jupyter Notebook
45	Y-debug-sys/LMTE [INFOCOM 2026] Official Implementation of "LMTE: Putting the {Reasoning}...	28	Experimental	4	Python
46	rhubarbwu/linguistic-collapse Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models...	28	Experimental	18	Python
47	LSquaredM/mutual_info_scaling_law (NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for...	27	Experimental	13	Python
48	Y-Research-SBU/CSR Official Repository for CSR - ICML 2025 Oral	27	Experimental	21	Python
49	millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective Code: Attention Mechanisms Perspective: Exploring LLM Processing of...	26	Experimental	12	Jupyter Notebook
50	Dahouabdelhalim/CodeSeg Replication code for "Semantic Code Segmentation with Language Models"...	26	Experimental	1	Jupyter Notebook
51	hank0316/AdaSearch This includes the original implementation of "AdaSearch: Balancing...	24	Experimental	10	—
52	HKUSTDial/megatran [VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with...	23	Experimental	11	Python
53	IAAR-Shanghai/FastMem Fast Memorization of Prompt Improves Context Awareness of Large Language...	22	Experimental	24	Python
54	lime9903/SemanticHAR LLM-based Human Activity Recognition System	22	Experimental	1	Python
55	YutongWang1216/ReflectionLLMMT Code and data realeases for the paper -- TasTe: Teaching Large Language...	21	Experimental	13	Python
56	UKPLab/arxiv2025-inherent-limits-plms Code repository for the paper "The Inherent Limits of Pretrained LLMs: The...	21	Experimental	13	Python
57	Xiaohao-Yang/LLM-ITL [ACL 2025 Main] Neural Topic Modeling with Large Language Models in the Loop	21	Experimental	11	Python
58	EastTower16/LLMDataDistill distill large scale web page text	21	Experimental	12	C++
59	efficientscaling/Z1 [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"	20	Experimental	68	Python
60	eminorhan/llm-memory Memory experiments with LLMs	20	Experimental	10	Python
61	ictnlp/FastLongSpeech FastLongSpeech is a novel framework designed to extend the capabilities of...	20	Experimental	14	Python
62	sky24h/Training-Free_Zero-Shot_Semantic_Segmentation_with_LLM_Refinement This repository contains official implementation of the paper "Training-Free...	20	Experimental	5	Jupyter Notebook
63	wyt2000/InverseCoder [AAAI 2025] The official code of the paper "InverseCoder: Unleashing the...	19	Experimental	14	Python
64	GeorgeVern/lmcor Code for the EACL 2024 paper: "Small Language Models Improve Giants by...	19	Experimental	12	Python
65	vitorhcsousa/llm-w-mlx Large Language Models with MLX	17	Experimental	1	Python
66	MaLA-LM/emma-500 EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models	17	Experimental	4	Python
67	ikeasamoahansah/univ-model A Universal Document Understanding Model (UDUM) which accepts various file types	17	Experimental	1	Jupyter Notebook
68	Keytoyze/JumpCoder Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via...	15	Experimental	27	Python
69	vishvaRam/Data-Prep-for-LLM-fine-tuning This repository helps prepare datasets for fine-tuning Large Language Models...	14	Experimental	1	Jupyter Notebook
70	VITA-Group/Data-Efficient-Scaling [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao...	13	Experimental	14	Python
71	supersimple33/Scaling-Laws A method for calculating scaling laws for LLMs from publicly available models	13	Experimental	9	Python
72	MaLA-LM/mala-500 MaLA-500: Massive Language Adaptation of Large Language Models	12	Experimental	5	Python
73	vocaliodmiku/SLI-LL Repository of the paper: "Spoken Language Intelligence of Large Language...	11	Experimental	4	—
74	pbevan1/multilingual-constitutional-ai Implementation for "Multilingual Constitutional AI"	10	Experimental	2	Jupyter Notebook