Llm Scaling Architecture Transformer Models
There are 74 llm scaling architecture models tracked. 5 score above 50 (established tier). The highest-rated is jncraton/languagemodels at 61/100 with 1,197 stars.
Get all 74 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-scaling-architecture&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
jncraton/languagemodels
Explore large language models in 512MB of RAM |
|
Established |
| 2 |
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities |
|
Established |
| 3 |
haizelabs/verdict
Inference-time scaling for LLMs-as-a-judge. |
|
Established |
| 4 |
albertan017/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models |
|
Established |
| 5 |
bytedance/Sa2VA
Official Repo For Pixel-LLM Codebase |
|
Established |
| 6 |
Cardinal-Operations/ORLM
ORLM: Training Large Language Models for Optimization Modeling |
|
Emerging |
| 7 |
sinanuozdemir/oreilly-optimizing-llms
Optimizing LLMs with Fine-Tuning and Prompt Engineering |
|
Emerging |
| 8 |
JIA-Lab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model" |
|
Emerging |
| 9 |
Tencent-Hunyuan/GradLoc
Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR... |
|
Emerging |
| 10 |
Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language... |
|
Emerging |
| 11 |
thunlp/InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for... |
|
Emerging |
| 12 |
skit-ai/SpeechLLM
This repository contains the training, inference, evaluation code for... |
|
Emerging |
| 13 |
yang-ai-lab/SleepLM
SleepLM: Natural-Language Intelligence for Human Sleep |
|
Emerging |
| 14 |
JKevin17/TM-LLM
The official code for "(ISCC 2025) Network Traffic Matrix Imputation via... |
|
Emerging |
| 15 |
huggingface/datablations
Scaling Data-Constrained Language Models |
|
Emerging |
| 16 |
UCSC-VLAA/m1
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical... |
|
Emerging |
| 17 |
nercone-dev/zeta-llm-tool
Fully Open-source LLM Tool |
|
Emerging |
| 18 |
NiuTrans/LMT
Building a inclusive, scalable, and high-performance multilingual translation model |
|
Emerging |
| 19 |
sshh12/llm_optimize
LLM Optimize is a proof-of-concept library for doing LLM (large language... |
|
Emerging |
| 20 |
StupidTrees/SplitLLM
Split Learning Simulation Framework for LLMs |
|
Emerging |
| 21 |
WANGXinyiLinda/concept-based-demonstration-selection
Offical code of the paper Large Language Models Are Implicitly Topic Models:... |
|
Emerging |
| 22 |
locuslab/massive-activations
Code accompanying the paper "Massive Activations in Large Language Models" |
|
Emerging |
| 23 |
pdfosborne/elsciRL
The core repository of the elsciRL framework. |
|
Emerging |
| 24 |
mkuchnik/relm
ReLM is a Regular Expression engine for Language Models |
|
Emerging |
| 25 |
luohongyin/LangCode
LangCode - Improving alignment and reasoning of large language models (LLMs)... |
|
Emerging |
| 26 |
VityaVitalich/STASC
[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models |
|
Emerging |
| 27 |
OSU-STARLAB/Simul-LLM
[ACL 2024] An easily extensible framework for simultaneous, text-to-text... |
|
Emerging |
| 28 |
martin-wey/peft-llm-code
Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning... |
|
Emerging |
| 29 |
luciusssss/ZhuangBench
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly |
|
Emerging |
| 30 |
ai8hyf/llm_split_recall_test
Split and Recall: A simple and efficient benchmark to evaluate in-context... |
|
Emerging |
| 31 |
NiuTrans/LaMaTE
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine... |
|
Emerging |
| 32 |
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language... |
|
Emerging |
| 33 |
Kitsunp/Prueba-de-modelo-de-ByteLatentTransformer
Este es una prueba de concepto del paper mencionado de Meta junto a otros... |
|
Emerging |
| 34 |
ZigeW/data_management_LLM
Collection of training data management explorations for large language models |
|
Emerging |
| 35 |
QwenLM/ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling |
|
Emerging |
| 36 |
ymoslem/Adaptive-MT-LLM
Adaptive Machine Translation with Large Language Models |
|
Emerging |
| 37 |
zzz47zzz/codebase-for-incremental-learning-with-llm
[ACL2024] A Codebase for Incremental Learning with Large Language Models;... |
|
Emerging |
| 38 |
ryoungj/ObsScaling
[NeurIPS'24 Spotlight] Observational Scaling Laws |
|
Emerging |
| 39 |
dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving
Baseline achieving 0.8 accuracy on the private test set in the ZaloAI... |
|
Emerging |
| 40 |
fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI
This research examines the performance of Large Language Models (GPT-3.5... |
|
Emerging |
| 41 |
mubingshen/MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025... |
|
Emerging |
| 42 |
yinzhangyue/EoT
Exchange-of-Thought: Enhancing Large Language Model Capabilities through... |
|
Emerging |
| 43 |
bminixhofer/zett
Code for Zero-Shot Tokenizer Transfer |
|
Experimental |
| 44 |
Butanium/llm-lang-agnostic
minimal code to reproduce results from Separating Tongue from Thought:... |
|
Experimental |
| 45 |
Y-debug-sys/LMTE
[INFOCOM 2026] Official Implementation of "LMTE: Putting the {Reasoning}... |
|
Experimental |
| 46 |
rhubarbwu/linguistic-collapse
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models... |
|
Experimental |
| 47 |
LSquaredM/mutual_info_scaling_law
(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for... |
|
Experimental |
| 48 |
Y-Research-SBU/CSR
Official Repository for CSR - ICML 2025 Oral |
|
Experimental |
| 49 |
millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective
Code: Attention Mechanisms Perspective: Exploring LLM Processing of... |
|
Experimental |
| 50 |
Dahouabdelhalim/CodeSeg
Replication code for "Semantic Code Segmentation with Language Models"... |
|
Experimental |
| 51 |
hank0316/AdaSearch
This includes the original implementation of "AdaSearch: Balancing... |
|
Experimental |
| 52 |
HKUSTDial/megatran
[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with... |
|
Experimental |
| 53 |
IAAR-Shanghai/FastMem
Fast Memorization of Prompt Improves Context Awareness of Large Language... |
|
Experimental |
| 54 |
lime9903/SemanticHAR
LLM-based Human Activity Recognition System |
|
Experimental |
| 55 |
YutongWang1216/ReflectionLLMMT
Code and data realeases for the paper -- TasTe: Teaching Large Language... |
|
Experimental |
| 56 |
UKPLab/arxiv2025-inherent-limits-plms
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The... |
|
Experimental |
| 57 |
Xiaohao-Yang/LLM-ITL
[ACL 2025 Main] Neural Topic Modeling with Large Language Models in the Loop |
|
Experimental |
| 58 |
EastTower16/LLMDataDistill
distill large scale web page text |
|
Experimental |
| 59 |
efficientscaling/Z1
[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code" |
|
Experimental |
| 60 |
eminorhan/llm-memory
Memory experiments with LLMs |
|
Experimental |
| 61 |
ictnlp/FastLongSpeech
FastLongSpeech is a novel framework designed to extend the capabilities of... |
|
Experimental |
| 62 |
sky24h/Training-Free_Zero-Shot_Semantic_Segmentation_with_LLM_Refinement
This repository contains official implementation of the paper "Training-Free... |
|
Experimental |
| 63 |
wyt2000/InverseCoder
[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the... |
|
Experimental |
| 64 |
GeorgeVern/lmcor
Code for the EACL 2024 paper: "Small Language Models Improve Giants by... |
|
Experimental |
| 65 |
vitorhcsousa/llm-w-mlx
Large Language Models with MLX |
|
Experimental |
| 66 |
MaLA-LM/emma-500
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models |
|
Experimental |
| 67 |
ikeasamoahansah/univ-model
A Universal Document Understanding Model (UDUM) which accepts various file types |
|
Experimental |
| 68 |
Keytoyze/JumpCoder
Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via... |
|
Experimental |
| 69 |
vishvaRam/Data-Prep-for-LLM-fine-tuning
This repository helps prepare datasets for fine-tuning Large Language Models... |
|
Experimental |
| 70 |
VITA-Group/Data-Efficient-Scaling
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao... |
|
Experimental |
| 71 |
supersimple33/Scaling-Laws
A method for calculating scaling laws for LLMs from publicly available models |
|
Experimental |
| 72 |
MaLA-LM/mala-500
MaLA-500: Massive Language Adaptation of Large Language Models |
|
Experimental |
| 73 |
vocaliodmiku/SLI-LL
Repository of the paper: "Spoken Language Intelligence of Large Language... |
|
Experimental |
| 74 |
pbevan1/multilingual-constitutional-ai
Implementation for "Multilingual Constitutional AI" |
|
Experimental |