Llm Scaling Architecture Transformer Models

There are 74 llm scaling architecture models tracked. 5 score above 50 (established tier). The highest-rated is jncraton/languagemodels at 61/100 with 1,197 stars.

Get all 74 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-scaling-architecture&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 jncraton/languagemodels

Explore large language models in 512MB of RAM

61
Established
2 microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

57
Established
3 haizelabs/verdict

Inference-time scaling for LLMs-as-a-judge.

55
Established
4 albertan017/LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

54
Established
5 bytedance/Sa2VA

Official Repo For Pixel-LLM Codebase

54
Established
6 Cardinal-Operations/ORLM

ORLM: Training Large Language Models for Optimization Modeling

47
Emerging
7 sinanuozdemir/oreilly-optimizing-llms

Optimizing LLMs with Fine-Tuning and Prompt Engineering

46
Emerging
8 JIA-Lab-research/LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

45
Emerging
9 Tencent-Hunyuan/GradLoc

Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...

45
Emerging
10 Victorwz/LongMem

Official implementation of our NeurIPS 2023 paper "Augmenting Language...

44
Emerging
11 thunlp/InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for...

42
Emerging
12 skit-ai/SpeechLLM

This repository contains the training, inference, evaluation code for...

40
Emerging
13 yang-ai-lab/SleepLM

SleepLM: Natural-Language Intelligence for Human Sleep

40
Emerging
14 JKevin17/TM-LLM

The official code for "(ISCC 2025) Network Traffic Matrix Imputation via...

40
Emerging
15 huggingface/datablations

Scaling Data-Constrained Language Models

40
Emerging
16 UCSC-VLAA/m1

[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical...

39
Emerging
17 nercone-dev/zeta-llm-tool

Fully Open-source LLM Tool

38
Emerging
18 NiuTrans/LMT

Building a inclusive, scalable, and high-performance multilingual translation model

38
Emerging
19 sshh12/llm_optimize

LLM Optimize is a proof-of-concept library for doing LLM (large language...

38
Emerging
20 StupidTrees/SplitLLM

Split Learning Simulation Framework for LLMs

37
Emerging
21 WANGXinyiLinda/concept-based-demonstration-selection

Offical code of the paper Large Language Models Are Implicitly Topic Models:...

37
Emerging
22 locuslab/massive-activations

Code accompanying the paper "Massive Activations in Large Language Models"

37
Emerging
23 pdfosborne/elsciRL

The core repository of the elsciRL framework.

37
Emerging
24 mkuchnik/relm

ReLM is a Regular Expression engine for Language Models

37
Emerging
25 luohongyin/LangCode

LangCode - Improving alignment and reasoning of large language models (LLMs)...

37
Emerging
26 VityaVitalich/STASC

[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models

37
Emerging
27 OSU-STARLAB/Simul-LLM

[ACL 2024] An easily extensible framework for simultaneous, text-to-text...

36
Emerging
28 martin-wey/peft-llm-code

Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...

36
Emerging
29 luciusssss/ZhuangBench

[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly

36
Emerging
30 ai8hyf/llm_split_recall_test

Split and Recall: A simple and efficient benchmark to evaluate in-context...

35
Emerging
31 NiuTrans/LaMaTE

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...

34
Emerging
32 YuanGongND/ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language...

33
Emerging
33 Kitsunp/Prueba-de-modelo-de-ByteLatentTransformer

Este es una prueba de concepto del paper mencionado de Meta junto a otros...

33
Emerging
34 ZigeW/data_management_LLM

Collection of training data management explorations for large language models

33
Emerging
35 QwenLM/ParScale

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

32
Emerging
36 ymoslem/Adaptive-MT-LLM

Adaptive Machine Translation with Large Language Models

32
Emerging
37 zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models;...

31
Emerging
38 ryoungj/ObsScaling

[NeurIPS'24 Spotlight] Observational Scaling Laws

31
Emerging
39 dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving

Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...

31
Emerging
40 fatemafaria142/Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI

This research examines the performance of Large Language Models (GPT-3.5...

31
Emerging
41 mubingshen/MLC-SLM-Baseline

The project is associated with the recently-launched INTERSPEECH 2025...

30
Emerging
42 yinzhangyue/EoT

Exchange-of-Thought: Enhancing Large Language Model Capabilities through...

30
Emerging
43 bminixhofer/zett

Code for Zero-Shot Tokenizer Transfer

29
Experimental
44 Butanium/llm-lang-agnostic

minimal code to reproduce results from Separating Tongue from Thought:...

29
Experimental
45 Y-debug-sys/LMTE

[INFOCOM 2026] Official Implementation of "LMTE: Putting the {Reasoning}...

28
Experimental
46 rhubarbwu/linguistic-collapse

Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models...

28
Experimental
47 LSquaredM/mutual_info_scaling_law

(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for...

27
Experimental
48 Y-Research-SBU/CSR

Official Repository for CSR - ICML 2025 Oral

27
Experimental
49 millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective

Code: Attention Mechanisms Perspective: Exploring LLM Processing of...

26
Experimental
50 Dahouabdelhalim/CodeSeg

Replication code for "Semantic Code Segmentation with Language Models"...

26
Experimental
51 hank0316/AdaSearch

This includes the original implementation of "AdaSearch: Balancing...

24
Experimental
52 HKUSTDial/megatran

[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with...

23
Experimental
53 IAAR-Shanghai/FastMem

Fast Memorization of Prompt Improves Context Awareness of Large Language...

22
Experimental
54 lime9903/SemanticHAR

LLM-based Human Activity Recognition System

22
Experimental
55 YutongWang1216/ReflectionLLMMT

Code and data realeases for the paper -- TasTe: Teaching Large Language...

21
Experimental
56 UKPLab/arxiv2025-inherent-limits-plms

Code repository for the paper "The Inherent Limits of Pretrained LLMs: The...

21
Experimental
57 Xiaohao-Yang/LLM-ITL

[ACL 2025 Main] Neural Topic Modeling with Large Language Models in the Loop

21
Experimental
58 EastTower16/LLMDataDistill

distill large scale web page text

21
Experimental
59 efficientscaling/Z1

[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"

20
Experimental
60 eminorhan/llm-memory

Memory experiments with LLMs

20
Experimental
61 ictnlp/FastLongSpeech

FastLongSpeech is a novel framework designed to extend the capabilities of...

20
Experimental
62 sky24h/Training-Free_Zero-Shot_Semantic_Segmentation_with_LLM_Refinement

This repository contains official implementation of the paper "Training-Free...

20
Experimental
63 wyt2000/InverseCoder

[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the...

19
Experimental
64 GeorgeVern/lmcor

Code for the EACL 2024 paper: "Small Language Models Improve Giants by...

19
Experimental
65 vitorhcsousa/llm-w-mlx

Large Language Models with MLX

17
Experimental
66 MaLA-LM/emma-500

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

17
Experimental
67 ikeasamoahansah/univ-model

A Universal Document Understanding Model (UDUM) which accepts various file types

17
Experimental
68 Keytoyze/JumpCoder

Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via...

15
Experimental
69 vishvaRam/Data-Prep-for-LLM-fine-tuning

This repository helps prepare datasets for fine-tuning Large Language Models...

14
Experimental
70 VITA-Group/Data-Efficient-Scaling

[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao...

13
Experimental
71 supersimple33/Scaling-Laws

A method for calculating scaling laws for LLMs from publicly available models

13
Experimental
72 MaLA-LM/mala-500

MaLA-500: Massive Language Adaptation of Large Language Models

12
Experimental
73 vocaliodmiku/SLI-LL

Repository of the paper: "Spoken Language Intelligence of Large Language...

11
Experimental
74 pbevan1/multilingual-constitutional-ai

Implementation for "Multilingual Constitutional AI"

10
Experimental