LLM Scaling Architecture LLM Tools

Research implementations and codebases focused on scaling language models across languages, sequence lengths, and parameters—including multilingual adaptation, embedding optimization, and architectural innovations for handling massive model capacity. Does NOT include deployment infrastructure, inference optimization, or general LLM applications.

There are 42 llm scaling architecture tools tracked. 1 score above 50 (established tier). The highest-rated is aalok-sathe/surprisal at 52/100 with 51 stars. 1 of the top 10 are actively maintained.

Get all 42 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-scaling-architecture&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 aalok-sathe/surprisal

A unified interface for computing surprisal (log probabilities) from...

52
Established
2 EvolvingLMMs-Lab/lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and...

46
Emerging
3 FunnySaltyFish/Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

45
Emerging
4 reasoning-machines/pal

PaL: Program-Aided Language Models (ICML 2023)

45
Emerging
5 microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of...

43
Emerging
6 YutongWang1216/DocMTAgent

Code and data releases for the paper -- DelTA: An Online Document-Level...

39
Emerging
7 FreedomIntelligence/EchoX

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for...

39
Emerging
8 merantix-momentum/acip

🗜️Codebase of the ACIP algorithm 🗜️

36
Emerging
9 Mxoder/Maxs-Awesome-Datasets

Max的有趣数据集 / Max's awesome datasets

33
Emerging
10 ch3njust1n/smart

Self-modifying code at runtime with Large Language Models

33
Emerging
11 apenab/pyrlm-runtime

Minimal runtime for Recursive Language Models (RLMs) inspired by the MIT...

32
Emerging
12 ZetangForward/CSA-GEC

This is the official code for ``Beyond Hard Samples: Robust and Effective...

31
Emerging
13 zhiyuanpeng/SPTAR

Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models

30
Emerging
14 farukalpay/ISO-639-2023

large language model

30
Emerging
15 zjunlp/LookAheadTuning

[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews

28
Experimental
16 GeorgeVern/qe-fusion

This repo contains the code for the paper "Don't Rank, Combine! Combining...

26
Experimental
17 a-m-team/a-m-models

a-m-team's exploration in large language modeling

25
Experimental
18 nitinvetcha/DeGAML-LLM

DeGAML-LLM: Decoupling Generalization and Adaptation in Meta-Learning for...

25
Experimental
19 Lucky-Wang-Chenlong/CodeSync

[ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code...

24
Experimental
20 PrithwishJana/CoTran

Official repository for CoTran: An LLM-based code translator for...

23
Experimental
21 ictnlp/StreamUni

StreamUni is a framework that efficiently enables unified Large...

23
Experimental
22 LARK-AI-Lab/CodeScaler

The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time...

23
Experimental
23 WSE-research/Code2Code-Translations-using-LLMs-ENASE-2026

The repository to the paper Code2Code Translations using LLMs

23
Experimental
24 burcgokden/PLDR-LLM-Self-Organized-Criticality

Code used in paper titled "PLDR-LLMs Reason at Self-Organized Criticality"

21
Experimental
25 JingyingHu/ChineseL2Writing-Surprisals

Materials and code for Hu and Cong (2025) - Modeling Chinese L2 Writing...

21
Experimental
26 hmyousuf2010/bodh

A morphology-aware Bengali tokenizer for large language models.

21
Experimental
27 aakarsh/rl-llm-calibration-test

Attempt at replication of the parts of the paper "Language models (mostly)...

21
Experimental
28 AidanCooper/constrained-decoding

A guide to structured generation using constrained decoding

21
Experimental
29 tony10101105/ExpEmergence

[ICLR'25] U-shaped and Inverted-U Scaling behind Emergent Abilities of Large...

19
Experimental
30 isaacwiafe/speech_data_ghana_ug

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,...

17
Experimental
31 originaonxi/prm-replication

Live proof of arXiv:2603.17815 — O(N) confirmed R²=0.952, 1,984 API calls

15
Experimental
32 j-frei/CFG4FHIR

Context-Free Grammar-guided Generation of FHIR Resources Using Large Language Models

14
Experimental
33 Vidit-Ostwal/RLM-demo

Recursive Language Model Demo

13
Experimental
34 lindeng0/Replication-of-LARGE-LANGUAGE-MODELS-AN-APPLIED-ECONOMETRIC-FRAMEWORK

Replication of LLM econometric framework: leakage checks, prompt/model...

13
Experimental
35 sunwang-ai-linguist/bilingual-rlhf-semantic-repair-corpus

Daily Mandarin-English semantic alignment corpus for RLHF training, tone...

13
Experimental
36 aliasgar-m/Inventory-Opt-LLM

A comparison between Large Language Models for Inventory Optimization

13
Experimental
37 ymgw55/repro-superposition

Unofficial implementation to reproduce the experiments from "Superposition...

13
Experimental
38 sharmavasu/SMaRT

SMaRT (Small Model Reinforced Tuning) is a two-stage approach that...

12
Experimental
39 ChenDelong1999/Linguistic-Similarity

Official repo of paper "Linguistic Minimal Pairs Elicit Linguistic...

12
Experimental
40 zengikun/CXK_IKUN_Dataset

蔡徐坤微调模型数据集 里面包含了约100条有关于蔡徐坤,小黑子,玩梗的数据,可以用于模型微调,或者可以混合进其他数据集里,使得模型会玩坤坤的梗

11
Experimental
41 Mwaniki-Kanyi/The.Pentagon.Movement

HARNESSING SEQ2SEQ vs CASUAL-LLM MODELS.

11
Experimental
42 ArthurSpirling/LargeLanguageReplication

Replication for Language Models

11
Experimental