LLM Scaling Architecture LLM Tools
Research implementations and codebases focused on scaling language models across languages, sequence lengths, and parameters—including multilingual adaptation, embedding optimization, and architectural innovations for handling massive model capacity. Does NOT include deployment infrastructure, inference optimization, or general LLM applications.
There are 42 llm scaling architecture tools tracked. 1 score above 50 (established tier). The highest-rated is aalok-sathe/surprisal at 52/100 with 51 stars. 1 of the top 10 are actively maintained.
Get all 42 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-scaling-architecture&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
aalok-sathe/surprisal
A unified interface for computing surprisal (log probabilities) from... |
|
Established |
| 2 |
EvolvingLMMs-Lab/lmms-engine
A simple, unified multimodal models training engine. Lean, flexible, and... |
|
Emerging |
| 3 |
FunnySaltyFish/Better-Ruozhiba
【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集 |
|
Emerging |
| 4 |
reasoning-machines/pal
PaL: Program-Aided Language Models (ICML 2023) |
|
Emerging |
| 5 |
microsoft/monitors4codegen
Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of... |
|
Emerging |
| 6 |
YutongWang1216/DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level... |
|
Emerging |
| 7 |
FreedomIntelligence/EchoX
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for... |
|
Emerging |
| 8 |
merantix-momentum/acip
🗜️Codebase of the ACIP algorithm 🗜️ |
|
Emerging |
| 9 |
Mxoder/Maxs-Awesome-Datasets
Max的有趣数据集 / Max's awesome datasets |
|
Emerging |
| 10 |
ch3njust1n/smart
Self-modifying code at runtime with Large Language Models |
|
Emerging |
| 11 |
apenab/pyrlm-runtime
Minimal runtime for Recursive Language Models (RLMs) inspired by the MIT... |
|
Emerging |
| 12 |
ZetangForward/CSA-GEC
This is the official code for ``Beyond Hard Samples: Robust and Effective... |
|
Emerging |
| 13 |
zhiyuanpeng/SPTAR
Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models |
|
Emerging |
| 14 |
farukalpay/ISO-639-2023
large language model |
|
Emerging |
| 15 |
zjunlp/LookAheadTuning
[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews |
|
Experimental |
| 16 |
GeorgeVern/qe-fusion
This repo contains the code for the paper "Don't Rank, Combine! Combining... |
|
Experimental |
| 17 |
a-m-team/a-m-models
a-m-team's exploration in large language modeling |
|
Experimental |
| 18 |
nitinvetcha/DeGAML-LLM
DeGAML-LLM: Decoupling Generalization and Adaptation in Meta-Learning for... |
|
Experimental |
| 19 |
Lucky-Wang-Chenlong/CodeSync
[ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code... |
|
Experimental |
| 20 |
PrithwishJana/CoTran
Official repository for CoTran: An LLM-based code translator for... |
|
Experimental |
| 21 |
ictnlp/StreamUni
StreamUni is a framework that efficiently enables unified Large... |
|
Experimental |
| 22 |
LARK-AI-Lab/CodeScaler
The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time... |
|
Experimental |
| 23 |
WSE-research/Code2Code-Translations-using-LLMs-ENASE-2026
The repository to the paper Code2Code Translations using LLMs |
|
Experimental |
| 24 |
burcgokden/PLDR-LLM-Self-Organized-Criticality
Code used in paper titled "PLDR-LLMs Reason at Self-Organized Criticality" |
|
Experimental |
| 25 |
JingyingHu/ChineseL2Writing-Surprisals
Materials and code for Hu and Cong (2025) - Modeling Chinese L2 Writing... |
|
Experimental |
| 26 |
hmyousuf2010/bodh
A morphology-aware Bengali tokenizer for large language models. |
|
Experimental |
| 27 |
aakarsh/rl-llm-calibration-test
Attempt at replication of the parts of the paper "Language models (mostly)... |
|
Experimental |
| 28 |
AidanCooper/constrained-decoding
A guide to structured generation using constrained decoding |
|
Experimental |
| 29 |
tony10101105/ExpEmergence
[ICLR'25] U-shaped and Inverted-U Scaling behind Emergent Abilities of Large... |
|
Experimental |
| 30 |
isaacwiafe/speech_data_ghana_ug
The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,... |
|
Experimental |
| 31 |
originaonxi/prm-replication
Live proof of arXiv:2603.17815 — O(N) confirmed R²=0.952, 1,984 API calls |
|
Experimental |
| 32 |
j-frei/CFG4FHIR
Context-Free Grammar-guided Generation of FHIR Resources Using Large Language Models |
|
Experimental |
| 33 |
Vidit-Ostwal/RLM-demo
Recursive Language Model Demo |
|
Experimental |
| 34 |
lindeng0/Replication-of-LARGE-LANGUAGE-MODELS-AN-APPLIED-ECONOMETRIC-FRAMEWORK
Replication of LLM econometric framework: leakage checks, prompt/model... |
|
Experimental |
| 35 |
sunwang-ai-linguist/bilingual-rlhf-semantic-repair-corpus
Daily Mandarin-English semantic alignment corpus for RLHF training, tone... |
|
Experimental |
| 36 |
aliasgar-m/Inventory-Opt-LLM
A comparison between Large Language Models for Inventory Optimization |
|
Experimental |
| 37 |
ymgw55/repro-superposition
Unofficial implementation to reproduce the experiments from "Superposition... |
|
Experimental |
| 38 |
sharmavasu/SMaRT
SMaRT (Small Model Reinforced Tuning) is a two-stage approach that... |
|
Experimental |
| 39 |
ChenDelong1999/Linguistic-Similarity
Official repo of paper "Linguistic Minimal Pairs Elicit Linguistic... |
|
Experimental |
| 40 |
zengikun/CXK_IKUN_Dataset
蔡徐坤微调模型数据集 里面包含了约100条有关于蔡徐坤,小黑子,玩梗的数据,可以用于模型微调,或者可以混合进其他数据集里,使得模型会玩坤坤的梗 |
|
Experimental |
| 41 |
Mwaniki-Kanyi/The.Pentagon.Movement
HARNESSING SEQ2SEQ vs CASUAL-LLM MODELS. |
|
Experimental |
| 42 |
ArthurSpirling/LargeLanguageReplication
Replication for Language Models |
|
Experimental |