QwenLM/ParScale

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

/ 100

Emerging

ParScale introduces a new way to make large language models (LLMs) more capable without drastically increasing their size or slowing them down. By running multiple variations of an input through the model in parallel, it generates richer, more accurate outputs, particularly for complex tasks like coding or math. This is designed for AI researchers and engineers who build and deploy LLMs.

476 stars. No commits in the last 6 months.

Use this if you need to enhance the performance and reasoning abilities of your LLMs, especially for tasks requiring deep understanding, while being mindful of computational resources like memory and inference time.

Not ideal if your primary goal is to simply scale model parameters or inference speed using traditional methods, as this introduces a new parallel computation paradigm.

large-language-models model-optimization AI-research computational-efficiency reasoning-tasks

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 7 / 25

Community 13 / 25

How are scores calculated?

Stars

476

Forks

Language

Python

License

—

Higher-rated alternatives

jncraton/languagemodels

Explore large language models in 512MB of RAM

microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

haizelabs/verdict

Inference-time scaling for LLMs-as-a-judge.

albertan017/LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

bytedance/Sa2VA

Official Repo For Pixel-LLM Codebase

Explore Transformer Models

All categories Trending Transformer directory Insights