TencentARC/LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

/ 100

Emerging

This project offers an advanced large language model (LLM) designed to excel in complex reasoning tasks, particularly in mathematics and coding. It takes general text prompts or specific math problems as input and generates highly accurate solutions or detailed code. This tool is ideal for AI researchers and developers who are building or evaluating sophisticated language models and need state-of-the-art performance in logical and mathematical domains.

514 stars. No commits in the last 6 months.

Use this if you are a researcher or developer focused on building or fine-tuning advanced language models and need superior performance in mathematical reasoning and code generation.

Not ideal if you are looking for a general-purpose conversational AI or a tool for simple text generation tasks without a strong emphasis on complex logical deduction.

AI research natural language processing mathematical reasoning code generation model evaluation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

514

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD...

intel/auto-round

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality...

pytorch/ao

PyTorch native quantization and sparsity for training and inference

bodaay/HuggingFaceModelDownloader

Simple go utility to download HuggingFace Models and Datasets

NVIDIA/kvpress

LLM KV cache compression made easy

Explore Transformer Models

All categories Trending Transformer directory Insights