LARK-AI-Lab/CodeScaler

The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"

/ 100

Experimental

This tool helps developers who are training or using large language models for code generation tasks. It takes a coding problem description and candidate code solutions, then outputs a score indicating the quality of each solution. The primary users are AI/ML engineers and researchers working on code LLMs, who need to efficiently evaluate and improve their models.

Use this if you need to quickly and efficiently score the quality of generated code solutions without running time-consuming execution-based tests.

Not ideal if your primary goal is to run traditional unit tests for correctness on fully developed software, rather than evaluating AI-generated code.

code-generation-LLM AI-model-evaluation machine-learning-engineering reinforcement-learning-from-feedback

No License No Package No Dependents

Maintenance 13 / 25

Adoption 7 / 25

Maturity 3 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

aalok-sathe/surprisal

A unified interface for computing surprisal (log probabilities) from language models! Supports...

EvolvingLMMs-Lab/lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

FunnySaltyFish/Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

reasoning-machines/pal

PaL: Program-Aided Language Models (ICML 2023)

microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static...

Explore LLM Tools

All categories Trending LLM Tool directory Insights