madaan/pie-perf

Training language models to make programs faster

/ 100

Emerging

This project offers a specialized dataset for machine learning engineers and researchers focused on program optimization. It provides pairs of code snippets (in Python and C++) for competitive programming problems, where one version is measurably faster than the other. The dataset includes rich metadata like CPU time, memory usage, and the percentage of performance improvement, allowing researchers to train and evaluate models that can automatically generate performance-improving code edits.

No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer developing and evaluating AI models for code optimization and performance improvement, and you need a robust dataset with clear performance metrics.

Not ideal if you are looking for a tool that directly optimizes your code or provides ready-to-use performance improvements for your production systems.

program-optimization competitive-programming code-generation-research performance-engineering machine-learning-for-code

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

k4black/codebleu

Pip compatible CodeBLEU metric implementation available for linux/macos/win

LiveCodeBench/LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of...

EdinburghNLP/code-docstring-corpus

Preprocessed Python functions and docstrings for automated code documentation (code2doc) and...

hendrycks/apps

APPS: Automated Programming Progress Standard (NeurIPS 2021)

solis-team/Hydra

[FSE 2026] Do Not Treat Code as Natural Language: Implications for Repository-Level Code...

Explore AI Coding Tools

All categories Trending AI Coding directory Insights