OussamaSghaier/CuREV

Harnessing Large Language Models for Curated Code Reviews

/ 100

Emerging

When training AI models to assist with software development tasks like suggesting code improvements or generating feedback, the quality of the training data is crucial. This project provides a curated dataset of code review comments, designed to improve how well Large Language Models understand and generate helpful, relevant code review feedback. It takes raw code review data and processes it into a higher-quality format, primarily benefiting AI researchers and machine learning engineers working on code-related LLMs.

No commits in the last 6 months.

Use this if you are an AI researcher or machine learning engineer looking for a higher-quality dataset to train or fine-tune large language models for tasks involving code review comment generation or code refinement.

Not ideal if you are a software developer looking for a tool to automate your code reviews directly, as this is a dataset and framework for training models, not an end-user application.

AI-training-data natural-language-processing software-engineering-AI machine-learning-research code-review-automation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

sunmh207/AI-Codereview-Gitlab

基于大模型(DeepSeek,OpenAI等)的 GitLab 自动代码审查工具；支持钉钉/企业微信/飞书推送消息和生成日报；支持Docker部署；可视化 Dashboard。

codedog-ai/codedog

Code review assistant powered by LLM

gitleaks/gitleaks

Find secrets with Gitleaks 🔑

anc95/ChatGPT-CodeReview

🐥 A code review bot powered by ChatGPT

Nikita-Filonov/ai-review

🚀 AI-powered code review tool for GitHub, GitLab, Bitbucket Cloud, Bitbucket Server, Azure...

Explore LLM Tools

All categories Trending LLM Tool directory Insights