terryyz/DataAug4Code

Source Code Data Augmentation for Deep Learning: A Survey.

21
/ 100
Experimental

This is a curated list of research papers and their associated datasets focused on using data augmentation techniques for training deep learning models on source code. It helps researchers and practitioners explore different methods for tasks like identifying code authorship, detecting code clones, finding and fixing defects, or summarizing code. Anyone working on improving automated code analysis or generation with machine learning would find this useful.

No commits in the last 6 months.

Use this if you are a researcher or machine learning engineer looking for state-of-the-art data augmentation techniques and relevant papers to enhance your deep learning models for source code tasks.

Not ideal if you are looking for an off-the-shelf tool or library to directly apply data augmentation, as this is a survey of academic literature, not an implementation.

code-analysis software-engineering-research machine-learning-on-code code-quality developer-tools-research
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 5 / 25

How are scores calculated?

Stars

66

Forks

2

Language

License

Last pushed

Jun 15, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/terryyz/DataAug4Code"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.