EngineeringSoftware/time-segmented-evaluation

Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.

/ 100

Experimental

This project helps researchers and practitioners in machine learning for code understand how different ways of splitting a dataset impact the results of code summarization models. It takes in raw code and comments with timestamps, processes them, and then trains and evaluates several machine learning models. The output shows how various evaluation methodologies affect model performance, helping users choose the most appropriate method for their research or application.

No commits in the last 6 months.

Use this if you are a researcher or ML engineer working on code summarization and want to rigorously test how different data splitting strategies (like time-segmented, mixed-project, or cross-project) influence your model's real-world applicability.

Not ideal if you are looking for a pre-trained code summarization model to use directly in an application without needing to perform deep methodological analysis.

code-summarization machine-learning-evaluation natural-language-processing-code software-engineering-research dataset-splitting

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

zhihu/cuBERT

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

ThalesGroup/ConceptBERT

Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering

dimitreOliveira/bert-as-a-service_TFX

End-to-end pipeline with TFX to train and deploy a BERT model for sentiment analysis.

kpi6research/Bert-as-a-Library

Bert as a Library is a Tensorflow library for quick and easy training and finetuning of models...

Kvasirs/MILES

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification...

Explore ML Frameworks

All categories Trending ML Framework directory Insights