EngineeringSoftware/time-segmented-evaluation
Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.
This project helps researchers and practitioners in machine learning for code understand how different ways of splitting a dataset impact the results of code summarization models. It takes in raw code and comments with timestamps, processes them, and then trains and evaluates several machine learning models. The output shows how various evaluation methodologies affect model performance, helping users choose the most appropriate method for their research or application.
No commits in the last 6 months.
Use this if you are a researcher or ML engineer working on code summarization and want to rigorously test how different data splitting strategies (like time-segmented, mixed-project, or cross-project) influence your model's real-world applicability.
Not ideal if you are looking for a pre-trained code summarization model to use directly in an application without needing to perform deep methodological analysis.
Stars
10
Forks
1
Language
Python
License
MIT
Category
Last pushed
Sep 06, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/EngineeringSoftware/time-segmented-evaluation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
zhihu/cuBERT
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
ThalesGroup/ConceptBERT
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
dimitreOliveira/bert-as-a-service_TFX
End-to-end pipeline with TFX to train and deploy a BERT model for sentiment analysis.
kpi6research/Bert-as-a-Library
Bert as a Library is a Tensorflow library for quick and easy training and finetuning of models...
Kvasirs/MILES
MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification...