iSEngLab/LLM4AG
[2025 TOSEM] Exploring Automated Assertion Generation via Large Language Models
This project helps software quality assurance engineers and researchers evaluate how well large language models can automatically generate code assertions. You can feed in your code and expected assertions to measure the accuracy of different models. The output provides performance metrics, helping you understand which models are most effective for automated assertion generation.
No commits in the last 6 months.
Use this if you are a software quality assurance professional or researcher looking to benchmark large language models for generating accurate code assertions.
Not ideal if you are looking for an out-of-the-box tool to generate assertions for your production code without evaluating model performance.
Stars
8
Forks
—
Language
Python
License
—
Category
Last pushed
Jul 02, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/iSEngLab/LLM4AG"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
UBC-MDS/fixml
LLM Tool for effective test evaluation of ML projects with curated Checklists and LLM prompts
AstraBert/DebateLLM-Championship
5 LLMs, 1vs1 matches to produce the most convincing argumentation in favor or against a random...
brains-on-code/IterativeRefactoringLLM
Replication package, supplementary materials, and analysis pipeline for our paper on iterative...
JosephTLucas/llm_test
A suite of tests to verify bias, safety, trust, and security concerns for LLMs.
ash-jyc/db84llm
College policy debate as a verbal reasoning benchmark for LLMs