iSEngLab/LLM4AG

[2025 TOSEM] Exploring Automated Assertion Generation via Large Language Models

/ 100

Experimental

This project helps software quality assurance engineers and researchers evaluate how well large language models can automatically generate code assertions. You can feed in your code and expected assertions to measure the accuracy of different models. The output provides performance metrics, helping you understand which models are most effective for automated assertion generation.

No commits in the last 6 months.

Use this if you are a software quality assurance professional or researcher looking to benchmark large language models for generating accurate code assertions.

Not ideal if you are looking for an out-of-the-box tool to generate assertions for your production code without evaluating model performance.

software-testing quality-assurance assertion-generation code-analysis LLM-evaluation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

UBC-MDS/fixml

LLM Tool for effective test evaluation of ML projects with curated Checklists and LLM prompts

AstraBert/DebateLLM-Championship

5 LLMs, 1vs1 matches to produce the most convincing argumentation in favor or against a random...

brains-on-code/IterativeRefactoringLLM

Replication package, supplementary materials, and analysis pipeline for our paper on iterative...

JosephTLucas/llm_test

A suite of tests to verify bias, safety, trust, and security concerns for LLMs.

ash-jyc/db84llm

College policy debate as a verbal reasoning benchmark for LLMs

Explore Transformer Models

All categories Trending Transformer directory Insights