RedHatResearch/conext24-NetConfEval

Benchmark for evaluating LLMs in network configuration problems.

40
/ 100
Emerging

This project helps network operations engineers evaluate how well large language models can assist with network configuration tasks. It takes high-level network requirements and assesses a model's ability to translate them into formal specifications, API calls, routing algorithms, or low-level device configurations. The output shows which models are most effective for different stages of network setup and management.

No commits in the last 6 months.

Use this if you are a network operations engineer, researcher, or architect looking to understand the current capabilities and limitations of large language models for automating or facilitating network configuration workflows.

Not ideal if you are looking for a ready-to-deploy, production-grade tool to directly configure your network using AI, as this is an evaluation benchmark.

network-operations network-engineering network-automation traffic-engineering network-management
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

34

Forks

8

Language

Python

License

MIT

Last pushed

Mar 30, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/RedHatResearch/conext24-NetConfEval"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.