microsoft/NeMoEval

A Benchmark Tool for Natural Language-based Network Management

/ 100

Emerging

This tool helps network engineers and operations teams evaluate how well natural language instructions can manage and analyze network operations. You provide natural language queries about network traffic or lifecycle management, and it assesses the quality of the generated code or actions. The output helps you understand the effectiveness of using AI for network tasks.

No commits in the last 6 months.

Use this if you are a network engineer, network architect, or operations manager evaluating the potential of large language models to assist with network traffic analysis or network lifecycle management tasks.

Not ideal if you need a plug-and-play solution to directly manage a live network, as this is a benchmark and evaluation tool, not an operational one.

network-operations network-traffic-analysis network-lifecycle-management network-automation IT-operations

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Related tools

FudanSELab/ClassEval

Benchmark ClassEval for class-level code generation.

apartresearch/specificityplus

👩‍💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"

claws-lab/XLingEval

Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large...

HICAI-ZJU/SciKnowEval

SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models

nicolay-r/RuSentRel-Leaderboard

This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper...

Explore NLP Tools

All categories Trending NLP directory Insights