usnistgov/KAIROS

Scoring and analysis software for the evaluation of Knowledge Directed Artificial Intelligence Reasoning Over Schemas (KAIROS)

/ 100

Emerging

This software suite helps researchers and participants in the KAIROS program evaluate the performance of Knowledge Directed Artificial Intelligence Reasoning Systems. It takes in system outputs (SDF files and LDC annotations) from TA1 and TA2 systems and produces detailed score reports and statistical spreadsheets. This tool is designed for KAIROS program participants who need to assess and compare their AI system's ability to extract and reason over complex knowledge graphs.

No commits in the last 6 months.

Use this if you are a KAIROS program participant needing to formally evaluate the output of your TA1 or TA2 AI systems against NIST's criteria.

Not ideal if you are looking for a general-purpose AI model evaluation tool or if you are not involved in the KAIROS research program.

AI-evaluation knowledge-reasoning program-assessment research-evaluation NIST-standards

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

nityansuman/marvin

Web app to automatically generate subjective or an objective test and evaluate user responses...

shibing624/judger

自动作文评分工具，支持中文、英文作文智能评分，支持评分模型自训练，支持WEKA处理模型数据，支持自定义评分算法。java开发。

shubhpawar/Automated-Essay-Scoring

Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle

antrixsh/trusteval

Enterprise LLM Evaluation & Responsible AI Framework — Benchmark bias, hallucination, PII...

samiali12/debateai-server

A FastAPI-powered backend that manages structured debates, analyzes arguments, and generates...

Explore NLP Tools

All categories Trending NLP directory Insights