All AI Evaluation Tools

216 tools ranked by quality score

Showing 1–100 of 216
# Tool Score Tier
1 DataDog/dd-trace-js

Datadog APM client for Node.js

95
Verified
2 lmnr-ai/lmnr

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

88
Verified
3 mnfst/manifest

Smart LLM Routing for OpenClaw. Cut Costs up to 70% 🦞🦚

87
Verified
4 open-telemetry/opentelemetry-rust

The Rust OpenTelemetry implementation

81
Verified
5 tokio-rs/tracing

Application level tracing for Rust.

78
Verified
6 DataDog/dd-trace-go

Datadog Go Library including APM tracing, profiling, and security monitoring.

76
Verified
7 pinpoint-apm/pinpoint

APM, (Application Performance Management) tool for large-scale distributed systems.

76
Verified
8 DataDog/dd-trace-py

Datadog Python APM Client

76
Verified
9 open-telemetry/opentelemetry-go

OpenTelemetry Go API and SDK

76
Verified
10 jaegertracing/jaeger-ui

Web UI for Jaeger

76
Verified
11 DataDog/datadog-agent

Main repository for Datadog Agent

76
Verified
12 open-telemetry/opentelemetry-go-instrumentation

OpenTelemetry Auto Instrumentation using eBPF

73
Verified
13 opentracing-contrib/nginx-opentracing

NGINX plugin for OpenTracing

73
Verified
14 openzipkin/zipkin

Zipkin is a distributed tracing system

73
Verified
15 NVIDIA/garak

the LLM vulnerability scanner

72
Verified
16 winsiderss/systeminformer

A free, powerful, multi-purpose tool that helps you monitor system...

72
Verified
17 namhyung/uftrace

Function graph tracer for C/C++/Rust/Python

72
Verified
18 jaegertracing/jaeger

CNCF Jaeger, a Distributed Tracing Platform

72
Verified
19 autogluon/fev

Forecast evaluation library

72
Verified
20 confident-ai/deepeval

The LLM Evaluation Framework

71
Verified
21 inikep/lzbench

lzbench is an in-memory benchmark of open-source compressors

71
Verified
22 bpftrace/bpftrace

High-level tracing language for Linux

71
Verified
23 gofr-dev/gofr

An opinionated GoLang framework for accelerated microservice development....

70
Verified
24 SigNoz/signoz

SigNoz is an open-source observability platform native to OpenTelemetry with...

70
Verified
25 GreptimeTeam/greptimedb

The open-source Observability 2.0 database. One engine for metrics, logs,...

70
Verified
26 libbpf/libbpf

Automated upstream mirror for libbpf stand-alone build.

70
Verified
27 iipeace/guider

The All-in-One System Profiling and Fault Detection Tool for Linux & Android

70
Verified
28 pydantic/logfire

AI observability platform for production LLM and agent systems.

69
Established
29 CodSpeedHQ/pytest-codspeed

A pytest plugin to create benchmarks

69
Established
30 dotnet/BenchmarkDotNet

Powerful .NET library for benchmarking

69
Established
31 CodSpeedHQ/codspeed-rust

Crates to benchmark your Rust code

67
Established
32 alibaba/loongsuite-go-agent

OpenTelemetry Compile-Time Instrumentation for Golang

66
Established
33 coroot/coroot

Coroot is an open-source observability and APM tool with AI-powered Root...

66
Established
34 flightlessmango/MangoHud

A Vulkan and OpenGL overlay for monitoring FPS, temperatures, CPU/GPU load and more.

66
Established
35 metrico/gigapipe

⭐️ The Open-Source Polyglot Observability Warehouse: Light, Fast, Cloud...

65
Established
36 TPC-Council/HammerDB

HammerDB: The industry standard open-source database benchmark

64
Established
37 DataDog/dd-trace-java

Datadog APM client for Java

64
Established
38 DataDog/dd-trace-php

Datadog PHP Clients

64
Established
39 DataDog/dd-trace-rb

Datadog's client library for Ruby

64
Established
40 jaegertracing/helm-charts

Helm Charts for Jaeger backend

64
Established
41 DataDog/dd-sdk-ios

Datadog SDK for iOS - Swift and Objective-C.

64
Established
42 open-telemetry/opentelemetry-ruby-contrib

Contrib Packages for the OpenTelemetry Ruby API and SDK implementation.

64
Established
43 gogf/gf

A powerful framework for faster, easier, and more efficient project development.

64
Established
44 RafaelGSS/bench-node

A powerful Node.js benchmark library

64
Established
45 DataDog/dd-trace-dotnet

.NET Client Library for Datadog APM

64
Established
46 open-telemetry/opentelemetry-php

The OpenTelemetry PHP Library

64
Established
47 reframe-hpc/reframe

A powerful Python framework for writing and running portable regression...

63
Established
48 verifywise-ai/verifywise

Complete AI governance and LLM Evals platform with support for EU AI Act,...

63
Established
49 rabbitmq/rabbitmq-perf-test

A load testing tool

63
Established
50 oushujun/EDTA

Extensive de-novo TE Annotator

63
Established
51 nowsecure/fsmon

Filesystem monitor tool for Linux/Android iOS/macOS

62
Established
52 typelevel/natchez

functional tracing for cats

62
Established
53 cloudflare/ebpf_exporter

Prometheus exporter for custom eBPF metrics

62
Established
54 zio/zio-logging

Powerful logging for ZIO 2.0 applications, with compatibility with many...

62
Established
55 lttng/lttng-tools

The lttng-tools project provides a session daemon (lttng-sessiond) that acts...

62
Established
56 efficios/babeltrace

Babeltrace /ˈbæbəltreɪs/ is an open-source trace manipulation toolkit.

62
Established
57 huggingface/aisheets

Build, enrich, and transform datasets using AI models with no code

61
Established
58 typelevel/otel4s

An OpenTelemetry library for Scala based on Cats-Effect

61
Established
59 fastify/fastify-zipkin

Fastify plugin for Zipkin distributed tracing system.

61
Established
60 dash0hq/otelbin

Web-based tool to facilitate OpenTelemetry collector configuration editing...

61
Established
61 iand675/hs-opentelemetry

OpenTelemetry support for the Haskell programming language

60
Established
62 swift-otel/swift-otel

An OpenTelemetry Protocol (OTLP) backend for Swift Log, Swift Metrics, and...

60
Established
63 godotengine/godot-benchmarks

Collection of benchmarks to test performance of different areas of Godot

60
Established
64 cilium/pwru

Packet, where are you? -- eBPF-based Linux kernel networking debugger

60
Established
65 instana/go-sensor

:rocket: Go Distributed Tracing & Metrics Sensor for Instana

60
Established
66 signalfx/tracing-examples

Examples of using third-party tracers with SignalFx

59
Established
67 signalfx/splunk-otel-java

Splunk Distribution of OpenTelemetry Java

59
Established
68 instana/nodejs

Node.js in-process collectors for Instana

59
Established
69 team-decent/decent-bench

A benchmarking framework for decentralized optimization

59
Established
70 kieker-monitoring/kieker

Kieker is an observability framework, that consists of an monitoring and...

59
Established
71 jonahsnider/benchmark

A Node.js benchmarking library with support for multithreading and TurboFan...

59
Established
72 dynatrace-oss/unguard

Unguard is an insecure cloud-native microservices demo application.

59
Established
73 instana/python-sensor

:snake: Python Distributed Tracing & Metrics Sensor for Instana

58
Established
74 munich-quantum-toolkit/bench

MQT Bench - An MQT Tool for Benchmarking Quantum Software Tools

58
Established
75 ertgl/tapable-tracer

Trace the connections and flows between tapable hooks.

58
Established
76 uio-bmi/immuneML

immuneML is a platform for machine learning analysis of adaptive immune...

58
Established
77 ant-research/EasyTemporalPointProcess

EasyTPP: Towards Open Benchmarking Temporal Point Processes

57
Established
78 nhsengland/evalsense

Tools for systematic large language model evaluations

57
Established
79 instana/ruby-sensor

💎 Ruby Distributed Tracing & Metrics Sensor for Instana

56
Established
80 atesgoral/hrm-solutions

Human Resource Machine solutions and size/speed hacks

56
Established
81 bamlab/flashlight

📱⚡️ Lighthouse for Mobile - audits your app and gives a performance score to...

55
Established
82 ldbc/ldbc_snb_docs

Specification of the LDBC Social Network Benchmark suite

55
Established
83 aliesbelik/load-testing-toolkit

Collection of open-source tools for debugging, benchmarking, load and stress...

54
Established
84 unitaryfoundation/metriq-gym

metriq-gym is a framework for implementing and running standard quantum...

54
Established
85 ryncsn/memstrack

A memory allocation tracer combined with stack trace.

54
Established
86 GDATASoftwareAG/motornet

Motor.NET is a microservice framework based on Microsoft.Extensions.Hosting

54
Established
87 argonne-lcf/THAPI

A tracing infrastructure for heterogeneous computing applications.

54
Established
88 DataDog/nginx-datadog

Enhance NGINX Observability and Security with Datadog's Module

54
Established
89 bencheeorg/benchee

Easy and extensible benchmarking in Elixir providing you with lots of statistics!

54
Established
90 chirpz-ai/pandaprobe

🐼 Open source agent engineering platform: traces, evals, and metrics to...

54
Established
91 jnidzwetzki/pg-lock-tracer

An eBPF based lock tracer for PostgreSQL

54
Established
92 cau-se/theodolite

Theodolite is a framework for benchmarking the horizontal and vertical...

53
Established
93 bencherdev/bencher

🐰 Bencher - Continuous Benchmarking

53
Established
94 hendriknielaender/zBench

📊 zig benchmark

53
Established
95 DataDog/dd-trace-cpp

Datadog APM client for C++

53
Established
96 cmackenzie1/tracing-ndjson

A customizable NDJSON format for tracing in Rust

53
Established
97 prestodb/pbench

Presto/Prestissimo Benchmark Toolset

53
Established
98 elastic/elastic-otel-dotnet

Elastic OpenTelemetry .NET Distribution

53
Established
99 signalfx/splunk-otel-dotnet

Splunk Distribution of OpenTelemetry .NET

52
Established
100 FrankChen021/bithon

A full stack observability platform

52
Established
1 2 3 Next »