Trending AI Evaluation Tools

Tools with the biggest quality score improvements over the last 3 days.

# Tool Change Score Tier
1 lmnr-ai/lmnr

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

+31 88 Verified
2 DataDog/dd-trace-js

Datadog APM client for Node.js

+31 95 Verified
3 mnfst/manifest

Smart LLM Routing for OpenClaw. Cut Costs up to 70% 🦞🦚

+30 87 Verified
4 autogluon/fev

Forecast evaluation library

+20 72 Verified
5 RafaelGSS/bench-node

A powerful Node.js benchmark library

+19 64 Established
6 jonahsnider/benchmark

A Node.js benchmarking library with support for multithreading and TurboFan...

+19 59 Established
7 open-telemetry/opentelemetry-rust

The Rust OpenTelemetry implementation

+17 81 Verified
8 tokio-rs/tracing

Application level tracing for Rust.

+17 78 Verified
9 omniviser/omniray

Stop guessing! You and your AI can now see live what's happening inside your...

+16 43 Emerging
10 beorn/loggily

TypeScript logger with debug-style namespaces, structured JSON, and...

+16 42 Emerging
11 chirpz-ai/pandaprobe

🐼 Open source agent engineering platform: traces, evals, and metrics to...

+15 54 Established
12 yonatan-h/express-k6-profiler

Finds bottlenecks in an Express app during load testing

+15 34 Emerging
13 falcondev-oss/workflow

Simple type-safe queue worker with durable execution based on BullMQ.

+14 42 Emerging
14 ertgl/tapable-tracer

Trace the connections and flows between tapable hooks.

+14 58 Established
15 CodSpeedHQ/pytest-codspeed

A pytest plugin to create benchmarks

+13 69 Established
16 lucsorel/pydoctrace

Generate architecture diagrams by tracing Python code execution

+13 48 Emerging
17 fastify/fastify-zipkin

Fastify plugin for Zipkin distributed tracing system.

+12 61 Established
18 team-decent/decent-bench

A benchmarking framework for decentralized optimization

+12 59 Established
19 confident-ai/deepeval

The LLM Evaluation Framework

+12 71 Verified
20 DataDog/dd-trace-py

Datadog Python APM Client

+12 76 Verified
21 pinpoint-apm/pinpoint

APM, (Application Performance Management) tool for large-scale distributed systems.

+12 76 Verified
22 NVIDIA/garak

the LLM vulnerability scanner

+12 72 Verified
23 open-telemetry/opentelemetry-go

OpenTelemetry Go API and SDK

+12 76 Verified
24 GreptimeTeam/greptimedb

The open-source Observability 2.0 database. One engine for metrics, logs,...

+12 70 Verified
25 pydantic/logfire

AI observability platform for production LLM and agent systems.

+12 69 Established
26 DataDog/datadog-agent

Main repository for Datadog Agent

+12 76 Verified
27 jaegertracing/jaeger-ui

Web UI for Jaeger

+12 76 Verified
28 open-telemetry/opentelemetry-go-instrumentation

OpenTelemetry Auto Instrumentation using eBPF

+12 73 Verified
29 DataDog/dd-trace-go

Datadog Go Library including APM tracing, profiling, and security monitoring.

+12 76 Verified
30 winsiderss/systeminformer

A free, powerful, multi-purpose tool that helps you monitor system...

+12 72 Verified
31 SigNoz/signoz

SigNoz is an open-source observability platform native to OpenTelemetry with...

+12 70 Verified
32 jaegertracing/jaeger

CNCF Jaeger, a Distributed Tracing Platform

+12 72 Verified
33 gofr-dev/gofr

An opinionated GoLang framework for accelerated microservice development....

+12 70 Verified
34 dotnet/BenchmarkDotNet

Powerful .NET library for benchmarking

+10 69 Established
35 metrico/gigapipe

⭐️ The Open-Source Polyglot Observability Warehouse: Light, Fast, Cloud...

+10 65 Established
36 coroot/coroot

Coroot is an open-source observability and APM tool with AI-powered Root...

+10 66 Established
37 CodSpeedHQ/codspeed-rust

Crates to benchmark your Rust code

+10 67 Established
38 dash0hq/otelbin

Web-based tool to facilitate OpenTelemetry collector configuration editing...

+10 61 Established
39 bpftrace/bpftrace

High-level tracing language for Linux

+10 71 Verified
40 flightlessmango/MangoHud

A Vulkan and OpenGL overlay for monitoring FPS, temperatures, CPU/GPU load and more.

+10 66 Established
41 iipeace/guider

The All-in-One System Profiling and Fault Detection Tool for Linux & Android

+10 70 Verified
42 opentracing-contrib/nginx-opentracing

NGINX plugin for OpenTracing

+10 73 Verified
43 openzipkin/zipkin

Zipkin is a distributed tracing system

+10 73 Verified
44 namhyung/uftrace

Function graph tracer for C/C++/Rust/Python

+10 72 Verified
45 cmackenzie1/tracing-ndjson

A customizable NDJSON format for tracing in Rust

+9 53 Established
46 libbpf/libbpf

Automated upstream mirror for libbpf stand-alone build.

+7 70 Verified
47 alibaba/loongsuite-go-agent

OpenTelemetry Compile-Time Instrumentation for Golang

+7 66 Established
48 inikep/lzbench

lzbench is an in-memory benchmark of open-source compressors

+7 71 Verified
49 komoju/komoju-datadog

Rust Datadog instrumentation

+5 48 Emerging
50 huggingface/aisheets

Build, enrich, and transform datasets using AI models with no code

+3 61 Established
51 cloudflare/ebpf_exporter

Prometheus exporter for custom eBPF metrics

+3 62 Established
52 bamlab/flashlight

📱⚡️ Lighthouse for Mobile - audits your app and gives a performance score to...

+3 55 Established
53 gogf/gf

A powerful framework for faster, easier, and more efficient project development.

+3 64 Established
54 kolloch/reqray

Log call tree summaries after each request for rust programs instrumented...

+3 35 Emerging
55 cilium/pwru

Packet, where are you? -- eBPF-based Linux kernel networking debugger

+3 60 Established
56 tracewayapp/opentelemetry-symfony-bundle

Pure-PHP OpenTelemetry instrumentation for Symfony - automatic HTTP,...

+2 35 Emerging