Trending AI Evaluation Tools
Tools with the biggest quality score improvements over the last 3 days.
| # | Tool | Change | Score | Tier |
|---|---|---|---|---|
| 1 |
lmnr-ai/lmnr
Laminar - open-source observability platform purpose-built for AI agents. YC S24. |
+31 | 88 | Verified |
| 2 |
DataDog/dd-trace-js
Datadog APM client for Node.js |
+31 | 95 | Verified |
| 3 |
mnfst/manifest
Smart LLM Routing for OpenClaw. Cut Costs up to 70% 🦞🦚 |
+30 | 87 | Verified |
| 4 |
autogluon/fev
Forecast evaluation library |
+20 | 72 | Verified |
| 5 |
RafaelGSS/bench-node
A powerful Node.js benchmark library |
+19 | 64 | Established |
| 6 |
jonahsnider/benchmark
A Node.js benchmarking library with support for multithreading and TurboFan... |
+19 | 59 | Established |
| 7 |
open-telemetry/opentelemetry-rust
The Rust OpenTelemetry implementation |
+17 | 81 | Verified |
| 8 |
tokio-rs/tracing
Application level tracing for Rust. |
+17 | 78 | Verified |
| 9 |
omniviser/omniray
Stop guessing! You and your AI can now see live what's happening inside your... |
+16 | 43 | Emerging |
| 10 |
beorn/loggily
TypeScript logger with debug-style namespaces, structured JSON, and... |
+16 | 42 | Emerging |
| 11 |
chirpz-ai/pandaprobe
🐼 Open source agent engineering platform: traces, evals, and metrics to... |
+15 | 54 | Established |
| 12 |
yonatan-h/express-k6-profiler
Finds bottlenecks in an Express app during load testing |
+15 | 34 | Emerging |
| 13 |
falcondev-oss/workflow
Simple type-safe queue worker with durable execution based on BullMQ. |
+14 | 42 | Emerging |
| 14 |
ertgl/tapable-tracer
Trace the connections and flows between tapable hooks. |
+14 | 58 | Established |
| 15 |
CodSpeedHQ/pytest-codspeed
A pytest plugin to create benchmarks |
+13 | 69 | Established |
| 16 |
lucsorel/pydoctrace
Generate architecture diagrams by tracing Python code execution |
+13 | 48 | Emerging |
| 17 |
fastify/fastify-zipkin
Fastify plugin for Zipkin distributed tracing system. |
+12 | 61 | Established |
| 18 |
team-decent/decent-bench
A benchmarking framework for decentralized optimization |
+12 | 59 | Established |
| 19 |
confident-ai/deepeval
The LLM Evaluation Framework |
+12 | 71 | Verified |
| 20 |
DataDog/dd-trace-py
Datadog Python APM Client |
+12 | 76 | Verified |
| 21 |
pinpoint-apm/pinpoint
APM, (Application Performance Management) tool for large-scale distributed systems. |
+12 | 76 | Verified |
| 22 |
NVIDIA/garak
the LLM vulnerability scanner |
+12 | 72 | Verified |
| 23 |
open-telemetry/opentelemetry-go
OpenTelemetry Go API and SDK |
+12 | 76 | Verified |
| 24 |
GreptimeTeam/greptimedb
The open-source Observability 2.0 database. One engine for metrics, logs,... |
+12 | 70 | Verified |
| 25 |
pydantic/logfire
AI observability platform for production LLM and agent systems. |
+12 | 69 | Established |
| 26 |
DataDog/datadog-agent
Main repository for Datadog Agent |
+12 | 76 | Verified |
| 27 |
jaegertracing/jaeger-ui
Web UI for Jaeger |
+12 | 76 | Verified |
| 28 |
open-telemetry/opentelemetry-go-instrumentation
OpenTelemetry Auto Instrumentation using eBPF |
+12 | 73 | Verified |
| 29 |
DataDog/dd-trace-go
Datadog Go Library including APM tracing, profiling, and security monitoring. |
+12 | 76 | Verified |
| 30 |
winsiderss/systeminformer
A free, powerful, multi-purpose tool that helps you monitor system... |
+12 | 72 | Verified |
| 31 |
SigNoz/signoz
SigNoz is an open-source observability platform native to OpenTelemetry with... |
+12 | 70 | Verified |
| 32 |
jaegertracing/jaeger
CNCF Jaeger, a Distributed Tracing Platform |
+12 | 72 | Verified |
| 33 |
gofr-dev/gofr
An opinionated GoLang framework for accelerated microservice development.... |
+12 | 70 | Verified |
| 34 |
dotnet/BenchmarkDotNet
Powerful .NET library for benchmarking |
+10 | 69 | Established |
| 35 |
metrico/gigapipe
⭐️ The Open-Source Polyglot Observability Warehouse: Light, Fast, Cloud... |
+10 | 65 | Established |
| 36 |
coroot/coroot
Coroot is an open-source observability and APM tool with AI-powered Root... |
+10 | 66 | Established |
| 37 |
CodSpeedHQ/codspeed-rust
Crates to benchmark your Rust code |
+10 | 67 | Established |
| 38 |
dash0hq/otelbin
Web-based tool to facilitate OpenTelemetry collector configuration editing... |
+10 | 61 | Established |
| 39 |
bpftrace/bpftrace
High-level tracing language for Linux |
+10 | 71 | Verified |
| 40 |
flightlessmango/MangoHud
A Vulkan and OpenGL overlay for monitoring FPS, temperatures, CPU/GPU load and more. |
+10 | 66 | Established |
| 41 |
iipeace/guider
The All-in-One System Profiling and Fault Detection Tool for Linux & Android |
+10 | 70 | Verified |
| 42 |
opentracing-contrib/nginx-opentracing
NGINX plugin for OpenTracing |
+10 | 73 | Verified |
| 43 |
openzipkin/zipkin
Zipkin is a distributed tracing system |
+10 | 73 | Verified |
| 44 |
namhyung/uftrace
Function graph tracer for C/C++/Rust/Python |
+10 | 72 | Verified |
| 45 |
cmackenzie1/tracing-ndjson
A customizable NDJSON format for tracing in Rust |
+9 | 53 | Established |
| 46 |
libbpf/libbpf
Automated upstream mirror for libbpf stand-alone build. |
+7 | 70 | Verified |
| 47 |
alibaba/loongsuite-go-agent
OpenTelemetry Compile-Time Instrumentation for Golang |
+7 | 66 | Established |
| 48 |
inikep/lzbench
lzbench is an in-memory benchmark of open-source compressors |
+7 | 71 | Verified |
| 49 |
komoju/komoju-datadog
Rust Datadog instrumentation |
+5 | 48 | Emerging |
| 50 |
huggingface/aisheets
Build, enrich, and transform datasets using AI models with no code |
+3 | 61 | Established |
| 51 |
cloudflare/ebpf_exporter
Prometheus exporter for custom eBPF metrics |
+3 | 62 | Established |
| 52 |
bamlab/flashlight
📱⚡️ Lighthouse for Mobile - audits your app and gives a performance score to... |
+3 | 55 | Established |
| 53 |
gogf/gf
A powerful framework for faster, easier, and more efficient project development. |
+3 | 64 | Established |
| 54 |
kolloch/reqray
Log call tree summaries after each request for rust programs instrumented... |
+3 | 35 | Emerging |
| 55 |
cilium/pwru
Packet, where are you? -- eBPF-based Linux kernel networking debugger |
+3 | 60 | Established |
| 56 |
tracewayapp/opentelemetry-symfony-bundle
Pure-PHP OpenTelemetry instrumentation for Symfony - automatic HTTP,... |
+2 | 35 | Emerging |