Agenta-AI/agenta

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

/ 100

Established

This platform helps product and engineering teams build reliable applications powered by Large Language Models (LLMs). It provides tools to refine the prompts that guide LLMs, test their performance with various inputs, and monitor how they behave once deployed. You can input different prompts and test cases, then analyze the LLM's responses and performance metrics.

3,923 stars. Actively maintained with 322 commits in the last 30 days.

Use this if you are building LLM-powered applications and need a systematic way to manage prompts, evaluate model responses, and observe performance in production.

Not ideal if you are solely using LLMs for simple, one-off tasks and do not require iterative development, testing, or production monitoring.

LLM-application-development prompt-engineering AI-model-evaluation production-monitoring MLOps

No Package No Dependents

Maintenance 22 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

3,923

Forks

492

Language

TypeScript

License

—

Recent Releases

v0.95.7 10 Apr 2026 v0.95.6 09 Apr 2026 v0.95.5 08 Apr 2026 v0.95.4 02 Apr 2026 v0.95.3 02 Apr 2026

Compare

agenta and langfuse agenta and phoenix agenta and LLMstudio agenta and langwatch agenta and brokle

Related tools

langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management,...

Arize-ai/phoenix

AI Observability & Evaluation

Mirascope/mirascope

The LLM Anti-Framework

Helicone/helicone

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

algorithmicsuperintelligence/optillm

Optimizing inference proxy for LLMs

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights