📖 The AI Tool Bible

LLM observability

Editorial picks for "best llm observability".

9 tools

All Evaluation

Helicone

Evaluation · Platform (any LLM)
8.3

Open-source LLM observability — one-line proxy install.

Freemium· Free 100k req/mo; Pro from $25/moobservabilitycost tracking

Arize AI

Evaluation · Multi-model

Enterprise observability and evaluation platform for LLM agents and generative AI applications.

Freemium· Free tier and OSS Phoenix; paid/enterprise tiers via salesllm-observabilityagent-evaluation

Athina AI

Evaluation · Multi-model

Collaborative LLM evaluation and observability platform for teams shipping AI features to production.

Freemium· Starter free (10k logs/mo); Pro & Enterprise customllm-evaluationprompt-management

Fiddler AI

Evaluation · Fiddler Centor (proprietary evaluators)

Enterprise AI observability and guardrails platform for monitoring agents, LLMs, and ML models in production.

Enterprise· Tiered plans; contact salesllm-observabilityagent-monitoring

Langfuse

Evaluation · Model-agnostic

Open-source LLM observability, prompt management, and evaluation in one platform.

Freemium· Free self-host & Hobby tier; Core $29/mo, Pro $199/mo, Enterprise $2,499/mollm-observabilityprompt-management

Maxim AI

Evaluation · Multi-model

End-to-end evaluation, simulation, and observability platform for shipping production-grade AI agents.

Freemium· Free tier; 14-day trial on paid plans; custom enterprise pricingagent-evaluationllm-observability

Opik

Evaluation · Multi-model

Open-source LLM observability and evaluation platform for debugging and monitoring AI agents in production.

Freemium· Free open-source self-host; free Cloud tier (no card); Enterprise contact salesllm-tracingagent-evaluation

Puzzlet AI

Agents · Multi-model

Git-native prompt management and observability platform for teams shipping LLM applications.

Freemiumprompt-managementllm-observability

Respan (formerly Keywords AI)

Evaluation · Multi-model (500+ via gateway)

LLM engineering platform combining a multi-model gateway with tracing, evals, and prompt management.

Freemium· Free tier; paid plans (pricing not public); enterprise on requestllm-observabilityprompt-management