Pisama¶
Multi-agent failure detection & self-healing platform¶
Pisama detects 22 failure modes in LLM agent systems, from infinite loops and hallucinations to coordination breakdowns and prompt injection. Built on the MAST taxonomy, Pisama provides production-grade observability for any multi-agent framework.
-
Get started in 5 minutes
Install Pisama, send your first trace, and see failure detection in action.
-
API Reference
Complete REST API documentation for traces, detections, healing, analytics, and integrations.
-
Detection Reference
Detailed documentation for all 22 failure mode detectors with accuracy benchmarks.
Why Pisama?¶
The problem¶
LLM agents fail silently. A coding agent loops for 40 minutes burning tokens. A research agent hallucinates citations. A customer support agent drifts from its persona. A multi-agent pipeline drops critical context between handoffs. These failures are invisible to standard monitoring tools.
The solution¶
Pisama provides purpose-built failure detection for AI agent systems:
| Capability | Description |
|---|---|
| 22 failure detectors | Covering planning, execution, verification, and cross-cutting concerns |
| Tiered detection | Cost-aware escalation from hash-based ($0.00) to LLM judge ($0.05) |
| Framework agnostic | Works with LangGraph, CrewAI, AutoGen, n8n, Dify, OpenClaw, Claude Code |
| Self-healing | Automated fix generation, approval workflows, and rollback capabilities |
| OTEL native | Built on OpenTelemetry with gen_ai.* semantic conventions |
| Production accuracy | 9 detectors at F1 >= 0.80, with continuous calibration |
Supported frameworks¶
- LangGraph -- State graph analysis, node-level detection
- n8n -- Workflow validation, AI node monitoring, webhook integration
- Dify -- App monitoring, workflow execution tracking
- CrewAI -- Agent role validation, task delegation analysis
- AutoGen -- Multi-agent conversation monitoring
- OpenClaw -- Session and agent tracking
- Claude Code -- Tool call capture and session analysis
Architecture at a glance¶
Trace Sources Pisama Platform Outputs
-------------- --------------- -------
LangGraph ----+
n8n ----+---> Ingestion ---> Detection Engine ---> Dashboard
Dify ----+ Pipeline (22 detectors) Alerts
CrewAI ----+ (OTEL/ Tiered escalation Fix suggestions
Claude Code ---+ webhook) LLM Judge Self-healing
API / webhooks
Quick links¶
- Installation -- Full setup guide
- Configuration -- Environment variables reference
- Failure Modes -- Understanding what Pisama detects
- Integrations -- Connect your agent framework
- Deployment -- Production deployment options
- Contributing -- Development setup and guidelines