Auto-Instrumentation¶

Add Pisama failure detection to any LLM application with one line of code. No trace construction, no SDK hooks, no configuration needed.

Install¶

pip install pisama-auto

Quick Start¶

import pisama_auto
pisama_auto.init()

# All subsequent LLM calls are automatically traced
import anthropic
client = anthropic.Anthropic()
response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello"}],
)
# ^ This call is traced with OTEL gen_ai.* semantic conventions

How It Works¶

pisama_auto.init() does two things:

Sets up an OTEL tracer that exports spans to Pisama (or console for local debugging)
Monkey-patches LLM SDKs to wrap API calls with OTEL spans

Each LLM call emits a span with:

gen_ai.system — provider (anthropic, openai)
gen_ai.request.model — model name
gen_ai.usage.input_tokens / output_tokens / total_tokens — token counts
gen_ai.response.stop_reason — why the model stopped
Error attributes on failure

Supported Libraries¶

Library	Methods Patched	Status
`anthropic`	`messages.create()`, `messages.stream()`	GA
`openai`	`chat.completions.create()`	GA

Configuration¶

Export to Pisama Platform¶

pisama_auto.init(api_key="ps_...")

Or set the PISAMA_API_KEY environment variable:

export PISAMA_API_KEY=ps_your_key_here
python my_app.py

Custom Endpoint¶

pisama_auto.init(
    api_key="ps_...",
    endpoint="https://api.pisama.ai/api/v1/traces/ingest",
)

Or set PISAMA_ENDPOINT.

Local-Only (No Export)¶

pisama_auto.init()  # No API key — traces go to console only

Selective Patching¶

pisama_auto.init(auto_patch=False)  # Don't patch anything automatically

from pisama_auto.patches import patch
patch("anthropic")  # Only patch Anthropic SDK

Integration with Detection¶

Traces exported by pisama-auto flow through the standard Pisama detection pipeline:

Spans arrive at the OTEL ingestion endpoint
The OTEL parser extracts states from gen_ai.* attributes
Detectors analyze the trace for failures
Results appear in the dashboard

This works with all 20 core detectors and the full tiered detection pipeline.