← Traces

agent.run

trace_id: trace_00002g · tenant: acme

4 spans·5384msok

Waterfall

Each row is one span. Indent shows parent depth; the bar shows position within the trace.

agent.run
5383.8ms
Attributes (2)
{
  "feature": "support_bot",
  "user_id": "u_heavy_11"
}
anthropic.messages.create
1169.5ms
Attributes (10)
{
  "model": "claude-opus-4-7",
  "feature": "support_bot",
  "user_id": "u_heavy_11",
  "tokens_prompt": 270,
  "tokens_output": 317,
  "tokens_total": 587,
  "input": "Summarize the Q3 earnings report attached. Focus on revenue, margin, and guidance.",
  "output": "(answer to: Summarize the Q3 earnings report attache…)",
  "eval_scores": {
    "Hallucination": 0.8611808571033179
  },
  "hallucination_details": {
    "claims": [
      {
        "text": "Q3 revenue was $4.2B.",
        "verdict": "contradicted"
      },
      {
        "text": "YoY growth was 38%.",
        "verdict": "contradicted"
      },
      {
        "text": "Operating margin expanded to 27%.",
        "verdict": "unsupported"
      },
      {
        "text": "Net income grew faster than revenue.",
        "verdict": "supported"
      }
    ],
    "supported": 1,
    "contradicted": 2,
    "unsupported": 1,
    "total": 4,
    "score": 0.25
  }
}
anthropic.messages.create
1770.4ms
Attributes (10)
{
  "model": "claude-opus-4-7",
  "feature": "support_bot",
  "user_id": "u_heavy_11",
  "tokens_prompt": 387,
  "tokens_output": 224,
  "tokens_total": 611,
  "input": "Given the customer ticket below, draft a refund response that follows policy P-204.",
  "output": "(answer to: Given the customer ticket below, draft a…)",
  "eval_scores": {
    "Hallucination": 0.9871122930198908
  },
  "hallucination_details": {
    "claims": [
      {
        "text": "Day 1 plan fits under the $200/day soft cap.",
        "verdict": "supported"
      },
      {
        "text": "Belém Tower opens at 9am.",
        "verdict": "contradicted"
      },
      {
        "text": "Time Out Market lunch costs $18.",
        "verdict": "unsupported"
      },
      {
        "text": "Castelo dinner costs $42.",
        "verdict": "unsupported"
      }
    ],
    "supported": 1,
    "contradicted": 1,
    "unsupported": 2,
    "total": 4,
    "score": 0.25
  }
}
anthropic.messages.create
2360.0ms
Attributes (10)
{
  "model": "claude-opus-4-7",
  "feature": "support_bot",
  "user_id": "u_heavy_11",
  "tokens_prompt": 620,
  "tokens_output": 231,
  "tokens_total": 851,
  "input": "Translate the user manual section to Japanese, preserving the table structure.",
  "output": "(answer to: Translate the user manual section to Jap…)",
  "eval_scores": {
    "Hallucination": 0.9195683976169676
  },
  "hallucination_details": {
    "claims": [
      {
        "text": "Q3 revenue was $4.2B.",
        "verdict": "contradicted"
      },
      {
        "text": "YoY growth was 38%.",
        "verdict": "contradicted"
      },
      {
        "text": "Operating margin expanded to 27%.",
        "verdict": "unsupported"
      },
      {
        "text": "Net income grew faster than revenue.",
        "verdict": "supported"
      }
    ],
    "supported": 1,
    "contradicted": 2,
    "unsupported": 1,
    "total": 4,
    "score": 0.25
  }
}

Faithfulness · anthropic.messages.create0.25

1 supported · 2 contradicted · 1 unsupported of 4 claims

  • contradicted

    Q3 revenue was $4.2B.

  • contradicted

    YoY growth was 38%.

  • unsupported

    Operating margin expanded to 27%.

  • supported

    Net income grew faster than revenue.

Faithfulness · anthropic.messages.create0.25

1 supported · 1 contradicted · 2 unsupported of 4 claims

  • supported

    Day 1 plan fits under the $200/day soft cap.

  • contradicted

    Belém Tower opens at 9am.

  • unsupported

    Time Out Market lunch costs $18.

  • unsupported

    Castelo dinner costs $42.

Faithfulness · anthropic.messages.create0.25

1 supported · 2 contradicted · 1 unsupported of 4 claims

  • contradicted

    Q3 revenue was $4.2B.

  • contradicted

    YoY growth was 38%.

  • unsupported

    Operating margin expanded to 27%.

  • supported

    Net income grew faster than revenue.