← Traces

agent.run

trace_id: trace_000068 · tenant: acme

6 spans·5072msok

Waterfall

Each row is one span. Indent shows parent depth; the bar shows position within the trace.

agent.run
5051.4ms
Attributes (2)
{
  "feature": "moderation",
  "user_id": "u_heavy_49"
}
openai.chat.completions.create
1940.8ms
Attributes (10)
{
  "model": "gpt-4-mini",
  "feature": "moderation",
  "user_id": "u_heavy_49",
  "tokens_prompt": 158,
  "tokens_output": 80,
  "tokens_total": 238,
  "input": "Summarize the Q3 earnings report attached. Focus on revenue, margin, and guidance.",
  "output": "(answer to: Summarize the Q3 earnings report attache…)",
  "eval_scores": {
    "Hallucination": 0.6146426439285279
  },
  "hallucination_details": {
    "claims": [
      {
        "text": "Policy P-204(b) applies to this request.",
        "verdict": "supported"
      },
      {
        "text": "The refund amount is $189.99.",
        "verdict": "contradicted"
      },
      {
        "text": "The card on file ends in 4421.",
        "verdict": "unsupported"
      },
      {
        "text": "Refunds settle within 3 business days.",
        "verdict": "supported"
      }
    ],
    "supported": 2,
    "contradicted": 1,
    "unsupported": 1,
    "total": 4,
    "score": 0.5
  }
}
openai.chat.completions.create
2352.1ms
Attributes (10)
{
  "model": "gpt-4-mini",
  "feature": "moderation",
  "user_id": "u_heavy_49",
  "tokens_prompt": 86,
  "tokens_output": 38,
  "tokens_total": 124,
  "input": "Given the customer ticket below, draft a refund response that follows policy P-204.",
  "output": "(answer to: Given the customer ticket below, draft a…)",
  "eval_scores": {
    "Hallucination": 0.5712026035413146
  },
  "hallucination_details": {
    "claims": [
      {
        "text": "Q3 revenue was $4.2B.",
        "verdict": "contradicted"
      },
      {
        "text": "YoY growth was 38%.",
        "verdict": "contradicted"
      },
      {
        "text": "Operating margin expanded to 27%.",
        "verdict": "unsupported"
      },
      {
        "text": "Net income grew faster than revenue.",
        "verdict": "supported"
      }
    ],
    "supported": 1,
    "contradicted": 2,
    "unsupported": 1,
    "total": 4,
    "score": 0.25
  }
}
tool.search
223.8ms
Attributes (3)
{
  "tool": "search",
  "feature": "moderation",
  "user_id": "u_heavy_49"
}
tool.code_exec
195.1ms
Attributes (3)
{
  "tool": "code_exec",
  "feature": "moderation",
  "user_id": "u_heavy_49"
}
tool.calculator
239.7ms
Attributes (3)
{
  "tool": "calculator",
  "feature": "moderation",
  "user_id": "u_heavy_49"
}

Faithfulness · openai.chat.completions.create0.50

2 supported · 1 contradicted · 1 unsupported of 4 claims

  • supported

    Policy P-204(b) applies to this request.

  • contradicted

    The refund amount is $189.99.

  • unsupported

    The card on file ends in 4421.

  • supported

    Refunds settle within 3 business days.

Faithfulness · openai.chat.completions.create0.25

1 supported · 2 contradicted · 1 unsupported of 4 claims

  • contradicted

    Q3 revenue was $4.2B.

  • contradicted

    YoY growth was 38%.

  • unsupported

    Operating margin expanded to 27%.

  • supported

    Net income grew faster than revenue.