← Traces

agent.run

trace_id: trace_000020 · tenant: soylent

4 spans·3321msok

Waterfall

Each row is one span. Indent shows parent depth; the bar shows position within the trace.

agent.run
3320.6ms
Attributes (2)
{
  "feature": "moderation",
  "user_id": "u_heavy_2"
}
openai.chat.completions.create
1055.9ms
Attributes (10)
{
  "model": "gpt-4-mini",
  "feature": "moderation",
  "user_id": "u_heavy_2",
  "tokens_prompt": 100,
  "tokens_output": 79,
  "tokens_total": 179,
  "input": "Summarize the Q3 earnings report attached. Focus on revenue, margin, and guidance.",
  "output": "(answer to: Summarize the Q3 earnings report attache…)",
  "eval_scores": {
    "Hallucination": 0.9537539148470388
  },
  "hallucination_details": {
    "claims": [
      {
        "text": "Day 1 plan fits under the $200/day soft cap.",
        "verdict": "supported"
      },
      {
        "text": "Belém Tower opens at 9am.",
        "verdict": "contradicted"
      },
      {
        "text": "Time Out Market lunch costs $18.",
        "verdict": "unsupported"
      },
      {
        "text": "Castelo dinner costs $42.",
        "verdict": "unsupported"
      }
    ],
    "supported": 1,
    "contradicted": 1,
    "unsupported": 2,
    "total": 4,
    "score": 0.25
  }
}
openai.chat.completions.create
526.8ms
Attributes (9)
{
  "model": "gpt-4-mini",
  "feature": "moderation",
  "user_id": "u_heavy_2",
  "tokens_prompt": 198,
  "tokens_output": 88,
  "tokens_total": 286,
  "input": "Given the customer ticket below, draft a refund response that follows policy P-204.",
  "output": "(answer to: Given the customer ticket below, draft a…)",
  "eval_scores": {
    "Hallucination": 0.9174977915594354
  }
}
openai.chat.completions.create
1649.8ms
Attributes (9)
{
  "model": "gpt-4-mini",
  "feature": "moderation",
  "user_id": "u_heavy_2",
  "tokens_prompt": 291,
  "tokens_output": 73,
  "tokens_total": 364,
  "input": "Translate the user manual section to Japanese, preserving the table structure.",
  "output": "(answer to: Translate the user manual section to Jap…)",
  "eval_scores": {
    "Hallucination": 0.9494923364603892
  }
}

Faithfulness · openai.chat.completions.create0.25

1 supported · 1 contradicted · 2 unsupported of 4 claims

  • supported

    Day 1 plan fits under the $200/day soft cap.

  • contradicted

    Belém Tower opens at 9am.

  • unsupported

    Time Out Market lunch costs $18.

  • unsupported

    Castelo dinner costs $42.