Waterfall
Each row is one span. Indent shows parent depth; the bar shows position within the trace.
agent.run
5383.8ms
Attributes (2)
{
"feature": "support_bot",
"user_id": "u_heavy_11"
}anthropic.messages.create
1169.5ms
Attributes (10)
{
"model": "claude-opus-4-7",
"feature": "support_bot",
"user_id": "u_heavy_11",
"tokens_prompt": 270,
"tokens_output": 317,
"tokens_total": 587,
"input": "Summarize the Q3 earnings report attached. Focus on revenue, margin, and guidance.",
"output": "(answer to: Summarize the Q3 earnings report attache…)",
"eval_scores": {
"Hallucination": 0.8611808571033179
},
"hallucination_details": {
"claims": [
{
"text": "Q3 revenue was $4.2B.",
"verdict": "contradicted"
},
{
"text": "YoY growth was 38%.",
"verdict": "contradicted"
},
{
"text": "Operating margin expanded to 27%.",
"verdict": "unsupported"
},
{
"text": "Net income grew faster than revenue.",
"verdict": "supported"
}
],
"supported": 1,
"contradicted": 2,
"unsupported": 1,
"total": 4,
"score": 0.25
}
}anthropic.messages.create
1770.4ms
Attributes (10)
{
"model": "claude-opus-4-7",
"feature": "support_bot",
"user_id": "u_heavy_11",
"tokens_prompt": 387,
"tokens_output": 224,
"tokens_total": 611,
"input": "Given the customer ticket below, draft a refund response that follows policy P-204.",
"output": "(answer to: Given the customer ticket below, draft a…)",
"eval_scores": {
"Hallucination": 0.9871122930198908
},
"hallucination_details": {
"claims": [
{
"text": "Day 1 plan fits under the $200/day soft cap.",
"verdict": "supported"
},
{
"text": "Belém Tower opens at 9am.",
"verdict": "contradicted"
},
{
"text": "Time Out Market lunch costs $18.",
"verdict": "unsupported"
},
{
"text": "Castelo dinner costs $42.",
"verdict": "unsupported"
}
],
"supported": 1,
"contradicted": 1,
"unsupported": 2,
"total": 4,
"score": 0.25
}
}anthropic.messages.create
2360.0ms
Attributes (10)
{
"model": "claude-opus-4-7",
"feature": "support_bot",
"user_id": "u_heavy_11",
"tokens_prompt": 620,
"tokens_output": 231,
"tokens_total": 851,
"input": "Translate the user manual section to Japanese, preserving the table structure.",
"output": "(answer to: Translate the user manual section to Jap…)",
"eval_scores": {
"Hallucination": 0.9195683976169676
},
"hallucination_details": {
"claims": [
{
"text": "Q3 revenue was $4.2B.",
"verdict": "contradicted"
},
{
"text": "YoY growth was 38%.",
"verdict": "contradicted"
},
{
"text": "Operating margin expanded to 27%.",
"verdict": "unsupported"
},
{
"text": "Net income grew faster than revenue.",
"verdict": "supported"
}
],
"supported": 1,
"contradicted": 2,
"unsupported": 1,
"total": 4,
"score": 0.25
}
}Faithfulness · anthropic.messages.create0.25
1 supported · 2 contradicted · 1 unsupported of 4 claims
- contradicted
Q3 revenue was $4.2B.
- contradicted
YoY growth was 38%.
- unsupported
Operating margin expanded to 27%.
- supported
Net income grew faster than revenue.
Faithfulness · anthropic.messages.create0.25
1 supported · 1 contradicted · 2 unsupported of 4 claims
- supported
Day 1 plan fits under the $200/day soft cap.
- contradicted
Belém Tower opens at 9am.
- unsupported
Time Out Market lunch costs $18.
- unsupported
Castelo dinner costs $42.
Faithfulness · anthropic.messages.create0.25
1 supported · 2 contradicted · 1 unsupported of 4 claims
- contradicted
Q3 revenue was $4.2B.
- contradicted
YoY growth was 38%.
- unsupported
Operating margin expanded to 27%.
- supported
Net income grew faster than revenue.