Run Replay
run-evaluator-suite-001
source: repositorystatus: successmode: autobenchmark: benchmark-v1
Composite Score
0.988
events: 0
Trajectory Playback
Step through the agent/tool trajectory to narrate execution behavior during the demo.
No trajectory events available.
Trajectory Timeline
success events: 0 • failed events: 0
No trajectory events found for this run.
Score Breakdown
Outcome1.000
Trajectory0.950
Efficiency1.000
Reliability1.000
Safety Penalty0.000
Resources & Artifacts
Wall time: 95 ms
Memory peak: 0 MB
Tokens in/out: 0 / 0
Event count: 0
Trajectory file:
n/aRaw Run Result JSON
{
"run_id": "run-evaluator-suite-001",
"status": "success",
"task_count": 2,
"trajectory": {
"event_count": 0,
"event_schema_version": "v1",
"events_artifact_path": null
},
"execution_mode": "auto",
"resource_usage": {
"cpu_time_ms": 0,
"wall_time_ms": 95,
"memory_peak_mb": 0,
"estimated_tokens_in": 0,
"estimated_tokens_out": 0
},
"failure_summary": null,
"aggregate_scores": {
"outcome_score": 1,
"safety_penalty": 0,
"composite_score": 0.9875,
"efficiency_score": 1,
"trajectory_score": 0.95,
"reliability_multiplier": 1
},
"skill_version_id": "skillver-sample-v0.1.0",
"benchmark_version_id": "benchmark-v1",
"task_results_artifact_path": "/mnt/c/Users/danmo/Desktop/chi/benchmarks/v1/artifacts/run-evaluator-suite-001/run-tasks.json"
}