Control Panel
Runs / run-evaluator-suite-001
Run Replay

run-evaluator-suite-001

source: repositorystatus: successmode: autobenchmark: benchmark-v1
Composite Score
0.988
events: 0
Trajectory Playback
Step through the agent/tool trajectory to narrate execution behavior during the demo.
No trajectory events available.

Trajectory Timeline

success events: 0 • failed events: 0
No trajectory events found for this run.

Score Breakdown

Outcome1.000
Trajectory0.950
Efficiency1.000
Reliability1.000
Safety Penalty0.000

Resources & Artifacts

Wall time: 95 ms
Memory peak: 0 MB
Tokens in/out: 0 / 0
Event count: 0
Trajectory file:
n/a
Raw Run Result JSON
{
  "run_id": "run-evaluator-suite-001",
  "status": "success",
  "task_count": 2,
  "trajectory": {
    "event_count": 0,
    "event_schema_version": "v1",
    "events_artifact_path": null
  },
  "execution_mode": "auto",
  "resource_usage": {
    "cpu_time_ms": 0,
    "wall_time_ms": 95,
    "memory_peak_mb": 0,
    "estimated_tokens_in": 0,
    "estimated_tokens_out": 0
  },
  "failure_summary": null,
  "aggregate_scores": {
    "outcome_score": 1,
    "safety_penalty": 0,
    "composite_score": 0.9875,
    "efficiency_score": 1,
    "trajectory_score": 0.95,
    "reliability_multiplier": 1
  },
  "skill_version_id": "skillver-sample-v0.1.0",
  "benchmark_version_id": "benchmark-v1",
  "task_results_artifact_path": "/mnt/c/Users/danmo/Desktop/chi/benchmarks/v1/artifacts/run-evaluator-suite-001/run-tasks.json"
}