Run Replay

run-evaluator-suite-001

source: repositorystatus: successmode: autobenchmark: benchmark-v1

Composite Score

0.988

events: 0

Back to runs•Artifacts API JSON

Trajectory Playback

Step through the agent/tool trajectory to narrate execution behavior during the demo.

No trajectory events available.

Trajectory Timeline

success events: 0 • failed events: 0

No trajectory events found for this run.

Score Breakdown

Outcome1.000

Trajectory0.950

Efficiency1.000

Reliability1.000

Safety Penalty0.000

Resources & Artifacts

Wall time: 95 ms

Memory peak: 0 MB

Tokens in/out: 0 / 0

Event count: 0

Trajectory file:

n/a

Raw Run Result JSON

{
  "run_id": "run-evaluator-suite-001",
  "skill_version_id": "skillver-sample-v0.1.0",
  "benchmark_version_id": "benchmark-v1",
  "status": "success",
  "aggregate_scores": {
    "outcome_score": 1,
    "trajectory_score": 0.95,
    "efficiency_score": 1,
    "reliability_multiplier": 1,
    "safety_penalty": 0,
    "composite_score": 0.9875
  },
  "resource_usage": {
    "wall_time_ms": 95,
    "cpu_time_ms": 0,
    "memory_peak_mb": 0,
    "estimated_tokens_in": 0,
    "estimated_tokens_out": 0
  },
  "failure_summary": null,
  "trajectory": {
    "event_schema_version": "v1",
    "event_count": 0,
    "events_artifact_path": null
  },
  "execution_mode": "auto",
  "task_count": 2,
  "task_results_artifact_path": "/mnt/c/Users/danmo/Desktop/chi/benchmarks/v1/artifacts/run-evaluator-suite-001/run-tasks.json"
}