Phase 1 Workflow

Human Re-Annotation Platform

Review the original AgentDojo trace, complete the propagation fields in a survey flow, and export an annotation CSV that stays easy to analyze later.

Dataset Loading…

Session Snapshot

Your progress is saved to your account automatically on every Save. Use Export to download a local copy.

1 Dataset Status

180-sample annotation dataset loaded automatically.

2 Trace Coverage

All 180 trace files are bundled with the app and load on demand.

3 Export naming

Pick a filename for your annotation pass. The app keeps the CSV schema and writes back the human fields.

Trace Summary

The benchmark labels and machine candidate are visible, but the original trace remains the primary evidence source.

Machine candidate reference

Original Trace

The timeline below is built from the raw run JSON. It shows injections, messages, tool calls, tool outputs, and the exact text that human annotators are reviewing.

Raw JSON trace