MemGUI-Bench MemGUI-Bench

Trajectory comparison

MemGUI-Bench Arena

Compare two trajectory-enabled agents task by task, inspect pass/fail buckets, and open their step traces side by side.

VS
Loading trajectory bundles...

Outcome Matrix

Tasks

Select two agents to compare their trajectories.