three platforms, one clock
Three small agents stand on three separate platforms. Each has its own button. From far away, the task looks easy: move to the button and press it. But the team does not win when one agent succeeds. It wins only when at least two buttons are pressed at the same time.
without a cooperation layer
Each agent can learn to reach its own button. That still does not solve the task. One arrives too early. One is still crossing. One has already released. Every action can make sense locally, while the team still fails globally.
with a cooperation layer
Each agent still sees only its own small world. But now, something about the team reaches it.
Not everything. Not raw data.
Just enough to change the next move.
One agent slows down.
Another holds near the button.
Another closes the gap.
For the first time,
their separate movements line up.
without a cooperation layer
The task was not reaching the buttons.
The task was reaching the moment together.
Not centralized control, not hand coded waiting. A learned layer
that turns local policies into coordinated timing.