Policy Gradient Landscape

Reward Regime

Binary mode exposes the good/bad split, common-refinement cells, and the induced switching graph.

Display

Direction-Space Geometry

Dynamics In Parameter Space

Feature-Space Overlays

Drift vectors are the cell-wise directions that define the switching map.

Cells and the induced graph require binary good/bad rewards.

Simulation

0.01
50k

White circle = (draggable)

Actions — drag on canvas

Presets