Bellman Equation: Value Iteration

Watch the Bellman update propagate values through a 3×4 grid world

Click "Next Step" to run the first Bellman backup iteration

Grid World — State Values V(s)

Iteration: 0  |  Max ΔV:  |  Not started
High value (positive)
Near zero
Low value (negative)
Wall (impassable)