Comic Series: Markov Decision Processes with High Rewards
I guess we would want to jump into the hole.
Imagine yourself in a grid-like world with two end goals (absorbing states), what would you do? Would you reach for the drumstick or drop into the hole?