Sample environment
Transitions between states are probabilistic, and are represented as a Markov Decision Process.