Part 1: On the Markov Decision Model which forms the theoretical foundation of reinforcement learning problems