If the only difference between two mdps is the value of the discount factor then they must have the same optimal policy? Explain your reasoning.