On the convergence of the average expected return in dynamic programming