Reinforcement Learning is Direct Adaptive Optimal Control