On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference (Extended Abstract)