Model-Based Reinforcement Learning Variable Impedance Control for Human-Robot Collaboration