Learning to Optimize Control Policies and Evaluate Reproduction Performance from Human Demonstrations