2P1-S-022 Learning of Decision Making at Free Kicks Using Policy Gradient Methods(RoboCup 2,Mega-Integration in Robotics and Mechatronics to Assist Our Daily Lives)