"Think Before You Speak": Improving Multi-Action Dialog Policy by Planning Single-Action Dialogs