Distilling Knowledge Learned in BERT for Text Generation