Appendix for Data Diversification: A Simple Strategy For Neural Machine Translation