The Paradox of the Compositionality of Natural Language: A Neural Machine Translation Case Study