Conference on Empirical Methods for Natural Language Processing, Fourth Workshop on Vision and Language