Conditions for vocabulary acquisition in multi-modal and multilingual environments