Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction