Dimension-based Attention in Learning and Understanding Spoken Language