Detecting Highlighted Video Clips Through Emotion-Enhanced Audio-Visual Cues