Speech Driven Talking Face Generation From a Single Image and an Emotion Condition