Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset