Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration