Federated Transfer Learning with Multimodal Data