Real Estate Attribute Prediction from Multiple Visual Modalities with Missing Data