Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living