AutoEncoder-Driven Multimodal Collaborative Learning for Medical Image Synthesis