Variational Hetero-Encoder Randomized Generative Adversarial Networks for Joint Image-Text Modeling