Heterogeneous Dual-Task Clustering with Visual-Textual Information