A survey on Self Supervised learning approaches for improving Multimodal representation learning