A simple, efficient and scalable contrastive masked autoencoder for learning visual representations