Entropy-driven Unsupervised Keypoint Representation Learning in Videos