Self-supervised network distillation: An effective approach to exploration in sparse reward environments