StepFormer: Self-Supervised Step Discovery and Localization in Instructional Videos