Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models