Toward Understanding the Importance of Noise in Training Neural Networks