THE CONVERGENCE OF STOCHASTIC GRADIENT ALGORITHMS APPLIED TO LEARNING IN NEURAL NETWORKS