Chapter 2 LEARNING RATE ADAPTATION IN STOCHASTIC GRADIENT DESCENT