A model of online learning as a Linear Quadratic Gaussian (LQG) optimal control problem with random matrices