Experiments With Scalable Gradient-based Hyperparameter Optimization for Deep Neural Networks by