Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent