Fix your classifier: the marginal value of training the last weight layer