论文信息 - Training trajectories, mini-batch losses and the curious role of the learning rate - 字舞流文

Training trajectories, mini-batch losses and the curious role of the learning rate

M. Sandler | A. Zhmoginov | Max Vladymyrov | Nolan Miller