Recycling: Semi-Supervised Learning With Noisy Labels in Deep Neural Networks

Nowadays, as noisy labels, also known as erroneous labels, are becoming increasingly ubiquitous in daily life, it is of great importance to be able to learn from them. This paper efficiently uses semi-supervised learning to highlight a method for utilizing all training data. The proposed algorithm comprises two stages: In the first stage, a small portion of training data regarded as clean samples is selected by the small-loss criteria with a proposed rejection ratio in a mini-batch manner. The proposed rejection ratio is determined by the number of classes regardless of the noise ratio. In the second stage, semi-supervised learning is used to update the parameters of the network with selected clean samples (labeled data) and the remaining noisy samples (unlabeled data) using the Renyi entropy regularization for low-density separation among classes. The proposed algorithm is believed to have the capability to train deep neural network robustly to noisy labels, although noise ratio is unknown. The experimental results considering the MNIST and CIFAR-10 datasets confirm that the proposed algorithm achieves the best accuracy among sample-selection methods.