Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks

Class imbalance occurs in many real-world applications, including image classification, where the number of images in each class differs significantly. With imbalanced data, the generative adversarial networks (GANs) leans to majority class samples. The two recent methods, Balancing GAN (BAGAN) and improved BAGAN (BAGAN-GP), are proposed as an augmentation tool to handle this problem and restore the balance to the data. The former pre-trains the autoencoder weights in an unsupervised manner. However, it is unstable when the images from different categories have similar features. The latter is improved based on BAGAN by facilitating supervised autoencoder training, but the pre-training is biased towards the majority classes. In this work, we propose a novel Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks (CAPGAN)1 as an augmentation tool to generate realistic synthetic images. In particular, we utilize a conditional convolutional variational autoencoder with supervised and balanced pre-training for the GAN initialization and training with gradient penalty. Our proposed method presents a superior performance of other state-ofthe-art methods on the highly imbalanced version of MNIST, Fashion-MNIST, CIFAR-10, and two medical imaging datasets. Our method can synthesize high-quality minority samples in terms of Fréchet inception distance, structural similarity index measure and perceptual quality.

[1]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[2]  Roland Vollgraf,et al.  Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[3]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  V. Sowmya,et al.  Analysis of Adversarial based Augmentation for Diabetic Retinopathy Disease Grading , 2020, 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[5]  Wei Liu,et al.  Correlated Multi-label Classification with Incomplete Label Space and Class Imbalance , 2019, ACM Trans. Intell. Syst. Technol..

[6]  Liqaa M. Shoohi,et al.  DCGAN for Handling Imbalanced Malaria Dataset based on Over-Sampling Technique and using CNN , 2020 .

[7]  Jacob Abernethy,et al.  On Convergence and Stability of GANs , 2018 .

[8]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[9]  Janne J. Näppi,et al.  Generative synthetic adversarial network for internal bias correction and handling class imbalance problem in medical image diagnosis , 2020, Medical Imaging.

[10]  Luiz Eduardo Soares de Oliveira,et al.  A Dataset for Breast Cancer Histopathological Image Classification , 2016, IEEE Transactions on Biomedical Engineering.

[11]  Shan Wang,et al.  Imbalanced Adversarial Learning for Weather Image Generation and Classification , 2018, 2018 14th IEEE International Conference on Signal Processing (ICSP).

[12]  Constantine Bekas,et al.  BAGAN: Data Augmentation with Balancing GAN , 2018, ArXiv.

[13]  Rozenn Dahyot,et al.  Using WGAN for Improving Imbalanced Classification Performance , 2019, AICS.

[14]  Gaofeng Huang,et al.  Enhanced balancing GAN: minority-class image generation , 2020, Neural Computing and Applications.

[15]  Mohammad Havaei,et al.  Jigsaw-VAE: Towards Balancing Features in Variational Autoencoders , 2020, ArXiv.

[16]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[17]  Juan José Aguilar Martín,et al.  A survey on generative adversarial networks for imbalance problems in computer vision tasks , 2020, Journal of Big Data.

[18]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[19]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[20]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[21]  Deepak Gupta,et al.  CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection , 2020, IEEE Access.

[22]  Iñaki Maurtua,et al.  A Survey on Generative Adversarial Networks for imbalance problems in computer vision tasks , 2020 .

[23]  Léon Bottou,et al.  Wasserstein Generative Adversarial Networks , 2017, ICML.