论文信息 - Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net

Style-Neutralized Pattern Classification Based on Adversarially Trained Upgraded U-Net

Traditional machine learning approaches usually hold the assumption that data for model training and in real applications are created following the identical and independent distribution (i.i.d.). However, several relevant research topics have demonstrated that such condition may not always describe the real scenarios. One particular case is that the patterns are equipped with diverse and changeable style information. In this paper, a novel classification framework named Style Neutralization Generative Adversarial Classifier (SN-GAC), based on an upgraded U-Net architecture, and trained adversarially with the Generative Adversarial Network (GAN) framework, is introduced to accomplish the classification in such disparate and inconsistent data information case. The generative model in SN-GAC neutralizes style information from the original style-discriminative patterns (style-source) by building the mapping function from them to their style-free counterparts (corresponding standard examples, standard-target). A well-learned generator in the SN-GAC framework is capable of producing the targeted style-neutralized data (generated-target), satisfying the i.i.d. condition. Additionally, SN-GAC is trained adversarially, where an independent discriminator is used to surveil and supervise the training progress of the above-mentioned generator by distinguishing between the real and the generated. Simultaneously, an auxiliary classifier is also embedded in the discriminator to assign the correct class label of both the real and generated data. This process proves effective to aid the generator to produce high-quality human-readable style-neutralized patterns. It will then be further fine-tuned for the sake of promoting the final classification performance. Extensive experiments have adequately demonstrated the effectiveness of the proposed SN-GAC framework: it outperforms several relevant state-of-the-art baselines on two empirical data sets in the non-i.i.d. data classification task.

[1] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[2] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[3] Massimiliano Pontil,et al. Regularized multi--task learning , 2004, KDD.

[4] Jonathon Shlens,et al. Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[5] Rona Cohen. J.-L. Nancy, The possibility of a world , 2018 .

[6] Rui Zhang,et al. W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks , 2018, ICONIP.

[7] Steven C. H. Hoi,et al. Graph Matching by Simplified Convex-Concave Relaxation Procedure , 2014, International Journal of Computer Vision.

[8] Luis Mateus Rocha,et al. Singular value decomposition and principal component analysis , 2003 .

[9] Fumitaka Kimura,et al. Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Chih-Jen Lin,et al. A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[11] Hau-San Wong,et al. Face recognition based on 2D Fisherface approach , 2006, Pattern Recognit..

[12] R. Chellappa. Introduction of New Editor-in-Chief , 2005 .

[13] Hong Qiao,et al. GNCCP—Graduated NonConvexityand Concavity Procedure , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Kaizhu Huang,et al. Pattern Field Classification with Style Normalized Transformation , 2011, IJCAI.

[15] Fahim Dalvi,et al. DeepFace: Face Generation using Deep Learning , 2017, ArXiv.

[16] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[17] Martin Wattenberg,et al. Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.

[18] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[19] A. Hussain,et al. Deep Learning: Fundamentals, Theory and Applications , 2019, Cognitive Computation Trends.

[20] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[21] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[22] Xin Lin,et al. Style Transfer for Anime Sketches with Enhanced Residual U-net and Auxiliary Classifier GAN , 2017, 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR).

[23] Joshua B. Tenenbaum,et al. Separating Style and Content with Bilinear Models , 2000, Neural Computation.

[24] Kaizhu Huang,et al. Field Support Vector Machines , 2017, IEEE Transactions on Emerging Topics in Computational Intelligence.

[25] J. Crowley,et al. Estimating Face orientation from Robust Detection of Salient Facial Structures , 2004 .

[26] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[27] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[28] Jianguo Xiao,et al. DCFont: an end-to-end deep chinese font generation system , 2017, SIGGRAPH Asia Technical Briefs.

[29] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[30] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[31] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[32] Rui Zhang,et al. Field Support Vector Regression , 2017, ICONIP.

[33] Yuichi Yoshida,et al. Spectral Norm Regularization for Improving the Generalizability of Deep Learning , 2017, ArXiv.

[34] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[36] Amos J. Storkey,et al. Data Augmentation Generative Adversarial Networks , 2017, ICLR 2018.

[37] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[38] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Tomas Pfister,et al. Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[41] Fei Yin,et al. CASIA Online and Offline Chinese Handwriting Databases , 2011, 2011 International Conference on Document Analysis and Recognition.

[42] George Nagy,et al. Style consistent classification of isogenous patterns , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.