Deep learning for multi-label scene classification

Scene classification is an important topic in computer vision. For similar weather conditions, there are some obstacles for extracting features from outdoor images. In this thesis, I present a novel approach to classify cloudy and sunny weather images. Inspired by recent study of a deep convolutional neural network and the spatial pyramid matching, I generate a model based on the ImageNet dataset. Starting with parameters learned from other classification tasks, I fine-tune the model using outdoor images. Experiments demonstrate that our classifier can achieve state-of-the-art accuracy. Multi-label learning is a variant of supervised learning where the task is to predict a set of examples, which can belong to multiple classes. This is a variant of popular multi-class classification problems in which each sample has one class label only. It can apply to a wide range of applications, which include text categorisation, semantic image labelling etc.. A lot of research work has been done on multi-label learning with different approaches. In this thesis, I train a neural network from scratch based on the generated artificial images. The model is learned by minimising an error function based on the Hamming distance, through the backpropagation optimisation. The model has high capability of generalisation.

[1]  Xiaoming Zheng,et al.  Weather Recognition Based on Images Captured by Vision System in Vehicle , 2009, ISNN.

[2]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[3]  H. Robbins,et al.  A Convergence Theorem for Non Negative Almost Supermartingales and Some Applications , 1985 .

[4]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[5]  Sergio Bermejo,et al.  Oriented principal component analysis for large margin classifiers , 2001, Neural Networks.

[6]  R. S-A. Gatsaeva,et al.  On the representation of continuous functions of several variables as superpositions of continuous functions of one variable and addition , 2018 .

[7]  Cewu Lu,et al.  Two-Class Weather Classification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Heaton T. Jeff,et al.  Introduction to Neural Networks with Java , 2005 .

[10]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[11]  Geoffrey E. Hinton Learning Translation Invariant Recognition in Massively Parallel Networks , 1987, PARLE.

[12]  Selim Aksoy,et al.  Scene Classification Using Bag-of-Regions Representations , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Andrew R. Barron,et al.  Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.

[15]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[17]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[18]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[19]  Jiebo Luo,et al.  A computationally efficient approach to indoor/outdoor scene classification , 2002, Object recognition supported by user interaction for service robots.

[20]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[21]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[22]  Junita Mohamad-Saleh,et al.  An Oil Fraction Neural Sensor Developed Using Electrical capacitance Tomography Sensor Data , 2013, Sensors.

[23]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[25]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[26]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[27]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[28]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[29]  Zhi-Hua Zhou,et al.  On the Consistency of Multi-Label Learning , 2011, COLT.

[30]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[31]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[32]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[33]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[34]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Grigorios Tsoumakas,et al.  Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[36]  Anil K. Jain,et al.  Automatic image orientation detection , 2002, IEEE Trans. Image Process..

[37]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[38]  F. Moosmann,et al.  Classification of weather situations on single color images , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[39]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.