Scene Semantic Recognition Based on Modified Fuzzy C-Mean and Maximum Entropy Using Object-to-Object Relations

With advances in machine vision systems (e.g., artificial eye, unmanned aerial vehicles, surveillance monitoring) scene semantic recognition (SSR) technology has attracted much attention due to its related applications such as autonomous driving, tourist navigation, intelligent traffic and remote aerial sensing. Although tremendous progress has been made in visual interpretation, several challenges remain (i.e., dynamic backgrounds, occlusion, lack of labeled data, changes in illumination, direction, and size). Therefore, we have proposed a novel SSR framework that intelligently segments the locations of objects, generates a novel Bag of Features, and recognizes scenes via Maximum Entropy. First, denoising and smoothing are applied on scene data. Second, modified Fuzzy C-Means integrates with super-pixels and Random Forest for the segmentation of objects. Third, these segmented objects are used to extract a novel Bag of Features that concatenate different blobs, multiple orientations, Fourier transform and geometrical points over the objects. An Artificial Neural Network recognizes the multiple objects using the different patterns of objects. Finally, labels are estimated via Maximum Entropy model. During experimental evaluation, our proposed system illustrated a remarkable mean accuracy rate of 90.07% over the MSRC dataset and 89.26% over the Caltech 101 for object recognition, and 93.53% over the Pascal-VOC12 dataset for scene recognition, respectively. The proposed system should be applicable to various emerging technologies, such as augmented reality, to represent the real-world environment for military training and engineering design, as well as for entertainment, artificial eyes for visually impaired people and traffic monitoring to avoid congestion or road accidents.

[1]  Renaud Dubé,et al.  SegMatch: Segment based place recognition in 3D point clouds , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Hong Huo,et al.  Global-Local Attention Network for Aerial Scene Classification , 2019, IEEE Access.

[3]  Ahmad Jalal,et al.  Region and Decision Tree-Based Segmentations for Multi-Objects Detection and Classification in Outdoor Scenes , 2019, 2019 International Conference on Frontiers of Information Technology (FIT).

[4]  Kibum Kim,et al.  Automatic Recognition of Human Interaction via Hybrid Descriptors and Maximum Entropy Markov Model Using Depth Sensors , 2020, Entropy.

[5]  Xinhang Song,et al.  Scene Recognition With Prototype-Agnostic Scene Layout , 2019, IEEE Transactions on Image Processing.

[6]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[8]  Lin Xie,et al.  Improved spatial pyramid matching for scene recognition , 2018, Pattern Recognit..

[9]  Min Guo,et al.  Multi-value image segmentation based on FCM algorithm and Graph Cut Theory , 2016, 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[10]  Ahmad Jalal,et al.  Scene Understanding and Recognition: Statistical Segmented Model using Geometrical Features and Gaussian Naïve Bayes , 2019, 2019 International Conference on Applied and Engineering Mathematics (ICAEM).

[11]  Yu-Chiang Frank Wang,et al.  Spot and Learn: A Maximum-Entropy Patch Sampler for Few-Shot Image Classification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Bingbing Ni,et al.  HCP: A Flexible CNN Framework for Multi-Label Image Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Yong Zhang,et al.  Image Segmentation Based on FCM with Mahalanobis Distance , 2010, ICICA.

[14]  Hui Wei,et al.  Bag of contour fragments for improvement of object segmentation , 2019, Applied Intelligence.

[15]  Charles W. Lee,et al.  Training Feedforward Neural Networks: An Algorithm Giving Improved Generalization , 1997, Neural Networks.

[16]  Arun Kumar Sangaiah,et al.  Object Tracking in Vary Lighting Conditions for Fog Based Intelligent Surveillance of Public Spaces , 2018, IEEE Access.

[17]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[18]  Ahmad Jalal,et al.  Wearable Inertial Sensors for Daily Activity Analysis Based on Adam Optimization and the Maximum Entropy Markov Model , 2020, Entropy.

[19]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[20]  Balakrishna Gokaraju,et al.  Gabor filter based entropy and energy features for basic scene recognition , 2016, 2016 IEEE Applied Imagery Pattern Recognition Workshop (AIPR).

[21]  Zhuowen Tu,et al.  Deep FisherNet for Image Classification , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Kandarpa Kumar Sarma,et al.  Artificial Neural Network (ANN) Based Object Recognition Using Multiple Feature Sets , 2012, Soft Computing Techniques in Vision Science.

[23]  Lee Jiann-Der,et al.  Object recognition using a neural network with optimal feature extraction , 1997 .

[24]  Fei-Fei Li,et al.  What, where and who? Classifying events by scene and object recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[25]  Horst Bischof,et al.  Interactive Texture Segmentation using Random Forests and Total Variation , 2009, BMVC.

[26]  Qunchao Mi,et al.  A sound-based video clipping framework toward sports scenes , 2018, J. Vis. Commun. Image Represent..

[27]  Luca Benini,et al.  An Event-Driven Ultra-Low-Power Smart Visual Sensor , 2016, IEEE Sensors Journal.

[28]  Hamza Abdellahoum,et al.  CSFCM: An improved fuzzy C-Means image segmentation algorithm using a cooperative approach , 2021, Expert Syst. Appl..

[29]  Yuqing Hou,et al.  Efficient Kidney Segmentation in Micro-CT Based on Multi-Atlas Registration and Random Forests , 2018, IEEE Access.

[30]  Bin Liu,et al.  Image classification method rationally utilizing spatial information of the image , 2019, Multimedia Tools and Applications.

[31]  Xiao Liu,et al.  Random Geometric Prior Forest for Multiclass Object Segmentation , 2015, IEEE Transactions on Image Processing.

[32]  Cécile Barat,et al.  Spatial histograms of soft pairwise similar patches to improve the bag-of-visual-words model , 2015, Comput. Vis. Image Underst..

[33]  Mohsen Guizani,et al.  A Spark-Based Parallel Fuzzy $c$ -Means Segmentation Algorithm for Agricultural Image Big Data , 2019, IEEE Access.

[34]  Senxiang Lu,et al.  An Accurate Object Detector With Effective Feature Extraction by Intrinsic Prior Knowledge , 2020, IEEE Access.

[35]  Mohammed Affan Zidan,et al.  A Novel Autonomous Perceptron Model for Pattern Classification Applications , 2019, Entropy.

[36]  Qiong Ran,et al.  Multi-focus Color Image Fusion using Laplacian Filter and Discrete Fourier Transformation with Qualitative Error Image Metrics , 2019, ICCCV 2019.

[37]  Samy Bengio,et al.  Learning semantic relationships for better action retrieval in images , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Nanning Zheng,et al.  Learning to Detect a Salient Object , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Mircea-Florin Vaida,et al.  Random forest feature selection approach for image segmentation , 2017, International Conference on Machine Vision.

[40]  Qing Li,et al.  Improving Image Classification Accuracy With ELM and CSIFT , 2019, Computing in Science & Engineering.

[41]  Mohamed Akil,et al.  Unsupervised image segmentation based on local pixel clustering and low-level region merging , 2016, 2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP).

[42]  Le Minh Hieu,et al.  An Iterative Mean Filter for Image Denoising , 2019, IEEE Access.

[43]  Bin Xiao,et al.  Object recognition based on the Region of Interest and optimal Bag of Words model , 2016, Neurocomputing.

[44]  Ling Shao,et al.  Learning Object-to-Class Kernels for Scene Classification , 2014, IEEE Transactions on Image Processing.

[45]  Piotr Szuster,et al.  Blob Extraction Algorithm in Detection of Convective Cells for Data Fusion , 2019, Journal of Telecommunications and Information Technology.

[46]  Kibum Kim,et al.  A Novel Statistical Method for Scene Classification Based on Multi-Object Categorization and Logistic Regression , 2020, Sensors.

[47]  Francesco Palmieri,et al.  Objective priors from maximum entropy in data classification , 2013, Inf. Fusion.

[48]  Wei Xia,et al.  Multi-View Classification via Adaptive Discriminant Analysis , 2019, IEEE Access.

[49]  Sabine Süsstrunk,et al.  Multi-spectral SIFT for scene category recognition , 2011, CVPR 2011.

[50]  Parvin Razzaghi,et al.  Parametric and nonparametric context models: A unified approach to scene parsing , 2018, Pattern Recognit..

[51]  Chun-Chieh Wang,et al.  Multi-Scale Analysis Based Ball Bearing Defect Diagnostics Using Mahalanobis Distance and Support Vector Machine , 2013, Entropy.

[52]  Zhipeng Ye,et al.  Hierarchical abstract semantic model for image classification , 2015, J. Electronic Imaging.

[53]  S. V. N. Vishwanathan,et al.  SPF-GMKL: generalized multiple kernel learning with a million kernels , 2012, KDD.

[54]  Yang Yang,et al.  Multi-Temporal Remote Sensing Image Registration Using Deep Convolutional Features , 2018, IEEE Access.

[55]  Yuan Xie,et al.  Instance-Level Salient Object Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Mikko Nuutinen,et al.  Performance measure of image and video quality assessment algorithms: subjective root-mean-square error , 2016, J. Electronic Imaging.

[57]  Miao Qi,et al.  Object Detection Based on Multiple Information Fusion Net , 2020, Applied Sciences.

[58]  Ying Liu,et al.  Context-aware attention network for image recognition , 2019, Neural Computing and Applications.

[59]  Antonio Criminisi,et al.  TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[60]  Andrew Clement,et al.  Semantic and Functional Relationships Among Objects Increase the Capacity of Visual Working Memory , 2018, Journal of experimental psychology. Learning, memory, and cognition.

[61]  Zahir M. Hussain,et al.  A feature-based structural measure: an image similarity measure for face recognition , 2017 .

[62]  Nicholas Roy,et al.  Indoor scene recognition through object detection , 2010, 2010 IEEE International Conference on Robotics and Automation.

[63]  Alan Wee-Chung Liew,et al.  Automatic Image Region Annotation by Genetic Algorithm-Based Joint Classifier and Feature Selection in Ensemble System , 2018, ACIIDS.

[64]  Stewart Worrall,et al.  Adapting Semantic Segmentation Models for Changes in Illumination and Camera Perspective , 2018, IEEE Robotics and Automation Letters.

[65]  Ahmad Jalal,et al.  Salient Segmentation based Object Detection and Recognition using Hybrid Genetic Transform , 2019, 2019 International Conference on Applied and Engineering Mathematics (ICAEM).

[66]  Shuang Song,et al.  Multiple Objects Positioning and Identification Method Based on Magnetic Localization System , 2016, IEEE Transactions on Magnetics.

[67]  Raimondo Schettini,et al.  Spatial Sampling Network for Fast Scene Understanding , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[68]  Serge N. Demidenko,et al.  Moments and maximum entropy method for expanded uncertainty estimation in measurements , 2017, 2017 IEEE International Instrumentation and Measurement Technology Conference (I2MTC).

[69]  Vijay John,et al.  Gabor Filter and Gershgorin Disk-based Convolutional Filter Constraining for Image Classification , 2017 .

[70]  Yi Zhang,et al.  PSANet: Point-wise Spatial Attention Network for Scene Parsing , 2018, ECCV.

[71]  Tetsuya Shimamura,et al.  Parametric Wiener Filter Based on Image Power Spectrum Sparsity , 2018, Journal of Signal Processing.

[72]  Xinhang Song,et al.  Multi-Scale Multi-Feature Context Modeling for Scene Recognition in the Semantic Manifold , 2017, IEEE Transactions on Image Processing.

[73]  Patrick Siarry,et al.  Improved spatial fuzzy c-means clustering for image segmentation using PSO initialization, Mahalanobis distance and post-segmentation correction , 2013, Digit. Signal Process..

[74]  Shiguang Shan,et al.  Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[75]  Wei-Ping Zhu,et al.  Multi-scale context for scene labeling via flexible segmentation graph , 2016, Pattern Recognit..

[76]  Wenzhong Guo,et al.  Salient Object Segmentation Based on Superpixel and Background Connectivity Prior , 2018, IEEE Access.