Automated Sustainable Multi-Object Segmentation and Recognition via Modified Sampling Consensus and Kernel Sliding Perceptron

Object recognition in depth images is challenging and persistent task in machine vision, robotics, and automation of sustainability. Object recognition tasks are a challenging part of various multimedia technologies for video surveillance, human–computer interaction, robotic navigation, drone targeting, tourist guidance, and medical diagnostics. However, the symmetry that exists in real-world objects plays a significant role in perception and recognition of objects in both humans and machines. With advances in depth sensor technology, numerous researchers have recently proposed RGB-D object recognition techniques. In this paper, we introduce a sustainable object recognition framework that is consistent despite any change in the environment, and can recognize and analyze RGB-D objects in complex indoor scenarios. Firstly, after acquiring a depth image, the point cloud and the depth maps are extracted to obtain the planes. Then, the plane fitting model and the proposed modified maximum likelihood estimation sampling consensus (MMLESAC) are applied as a segmentation process. Then, depth kernel descriptors (DKDES) over segmented objects are computed for single and multiple object scenarios separately. These DKDES are subsequently carried forward to isometric mapping (IsoMap) for feature space reduction. Finally, the reduced feature vector is forwarded to a kernel sliding perceptron (KSP) for the recognition of objects. Three datasets are used to evaluate four different experiments by employing a cross-validation scheme to validate the proposed model. The experimental results over RGB-D object, RGB-D scene, and NYUDv1 datasets demonstrate overall accuracies of 92.2%, 88.5%, and 90.5% respectively. These results outperform existing state-of-the-art methods and verify the suitability of the method.

[1]  Harihara Santosh Dadi,et al.  Improved Face Recognition Rate Using HOG Features and SVM Classifier , 2016 .

[2]  B. Caputo,et al.  A Performance Evaluation of Exact and Approximate Match Kernels for Object Recognition , 2010 .

[3]  Zhi Jin,et al.  Depth image-based plane detection , 2018, Big Data Analytics.

[4]  Olivier Alata,et al.  Is there a best color space for color image characterization or representation based on Multivariate Gaussian Mixture Model? , 2009, Comput. Vis. Image Underst..

[5]  Mohammed Bennamoun,et al.  RGB-D Object Recognition and Grasp Detection Using Hierarchical Cascaded Forests , 2017, IEEE Transactions on Robotics.

[6]  Emmanuel Zenou,et al.  Using Shape Descriptors for UAV Detection , 2018, IRIACV.

[7]  Giuseppina Piscitelli,et al.  Artificial Intelligence and Machine Learning Applications in Smart Production: Progress, Trends, and Directions , 2020 .

[8]  Humberto Bustince,et al.  New method to assess barley nitrogen nutrition status based on image colour analysis , 2009 .

[9]  Ioannis Kompatsiaris,et al.  H-RANSAC: A HYBRID POINT CLOUD SEGMENTATION COMBINING 2D AND 3D DATA , 2018, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[10]  Xu Yong,et al.  Three-stage network for age estimation , 2019 .

[11]  Zhong Liu,et al.  An Effective 3D Shape Descriptor for Object Recognition with RGB-D Sensors , 2017, Sensors.

[12]  Michela Bertolotto,et al.  Octree-based region growing for point cloud segmentation , 2015 .

[13]  Ahmad Jalal,et al.  Scene Understanding and Recognition: Statistical Segmented Model using Geometrical Features and Gaussian Naïve Bayes , 2019, 2019 International Conference on Applied and Engineering Mathematics (ICAEM).

[14]  Duoqian Miao,et al.  Influence of kernel clustering on an RBFN , 2019, CAAI Trans. Intell. Technol..

[15]  Ahmad Jalal,et al.  Wearable Inertial Sensors for Daily Activity Analysis Based on Adam Optimization and the Maximum Entropy Markov Model , 2020, Entropy.

[16]  Yambem Jina Chanu,et al.  A Survey on Image Segmentation Methods using Clustering Techniques , 2017, European Journal of Engineering and Technology Research.

[17]  Xianghong Hua,et al.  Indoor Point Cloud Segmentation Using Iterative Gaussian Mapping and Improved Model Fitting , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Leonardo Acho,et al.  Image Segmentation Based on Statistical Confidence Intervals , 2018, Entropy.

[19]  Kibum Kim,et al.  A Novel Statistical Method for Scene Classification Based on Multi-Object Categorization and Logistic Regression , 2020, Sensors.

[20]  Binh Thai Pham,et al.  Computational Hybrid Machine Learning Based Prediction of Shear Capacity for Steel Fiber Reinforced Concrete Beams , 2020, Sustainability.

[21]  Ziad Alqadi,et al.  Analysis of color image features extraction using texture methods , 2019, TELKOMNIKA (Telecommunication Computing Electronics and Control).

[22]  Hyuk Choi,et al.  Hole Filling Method for Depth Image Based Rendering Based on Boundary Decision , 2017, IEEE Signal Processing Letters.

[23]  Muhammad Khan,et al.  A Survey: Image Segmentation Techniques , 2014 .

[24]  Yu Kong,et al.  Learning hierarchical 3D kernel descriptors for RGB-D action recognition , 2016, Comput. Vis. Image Underst..

[25]  Shu Zhang,et al.  Pose detection of parallel robot based on improved RANSAC algorithm , 2019, Measurement and Control.

[26]  Seba Susan,et al.  New shape descriptor in the context of edge continuity , 2019, CAAI Trans. Intell. Technol..

[27]  Kibum Kim,et al.  Automatic Recognition of Human Interaction via Hybrid Descriptors and Maximum Entropy Markov Model Using Depth Sensors , 2020, Entropy.

[28]  Maja J. Mataric,et al.  A spatio-temporal extension to Isomap nonlinear dimension reduction , 2004, ICML.

[29]  Bahram Lavi Sefidgari,et al.  Comparative Study of the Behavior of Feature Reduction Methods in Person Re-identification Task , 2018, ICPRAM.

[30]  Majid Ali Khan Quaid,et al.  Wearable sensors based human behavioral pattern recognition using statistical features and reweighted genetic algorithm , 2019, Multimedia Tools and Applications.

[31]  Khurram Khurshid,et al.  On the improvement of foreground-background model-based object tracker , 2017, IET Comput. Vis..

[32]  Ying Wang,et al.  A survey on algorithms of hole filling in 3D surface reconstruction , 2016, The Visual Computer.

[33]  Kui Jia,et al.  Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition , 2019, IEEE Transactions on Cognitive and Developmental Systems.

[34]  Ngai-Man Cheung,et al.  On-Device Scalable Image-Based Localization via Prioritized Cascade Search and Fast One-Many RANSAC , 2018, IEEE Transactions on Image Processing.

[35]  Reza Safabakhsh,et al.  Novel Adaptive Genetic Algorithm Sample Consensus , 2017, Appl. Soft Comput..

[36]  Hamidullah Binol,et al.  Recognition of the stomach cancer images with probabilistic HOG feature vector histograms by using HOG features , 2017, 2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY).

[37]  Fuchun Sun,et al.  Multi-Modal Local Receptive Field Extreme Learning Machine for object recognition , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[38]  Amjad Rehman,et al.  A Sustainable Deep Learning Framework for Object Recognition Using Multi-Layers Deep Features Fusion and Selection , 2020, Sustainability.

[39]  Jiri Matas,et al.  Graph-Cut RANSAC , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40]  Daijin Kim,et al.  Robust human activity recognition from depth video using spatiotemporal multi-fused features , 2017, Pattern Recognit..