A Systematic Deep Learning Based Overhead Tracking and Counting System Using RGB-D Remote Cameras

Automatic head tracking and counting using depth imagery has various practical applications in security, logistics, queue management, space utilization and visitor counting. However, no currently available system can clearly distinguish between a human head and other objects in order to track and count people accurately. For this reason, we propose a novel system that can track people by monitoring their heads and shoulders in complex environments and also count the number of people entering and exiting the scene. Our system is split into six phases; at first, preprocessing is done by converting videos of a scene into frames and removing the background from the video frames. Second, heads are detected using Hough Circular Gradient Transform, and shoulders are detected by HOG based symmetry methods. Third, three robust features, namely, fused joint HOG-LBP, Energy based Point clouds and Fused intra-inter trajectories are extracted. Fourth, the Apriori-Association is implemented to select the best features. Fifth, deep learning is used for accurate people tracking. Finally, heads are counted using Cross-line judgment. The system was tested on three benchmark datasets: the PCDS dataset, the MICC people counting dataset and the GOTPD dataset and counting accuracy of 98.40%, 98%, and 99% respectively was achieved. Our system obtained remarkable results.

[1]  Mario Vento,et al.  Counting people by RGB or depth overhead cameras , 2016, Pattern Recognit. Lett..

[2]  Tsong-Yi Chen,et al.  A Cost-Effective People-Counter for a Crowd of Moving People Based on Two-Stage Segmentation , 2012, J. Inf. Hiding Multim. Signal Process..

[3]  Guesh Dagnew,et al.  Deep learning approach for microarray cancer data classification , 2020, CAAI Trans. Intell. Technol..

[4]  Cristina Losada-Gutierrez,et al.  Headgear Accessories Classification Using an Overhead Depth Sensor , 2017, Sensors.

[5]  Rui Jiang,et al.  Object tracking on event cameras with offline-online learning , 2020, CAAI Trans. Intell. Technol..

[6]  Benedetto Barabino,et al.  An Offline Framework for Handling Automatic Passenger Counting Raw Data , 2014, IEEE Transactions on Intelligent Transportation Systems.

[7]  Jing Lv,et al.  People-flow counting in complex environments by combining depth and color information , 2016, Multimedia Tools and Applications.

[8]  Wei Liang,et al.  Recognising human interaction from videos by a discriminative model , 2014, IET Comput. Vis..

[9]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[10]  Xu Yong,et al.  Three-stage network for age estimation , 2019 .

[11]  Munkhjargal Gochoo,et al.  Modeling Two-Person Segmentation and Locomotion for Stereoscopic Action Identification: A Sustainable Video Surveillance System , 2021, Sustainability.

[12]  Munkhjargal Gochoo,et al.  Stochastic Remote Sensing Event Classification over Adaptive Posture Estimation via Multifused Data and Deep Belief Network , 2021, Remote. Sens..

[13]  Tae-Seong Kim,et al.  Human Activity Recognition via Recognized Body Parts of Human Depth Silhouettes for Residents Monitoring Services at Smart Home , 2013 .

[14]  Jean-Philippe Thiran,et al.  Counting Pedestrians in Video Sequences Using Trajectory Clustering , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Yi Sun,et al.  Head-and-Shoulder Detection in Varying Pose , 2005, ICNC.

[16]  Kibum Kim,et al.  Automatic Recognition of Human Interaction via Hybrid Descriptors and Maximum Entropy Markov Model Using Depth Sensors , 2020, Entropy.

[17]  Jehad Ali,et al.  3D Head Pose Estimation through Facial Features and Deep Convolutional Neural Networks , 2021, Computers, Materials & Continua.

[18]  Tan-Hsu Tan,et al.  Novel IoT-Based Privacy-Preserving Yoga Posture Recognition System Using Low-Resolution Infrared Sensors and Deep Learning , 2019, IEEE Internet of Things Journal.

[19]  Kibum Kim,et al.  A Novel Statistical Method for Scene Classification Based on Multi-Object Categorization and Logistic Regression , 2020, Sensors.

[20]  Munkhjargal Gochoo,et al.  Robust Active Shape Model via Hierarchical Feature Extraction with SFS-Optimized Convolution Neural Network for Invariant Human Age Classification , 2021, Electronics.

[21]  Alvaro Fernandez-Rincon,et al.  Robust people detection using depth information from an overhead Time-of-Flight camera , 2017, Expert Syst. Appl..

[22]  Kibum Kim,et al.  Automated Sustainable Multi-Object Segmentation and Recognition via Modified Sampling Consensus and Kernel Sliding Perceptron , 2020, Symmetry.

[23]  Jianxin Li,et al.  Benchmark Data and Method for Real-Time People Counting in Cluttered Scenes Using Depth Sensors , 2018, IEEE Transactions on Intelligent Transportation Systems.

[24]  Andrea Fusiello,et al.  Tracking Human Motion with Multiple Cameras Using an Articulated Model , 2009, MIRAGE.

[25]  Normaziah Abdul Aziz,et al.  Improved Malware Detection Model with Apriori Association Rule and Particle Swarm Optimization , 2019, Secur. Commun. Networks.

[26]  Weria Khaksar,et al.  Facial Expression Recognition Using Salient Features and Convolutional Neural Network , 2017, IEEE Access.

[27]  Maria Mahmood,et al.  Students’ behavior mining in e-learning environment using cognitive processes with information technologies , 2019, Education and Information Technologies.

[28]  Ahmad Jalal,et al.  Wearable Inertial Sensors for Daily Activity Analysis Based on Adam Optimization and the Maximum Entropy Markov Model , 2020, Entropy.

[29]  Majid Ali Khan Quaid,et al.  Wearable sensors based human behavioral pattern recognition using statistical features and reweighted genetic algorithm , 2019, Multimedia Tools and Applications.