Overhead View Person Detection Using YOLO

In video surveillance system, one of the important task is to detect person. In recent years, different computer vision and deep learning algorithms have been developed, which provides robust person detection results. Majority of these developed techniques focused on frontal and asymmetric views. Therefore, in this paper, person detection has been performed from a significantly changed perspective i.e. overhead view. A deep learning model i.e. YOLO (You Look Only Once) has been explored in the context of person detection from overhead view. The model is trained on frontal view data set and tested on overhead view person data set. Furthermore, overhead view person counting has been performed using information of classified bounding box. The YOLO model produces significantly good results with TPR of 95% and FPR up to 0.2%.

[1]  Imran Ahmed,et al.  Person Detection from Overhead View: A Survey , 2019, International Journal of Advanced Computer Science and Applications.

[2]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[3]  John N. Carter,et al.  A robust person detector for overhead views , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[4]  Imran Ahmed,et al.  A robust algorithm for detecting people in overhead views , 2017, Cluster Computing.

[5]  Ali Yeon Md Shakaff,et al.  A robust multimedia surveillance system for people counting , 2017, Multimedia Tools and Applications.

[6]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[7]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Filip Malawski Top-view people counting in public transportation using Kinect , 2014 .

[9]  Thomas B. Moeslund,et al.  Pedestrian Counting with Occlusion Handling Using Stereo Thermal Cameras , 2016, Sensors.

[10]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Zhiqiang Li,et al.  Deep People Counting with Faster R-CNN and Correlation Tracking , 2016, ICIMCS.

[12]  Wei Fang,et al.  A novel YOLO-Based real-time people counting approach , 2017, 2017 International Smart Cities Conference (ISC2).

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Nikos Komodakis,et al.  Object Detection via a Multi-region and Semantic Segmentation-Aware CNN Model , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Imran Ahmed,et al.  Energy Efficient Camera Solution for Video Surveillance , 2019, International Journal of Advanced Computer Science and Applications.

[16]  Liangliang Sun,et al.  Counting people by using a single camera without calibration , 2016, 2016 Chinese Control and Decision Conference (CCDC).

[17]  Yury Vizilter,et al.  Pedestrian detection in video surveillance using fully convolutional YOLO neural network , 2017, Optical Metrology.

[18]  B. Michaelis,et al.  Facial expression recognition based on Haar-like feature detection , 2008, Pattern Recognition and Image Analysis.

[19]  Jianxin Li,et al.  Benchmark Data and Method for Real-Time People Counting in Cluttered Scenes Using Depth Sensors , 2018, IEEE Transactions on Intelligent Transportation Systems.

[20]  Imran Ahmed,et al.  Person detector for different overhead views using machine learning , 2019, Int. J. Mach. Learn. Cybern..

[21]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[22]  Mario Vento,et al.  An efficient and effective method for people detection from top-view depth cameras , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[23]  Satarupa Mukherjee,et al.  Anovel framework for automatic passenger counting , 2011, 2011 18th IEEE International Conference on Image Processing.

[24]  Mario Vento,et al.  A versatile and effective method for counting people on either RGB or depth overhead cameras , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[25]  Awais Adnan,et al.  Robust Background Subtraction Based Person’s Counting From Overhead View , 2018, 2018 9th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON).

[26]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[27]  Bing-Fei Wu,et al.  The design and implementation of a vision-based people counting system in buses , 2016, 2016 International Conference on System Science and Engineering (ICSSE).

[28]  Gwanggil Jeon,et al.  Efficient topview person detector using point based transformation and lookup table , 2019, Comput. Commun..

[29]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[30]  Jakub Nalepa,et al.  Real-Time People Counting from Depth Images , 2015, BDAS.

[31]  Mario Vento,et al.  Counting people by RGB or depth overhead cameras , 2016, Pattern Recognit. Lett..

[32]  Arun Kumar Sangaiah,et al.  A Robust Features-Based Person Tracker for Overhead Views in Industrial Environment , 2018, IEEE Internet of Things Journal.

[33]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[34]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.