Incorporating wheelchair users in people detection

A wheelchair users detector is presented to extend people detection, providing a more general solution to detect people in environments such as houses adapted for independent and assisted living, hospitals, healthcare centers and senior residences. A wheelchair user model is incorporated in a detector whose detections are afterwards combined with the ones obtained using traditional people detectors (we define these as standing people detectors). We have trained a model for classical (DPM) and two for modern (Faster-RCNN and YOLOv3) detection algorithms, to compare their performance. Besides the extensibility proposed with respect to people detection, a dataset of video sequences has been recorded in a real in-door senior residence environment containing wheelchairs users and standing people and it has been released together with the associated ground-truth.

[1]  Chang-Te Lin,et al.  Applying Image Technology to Detect and Track the Wheelchair Patient Safety , 2013 .

[2]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Michael Felsberg,et al.  The Visual Object Tracking VOT2015 Challenge Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[4]  Jin Young Choi,et al.  Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Joseph Redmon,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[6]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  L. Delahoche,et al.  Generic method for recognition of a wheelchair, even with a low resolution-effective sensor , 2004, 2004 IEEE International Conference on Industrial Technology, 2004. IEEE ICIT '04..

[9]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Álvaro García-Martín,et al.  Post-processing approaches for improving people detection performance , 2015, Comput. Vis. Image Underst..

[11]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Sergio A. Velastin,et al.  Intelligent distributed surveillance systems: a review , 2005 .

[13]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[15]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Bernt Schiele,et al.  Pictorial structures revisited: People detection and articulated pose estimation , 2009, CVPR.

[17]  Pau-Choo Chung,et al.  Recovery of 3-D location and orientation of a wheelchair in a calibrated environment by using single perspective geometry , 2007, TENCON 2007 - 2007 IEEE Region 10 Conference.

[18]  Stefan Roth,et al.  People-tracking-by-detection and people-detection-by-tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Huchuan Lu,et al.  Deep visual tracking: Review and experimental comparison , 2018, Pattern Recognit..

[20]  Franck Multon,et al.  Fall Detection With Multiple Cameras: An Occlusion-Resistant Method Based on 3-D Silhouette Vertical Distribution , 2011, IEEE Transactions on Information Technology in Biomedicine.

[21]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Piotr Dollár,et al.  Crosstalk Cascades for Frame-Rate Pedestrian Detection , 2012, ECCV.

[24]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Ignacio Parra,et al.  Combination of Feature Extraction Methods for SVM Pedestrian Detection , 2007, IEEE Transactions on Intelligent Transportation Systems.

[26]  Katsuhiko Sakaue,et al.  Wheelchair recognition by using stereo vision and histogram of oriented gradients (HOG) in real environments , 2009, 2009 Workshop on Applications of Computer Vision (WACV).

[27]  Pau-Choo Chung,et al.  Assistance Instruments Detection Using Geometry Constrained Knowledge for Health Care Centers , 2010, 2010 5th International Conference on Future Information Technology.

[28]  Álvaro García-Martín,et al.  Robust Real Time Moving People Detection in Surveillance Scenarios , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[29]  Bernt Schiele,et al.  Multi-cue onboard pedestrian detection , 2009, CVPR.

[30]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[31]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[32]  Xiaogang Wang,et al.  DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection , 2014, ArXiv.

[33]  Jitendra Malik,et al.  Deformable part models are convolutional neural networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Duan-Yu Chen,et al.  Robust wheelchair pedestrian detection using sparse representation , 2012, 2012 Visual Communications and Image Processing.

[35]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[37]  Huchuan Lu,et al.  Visual Tracking via Coarse and Fine Structural Local Sparse Appearance Models , 2016, IEEE Transactions on Image Processing.

[38]  Xiaogang Wang,et al.  Visual Tracking with Fully Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[39]  Mubarak Shah,et al.  Wheelchair Detection in a Calibrated Environment , 2002 .

[40]  Sergio A. Velastin,et al.  Backgroundless detection of pedestrians in cluttered conditions based on monocular images: a review , 2012 .

[41]  Osama Masoud,et al.  Estimating pedestrian counts in groups , 2008, Comput. Vis. Image Underst..

[42]  Pau-Choo Chung,et al.  Wheelchair Detection Using Cascaded Decision Tree , 2010, IEEE Transactions on Information Technology in Biomedicine.

[43]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[44]  Nadia Magnenat-Thalmann,et al.  Fall Detection Based on Body Part Tracking Using a Depth Camera , 2015, IEEE Journal of Biomedical and Health Informatics.

[45]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.