Generation and evaluation of synthetic models for training people detectors

There is a large demand in the area of video-surveillance, especially in people detection, which has caused a large increase in the number of researches and resources in this field. As training images and annotations are not always available, it is important to consider the cost involved in creating the detector models. For example, for elderly people detection, the detector must have into account different positions such as standing, sitting, in a wheelchair, etc. Therefore, this work has the main objective of reducing the amount of resources needed to generate the detection model, saving the cost of having to record new sequences and generate the associated annotations for a detector training. To achieve this, three synthetic image datasets have been created in order to train three different models, evaluating which model is optimal and finally analyzing its feasibility by comparing it with a people detector for wheelchair users trained with real images. Other people detection scenarios in which this technique could be applied are, for example, people riding horses or motorbikes, or people carrying supermarket carts. The synthetic datasets have been generated by combining images of standing people with wheelchair images, combining image patches, and segmenting sections of people (trunk, legs, etc.) to add them to the wheelchair image. As expected, the obtained results have a reduction of efficiency (between 21 and 25%) in exchange for the enormous saving in human annotation and resources to record real images.

[1]  Álvaro García-Martín,et al.  People detection in surveillance: classification and evaluation , 2015, IET Comput. Vis..

[2]  Ignacio Parra,et al.  Combination of Feature Extraction Methods for SVM Pedestrian Detection , 2007, IEEE Transactions on Intelligent Transportation Systems.

[3]  Pau-Choo Chung,et al.  Wheelchair Detection Using Cascaded Decision Tree , 2010, IEEE Transactions on Information Technology in Biomedicine.

[4]  Mubarak Shah,et al.  Wheelchair Detection in a Calibrated Environment , 2002 .

[5]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  L. Delahoche,et al.  Generic method for recognition of a wheelchair, even with a low resolution-effective sensor , 2004, 2004 IEEE International Conference on Industrial Technology, 2004. IEEE ICIT '04..

[7]  Álvaro García-Martín,et al.  On collaborative people detection and tracking in complex scenarios , 2012, Image Vis. Comput..

[8]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Luc Van Gool,et al.  Coupled Detection and Trajectory Estimation for Multi-Object Tracking , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10]  Pau-Choo Chung,et al.  Recovery of 3-D location and orientation of a wheelchair in a calibrated environment by using single perspective geometry , 2007, TENCON 2007 - 2007 IEEE Region 10 Conference.

[11]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.