Human Tracking and Pose Estimation in Open Spaces

Keywords: multi-person tracking ; human pose estimation ; surveillance ; conditional random field ; adaptation ; optimization ; temporal filtering ; manifold alignment These Ecole polytechnique federale de Lausanne EPFL, n° 6276 (2014)Programme doctoral Genie electriqueFaculte des sciences et techniques de l'ingenieurInstitut de genie electrique et electroniqueLaboratoire de l'IDIAPJury: Prof. C.N. Jones (president) ; Dr J.-M. Odobez (directeur) ; Dr F. Fleuret, Dr P. Perez, Prof. T. Xiang (rapporteurs) Public defense: 2014-8-11 Reference doi:10.5075/epfl-thesis-6276Print copy in library catalog Record created on 2014-08-06, modified on 2017-05-10

[1]  Xin Zhang,et al.  Object class detection: A survey , 2013, CSUR.

[2]  Mohan M. Trivedi,et al.  Understanding human interactions with track and body synergies (TBS) captured from multiple views , 2008, Comput. Vis. Image Underst..

[3]  Yaakov Bar-Shalom,et al.  Sonar tracking of multiple targets using joint probabilistic data association , 1983 .

[4]  Dariu Gavrila,et al.  A mixed generative-discriminative framework for pedestrian classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Michael Brady,et al.  Automatic Human Behaviour Recognition and Explanation for CCTV Video Surveillance , 2008 .

[6]  Anand Singh Jalal,et al.  The State-of-the-Art in Visual Object Tracking , 2012, Informatica.

[7]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Ian D. Reid,et al.  Guiding Visual Surveillance by Tracking Human Attention , 2009, BMVC.

[9]  Ramakant Nevatia,et al.  Robust Object Tracking by Hierarchical Association of Detection Responses , 2008, ECCV.

[10]  Martial Michel,et al.  The CLEAR 2007 Evaluation , 2007, CLEAR.

[11]  Ying Wang,et al.  Recognize Multi-people Interaction Activity by PCA-HMMs , 2006, ACCV.

[12]  Jian Yao,et al.  Multi-Camera Multi-Person 3D Space Tracking with MCMC in Surveillance Scenarios , 2008, ECCV 2008.

[13]  Ramakant Nevatia,et al.  Online Learned Discriminative Part-Based Appearance Models for Multi-human Tracking , 2012, ECCV.

[14]  Cor J. Veenman,et al.  Resolving Motion Correspondence for Densely Moving Points , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Duc Phu Chau,et al.  Object Tracking in Videos: Approaches and Issues , 2013, ArXiv.

[16]  Teddy Ko,et al.  A survey on behavior analysis in video surveillance for homeland security applications , 2008, 2008 37th IEEE Applied Imagery Pattern Recognition Workshop.

[17]  Alexander Artikis,et al.  Behaviour Recognition using the Event Calculus , 2009, AIAI.

[18]  Alexandre Heili,et al.  Combined estimation of location and body pose in surveillance video , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[19]  Alexandre Heili,et al.  Exploiting Long-Term Connectivity and Visual Motion in CRF-Based Multi-Person Tracking , 2014, IEEE Transactions on Image Processing.

[20]  Pascal Fua,et al.  Multi-camera Tracking and Atypical Motion Detection with Behavioral Maps , 2008, ECCV.

[21]  Jean-Marc Odobez,et al.  Embedding Motion in Model-Based Stochastic Tracking , 2004, IEEE Transactions on Image Processing.

[22]  Yong Pei,et al.  Integrating multi-stage depth-induced contextual information for human action recognition and localization , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[23]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..

[24]  Michel Bierlaire,et al.  Behavioral Priors for Detection and Tracking of Pedestrians in Video Sequences , 2006, International Journal of Computer Vision.

[25]  Shaogang Gong,et al.  Reidentification by Relative Distance Comparison , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[27]  Xiaogang Wang,et al.  Unsupervised Salience Learning for Person Re-identification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[29]  David Beymer,et al.  Real-Time Tracking of Multiple People Using Continuous Detection , 1999 .

[30]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[31]  Subramanian Ramanathan,et al.  No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion , 2013, 2013 IEEE International Conference on Computer Vision.

[32]  Alexandre Heili,et al.  Detection-based multi-human tracking using a CRF model , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[33]  Pascal Fua,et al.  Multi-Commodity Network Flow for Tracking Multiple People , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Alessio Del Bue,et al.  Social interaction discovery by statistical analysis of F-formations , 2011, BMVC.

[35]  Gian Luca Foresti,et al.  Dynamic Models for People Detection and Tracking , 2008, 2008 IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance.

[36]  Thomas S. Huang,et al.  Image processing , 1971 .

[37]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 2004, International Journal of Computer Vision.

[38]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[39]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Mun Wai Lee,et al.  Human Upper Body Pose Estimation in Static Images , 2004, ECCV.

[41]  Frank Dellaert,et al.  An MCMC-Based Particle Filter for Tracking Multiple Interacting Targets , 2004, ECCV.

[42]  Jean-Marc Odobez,et al.  We are not contortionists: Coupled adaptive learning for head and body orientation estimation in surveillance video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Subramanian Ramanathan,et al.  Exploring Transfer Learning Approaches for Head Pose Classification from Multi-view Surveillance Images , 2013, International Journal of Computer Vision.

[44]  Peter V. Gehler,et al.  Poselet Conditioned Pictorial Structures , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Shin'ichi Satoh,et al.  Robust Recognition of Specific Human Behaviors in Crowded Surveillance Video Sequences , 2010, EURASIP J. Adv. Signal Process..

[46]  Samuel S. Blackman,et al.  Multiple-Target Tracking with Radar Applications , 1986 .

[47]  Pascal Fua,et al.  Tracking multiple people under global appearance constraints , 2011, 2011 International Conference on Computer Vision.

[48]  Jake K. Aggarwal,et al.  Video Retrieval of Human Interactions Using Model-Based Motion Tracking and Multi-layer Finite State Automata , 2003, CIVR.

[49]  Ramakant Nevatia,et al.  Learning affinities and dependencies for multi-target tracking using a CRF model , 2011, CVPR 2011.

[50]  Ben J. A. Kröse,et al.  Detecting F-formations as dominant sets , 2011, ICMI '11.

[51]  Narendra Ahuja,et al.  Improving head and body pose estimation through semi-supervised manifold alignment , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[52]  Ramakant Nevatia,et al.  Bayesian human segmentation in crowded situations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[53]  Konrad Schindler,et al.  Challenges of Ground Truth Evaluation of Multi-target Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[54]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[57]  Fatih Murat Porikli,et al.  Pedestrian Detection via Classification on Riemannian Manifolds , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Slawomir Bak,et al.  Multiple-shot human re-identification by Mean Riemannian Covariance Grid , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[60]  Daniel P. Huttenlocher,et al.  Tracking non-rigid objects in complex scenes , 1993, 1993 (4th) International Conference on Computer Vision.

[61]  Ian D. Reid,et al.  Estimating Gaze Direction from Low-Resolution Faces in Video , 2006, ECCV.

[62]  Shuicheng Yan,et al.  Synchronized Submanifold Embedding for Person-Independent Pose Estimation and Beyond , 2009, IEEE Transactions on Image Processing.

[63]  Ramakant Nevatia,et al.  Multi-target tracking by on-line learned discriminative appearance models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[64]  Shaogang Gong,et al.  A Unified Bayesian Framework for Adaptive Visual Tracking , 2009, BMVC.

[65]  Daniel D. Lee,et al.  Semisupervised alignment of manifolds , 2005, AISTATS.

[66]  Sridhar Mahadevan,et al.  Sparse Manifold Alignment , 2012 .

[67]  Shaogang Gong,et al.  Head Pose Classification in Crowded Scenes , 2009, BMVC.

[68]  Rama Chellappa,et al.  Estimation of Object Motion Parameters from Noisy Images , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  Bernt Schiele,et al.  Monocular 3D pose estimation and tracking by detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[70]  Jean-Marc Odobez,et al.  Short-Term Spatio–Temporal Clustering Applied to Multiple Moving Speakers , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[71]  Noel E. O'Connor,et al.  Anti-social behavior detection in audio-visual surveillance systems , 2009 .

[72]  Shaogang Gong,et al.  Human pose estimation using structural support vector machines , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[73]  Shaogang Gong,et al.  Learning human pose in crowd , 2010, MPVA '10.

[74]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[75]  Bernt Schiele,et al.  New features and insights for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[76]  Tieniu Tan,et al.  An Integrated Traffic and Pedestrian Model-Based Vision System , 1997, BMVC.

[77]  Rita Cucchiara,et al.  Multi-stage Sampling with Boosting Cascades for Pedestrian Detection in Images and Videos , 2010, ECCV.

[78]  Jean-Marc Odobez,et al.  Robust Multiresolution Estimation of Parametric Motion Models , 1995, J. Vis. Commun. Image Represent..

[79]  Somayeh Danafar,et al.  Action Recognition for Surveillance Applications Using Optic Flow and SVM , 2007, ACCV.

[80]  R. Collins,et al.  Marked point processes for crowd counting , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[81]  Huadong Ma,et al.  Human detection using multi-camera and 3D scene knowledge , 2011, 2011 18th IEEE International Conference on Image Processing.

[82]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[83]  Afshin Dehghan,et al.  GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs , 2012, ECCV.

[84]  Francesco Setti,et al.  Group detection in still images by F-formation modeling: A comparative study , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[85]  Subhransu Maji,et al.  Action recognition from a distributed representation of pose and appearance , 2011, CVPR 2011.

[86]  Luc Van Gool,et al.  Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[87]  Dmitry B. Goldgof,et al.  Understanding Transit Scenes: A Survey on Human Behavior-Recognition Algorithms , 2010, IEEE Transactions on Intelligent Transportation Systems.

[88]  Larry S. Davis,et al.  Shape-Based Human Detection and Segmentation via Hierarchical Part-Template Matching , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[89]  Duc Phu Chau,et al.  Multi-target tracking by discriminative analysis on Riemannian manifold , 2012, 2012 19th IEEE International Conference on Image Processing.

[90]  Jian Yao,et al.  Fast human detection from videos using covariance features , 2008, ECCV 2008.

[91]  Stanley T. Birchfield,et al.  Elliptical head tracking using intensity gradients and color histograms , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[92]  Slawomir Bak,et al.  Learning to Match Appearances by Correlations in a Covariance Metric Space , 2012, ECCV.

[93]  Ramakant Nevatia,et al.  An online learned CRF model for multi-target tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[94]  Frank Dellaert,et al.  Efficient particle filter-based tracking of multiple interacting targets using an MRF-based motion model , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[95]  Robert T. Collins,et al.  Multi-target Tracking by Lagrangian Relaxation to Min-cost Network Flow , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[96]  Jean-Marc Odobez,et al.  MRF-based motion segmentation exploiting a 2D motion model robust estimation , 1995, Proceedings., International Conference on Image Processing.

[97]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[98]  Bo Li,et al.  A review on vision-based pedestrian detection in intelligent transportation systems , 2012, Proceedings of 2012 9th IEEE International Conference on Networking, Sensing and Control.

[99]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[100]  Rainer Stiefelhagen,et al.  Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics , 2008, EURASIP J. Image Video Process..

[101]  Chang Huang,et al.  Learning to associate: HybridBoosted multi-target tracker for crowded scene , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[102]  Jitendra Malik,et al.  Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[103]  Robert T. Collins,et al.  Multitarget data association with higher-order motion models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[104]  Alexandre Heili,et al.  A joint estimation of head and body orientation cues in surveillance video , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[105]  Jean-Marc Odobez,et al.  Parameter estimation and contextual adaptation for a multi-object tracking CRF model , 2013, 2013 IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (PETS).

[106]  Ian D. Reid,et al.  Unsupervised learning of a scene-specific coarse gaze estimator , 2011, 2011 International Conference on Computer Vision.

[107]  Duc Phu Chau,et al.  A multi-feature tracking algorithm enabling adaptation to context variations , 2011, ICDP.

[108]  Paul W. Fieguth,et al.  Color-based tracking of heads and other mobile objects at video frame rates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[109]  M. Shah,et al.  Object tracking: A survey , 2006, CSUR.

[110]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[111]  Jean-Marc Odobez,et al.  Using particles to track varying numbers of interacting people , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[112]  FuaPascal,et al.  Multicamera People Tracking with a Probabilistic Occupancy Map , 2008 .

[113]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[114]  Feng Wu,et al.  Very Fast Template Matching , 2002, ECCV.

[115]  F. Fleuret,et al.  Multiple object tracking using flow linear programming , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[116]  Mohamed R. Amer,et al.  Multiobject tracking as maximum weight independent set , 2011, CVPR 2011.

[117]  Bingbing Ni,et al.  RGBD-HuDaAct: A color-depth video database for human daily activity recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[118]  Eranda C Ela,et al.  Assignment Problems , 1964, Comput. J..

[119]  Mubarak Shah,et al.  A noniterative greedy algorithm for multiframe point correspondence , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[120]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[121]  François Fleuret,et al.  Exact Acceleration of Linear Object Detectors , 2012, ECCV.

[122]  Fatih Murat Porikli,et al.  Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[123]  Dariu Gavrila,et al.  A Bayesian, Exemplar-Based Approach to Hierarchical Shape Matching , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[124]  Takahiro Okabe,et al.  Appearance-based head pose estimation with scene-specific adaptation , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[125]  A. Kendon Studies in the behavior of social interaction , 1977 .

[126]  Ramakant Nevatia,et al.  Global data association for multi-object tracking using network flows , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[127]  Daniel Snow,et al.  Pedestrian detection using boosted features over many frames , 2008, 2008 19th International Conference on Pattern Recognition.

[128]  Shai Avidan,et al.  Ensemble Tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[129]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[130]  Vittorio Murino,et al.  Socially intelligent surveillance and monitoring: Analysing social dimensions of physical space , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[131]  Anupam Agrawal,et al.  A survey on activity recognition and behavior understanding in video surveillance , 2012, The Visual Computer.