Simultaneous Object Detection, Tracking, and Event Recognition

The common internal structure and algorithmic organization of object detection, detection-based tracking, and event recognition facilitates a general approach to integrating these three components. This supports multidirectional information flow between these components allowing object detection to influence tracking and event recognition and event recognition to influence tracking and object detection. The performance of the combination can exceed the performance of the components in isolation. This can be done with linear asymptotic complexity.

[1]  Demetri Terzopoulos,et al.  A Cognitive Vision System for Space Robotics , 2004 .

[2]  Irfan A. Essa,et al.  Exploiting human actions and object context for recognition tasks , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Gu Xu,et al.  An HMM-based framework for video semantic analysis , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Daniel P. Huttenlocher,et al.  Distance Transforms of Sampled Functions , 2012, Theory Comput..

[5]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[6]  Bo Wu,et al.  Robust Object Tracking based on Detection with Soft Decision , 2008, 2008 IEEE Workshop on Motion and video Computing.

[7]  C Tomasi,et al.  Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[8]  David A. McAllester,et al.  Cascade object detection with deformable part models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Charless C. Fowlkes,et al.  Globally-optimal greedy algorithms for tracking a variable number of objects , 2011, CVPR 2011.

[10]  Sven J. Dickinson,et al.  A Research Roadmap of Cognitive Vision , 2005 .

[11]  Yves Lucet New sequential exact Euclidean distance transform algorithms based on convex analysis , 2009, Image Vis. Comput..

[12]  Ramakant Nevatia,et al.  Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context , 2008, ECCV.

[13]  Sven J. Dickinson,et al.  Spatiotemporal Contour Grouping Using Abstract Part Models , 2010, ACCV.

[14]  Larry S. Davis,et al.  Objects in Action: An Approach for Combining Action Understanding and Object Perception , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[16]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[17]  Jeffrey Mark Siskind,et al.  A Maximum-Likelihood Approach to Visual Event Classification , 1996, ECCV.

[18]  Desmond P. Taylor,et al.  Convolutional Codes and Their Performance in Communication Systems , 2007 .

[19]  Sven J. Dickinson,et al.  Video In Sentences Out , 2012, UAI.

[20]  Rama Chellappa,et al.  A generic approach to simultaneous tracking and verification in video , 2002, IEEE Trans. Image Process..

[21]  Xiaokang Yang,et al.  Event recognition with time varying Hidden Markov Model , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[22]  Svetha Venkatesh,et al.  Combining image regions and human activity for indirect object recognition in indoor wide-angle views , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[23]  Shiqiang Yang,et al.  Motion based event recognition using HMM , 2002, Object recognition supported by user interaction for service robots.

[24]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  D. Castañón Efficient algorithms for finding the K best paths through a trellis , 1990 .

[26]  K. X. M. Tzeng,et al.  Convolutional Codes and 'Their Performance in Communication Systems , 1971 .

[27]  Jon M. Kleinberg,et al.  Fast Algorithms for Large-State-Space HMMs with Applications to Web Usage Analysis , 2003, NIPS.

[28]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[29]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Anthony G. Cohn,et al.  Cognitive Vision: Integrating Symbolic Qualitative Representations with Computer Vision , 2006, Cognitive Vision Systems.

[33]  William T. Freeman,et al.  Orientation Histograms for Hand Gesture Recognition , 1995 .

[34]  Jack K. Wolf,et al.  Finding the best set of K paths through a trellis with application to multitarget tracking , 1989 .

[35]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .