Minimal-latency human action recognition using reliable-inference

We present a probabilistic reliable-inference framework to address the issue of rapid detection of human actions with low error rates. The approach determines the shortest video exposures needed for low-latency recognition by sequentially evaluating a series of posterior ratios for different action classes. If a subsequence is deemed unreliable or confusing, additional video frames are incorporated until a reliable classification to a particular action can be made. Results are presented for multiple action classes and subsequence durations, and are compared to alternative probabilistic approaches. The framework provides a means to accurately classify human actions using the least amount of temporal information.

[1]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  G Johansson,et al.  Spatio-temporal differentiation and integration in visual motion perception , 1976, Psychological research.

[5]  Fang Liu,et al.  Finding periodicity in space and time , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[6]  C. W. Therrien,et al.  Decision, Estimation and Classification: An Introduction to Pattern Recognition and Related Topics , 1989 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Dariu Gavrila,et al.  Pedestrian Detection from a Moving Vehicle , 2000, ECCV.

[9]  Manuele Bicego,et al.  Investigating hidden Markov models' capabilities in 2D shape classification , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Venugopal V. Veeravalli,et al.  A sequential procedure for multihypothesis testing , 1994, IEEE Trans. Inf. Theory.

[12]  John H. L. Hansen,et al.  Likelihood decision boundary estimation between HMM pairs in speech recognition , 1998, IEEE Trans. Speech Audio Process..

[13]  R. Nelson,et al.  Low level recognition of human motion (or how to get your man without finding his body parts) , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[14]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[15]  Tomaso A. Poggio,et al.  Pedestrian detection using wavelet templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Jessica K. Hodgins,et al.  Interactive control of avatars animated with human motion data , 2002, SIGGRAPH.

[17]  James W. Davis,et al.  A reliable-inference framework for recognition of human actions , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..

[18]  Tieniu Tan,et al.  Recent developments in human motion analysis , 2003, Pattern Recognit..

[19]  David G. Stork,et al.  Pattern Classification , 1973 .

[20]  James W. Davis Sequential Reliable-Inference for Rapid Detection of Human Actions , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[21]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[22]  Alain Biem,et al.  A model selection criterion for classification: application to HMM topology optimization , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[23]  Yoram Singer,et al.  The Hierarchical Hidden Markov Model: Analysis and Applications , 1998, Machine Learning.

[24]  Alex Pentland,et al.  Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[26]  Edward H. Adelson,et al.  Analyzing and recognizing walking figures in XYT , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Ruediger Dillmann,et al.  Human Motion Analysis: A Review , 1997 .

[28]  Monique Thonnat,et al.  Recurrent Bayesian Network for the Recognition of Human Behaviors from Video , 2003, ICVS.

[29]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[30]  Qiang Ji,et al.  Multi-View Face Tracking with Factorial and Switching HMM , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[31]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[32]  James W. Davis,et al.  Visual Categorization of Children and Adult Walking Styles , 2001, AVBPA.

[33]  J. Little,et al.  Recognizing People by Their Gait: The Shape of Motion , 1998 .

[34]  Allan D. Jepson,et al.  What Makes a Good Feature , 1992 .

[35]  Rómer Rosales,et al.  Inferring body pose without tracking body parts , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[36]  Jake K. Aggarwal,et al.  Human Motion Analysis: A Review , 1999, Comput. Vis. Image Underst..

[37]  Yanxi Liu,et al.  Gait Sequence Analysis Using Frieze Patterns , 2002, ECCV.

[38]  I-Cheng Chang,et al.  The model-based human body motion analysis system , 2000, Image Vis. Comput..

[39]  Matthew Brand,et al.  Discovery and Segmentation of Activities in Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..