Probabilistic Parsing in Action Recognition

This report addresses the problem of using probabilistic formal languages to describe and understand actions with explicit structure. The paper explores a probabilistic mechanisms of parsing the uncertain input string aided by a stochastic context-free grammar. This method, originating in speech recognition, allows for combination of a statistical recognition approach with a syntactical one in a unified syntactic-semantic framework for action recognition. The basic approach is to design the recognition system in a two-level architecture. The first level, a set of independently trained component event detectors, produces the likelihoods of each component model. The outputs of these detectors provide the input stream for a stochastic context-free parsing mechanism. Any decisions about supposed structure of the input are deferred to the parser, which attempts to combine the maximum amount of the candidate events into a most likely sequence according to a given Stochastic ContextFree Grammar (SCFG). The grammar and parser enforce longer range temporal constraints, disambiguate or correct uncertain or mis-labeled low level detections, and allow the inclusion of a priori knowledge about the structure of temporal events in a given domain. The method takes into consideration the continuous character of the input and performs “structural rectification” of it in order to account for misalignments and ungrammatical symbols in the stream. The presented technique of such a rectification uses the structure probability maximization to drive the segmentation.

[1]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[2]  Andreas Stolcke,et al.  An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities , 1994, CL.

[3]  Frederick Jelinek,et al.  Basic Methods of Probabilistic Context Free Grammars , 1992 .

[4]  Alberto Sanfeliu,et al.  Automatic Recognition of Bidimensional Models Learned by Grammatical Inference in Outdoor Scenes , 1996, SSPR.

[5]  Matthew Brand,et al.  Understanding manipulation in video , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[6]  Jonathan D. Courtney Automatic video indexing via object motion analysis , 1997, Pattern Recognit..

[7]  Michael L. Baird,et al.  A paradigm for semantic picture recognition , 1973, Pattern Recognit..

[8]  Andreas Stolcke,et al.  Bayesian learning of probabilistic language models , 1994 .

[9]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[10]  Alex Pentland,et al.  Space-time gestures , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[11]  B. John Oommen,et al.  Optimal and Information Theoretic Syntactic Pattern Recognition for Traditional Errors , 1996, SSPR.

[12]  Alfred V. Aho,et al.  A Minimum Distance Error-Correcting Parser for Context-Free Languages , 1972, SIAM J. Comput..

[13]  Jonathan H. Connell,et al.  A colony architecture for an artificial creature , 1989 .

[14]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[15]  King-Sun Fu,et al.  Attributed Grammar-A Tool for Combining Syntactic and Statistical Approaches to Pattern Recognition , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Jay Earley,et al.  An efficient context-free parsing algorithm , 1970, Commun. ACM.

[17]  Alberto Sanfeliu,et al.  Efficient Recognition of a Class of Context-Sensitive Languages Described by Augmented Regular Expressions , 1996, SSPR.

[18]  H. Bunke,et al.  PARSING MULTIVALUED STRINGS AND ITS APPLICATION TO IMAGE AND WAVEFORM RECOGNITION , 1990 .

[19]  King-Sun Fu,et al.  A Step Towards Unification of Syntactic and Statistical Pattern Recognition , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.