On Stacking Probabilistic Temporal Models with Bidirectional Information Flow

We discuss hierarchical combinations of probabilistic models where the upper layer is crafted for predicting time-series data. The combination of models makes the naı̈ve Bayes assumption, stating that the latent variables of the models are independent given the time-indexed label variables. In this setting an additional independence assumption between time steps and mildly inconsistent results are often accepted to make inference computationally feasible. We discuss how the application of approximate inference to the practically intractable joint model instead, shifts the need for these simplifications from model design time to inference time, and the application of loopy belief propagation to the joint model realizes bidirectional communication between models during inference. A first empirical evaluation of the proposed architecture on an activity recognition task demonstrates the benefits of the layered architecture and examines the effects of bidirectional information flow.

[1]  Sebastian Thrun,et al.  Recognizing Activities with Multiple Cues , 2007, Workshop on Human Motion.

[2]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[3]  Larry S. Davis,et al.  Event Modeling and Recognition Using Markov Logic Networks , 2008, ECCV.

[4]  Günther Palm,et al.  uulmMAD - A Human Action Recognition Dataset for Ground-Truth Evaluation and Investigation of View Invariances , 2014, MPRSS.

[5]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[6]  Gang Hua,et al.  Tracking articulated body by dynamic Markov network , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Kai-Uwe Kühnberger,et al.  Neural-symbolic networks for cognitive capacities , 2014, BICA 2014.

[8]  Eric Horvitz,et al.  A Comparison of HMMs and Dynamic Bayesian Networks for Recognizing Office Activities , 2005, User Modeling.

[9]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Michael Glodek,et al.  A layered architecture for probabilistic complex pattern recognition to detect user preferences , 2014, BICA 2014.

[11]  Eric Horvitz,et al.  Layered representations for learning and inferring office activity from multiple sensory channels , 2004, Comput. Vis. Image Underst..

[12]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[13]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[14]  Stuart J. Russell,et al.  First-Order Probabilistic Languages: Into the Unknown , 2007, ILP.