Simultaneous positive sequential vectors modeling and unsupervised feature selection via continuous hidden Markov models

Abstract Since positive data vectors are often naturally generated in various real-life applications, positive vectors modeling has become an important research topic. In this article, we tackle the problem of modeling positive sequential vectors through continuous hidden Markov models (HMMs). Motivated by several recent studies in which the generalized inverted Dirichlet (GID) distribution has provided better performance than the Gaussian distribution for modeling positive data, instead of adopting Gaussian mixture models (GMM) as the emission density for conventional continuous HMMs, we theoretically propose a novel HMM by considering the mixture of GID distributions as the emission density. Moreover, to cope with high-dimensional data which may contain irrelevant features, an unsupervised localized feature selection method is incorporated with our model, which results in a unified framework that can simultaneously perform positive sequential data modeling and feature selection. To learn the proposed model, we develop a convergence-guaranteed algorithm based on variational Bayes. The advantages of our model are demonstrated through both simulated data sets and a real-life application about human action recognition.

[1]  Nizar Bouguila,et al.  Unsupervised Anomaly Intrusion Detection via Localized Bayesian Feature Selection , 2011, 2011 IEEE 11th International Conference on Data Mining.

[2]  Geoffrey J. McLachlan,et al.  The EM Algorithm , 2012 .

[3]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[4]  Nizar Bouguila,et al.  Robust simultaneous positive data clustering and unsupervised feature selection using generalized inverted Dirichlet mixture models , 2014, Knowl. Based Syst..

[5]  Shichao Zhang,et al.  Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Nizar Bouguila,et al.  A novel approach for modeling positive vectors with inverted Dirichlet-based hidden Markov models , 2020, Knowl. Based Syst..

[7]  Nizar Bouguila,et al.  Axially Symmetric Data Clustering Through Dirichlet Process Mixture Models of Watson Distributions , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Ezzeddine Zagrouba,et al.  Abnormal behavior recognition for intelligent video surveillance systems: A review , 2018, Expert Syst. Appl..

[9]  Aristidis Likas,et al.  Bayesian feature and model selection for Gaussian mixture models , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Stephen C. Adams,et al.  A survey of feature selection methods for Gaussian mixture models and hidden Markov models , 2017, Artificial Intelligence Review.

[11]  Ioannis D. Schizas,et al.  On unsupervised simultaneous kernel learning and data clustering , 2020, Pattern Recognit..

[12]  Stéphane Robin,et al.  Hidden Markov Models with mixtures as emission distributions , 2012, Statistics and Computing.

[13]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[14]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[15]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[16]  Wentao Fan,et al.  Positive Sequential Data Modeling Using Continuous Hidden Markov Models Based on Inverted Dirichlet Mixtures , 2019, IEEE Access.

[17]  Lawrence Carin,et al.  Variational Bayes for continuous hidden Markov models and its application to active learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Jing Hua,et al.  Simultaneous Localized Feature Selection and Model Detection for Gaussian Mixtures , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Yuhui Zheng,et al.  Student’s t-Hidden Markov Model for Unsupervised Learning Using Localized Feature Selection , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Robert P. W. Duin,et al.  STATISTICAL PATTERN RECOGNITION , 2005 .

[21]  Sander Dieleman,et al.  Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video , 2015, International Journal of Computer Vision.

[22]  Anil K. Jain,et al.  Simultaneous feature selection and clustering using mixture models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Jesús Ariel Carrasco-Ochoa,et al.  A new Unsupervised Spectral Feature Selection Method for mixed data: A filter approach , 2017, Pattern Recognit..

[24]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[25]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Nizar Bouguila,et al.  Online Learning of Hierarchical Pitman–Yor Process Mixture of Generalized Dirichlet Distributions With Feature Selection , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Henry Leung,et al.  Simultaneous Feature and Model Selection for Continuous Hidden Markov Models , 2012, IEEE Signal Processing Letters.

[28]  Xiaofeng Zhu,et al.  One-Step Multi-View Spectral Clustering , 2019, IEEE Transactions on Knowledge and Data Engineering.

[29]  Nizar Bouguila,et al.  Nonparametric Hierarchical Bayesian Models for Positive Data Clustering Based on Inverted Dirichlet-Based Distributions , 2019, IEEE Access.

[30]  Nizar Bouguila,et al.  Bayesian learning of inverted Dirichlet mixtures for SVM kernels generation , 2013, Neural Computing and Applications.

[31]  Sotirios Chatzis,et al.  A variational Bayesian methodology for hidden Markov models utilizing Student's-t mixtures , 2011, Pattern Recognit..

[32]  Qiang Fu,et al.  Ensemble clustering based on evidence extracted from the co-association matrix , 2019, Pattern Recognit..

[33]  Naiming Qi,et al.  Density peak clustering based on relative density relationship , 2020, Pattern Recognit..

[34]  Volodymyr Melnykov,et al.  On finite mixture modeling and model-based clustering of directed weighted multilayer networks , 2020, Pattern Recognit..

[35]  Sotirios Chatzis,et al.  Hidden Markov Models with Nonelliptically Contoured State Densities , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[38]  Nizar Bouguila,et al.  Variational Bayesian inference for infinite generalized inverted Dirichlet mixtures with feature selection and its application to clustering , 2015, Applied Intelligence.

[39]  Christopher J. R. Illingworth,et al.  On the effective depth of viral sequence data , 2017, Virus evolution.

[40]  Wanqing Li,et al.  Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[41]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[42]  Xiaofeng Zhu,et al.  Local and Global Structure Preservation for Robust Unsupervised Spectral Feature Selection , 2018, IEEE Transactions on Knowledge and Data Engineering.

[43]  Nizar Bouguila,et al.  Modeling and Clustering Positive Vectors via Nonparametric Mixture Models of Liouville Distributions , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[44]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[45]  Stephen C. Adams,et al.  Feature Selection for Hidden Markov Models and Hidden Semi-Markov Models , 2016, IEEE Access.

[46]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[47]  Nizar Bouguila,et al.  Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection , 2013, Pattern Recognit..

[48]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[49]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Nizar Bouguila,et al.  Proportional data modeling with hidden Markov models based on generalized Dirichlet and Beta-Liouville mixtures applied to anomaly detection in public areas , 2016, Pattern Recognit..

[51]  Hong Shen,et al.  User clustering in a dynamic social network topic model for short text streams , 2017, Inf. Sci..

[52]  Nizar Bouguila,et al.  Variational Bayesian Learning of Generalized Dirichlet-Based Hidden Markov Models Applied to Unusual Events Detection , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[53]  Nizar Bouguila,et al.  Positive vectors clustering using inverted Dirichlet finite mixture models , 2012, Expert Syst. Appl..