A Gaussian-Von Mises Hidden Markov Model for Clustering Multivariate Linear-Circular Data

A multivariate hidden Markov model is proposed for clustering mixed linear and circular time-series data with missing values. The model integrates von Mises and normal densities to describe the distribution that the data take under different latent regimes, with parameters that depend on the evolution of an unobserved Markov chain. Estimation is facilitated by an EM algorithm that treats the states of the latent chain and missing values as different sources of incomplete information. The model is exploited to identify sea regimes from multivariate marine data.

[1]  K. Mardia,et al.  Protein Bioinformatics and Mixtures of Bivariate von Mises Distributions for Angular Data , 2007, Biometrics.

[2]  F. Lagona,et al.  Model-based clustering of multivariate skew data with circular components and missing values , 2012 .

[3]  Gilles Celeux,et al.  Combining Mixture Components for Clustering , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[4]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[5]  Harshinder Singh,et al.  Probabilistic model for two dependent circular variables , 2002 .

[6]  Francesco Lagona,et al.  A Latent-Class Model for Clustering Incomplete Linear and Circular Data in Marine Studies , 2021 .

[7]  Gérard Govaert,et al.  Assessing a Mixture Model for Clustering with the Integrated Completed Likelihood , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Eric Moulines,et al.  Inference in hidden Markov models , 2010, Springer series in statistics.

[9]  Qiang Zhang,et al.  Multivariate Discrete Hidden Markov Models for Domain-Based Measurements and Assessment of Risk Factors in Child Development , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[10]  A. Munk,et al.  Hidden Markov models for circular and linear-circular time series , 2006, Environmental and Ecological Statistics.