A Variable Order Hidden Markov Model with Dependence Jumps

Hidden Markov models (HMMs) are a popular approach for modeling sequential data, typically based on the assumption of a first- or moderate-order Markov chain. However, in many real-world scenarios the modeled data entail temporal dynamics the patterns of which change over time. In this paper, we address this problem by proposing a novel HMM formulation, treating temporal dependencies as latent variables over which inference is performed. Specifically, we introduce a hierarchical graphical model comprising two hidden layers: on the first layer, we postulate a chain of latent observation-emitting states, the temporal dependencies between which may change over time; on the second layer, we postulate a latent first-order Markov chain modeling the evolution of temporal dynamics (dependence jumps) pertaining to the first-layer latent process. As a result of this construction, our method allows for effectively modeling non-homogeneous observed data, where the patterns of the entailed temporal dynamics may change over time. We devise efficient training and inference algorithms for our model, following the expectation-maximization paradigm. We demonstrate the efficacy and usefulness of our approach considering several real-world datasets. As we show, our model allows for increased modeling and predictive performance compared to the alternative methods, while offering a good trade-off between the resulting increases in predictive performance and computational complexity.

[1]  Sotirios Chatzis,et al.  Gaussian Process-Mixture Conditional Heteroscedasticity , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Christopher D. Carothers,et al.  VOGUE: A variable order hidden Markov model with duration based on frequent sequence mining , 2010, TKDD.

[3]  R. Cont Empirical properties of asset returns: stylized facts and statistical issues , 2001 .

[4]  P. Manimaran,et al.  Modelling Financial Time Series , 2006 .

[5]  Abdelaziz Kriouile,et al.  Automatic word recognition based on second-order hidden Markov models , 1994, IEEE Trans. Speech Audio Process..

[6]  George Tauchen,et al.  Volatility Jumps , 2008 .

[7]  T. Bollerslev,et al.  Generalized autoregressive conditional heteroskedasticity , 1986 .

[8]  Jean-François Mari,et al.  A second-order HMM for high performance word and phoneme-based continuous speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Haikady N. Nagaraja,et al.  Inference in Hidden Markov Models , 2006, Technometrics.

[10]  Ben M. Herbst,et al.  Estimating the pen trajectories of static signatures using hidden Markov models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Michael I. Jordan,et al.  Variational inference for Dirichlet process mixtures , 2006 .

[12]  Ran El-Yaniv,et al.  On Prediction Using Variable Order Markov Models , 2004, J. Artif. Intell. Res..

[13]  George Papadourakis,et al.  A Nonstationary Hidden Markov Model with Approximately Infinitely-Long Time-Dependencies , 2014, ISVC.

[14]  Yu Wei,et al.  Forecasting crude oil market volatility: Further evidence using GARCH-class models , 2010 .

[15]  Miguel Lázaro-Gredilla,et al.  Variational Heteroscedastic Gaussian Process Regression , 2011, ICML.

[16]  C. Brownlees,et al.  A Practical Guide to Volatility Forecasting through Calm and Storm , 2011 .

[17]  Stefan Trück,et al.  Modelling and forecasting volatility in the gold market , 2012 .

[18]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[19]  Andreas Karathanasopoulos,et al.  Modeling and Trading the EUR/USD Exchange Rate Using Machine Learning Techniques , 2012 .

[20]  T. Rydén,et al.  Stylized Facts of Daily Return Series and the Hidden Markov Model , 1998 .

[21]  Chris Brooks,et al.  Benchmarks and the accuracy of GARCH model estimation , 2001 .

[22]  Trevor Darrell,et al.  Hidden Conditional Random Fields , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Jan Bulla,et al.  Stylized facts of financial time series and hidden semi-Markov models , 2006, Comput. Stat. Data Anal..

[24]  P. Deb Finite Mixture Models , 2008 .

[25]  Bingbing Ni,et al.  RGBD-HuDaAct: A color-depth video database for human daily activity recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[26]  C. Granger,et al.  A long memory property of stock market returns and a new model , 1993 .

[27]  Sotirios Chatzis,et al.  Margin-maximizing classification of sequential data with infinitely-long temporal dependencies , 2013, Expert Syst. Appl..

[28]  Ravi Kumar,et al.  Are web users really Markovian? , 2012, WWW.

[29]  James W. Davis,et al.  The representation and recognition of human movement using temporal templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[30]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[31]  C. Granger,et al.  Nonstationarities in Stock Returns , 2005, Review of Economics and Statistics.

[32]  Andrew Gordon Wilson,et al.  Copula Processes , 2010, NIPS.

[33]  Richard Washington,et al.  Learning to Automatically Detect Features for Mobile Robots Using Second-Order Hidden Markov Models , 2003, IJCAI 2003.

[34]  Sotirios Chatzis,et al.  Robust Sequential Data Modeling Using an Outlier Tolerant Hidden Markov Model , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[36]  Charles G. Renfro,et al.  Benchmarks and software standards: A case study of GARCH procedures , 1998 .

[37]  Bjørn Eraker Do Stock Prices and Volatility Jump? Reconciling Evidence from Spot and Option Prices , 2004 .

[38]  Nicholas G. Polson,et al.  The Impact of Jumps in Volatility and Returns , 2000 .

[39]  Sotirios Chatzis,et al.  A novel corporate credit rating system based on Student's-t hidden Markov models , 2016, Expert Syst. Appl..

[40]  P. Bühlmann,et al.  Variable Length Markov Chains: Methodology, Computing, and Software , 2004 .

[41]  J. Lothian Some new stylized facts of floating exchange rates , 1998 .

[42]  Shunzheng Yu,et al.  Hidden semi-Markov models , 2010, Artif. Intell..

[43]  Sotirios Chatzis,et al.  A conditional random field-based model for joint sequence segmentation and classification , 2013, Pattern Recognit..

[44]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[45]  Christos Dimitrakakis,et al.  Bayesian variable order Markov models , 2010, AISTATS.

[46]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[47]  Johan A. du Preez,et al.  Efficient backward decoding of high-order hidden Markov models , 2010, Pattern Recognit..