Learning short multivariate time series models through evolutionary and sparse matrix computation

Multivariate time series (MTS) data are widely available in different fields including medicine, finance, bioinformatics, science and engineering. Modelling MTS data accurately is important for many decision making activities. One area that has been largely overlooked so far is the particular type of time series where the data set consists of a large number of variables but with a small number of observations. In this paper we describe the development of a novel computational method based on Natural Computation and sparse matrices that bypasses the size restrictions of traditional statistical MTS methods, makes no distribution assumptions, and also locates the associated parameters. Extensive results are presented, where the proposed method is compared with both traditional statistical and heuristic search techniques and evaluated on a number of criteria. The results have implications for a wide range of applications involving the learning of short MTS models.

[1]  Fred L. Collopy,et al.  Error Measures for Generalizing About Forecasting Methods: Empirical Comparisons , 1992 .

[2]  Z. Zlatev Computational Methods for General Sparse Matrices , 1991 .

[3]  David J. Hand,et al.  Deconstructing Statistical Questions , 1994 .

[4]  P. Whittle,et al.  Prediction and Regulation. , 1965 .

[5]  Xiaohui Liu,et al.  Consensus clustering and functional interpretation of gene-expression data , 2004, Genome Biology.

[6]  G. W. Snedecor Statistical Methods , 1964 .

[7]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[8]  Xiaohui Liu,et al.  A spatio-temporal Bayesian network classifier for understanding visual field deterioration , 2005, Artif. Intell. Medicine.

[9]  Amrita Ghatak Vector autoregression modelling and forecasting growth of South Korea , 1998 .

[10]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[11]  H. Akaike A new look at the statistical model identification , 1974 .

[12]  Yuval Shahar,et al.  A Framework for Knowledge-Based Temporal Abstraction , 1997, Artif. Intell..

[13]  Martin Casdagli,et al.  Nonlinear Modeling And Forecasting , 1992 .

[14]  Samir W. Mahfoud Niching methods for genetic algorithms , 1996 .

[15]  James E. Baker,et al.  Adaptive Selection Methods for Genetic Algorithms , 1985, International Conference on Genetic Algorithms.

[16]  P. Pfeifer,et al.  A Three-Stage Iterative Procedure for Space-Time Modeling , 1980 .

[17]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[18]  Peter Bauer,et al.  An Extension of the MOSUM Technique for Quality Control , 1980 .

[19]  G. W. Stewart,et al.  Matrix Algorithms: Volume 1, Basic Decompositions , 1998 .

[20]  Xiaohui Liu,et al.  Predicting glaucomatous visual field deterioration through short multivariate time series modelling , 2002, Artif. Intell. Medicine.

[21]  Stephen Swift,et al.  Modelling and forecasting of glaucomatous visual fields using genetic algorithms , 1999 .

[22]  Hamparsum Bozdogan,et al.  Subset selection in vector autoregressive models using the genetic algorithm with informational complexity as the fitness function , 1998 .

[23]  Jeff Harrison,et al.  Applied Bayesian Forecasting and Time Series Analysis , 1994 .

[24]  Paul R. Cohen,et al.  Efficient Mining of Statistical Dependencies , 1999, IJCAI.

[25]  Ron Meir,et al.  Nonparametric Time Series Prediction Through Adaptive Model Selection , 2000, Machine Learning.

[26]  Allan Tucker,et al.  Extending Evolutionary Programming methods to the learning of dynamic Bayesian networks , 1999 .

[27]  Douglas G. Altman,et al.  Practical statistics for medical research , 1990 .

[28]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[29]  Xiaohui Liu,et al.  Evolutionary Computation to Search for Strongly Correlated Variables in High-Dimensional Time-Series , 1999, IDA.

[30]  Mohammed Waleed Kadous,et al.  Learning Comprehensible Descriptions of Multivariate Time Series , 1999, ICML.

[31]  P. Pfeifer,et al.  Identification and Interpretation of First Order Space-Time ARMA Models , 1980 .

[32]  C. Chatfield Model uncertainty, data mining and statistical inference , 1995 .

[33]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1992, Artificial Intelligence.

[34]  Karen Holden,et al.  Vector auto regression modeling and forecasting , 1995 .

[35]  David E. Goldberg,et al.  Real-coded Genetic Algorithms, Virtual Alphabets, and Blocking , 1991, Complex Syst..

[36]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .

[37]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[38]  Andreas S. Weigend,et al.  Time Series Prediction: Forecasting the Future and Understanding the Past , 1994 .

[39]  Lawrence Davis,et al.  Adapting Operator Probabilities in Genetic Algorithms , 1989, ICGA.

[40]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[41]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .