Data analysis and statistical estimation for time series: improving presentation and interpretation

In our days in the social sciences, time series (or longitudinal data) are ubiquitous, used in any analytic process, with the main scope to estimate or predict the future. The main issues are represented by the large variety of time series (sometime with an unknown size), the identification of outliers, and by the impossibility to estimate the error or numerical stability of statistical analysis. This paper proposed a matrix-based model for predictive analytics and, using a statistical estimation for different finite samples extracted from time series, estimated the residual and factorial variance for a group of samples. The proposed methods are applied on different samples of social data: number of births in a community, number of inhabitants, natural mobility of population, life expectancy (by sex and area), life expectancy at birth, fertility rate, infant mortality rate.

[1]  A. Pickard,et al.  A median model for predicting United States population-based EQ-5D health state preferences. , 2010, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[2]  Stefano Marrone,et al.  Performability Modeling of Exceptions-Aware Systems in Multiformalism Tools , 2011, ASMTA.

[3]  João Saboia Autoregressive Integrated Moving Average (ARIMA) Models for Birth Forecasting , 1977 .

[4]  Rupert G. Miller Beyond ANOVA, basics of applied statistics , 1987 .

[5]  Ian T. Foster,et al.  Homeostatic and tendency-based CPU load predictions , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[6]  Lennart Ljung,et al.  Analysis of a general recursive prediction error identification algorithm , 1981, Autom..

[7]  George Mastorakis,et al.  Resource usage prediction for optimal and balanced provision of multimedia services , 2014, 2014 IEEE 19th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD).

[8]  H. Akaike Fitting autoregressive models for prediction , 1969 .

[9]  Yusuf Gurefe,et al.  Multiplicative Adams Bashforth–Moulton methods , 2011, Numerical Algorithms.

[10]  N. Leonenko,et al.  Hypothesis testing for Fisher–Snedecor diffusion , 2012 .

[11]  Anabela Simões,et al.  Prediction in evolutionary algorithms for dynamic environments , 2014, Soft Comput..

[12]  Shuang Qin,et al.  A parametric bootstrap approach for two-way ANOVA in presence of possible interactions with unequal variances , 2013, J. Multivar. Anal..

[13]  George Mastorakis,et al.  Predicting and quantifying the technical debt in cloud software engineering , 2014, 2014 IEEE 19th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD).

[14]  Nik Bessis,et al.  Defining Minimum Requirements of Inter-collaborated Nodes by Measuring the Weight of Node Interactions , 2010, 2010 International Conference on Complex, Intelligent and Software Intensive Systems.

[15]  Nik Bessis,et al.  Towards Inter-cloud Simulation Performance Analysis: Exploring Service-Oriented Benchmarks of Clouds in SimIC , 2013, 2013 27th International Conference on Advanced Information Networking and Applications Workshops.

[16]  Mauro Iacono,et al.  Adaptive monitoring of marine disasters with intelligent mobile sensor networks , 2010, 2010 IEEE Workshop on Environmental Energy and Structural Monitoring Systems.

[17]  Sifeng Liu,et al.  Advances in grey systems research , 2010 .

[18]  Richard Wolski,et al.  Multivariate Resource Performance Forecasting in the Network Weather Service , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[19]  Paul H. C. Eilers,et al.  Efficient two-dimensional smoothing with PP-spline ANOVA mixed models and nested bases , 2013, Comput. Stat. Data Anal..

[20]  J. H. Wilkinson,et al.  AN ESTIMATE FOR THE CONDITION NUMBER OF A MATRIX , 1979 .

[21]  Ciprian Dobre,et al.  Resource usage prediction algorithms for optimal selection of multimedia content delivery methods , 2015, 2015 IEEE International Conference on Communications (ICC).

[22]  Richard A. Berk,et al.  Applied Time Series Analysis for the Social Sciences , 1980 .

[23]  Frances Y. Kuo,et al.  The smoothing effect of the ANOVA decomposition , 2010, J. Complex..

[24]  Weida Zhou,et al.  Time series prediction using sparse regression ensemble based on $$\ell _2$$ℓ2–$$\ell _1$$ℓ1 problem , 2015, Soft Comput..