Big Data Time Series Analysis: An Approach with the Size of the Time Series

Data analysis has importance in real-time data. It is collective representation of all the facts of population or variables under interest. As velocity of data generating process (DGP) is third very important identity in Big Data prospective, so this cannot be managed through routine statistical tools of analysis. Big data time series recording is in principle on packet of information in every unit interval of time and increase size of the series. For that reason modelling can be performed after adding the respective information. This gives us challenges to take a decision regarding size of the series for understanding the data generating process. Present paper is proposing an alternative approach of time series analysis in which data generating process is studies based on size of time series. Here, we discuss the convergence of model in term of order. An empirical analysis is carried out on the time series of sectoral indices of National Stock Exchange (India) to show the convergence of the model. This convergence is also advocating the fact that if a series is fitted after certain size, we get a constant model. This supports that true model forecasted rightly the future value and also data is always convergent is nature. Study reveals that the realization of big data time series may be in considered in respect to size of the series, so identified DGP can be best representative intervals for respective study period. Keywords— Time series; Big Data; Velocity of Big Data; ARIMA model; GARCH model.

[1]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[2]  D. Piccolo A DISTANCE MEASURE FOR CLASSIFYING ARIMA MODELS , 1990 .

[3]  S. Durlauf,et al.  Interpreting Tests of the Convergence Hypothesis , 1994 .

[4]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[5]  Martin Hilbert,et al.  The World’s Technological Capacity to Store, Communicate, and Compute Information , 2011, Science.

[6]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[7]  R. Engle Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation , 1982 .

[8]  Elizabeth Bradley Analysis of time series , 2003 .

[9]  Ronald J. Brachman,et al.  Brief Application Description; Visual Data Mining: Recognizing Telephone Calling Fraud , 2004, Data Mining and Knowledge Discovery.

[10]  L. Ong,et al.  Asia's Stock Markets: Are There Crouching Tigers and Hidden Dragons? , 2014, SSRN Electronic Journal.

[11]  Han Liu,et al.  Challenges of Big Data Analysis. , 2013, National science review.

[12]  Guy Holmes,et al.  The World’s Technological Capacity to Store, Compute and Communicate Information that has Already Been Created and Does not Need to be Done Again – 2012 , 2012 .

[13]  Robert H. Shumway,et al.  Time series analysis and its applications : with R examples , 2017 .

[14]  Murat Kulahci,et al.  Introduction to Time Series Analysis and Forecasting , 2008 .

[15]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[16]  P. A. Blight The Analysis of Time Series: An Introduction , 1991 .

[17]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[18]  Ruey S. Tsay,et al.  Analysis of Financial Time Series , 2005 .