Rare earth elements price forecasting by means of transgenic time series developed with ARIMA models

Abstract A time series can be thought of as a numerical organism with a continuous nature from a chronological point of view and something that is permanently updated. Up to this moment time series research related with their features, traits, and characteristics, is mainly focused on data mining, in order to discover hidden information or specific knowledge within the time series or their transformations. However, time series representation is crucial, as they are difficult to handle in their original structure due to their high dimensionality. In this paper, the “theory of transgenic time series” is developed, and applied to the forecasting of several rare earth oxide prices: dysprosium, europium, terbium, neodymium, and praseodymium oxides. This theory addresses, specifically, the existence of metal price cycles and the presence of anomalous phenomena that the theory allows to eliminate from the time series, improving the accuracy of the forecast. After representing the time series in a way that allows their genome to be sequenced, a restriction enzyme is defined in order to create a genetically modified time series. There was no need to develop DNA ligases as time series can be cut and pasted without further considerations. Results clearly state that transgenic time series lead to more accurate short term forecasts in cases where a consistent time series genome can be represented. Further research should address the feasibility of developing more accurate long term forecasts by adding new gene sequences based on the time series genome, in order to achieve greater confidence from investors and professional advisers in the feasibility studies developed for future mining investment projects. Finally, it has to be remarked that this theory has nothing to do with “genetic algorithms”, a metaheuristic that was inspired by the natural selection process and not by the sequencing and manipulation of the genome.

[1]  Pengjian Shang,et al.  Compositional segmentation of time series in the financial markets , 2015, Appl. Math. Comput..

[2]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[3]  Pei-Chann Chang,et al.  A Trend-Based Segmentation Method and the Support Vector Regression for Financial Time Series Forecasting , 2012 .

[4]  Alicja Krzemień,et al.  Forecasting European thermal coal spot prices , 2015, Journal of Sustainable Mining.

[5]  Walter C. Labys,et al.  The existence of metal price cycles , 1998 .

[6]  Qiang Wang,et al.  Time series analysis with multiple resolutions , 2010, Inf. Syst..

[7]  Xindong Wu,et al.  10 Challenging Problems in Data Mining Research , 2006, Int. J. Inf. Technol. Decis. Mak..

[8]  Salim Lahmiri,et al.  A variational mode decompoisition approach for analysis and forecasting of economic and financial time series , 2016, Expert Syst. Appl..

[9]  F. J. D. C. Juez,et al.  Forecasting the COMEX copper spot price by means of neural networks and ARIMA models , 2015 .

[10]  P. J. García Nieto,et al.  Prediction of work-related accidents according to working conditions using support vector machines , 2011, Appl. Math. Comput..

[11]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[12]  T. Liao,et al.  An adaptive genetic clustering method for exploratory mining of feature vector and time series data , 2006 .

[13]  Simon Fong,et al.  Financial time series pattern matching with extended UCR Suite and Support Vector Machine , 2016, Expert Syst. Appl..

[14]  Maciej Krawczak,et al.  An approach to dimensionality reduction in time series , 2014, Inf. Sci..

[15]  Sergio Greco,et al.  A time series representation model for accurate and fast similarity detection , 2009, Pattern Recognit..

[16]  Tzung-Pei Hong,et al.  Cluster-based genetic segmentation of time series with DWT , 2009, Pattern Recognit. Lett..

[17]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[18]  Francisco Javier de Cos Juez,et al.  Missing data imputation of questionnaires by means of genetic algorithms with different fitness functions , 2017, J. Comput. Appl. Math..

[19]  Markus Nilsson,et al.  Dimensionality reduction of fMRI time series data using locally linear embedding , 2010, Magnetic Resonance Materials in Physics, Biology and Medicine.

[20]  Rob J Hyndman,et al.  Automatic Time Series Forecasting: The forecast Package for R , 2008 .

[21]  Jui-Sheng Chou,et al.  Time series analytics using sliding window metaheuristic optimization-based machine learning system for identifying building energy consumption patterns , 2016 .

[22]  E. Sober Instrumentalism, Parsimony, and the Akaike Framework , 2002, Philosophy of Science.

[23]  María Victoria Riesgo García,et al.  Rare earth elements mining investment: It is not all about China , 2017 .

[24]  Qiang Wang,et al.  A dimensionality reduction technique for efficient time series similarity analysis , 2008, Inf. Syst..

[25]  Akihiro Sato,et al.  A Comprehensive Analysis of Time Series Segmentation on Japanese Stock Prices , 2012, 1205.0332.