A latent information function to extend domain attributes to improve the accuracy of small-data-set forecasting

In the current highly competitive manufacturing environment, it is important to have effective and efficient control of manufacturing systems to obtain and maintain competitive advantages. However, developing appropriate forecasting models for such systems can be challenging in their early stages, as the sample sizes are usually very small, and thus there is limited data available for analysis. The technique of virtual sample generation is one way to address this issue, but this method is usually not directly applied to time series data. This research thus develops a Latent Information function to analyze data features and extract hidden information, in order to learn from small data sets considering timing factors. The experimental results obtained using the Synthetic Control Chart Time Series and aluminum price datasets show that the proposed method can significantly improve forecasting accuracy, and thus is considered an appropriate procedure to forecast manufacturing outputs based on small samples.

[1]  Der-Chiang Li,et al.  Employing box-and-whisker plots for learning more knowledge in TFT-LCD pilot runs , 2012 .

[2]  Yaser S. Abu-Mostafa,et al.  Learning from hints in neural networks , 1990, J. Complex..

[3]  Fengming M. Chang,et al.  Using data continualization and expansion to improve small data set learning accuracy for early flexible manufacturing system (FMS) scheduling , 2006 .

[4]  Chun-Wu Yeh,et al.  An improved grey-based approach for early manufacturing data forecasting , 2009, Comput. Ind. Eng..

[5]  Huang Chong-fu,et al.  Principle of information diffusion , 1997 .

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[7]  Der-Chiang Li,et al.  Forecasting short-term electricity consumption using the adaptive grey-based approach—An Asian case , 2012 .

[8]  Der-Chiang Li,et al.  Utilization of virtual samples to facilitate cancer identification for DNA microarray data in the early stages of an investigation , 2009, Inf. Sci..

[9]  Long-Sheng Chen,et al.  Using Functional Virtual Population as assistance to learn scheduling knowledge in dynamic manufacturing environments , 2003 .

[10]  Shiro Masuda,et al.  A New Reliability Prediction Model in Manufacturing Systems , 2010, IEEE Transactions on Reliability.

[11]  Claudio Moraga,et al.  A diffusion-neural-network for learning from small samples , 2004, Int. J. Approx. Reason..

[12]  Leslie S. Smith,et al.  A neural network-based framework for the reconstruction of incomplete data sets , 2010, Neurocomputing.

[13]  W. G. Marchal,et al.  Basic Statistics for Business and Economics , 1994 .

[14]  V. Cristina Ivanescu,et al.  Bootstrapping to solve the limited data problem in production control: an application in batch process industries , 2006, J. Oper. Res. Soc..

[15]  Tomaso Poggio,et al.  Incorporating prior information in machine learning by creating virtual examples , 1998, Proc. IEEE.

[16]  Sungzoon Cho,et al.  Virtual sample generation using a population of networks , 2004, Neural Processing Letters.

[17]  Tommy W. S. Chow,et al.  Effective feature selection scheme using mutual information , 2005, Neurocomputing.

[18]  D. Berry,et al.  Statistics: Theory and Methods , 1990 .

[19]  Der-Chiang Li,et al.  Utilize bootstrap in small data set learning for pilot run modeling of manufacturing systems , 2008, Expert Syst. Appl..

[20]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[21]  J. Scott Armstrong,et al.  Beyond Accuracy: Comparison of Criteria Used to Select Forecasting Methods , 1995 .

[22]  Ian Witten,et al.  Data Mining , 2000 .

[23]  Der-Chiang Li,et al.  Using virtual sample generation to build up management knowledge in the early manufacturing stages , 2006, Eur. J. Oper. Res..

[24]  Douglas J. Dalrymple Sales forecasting practices: Results from a United States survey , 1987 .

[25]  Sifeng Liu,et al.  Grey Information: Theory and Practical Applications (Advanced Information and Knowledge Processing) , 2005 .

[26]  Sam Efromovich,et al.  Adaptive nonparametric density estimation with missing observations , 2013 .

[27]  Yi Lin,et al.  Grey Information - Theory and Practical Applications , 2005, Advanced Information and Knowledge Processing.