Modeling high dimensional frequency spectral data based on virtual sample generation technique

In many situations, such as medical records of rare diseases, early stages of flexible manufacturing system and continuous industrial process, only small training samples can be obtained to construct prediction model. When modelling with high dimensional spectral data, it is very much difficulty to construct efficient and effective prediction model with such a small sample. This research proposes a new virtual sample generation (VSG) approach to model mechanical vibration and acoustic spectra. At first, prior knowledge about the actual training samples is used to produce input of virtual sample. Then, partial least squares (PLS) is used to extract spectral features for reducing features dimension. Thirdly, genetic algorithm (GA) and backup propagation neural networks (BPNN) based feasibility-based programming (FBP) model is used to generate virtual sample's output. At last, shell vibration and acoustic spectral data of a laboratory-scale ball mill are used to verify performance of the proposed method.

[1]  L. Cooper,et al.  When Networks Disagree: Ensemble Methods for Hybrid Neural Networks , 1992 .

[2]  Long-Sheng Chen,et al.  Using Functional Virtual Population as assistance to learn scheduling knowledge in dynamic manufacturing environments , 2003 .

[3]  Claudio Moraga,et al.  A diffusion-neural-network for learning from small samples , 2004, Int. J. Approx. Reason..

[4]  Yaser S. Abu-Mostafa,et al.  Hints , 2018, Neural Computation.

[5]  Der-Chiang Li,et al.  Extending Attribute Information for Small Data Set Classification , 2012, IEEE Transactions on Knowledge and Data Engineering.

[6]  Der-Chiang Li,et al.  A forecasting model for small non-equigap data sets considering data weights and occurrence possibilities , 2014, Comput. Ind. Eng..

[7]  Der-Chiang Li,et al.  Utilization of virtual samples to facilitate cancer identification for DNA microarray data in the early stages of an investigation , 2009, Inf. Sci..

[8]  Yoshihiko Hamamoto,et al.  Improvement of the Parzen classifier in small training sample size situations , 2001, Intell. Data Anal..

[9]  Christopher R. Houck,et al.  A Genetic Algorithm for Function Optimization: A Matlab Implementation , 2001 .

[10]  Guozhong An,et al.  The Effects of Adding Noise During Backpropagation Training on a Generalization Performance , 1996, Neural Computation.

[11]  Sungzoon Cho,et al.  Virtual sample generation using a population of networks , 2004, Neural Processing Letters.

[12]  Der-Chiang Li,et al.  The Generalized-Trend-Diffusion modeling algorithm for small data sets in the early stages of manufacturing systems , 2010, Eur. J. Oper. Res..

[13]  Der-Chiang Li,et al.  Using mega-trend-diffusion and artificial samples in small data set learning for early flexible manufacturing system scheduling knowledge , 2007, Comput. Oper. Res..

[14]  Tianyou Chai,et al.  Modeling Load Parameters of Ball Mill in Grinding Process Based on Selective Ensemble Multisensor Information , 2013, IEEE Transactions on Automation Science and Engineering.

[15]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Robert P. W. Duin,et al.  Bagging and Boosting for the Nearest Mean Classifier: Effects of Sample Size on Diversity and Accuracy , 2002, Multiple Classifier Systems.

[17]  Der-Chiang Li,et al.  A genetic algorithm-based virtual sample generation technique to improve small data set learning , 2014, Neurocomputing.

[18]  Der-Chiang Li,et al.  Using virtual sample generation to build up management knowledge in the early manufacturing stages , 2006, Eur. J. Oper. Res..

[19]  Oleksandr Makeyev,et al.  Neural network with ensembles , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[20]  T. Poggio,et al.  Recognition and Structure from one 2D Model View: Observations on Prototypes, Object Classes and Symmetries , 1992 .

[21]  Jian Tang,et al.  Experimental analysis of wet mill load based on vibration signals of laboratory-scale ball mill shell , 2010 .

[22]  Chai Tian,et al.  Operational Optimization and Feedback Control for Complex Industrial Processes , 2013 .

[23]  John Shawe-Taylor,et al.  Bounding Sample Size with the Vapnik-Chervonenkis Dimension , 1993, Discrete Applied Mathematics.

[24]  Wei Tang,et al.  Ensembling neural networks: Many could be better than all , 2002, Artif. Intell..

[25]  Der-Chiang Li,et al.  Using mega-fuzzification and data trend estimation in small data set learning for early FMS scheduling knowledge , 2006, Comput. Oper. Res..

[26]  R. Duin Small sample size generalization , 1995 .

[27]  Jianpei Zhang,et al.  A novel virtual sample generation method based on Gaussian distribution , 2011, Knowl. Based Syst..

[28]  Zhi-Hua Zhou,et al.  NeC4.5: Neural Ensemble Based C4.5 , 2004, IEEE Trans. Knowl. Data Eng..