Deep forest regression based on cross-layer full connection

Key process parameters such as production qualities and environmental pollution indices are difficult to be measured online in complex industrial processes. High time and economic costs make only limited small sample data be obtained to build process models, while the deep neural network model requires massive training samples. Although the deep forest algorithm is based on nonneural network structure, it mainly is utilized to effectively address classification problems. Owing to the above problems, a new deep forest regression algorithm based on cross-layer full connection is proposed. First of all, sub-forest prediction values of the input layer forest module are processed to obtain the layer regression vector, which is combined with the raw feature vector as the input of the middle layer forest model. And then, a cross-layer full connection way connecting the former layer regression vector contributes to an augmented layer regression vector. Meanwhile, the deep layer’s number is adaptively adjusted via verifying the validation error. In the end, the output layer forest model is trained by using the augmented layer regression vector originated from the middle layer forest model and the raw feature vector. Sequentially, the maximum information flow is effectively ensured by information sharing. Moreover, the proposed method has the advantages of simple hyper-parameter setting criterion. Simulation results based on benchmark and industrial data show that the proposed method has equal or better performance than several state-of-art methods.

[1]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  I-Cheng Yeh,et al.  Modeling of strength of high-performance concrete using artificial neural networks , 1998 .

[4]  Philip S. Yu,et al.  Is random model better? On its accuracy and efficiency , 2003, Third IEEE International Conference on Data Mining.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[7]  I-Cheng Yeh,et al.  Modeling slump flow of concrete using second-order regressions and artificial neural networks , 2007 .

[8]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[9]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[10]  Tianyou Chai,et al.  On-line principal component analysis with application to process modeling , 2012, Neurocomputing.

[11]  Peter Kontschieder,et al.  Deep Neural Decision Forests , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[13]  Olivier Sigaud,et al.  Many regression algorithms, one unified model: A review , 2015, Neural Networks.

[14]  Xiantong Zhen,et al.  Multi-scale deep networks and regression forests for direct bi-ventricular volume estimation , 2016, Medical Image Anal..

[15]  Mohammad Hossein Rafiei,et al.  A Novel Machine Learning Model for Estimation of Sale Prices of Real Estate Units , 2016 .

[16]  Xianping Fu,et al.  Appearance-based gaze estimation using deep features and random forest regression , 2016, Knowl. Based Syst..

[17]  Kevin Miller,et al.  Forward Thinking: Building Deep Random Forests , 2017, ArXiv.

[18]  Chris Aldrich,et al.  An IsaMill™ Soft Sensor based on Random Forests and Principal Component Analysis , 2017 .

[19]  Yuanqing Xia,et al.  A deep Boltzmann machine and multi-grained scanning forest ensemble collaborative method and its application to industrial fault diagnosis , 2018, Comput. Ind..

[20]  Ying Xie,et al.  Forest fire forecasting using ensemble learning approaches , 2019, Neural Computing and Applications.

[21]  John G. Lyons,et al.  A soft sensor for prediction of mechanical properties of extruded PLA sheet using an instrumented slit die and machine learning algorithms , 2018, Polymer Testing.

[22]  Yunsoo Choi,et al.  A data ensemble approach for real-time air quality forecasting using extremely randomized trees and deep neural networks , 2019, Neural Computing and Applications.

[23]  Omer Deperlioglu,et al.  An enhanced diabetic retinopathy detection and classification approach using deep convolutional neural network , 2019, Neural Computing and Applications.

[24]  Ji Feng,et al.  Deep forest , 2017, IJCAI.

[25]  Yong Yan,et al.  Online prediction of biomass moisture content in a fluidized bed dryer using electrostatic sensor arrays and the Random Forest method , 2019, Fuel.

[26]  Haiyang Wang,et al.  Dense adaptive cascade forest: a self-adaptive deep ensemble for classification problems , 2018, Soft Computing.

[27]  Xinxin Zhang,et al.  Multi-objective prediction of coal-fired boiler with a deep hybrid neural networks , 2020 .

[28]  Xiaobing Kong,et al.  Wind speed forecasting using deep neural network with feature selection , 2020, Neurocomputing.