A^2DTEL: Attention-Aware based Deep Tree Ensemble Learning

In recent years, random forest, as a classic ensemble learning method, has been widely used in many real scenarios benefited from its advantages such as superior performance and easy implementation. Random forest has high generalization ability and can achieve quantitative backtracking of the importance of decision feature parameters during the prediction processing, but it is fail to effectively learn the representation from the input like some deep learning based methods can do, which limits the prediction accuracy unavoidably. To address this issue, this paper proposes an attention-aware deep tree ensemble learning method. Specifically, it introduces the idea of deep learning into the random forest to design a multilayer stacking prediction model. In this way, the prediction result of the upper layer can be weighted to derive the attention parameter of the input sample at the current layer so that the layered importance sampling can be achieved. Compared with the traditional random forest model, the proposed method can learn representation from the input in a more effectively way. In experimental results, the proposed method can outperform other baselines in two real classification tasks significantly.

[1]  Chun Zhang,et al.  A weighted random forest approach to improve predictive performance for power system transient stability assessment , 2016, 2016 IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC).

[2]  Venkatesh Saligrama,et al.  Feature-Budgeted Random Forest , 2015, ICML.

[3]  Yao Zhao,et al.  EA-LSTM: Evolutionary Attention-based LSTM for Time Series Prediction , 2018, Knowl. Based Syst..

[4]  Jianfeng Gao,et al.  Deep stacking networks for information retrieval , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[6]  Wenjuan Li,et al.  Improving the Performance of Neural Networks with Random Forest in Detecting Network Intrusions , 2013, ISNN.

[7]  Yunming Ye,et al.  Hybrid Random Forests: Advantages of Mixed Trees in Classifying Text Data , 2012, PAKDD.

[8]  Takuji Kinkyo,et al.  Financial Hazard Map: Financial Vulnerability Predicted by a Random Forests Classification Model , 2018 .

[9]  Diego Cabrera,et al.  Gearbox fault diagnosis based on deep random forest fusion of acoustic and vibratory signals , 2016 .

[10]  Elias Symeonakis,et al.  Modelling land cover change in a Mediterranean environment using Random Forests and a multi-layer neural network model , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[11]  Constantin F. Aliferis,et al.  A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification , 2008, BMC Bioinformatics.