A Survey on Ensemble Methods for High Dimensional Data Classification in Biomedicine Field

The volume of the information is growing exceptionally large. So, there is growing interest to help people to categorize, handle and control these resources. In last few years, the data mining is applied widely to discover the knowledge from information system. Classification is one of the tools which are used for data mining. Ensemble methods proved to be superior to individual classification method for high dimensional datasets. Hence, this paper surveys many ensemble methods along with bagging, boosting and random forest. It gives the idea about the earlier proposed categories of ensemble methods. It also discusses about the performance of ensemble methods.

[1]  Cheng Hao Jin,et al.  Ensemble method for classification of high-dimensional data , 2014, 2014 International Conference on Big Data and Smart Computing (BIGCOMP).

[2]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[3]  Sergio Escalera,et al.  A Framework towards the Unification of Ensemble Classification Methods , 2013, 2013 12th International Conference on Machine Learning and Applications.

[4]  Sung-Bae Cho,et al.  Cancer classification using ensemble of neural networks with multiple significant gene subsets , 2007, Applied Intelligence.

[5]  Ashfaqur Rahman,et al.  Ensemble classifier generation using non-uniform layered clustering and Genetic Algorithm , 2013, Knowl. Based Syst..

[6]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[7]  Aboul Ella Hassanien,et al.  Ensemble-based classifiers for prostate cancer diagnosis , 2013, 2013 9th International Computer Engineering Conference (ICENCO).

[8]  Jun Ni,et al.  An Improved Ensemble Learning Method for Classifying High-Dimensional and Imbalanced Biomedicine Data , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.