Multiobjective PSO-based rank aggregation: Application in gene ranking from microarray data

Rank aggregation refers to the problem of aggregating a number of individual preference rankings of some items to yield a consensus ranking. This problem is usually posed as an optimization problem in which the average distance of the aggregated ranking from the input rank lists is minimized in order to obtain a consensus. However, it may be noted that minimizing the average distance does not always ensure an impartial aggregation. This is because the aggregated ranking may be very similar to one input ranking, whereas at the same time, it may be distant from another. This may reduce the overall average distance while producing a biased aggregated ranking. This possibility motivates us to pose the problem of rank aggregation as a multiobjective optimization problem. Here the objective is not only to minimize the average distance of the aggregated ranking from the input rank lists, but also to minimize the standard deviation of the distance values simultaneously to avoid any bias towards any particular input ranking. In order to minimize both the objective functions simultaneously, we propose a multiobjective particle swarm optimization (PSO) based rank aggregation algorithm in this article. The performance of the proposed technique is demonstrated for some artificial datasets. Moreover, its application in gene ranking from microarray gene expression data is studied and the performance is compared with that of some state-of-art techniques to establish its superiority.

[1]  Frank A. Farris The Gini Index and Measures of Inequality , 2010, Am. Math. Mon..

[2]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Clustering , 2015, ACM Comput. Surv..

[3]  Mark Voorneveld,et al.  Characterization of Pareto dominance , 2003, Oper. Res. Lett..

[4]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Algorithms for Data Mining: Part I , 2014, IEEE Transactions on Evolutionary Computation.

[5]  Jason Wittenberg,et al.  Clarify: Software for Interpreting and Presenting Statistical Results , 2003 .

[6]  Ali Rıza Yıldız,et al.  A novel particle swarm optimization approach for product design and manufacturing , 2008 .

[7]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[8]  Shili Lin,et al.  Rank aggregation methods , 2010 .

[9]  In-Hee Lee,et al.  A filter-based feature selection approach for identifying potential biomarkers for lung cancer , 2011, Journal of Clinical Bioinformatics.

[10]  Wei Pan,et al.  A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments , 2002, Bioinform..

[11]  Erwie Zahara,et al.  A hybrid genetic algorithm and particle swarm optimization for multimodal functions , 2008, Appl. Soft Comput..

[12]  Sven Laur,et al.  Robust rank aggregation for gene list integration and meta-analysis , 2012, Bioinform..

[13]  Ahmad Bagheri,et al.  HEPSO: High exploration particle swarm optimization , 2014, Inf. Sci..

[14]  G. Marsaglia,et al.  Evaluating Kolmogorov's distribution , 2003 .

[15]  G. Box NON-NORMALITY AND TESTS ON VARIANCES , 1953 .

[16]  A. Mukhopadhyay,et al.  Identifying most relevant non-redundant gene markers from gene expression data using PSO-based graph -theoretic approach , 2012, 2012 2nd IEEE International Conference on Parallel, Distributed and Grid Computing.

[17]  Minqiang Li,et al.  A hybrid niching PSO enhanced with recombination-replacement crowding strategy for multimodal function optimization , 2012, Appl. Soft Comput..

[18]  Ronaldo C. Prati,et al.  Combining feature ranking algorithms through rank aggregation , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[19]  H. Lilliefors On the Kolmogorov-Smirnov Test for Normality with Mean and Variance Unknown , 1967 .

[20]  Ujjwal Maulik,et al.  Gene-Expression-Based Cancer Subtypes Prediction Through Feature Selection and Transductive SVM , 2013, IEEE Transactions on Biomedical Engineering.

[21]  Pablo M. Granitto,et al.  Gene Set Enrichment Analysis Using Non-parametric Scores , 2007, BSB.

[22]  Verónica Bolón-Canedo,et al.  A review of microarray datasets and applied feature selection methods , 2014, Inf. Sci..

[23]  Tao Qin,et al.  Supervised rank aggregation , 2007, WWW '07.

[24]  R. Graham,et al.  Spearman's Footrule as a Measure of Disarray , 1977 .

[25]  Richard Mankiewicz The Story of Mathematics , 2001 .

[26]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[27]  Salwani Abdullah,et al.  Hybridising harmony search with a Markov blanket for gene selection problems , 2014, Inf. Sci..

[28]  Thomas E. Potok,et al.  Document clustering using particle swarm optimization , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[29]  Russ B. Altman,et al.  Nonparametric methods for identifying differentially expressed genes in microarray data , 2002, Bioinform..

[30]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[31]  Anirban Mukhopadhyay,et al.  A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  Michael N. Vrahatis,et al.  Particle Swarm Optimization and Intelligence: Advances and Applications , 2010 .

[33]  Koji Kadota,et al.  Evaluating methods for ranking differentially expressed genes applied to microArray quality control data , 2011, BMC Bioinformatics.

[34]  Stefan Roepcke,et al.  Genome-wide expression patterns of invasion front, inner tumor mass and surrounding normal epithelium of colorectal tumors , 2007, Molecular Cancer.

[35]  E. Lehmann,et al.  Nonparametrics: Statistical Methods Based on Ranks , 1976 .

[36]  Ali R. Yildiz,et al.  Comparison of evolutionary-based optimization algorithms for structural design optimization , 2013, Eng. Appl. Artif. Intell..

[37]  Juan-Zi Li,et al.  A multi-objective evolutionary algorithm for feature selection based on mutual information with a new redundancy measure , 2015, Inf. Sci..

[38]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[39]  Ali R. Yildiz,et al.  A new hybrid particle swarm optimization approach for structural design optimization in the automotive industry , 2012 .

[40]  Wilfred Ng,et al.  Identifying Differentially Expressed Genes via Weighted Rank Aggregation , 2011, 2011 IEEE 11th International Conference on Data Mining.

[41]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[42]  Ali R. Yildiz,et al.  A comparative study of population-based optimization algorithms for turning operations , 2012, Inf. Sci..

[43]  Kiran Solanki,et al.  Multi-objective optimization of vehicle crashworthiness using a new particle swarm based approach , 2012 .

[44]  Giovanni Acampora,et al.  Enhancing ontology alignment through a memetic aggregation of similarity measures , 2013, Inf. Sci..

[45]  M Reyes Sierra,et al.  Multi-Objective Particle Swarm Optimizers: A Survey of the State-of-the-Art , 2006 .

[46]  Ruth Etzioni,et al.  Combining Results of Microarray Experiments: A Rank Aggregation Approach , 2006 .

[47]  Hui Wang,et al.  Diversity enhanced particle swarm optimization with neighborhood search , 2013, Inf. Sci..

[48]  Welch Bl THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .

[49]  Anirban Mukhopadhyay,et al.  A PSO-based rank aggregation algorithm for ranking genes from microarray data , 2013, PCI '13.

[50]  Wagner Meira,et al.  Competence-Conscious Associative Rank Aggregation , 2011, J. Inf. Data Manag..

[51]  Narasimhan Sundararajan,et al.  Self regulating particle swarm optimization algorithm , 2015, Inf. Sci..