Multiobjective Optimization in Bioinformatics and Computational Biology

This paper reviews the application of multiobjective optimization in the fields of bioinformatics and computational biology. A survey of existing work, organized by application area, forms the main body of the review, following an introduction to the key concepts in multiobjective optimization. An original contribution of the review is the identification of five distinct "contexts," giving rise to multiple objectives: These are used to explain the reasons behind the use of multiobjective optimization in each application area and also to point the way to potential future uses of the technique

[1]  Lothar Thiele,et al.  Multiobjective genetic programming: reducing bloat using SPEA2 , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[2]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[3]  Alex Alves Freitas,et al.  On rule interestingness measures , 1999, Knowl. Based Syst..

[4]  Roĭtberg Ma,et al.  [Pareto-optimal alignment of biological sequences]. , 1999 .

[5]  I Zwir,et al.  Automated Biological Sequence Description by Genetic Multiobjective Generalized Clustering , 2002, Annals of the New York Academy of Sciences.

[6]  Matthias Ehrgott,et al.  Multicriteria Optimization , 2005 .

[7]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems (Genetic and Evolutionary Computation) , 2006 .

[8]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[9]  Luis Herrera,et al.  A Hybrid Promoter Analysis Methodology for Prokaryotic Genomes , 2009, Fuzzy Systems in Bioinformatics and Computational Biology.

[10]  Jasbir S. Arora,et al.  Survey of multi-objective optimization methods for engineering , 2004 .

[11]  Indraneel Das On characterizing the “knee” of the Pareto curve based on Normal-Boundary Intersection , 1999 .

[12]  Joshua D. Knowles,et al.  Semi-supervised feature selection via multiobjective optimization , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[13]  Hitoshi Iba,et al.  Selecting informative genes using a multiobjective evolutionary algorithm , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[14]  Bhabesh Nath,et al.  Multi-objective rule mining using genetic algorithms , 2004, Inf. Sci..

[15]  M A Roĭtberg,et al.  [Pareto-optimal alignment of biological sequences]. , 1999, Biofizika.

[16]  Jason Weston,et al.  Semi-supervised Protein Classification Using Cluster Kernels , 2003, NIPS.

[17]  Jonathan E. Fieldsend,et al.  Pareto evolutionary neural networks , 2005, IEEE Transactions on Neural Networks.

[18]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[19]  Stefan Schuster,et al.  YANA – a software tool for analyzing flux modes, gene-expression and enzyme activities , 2005, BMC Bioinformatics.

[20]  Roy L. Johnston,et al.  Definition of a `guiding function' in global optimization: a hybrid approach combining energy and R-factor in structure solution from powder diffraction data , 2000 .

[21]  Jonathan E. Fieldsend,et al.  Multi-class ROC analysis from a multi-objective optimisation perspective , 2006, Pattern Recognit. Lett..

[22]  Johann Gasteiger,et al.  A Graph-Based Genetic Algorithm and Its Application to the Multiobjective Evolution of Median Molecules , 2004, J. Chem. Inf. Model..

[23]  Peter J. Fleming,et al.  Assessing the performance of multiobjective genetic algorithms for optimization of a batch process scheduling problem , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[24]  Sanjeev Garg,et al.  Multiobjective optimization of a free radical bulk polymerization reactor using genetic algorithm , 1999 .

[25]  V. Cutello,et al.  A multi-objective evolutionary approach to the protein structure prediction problem , 2006, Journal of The Royal Society Interface.

[26]  A. Brazma,et al.  Towards reconstruction of gene networks from expression data by supervised learning , 2003, Genome Biology.

[27]  Victor J. Rayward-Smith,et al.  Developments on a Multi-objective Metaheuristic (MOMH) Algorithm for Finding Interesting Sets of Classification Rules , 2005, EMO.

[28]  Alex Alves Freitas,et al.  A critical review of multi-objective optimization in data mining: a position paper , 2004, SKDD.

[29]  Douglas B Kell,et al.  Theodor Bücher Lecture. Metabolomics, modelling and machine learning in systems biology - towards an understanding of the languages of cells. Delivered on 3 July 2005 at the 30th FEBS Congress and the 9th IUBMB conference in Budapest. , 2006, The FEBS journal.

[30]  Kalyanmoy Deb,et al.  Multi-objective evolutionary algorithms: introducing bias among Pareto-optimal solutions , 2003 .

[31]  Yaochu Jin,et al.  Multi-Objective Machine Learning (Studies in Computational Intelligence) (Studies in Computational Intelligence) , 2006 .

[32]  Byoung-Tak Zhang,et al.  Multi-objective Evolutionary Probe Design Based on Thermodynamic Criteria for HPV Detection , 2004, PRICAI.

[33]  Anil K. Jain,et al.  Clustering ensembles: models of consensus and weak partitions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Gary B. Lamont,et al.  Solving the Protein Structure Prediction Problem Through a Multiobjective Genetic Algorithm , 2002 .

[35]  Mehrdad Tamiz,et al.  Multi-objective meta-heuristics: An overview of the current state-of-the-art , 2002, Eur. J. Oper. Res..

[36]  HandlJulia,et al.  Computational cluster validation in post-genomic data analysis , 2005 .

[37]  Ken Brodlie,et al.  Navigating high-dimensional spaces to support design steering , 2000, Proceedings Visualization 2000. VIS 2000 (Cat. No.00CH37145).

[38]  Xavier Gandibleux,et al.  A survey and annotated bibliography of multiobjective combinatorial optimization , 2000, OR Spectr..

[39]  Horst Bunke,et al.  On Median Graphs: Properties, Algorithms, and Applications , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Christian Igel,et al.  Multi-objective Model Selection for Support Vector Machines , 2005, EMO.

[41]  Hussein A. Abbass,et al.  Pareto neuro-evolution: constructing ensemble of neural networks using multi-objective optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[42]  Ralph E. Steuer,et al.  Multiple Criteria Decision Making, Multiattribute Utility Theory: The Next Ten Years , 1992 .

[43]  Ralph E. Steuer Multiple criteria optimization , 1986 .

[44]  Sushmita Mitra,et al.  Computational Intelligence in Bioinformatics , 2005, Trans. Rough Sets.

[45]  Hussein A. Abbass,et al.  Speeding Up Backpropagation Using Multiobjective Evolutionary Algorithms , 2003, Neural Computation.

[46]  Carla E. Brodley,et al.  Feature Selection for Unsupervised Learning , 2004, J. Mach. Learn. Res..

[47]  Colas Schretter,et al.  OligoFaktory: a visual tool for interactive oligonucleotide design , 2006, Bioinform..

[48]  Peter J. Fleming,et al.  Genetic Algorithms for Multiobjective Optimization: FormulationDiscussion and Generalization , 1993, ICGA.

[49]  Christian Igel,et al.  Evolutionary Multi-Objective Optimisation Of Neural Networks For Face Detection , 2004, Int. J. Comput. Intell. Appl..

[50]  G. K. Suraishkumar,et al.  Multi-objective optimization in Aspergillus niger fermentation for selective product enhancement , 2005, Bioprocess and biosystems engineering.

[51]  Frances H. Arnold,et al.  Molecular breeding of carotenoid biosynthetic pathways , 2000, Nature Biotechnology.

[52]  Xavier Llorà,et al.  Bounding the Effect of Noise in Multiobjective Learning Classifier Systems , 2003, Evolutionary Computation.

[53]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[54]  A FreitasAlex A critical review of multi-objective optimization in data mining , 2004 .

[55]  Gary G. Yen,et al.  Hierarchical Rank Density Genetic Algorithm for Radial-Basis Function Neural Network Design , 2003, Int. J. Comput. Intell. Appl..

[56]  Theodor J. Stewart,et al.  Multiple criteria decision analysis - an integrated approach , 2001 .

[57]  Leon Poladian,et al.  Multi-objective evolutionary algorithms and phylogenetic inference with multiple data sets , 2006, Soft Comput..

[58]  Sanjoy Das,et al.  A multi-objective GA-simplex hybrid approach for gene regulatory network models , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[59]  H. Halsall-Whitney,et al.  Multicriteria optimization of gluconic acid production using net flow , 2003, Bioprocess and biosystems engineering.

[60]  Alfred O. Hero,et al.  Pareto-Optimal Methods for Gene Ranking , 2004, J. VLSI Signal Process..

[61]  Gisbert Schneider,et al.  Computer-based de novo design of drug-like molecules , 2005, Nature Reviews Drug Discovery.

[62]  Edwin D. de Jong,et al.  Reducing bloat and promoting diversity using multi-objective methods , 2001 .

[63]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[64]  K. Deb,et al.  Reliable classification of two-class cancer data using evolutionary algorithms. , 2003, Bio Systems.

[65]  Valerie J. Gillet,et al.  Generation of multiple pharmacophore hypotheses using multiobjective optimisation techniques , 2004, J. Comput. Aided Mol. Des..

[66]  Hisao Ishibuchi,et al.  Fuzzy rule selection by multi-objective genetic local search algorithms and rule evaluation measures in data mining , 2004, Fuzzy Sets Syst..

[67]  Daniel Baum Multiple Semi-flexible 3D Superposition of Drug-Sized Molecules , 2005, CompLife.

[68]  Richard Bonneau,et al.  An improved protein decoy set for testing energy functions for protein structure prediction , 2003, Proteins.

[69]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[70]  E. Besada-Portas,et al.  Multiobjective optimization and multivariable control of the beer fermentation process with the use of evolutionary algorithms , 2004, Journal of Zhejiang University. Science.

[71]  Joshua D. Knowles,et al.  ParEGO: a hybrid algorithm with on-line landscape approximation for expensive multiobjective optimization problems , 2006, IEEE Transactions on Evolutionary Computation.

[72]  R. S. Laundy,et al.  Multiple Criteria Optimisation: Theory, Computation and Application , 1989 .

[73]  Flávio Bortolozzi,et al.  Unsupervised feature selection using multi-objective genetic algorithms for handwritten word recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[74]  Clarisse Dhaenens,et al.  A multicriteria genetic algorithm to analyze microarray data , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[75]  A. Ferligoj,et al.  Direct multicriteria clustering algorithms , 1992 .

[76]  Matthias Ehrgott,et al.  Optimisation of beam directions in intensity modulated radiation therapy planning , 2003, OR Spectr..

[77]  Frances H. Arnold,et al.  When blind is better: Protein design by evolution , 1998, Nature Biotechnology.

[78]  Jacques Cohen,et al.  Bioinformatics—an introduction for computer scientists , 2004, CSUR.

[79]  Julio Vera,et al.  MetMAP: An integrated Matlab TM package for analysis and optimization of metabolic systems , 2003, Silico Biol..

[80]  Rajeev Kumar,et al.  Multiobjective genetic algorithm partitioning for hierarchical learning of high-dimensional pattern spaces: a learning-follows-decomposition strategy , 1998, IEEE Trans. Neural Networks.

[81]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[82]  Richard A. Watson,et al.  Reducing Local Optima in Single-Objective Problems by Multi-objectivization , 2001, EMO.

[83]  Mikkel T. Jensen,et al.  Guiding Single-Objective Optimization Using Multi-objective Methods , 2003, EvoWorkshops.

[84]  Warren S. Sarle,et al.  Stopped Training and Other Remedies for Overfitting , 1995 .

[85]  Kalyanmoy Deb,et al.  Finding Knees in Multi-objective Optimization , 2004, PPSN.

[86]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[87]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[88]  Marcel J. T. Reinders,et al.  Multi-criterion optimization for genetic network modeling , 2003, Signal Process..

[89]  Guido Stehr,et al.  Performance trade-off analysis of analog circuits by normal-boundary intersection , 2003, Proceedings 2003. Design Automation Conference (IEEE Cat. No.03CH37451).

[90]  Joshua D. Knowles,et al.  On semi-supervised clustering via multiobjective optimization , 2006, GECCO.

[91]  Douglas B. Kell,et al.  Computational cluster validation in post-genomic data analysis , 2005, Bioinform..

[92]  Yimin Liu,et al.  Integrating Multi-Objective Genetic Algorithm and Validity Analysis for Locating and Ranking Alternative Clustering , 2005, Informatica.

[93]  Luiz Eduardo Soares de Oliveira,et al.  A Methodology for Feature Selection Using Multiobjective Genetic Algorithms for Handwritten Digit String Recognition , 2003, Int. J. Pattern Recognit. Artif. Intell..

[94]  Valerie J Gillet,et al.  Multiobjective optimization in quantitative structure-activity relationships: deriving accurate and interpretable QSARs. , 2002, Journal of medicinal chemistry.

[95]  Filippo Menczer,et al.  Evolutionary model selection in unsupervised learning , 2002, Intell. Data Anal..

[96]  DaeEun Kim,et al.  Structural Risk Minimization on Decision Trees Using an Evolutionary Multiobjective Optimization , 2004, EuroGP.

[97]  Xin Yao,et al.  DIVACE: Diverse and Accurate Ensemble Learning Algorithm , 2004, IDEAL.

[98]  Donato Malerba,et al.  A Comparative Analysis of Methods for Pruning Decision Trees , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[99]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[100]  Philip E. Bourne,et al.  A New Scoring Function and Associated Statistical Significance for Structure Alignment by CE , 2004, J. Comput. Biol..

[101]  Robert M. Hubley,et al.  Evolutionary algorithms for the selection of single nucleotide polymorphisms , 2003, BMC Bioinformatics.

[102]  Alan R. Fersht,et al.  Directed evolution of new catalytic activity using the α/β-barrel scaffold , 2000, Nature.

[103]  Johann Gasteiger,et al.  Superposition of Three-Dimensional Chemical Structures Allowing for Conformational Flexibility by a Hybrid Method , 1998, J. Chem. Inf. Comput. Sci..

[104]  Alfred O. Hero,et al.  Multicriteria Gene Screening for Analysis of Differential Expression with DNA Microarrays , 2004, EURASIP J. Adv. Signal Process..

[105]  M. Tristem Molecular Evolution — A Phylogenetic Approach. , 2000, Heredity.

[106]  Eric Bauer,et al.  An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[107]  Theodor J. Stewart,et al.  Multiple Criteria Decision Analysis , 2001 .

[108]  Silvia Curteanu,et al.  Alternatives for multiobjective optimization of a polymerization process , 2006 .

[109]  Vladimir Brusic,et al.  Multi-Objective Evolutionary Algorithm for Discovering Peptide Binding Motifs , 2006, EvoWorkshops.

[110]  S. Ruzika,et al.  Approximation Methods in Multiobjective Programming , 2005 .

[111]  Byoung-Tak Zhang,et al.  Multiobjective evolutionary optimization of DNA sequences for reliable DNA computing , 2005, IEEE Transactions on Evolutionary Computation.

[112]  Marta Cascante,et al.  Multicriteria optimization of biochemical systems by linear programming: application to production of ethanol by Saccharomyces cerevisiae. , 2003, Biotechnology and bioengineering.

[113]  Kishalay Mitra,et al.  Multiobjective dynamic optimization of a semi-batch epoxy polymerization process , 2004, Comput. Chem. Eng..

[114]  Kalyanmoy Deb,et al.  Multiclass protein fold recognition using multiobjective evolutionary algorithms , 2004, 2004 Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[115]  Christian Fonteix,et al.  Multicriteria optimization of a single-cell oil production , 2004, Eur. J. Oper. Res..

[116]  A. Messac,et al.  Smart Pareto filter: obtaining a minimal representation of multiobjective design space , 2004 .

[117]  Steffen Schulze-Kremer Application of Evolutionary Computation to Protein Folding with Specialized Operators , 2003 .

[118]  Alfred O. Hero,et al.  Pareto-optimal methods for gene analysis , 2002 .

[119]  M. Ierapetritou,et al.  Novel quantitative tools for engineering analysis of hepatocyte cultures in bioartificial liver systems. , 2005, Biotechnology and bioengineering.

[120]  P. J. Fleming,et al.  The good of the many outweighs the good of the one: evolutionary multi-objective optimization , 2003 .

[121]  Kalyanmoy Deb,et al.  Multi-Class Protein Fold Recognition Using Multi-Objective Evolutionary Algorithms , 2004 .

[122]  Joshua D. Knowles,et al.  Feature subset selection in unsupervised learning via multiobjective optimization , 2006 .

[123]  Andreas Zell,et al.  Multi-objective Model Optimization for Inferring Gene Regulatory Networks , 2005, EMO.

[124]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[125]  Pablo Moscato,et al.  Inferring Phylogenetic Trees Using Evolutionary Algorithms , 2002, PPSN.

[126]  Junhyong Kim,et al.  Separate Versus Combined Analysis of Phylogenetic Evidence , 1995 .

[127]  D. Kell Metabolomics, modelling and machine learning in systems biology – towards an understanding of the languages of cells , 2006, The FEBS journal.

[128]  Peter Willett,et al.  Designing focused libraries using MoSELECT. , 2002, Journal of molecular graphics & modelling.

[129]  Bernhard Sendhoff,et al.  Neural network regularization and ensembling using multi-objective evolutionary algorithms , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[130]  Ralph L. Keeney,et al.  Decisions with multiple objectives: preferences and value tradeoffs , 1976 .

[131]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[132]  Kishalay Mitra,et al.  Towards a better understanding of the epoxy-polymerization process using multi-objective evolutionary computation , 2004 .

[133]  Valerie J Gillet,et al.  Designing combinatorial libraries optimized on multiple objectives. , 2004, Methods in molecular biology.

[134]  V. J. Rayward-Smith,et al.  Data mining rules using multi-objective evolutionary algorithms , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[135]  C. Fonseca,et al.  GENETIC ALGORITHMS FOR MULTI-OBJECTIVE OPTIMIZATION: FORMULATION, DISCUSSION, AND GENERALIZATION , 1993 .

[136]  D. Mount Bioinformatics: Sequence and Genome Analysis , 2001 .

[137]  Frans Coenen,et al.  Threshold Tuning for Improved Classification Association Rule Mining , 2005, PAKDD.

[138]  Hitoshi Iba,et al.  Prediction of tumor outcome based on gene expression data , 2008, Wuhan University Journal of Natural Sciences.

[139]  D. Brutlag,et al.  Highly specific protein sequence motifs for genome analysis. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[140]  J. C. Schön,et al.  Combined method for ab initio structure solution from powder diffraction data , 1999 .

[141]  M. Anastasio,et al.  Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves , 1999, IEEE Transactions on Medical Imaging.

[142]  J M Blackburn,et al.  Directed evolution of new catalytic activity using the alpha/beta-barrel scaffold. , 2000, Nature.

[143]  Linda R. Petzold,et al.  OPTIMAL PERFORMANCE OF THE HEAT-SHOCK GENE REGULATORY NETWORK , 2005 .