A review on data mining and continuous optimization applications in computational biology and medicine.

An emerging research area in computational biology and biotechnology is devoted to mathematical modeling and prediction of gene-expression patterns; it nowadays requests mathematics to deeply understand its foundations. This article surveys data mining and machine learning methods for an analysis of complex systems in computational biology. It mathematically deepens recent advances in modeling and prediction by rigorously introducing the environment and aspects of errors and uncertainty into the genetic context within the framework of matrix and interval arithmetics. Given the data from DNA microarray experiments and environmental measurements, we extract nonlinear ordinary differential equations which contain parameters that are to be determined. This is done by a generalized Chebychev approximation and generalized semi-infinite optimization. Then, time-discretized dynamical systems are studied. By a combinatorial algorithm which constructs and follows polyhedra sequences, the region of parametric stability is detected. In addition, we analyze the topological landscape of gene-environment networks in terms of structural stability. As a second strategy, we will review recent model selection and kernel learning methods for binary classification which can be used to classify microarray data for cancerous cells or for discrimination of other kind of diseases. This review is practically motivated and theoretically elaborated; it is devoted to a contribution to better health care, progress in medicine, a better education, and more healthy living conditions.

[1]  Gerhard-Wilhelm Weber,et al.  Mathematical Modeling and Approximation of Gene Expression Patterns , 2004, OR.

[2]  M. O. Dayhoff A model of evolutionary change in protein , 1978 .

[3]  Nikolaj Blom,et al.  Prediction of proprotein convertase cleavage sites. , 2004, Protein engineering, design & selection : PEDS.

[4]  G.,et al.  Stability Analysis of Gene Expression Patterns by Dynamical Systems and a Combinatorial Algorithm , 2005 .

[5]  Robert K. Brayton,et al.  Stability of dynamical systems: A constructive approach , 1979 .

[6]  P. Holmes,et al.  Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields , 1983, Applied Mathematical Sciences.

[7]  J. Gebert,et al.  Inference of Gene Expression Patterns by Using a Hybrid System Formulation An Algorithmic Approach to Local State Transition Matrices , 2004 .

[8]  R. Hettich,et al.  Approximation und Optimierung , 1982 .

[9]  J. Rückmann,et al.  On generalized semi-infinite programming , 2006 .

[10]  Hubertus Th. Jongen,et al.  Generalized semi-infinite optimization: A first order optimality condition and examples , 1998, Math. Program..

[11]  Fatma Byilmaz A MATHEMATICAL MODELING AND APPROXIMATION OF GENE EXPRESSION PATTERNS BY LINEAR AND QUADRATIC REGULATORY RELATIONS AND ANALYSIS OF GENE NETWORKS , 2004 .

[12]  T. Spector,et al.  Epigenetic differences arise during the lifetime of monozygotic twins. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Hiroyuki Ogata,et al.  AAindex: Amino Acid Index Database , 1999, Nucleic Acids Res..

[14]  T. Knudsen,et al.  Reprogramming of genetic networks during initiation of the Fetal Alcohol Syndrome , 2007, Developmental dynamics : an official publication of the American Association of Anatomists.

[15]  G.-W. Weber,et al.  Optimization of gene-environment networks in the presence of errors and uncertainty with Chebychev approximation , 2008 .

[16]  Röbbe Wünschiers,et al.  An algorithmic approach to analyse genetic networks and biological energy production: an introduction and contribution where OR meets biology , 2009 .

[17]  R. Feil,et al.  Environmental and nutritional effects on the epigenetic regulation of genes. , 2006, Mutation research.

[18]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[19]  Hubertus Th. Jongen,et al.  Nonlinear Optimization in Finite Dimensions - Morse Theory, Chebyshev Approximation, Transversality, Flows, Parametric Aspects , 2000 .

[20]  S. Edvinsson,et al.  Cardiovascular and diabetes mortality determined by nutrition during parents' and grandparents' slow growth period , 2002, European Journal of Human Genetics.

[21]  Rengül Çetin-Atalay,et al.  Implicit motif distribution based hybrid computational kernel for sequence classification , 2005, Bioinform..

[22]  Erik Kropat,et al.  A survey on OR and mathematical methods applied on gene-environment networks , 2009, Central Eur. J. Oper. Res..

[23]  H. Öktem A survey on piecewise-linear models of regulatory dynamical systems , 2005 .

[24]  Mesut Tastan,et al.  ANALYSIS AND PREDICTION OF GENE EXPRESSION PATTERNS BY DYNAMICAL SYSTEMS, AND BY A COMBINATORIAL ALGORITHM , 2005 .

[25]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[26]  H. Iba,et al.  Inferring a system of differential equations for a gene regulatory network by using genetic programming , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[27]  Ivan Bratko,et al.  GenePath: a system for inference of genetic networks and proposal of genetic experiments , 2003, Artif. Intell. Medicine.

[28]  Nicole Radde,et al.  A New Approach for Modeling Procaryotic Biochemical Networks With Differential Equations , 2006 .

[29]  S. Altschul Amino acid substitution matrices from an information theoretic perspective , 1991, Journal of Molecular Biology.

[30]  Oliver Stein,et al.  Bi-Level Strategies in Semi-Infinite Programming , 2003 .

[31]  J. Gebert,et al.  Analyzing and optimizing genetic network structure via path-finding , 2004 .

[32]  G. Yagil,et al.  On the relation between effector concentration and the rate of induced enzyme synthesis. , 1971, Biophysical journal.

[33]  Gerhard-Wilhelm Weber,et al.  On optimization, dynamics and uncertainty: A tutorial for gene-environment networks , 2009, Discret. Appl. Math..

[34]  Gerhard-Wilhelm Weber Generalized semi-infinite optimization: On iteration procedures and topological aspects , 1996 .

[35]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[36]  H. Jongen,et al.  On parametric nonlinear programming , 1991 .

[37]  Gerhard-Wilhelm Weber,et al.  Generalized semi-infinite optimization: On some foundations , 1999 .

[38]  Gerhard-Wilhelm Weber,et al.  Modeling gene regulatory networks with piecewise linear differential equations , 2007, Eur. J. Oper. Res..

[39]  J. Shawe-Taylor,et al.  Pattern Analysis for the Prediction of Eukoryatic Pro-peptide Cleavage Sites ? , 2007 .

[40]  Gerhard-Wilhelm Weber,et al.  Optimization and dynamics of gene-environment networks with intervals , 2007 .

[41]  Carlos Cordon-Cardo,et al.  DNA Microchips: Technical and Practical Considerations , 2000 .

[42]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[43]  Gerhard-Wilhelm Weber,et al.  On generalized semi-infinite optimization of genetic networks , 2007 .

[44]  Gerhard-Wilhelm Weber,et al.  Challenges in the Optimization of Biosystems II: Mathematical Modeling and Stability Analysis of Gene-Expression Patterns in an Extended Space and with Runge-Kutta Discretization , 2005, OR.

[45]  Gerhard-Wilhelm Weber,et al.  Generalized semi-infinite optimization and related topics , 1999 .

[46]  G. Weber,et al.  MODELING AND PREDICTION OF GENE-EXPRESSION PATTERNS RECONSIDERED WITH RUNGEKUTTA DISCRETIZATION , 2004 .

[47]  Hubertus Th. Jongen,et al.  Nonlinear optimization: Characterization of structural stability , 1991, J. Glob. Optim..

[48]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[49]  Gerhard-Wilhelm Weber,et al.  Mathematical contributions to dynamics and optimization of gene-environment networks , 2008 .

[50]  Röbbe Wünschiers,et al.  An algorithm to analyze stability of gene-expression patterns , 2006, Discret. Appl. Math..

[51]  Sui Huang Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery , 1999, Journal of Molecular Medicine.

[52]  Röbbe Wünschiers,et al.  Genetic Networks and Anticipation of Gene Expression Patterns , 2004 .

[53]  Theodor Bröcker,et al.  Differentiable Germs and Catastrophes , 1975 .

[54]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[55]  Clifford H. Thurber,et al.  Parameter estimation and inverse problems , 2005 .