The practical problems of post-genomic biology

describes the emergence of new postgenomic disciplines like proteomics, epigenomics, and phenomics. He proposes that integrated “omic” databases in the future will allow the in silico re-enacting of the entire organism once the totality of the information at the various levels, from genome to proteome and functions, is available. This is an attractive idea: just think of virtual organisms that could serve for virtual clinical trials. However, the “omic” strategy is based on the same linear, monocausal, deterministic thinking that has reigned in molecular biology over the past decades2; it just extrapolates it to the entire genome. This is brute-force, genocentric reductionism in the guise of entireness, rather than a novel integrative approach devoted to wholeness, or as Nature Biotechnology put it in an editorial: “Complicated is not complex.”3 In fact, the present efforts in the drug development industry to simulate organismal function still ignore the very elementary parts, and instead employ the traditional “top-down” modeling approach of physiologists and systems engineers, in which organs and cells are black boxes. Such approaches vastly dominated a recent workshop on computer modeling in biology, despite the promising conference title: “From gene to organ.”* With their dream of simulating organisms “from bottom-up,” genomics scientists touch upon the old riddle of genome-to-phenome mapping. They will have to overcome the mentality of data collecting, clustering, and classification, and develop a qualitatively new conceptual understanding of the complexity of living organisms. Help in tackling this daunting task is already on its way: not only are physical scientists increasingly interested in studying complex systems in general, but one of the first institutions devoted to such studies, the New England Complex Systems Institute (NECSI) in Boston, MA, has launched collaborations with molecular biologists to harness the data flood triggered by the success of genomic technology. To counter, but not stifle Evans’ enthusiasm, let me give an overview of some of the major obstacles and challenges that we will have to confront in a post-genomic endeavor that will ultimately lead to the design of cyber-guinea pigs based on our genomic databases: Ontological questions concerning database structure and representation of “gene function”. A major goal of making genome databases more user-friendly is the systematic annotation of genes with functional properties, such as the biochemical function, interaction partners, and physiological role. This has worked pretty well for simple organisms, as demonstrated in the Saccharomyces Genome Database4. However, in more complex organisms, the same task represents an ontological challenge, as most proteins are embedded in a regulatory network and don’t just code for metabolic enzymes. For instance, genes such as ras, myc, rho, and NF-κB either stimulate growth and survival, or they induce apoptosis. The “function” of a gene product appears to depend on its “cellular context.” This raises fundamental questions. For example, what is a “gene function?” What is its “context?” And what is the modular entity to which the concept of context applies: a protein domain, a single protein, a stable complex, or a whole pathway? Recently, the principle of “modularity” in cellular processes has been proposed to facilitate the task of dealing with biological functions5. For example, a specific signal transduction pathway could be viewed as a functional module. However, it remains to be seen whether modules are “natural kinds” or just logical constructs of our mind and how their boundaries have to be delineated given the extensive “crosstalk” between the historically defined pathways. These questions must be addressed first before useful higher level “omic” databases can be constructed. Limitations of computational power. In proposing the in silico re-enacting of organismal function, Evans seems to ignore the immensity of computational capacity required. Take the human genome with 100,000 genes and let every gene be simply either “on” (expressed) or “off ” (silent). This minimal, idealized, and discrete setting alone would lead to the astronomical number of 1030,000 possible gene expression profiles! The computing and testing of all these patterns with the existing serial computers would take more time than the age of

[1]  Richard C. Strohman,et al.  The coming Kuhnian revolution in biology , 1997, Nature Biotechnology.

[2]  D. S. Coffey Self-organization, complexity and chaos: The new biology for medicine , 1998, Nature Medicine.

[3]  U. Alon,et al.  Robustness in bacterial chemotaxis , 2022 .

[4]  Glen A. Evans,et al.  Designer science and the “omic” revolution , 2000, Nature Biotechnology.

[5]  M. Savageau Michaelis-Menten mechanism reconsidered: implications of fractal kinetics. , 1995, Journal of theoretical biology.

[6]  A. Minton,et al.  Macromolecular crowding: biochemical, biophysical, and physiological consequences. , 1993, Annual review of biophysics and biomolecular structure.

[7]  B. Palsson,et al.  The underlying pathway structure of biochemical reaction networks. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[8]  D. Wood,et al.  Computation with biomolecules. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Kara Dolinski,et al.  Integrating functional genomic information into the Saccharomyces Genome Database , 2000, Nucleic Acids Res..

[10]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[11]  Sui Huang Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery , 1999, Journal of Molecular Medicine.

[12]  H. Kacser,et al.  MOlecular democracy: who shares the controls? , 1979, Biochemical Society transactions.

[13]  A. Arkin,et al.  It's a noisy business! Genetic regulation at the nanomolar scale. , 1999, Trends in genetics : TIG.

[14]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[15]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[16]  Peter V. E. McClintock,et al.  Random fluctuations: Unsolved problems of noise , 1999, Nature.

[17]  A. Arkin,et al.  Simulation of prokaryotic genetic circuits. , 1998, Annual review of biophysics and biomolecular structure.

[18]  Yaneer Bar-Yam,et al.  Dynamics Of Complex Systems , 2019 .

[19]  Donald E. Ingber,et al.  The structural and mechanical complexity of cell-growth control , 1999, Nature Cell Biology.