Exons – original building blocks of proteins?

In a recent paper, Walter Gilbert's group has estimated the number of original exons from which all extant proteins might have been constructed. The approach used is subjected to a critical analysis here. It is shown that there are flawed assumptions about both the mechanism and generality of exon‐shuffling and in the sequence comparison procedures employed, the latter failing to distinguish chance similarity from similarity due to common ancestry. These methodological errors lead to the omission of many known cases of exonshuffling and the inclusion of others which may not be genuine. In consequence, the analysis from the Gilbert group cannot give a reliable estimate of those modules that actually participated in exon‐shuffling and provides no information on the number of protein archetypes that did not participate in these processes.

[1]  K. Raju,et al.  Primary structures of bovine elastin a, b, and c deduced from the sequences of cDNA clones. , 1987, The Journal of biological chemistry.

[2]  L. Patthy,et al.  Intron‐dependent evolution: Preferred types of exons and introns , 1987, FEBS letters.

[3]  A. Jacquier,et al.  Self-splicing group II and nuclear pre-mRNA introns: how similar are they? , 1990, Trends in biochemical sciences.

[4]  R. Doolittle Similar amino acid sequences: chance or common ancestry? , 1981, Science.

[5]  J. Rosenbloom,et al.  Structure of the 3' portion of the bovine elastin gene. , 1985, Biochemistry.

[6]  Ming-Qun Xu,et al.  Bacterial origin of a chloroplast intron: conserved self-splicing group I introns in cyanobacteria , 1990, Science.

[7]  W. Gilbert Why genes in pieces? , 1978, Nature.

[8]  J. Myers,et al.  Human proα1(I) collagen gene structure reveals evolutionary conservation of a pattern of introns and exons , 1984, Nature.

[9]  P. Newman,et al.  Structure of the human gene encoding granule membrane protein-140, a member of the selectin family of adhesion receptors for leukocytes. , 1990, The Journal of biological chemistry.

[10]  T. Traut Do exons code for structural or functional units in proteins? , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M Levitt,et al.  Alignment of the amino acid sequences of distantly related proteins using variable gap penalties. , 1986, Protein engineering.

[12]  J. Erickson,et al.  Chlamydomonas reinhardii gene for the 32 000 mol. wt. protein of photosystem II contains four large introns and is located entirely within the chloroplast inverted repeat , 1984, The EMBO journal.

[13]  W. Kuang,et al.  Molecular structure of the human albumin gene is revealed by nucleotide sequence within q11-22 of chromosome 4. , 1986, The Journal of biological chemistry.

[14]  R. Bastos,et al.  Molecular structure of the gene and the 5'-flanking region of the human lymphocyte immunoglobulin E receptor. , 1987, Nucleic acids research.

[15]  M. Mörl,et al.  Integration of group II intron bl1 into a foreign RNA by reversal of the self-splicing reaction in vitro , 1990, Cell.

[16]  John Rogers,et al.  Split-gene evolution: Exon shuffling and intron insertion in serine protease genes , 1985, Nature.

[17]  E. Fuchs,et al.  Remarkable conservation of structure among intermediate filament genes , 1984, Cell.

[18]  D. Hickey,et al.  A general model for the evolution of nuclear pre-mRNA introns. , 1989, Journal of theoretical biology.

[19]  L. Patthy Evolutionary Assembly of Blood Coagulation Proteins , 1990, Seminars in thrombosis and hemostasis.

[20]  P. E. Gibbs,et al.  Origin of structural domains of the serum-albumin gene family and a predicted structure of the gene for vitamin D-binding protein. , 1987, Molecular biology and evolution.

[21]  R. MacGillivray,et al.  Characterization of the human blood coagulation factor XII gene. Intron/exon gene organization and analysis of the 5'-flanking region. , 1987, The Journal of biological chemistry.

[22]  L. Patthy Evolution of blood coagulation and fibrinolysis. , 1990, Blood coagulation & fibrinolysis : an international journal in haemostasis and thrombosis.

[23]  M. Keller,et al.  Structure of the Euglena gracilis chloroplast gene (psbA) coding for the 32‐kDa protein of Photosystem II , 1984 .

[24]  T. Cavalier-smith,et al.  Selfish DNA and the origin of introns , 1985, Nature.

[25]  K. Drickamer,et al.  Characterization of the gene encoding the major rat liver asialoglycoprotein receptor. , 1985, The Journal of biological chemistry.

[26]  L. Patthy,et al.  Detecting distant homologies of mosaic proteins. Analysis of the sequences of thrombomodulin, thrombospondin complement components C9, C8 alpha and C8 beta, vitronectin and plasma cell membrane glycoprotein PC-1. , 1988, Journal of molecular biology.

[27]  R. Owens,et al.  Exon structure of the collagen‐binding domain of human fibronectin , 1986, FEBS letters.

[28]  F. Kafatos,et al.  DNA sequence transfer between two high-cysteine chorion gene families in the silkmoth Bombyx mori. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[29]  L. Patthy Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modules , 1985, Cell.

[30]  W. Gilbert,et al.  How big is the universe of exons? , 1990, Science.

[31]  P. Young,et al.  Intragenic amplification and divergence in the mouse α-fetoprotein gene , 1981, Nature.

[32]  P Argos,et al.  Repeating structure of chick tropoelastin revealed by complementary DNA cloning. , 1987, Biochemistry.

[33]  L. Patthy,et al.  Detecting homology of distantly related proteins with consensus sequences. , 1987, Journal of molecular biology.

[34]  T. Tanaka,et al.  Partial structure of the gene for chicken cartilage proteoglycan core protein. , 1988, The Journal of biological chemistry.

[35]  J. Griffin,et al.  Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils. , 1990, The Journal of biological chemistry.

[36]  H. Lodish,et al.  Structure and organization of the murine band 3 gene. , 1987, The Journal of biological chemistry.

[37]  L Peltonen,et al.  Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[38]  F C Kafatos,et al.  Origin of evolutionary novelty in proteins: how a high-cysteine chorion protein has evolved. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[39]  J. Rogers,et al.  How were introns inserted into nuclear genes? , 1989, Trends in genetics : TIG.

[40]  P. Perlman,et al.  Mobile introns and intron-encoded proteins. , 1989, Science.

[41]  F. Michel,et al.  The introns of the Euglenagracilis chloroplast gene which codes for the 32-kDa protein of photosystem II: Evidence for structural homologies with class II introns , 1985 .

[42]  J. Rogers The role of introns in evolution , 1990, FEBS letters.

[43]  J. Palmer,et al.  An ancient group I intron shared by eubacteria and chloroplasts , 1990, Science.

[44]  M. Gribskov,et al.  [9] Profile analysis , 1990 .

[45]  T. Springer,et al.  Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule. , 1990, The Journal of biological chemistry.

[46]  J. Gingrich,et al.  The Euglena gracilis chloroplast ribulose-1,5-bisphosphate carboxylase gene. I. Complete DNA sequence and analysis of the nine intervening sequences. , 1985, The Journal of biological chemistry.

[47]  R. Doolittle Redundancies in Protein Sequences , 1989 .

[48]  J. Palmer,et al.  The gain of two chloroplast tRNA introns marks the green algal ancestors of land plants , 1990, Nature.

[49]  J. McCarrey Molecular evolution of the human Pgk-2 retroposon. , 1990, Nucleic acids research.

[50]  R F Doolittle,et al.  Similar amino acid sequences revisited. , 1989, Trends in biochemical sciences.