Ancient large-scale genome duplications: phylogenetic and linkage analyses shed light on chordate genome evolution.

Paralogous genes from several families were found in four human chromosome regions (4p16, 5q33-35, 8p12-21, and 10q24-26), suggesting that their common ancestral region underwent several rounds of large-scale duplication. Searches in the EMBL databases, followed by phylogenetic analyses, showed that cognates (orthologs) of human duplicated genes can be found in other vertebrates, including bony fishes. In contrast, within each family, only one gene showing the same high degree of similarity with all the duplicated mammalian genes was found in nonvertebrates (echinoderms, insects, nematodes). This indicates that large-scale duplications occurred after the echinoderms/chordates split and before the bony vertebrate radiation. It has been suggested that two rounds of gene duplication occurred in the vertebrate lineage after the separation of Amphioxus and craniate (vertebrates + Myxini) ancestors. Before these duplications, the genes that have led to the families of paralogous genes in vertebrates must have been physically linked in the craniate ancestor. Linkage of some of these genes can be found in the Drosophila melanogaster and Caenorhabditis elegans genomes, suggesting that they were linked in the triploblast Metazoa ancestor.

[1]  S. Higashijima,et al.  Two FGF-receptor homologues of Drosophila: one is expressed in mesodermal primordium in early embryos. , 1993, Development.

[2]  V. Wheaton,et al.  An ankyrin-related gene (unc-44) is necessary for proper axonal guidance in Caenorhabditis elegans , 1995, The Journal of cell biology.

[3]  D. Steiner,et al.  Evolution of the insulin gene superfamily. Sequence of a preproinsulin-like growth factor cDNA from the Atlantic hagfish. , 1991, The Journal of biological chemistry.

[4]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[5]  D. Steiner,et al.  Evolution of the insulin superfamily: cloning of a hybrid insulin/insulin-like growth factor cDNA from amphioxus. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Sydney Brenner,et al.  The molecular evolution of genes and proteins: a tale of two serines , 1988, Nature.

[7]  N. Leslie,et al.  The chicken urokinase-type plasminogen activator gene. , 1990, The Journal of biological chemistry.

[8]  P. Holland,et al.  Hox genes and chordate evolution. , 1996, Developmental biology.

[9]  E. Fisher,et al.  Paralogy mapping: identification of a region in the human MHC triplicated onto human chromosomes 1 and 9 allows the prediction and isolation of novel PBX and NOTCH loci. , 1996, Genomics.

[10]  C Kappen,et al.  Duplication of large genomic regions during the evolution of vertebrate homeobox genes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[11]  François Coulier,et al.  Of Worms and Men: An Evolutionary Perspective on the Fibroblast Growth Factor (FGF) and FGF Receptor Families , 1997, Journal of Molecular Evolution.

[12]  Samuel E. Lux,et al.  Analysis of cDNA for human erythrocyte ankyrin indicates a repeated structure with homology to tissue-differentiation and cell-cycle control proteins , 1990, Nature.

[13]  M. Kasahara,et al.  Chromosomal duplication and the emergence of the adaptive immune system. , 1997, Trends in genetics : TIG.

[14]  V. Chapman,et al.  Close physical linkage of the FLT1 and FLT3 genes on chromosome 13 in man and chromosome 5 in mouse. , 1993, Oncogene.

[15]  M. Flajnik,et al.  Isolation of low molecular mass polypeptide complementary DNA clones from primitive vertebrates. Implications for the origin of MHC class I-restricted antigen presentation. , 1996, Journal of immunology.

[16]  E. Volpi,et al.  Detailed physical analysis of a 1.5-megabase YAC contig containing the MXI1 and ADRA2A genes. , 1997, Genomics.

[17]  H. Horvitz,et al.  An FGF receptor signaling pathway is required for the normal cell migrations of the sex myoblasts in C. elegans hermaphrodites , 1995, Cell.

[18]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[19]  P. Buchwald,et al.  Two novel human pancreatic lipase related proteins, hPLRP1 and hPLRP2. Differences in colipase dependence and in lipase activity. , 1992, The Journal of biological chemistry.

[20]  Winston A Hide,et al.  Structure and evolution of the lipase superfamily. , 1992, Journal of lipid research.

[21]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.

[22]  T. Bonner,et al.  Cloning and expression of the vesamicol binding protein from the marine ray Torpedo , 1994, FEBS letters.

[23]  M. Caron,et al.  Chromosomal organization of adrenergic receptor genes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[24]  J. Venter,et al.  Cloning, localization, and permanent expression of a Drosophila octopamine receptor , 1990, Neuron.

[25]  Jordi Garcia-Fernàndez,et al.  Archetypal organization of the amphioxus Hox gene cluster , 1994, Nature.

[26]  R. Raff,et al.  Evidence for a clade of nematodes, arthropods and other moulting animals , 1997, Nature.

[27]  M. Adams,et al.  How many genes in the human genome? , 1994, Nature Genetics.

[28]  B. Shilo,et al.  breathless, a Drosophila FGF receptor homolog, is essential for migration of tracheal and specific midline glial cells. , 1992, Genes & development.

[29]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[30]  T. Ikemura,et al.  Chromosomal localization of the proteasome Z subunit gene reveals an ancient chromosomal duplication involving the major histocompatibility complex. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[31]  G. Rubin,et al.  The Role of the Genome Project in Determining Gene Function: Insights from Model Organisms , 1996, Cell.

[32]  J. Vincent,et al.  Early Emergence of Three Dopamine D1 Receptor Subtypes in Vertebrates , 1997, The Journal of Biological Chemistry.

[33]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[34]  K. Saigo,et al.  Identification of four FGF receptor genes in Medaka fish (Oryzias latipes) , 1992, FEBS Letters.

[35]  T. Segawa,et al.  [5-Hydroxytryptamine receptors]. , 1989, Nihon yakurigaku zasshi. Folia pharmacologica Japonica.

[36]  A. Bird,et al.  Number of CpG islands and genes in human and mouse. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Margaret R. Thomson,et al.  Vertebrate genome evolution and the zebrafish gene map , 1998, Nature Genetics.

[38]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[39]  A. Sidow,et al.  Gene duplications and the origins of vertebrate development. , 1994, Development (Cambridge, England). Supplement.

[40]  N. Risch,et al.  Gene loss and gain in the evolution of the vertebrates. , 1994, Development (Cambridge, England). Supplement.

[41]  G. Wagner,et al.  Phylogenetic reconstruction of vertebrate Hox cluster duplications. , 1997, Molecular biology and evolution.

[42]  A. Alfonso,et al.  The Caenorhabditis elegans unc-17 gene: a putative vesicular acetylcholine transporter. , 1993, Science.

[43]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[44]  D. Birnbaum,et al.  Coamplification in human breast-tumors and physical linkage at chromosomal band 12p13, of ccnd2 and fgf6 genes. , 1994, International journal of oncology.

[45]  Margaret R. Thomson,et al.  Vertebrate genome evolution and the zebrafish gene map , 1998, Nature Genetics.

[46]  S. Brenner,et al.  Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome , 1993, Nature.

[47]  H. Lehrach,et al.  Linkage of TATA-binding protein and proteasome subunit C5 genes in mice and humans reveals synteny conserved between mammals and invertebrates. , 1997, Genomics.

[48]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[49]  J. Stein,et al.  Avian adipose lipoprotein lipase: cDNA sequence and reciprocal regulation of mRNA levels in adipose and heart. , 1989, Biochimica et biophysica acta.

[50]  Davis Cg,et al.  The many faces of epidermal growth factor repeats. , 1990 .

[51]  E. Zifa,et al.  5-Hydroxytryptamine receptors. , 1992, Pharmacological reviews.

[52]  A. Sidow Gen(om)e duplications in the evolution of early vertebrates. , 1996, Current opinion in genetics & development.

[53]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[54]  J. Spieth,et al.  Gene structure and organization in Caenorhabditis elegans. , 1996, Current opinion in genetics & development.

[55]  J. Spring,et al.  Vertebrate evolution by interspecific hybridisation – are we polyploid? , 1997, FEBS letters.

[56]  Stephen S. Gisselbrecht,et al.  heartless encodes a fibroblast growth factor receptor (DFR1/DFGF-R2) involved in the directional migration of early mesodermal cells in the Drosophila embryo. , 1996, Genes & development.

[57]  C. Davis,et al.  The many faces of epidermal growth factor repeats. , 1990, The New biologist.

[58]  Barbara J. Wold,et al.  Structural gene sets active in embryos and adult tissues of the sea urchin , 1976, Cell.

[59]  B. Shilo,et al.  Heartless, a Drosophila FGF receptor homolog, is essential for cell migration and establishment of several mesodermal lineages. , 1996, Genes & development.

[60]  L. Lundin,et al.  Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse. , 1993, Genomics.