The temporal distribution of gene duplication events in a set of highly conserved human gene families.

Using a data set of protein translations associated with map positions in the human genome, we identified 1520 mapped highly conserved gene families. By comparing sharing of families between genomic windows, we identified 92 potentially duplicated blocks in the human genome containing 422 duplicated members of these families. Using branching order in the phylogenetic trees, we timed gene duplication events in these families relative to the primate-rodent divergence, the amniote-amphibian divergence, and the deuterostome-protostome divergence. The results showed similar patterns of gene duplication times within duplicated blocks and outside duplicated blocks. Both within and outside duplicated blocks, numerous duplications were timed prior to the deuterostome-protostome divergence, whereas others occurred after the amniote-amphibian divergence. Thus, neither gene duplication in general nor duplication of genomic blocks could be attributed entirely to polyploidization early in vertebrate history. The strongest signal in the data was a tendency for intrachromosomal duplications to be more recent than interchromosomal duplications, consistent with a model whereby tandem duplication-whether of single genes or of genomic blocks-may be followed by eventual separation of duplicates due to chromosomal rearrangements. The rate of separation of tandemly duplicated gene pairs onto separated chromosomes in the human lineage was estimated at 1.7 x 10(-9) per gene-pair per year.

[1]  A. Hughes,et al.  Gene duplication and the structure of eukaryotic genomes. , 2001, Genome research.

[2]  K. Strimmer,et al.  Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies , 1996 .

[3]  D. Haussler,et al.  Assembly of the working draft of the human genome with GigAssembler. , 2001, Genome research.

[4]  L. Lundin,et al.  Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse. , 1993, Genomics.

[5]  Erik L. L. Sonnhammer,et al.  A workbench for large-scale sequence homology analysis , 1994, Comput. Appl. Biosci..

[6]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[7]  Xun Gu,et al.  Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution , 2002, Nature Genetics.

[8]  K. H. Wolfe Yesterday's polyploids and the mystery of diploidization , 2001, Nature Reviews Genetics.

[9]  Karsten Hokamp,et al.  Extensive genomic duplication during early chordate evolution , 2002, Nature Genetics.

[10]  A. Sidow Gen(om)e duplications in the evolution of early vertebrates. , 1996, Current opinion in genetics & development.

[11]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[12]  Austin L. Hughes,et al.  Phylogenies of Developmentally Important Proteins Do Not Support the Hypothesis of Two Rounds of Genome Duplication Early in Vertebrate History , 1999, Journal of Molecular Evolution.

[13]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[14]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[15]  A. Force,et al.  The probability of preservation of a newly arisen gene duplicate. , 2001, Genetics.

[16]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[17]  G. Jékely,et al.  The Evolution of the Calpain Family as Reflected in Paralogous Chromosome Regions , 1999, Journal of Molecular Evolution.

[18]  S. O’Brien,et al.  The promise of comparative genomics in mammals. , 1999, Science.

[19]  A. Hughes The evolution of functionally novel proteins after gene duplication , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[20]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[21]  N. M. Brooke,et al.  A molecular timescale for vertebrate evolution , 1998, Nature.

[22]  G. Glazko,et al.  Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[23]  A Rzhetsky,et al.  Phylogenetic test of the molecular clock and linearized trees. , 1995, Molecular biology and evolution.

[24]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[25]  A. Hughes,et al.  Phylogenetic tests of the hypothesis of block duplication of homologous genes on human chromosomes 6, 9, and 1. , 1998, Molecular biology and evolution.

[26]  André Gilles,et al.  Evidence of en bloc duplication in vertebrate genomes , 2002, Nature Genetics.

[27]  A. Hughes,et al.  Pattern and timing of gene duplication in animal genomes. , 2001, Genome research.

[28]  Robert L. Carroll,et al.  Vertebrate Paleontology and Evolution , 1988 .

[29]  A. Hughes,et al.  Ancient genome duplications did not structure the human Hox-bearing chromosomes. , 2001, Genome research.

[30]  M. Kasahara,et al.  Chromosomal duplication and the emergence of the adaptive immune system. , 1997, Trends in genetics : TIG.

[31]  A. Meyer,et al.  Gene and genome duplications in vertebrates: the one-to-four (-to-eight in fish) rule and the evolution of novel gene functions. , 1999, Current opinion in cell biology.

[32]  M. Nei,et al.  Molecular Evolution and Phylogenetics , 2000 .

[33]  E. Eichler,et al.  Segmental duplications and the evolution of the primate genome , 2002, Nature Reviews Genetics.

[34]  John C. Wootton,et al.  Statistics of Local Complexity in Amino Acid Sequences and Sequence Databases , 1993, Comput. Chem..