A comparative phylogenetic approach for dating whole genome duplication events

MOTIVATION Whole genome duplications have played a major role in determining the structure of eukaryotic genomes. Current evidence revealing large blocks of duplicated chromatin yields new insights into the evolutionary history of species, but also presents a major challenge for researchers attempting to utilize comparative genomics techniques. Understanding the timing of duplication events relative to divergence among taxa is critical to accurate and comprehensive cross-species comparisons. RESULTS We describe a large-scale approach to estimate the timing of duplication events in a phylogenetic context. The methodology has been previously utilized for analysis of Arabidopsis and Saccharomyces duplication events. This new implementation provides a more flexible and reusable framework for these analyses. Scripts written in the Python programming language drive a number of freely available bioinformatics programs, creating a no-cost tool for researchers. The usefulness of the approach is demonstrated through genome-scale analysis of Arabidopsis and Oryza (rice) duplications. AVAILABILITY Software and documentation are freely available from http://plantgenome.agtec.uga.edu/bioinformatics/dating/

[1]  Karsten Hokamp,et al.  Extensive genomic duplication during early chordate evolution , 2002, Nature Genetics.

[2]  Klaas Vandepoele,et al.  Evidence That Rice and Other Cereals Are Ancient Aneuploids Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.014019. , 2003, The Plant Cell Online.

[3]  W. McCombie,et al.  Syntenic Relationships between Medicago truncatulaand Arabidopsis Reveal Extensive Divergence of Genome Organization1,212 , 2003, Plant Physiology.

[4]  L. Stein,et al.  Comparative genomics between rice and Arabidopsis shows scant collinearity in gene order. , 2001, Genome research.

[5]  Brandon S Gaut,et al.  Patterns of nucleotide substitution among simultaneously duplicated gene pairs in Arabidopsis thaliana. , 2002, Molecular biology and evolution.

[6]  Todd J. Vision,et al.  Fast identification and statistical evaluation of segmental homologies in comparative maps , 2003, ISMB.

[7]  Yasuko Takahashi,et al.  Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events , 2022 .

[8]  Guillaume Blanc,et al.  Structural divergence of chromosomal segments that arose from successive duplication events in the Arabidopsis genome. , 2003, Nucleic acids research.

[9]  Mark Johnston,et al.  Yeast genome duplication was followed by asynchronous differentiation of duplicated genes , 2003, Nature.

[10]  Wen-Hsiung Li,et al.  Divergence in the spatial pattern of gene expression between human duplicate genes. , 2003, Genome research.

[11]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[12]  M. Delseny,et al.  Extensive Duplication and Reshuffling in the Arabidopsis Genome , 2000, Plant Cell.

[13]  Yvan Saeys,et al.  Investigating ancient duplication events in the Arabidopsis genome , 2004, Journal of Structural and Functional Genomics.

[14]  Jonathan A. Eisen,et al.  The age of the Arabidopsis thaliana genome duplication , 2003, Plant Molecular Biology.

[15]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[16]  K. Hokamp,et al.  A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. , 2003, Genome research.

[17]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[18]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[19]  Andreas Wagner,et al.  GenomeHistory: a software tool and its application to fully sequenced genomes. , 2002, Nucleic acids research.

[20]  Brad A. Chapman,et al.  Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events , 2003, Nature.

[21]  D. G. Brown,et al.  The origins of genomic duplications in Arabidopsis. , 2000, Science.

[22]  Thomas Mitchell-Olds,et al.  Plant evolutionary genomics. , 2002, Current opinion in plant biology.

[23]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[24]  A. Paterson,et al.  Comparative mapping of Arabidopsis thaliana and Brassica oleracea chromosomes reveals islands of conserved organization. , 1994, Genetics.

[25]  Daniel G Peterson,et al.  Structure and evolution of cereal genomes. , 2003, Current opinion in genetics & development.

[26]  D. Lockhart,et al.  Functional Genomics , 2002, Springer Netherlands.

[27]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[28]  C. Elsik,et al.  Comparative Genomics of Plant Chromosomes , 2000, Plant Cell.

[29]  Klaas Vandepoele,et al.  The hidden duplication past of Arabidopsis thaliana , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[31]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[32]  J. Raes,et al.  The automatic detection of homologous regions (ADHoRe) and its application to microcolinearity between Arabidopsis and rice. , 2002, Genome research.

[33]  D. Nicolae,et al.  Rapid divergence in expression between duplicate genes inferred from microarray data. , 2002, Trends in genetics : TIG.