Whole-genome phylogeny of mammals: Evolutionary information in genic and nongenic regions

Ten complete mammalian genome sequences were compared by using the “feature frequency profile” (FFP) method of alignment-free comparison. This comparison technique reveals that the whole nongenic portion of mammalian genomes contains evolutionary information that is similar to their genic counterparts—the intron and exon regions. We partitioned the complete genomes of mammals (such as human, chimp, horse, and mouse) into their constituent nongenic, intronic, and exonic components. Phylogenic species trees were constructed for each individual component class of genome sequence data as well as the whole genomes by using standard tree-building algorithms with FFP distances. The phylogenies of the whole genomes and each of the component classes (exonic, intronic, and nongenic regions) have similar topologies, within the optimal feature length range, and all agree well with the evolutionary phylogeny based on a recent large dataset, multispecies, and multigene-based alignment. In the strictest sense, the FFP-based trees are genome phylogenies, not species phylogenies. However, the species phylogeny is highly related to the whole-genome phylogeny. Furthermore, our results reveal that the footprints of evolutionary history are spread throughout the entire length of the whole genome of an organism and are not limited to genes, introns, or short, highly conserved, nongenic sequences that can be adversely affected by factors (such as a choice of sequences, homoplasy, and different mutation rates) resulting in inconsistent species phylogenies.

[1]  S. Brenner,et al.  Late changes in spliceosomal introns define clades in vertebrate evolution. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Gary Benson,et al.  Evolutionary History of Mammalian Transposons Determined by Genome-Wide Defragmentation , 2007, PLoS Comput. Biol..

[3]  Diana J. Kao,et al.  Parallel adaptive radiations in two major clades of placental mammals , 2001, Nature.

[4]  M. Stanhope,et al.  Molecules consolidate the placental mammal tree. , 2004, Trends in ecology & evolution.

[5]  Jianzhi Zhang,et al.  Rates of Conservative and Radical Nonsynonymous Nucleotide Substitutions in Mammalian Nuclear Genes , 2000, Journal of Molecular Evolution.

[6]  O. Madsen,et al.  Sequence gaps join mice and men: phylogenetic evidence from deletions in two proteins. , 2002, Molecular biology and evolution.

[7]  A. Harcourt The Chimpanzees of Gombe. Patterns of Behavior, Jane Goodall. Belknap Press of Harvard University Press, Cambridge, Massachussets (1986), xii, +671. Price $30 , 1988 .

[8]  R. Nowak,et al.  Walker's mammals of the world , 1968 .

[9]  Ying Zhang,et al.  Distance conservation of transcription regulatory motifs in human promoters , 2008, Comput. Biol. Chem..

[10]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[11]  S. O’Brien,et al.  Molecular phylogenetics and the origins of placental mammals , 2001, Nature.

[12]  J. Dopazo,et al.  The human phylome , 2007, Genome Biology.

[13]  Joaquín Dopazo,et al.  PhylomeDB: a database for genome-wide collections of gene phylogenies , 2007, Nucleic Acids Res..

[14]  Pavel A Pevzner,et al.  Mammalian phylogenomics comes of age. , 2004, Trends in genetics : TIG.

[15]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[16]  Sudhir Kumar,et al.  Mutation rates in mammalian genomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  J. Thewissen,et al.  A retroposon analysis of Afrotherian phylogeny. , 2005, Molecular biology and evolution.

[18]  M. Kiefmann,et al.  Retroposed Elements as Archives for the Evolutionary History of Placental Mammals , 2006, PLoS biology.

[19]  K. Misawa,et al.  Revisiting the Glires concept--phylogenetic analysis of nuclear sequences. , 2003, Molecular phylogenetics and evolution.

[20]  N Okada,et al.  SINE insertions: powerful tools for molecular systematics. , 2000, BioEssays : news and reviews in molecular, cellular and developmental biology.

[21]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[22]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[23]  Alexandre Reymond,et al.  Evolutionary Discrimination of Mammalian Conserved Non-Genic Sequences (CNGs) , 2003, Science.

[24]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[25]  Masami Hasegawa,et al.  Maximum Likelihood Analysis of the Complete Mitochondrial Genomes of Eutherians and a Reevaluation of the Phylogeny of Bats and Insectivores , 2001, Journal of Molecular Evolution.

[26]  G. Petsko My worries are no longer behind me , 2007, Genome Biology.

[27]  E. Harley,et al.  Housekeeping genes for phylogenetic analysis of eutherian relationships. , 2006, Molecular biology and evolution.

[28]  D. Hillis,et al.  Ribosomal RNA secondary structure: compensatory mutations and implications for phylogenetic analysis. , 1993, Molecular biology and evolution.

[29]  P. Holland,et al.  Rare genomic changes as a tool for phylogenetics. , 2000, Trends in ecology & evolution.

[30]  D. Penny,et al.  Genome-scale phylogeny and the detection of systematic biases. , 2004, Molecular biology and evolution.

[31]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[32]  M. Hasegawa,et al.  Pegasoferae, an unexpected mammalian clade revealed by tracking ancient retroposon insertions. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[33]  T. J. Robinson,et al.  Indel evolution of mammalian introns and the utility of non-coding nuclear markers in eutherian phylogenetics. , 2007, Molecular phylogenetics and evolution.

[34]  L. Foulds,et al.  Testing the theory of evolution by comparing phylogenetic trees constructed from five different protein sequences , 1982, Nature.

[35]  C. Pond,et al.  Walker's Mammals of the World, 4th Edition, Ronald M. Nowak, John L. Paradiso. The Johns Hopkins University Press, Baltimore, Maryland (1983), 1xi, +1-568 (Vol. I), xxv+569-1362 (Vol. II). Price $65.00 , 1984 .

[36]  A. Reymond,et al.  Conserved non-genic sequences — an unexpected feature of mammalian genomes , 2005, Nature Reviews Genetics.

[37]  H Kishino,et al.  Freeing phylogenies from artifacts of alignment. , 1992, Molecular biology and evolution.

[38]  A. V. van Zeeland,et al.  Effect of nucleotide excision repair on hprt gene mutations in rodent cells exposed to DNA ethylating agents. , 1997, Mutagenesis.

[39]  F. Delsuc,et al.  Phylogenomics: the beginning of incongruence? , 2006, Trends in genetics : TIG.

[40]  Webb Miller,et al.  Using genomic data to unravel the root of the placental mammal phylogeny. , 2007, Genome research.

[41]  P. Pevzner,et al.  Breakpoint graphs and ancestral genome reconstructions. , 2009, Genome research.

[42]  Bruno Nyffeler,et al.  Early History of Mammals Is Elucidated with the ENCODE Multiple Species Sequencing Data , 2007, PLoS genetics.

[43]  Eric D. Green,et al.  Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets , 2008, Molecular biology and evolution.

[44]  S. Carroll,et al.  Bushes in the Tree of Life , 2006, PLoS biology.

[45]  T H Jukes,et al.  Rates of transition and transversion in coding sequences since the human-rodent divergence. , 1994, Genomics.

[46]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[47]  Se-Ran Jun,et al.  Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions , 2009, Proceedings of the National Academy of Sciences.

[48]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[49]  C R Woese,et al.  Archaeal phylogeny: reexamination of the phylogenetic position of Archaeoglobus fulgidus in light of certain composition-induced artifacts. , 1991, Systematic and applied microbiology.