How to usefully compare homologous plant genes and chromosomes as DNA sequences.

There are four sequenced and publicly available plant genomes to date. With many more slated for completion, one challenge will be to use comparative genomic methods to detect novel evolutionary patterns in plant genomes. This research requires sequence alignment algorithms to detect regions of similarity within and among genomes. However, different alignment algorithms are optimized for identifying different types of homologous sequences. This review focuses on plant genome evolution and provides a tutorial for using several sequence alignment algorithms and visualization tools to detect useful patterns of conservation: conserved non-coding sequences, false positive noise, subfunctionalization, synteny, annotation errors, inversions and local duplications. Our tutorial encourages the reader to experiment online with the reviewed tools as a companion to the text.

[1]  F. Delsuc Comparative Genomics , 2010, Lecture Notes in Computer Science.

[2]  M. Freeling,et al.  The evolutionary position of subfunctionalization, downgraded. , 2008, Genome dynamics.

[3]  J. Poulain,et al.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla , 2007, Nature.

[4]  Brian C. Thomas,et al.  G-Boxes, Bigfoot Genes, and Environmental Response: Characterization of Intragenomic Conserved Noncoding Sequences in Arabidopsis[W] , 2007, The Plant Cell Online.

[5]  Brian C. Thomas,et al.  Arabidopsis intragenomic conserved noncoding sequence , 2007, Proceedings of the National Academy of Sciences.

[6]  T. Wicker,et al.  Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation. , 2007, The Plant journal : for cell and molecular biology.

[7]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[8]  David A. Nix,et al.  Large-Scale Turnover of Functional Transcription Factor Binding Sites in Drosophila , 2006, PLoS Comput. Biol..

[9]  Brian C. Thomas,et al.  Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity. , 2006, Genome research.

[10]  Brian C. Thomas,et al.  Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. , 2006, Genome research.

[11]  Helen Pearson,et al.  Genetics: What is a gene? , 2006, Nature.

[12]  R W Doerge,et al.  Genomewide Nonadditive Gene Regulation in Arabidopsis Allotetraploids , 2006, Genetics.

[13]  E. Koonin Orthologs, Paralogs, and Evolutionary Genomics 1 , 2005 .

[14]  Steven Maere,et al.  Genome duplication and the origin of angiosperms. , 2005, Trends in ecology & evolution.

[15]  Sarah F. Smith,et al.  Highly conserved regulatory elements around the SHH gene may contribute to the maintenance of conserved synteny across human chromosome 7q36.3. , 2005, Genomics.

[16]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[17]  Burkhard Morgenstern,et al.  Multiple alignment of genomic sequences using CHAOS, DIALIGN and ABC , 2005, Nucleic Acids Res..

[18]  Nicole C Riddle,et al.  Dosage balance in gene regulation: biological implications. , 2005, Trends in genetics : TIG.

[19]  Jonathan F Wendel,et al.  Polyploidy and Genome Evolution in Plants This Review Comes from a Themed Issue on Genome Studies and Molecular Genetics Edited , 2022 .

[20]  M. Kreitman,et al.  Functional Evolution of a cis-Regulatory Module , 2005, PLoS biology.

[21]  Klaas Vandepoele,et al.  Ancient duplication of cereal genomes. , 2005, The New phytologist.

[22]  Klaudia Walter,et al.  Open access, freely available online PLoS BIOLOGY Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2022 .

[23]  E. Koonin Orthologs, paralogs, and evolutionary genomics. , 2005, Annual review of genetics.

[24]  W. Miller,et al.  Mulan: multiple-sequence local alignment and visualization for studying function and evolution. , 2005, Genome research.

[25]  Webb Miller,et al.  Evolution and functional classification of vertebrate gene deserts. , 2005, Genome research.

[26]  Gregory M. Cooper,et al.  ABC: software for interactive browsing of genomic multiple sequence alignment data , 2004, BMC Bioinformatics.

[27]  Hideki Innan,et al.  Very Low Gene Duplication Rate in the Yeast Genome , 2004, Science.

[28]  Georg Haberer,et al.  Transcriptional Similarities, Dissimilarities, and Conservation of cis-Elements in Duplicated Genes of Arabidopsis1[w] , 2004, Plant Physiology.

[29]  Michael Freeling,et al.  Genomic duplication, fractionation and the origin of regulatory novelty. , 2004, Genetics.

[30]  Jens Stoye,et al.  Benchmarking tools for the alignment of functional noncoding DNA , 2004, BMC Bioinformatics.

[31]  A. Bashir,et al.  Conserved noncoding sequences in the grasses. , 2003, Genome research.

[32]  Inna Dubchak,et al.  Glocal alignment: finding rearrangements during alignment , 2003, ISMB.

[33]  S. Moose,et al.  Conserved Noncoding Sequences among Cultivated Cereal Genomes Identify Candidate Regulatory Sequence Elements and Patterns of Promoter Evolution Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.010181. , 2003, The Plant Cell Online.

[34]  Chuong B. Do,et al.  Access the most recent version at doi: 10.1101/gr.926603 References , 2003 .

[35]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[36]  Nicholas L. Bray,et al.  AVID: A global alignment program. , 2003, Genome research.

[37]  Michael Brudno,et al.  Fast and sensitive alignment of large genomic sequences , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[38]  S. Goff,et al.  Utility and distribution of conserved noncoding sequences in the grasses , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Lior Pachter,et al.  VISTA : visualizing global DNA sequence alignments of arbitrary length , 2000, Bioinform..

[40]  R. Hardison Conserved noncoding sequences are reliable guides to regulatory elements. , 2000, Trends in genetics : TIG.

[41]  W. Miller,et al.  Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. , 2000, Science.

[42]  R. Gibbs,et al.  PipMaker--a web server for aligning two genomic DNA sequences. , 2000, Genome research.

[43]  A. Force,et al.  The probability of duplicate gene preservation by subfunctionalization. , 2000, Genetics.

[44]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[45]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[46]  Burkhard Morgenstern,et al.  DIALIGN2: Improvement of the segment to segment approach to multiple sequence alignment , 1999, German Conference on Bioinformatics.

[47]  Burkhard Morgenstern,et al.  DIALIGN: finding local similarities by multiple sequence alignment , 1998, Bioinform..

[48]  What was the evolutionary synthesis? , 1993, Trends in ecology & evolution.

[49]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[50]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[51]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[52]  Yasuko Takahashi,et al.  Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events , 2022 .