PIECE: a database for plant gene structure comparison and evolution

Gene families often show degrees of differences in terms of exon–intron structures depending on their distinct evolutionary histories. Comparative analysis of gene structures is important for understanding their evolutionary and functional relationships within plant species. Here, we present a comparative genomics database named PIECE (http://wheat.pw.usda.gov/piece) for Plant Intron and Exon Comparison and Evolution studies. The database contains all the annotated genes extracted from 25 sequenced plant genomes. These genes were classified based on Pfam motifs. Phylogenetic trees were pre-constructed for each gene category. PIECE provides a user-friendly interface for different types of searches and a graphical viewer for displaying a gene structure pattern diagram linked to the resulting bootstrapped dendrogram for each gene family. The gene structure evolution of orthologous gene groups was determined using the GLOOME, Exalign and GECA software programs that can be accessed within the database. PIECE also provides a web server version of the software, GSDraw, for drawing schematic diagrams of gene structures. PIECE is a powerful tool for comparing gene sequences and provides valuable insights into the evolution of gene structure in plant genomes.

[1]  M. Margis-Pinheiro,et al.  Evolutionary view of acyl-CoA diacylglycerol acyltransferase (DGAT), a key enzyme in neutral lipid biosynthesis , 2011, BMC Evolutionary Biology.

[2]  R. DeSalle,et al.  Intron Evolution: Testing Hypotheses of Intron Evolution Using the Phylogenomics of Tetraspanins , 2009, PloS one.

[3]  Ashwini Bhasi,et al.  ExDom: an integrated database for comparative analysis of the exon–intron structures of protein domains in eukaryotes , 2008, Nucleic Acids Res..

[4]  Matthew D. Wilkerson,et al.  PlantGDB: a resource for comparative plant genomics , 2007, Nucleic Acids Res..

[5]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[6]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[7]  Valentin A. Ilyin,et al.  Structural exon database, SEDB, mapping exon boundaries on multiple protein structures , 2004, Bioinform..

[8]  Tal Pupko,et al.  GLOOME: gain loss mapping engine , 2010, Bioinform..

[9]  M. Long,et al.  Extensive Structural Renovation of Retrogenes in the Evolution of the Populus Genome1[W][OA] , 2009, Plant Physiology.

[10]  X. Gu,et al.  Intron gain and loss in segmentally duplicated genes in rice , 2006, Genome Biology.

[11]  An-Yuan Guo,et al.  [GSDS: a gene structure display server]. , 2007, Yi chuan = Hereditas.

[12]  Geoffrey J. Barton,et al.  Jalview Version 2—a multiple sequence alignment editor and analysis workbench , 2009, Bioinform..

[13]  J. Strommer The plant ADH gene family. , 2011, The Plant journal : for cell and molecular biology.

[14]  Jacques Ravel,et al.  Visualization of comparative genomic analyses by BLAST score ratio , 2005, BMC Bioinformatics.

[15]  G. Pavesi,et al.  Exalign: a new method for comparative analysis of exon–intron gene structures , 2008, Nucleic acids research.

[16]  Robert D. Finn,et al.  HMMER web server: interactive sequence similarity searching , 2011, Nucleic Acids Res..

[17]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[18]  Matthew D. Wilkerson,et al.  Common introns within orthologous genes: software and application to plants , 2009, Briefings Bioinform..

[19]  David M. Goodstein,et al.  Phytozome: a comparative platform for green plant genomics , 2011, Nucleic Acids Res..

[20]  Christian M. Zmasek,et al.  GreenPhylDB v2.0: comparative and functional genomics in plants , 2010, Nucleic Acids Res..

[21]  C. Maher,et al.  Genome-Wide Characterization of the HD-ZIP IV Transcription Factor Family in Maize: Preferential Expression in the Epidermis1[C][W] , 2011, Plant Physiology.

[22]  E. Koonin,et al.  Remarkable Interkingdom Conservation of Intron Positions and Massive, Lineage-Specific Intron Loss and Gain in Eukaryotic Evolution , 2003, Current Biology.

[23]  Christophe Dunand,et al.  GECA: a fast tool for gene evolution and conservation analysis in eukaryotic protein families , 2012, Bioinform..

[24]  G. Rubin,et al.  A computer program for aligning a cDNA sequence with a genomic DNA sequence. , 1998, Genome research.

[25]  Francesca D. Ciccarelli,et al.  FancyGene: dynamic visualization of gene structures and protein domain architectures on genomic loci , 2009, Bioinform..

[26]  Y. van de Peer,et al.  PLAZA: A Comparative Genomics Resource to Study Gene and Genome Evolution in Plants[W] , 2009, The Plant Cell Online.

[27]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..