Improved criteria and comparative genomics tool provide new insights into grass paleogenomics

In the past decade, a number of bioinformatics tools have been developed to perform comparative genomics studies in plants and animals. However, most of the publicly available and user friendly tools lack common standards for the identification of robust orthologous relationships between genomes leading non-specialists to often over interpret the results of large scale comparative sequence analyses. Recently, we have established a number of improved parameters and tools to define significant relationships between genomes as a basis to develop paleogenomics studies in grasses. Here, we describe our approaches and propose the development of community-based standards that can be used in comparative genomic studies to (i) identify robust sets of orthologous gene pairs, (ii) derive complete sets of chromosome to chromosome relationships within and between genomes and (iii) model common paleo-ancestor genome structures. The rice and sorghum genome sequences are used to exemplify step-by-step a methodology that should allow users to perform accurate comparative genome analyses in their favourite species. Finally, we describe two applications for accurate gene annotation and synteny-based cloning of agronomically important traits.

[1]  Joachim Messing,et al.  Reconstruction of monocotelydoneous proto-chromosomes reveals faster evolution in plants than in animals , 2009, Proceedings of the National Academy of Sciences.

[2]  S. Praud,et al.  Genomics in cereals: from genome-wide conserved orthologous set (COS) sequences to candidate genes for trait dissection , 2009, Functional & Integrative Genomics.

[3]  Sachin Kumar,et al.  Orthology between genomes of Brachypodium, wheat and rice , 2009, BMC Research Notes.

[4]  J. Messing,et al.  The 'inner circle' of the cereal genomes. , 2009, Current opinion in plant biology.

[5]  Kris Popendorf,et al.  Accurate identification of orthologous segments among multiple genomes , 2009, Bioinform..

[6]  J. Bennetzen,et al.  Comparative sequence analysis of MONOCULM1-orthologous regions in 14 Oryza genomes , 2009, Proceedings of the National Academy of Sciences.

[7]  Caroline Dean,et al.  Growth and development: a broad view of fine detail. , 2009, Current opinion in plant biology.

[8]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[9]  Haibao Tang,et al.  Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. , 2008, Genome research.

[10]  Haibao Tang,et al.  Finding and Comparing Syntenic Regions among Arabidopsis and the Outgroups Papaya, Poplar, and Grape: CoGe with Rosids1[W] , 2008, Plant Physiology.

[11]  Yuji Nagata,et al.  GenomeMatcher: A graphical user interface for DNA sequence comparison , 2008, BMC Bioinformatics.

[12]  Christine Fong,et al.  PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes , 2008, BMC Bioinformatics.

[13]  Jian Pei,et al.  OrthoCluster: a new tool for mining synteny blocks and applications in comparative genomics , 2008, EDBT '08.

[14]  Natalie G Ahn,et al.  Proteomics and genomics: perspectives on drug and target discovery , 2008, Current Opinion in Chemical Biology.

[15]  B. Gill,et al.  Genomic targeting and mapping of tiller inhibition gene (tin3) of wheat using ESTs and synteny with rice , 2008, Functional & Integrative Genomics.

[16]  M. Ferguson-Smith,et al.  Mammalian karyotype evolution , 2007, Nature Reviews Genetics.

[17]  Christophe Périn,et al.  GreenPhylDB: a database for plant comparative genomics , 2007, Nucleic Acids Res..

[18]  Lincoln Stein,et al.  Gramene: a growing plant comparative genomics resource , 2007, Nucleic Acids Res..

[19]  Anthony Levasseur,et al.  Ancestral animal genomes reconstruction. , 2007, Current opinion in immunology.

[20]  Amit U. Sinha,et al.  Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms , 2007, BMC Bioinformatics.

[21]  T. Derrien,et al.  AutoGRAPH: an interactive web server for automating and visualizing comparative genome maps , 2007, Bioinform..

[22]  T. Wicker,et al.  Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation. , 2007, The Plant journal : for cell and molecular biology.

[23]  Eileen Kraemer,et al.  SynView: a GBrowse-compatible approach to visualizing comparative genome data , 2006, Bioinform..

[24]  L. Stein,et al.  SynBrowse: a synteny browser for comparative sequence analysis , 2005, Bioinform..

[25]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[26]  李佩芳 International Rice Genome Sequencing Project. 2005. The map-based sequence of the rice genome. , 2005 .

[27]  Pierre Baldi,et al.  Statistical detection of chromosomal homology using shared-gene density alone , 2005, Bioinform..

[28]  Erik L. L. Sonnhammer,et al.  Inparanoid: a comprehensive database of eukaryotic orthologs , 2004, Nucleic Acids Res..

[29]  Steven Salzberg,et al.  DAGchainer: a tool for mining segmental genome duplications and synteny , 2004, Bioinform..

[30]  Pavel A Pevzner,et al.  Mammalian phylogenomics comes of age. , 2004, Trends in genetics : TIG.

[31]  Mark Rebeiz,et al.  GenePalette: a universal software tool for genome sequence visualization and analysis. , 2004, Developmental biology.

[32]  T. J. Robinson,et al.  Cytogenetics and cladistics. , 2004, Systematic biology.

[33]  Greg Elgar,et al.  A Fugu-Human Genome Synteny Viewer: web software for graphical display and annotation reports of synteny between Fugu genomic sequence and human genes. , 2004, Nucleic acids research.

[34]  P. Pevzner,et al.  Reconstructing the genomic architecture of ancestral mammals: lessons from human, mouse, and rat genomes. , 2004, Genome research.

[35]  Inna Dubchak,et al.  Automated whole-genome multiple alignment of rat, mouse, and human. , 2004, Genome research.

[36]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[37]  Todd J. Vision,et al.  Fast identification and statistical evaluation of segmental homologies in comparative maps , 2003, ISMB.

[38]  P. Baldi,et al.  LineUp: statistical detection of chromosomal homology with application to plant comparative genomics. , 2003, Genome research.

[39]  S. Salzberg,et al.  Using MUMmer to Identify Similar Regions in Large Sequence Sets , 2003, Current protocols in bioinformatics.

[40]  J. Raes,et al.  The automatic detection of homologous regions (ADHoRe) and its application to microcolinearity between Arabidopsis and rice. , 2002, Genome research.

[41]  G. Tesler GRIMM: genome rearrangements web server , 2002, Bioinform..

[42]  K. Theres,et al.  The Lateral suppressor (Ls) gene of tomato encodes a new member of the VHIID protein family. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[44]  R. Durbin,et al.  A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. , 1995, Gene.

[45]  C. Arrecubieta,et al.  Sequence and transcriptional analysis of a DNA region involved in the production of capsular polysaccharide in Streptococcus pneumoniae type 3. , 1995, Gene.

[46]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[47]  G. Muehlbauer,et al.  Genetics and Genomics of the Triticeae , 2009 .

[48]  Sonja J. Prohaska,et al.  BMC Bioinformatics BioMed Central Methodology article SynBlast: Assisting the analysis of conserved synteny information , 2008 .

[49]  J. Salse,et al.  Identification and Characterization of Shared Duplications between Rice and Wheat Provide New Insight into Grass Genome Evolution , 2008 .

[50]  Burkhard Morgenstern,et al.  Alignment of genomic sequences using DIALIGN. , 2007, Methods in molecular biology.

[51]  Jonathan Crabtree,et al.  Sybil: methods and software for multiple genome comparison and visualization. , 2007, Methods in molecular biology.

[52]  R. Varshney,et al.  Genomics-Assisted Crop Improvement , 2007 .

[53]  Bernard B. Suh,et al.  Reconstructing contiguous regions of an ancestral genome. , 2006, Genome research.

[54]  Jia Liu,et al.  The TIGR rice genome annotation resource: annotating the rice genome and creating resources for plant biologists , 2003, Nucleic Acids Res..

[55]  D. Church,et al.  Cross-species sequence comparisons: a review of methods and available resources. , 2003, Genome research.