The bioinformatics challenges in comparative analysis of cereal genomes—an overview

Comparative genomic analysis is the cornerstone of in silico-based approaches to understanding biological systems and processes across cereal species, such as rice, wheat and barley, in order to identify genes of agronomic interest. The size of the genomic repositories is nearly doubling every year, and this has significant implications on the way bioinformatics analyses are carried out. In this overview the concepts and technology underpinning bioinformatics as applied to comparative genomic analysis are considered in the context of other manuscripts appearing in this issue of Functional and Integrative Genomics.

[1]  P. Shewry,et al.  Manipulating cereal endosperm structure, development and composition to improve end-use properties , 2001 .

[2]  R. Gibbs,et al.  PipMaker--a web server for aligning two genomic DNA sequences. , 2000, Genome research.

[3]  Jianxin Ma,et al.  Genomic sequencing reveals gene content, genomic organization, and recombination relationships in barley , 2002, Functional & Integrative Genomics.

[4]  M. Yamamoto,et al.  The structural organisation of the gene encoding class II starch synthase of wheat and barley and the evolution of the genes encoding starch synthases in plants , 2003, Functional & Integrative Genomics.

[5]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[6]  M. Morell,et al.  The structure and expression of the wheat starch synthase III gene. Motifs in the expressed gene define the lineage of the starch synthase III gene family. , 2000, Plant physiology.

[7]  David W. Galbraith,et al.  Global Analysis of Cell Type-Specific Gene Expression , 2003, Comparative and functional genomics.

[8]  G. F. Barry The use of the Monsanto draft rice genome sequence in research. , 2001, Plant physiology.

[9]  Eric H Davidson,et al.  New computational approaches for analysis of cis-regulatory networks. , 2002, Developmental biology.

[10]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. japonica) , 2002, Science.

[11]  M. Yano,et al.  Rice-barley synteny and its application to saturation mapping of the barley Rpg1 region. , 1995, Nucleic acids research.

[12]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[13]  K. Devos,et al.  Comparative genetics in the grasses. , 1998, Plant molecular biology.

[14]  G. Moore,et al.  Construction and analysis of a BAC library in the grass Brachypodium sylvaticum: its use as a tool to bridge the gap between rice and wheat in elucidating gene content , 2004, Functional & Integrative Genomics.

[15]  M. Yamamoto,et al.  Comparison of starch-branching enzyme genes reveals evolutionary relationships among isoforms. Characterization of a gene for starch-branching enzyme IIa from the wheat genome donor Aegilops tauschii. , 2001, Plant physiology.

[16]  L. Stein Creating a bioinformatics nation , 2002, Nature.

[17]  X. R. L N K C Z V E A Y H G U T X R L N K C Z V E A Y H [General method]. , 2000, Diabetes & metabolism.

[18]  A. Oliphant,et al.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). , 2002, Science.

[19]  T. Sasaki,et al.  New evidence for the synteny of rice chromosome 1 and barley chromosome 3H from rice expressed sequence tags. , 2001, Genome.

[20]  Cari Soderlund,et al.  In-Depth View of Structure, Activity, and Evolution of Rice Chromosome 10 , 2003, Science.

[21]  Junjun Zhang,et al.  Recent segmental and gene duplications in the mouse genome , 2003, Genome Biology.

[22]  Ömer Egecioglu,et al.  A new approach to sequence comparison: normalized sequence alignment , 2001, Bioinform..

[23]  S. Salzberg,et al.  Fast algorithms for large-scale genome alignment and comparison. , 2002, Nucleic acids research.

[24]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[25]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[26]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[27]  Anthony J. Cox,et al.  SSAHA: A Fast Search Method for Large , 2006 .

[28]  A. Kilian,et al.  Towards map-based cloning of the barley stem rust resistance genes Rpg1 and rpg4 using rice as an intergenomic cloning vehicle. , 1997 .

[29]  M. Yano,et al.  Rapid reorganization of resistance gene homologues in cereal genomes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Hideaki Sugawara,et al.  DNA Data Bank of Japan at work on genome sequence data , 1998, Nucleic Acids Res..

[31]  Matthew Bellgard,et al.  Microarray analysis using bioinformatics analysis audit trails (BAATs). , 2003, Comptes rendus biologies.

[32]  M. Yano,et al.  Synteny with rice: analysis of barley malting quality QTLs and rpg4 chromosome regions , 1998 .

[33]  J. Dubcovsky,et al.  Microcolinearity between a 2-cM region encompassing the grain protein content locus Gpc-6B1 on wheat chromosome 6B and a 350-kb region on rice chromosome 2 , 2004, Functional & Integrative Genomics.

[34]  R. Appels,et al.  Advances in cereal functional genomics , 2003, Functional & Integrative Genomics.

[35]  Robert Miller,et al.  STACK: Sequence Tag Alignment and Consensus Knowledgebase , 2001, Nucleic Acids Res..

[36]  J. Bennetzen,et al.  Comparative sequence analysis of colinear barley and rice bacterial artificial chromosomes. , 2001, Plant physiology.

[37]  J. Bennetzen,et al.  Transposable elements, genes and recombination in a 215-kb contig from wheat chromosome 5Am , 2002, Functional & Integrative Genomics.

[38]  G. Sermonti The human genome. , 1988, Rivista di biologia.

[39]  Nils Rostoks,et al.  The barley stem rust-resistance gene Rpg1 is a novel disease-resistance gene with homology to receptor kinases , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  J. Dubcovsky,et al.  Comparative RFLP mapping of Triticum monococcum genes controlling vernalization requirement , 1998, Theoretical and Applied Genetics.

[41]  B. Keller,et al.  In silico comparative analysis reveals a mosaic conservation of genes within a novel colinear region in wheat chromosome 1AS and rice chromosome 5S , 2004, Functional & Integrative Genomics.

[42]  N. Barton,et al.  Chromosomal Speciation and Molecular Divergence--Accelerated Evolution in Rearranged Chromosomes , 2003, Science.

[43]  Visible trends in functional genomics , 2003, Functional & Integrative Genomics.

[44]  Takuji Sasaki,et al.  Rice molecular genetic map using RFLPs and its applications , 1997, Plant Molecular Biology.

[45]  M. Brent,et al.  Leveraging the mouse genome for gene prediction in human: from whole-genome shotgun reads to a global synteny map. , 2003, Genome research.

[46]  Kim Carter,et al.  MASV - Multiple (BLAST) Annotation System Viewer , 2003, Bioinform..

[47]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[48]  T. Wicker,et al.  Analysis of a contiguous 211 kb sequence in diploid wheat (Triticum monococcum L.) reveals multiple mechanisms of genome evolution. , 2001, The Plant journal : for cell and molecular biology.

[49]  Maria Jesus Martin,et al.  High-quality Protein Knowledge Resource: SWISS-PROT and TrEMBL , 2002, Briefings Bioinform..

[50]  T. Gojobori,et al.  Significant differences between the G+C content of synonymous codons in orthologous genes and the genomic G+C content. , 1999, Gene.

[51]  Junjun Zhang,et al.  Human Chromosome 7: DNA Sequence and Biology , 2003, Science.

[52]  Mark Reynolds,et al.  Gap mapping: a paradigm for aligning two sequences. , 2003, Applied bioinformatics.

[53]  Paul Richardson,et al.  Human Chromosome 19 and Related Regions in Mouse: Conservative and Lineage-Specific Evolution , 2001, Science.

[54]  J. Mullikin,et al.  SSAHA: a fast search method for large DNA databases. , 2001, Genome research.

[55]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[56]  L. Qi,et al.  Microsatellite-based deletion bin system for the establishment of genetic-physical map relationships in wheat (Triticum aestivum L.) , 2004, Functional & Integrative Genomics.

[57]  K Carter,et al.  Bioinformatics issues for automating the annotation of genomic sequences. , 2001, Genome informatics. International Conference on Genome Informatics.

[58]  Wei Zhao,et al.  Gramene: a resource for comparative grass genomics , 2002, Nucleic Acids Res..

[59]  A. Kilian,et al.  Towards map-based cloning of the barley stem rust resistance genes Rpg1 and rpg4 using rice as an intergenomic cloning vehicle , 2004, Plant Molecular Biology.

[60]  M. Yano,et al.  Comparative mapping of the barley Ppd-H1 photoperiod response gene region, which lies close to a junction between two rice linkage segments. , 2002, Genetics.

[61]  Jerzy Jurka,et al.  Censor - a Program for Identification and Elimination of Repetitive Elements From DNA Sequences , 1996, Comput. Chem..

[62]  J. Bennetzen,et al.  Comparative Sequence Analysis of Colinear Barley and Rice Bacterial Artificial Chromosomes 1 , 2001 .

[63]  Junhua Peng,et al.  Comparative DNA sequence analysis of wheat and rice genomes. , 2003, Genome research.

[64]  R. Durbin,et al.  A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. , 1995, Gene.

[65]  M. Sorrells,et al.  Comparative DNA sequence analysis of mapped wheat ESTs reveals the complexity of genome relationships between rice and wheat , 2004, Functional & Integrative Genomics.

[66]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[67]  D. Laurie,et al.  Conservation of fine-scale DNA marker order in the genomes of rice and the Triticeae. , 1995, Nucleic acids research.

[68]  R. Guigó,et al.  Comparative gene prediction in human and mouse. , 2003, Genome research.

[69]  W. McCombie,et al.  Sequence analysis of the long arm of rice chromosome 11 for rice–wheat synteny , 2004, Functional & Integrative Genomics.

[70]  Junhua Peng,et al.  The organization and rate of evolution of wheat genomes are correlated with recombination rates along chromosome arms. , 2003, Genome research.

[71]  Vincent Lombard,et al.  The EMBL Nucleotide Sequence Database: major new developments , 2003, Nucleic Acids Res..

[72]  G. Moore,et al.  Cereal genome evolution: pastoral pursuits with 'Lego' genomes. , 1995, Current opinion in genetics & development.

[73]  E Pennisi,et al.  Keeping Genome Databases Clean and Up to Date , 1999, Science.

[74]  M. Bellgard,et al.  Genomic and Phylogenetic Analysis of the S100A7 (Psoriasin) Gene Duplications Within the Region of the S100 Gene Cluster on Human Chromosome 1q21 , 2003, Journal of Molecular Evolution.

[75]  Junhua Peng,et al.  Synteny perturbations between wheat homoeologous chromosomes caused by locus duplications and deletions correlate with recombination rates , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[76]  F. Collins,et al.  The Human Genome Project: Lessons from Large-Scale Biology , 2003, Science.

[77]  A. Graner,et al.  An integrated approach for comparative mapping in rice and barley with special reference to the Rph16 resistance locus , 2004, Functional & Integrative Genomics.

[78]  R. Tarchini,et al.  The Complete Sequence of 340 kb of DNA around the Rice Adh1–Adh2 Region Reveals Interrupted Colinearity with Maize Chromosome 4 , 2000, Plant Cell.

[79]  J. Bennetzen,et al.  Numerous small rearrangements of gene content, order and orientation differentiate grass genomes , 2002, Plant Molecular Biology.

[80]  Peixiang Ni,et al.  Genes controlling seed dormancy and pre-harvest sprouting in a rice-wheat-barley comparison , 2004, Functional & Integrative Genomics.