Gene Model Detection Using Mass Spectrometry

The utility of a genome sequence in biological research depends entirely on the comprehensive description of all of its functional elements. Analysis of genome sequences is still predominantly gene-centric (i.e., identifying gene models/open reading frames). In this article, we describe a proteomics-based method for identifying open reading frames that are missed by computational algorithms. Mass spectrometry-based identification of peptides and proteins from biological samples provide evidence for the expression of the genome sequence at the protein level. This proteogenomic annotation method combines computationally predicted ORFs and the genome sequence with proteomics to identify novel gene models. We also describe our proteogenomic mapping pipeline - a set of computational tools that automate the proteogenomic annotation work flow. This pipeline is available for download at www.agbase.msstate.edu/tools/ .

[1]  Bindu Nanduri,et al.  Proteomic analysis using an unfinished bacterial genome: The effects of subminimum inhibitory concentrations of antibiotics on Mannheimia haemolytica virulence factor expression , 2005, Proteomics.

[2]  E. Birney,et al.  EGASP: the human ENCODE Genome Annotation Assessment Project , 2006, Genome Biology.

[3]  Lincoln Stein,et al.  Genome annotation: from sequence to biology , 2001, Nature Reviews Genetics.

[4]  S. Gottesman Micros for microbes: non-coding regulatory RNAs in bacteria. , 2005, Trends in genetics : TIG.

[5]  Mahalingam Ramkumar,et al.  Quantitative analysis of Streptococcus pneumoniae TIGR4 response to in vitro iron restriction by 2‐D LC ESI MS/MS , 2008, Proteomics.

[6]  Bindu Nanduri,et al.  Comparative Proteomic Analysis of Listeria monocytogenes Strains F2365 and EGD , 2008, Applied and Environmental Microbiology.

[7]  Bindu Nanduri,et al.  Effects of Subminimum Inhibitory Concentrations of Antibiotics on the Pasteurella multocida Proteome: A Systems Approach , 2008, Comparative and functional genomics.

[8]  Jacob D. Jaffe,et al.  Proteogenomic mapping as a complementary method to perform genome annotation , 2004, Proteomics.

[9]  Richard D. Smith,et al.  Whole proteome analysis of post-translational modifications: applications of mass-spectrometry for proteogenomic annotation. , 2007, Genome research.

[10]  Nan Wang,et al.  AgBase: a functional genomics resource for agriculture , 2006, BMC Genomics.

[11]  F. McCarthy,et al.  Modeling a whole organ using proteomics: The avian bursa of Fabricius , 2006, Proteomics.

[12]  Jacob D. Jaffe,et al.  The complete genome and proteome of Mycoplasma mobile. , 2004, Genome research.

[13]  Daniel B. Goodman,et al.  Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes. , 2008, Genome research.