Improved annotation through genome-scale metabolic modeling of Aspergillus oryzae

BackgroundSince ancient times the filamentous fungus Aspergillus oryzae has been used in the fermentation industry for the production of fermented sauces and the production of industrial enzymes. Recently, the genome sequence of A. oryzae with 12,074 annotated genes was released but the number of hypothetical proteins accounted for more than 50% of the annotated genes. Considering the industrial importance of this fungus, it is therefore valuable to improve the annotation and further integrate genomic information with biochemical and physiological information available for this microorganism and other related fungi. Here we proposed the gene prediction by construction of an A. oryzae Expressed Sequence Tag (EST) library, sequencing and assembly. We enhanced the function assignment by our developed annotation strategy. The resulting better annotation was used to reconstruct the metabolic network leading to a genome scale metabolic model of A. oryzae.ResultsOur assembled EST sequences we identified 1,046 newly predicted genes in the A. oryzae genome. Furthermore, it was possible to assign putative protein functions to 398 of the newly predicted genes. Noteworthy, our annotation strategy resulted in assignment of new putative functions to 1,469 hypothetical proteins already present in the A. oryzae genome database. Using the substantially improved annotated genome we reconstructed the metabolic network of A. oryzae. This network contains 729 enzymes, 1,314 enzyme-encoding genes, 1,073 metabolites and 1,846 (1,053 unique) biochemical reactions. The metabolic reactions are compartmentalized into the cytosol, the mitochondria, the peroxisome and the extracellular space. Transport steps between the compartments and the extracellular space represent 281 reactions, of which 161 are unique. The metabolic model was validated and shown to correctly describe the phenotypic behavior of A. oryzae grown on different carbon sources.ConclusionA much enhanced annotation of the A. oryzae genome was performed and a genome-scale metabolic model of A. oryzae was reconstructed. The model accurately predicted the growth and biomass yield on different carbon sources. The model serves as an important resource for gaining further insight into our understanding of A. oryzae physiology.

[1]  Eugene W. Myers,et al.  Basic local alignment search tool. Journal of Molecular Biology , 1990 .

[2]  Chittibabu Guda,et al.  TARGET: a new method for predicting protein subcellular localization in eukaryotes , 2005, Bioinform..

[3]  J. Nielsen,et al.  Analysis of Aspergillus nidulans metabolism at the genome-scale , 2008, BMC Genomics.

[4]  Kara Dolinski,et al.  Saccharomyces cerevisiae S288C genome annotation: a working hypothesis , 2006, Yeast.

[5]  Jens Nielsen,et al.  Physiological Engineering Aspects Of Penicillium Chrysogenum , 1997 .

[6]  M Carlsen,et al.  Influence of carbon source on alpha-amylase production by Aspergillus oryzae. , 2001, Applied microbiology and biotechnology.

[7]  J. Nielsen,et al.  From genomes to in silico cells via metabolic networks. , 2005, Current opinion in biotechnology.

[8]  Antoine Quint,et al.  Scalable Vector Graphics , 2020, Definitions.

[9]  B. Palsson,et al.  Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. , 2003, Genome research.

[10]  Christina A. Cuomo,et al.  Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae , 2005, Nature.

[11]  R. Overbeek,et al.  Missing genes in metabolic pathways: a comparative genomics approach. , 2003, Current opinion in chemical biology.

[12]  M. Berriman,et al.  Hot and sexy moulds! , 2006, Nature Reviews Microbiology.

[13]  H. Bonarius,et al.  Flux analysis of underdetermined metabolic networks: the quest for the missing constraints. , 1997 .

[14]  J. Nielsen,et al.  Metabolic model integration of the bibliome, genome, metabolome and reactome of Aspergillus niger , 2008, Molecular systems biology.

[15]  H. Kitano,et al.  Computational systems biology , 2002, Nature.

[16]  Jenn-Kang Hwang,et al.  Prediction of protein subcellular localization , 2006, Proteins.

[17]  Jens Nielsen,et al.  Reconstruction of the central carbon metabolism of Aspergillus niger. , 2003, European journal of biochemistry.

[18]  S. Baker Aspergillus niger genomics: past, present and into the future. , 2006, Medical mycology.

[19]  William H. Majoros,et al.  Corrigendum: Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus , 2006, Nature.

[20]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[21]  David R Westhead,et al.  Annotating the Plasmodium genome and the enigma of the shikimate pathway. , 2004, Trends in parasitology.

[22]  K Asai,et al.  Recognition of human genes by stochastic parsing. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[23]  K. Isono,et al.  Genome sequencing and analysis of Aspergillus oryzae , 2005, Nature.

[24]  Osamu Gotoh,et al.  Homology-based gene structure prediction: simplified matching algorithm using a translated codon (tron) and improved accuracy by allowing for long gaps , 2000, Bioinform..

[25]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[26]  J. Cooney,et al.  Microbodies in fungi: a review , 1990, Journal of Industrial Microbiology.

[27]  B. Palsson,et al.  Metabolic modelling of microbes: the flux-balance approach. , 2002, Environmental microbiology.

[28]  The Aspergilli , 1927, Nature.

[29]  de Winde,et al.  University of Groningen Genome sequencing and analysis of the versatile cell factory Aspergillus niger CBS 513.88 Pel, , 2006 .

[30]  C. Ball,et al.  Saccharomyces Genome Database. , 2002, Methods in enzymology.

[31]  J. A. Roubos,et al.  Genome sequencing and analysis of the versatile cell factory Aspergillus niger CBS 513.88 , 2007, Nature Biotechnology.

[32]  Masayuki Machida,et al.  Whole genome comparison of Aspergillus flavus and A. oryzae. , 2006, Medical mycology.

[33]  William H. Majoros,et al.  Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus , 2005, Nature.

[34]  M. D. de Groot,et al.  Metabolic Control Analysis of Xylose Catabolism in Aspergillus , 2008, Biotechnology progress (Print).

[35]  Steven Salzberg,et al.  GlimmerM, Exonomy and Unveil: three ab initio eukaryotic genefinders , 2003, Nucleic Acids Res..

[36]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[37]  Jens Nielsen,et al.  Identification of Enzymes and Quantification of Metabolic Fluxes in the Wild Type and in a Recombinant Aspergillus oryzae Strain , 1999, Applied and Environmental Microbiology.

[38]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[39]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[40]  Gustavo H. Goldman,et al.  The aspergilli: genomics, medical aspects, biotechnology, and research methods. , 2007 .

[41]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[42]  Edison T Liu,et al.  Integrative biology and systems biology , 2005, Molecular systems biology.