Detecting variants with Metabolic Design, a new software tool to design probes for explorative functional DNA microarray development

BackgroundMicroorganisms display vast diversity, and each one has its own set of genes, cell components and metabolic reactions. To assess their huge unexploited metabolic potential in different ecosystems, we need high throughput tools, such as functional microarrays, that allow the simultaneous analysis of thousands of genes. However, most classical functional microarrays use specific probes that monitor only known sequences, and so fail to cover the full microbial gene diversity present in complex environments. We have thus developed an algorithm, implemented in the user-friendly program Metabolic Design, to design efficient explorative probes.ResultsFirst we have validated our approach by studying eight enzymes involved in the degradation of polycyclic aromatic hydrocarbons from the model strain Sphingomonas paucimobilis sp. EPA505 using a designed microarray of 8,048 probes. As expected, microarray assays identified the targeted set of genes induced during biodegradation kinetics experiments with various pollutants. We have then confirmed the identity of these new genes by sequencing, and corroborated the quantitative discrimination of our microarray by quantitative real-time PCR. Finally, we have assessed metabolic capacities of microbial communities in soil contaminated with aromatic hydrocarbons. Results show that our probe design (sensitivity and explorative quality) can be used to study a complex environment efficiently.ConclusionsWe successfully use our microarray to detect gene expression encoding enzymes involved in polycyclic aromatic hydrocarbon degradation for the model strain. In addition, DNA microarray experiments performed on soil polluted by organic pollutants without prior sequence assumptions demonstrate high specificity and sensitivity for gene detection. Metabolic Design is thus a powerful, efficient tool that can be used to design explorative probes and monitor metabolic pathways in complex environments, and it may also be used to study any group of genes. The Metabolic Design software is freely available from the authors and can be downloaded and modified under general public license.

[1]  Baohua Gu,et al.  GeoChip: a comprehensive microarray for investigating biogeochemical, ecological and environmental processes , 2007, The ISME Journal.

[2]  Zhili He,et al.  Empirical Evaluation of a New Method for Calculating Signal-to-Noise Ratio for Microarray Data Analysis , 2008, Applied and Environmental Microbiology.

[3]  Jizhong Zhou,et al.  Detection of Genes Involved in Biodegradation and Biotransformation in Microbial Communities by Using 50-Mer Oligonucleotide Microarrays , 2004, Applied and Environmental Microbiology.

[4]  Michael Y. Galperin,et al.  'Conserved hypothetical' proteins: prioritization of targets for experimental study. , 2004, Nucleic acids research.

[5]  Peter D. Karp,et al.  EcoCyc: A comprehensive view of Escherichia coli biology , 2008, Nucleic Acids Res..

[6]  Jizhong Zhou,et al.  Microarray-based functional gene analysis of soil microbial communities during ozonation and biodegradation of crude oil. , 2009, Chemosphere.

[7]  J. Willison,et al.  Identification and Functional Analysis of Two Aromatic-Ring-Hydroxylating Dioxygenases from a Sphingomonas Strain That Degrades Various Polycyclic Aromatic Hydrocarbons , 2004, Applied and Environmental Microbiology.

[8]  Rick L. Stevens,et al.  Functional metagenomic profiling of nine biomes , 2008, Nature.

[9]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[10]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[11]  David R. C. Hill,et al.  A comparison of algorithms for a complete backtranslation of oligopeptides , 2008, Int. J. Comput. Biol. Drug Des..

[12]  O. Pinyakong,et al.  The unique aromatic catabolic genes in sphingomonads degrading polycyclic aromatic hydrocarbons (PAHs). , 2003, The Journal of general and applied microbiology.

[13]  H. Sohn,et al.  Catabolic role of a three-component salicylate oxygenase from Sphingomonas yanoikuyae B1 in polycyclic aromatic hydrocarbon degradation. , 2005, Biochemical and biophysical research communications.

[14]  M. Chee,et al.  Microarray-based multicycle-enrichment of genomic subsets for targeted next-generation sequencing. , 2009, Genome research.

[15]  Matthias Hess,et al.  A perspective: metatranscriptomics as a tool for the discovery of novel biocatalysts. , 2009, Journal of biotechnology.

[16]  Dmitrij Frishman,et al.  Applying negative rule mining to improve genome annotation , 2007, BMC Bioinformatics.

[17]  Hedi Peterson,et al.  Gene expression KEGGanim : pathway animations for high-throughput data , 2008 .

[18]  A. Stolz,et al.  Identification and functional analysis of the genes for naphthalenesulfonate catabolism by Sphingomonas xenophaga BN6. , 2006, Microbiology.

[19]  Eric K. Nordberg,et al.  YODA: selecting signature oligonucleotides , 2005, Bioinform..

[20]  Terry J. Gentry,et al.  Microarray-Based Analysis of Microbial Community RNAs by Whole-Community RNA Amplification , 2006, Applied and Environmental Microbiology.

[21]  S. J. Thurston,et al.  Complete Sequence of a 184-Kilobase Catabolic Plasmid from Sphingomonas aromaticivorans F199 , 1999, Journal of bacteriology.

[22]  BMC Bioinformatics , 2005 .

[23]  R Staden,et al.  The staden sequence analysis package , 1996, Molecular biotechnology.

[24]  Jiasen Lu,et al.  Assessment of the sensitivity and specificity of oligonucleotide (50mer) microarrays. , 2000, Nucleic acids research.

[25]  Sophie Lemoine,et al.  An evaluation of custom microarray applications: the oligonucleotide design challenge , 2009, Nucleic acids research.

[26]  Lei Li,et al.  NMPP: a user-customized NimbleGen microarray data processing pipeline , 2006, Bioinform..

[27]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[28]  Joseph L DeRisi,et al.  E-Predict: a computational strategy for species identification based on observed DNA microarray hybridization patterns , 2005, Genome Biology.

[29]  T. Tzeng,et al.  Identification of four structural genes and two putative promoters necessary for utilization of phenanthrene naphthalene, fluoranthene, and by Sphingomonas paucimobilis var. EPA505. , 2000 .

[30]  J. Derisi,et al.  Microarray-based detection and genotyping of viral pathogens , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Christopher W. Schadt,et al.  Microarray-Based Analysis of Subnanogram Quantities of Microbial Community DNAs by Using Whole-Community Genome Amplification , 2006, Applied and Environmental Microbiology.

[32]  L. Øvreås,et al.  Microbial diversity and function in soil: from genes to ecosystems. , 2002, Current opinion in microbiology.

[33]  Jo Handelsman,et al.  Toward a Census of Bacteria in Soil , 2006, PLoS Comput. Biol..

[34]  D. Tautz,et al.  Oligonucleotide microarrays: widely applied--poorly understood. , 2007, Briefings in functional genomics & proteomics.

[35]  J. Tiedje,et al.  DNA recovery from soils of diverse composition , 1996, Applied and environmental microbiology.

[36]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[37]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[38]  Ye Deng,et al.  Microarray-based analysis of microbial functional diversity along an oil contamination gradient in oil field. , 2009, FEMS microbiology ecology.

[39]  P J Chapman,et al.  Isolation and characterization of a fluoranthene-utilizing strain of Pseudomonas paucimobilis , 1990, Applied and environmental microbiology.

[40]  David R. C. Hill,et al.  GoArrays: highly dynamic and efficient microarray probe design , 2005, Bioinform..

[41]  Thomas E. Royce,et al.  Toward a universal microarray: prediction of gene expression through nearest-neighbor probe sequence identification , 2007, Nucleic acids research.

[42]  Ivo L. Hofacker,et al.  Hybridization thermodynamics of NimbleGen Microarrays , 2010, BMC Bioinformatics.

[43]  B. Singh Exploring microbial diversity for biotechnology: the way forward. , 2010, Trends in biotechnology.

[44]  Xiaowei Wang,et al.  Selection of Oligonucleotide Probes for Protein Coding Sequences , 2003, Bioinform..

[45]  O. Pinyakong,et al.  Identification of three novel salicylate 1-hydroxylases involved in the phenanthrene degradation of Sphingobium sp. strain P2. , 2003, Biochemical and biophysical research communications.

[46]  Matthew R. Laird,et al.  BMC Bioinformatics BioMed Central Methodology article Improving the specificity of high-throughput ortholog prediction , 2006 .

[47]  Gerda Kamberova,et al.  DNA Array Image Analysis - Nuts and Bolts , 2002 .

[48]  M. Beier,et al.  Targeted next-generation sequencing by specific capture of multiple genomic loci using low-volume microfluidic DNA arrays , 2009, Analytical and bioanalytical chemistry.

[49]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[50]  O. Pinyakong,et al.  Identification of novel metabolites in the degradation of phenanthrene by Sphingomonas sp. strain P2. , 2000, FEMS microbiology letters.

[51]  V. Stojanoff,et al.  The crystal structure of the ring‐hydroxylating dioxygenase from Sphingomonas CHY‐1 , 2007, The FEBS journal.

[52]  G. Zylstra,et al.  Identification, cloning, and characterization of a multicomponent biphenyl dioxygenase from Sphingobium yanoikuyae B1 , 2007, Journal of Industrial Microbiology & Biotechnology.

[53]  M. Ferrer,et al.  Metagenomics approaches in systems microbiology. , 2009, FEMS microbiology reviews.

[54]  Jérôme Gouzy,et al.  The ProDom database of protein domain families , 1998, Nucleic Acids Res..

[55]  W. Ian Lipkin,et al.  Greene SCPrimer: a rapid comprehensive tool for designing degenerate primers from multiple sequence alignments , 2006, Nucleic acids research.

[56]  Zoltan Szallasi,et al.  Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization , 2009, Nucleic acids research.