Accurately quantifying low-abundant targets amid similar sequences by revealing hidden correlations in oligonucleotide microarray data

Microarrays have enabled the determination of how thousands of genes are expressed to coordinate function within single organisms. Yet applications to natural or engineered communities where different organisms interact to produce complex properties are hampered by theoretical and technological limitations. Here we describe a general method to accurately identify low-abundant targets in systems containing complex mixtures of homologous targets. We combined an analytical predictor of nonspecific probe–target interactions (cross-hybridization) with an optimization algorithm that iteratively deconvolutes true probe–target signal from raw signal affected by spurious contributions (cross-hybridization, noise, background, and unequal specific hybridization response). The method was capable of quantifying, with unprecedented specificity and accuracy, ribosomal RNA (rRNA) sequences in artificial and natural communities. Controlled experiments with spiked rRNA into artificial and natural communities demonstrated the accuracy of identification and quantitative behavior over different concentration ranges. Finally, we illustrated the power of this methodology for accurate detection of low-abundant targets in natural communities. We accurately identified Vibrio taxa in coastal marine samples at their natural concentrations (<0.05% of total bacteria), despite the high potential for cross-hybridization by hundreds of different coexisting rRNAs, suggesting this methodology should be expandable to any microarray platform and system requiring accurate identification of low-abundant targets amid pools of similar sequences.

[1]  J. Sambrook,et al.  DNA Microarrays: A Molecular Cloning Manual , 2002 .

[2]  S. Acinas,et al.  Fine-scale phylogenetic architecture of a complex bacterial community , 2004, Nature.

[3]  Jizhong Zhou,et al.  Evaluation of 50-mer oligonucleotide arrays for detecting microbial populations in environmental samples. , 2004, BioTechniques.

[4]  Alexander W Peterson,et al.  Hybridization of mismatched or partially matched DNA at surfaces. , 2002, Journal of the American Chemical Society.

[5]  R. Stoughton,et al.  Use of hybridization kinetics for differentiating specific from non-specific binding to oligonucleotide microarrays. , 2002, Nucleic acids research.

[6]  M. Polz,et al.  Diversity and Dynamics of a North Atlantic Coastal Vibrio Community , 2004, Applied and Environmental Microbiology.

[7]  D. Leckband,et al.  A quantitative assessment of heterogeneity for surface-immobilized proteins. , 2001, Analytical chemistry.

[8]  Jizhong Zhou,et al.  Detection of Genes Involved in Biodegradation and Biotransformation in Microbial Communities by Using 50-Mer Oligonucleotide Microarrays , 2004, Applied and Environmental Microbiology.

[9]  G. Grinstein,et al.  Modeling of DNA microarray data by using physical properties of hybridization , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Brendan J. Frey,et al.  GenXHC: a probabilistic generative model for cross-hybridization compensation in high-density genome-wide microarray data , 2005, ISMB.

[11]  Felix Naef,et al.  Absolute mRNA concentrations from sequence-specific calibration of oligonucleotide arrays. , 2003, Nucleic acids research.

[12]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  O. White,et al.  Environmental Genome Shotgun Sequencing of the Sargasso Sea , 2004, Science.

[14]  Rafael A. Irizarry,et al.  Stochastic Models Inspired by Hybridization Theory for Short Oligonucleotide Arrays , 2005, J. Comput. Biol..

[15]  Eliot Marshall,et al.  Getting the Noise Out of Gene Arrays , 2004, Science.

[16]  Jiasen Lu,et al.  Assessment of the sensitivity and specificity of oligonucleotide (50mer) microarrays. , 2000, Nucleic acids research.

[17]  J. A. Aas,et al.  Microbial Risk Indicators of Early Childhood Caries , 2005, Journal of Clinical Microbiology.

[18]  G. W. Hatfield,et al.  DNA microarrays and gene expression , 2002 .

[19]  J. Banfield,et al.  Community structure and metabolism through reconstruction of microbial genomes from the environment , 2004, Nature.

[20]  Michael Wagner,et al.  16S rRNA Gene-Based Oligonucleotide Microarray for Environmental Monitoring of the Betaproteobacterial Order “Rhodocyclales” , 2005, Applied and Environmental Microbiology.

[21]  Joseph L DeRisi,et al.  E-Predict: a computational strategy for species identification based on observed DNA microarray hybridization patterns , 2005, Genome Biology.

[22]  Chanathip Pharino,et al.  Genotypic Diversity Within a Natural Coastal Bacterioplankton Population , 2005, Science.

[23]  Yudong D. He,et al.  Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer , 2001, Nature Biotechnology.

[24]  Jason D. Gans,et al.  Computational Improvements Reveal Great Bacterial Diversity and High Metal Toxicity in Soil , 2005, Science.

[25]  Jean-Marie Rouillard,et al.  OligoArray: genome-scale oligonucleotide design for microarrays , 2002, Bioinform..

[26]  Steen Knudsen,et al.  Design of oligonucleotides for microarrays and perspectives for design of multi-transcriptome arrays , 2003, Nucleic Acids Res..

[27]  Chunlei Wu,et al.  Sequence dependence of cross-hybridization on short oligo microarrays , 2005, Nucleic acids research.

[28]  J. Cavanaugh Biostatistics , 2005, Definitions.

[29]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[30]  K. Aldape,et al.  A model of molecular interactions on short oligonucleotide microarrays , 2003, Nature Biotechnology.

[31]  Dorothea K. Thompson,et al.  Development and Evaluation of Functional Gene Arrays for Detection of Selected Genes in the Environment , 2001, Applied and Environmental Microbiology.

[32]  J. Tiedje,et al.  Quantitative Detection of Microbial Genes by Using DNA Microarrays , 2002, Applied and Environmental Microbiology.

[33]  Stephan Preibisch,et al.  Specific and nonspecific hybridization of oligonucleotide probes on microarrays. , 2004, Biophysical journal.

[34]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[35]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[36]  Michael B. Eisen,et al.  Rapid quantitative profiling of complex microbial populations , 2006, Nucleic acids research.

[37]  E. Southern,et al.  Molecular interactions on microarrays , 1999, Nature Genetics.

[38]  D. Relman The human body as microbial observatory , 2002, Nature Genetics.

[39]  B. Ward,et al.  Oligonucleotide Microarray for the Study of Functional Gene Diversity in the Nitrogen Cycle in the Environment , 2003, Applied and Environmental Microbiology.

[40]  Clemens Richert,et al.  ChipCheck-A Program Predicting Total Hybridization Equilibria for DNA Binding to Small Oligonucleotide Microarrays , 2003, J. Chem. Inf. Comput. Sci..

[41]  N. Stralis-Pavese,et al.  Development and validation of a diagnostic microbial microarray for methanotrophs. , 2003, Environmental microbiology.

[42]  Jörg Peplies,et al.  Application and validation of DNA microarrays for the 16S rRNA-based analysis of marine bacterioplankton. , 2004, Environmental microbiology.

[43]  S. Preibisch,et al.  Base pair interactions and hybridization isotherms of matched and mismatched oligonucleotide probes on microarrays. , 2005, Langmuir : the ACS journal of surfaces and colloids.

[44]  A. Lagarde,et al.  DNA Microarrays: A Molecular Cloning Manual. , 2003 .

[45]  Vincent J. Denef,et al.  Validation of a more sensitive method for using spotted oligonucleotide DNA microarrays for functional genomics studies on bacterial communities. , 2003, Environmental microbiology.

[46]  Philip Hugenholtz,et al.  Impact of Culture-Independent Studies on the Emerging Phylogenetic View of Bacterial Diversity , 1998, Journal of bacteriology.

[47]  John Quackenbush,et al.  Assessing unmodified 70-mer oligonucleotide probe performance on glass-slide microarrays , 2003, Genome Biology.