Improving comparability between microarray probe signals by thermodynamic intensity correction.

Signals from different oligonucleotide probes against the same target show great variation in intensities. However, detection of differences along a sequence e.g. to reveal intron/exon architecture, transcription boundary as well as simple absent/present calls depends on comparisons between different probes. It is therefore of great interest to correct for the variation between probes. Much of this variation is sequence dependent. We demonstrate that a thermodynamic model for hybridization of either DNA or RNA to a DNA microarray, which takes the sequence-dependent probe affinities into account significantly reduces the signal fluctuation between probes targeting the same gene transcript. For a test set of tightly tiled yeast genes, the model reduces the variance by up to a factor ∼1/3. As a consequence of this reduction, the model is shown to yield a more accurate determination of transcription start sites for a subset of yeast genes. In another application, we identify present/absent calls for probes hybridized to the sequenced Escherichia coli strain O157:H7 EDL933. The model improves the correct calls from 85 to 95% relative to raw intensity measures. The model thus makes applications which depend on comparisons between probes aimed at different sections of the same target more reliable.

[1]  Rasmus Wernersson FeatureExtract—extraction of sequence annotation made easy , 2005, Nucleic Acids Res..

[2]  Felix Naef,et al.  Absolute mRNA concentrations from sequence-specific calibration of oligonucleotide arrays. , 2003, Nucleic acids research.

[3]  Koji Hayashi,et al.  Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110 , 2006, Molecular systems biology.

[4]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[5]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Toralf Kirsten,et al.  Interactions in Oligonucleotide Hybrid Duplexes on Microarrays , 2004 .

[7]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[8]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[9]  Tyson A. Clark,et al.  Genomewide Analysis of mRNA Processing in Yeast Using Splicing-Specific Microarrays , 2002, Science.

[10]  P. Stadler,et al.  Sensitivity of Microarray Oligonucleotide Probes: Variability and Effect of Base Composition , 2004 .

[11]  S. Brunak,et al.  New weakly expressed cell cycle‐regulated genes in yeast , 2005, Yeast.

[12]  D. Ussery,et al.  Design of a Seven-Genome Escherichia coli Microarray for Comparative Genomic Profiling , 2006, Journal of bacteriology.

[13]  G. Phillips,et al.  Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  N. W. Davis,et al.  Genome sequence of enterohaemorrhagic Escherichia coli O157:H7 , 2001, Nature.

[15]  R. Holmes,et al.  Shiga-like toxin-converting phages from Escherichia coli strains that cause hemorrhagic colitis or infantile diarrhea. , 1984, Science.

[16]  R. Stoughton,et al.  Experimental annotation of the human genome using microarray technology , 2001, Nature.

[17]  G. Grinstein,et al.  Modeling of DNA microarray data by using physical properties of hybridization , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Zhijin Wu,et al.  Preprocessing of oligonucleotide array data , 2004, Nature Biotechnology.

[19]  Rafael A. Irizarry,et al.  A Model-Based Background Adjustment for Oligonucleotide Expression Arrays , 2004 .

[20]  M. Muir Physical Chemistry , 1888, Nature.

[21]  G. Grinstein,et al.  Relationship between gene expression and observed intensities in DNA microarrays—a modeling study , 2006, Nucleic acids research.

[22]  G. Church,et al.  Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset , 2005, Genome Biology.

[23]  Henrik Bjørn Nielsen,et al.  OligoWiz 2.0—integrating sequence feature annotation into the design of microarray probes , 2005, Nucleic Acids Res..

[24]  K. Aldape,et al.  A model of molecular interactions on short oligonucleotide microarrays , 2003, Nature Biotechnology.

[25]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[26]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .

[27]  Steen Knudsen,et al.  Design of oligonucleotides for microarrays and perspectives for design of multi-transcriptome arrays , 2003, Nucleic Acids Res..

[28]  Sandya Liyanarachchi,et al.  A high performance test of differential gene expression for oligonucleotide arrays , 2003, Genome Biology.