A disattenuated correlation estimate when variables are measured with error: Illustration estimating cross‐platform correlations

Previous cross-platform reproducibility studies have compared consistency of intensities as well as consistency of fold changes across different platforms using Pearson's correlation coefficient. In this study, we propose the use of measurement error models for estimating gene-specific correlations. Additionally, gene-specific reliability estimates are shown to be useful in prioritizing clones for sequence verification rather than selecting clones using a simple random sample. The proposed 'disattenuated' correlation may prove useful in a wide variety of studies when both X and Y are measured with error, such as in confirmation studies of microarray gene expression values, wherein more reliable laboratory assays such as real-time polymerase chain reaction are used.

[1]  Kazuho Ikeo,et al.  CIBEX: center for information biology gene expression database. , 2003, Comptes rendus biologies.

[2]  Wei-Min Liu,et al.  Robust estimators for expression analysis , 2002, Bioinform..

[3]  W. Wong,et al.  Functional annotation and network reconstruction through cross-platform integration of microarray data , 2005, Nature Biotechnology.

[4]  Zoltan Szallasi,et al.  Increased measurement accuracy for sequence-verified microarray probes. , 2004, Physiological genomics.

[5]  Z. Szallasi,et al.  Sequence-matched probes produce increased cross-platform consistency and more reproducible biological results in microarray-based gene expression measurements. , 2004, Nucleic acids research.

[6]  B. De Moor,et al.  Comparison and meta-analysis of microarray data: from the bench to the computer desk. , 2003, Trends in genetics : TIG.

[7]  P. Collins,et al.  Performance comparison of one-color and two-color platforms within the Microarray Quality Control (MAQC) project , 2006, Nature Biotechnology.

[8]  John Quackenbush,et al.  Multiple-laboratory comparison of microarray platforms , 2005, Nature Methods.

[9]  Charles L. Kooperberg,et al.  Improved Background Correction for Spotted DNA Microarrays , 2002, J. Comput. Biol..

[10]  Lucila Ohno-Machado,et al.  Analysis of matched mRNA measurements from two different microarray technologies , 2002, Bioinform..

[11]  Catalin C. Barbacioru,et al.  Evaluation of DNA microarray results with quantitative gene expression platforms , 2006, Nature Biotechnology.

[12]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Kathleen F. Kerr,et al.  Evaluation of methods for oligonucleotide array data via quantitative real-time PCR , 2006, BMC Bioinformatics.

[14]  Sergio Contrino,et al.  ArrayExpress—a public repository for microarray gene expression data at the EBI , 2004, Nucleic Acids Res..

[15]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[16]  Catherine Brooksbank,et al.  An open letter to the scientific journals , 2002, Bioinform..

[17]  J. Neter,et al.  Applied Linear Regression Models , 1983 .

[18]  Gavin Sherlock,et al.  Funding high-throughput data sharing , 2004, Nature Biotechnology.

[19]  M. Dosemeci,et al.  Occupational exposure to ionizing radiation and electromagnetic fields in relation to the risk of thyroid cancer in Sweden. , 2006, Scandinavian journal of work, environment & health.

[20]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[21]  Dennis B. Troup,et al.  NCBI GEO: mining millions of expression profiles—database and tools , 2004, Nucleic Acids Res..

[22]  Richard Simon,et al.  A random variance model for detection of differential gene expression in small microarray experiments , 2003, Bioinform..

[23]  Gavin Sherlock,et al.  The Stanford Microarray Database accommodates additional microarray platforms and data formats , 2004, Nucleic Acids Res..

[24]  Isaac S. Kohane,et al.  Redefinition of Affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements , 2005, BMC Bioinformatics.

[25]  J. Bunn,et al.  Comparing dietary recall data for mothers and children obtained on two occasions in a case-control study of environmental factors and childhood brain tumours. , 1997, International journal of epidemiology.

[26]  Debashis Ghosh,et al.  Statistical issues and methods for meta-analysis of microarray data: a case study in prostate cancer , 2003, Functional & Integrative Genomics.

[27]  M. Nieuwenhuijsen,et al.  Chlorination disinfection byproducts in water and their association with adverse reproductive outcomes: a review , 2000, Occupational and environmental medicine.

[28]  T. Barrette,et al.  Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. , 2002, Cancer research.

[29]  Christian Pilarsky,et al.  Meta-analysis of microarray data on pancreatic cancer defines a set of commonly dysregulated genes , 2005, Oncogene.

[30]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[31]  Hanlee P. Ji,et al.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. , 2006, Nature biotechnology.

[32]  James W. Hardin,et al.  The Regression-calibration Method for Fitting Generalized Linear Models with Additive Measurement Error , 2003 .

[33]  Kevin R. Coombes,et al.  Differences in gene expression between B-cell chronic lymphocytic leukemia and normal B cells: a meta-analysis of three microarray studies , 2004, Bioinform..

[34]  R. Lempicki,et al.  Evaluation of gene expression measurements from commercial microarray platforms. , 2003, Nucleic acids research.

[35]  H. Ledebur,et al.  A mammalian artificial chromosome engineering system (ACE System) applicable to biopharmaceutical protein production, transgenesis and gene-based cell therapy. , 2004, Nucleic acids research.

[36]  Fuad G. Gwadry,et al.  Comparing cDNA and oligonucleotide array data: concordance of gene expression across platforms for the NCI-60 cancer cells , 2003, Genome Biology.

[37]  Lei Liu,et al.  A study of inter-lab and inter-platform agreement of DNA microarray data , 2005, BMC Genomics.

[38]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[39]  C. Garrett,et al.  Microarrays in cancer research. , 2004, Anticancer Research.

[40]  Giovanni Parmigiani,et al.  A Cross-Study Comparison of Gene Expression Studies for the Molecular Classification of Lung Cancer , 2004, Clinical Cancer Research.

[41]  Ravi Kothapalli,et al.  Microarray results: how accurate are they? , 2002, BMC Bioinformatics.