Normalizing bead-based microRNA expression data: a measurement error model-based approach

MOTIVATION Compared with complementary DNA (cDNA) or messenger RNA (mRNA) microarray data, microRNA (miRNA) microarray data are harder to normalize due to the facts that the total number of miRNAs is small, and that the majority of miRNAs usually have low expression levels. In bead-based microarrays, the hybridization is completed in several pools. As a result, the number of miRNAs tested in each pool is even smaller, which poses extra difficulty to intrasample normalization and ultimately affects the quality of the final profiles assembled from various pools. In this article, we consider a measurement error model-based method for bead-based microarray intrasample normalization. RESULTS In this study, results from quantitative real-time PCR (qRT-PCR) assays are used as 'gold standards' for validation. The performance of the proposed measurement error model-based method is evaluated via a simulation study and real bead-based miRNA expression data. Simulation results show that the new method performs well to assemble complete profiles from subprofiles from various pools. Compared with two intrasample normalization methods recommended by the manufacturer, the proposed approach produces more robust final complete profiles and results in better agreement with the qRT-PCR results in identifying differentially expressed miRNAs, and hence improves the reproducibility between the two microarray platforms. Meaningful results are obtained by the proposed intrasample normalization method, together with quantile normalization as a subsequent complemental intersample normalization method. AVAILABILITY Datasets and R package are available at http://gauss.usouthal.edu/publ/beadsme/.

[1]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[2]  G. Rao The miR-17/92 Polycistron Is Up-regulated in Sonic Hedgehog–Driven Medulloblastomas and Induced by N-myc in Sonic Hedgehog–Treated Cerebellar Neural Precursors , 2010 .

[3]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[4]  Torsten Haferlach,et al.  Distinctive microRNA signature of acute myeloid leukemia bearing cytoplasmic mutated nucleophosmin , 2008, Proceedings of the National Academy of Sciences.

[5]  Bin Wang,et al.  Estimating smooth distribution function in the presence of heteroscedastic measurement errors , 2010, Comput. Stat. Data Anal..

[6]  Giacomo Gamberoni,et al.  Using miRNA expression data for the study of human cancer , 2008 .

[7]  David M. Rocke,et al.  A Model for Measurement Error for Gene Expression Arrays , 2001, J. Comput. Biol..

[8]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[9]  Thomas D. Schmittgen,et al.  Real-time PCR quantification of precursor and mature microRNA. , 2008, Methods.

[10]  Paul K Crane,et al.  lordif: An R Package for Detecting Differential Item Functioning Using Iterative Hybrid Ordinal Logistic Regression/Item Response Theory and Monte Carlo Simulations. , 2011, Journal of statistical software.

[11]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[12]  Bin Wang,et al.  Deconvolution Estimation in Measurement Error Models: The R Package decon. , 2011, Journal of statistical software.

[13]  B. Everitt,et al.  Large sample standard errors of kappa and weighted kappa. , 1969 .

[14]  David M. Rocke,et al.  A Two-Component Model for Measurement Error in Analytical Chemistry , 1995 .

[15]  Monica Chiogna,et al.  A modified LOESS normalization applied to microRNA arrays: a comparative evaluation , 2009, Bioinform..

[16]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[17]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[18]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[19]  Jacob Cohen,et al.  The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability , 1973 .

[20]  Bin Wang,et al.  A personalized microRNA microarray normalization method using a logistic regression model , 2009, Bioinform..

[21]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[22]  O. Fodstad,et al.  Gene Expression Profiles Classify Human Osteosarcoma Xenografts According to Sensitivity to Doxorubicin, Cisplatin, and Ifosfamide , 2009, Clinical Cancer Research.

[23]  Timothy S Davison,et al.  Analyzing micro-RNA expression using microarrays. , 2006, Methods in enzymology.

[24]  References , 1971 .

[25]  Bin Wang,et al.  Systematic Evaluation of Three microRNA Profiling Platforms: Microarray, Beads Array, and Quantitative Real-Time PCR Array , 2011, PloS one.

[26]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[27]  Sylvain Pradervand,et al.  Impact of normalization on miRNA microarray expression profiling. , 2009, RNA.

[28]  Kazuhiko Hayashi,et al.  Systematic analysis of microRNA expression of RNA extracted from fresh frozen and formalin-fixed paraffin-embedded samples. , 2007, RNA.

[29]  Trey Ideker,et al.  Testing for Differentially-Expressed Genes by Maximum-Likelihood Analysis of Microarray Data , 2000, J. Comput. Biol..