On non-detects in qPCR data

Motivation: Quantitative real-time PCR (qPCR) is one of the most widely used methods to measure gene expression. Despite extensive research in qPCR laboratory protocols, normalization and statistical analysis, little attention has been given to qPCR non-detects—those reactions failing to produce a minimum amount of signal. Results: We show that the common methods of handling qPCR non-detects lead to biased inference. Furthermore, we show that non-detects do not represent data missing completely at random and likely represent missing data occurring not at random. We propose a model of the missing data mechanism and develop a method to directly model non-detects as missing data. Finally, we show that our approach results in a sizeable reduction in bias when estimating both absolute and differential gene expression. Availability and implementation: The proposed algorithm is implemented in the R package, nondetects. This package also contains the raw data for the three example datasets used in this manuscript. The package is freely available at http://mnmccall.com/software and as part of the Bioconductor project. Contact: mccallm@gmail.com

[1]  Frank Speleman,et al.  A novel and universal method for microRNA RT-qPCR data normalization , 2009, Genome Biology.

[2]  A. Yakovlev,et al.  Synergistic response to oncogenic mutations defines gene class critical to cancer phenotype , 2008, Nature.

[3]  Thomas D. Schmittgen,et al.  Analyzing real-time PCR data by the comparative CT method , 2008, Nature Protocols.

[4]  S Nedjar,et al.  Simultaneous amplification and detection of specific hepatitis B virus and hepatitis C virus genomic sequences in serum samples , 1994, Journal of medical virology.

[5]  D. Ginzinger Gene quantification using real-time quantitative PCR: an emerging technology hits the mainstream. , 2002, Experimental hematology.

[6]  S A Bustin,et al.  Quantification of mRNA using real-time reverse transcription PCR (RT-PCR): trends and problems. , 2002, Journal of molecular endocrinology.

[7]  Tania Nolan,et al.  Pitfalls of quantitative real-time reverse-transcription polymerase chain reaction. , 2004, Journal of biomolecular techniques : JBT.

[8]  Anthony Almudevar,et al.  Fitting Boolean Networks from Steady State Perturbation Data , 2011, Statistical applications in genetics and molecular biology.

[9]  M. Pfaffl,et al.  A new mathematical model for relative quantification in real-time RT-PCR. , 2001, Nucleic acids research.

[10]  John Quackenbush,et al.  Data-driven normalization strategies for high-throughput quantitative RT-PCR , 2009, BMC Bioinformatics.

[11]  C. Wittwer,et al.  Continuous fluorescence monitoring of rapid cycle DNA amplification. , 1997, BioTechniques.

[12]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[13]  H. McMurray,et al.  Gene signature critical to cancer phenotype as a paradigm for anticancer drug discovery , 2013, Oncogene.

[14]  C. Heid,et al.  A novel method for real time quantitative RT-PCR. , 1996, Genome research.

[15]  Feng Chen,et al.  Statistical analysis of real-time PCR data , 2006, BMC Bioinformatics.

[16]  V. Beneš,et al.  The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. , 2009, Clinical chemistry.

[17]  Tania Nolan,et al.  Quantification of mRNA using real-time RT-PCR , 2006, Nature Protocols.

[18]  S. Bustin Absolute quantification of mRNA using real-time reverse transcription polymerase chain reaction assays. , 2000, Journal of molecular endocrinology.

[19]  Christian Mazza,et al.  Statistical significance of quantitative PCR , 2007, BMC Bioinformatics.

[20]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[21]  Ramon Goni,et al.  The qPCR data statistical analysis , 2009 .

[22]  A. Sacan,et al.  A novel method for the normalization of microRNA RT-PCR data , 2013, BMC Medical Genomics.

[23]  P. Walsh,et al.  Simultaneous Amplification and Detection of Specific DNA Sequences , 1992, Bio/Technology.