Robust transcriptional tumor signatures applicable to both formalin-fixed paraffin-embedded and fresh-frozen samples

Formalin-fixed paraffin-embedded (FFPE) samples represent a valuable resource for clinical researches. However, FFPE samples are usually considered an unreliable source for gene expression analysis due to the partial RNA degradation. In this study, through comparing gene expression profiles between FFPE samples and paired fresh-frozen (FF) samples for three cancer types, we firstly showed that expression measurements of thousands of genes had at least two-fold change in FFPE samples compared with paired FF samples. Therefore, for a transcriptional signature based on risk scores summarized from the expression levels of the signature genes, the risk score thresholds trained from FFPE (or FF) samples could not be applied to FF (or FFPE) samples. On the other hand, we found that more than 90% of the relative expression orderings (REOs) of gene pairs in the FF samples were maintained in their paired FFPE samples and largely unaffected by the storage time. The result suggested that the REOs of gene pairs were highly robust against partial RNA degradation in FFPE samples. Finally, as a case study, we developed a REOs-based signature to distinguish liver cirrhosis from hepatocellular carcinoma (HCC) using FFPE samples. The signature was validated in four datasets of FFPE samples and eight datasets of FF samples. In conclusion, the valuable FFPE samples can be fully exploited to identify REOs-based diagnostic and prognostic signatures which could be robustly applicable to both FF samples and FFPE samples with degraded RNA.

[1]  M. Monden,et al.  Analysis of chemical modification of RNA from formalin-fixed samples and optimization of molecular biology applications for such samples. , 1999, Nucleic acids research.

[2]  A. Nicholson,et al.  Impact of collection and storage of lung tumor tissue on whole genome expression profiling. , 2012, The Journal of molecular diagnostics : JMD.

[3]  Peter Beyerlein,et al.  Robust gene expression and mutation analyses of RNA-sequencing of formalin-fixed diagnostic tumor samples , 2015, Scientific Reports.

[4]  N. Blow Tissue preparation: Tissue issues , 2007, Nature.

[5]  I. Ng,et al.  TCGA whole-transcriptome sequencing data reveals significantly dysregulated genes and signaling pathways in hepatocellular carcinoma , 2015, Frontiers of Medicine.

[6]  Ting Liu,et al.  FOXP1 and SPINK1 reflect the risk of cirrhosis progression to HCC with HBV infection. , 2015, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[7]  L. Waldron,et al.  mRNA transcript quantification in archival samples using multiplexed, color-coded probes , 2011, BMC biotechnology.

[8]  John D. Minna,et al.  A Five-Gene and Corresponding Protein Signature for Stage-I Lung Adenocarcinoma Prognosis , 2010, Clinical Cancer Research.

[9]  R. Stoyanova,et al.  Successful application of microarray technology to microdissected formalin-fixed, paraffin-embedded tissue. , 2007, The Journal of molecular diagnostics : JMD.

[10]  A. Ensari,et al.  From RNA isolation to microarray analysis: Comparison of methods in FFPE tissues. , 2016, Pathology, research and practice.

[11]  M. Fernö,et al.  An improved genetic system for detection and analysis of protein nuclear import signals. , 2007 .

[12]  Lorenza Mittempergher,et al.  Gene Expression Profiles from Formalin Fixed Paraffin Embedded Breast Cancer Tissue Are Largely Comparable to Fresh Frozen Matched Tissue , 2011, PloS one.

[13]  H. Lee,et al.  Effects of Fixation and Storage of Human Tissue Samples on Nucleic Acid Preservation , 2014, Korean journal of pathology.

[14]  S. L. Chen,et al.  Expression of claudin 10 protein in hepatocellular carcinoma: impact on survival , 2011, Journal of Cancer Research and Clinical Oncology.

[15]  Bin Li,et al.  Extensive increase of microarray signals in cancers calls for novel normalization assumptions , 2011, Comput. Biol. Chem..

[16]  Haiyang Xie,et al.  Cyclin-dependent kinase inhibitor 3 is overexpressed in hepatocellular carcinoma and promotes tumor cell proliferation. , 2012, Biochemical and biophysical research communications.

[17]  Da Yang,et al.  An individualized prognostic signature for gastric cancer patients treated with 5-Fluorouracil-based chemotherapy and distinct multi-omics characteristics of prognostic groups , 2016, Oncotarget.

[18]  B. Leyland-Jones,et al.  DNA extraction from formalin-fixed, paraffin-embedded tissue. , 2009, Cold Spring Harbor protocols.

[19]  Dongsheng Tu,et al.  Differential Role of Estrogen Receptor Beta in Early Versus Metastatic Non-small Cell Lung Cancer , 2012, Hormones and Cancer.

[20]  S. Hamilton-Dutoit,et al.  Towards quantitative mRNA analysis in paraffin-embedded tissues using real-time reverse transcriptase-polymerase chain reaction: a methodological study on lymph nodes from melanoma patients. , 2003, The Journal of molecular diagnostics : JMD.

[21]  Crispin J. Miller,et al.  Methods comparison for high-resolution transcriptional analysis of archival material on Affymetrix Plus 2.0 and Exon 1.0 microarrays. , 2009, BioTechniques.

[22]  Joel S. Parker,et al.  Adjustment of systematic microarray data biases , 2004, Bioinform..

[23]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[24]  Andrew B. Nobel,et al.  Merging two gene-expression studies via cross-platform normalization , 2008, Bioinform..

[25]  H. Allgayer,et al.  Look who's talking: deregulated signaling in colorectal cancer. , 2012, Cancer genomics & proteomics.

[26]  Libin Chen,et al.  Critical limitations of prognostic signatures based on risk scores summarized from gene expression levels: a case study for resected stage I non-small-cell lung cancer , 2016, Briefings Bioinform..

[27]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[28]  Silke von Ahlfen,et al.  Determinants of RNA Quality from FFPE Samples , 2007, PloS one.

[29]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[30]  Martin Weisser,et al.  An optimized workflow for improved gene expression profiling for formalin-fixed, paraffin-embedded tumor samples , 2013, Journal of Clinical Bioinformatics.

[31]  Peter Regitnig,et al.  Genomic index of sensitivity to endocrine therapy for breast cancer. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[32]  J. Minna,et al.  A 12-Gene Set Predicts Survival Benefits from Adjuvant Chemotherapy in Non–Small Cell Lung Cancer Patients , 2013, Clinical Cancer Research.

[33]  T. Sørlie,et al.  Reliable PCR quantitation of estrogen, progesterone and ERBB2 receptor mRNA from formalin-fixed, paraffin-embedded tissue is independent of prior macro-dissection , 2013, Virchows Archiv.

[34]  P. Provero,et al.  Accurate Data Processing Improves the Reliability of Affymetrix Gene Expression Profiles from FFPE Samples , 2014, PloS one.

[35]  Zheng Guo,et al.  Differential expression analysis for individual cancer samples based on robust within-sample relative gene expression orderings across multiple profiling platforms , 2016, Oncotarget.

[36]  Anne-Mette K. Hein,et al.  Next-Generation Sequencing of RNA and DNA Isolated from Paired Fresh-Frozen and Formalin-Fixed Paraffin-Embedded Samples of Human Cancer and Normal Tissue , 2014, PloS one.

[37]  Daniel Q. Naiman,et al.  Statistical Applications in Genetics and Molecular Biology Classifying Gene Expression Profiles from Pairwise mRNA Comparisons , 2011 .

[38]  Yunyan Gu,et al.  A relative ordering-based predictor for tamoxifen-treated estrogen receptor-positive breast cancer patients: multi-laboratory cohort validation , 2013, Breast Cancer Research and Treatment.

[39]  Charles M Perou,et al.  Agreement in breast cancer classification between microarray and quantitative reverse transcription PCR from fresh-frozen and formalin-fixed, paraffin-embedded tissues. , 2007, Clinical chemistry.

[40]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[41]  Tieliu Shi,et al.  A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data , 2010, The Pharmacogenomics Journal.

[42]  T. Golub,et al.  Expression Profiling of Archival Tumors for Long-term Health Studies , 2012, Clinical Cancer Research.

[43]  Zheng Guo,et al.  Tamoxifen therapy benefit predictive signature coupled with prognostic signature of post-operative recurrent risk for early stage ER+ breast cancer , 2015, Oncotarget.

[44]  X. Chen,et al.  Comparison of Nanostring nCounter® Data on FFPE Colon Cancer Samples and Affymetrix Microarray Data on Matched Frozen Tissues , 2016, PloS one.

[45]  Thomas E. Royce,et al.  Whole-Genome Gene Expression Profiling of Formalin-Fixed, Paraffin-Embedded Tissue Samples , 2009, PloS one.

[46]  Daniel Q. Naiman,et al.  Simple decision rules for classifying human cancers from gene expression profiles , 2005, Bioinform..

[47]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[48]  Krishna R. Kalari,et al.  Gene Expression, Single Nucleotide Variant and Fusion Transcript Discovery in Archival Material from Breast Tumors , 2013, PloS one.

[49]  M. Vijver,et al.  Technology Insight: tuning into the genetic orchestra using microarrays—limitations of DNA microarrays in clinical practice , 2006, Nature Clinical Practice Oncology.

[50]  L. Buturovic,et al.  Validation and reproducibility of a microarray-based gene expression test for tumor identification in formalin-fixed, paraffin-embedded specimens. , 2011, The Journal of molecular diagnostics : JMD.

[51]  R. Gascoyne,et al.  A novel method of amplification of FFPET-derived RNA enables accurate disease classification with microarrays. , 2010, The Journal of molecular diagnostics : JMD.

[52]  T. Godfrey,et al.  Quantitative mRNA expression analysis from formalin-fixed, paraffin-embedded tissues using 5' nuclease quantitative reverse transcription-polymerase chain reaction. , 2000, The Journal of molecular diagnostics : JMD.

[53]  Jennifer Clarke,et al.  Statistical expression deconvolution from mixed tissue samples , 2010, Bioinform..

[54]  J. Vandesompele,et al.  Straightforward and sensitive RT-qPCR based gene expression analysis of FFPE samples , 2016, Scientific Reports.

[55]  W. Leong,et al.  Clinical relevance of DNA microarray analyses using archival formalin-fixed paraffin-embedded breast cancer specimens. , 2011, BMC cancer.

[56]  J. Foekens,et al.  A Method to Correlate mRNA Expression Datasets Obtained from Fresh Frozen and Formalin-Fixed, Paraffin-Embedded Tissue Samples: A Matter of Thresholds , 2015, PloS one.

[57]  Y. Morine,et al.  High NEK2 Expression Is a Predictor of Tumor Recurrence in Hepatocellular Carcinoma Patients After Hepatectomy. , 2016, Anticancer Research.

[58]  R. Boyce,et al.  Preliminary Comparison of Quantity, Quality, and Microarray Performance of RNA Extracted From Formalin-fixed, Paraffin-embedded, and Unfixed Frozen Tissue Samples , 2006, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[59]  C. Möller-Levet,et al.  Exon-array profiling unlocks clinically and biologically relevant gene signatures from formalin-fixed paraffin-embedded tumour samples , 2011, British Journal of Cancer.

[60]  Qiang Sun,et al.  Individual-level analysis of differential expression of genes and pathways for personalized medicine , 2015, Bioinform..

[61]  Donald Geman,et al.  Merging microarray data from separate breast cancer studies provides a robust prognostic test , 2008, BMC Bioinformatics.

[62]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[63]  Xianxiao Zhou,et al.  Rank-based predictors for response and prognosis of neoadjuvant taxane-anthracycline-based chemotherapy in breast cancer , 2013, Breast Cancer Research and Treatment.

[64]  B. Nielsen,et al.  Gene-expression analysis of a colorectal cancer-specific discriminatory transcript set on formalin-fixed, paraffin-embedded (FFPE) tissue samples , 2015, Diagnostic Pathology.

[65]  E. Lander Array of hope , 1999, Nature Genetics.

[66]  Kai Song,et al.  A rank-based transcriptional signature for predicting relapse risk of stage II colorectal cancer identified with proper data sources , 2016, Oncotarget.

[67]  David Venet,et al.  Most Random Gene Expression Signatures Are Significantly Associated with Breast Cancer Outcome , 2011, PLoS Comput. Biol..

[68]  Richard D. Kennedy,et al.  RNA expression analysis from formalin fixed paraffin embedded tissues , 2008, Histochemistry and Cell Biology.

[69]  Sergio Contrino,et al.  ArrayExpress—a public repository for microarray gene expression data at the EBI , 2004, Nucleic Acids Res..

[70]  S. Cheong,et al.  Transcriptional profiling of oral squamous cell carcinoma using formalin-fixed paraffin-embedded samples. , 2010, Oral oncology.

[71]  Jean-Pierre A. Kocher,et al.  A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis , 2013, Bioinform..

[72]  Zheng Guo,et al.  An individualized prognostic signature and multi-omics distinction for early stage hepatocellular carcinoma patients with surgical resection , 2016, Oncotarget.