Reference-free transcriptome exploration reveals novel RNAs for prostate cancer diagnosis

A reference-free computational workflow for the discovery of unannotated RNA biomarkers retrieves novel noncoding RNAs with high potential for prostate cancer diagnosis. The use of RNA-sequencing technologies held a promise of improved diagnostic tools based on comprehensive transcript sets. However, mining human transcriptome data for disease biomarkers in clinical specimens are restricted by the limited power of conventional reference-based protocols relying on unique and annotated transcripts. Here, we implemented a blind reference-free computational protocol, DE-kupl, to infer yet unreferenced RNA variations from total stranded RNA-sequencing datasets of tissue origin. As a bench test, this protocol was powered for detection of RNA subsequences embedded into putative long noncoding (lnc)RNAs expressed in prostate cancer. Through filtering of 1,179 candidates, we defined 21 lncRNAs that were further validated by NanoString for robust tumor-specific expression in 144 tissue specimens. Predictive modeling yielded a restricted probe panel enabling more than 90% of true-positive detections of cancer in an independent The Cancer Genome Atlas cohort. Remarkably, this clinical signature made of only nine unannotated lncRNAs largely outperformed PCA3, the only used prostate cancer lncRNA biomarker, in detection of high-risk tumors. This modular workflow is highly sensitive and can be applied to any pathology or clinical application.

[1]  D. Gautheret,et al.  Bridging the gap between reference and real transcriptomes , 2019, Genome Biology.

[2]  Jian Zhang,et al.  SEdb: a comprehensive human super-enhancer database , 2018, Nucleic Acids Res..

[3]  B. Trock,et al.  Validation of the Decipher Test for predicting adverse pathology in candidates for prostate cancer active surveillance , 2018, Prostate Cancer and Prostatic Diseases.

[4]  E. Leucci Cancer development and therapy resistance: spotlights on the dark side of the genome. , 2018, Pharmacology & therapeutics.

[5]  A. Frankish,et al.  Towards a complete map of the human long non-coding RNA transcriptome , 2018, Nature Reviews Genetics.

[6]  Hao Chen,et al.  Tumor-Derived Exosomal Long Noncoding RNAs as Promising Diagnostic Biomarkers for Prostate Cancer , 2018, Cellular Physiology and Biochemistry.

[7]  Shanrong Zhao,et al.  Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion , 2018, Scientific Reports.

[8]  Daniel Gautheret,et al.  DE-kupl: exhaustive capture of biological variation in RNA-seq data through k-mer decomposition , 2017, Genome Biology.

[9]  Ira W. Deveson,et al.  Universal alternative splicing of noncoding exons , 2017, bioRxiv.

[10]  Jie Tang,et al.  Environmental Research and Public Health Long Non-coding Rna as Potential Biomarker for Prostate Cancer: Is It Making a Difference? , 2022 .

[11]  E. Klein,et al.  Low PCA3 expression is a marker of poor differentiation in localized prostate tumors: exploratory analysis from 12,076 patients , 2017, Oncotarget.

[12]  A. Rosenkrantz,et al.  Risk Stratification by Urinary Prostate Cancer Gene 3 Testing Before Magnetic Resonance Imaging-Ultrasound Fusion-targeted Prostate Biopsy Among Men With No History of Biopsy. , 2017, Urology.

[13]  A. Tewari,et al.  The Use of Biomarkers in Prostate Cancer Screening and Treatment. , 2017, Reviews in urology.

[14]  A. Morillon,et al.  History, Discovery, and Classification of lncRNAs. , 2017, Advances in experimental medicine and biology.

[15]  Asha A. Nair,et al.  Impact of RNA degradation on fusion detection by RNA-seq , 2016, BMC Genomics.

[16]  Roland Eils,et al.  Complex heatmaps reveal patterns and correlations in multidimensional genomic data , 2016, Bioinform..

[17]  Gianluca Bontempi,et al.  Portraying breast cancers with long noncoding RNAs , 2016, Science Advances.

[18]  Howard Y. Chang,et al.  Long Noncoding RNAs in Cancer Pathways. , 2016, Cancer cell.

[19]  Howard Y. Chang,et al.  Unique features of long non-coding RNA biogenesis and function , 2015, Nature Reviews Genetics.

[20]  Kimberly R. Kukurba,et al.  RNA Sequencing and Analysis. , 2015, Cold Spring Harbor protocols.

[21]  G. Calin,et al.  The Clinical Relevance of Long Non-Coding RNAs in Cancer , 2015, Cancers.

[22]  D. Gautheret,et al.  VING: a software for visualization of deep sequencing signals , 2015, BMC Research Notes.

[23]  R. Sidman,et al.  PRUNE2 is a human prostate cancer suppressor regulated by the intronic long noncoding RNA PCA3 , 2015, Proceedings of the National Academy of Sciences.

[24]  S. Dhanasekaran,et al.  The landscape of long noncoding RNAs in the human transcriptome , 2015, Nature Genetics.

[25]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[26]  J. Hou,et al.  Development and prospective multicenter evaluation of the long noncoding RNA MALAT-1 as a diagnostic urinary biomarker for prostate cancer , 2014, Oncotarget.

[27]  A. Pizarro,et al.  Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data , 2014, bioRxiv.

[28]  C. Perou,et al.  Comparison of RNA-Seq by poly (A) capture, ribosomal RNA depletion, and DNA microarray for expression profiling , 2014, BMC Genomics.

[29]  Ruth Etzioni,et al.  Overdiagnosis and overtreatment of prostate cancer. , 2014, European urology.

[30]  P. Kantoff,et al.  Enhancer RNAs participate in androgen receptor-driven looping that selectively enhances gene activation , 2014, Proceedings of the National Academy of Sciences.

[31]  Francesca Demichelis,et al.  PCAT-1, a long noncoding RNA, regulates BRCA2 and controls homologous recombination in cancer. , 2014, Cancer research.

[32]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[33]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[34]  S. Dhanasekaran,et al.  The long noncoding RNA SChLAP1 promotes aggressive prostate cancer and antagonizes the SWI/SNF complex , 2013, Nature Genetics.

[35]  Zhen Su,et al.  Integrative genomic analyses reveal clinically relevant long non-coding RNA in human cancer , 2013 .

[36]  Andrew J Vickers,et al.  Prostate cancer screening: facts, statistics, and interpretation in response to the US Preventive Services Task Force Review. , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[37]  S. Willard,et al.  Regulators of gene expression as biomarkers for prostate cancer. , 2012, American journal of cancer research.

[38]  Carl Kingsford,et al.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers , 2011, Bioinform..

[39]  A. Partin,et al.  Review of the literature: PCA3 for prostate cancer risk assessment and prognostication. , 2011, Reviews in urology.

[40]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[41]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[42]  Fabio Galasso,et al.  PCA3: a new tool to diagnose prostate cancer (PCa) and a guidance in biopsy decisions. Preliminary report of the UrOP study. , 2010, Archivio italiano di urologia, andrologia : organo ufficiale [di] Societa italiana di ecografia urologica e nefrologica.

[43]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[44]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[45]  Leonard S. Marks,et al.  APTIMA PCA3 molecular urine test: development of a method to aid in the diagnosis of prostate cancer. , 2006, Clinical chemistry.

[46]  Debashis Ghosh,et al.  Classification and Selection of Biomarkers in Genomic Data Using LASSO , 2005, Journal of biomedicine & biotechnology.

[47]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[48]  L. Kiemeney,et al.  DD3(PCA3), a very sensitive and specific marker to detect prostate tumors. , 2002, Cancer research.

[49]  A. D'Amico,et al.  Calculated prostate carcinoma volume , 1998, Cancer.