Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer

Background Phenotype prediction problems are usually considered ill-posed, as the amount of samples is very limited with respect to the scrutinized genetic probes. This fact complicates the sampling of the defective genetic pathways due to the high number of possible discriminatory genetic networks involved. In this research, we outline three novel sampling algorithms utilized to identify, classify and characterize the defective pathways in phenotype prediction problems, such as the Fisher’s ratio sampler, the Holdout sampler and the Random sampler, and apply each one to the analysis of genetic pathways involved in tumor behavior and outcomes of triple negative breast cancers (TNBC). Altered biological pathways are identified using the most frequently sampled genes and are compared to those obtained via Bayesian Networks (BNs). Results Random, Fisher’s ratio and Holdout samplers were more accurate and robust than BNs, while providing comparable insights about disease genomics. Conclusions The three samplers tested are good alternatives to Bayesian Networks since they are less computationally demanding algorithms. Importantly, this analysis confirms the concept of “biological invariance” since the altered pathways should be independent of the sampling methodology and the classifier used for their inference. Nevertheless, still some modifications are needed in the Bayesian networks to be able to sample correctly the uncertainty space in phenotype prediction problems, since the probabilistic parameterization of the uncertainty space is not unique and the use of the optimum network might falsify the pathways analysis.

[1]  Guozhang Mao,et al.  DDX23-Linc00630-HDAC1 axis activates the Notch pathway to promote metastasis , 2017, Oncotarget.

[2]  Enrique J. deAndrés-Galiana,et al.  Sampling Defective Pathways in Phenotype Prediction Problems via the Fisher's Ratio Sampler , 2018, IWBBIO.

[3]  Tsviya Olender,et al.  GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation. , 2009, Omics : a journal of integrative biology.

[4]  Lily Yeh Jan,et al.  Targeting potassium channels in cancer , 2014, The Journal of cell biology.

[5]  Juan Luis Fernández-Martínez,et al.  From Bayes to Tarantola: New insights to understand uncertainty in inverse problems☆ , 2013 .

[6]  Qingyuan Zhang,et al.  Tumorigenesis and Neoplastic Progression Yin Yang 1 Plays an Essential Role in Breast Cancer and Negatively Regulates p 27 , 2012 .

[7]  Hamza Lasla,et al.  Gene-expression molecular subtyping of triple-negative breast cancer tumours: importance of immune response , 2015, Breast Cancer Research.

[8]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[9]  Ron Korstanje,et al.  A Bayesian Framework for Inference of the Genotype–Phenotype Map for Segregating Populations , 2011, Genetics.

[10]  M. Barmada,et al.  Identifying genetic interactions in genome‐wide data using Bayesian networks , 2010, Genetic epidemiology.

[11]  Enrique J. deAndrés-Galiana,et al.  Sampling Defective Pathways in Phenotype Prediction Problems via the Holdout Sampler , 2018, IWBBIO.

[12]  Dongxin Lin,et al.  A cis-eQTL genetic variant of the cancer–testis gene CCDC116 is associated with risk of multiple cancers , 2017, Human Genetics.

[13]  Ana Cernea,et al.  Genomic risk prediction of aromatase inhibitor‐related arthralgia in patients with breast cancer using a novel machine‐learning algorithm , 2017, Cancer medicine.

[14]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[15]  J. Lee,et al.  STC-1 expression is upregulated through an Akt/NF-κB-dependent pathway in triple-negative breast cancer cells. , 2016, Oncology reports.

[16]  Juan Luis Fernández-Martínez,et al.  Data kit inversion and uncertainty analysis , 2019, Journal of Applied Geophysics.

[17]  Enrique J. deAndrés-Galiana,et al.  Comparison of Different Sampling Algorithms for Phenotype Prediction , 2018, IWBBIO.

[18]  Chengbo Yu,et al.  Comprehensive analysis of long non-coding RNA expression profiles in hepatitis B virus-related hepatocellular carcinoma , 2016, Oncotarget.

[19]  Enrique J. deAndrés-Galiana,et al.  Design of Biomedical Robots for Phenotype Prediction Problems , 2016, J. Comput. Biol..

[20]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[21]  S. Gentile,et al.  Potassium channel activation inhibits proliferation of breast cancer cells by activating a senescence program , 2013, Cell Death and Disease.

[22]  Mina J. Bissell,et al.  Putting tumours in context , 2001, Nature Reviews Cancer.

[23]  Scott T. Weiss,et al.  CGBayesNets: Conditional Gaussian Bayesian Network Learning and Inference with Mixed Discrete and Continuous Data , 2014, PLoS Comput. Biol..

[24]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[25]  Breysse Denys,et al.  The uncertainty analysis in linear and nonlinear regression revisited: application to concrete strength estimation , 2018, Inverse Problems in Science and Engineering.

[26]  Mark E. Borsuk,et al.  Using Bayesian networks to discover relations between genes, environment, and disease , 2013, BioData Mining.

[27]  Michael J. Tompkins,et al.  On the topography of the cost functional in linear and nonlinear inverse problems , 2012 .

[28]  S. Thiagalingam,et al.  Integrin Signaling in Mammary Epithelial Cells and Breast Cancer , 2012, ISRN oncology.

[29]  Jeonghun Han,et al.  Elevated STC‑1 augments the invasiveness of triple‑negative breast cancer cells through activation of the JNK/c‑Jun signaling pathway. , 2016, Oncology reports.

[30]  Enrique J. deAndrés-Galiana,et al.  Genomic data integration in chronic lymphocytic leukemia , 2017, The journal of gene medicine.

[31]  Enrique J. deAndrés-Galiana,et al.  Sensitivity analysis of gene ranking methods in phenotype prediction , 2016, J. Biomed. Informatics.

[32]  Enrique J. deAndrés-Galiana,et al.  mGluR5 mediates post-radiotherapy fatigue development in cancer patients , 2018, Translational Psychiatry.

[33]  Fernndez-Martnez Jl,et al.  The Effect of NOP16 Mutation in Chronic Lymphocytic Leukemia , 2017 .

[34]  Enrique J. deAndrés-Galiana,et al.  Supervised Classification by Filter Methods and Recursive Feature Elimination Predicts Risk of Radiotherapy-Related Fatigue in Patients with Prostate Cancer , 2014, Cancer informatics.