Pathway-based classification of breast cancer subtypes.

Cancer heterogeneity represents a major hurdle in the development of effective theranostic strategies, as it prevents to devise unique and maximally efficient diagnostic, prognostic and therapeutic procedures even for patients affected by the same tumor type. Computational techniques can nowadays leverage the huge and ever increasing amount of (epi)genomic data to tackle this problem, therefore providing new and valuable instruments for decision support to biologists and pathologists, in the broad sphere of precision medicine. In this context, we here introduce a novel cancer subtype classifier from gene expression data and we apply it to two different Breast Cancer datasets, from TCGA and GEO repositories. The classifier is based on Support Vector Machines and relies on the information about the relevant pathways involved in breast cancer development to reduce the huge variable space. Among the main results, we show that the classifier accuracy is preserved at excellent values even when the variable space is reduced by a 20-fold, hence providing a precious tool for cancer patient profiling even in case of limited experimental resources.

[1]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[2]  A. Børresen-Dale,et al.  Subtype‐specific micro‐RNA expression signatures in breast cancer progression , 2016, International journal of cancer.

[3]  Arvindhan G. Nagarajan,et al.  Oncogene-directed alterations in cancer cell metabolism. , 2016, Trends in cancer.

[4]  Gianluca Bontempi,et al.  TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data , 2015, Nucleic acids research.

[5]  Giancarlo Mauri,et al.  Algorithmic methods to infer the evolutionary trajectories in cancer progression , 2015, Proceedings of the National Academy of Sciences.

[6]  Jonathan H. Chan,et al.  Pathway activity transformation for multi-class classification of lung cancer datasets , 2015, Neurocomputing.

[7]  Wei Liu,et al.  Topologically inferring pathway activity toward precise cancer classification via integrating genomic and metabolomic data: prostate cancer as a case , 2015, Scientific Reports.

[8]  Gianluca Bontempi,et al.  Integrative Analysis with Monte Carlo Cross-Validation Reveals miRNAs Regulating Pathways Cross-Talk in Aggressive Breast Cancer , 2015, BioMed research international.

[9]  Habibollah Haron,et al.  Semi-supervised SVM-based Feature Selection for Cancer Classification using Microarray Gene Expression Data , 2015, IEA/AIE.

[10]  F. Feng,et al.  Development and Validation of a Novel Platform-Independent Metastasis Signature in Human Breast Cancer , 2015, PloS one.

[11]  Carlos Caldas,et al.  BRCA1‐like signature in triple negative breast cancer: Molecular and clinical characterization reveals subgroups with therapeutic potential , 2015, Molecular oncology.

[12]  Dong Xu,et al.  Classification of lung cancer using ensemble-based feature selection and machine learning methods. , 2015, Molecular bioSystems.

[13]  Lazaros G. Papageorgiou,et al.  Pathway activity inference for multiclass disease classification through a mathematical programming optimisation framework , 2014, BMC Bioinformatics.

[14]  Charles Swanton,et al.  Tumour heterogeneity and the evolution of polyclonal drug resistance , 2014, Molecular oncology.

[15]  Qihua Tan,et al.  Classification of Breast Cancer Subtypes by combining Gene Expression and DNA Methylation Data , 2014, J. Integr. Bioinform..

[16]  Giancarlo Mauri,et al.  Integration of mRNA Expression Profile, Copy Number Alterations, and microRNA Expression Levels in Breast Cancer to Improve Grade Definition , 2014, PloS one.

[17]  Nikolay M. Borisov,et al.  Signaling pathway cloud regulation for in silico screening and ranking of the potential geroprotective drugs , 2014, Front. Genet..

[18]  G. Mauri,et al.  Combination of gene expression and genome copy number alteration has a prognostic value for breast cancer , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[19]  G. Landberg,et al.  Cell cycle regulators cyclin D1 and CDK4/6 have estrogen receptor-dependent divergent functions in breast cancer migration and stem cell-like activity , 2013, Cell cycle.

[20]  Giancarlo Mauri,et al.  Copy-Number Alterations for Tumor Progression Inference , 2013, AIME.

[21]  Zhijun Dai,et al.  TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection , 2013, BMC Medical Genomics.

[22]  L. Pusztai,et al.  Cancer heterogeneity: implications for targeted therapeutics , 2013, British Journal of Cancer.

[23]  F. Di Virgilio,et al.  Purines, purinergic receptors, and cancer. , 2012, Cancer research.

[24]  San-Gang Wu,et al.  Predictive value of breast cancer molecular subtypes in Chinese patients with four or more positive nodes after postmastectomy radiotherapy. , 2012, Breast.

[25]  Hyung-Seok Choi,et al.  A pathway-based classification of breast cancer integrating data on differentially expressed genes, copy number variations and MicroRNA target genes , 2012, Molecules and cells.

[26]  Ching Y. Suen,et al.  A novel hybrid CNN-SVM classifier for recognizing handwritten digits , 2012, Pattern Recognit..

[27]  Ara Darzi,et al.  Preparing for precision medicine. , 2012, The New England journal of medicine.

[28]  R. Arceci Intratumor Heterogeneity and Branched Evolution Revealed by Multiregion Sequencing , 2012 .

[29]  Charles DeLisi,et al.  Pathway-based classification of cancer subtypes , 2012, Biology Direct.

[30]  Alain Chédotal,et al.  Novel roles for Slits and netrins: axon guidance cues as anticancer targets? , 2011, Nature Reviews Cancer.

[31]  T. Nielsen,et al.  Breast cancer subtypes and the risk of local and regional relapse. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[32]  Sanjay Ranka,et al.  Pathway-BasedFeature Selection Algorithm for Cancer Microarray Data , 2010, Adv. Bioinformatics.

[33]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[34]  Emmanuel Barillot,et al.  Classification of microarray data using gene networks , 2007, BMC Bioinformatics.

[35]  Joshy George,et al.  Genetic reclassification of histologic grade delineates new clinical subtypes of breast cancer. , 2006, Cancer research.

[36]  M. J. van de Vijver,et al.  Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. , 2006, Journal of the National Cancer Institute.

[37]  Jun Lu,et al.  Pathway level analysis of gene expression using singular value decomposition , 2005, BMC Bioinformatics.

[38]  A. Gown,et al.  Immunohistochemical and Clinical Characterization of the Basal-Like Subtype of Invasive Breast Carcinoma , 2004, Clinical Cancer Research.

[39]  R. Tibshirani,et al.  Repeated observation of breast tumor subtypes in independent gene expression data sets , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[40]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[41]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[42]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[43]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[44]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[45]  T. Barbui,et al.  The Effect of Very-low-dose Warfarin on Markers of Hypercoagulation in Metastatic Breast Cancer: Results from a Randomized Trial , 1998, Thrombosis and Haemostasis.

[46]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .