Cancer subtype identification pipeline: A classifusion approach

Classification of cancer patients into treatment groups is essential for appropriate diagnosis to increase survival. Previously, a series of papers, largely published in the breast cancer domain have leveraged Computational Intelligence (CI) developments and tools, resulting in ground breaking advances such as the classification of cancer into newly identified classes - leading to improved treatment options. However, the current literature on the use of CI to achieve this is fragmented, making further advances challenging. This paper captures developments in this area so far, with the goal to establish a clear, step-by-step pipeline for cancer subtype identification. Based on establishing the pipeline, the paper identifies key potential advances in CI at the individual steps, thus establishing a roadmap for future research. As such, it is the aim of the paper to engage the CI community to address the research challenges and leverage the strong potential of CI in this important area. Finally, we present a small set of recent findings on the Nottingham Tenovus Primary Breast Carcinoma Series enabling the classification of a higher number of patients into one of the identified breast cancer groups, and introduce Classifusion: a combination of results of multiple classifiers.

[1]  Paulo J. G. Lisboa,et al.  A methodology to identify consensus classes from clustering algorithms applied to immunohistochemical data from breast cancer patients , 2010, Comput. Biol. Medicine.

[2]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[3]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[4]  Lee Lam Hong,et al.  A Review of Nearest Neighbor-Support Vector Machines Hybrid Classification Models , 2010 .

[5]  Soumava Kumar Roy,et al.  K-means clustering for adaptive wavelet based image denoising , 2015, 2015 International Conference on Advances in Computer Engineering and Applications.

[6]  Philip M. Long,et al.  Breast cancer classification and prognosis based on gene expression profiles from a population-based study , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Jonathan M. Garibaldi,et al.  A Comparison of Three Different Methods for Classification of Breast Cancer Data , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[9]  Allan Tucker,et al.  Comparing, Contrasting and Combining Clusters in Viral Gene Expression , 2001 .

[10]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  G. Ball,et al.  Identification of key clinical phenotypes of breast cancer using a reduced panel of protein biomarkers , 2013, British Journal of Cancer.

[12]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[13]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[14]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[15]  Khairul A. Rasmani,et al.  Linguistic rulesets extracted from a quantifier-based fuzzy classification system , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[16]  Hae-Sang Park,et al.  A simple and fast algorithm for K-medoids clustering , 2009, Expert Syst. Appl..

[17]  Jill P. Mesirov,et al.  A resampling-based method for class discovery and visualization of gene expression microarray data , 2003 .

[18]  Jill P. Mesirov,et al.  Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data , 2003, Machine Learning.

[19]  Jonathan M. Garibaldi,et al.  A quantifier-based fuzzy classification system for breast cancer patients , 2013, Artif. Intell. Medicine.

[20]  Daniele Soria,et al.  Global histone modifications in breast cancer correlate with tumor phenotypes, prognostic factors, and patient outcome. , 2009, Cancer research.

[21]  Tze-Yun Leong,et al.  Application of K-nearest neighbors algorithm on breast cancer diagnosis problem , 2000, AMIA.

[22]  ROBI POLIKAR,et al.  Pattern Recognition , 2018, Lecture Notes in Computer Science.