Membrane Computing to Model Feature Selection of Microarray Cancer Data

Cancer is the main public health issue in most places of the world due to the difficulties in its early diagnosis and to begin early treatment. In the recent years many techniques have been proposed to tackle the high dimensionality in cancer datasets. This paper proposed a membrane-inspired feature selection method to utilize the potentials of membrane computing features such as decentralization, non-determinism, and maximal parallel computing for feature selection of cancer data. Kernel p system- one of the variants of membrane computing- is defined based on multi objective binary particle swarm optimization feature selection method through nine steps involving definitions of objects, compartments, rules and output. Matlab software is used to model the proposed approach. The proposed model evaluated by cell line data of breast cancer with six samples of papillary infiltrating ductal carcinoma and Carcinosarcoma disease state. As the initial attempt to come out with a model, division rule and sequential computation on Matlab are used as tools to define potential of membrane computing in trading space against time computation. The evaluation results indicate the proposed model based on kernel p system computation with distributed compartments is capable in distinguishing marker genes.

[1]  Verónica Bolón-Canedo,et al.  A review of microarray datasets and applied feature selection methods , 2014, Inf. Sci..

[2]  David W. Corne,et al.  Modeling the Dynamics of HIV Infection with Conformon-P Systems and Cellular Automata , 2007, Workshop on Membrane Computing.

[3]  Anirban Mukhopadhyay,et al.  A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO , 2014, PloS one.

[4]  Natalia Shulzhenko,et al.  Microarrays for cancer diagnosis and classification. , 2007, Advances in experimental medicine and biology.

[5]  Wai-Fong Yin,et al.  Draft Genome Sequence of Aeromonas caviae Strain L12, a Quorum-Sensing Strain Isolated from a Freshwater Lake in Malaysia , 2015, Genome Announcements.

[6]  Huan Liu,et al.  Feature Selection for Clustering: A Review , 2018, Data Clustering: Algorithms and Applications.

[7]  A. Jemal,et al.  Cancer statistics, 2014 , 2014, CA: a cancer journal for clinicians.

[8]  C. Rosenow,et al.  Monitoring gene expression using DNA microarrays. , 2000, Current opinion in microbiology.

[9]  Ferrante Neri,et al.  An Optimization Spiking Neural P System for Approximately Solving Combinatorial Optimization Problems , 2014, Int. J. Neural Syst..

[10]  Mario J. Pérez-Jiménez,et al.  A Study of the Robustness of the EGFR Signalling Cascade Using Continuous Membrane Systems , 2005, IWINAC.

[11]  Florentin Ipate,et al.  A Kernel P System , 2012 .

[12]  Miguel Rocha,et al.  A platform for the selection of genes in DNA microarraydata using evolutionary algorithms , 2007, GECCO '07.

[13]  Mario J. Pérez-Jiménez,et al.  Simulating Avascular Tumors with Membrane Systems , 2005 .

[14]  Yungho Leu,et al.  A novel hybrid feature selection method for microarray data analysis , 2011, Appl. Soft Comput..

[15]  Giancarlo Mauri,et al.  A comparison of machine learning techniques for survival prediction in breast cancer , 2011, BioData Mining.

[16]  Daniel Birnbaum,et al.  DNA Arrays in Clinical Oncology: Promises and Challenges , 2003, Laboratory Investigation.