Identifying Potential Clinical Syndromes of Hepatocellular Carcinoma Using PSO-Based Hierarchical Feature Selection Algorithm

Hepatocellular carcinoma (HCC) is one of the most common malignant tumors. Clinical symptoms attributable to HCC are usually absent, thus often miss the best therapeutic opportunities. Traditional Chinese Medicine (TCM) plays an active role in diagnosis and treatment of HCC. In this paper, we proposed a particle swarm optimization-based hierarchical feature selection (PSOHFS) model to infer potential syndromes for diagnosis of HCC. Firstly, the hierarchical feature representation is developed by a three-layer tree. The clinical symptoms and positive score of patient are leaf nodes and root in the tree, respectively, while each syndrome feature on the middle layer is extracted from a group of symptoms. Secondly, an improved PSO-based algorithm is applied in a new reduced feature space to search an optimal syndrome subset. Based on the result of feature selection, the causal relationships of symptoms and syndromes are inferred via Bayesian networks. In our experiment, 147 symptoms were aggregated into 27 groups and 27 syndrome features were extracted. The proposed approach discovered 24 syndromes which obviously improved the diagnosis accuracy. Finally, the Bayesian approach was applied to represent the causal relationships both at symptom and syndrome levels. The results show that our computational model can facilitate the clinical diagnosis of HCC.

[1]  Licheng Jiao,et al.  Multiple Parameter Selection for LS-SVM Using Smooth Leave-One-Out Error , 2005, ISNN.

[2]  Jianzhong Wang,et al.  Maximum weight and minimum redundancy: A novel framework for feature subset selection , 2013, Pattern Recognit..

[3]  Kin Keung Lai,et al.  Hybrid approaches based on LSSVR model for container throughput forecasting: A comparative study , 2013, Appl. Soft Comput..

[4]  D. Woodfield Hepatocellular carcinoma. , 1986, The New Zealand medical journal.

[5]  S. G. Ponnambalam,et al.  An elitist strategy genetic algorithm for integrated layout design , 2012 .

[6]  Senjian An,et al.  Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression , 2007, Pattern Recognit..

[7]  Mohammad Reza Keyvanpour,et al.  A NOVEL EMBEDDED FEATURE SELECTION METHOD: A COMPARATIVE STUDY IN THE APPLICATION OF TEXT CATEGORIZATION , 2013, Appl. Artif. Intell..

[8]  F. Izzo,et al.  A new prognostic system for hepatocellular carcinoma: A retrospective study of 435 patients , 1998, Hepatology.

[9]  A. Jemal,et al.  Global cancer statistics , 2011, CA: a cancer journal for clinicians.

[10]  Robert G. Cowell,et al.  Local Propagation in Conditional Gaussian Bayesian Networks , 2005, J. Mach. Learn. Res..

[11]  H. Pomares,et al.  A heuristic method for parameter selection in LS-SVM: Application to time series prediction , 2011 .

[12]  Satoru Miyano,et al.  A filter based feature selection algorithm using null space of covariance matrix for DNA microarray gene expression data , 2012 .

[13]  A. Massi Pavan,et al.  Least squares support vector machine for short-term prediction of meteorological time series , 2012, Theoretical and Applied Climatology.

[14]  J. Bruix,et al.  Diagnosis of hepatic nodules 20 mm or smaller in cirrhosis: Prospective validation of the noninvasive diagnostic criteria for hepatocellular carcinoma , 2007, Hepatology.

[15]  Jaung-Geng Lin,et al.  Utilization pattern of traditional Chinese medicine for liver cancer patients in Taiwan , 2012, BMC Complementary and Alternative Medicine.

[16]  Philippe Leray,et al.  Probabilistic graphical models for genetic association studies , 2012, Briefings Bioinform..

[17]  M. Colombo,et al.  Epidemiology of hepatocellular carcinoma. , 1995, The Italian journal of gastroenterology.

[18]  Sadiq M. Sait,et al.  Binary particle swarm optimization (BPSO) based state assignment for area minimization of sequential circuits , 2013, Appl. Soft Comput..

[19]  L. Schwartz,et al.  The use of imaging in the diagnosis and staging of hepatobiliary malignancies. , 2007, Surgical oncology clinics of North America.

[20]  Xue-wen Chen,et al.  Improving Bayesian Network Structure Learning with Mutual Information-Based Node Ordering in the K2 Algorithm , 2008, IEEE Transactions on Knowledge and Data Engineering.

[21]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[22]  Qingling Duan,et al.  A novel force field parameter optimization method based on LSSVR for ECEPP , 2011, FEBS letters.

[23]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[24]  A. Rezaee Jordehi,et al.  Parameter selection in particle swarm optimisation: a survey , 2013, J. Exp. Theor. Artif. Intell..

[25]  Cheng-Hong Yang,et al.  Comparison of Classification Algorithms with Wrapper-Based Feature Selection for Predicting Osteoporosis Outcome Based on Genetic Factors in a Taiwanese Women Population , 2013, International journal of endocrinology.

[26]  Zhen Yang,et al.  Genetic algorithm-least squares support vector regression based predicting and optimizing model on carbon fiber composite integrated conductivity , 2010 .

[27]  Alexander G. Gray,et al.  Sparse high-dimensional fractional-norm support vector machine via DC programming , 2013, Comput. Stat. Data Anal..

[28]  Wensheng Zhang,et al.  Improved heuristic equivalent search algorithm based on Maximal Information Coefficient for Bayesian Network Structure Learning , 2013, Neurocomputing.

[29]  Ian R. Fasel,et al.  A learning approach to hierarchical feature selection and aggregation for audio classification , 2010, Pattern Recognit. Lett..

[30]  Haytham Elghazel,et al.  A semi-supervised feature ranking method with ensemble learning , 2012, Pattern Recognit. Lett..