Artificial neural networks and decision tree model analysis of liver cancer proteomes.

Hepatocellular carcinoma (HCC) is a heterogeneous cancer and usually diagnosed at late advanced tumor stages of high lethality. The present study attempted to obtain a proteome-wide analysis of HCC in comparison with adjacent non-tumor liver tissues, in order to facilitate biomarkers' discovery and to investigate the mechanisms of HCC development. A cohort of 66 Chinese patients with HCC was included for proteomic profiling study by two-dimensional gel electrophoresis (2-DE) analysis. Artificial neural network (ANN) and decision tree (CART) data-mining methods were employed to analyze the profiling data and to delineate significant patterns and trends for discriminating HCC from non-malignant liver tissues. Protein markers were identified by tandem MS/MS. A total of 132 proteome datasets were generated by 2-DE expression profiling analysis, and each with 230 consolidated protein expression intensities. Both the data-mining algorithms successfully distinguished the HCC phenotype from other non-malignant liver samples. The detection sensitivity and specificity of ANN were 96.97% and 87.88%, while those of CART were 81.82% and 78.79%, respectively. The three biological classifiers in the CART model were identified as cytochrome b5, heat shock 70 kDa protein 8 isoform 2, and cathepsin B. The 2-DE-based proteomic profiling approach combined with the ANN or CART algorithm yielded satisfactory performance on identifying HCC and revealed potential candidate cancer biomarkers.

[1]  Nathalie Wong,et al.  Prediction of liver fibrosis and cirrhosis in chronic hepatitis B infection by serum proteomic fingerprinting: a pilot study. , 2005, Clinical chemistry.

[2]  C. Balabaud,et al.  Proteomic analysis of differentially expressed proteins in hepatocellular carcinoma developed in patients with chronic viral hepatitis C , 2005, Proteomics.

[3]  N. Iizuka,et al.  Proteomic profiling of heat shock protein 70 family members as biomarkers for hepatitis C virus‐related hepatocellular carcinoma , 2003, Proteomics.

[4]  T. Poon,et al.  Proteome analysis and its impact on the discovery of serological tumor markers. , 2001, Clinica chimica acta; international journal of clinical chemistry.

[5]  H. Lee,et al.  Nuclear matrix of calreticulin in hepatocellular carcinoma. , 2000, Cancer research.

[6]  Martin T. Hagan,et al.  Neural network design , 1995 .

[7]  K. Sheahan,et al.  Stage-specific increases in cathepsin B messenger RNA content in human colorectal carcinoma. , 1991, Cancer research.

[8]  J. Luk,et al.  Identification of liver-intestine cadherin in hepatocellular carcinoma--a potential disease marker. , 2003, Biochemical and biophysical research communications.

[9]  M. Feitelson,et al.  Genetic mechanisms of hepatocarcinogenesis , 2002, Oncogene.

[10]  Hoguen Kim,et al.  Proteomic alterations of the variants of human aldehyde dehydrogenase isozymes correlate with hepatocellular carcinoma , 2002, International journal of cancer.

[11]  J. Luk,et al.  Proteomic profiling of hepatocellular carcinoma in Chinese cohort reveals heat‐shock proteins (Hsp27, Hsp70, GRP78) up‐regulation and their associated prognostic values , 2006, Proteomics.

[12]  R. Dwek,et al.  A Proteomic Approach for the Discovery of Early Detection Markers of Hepatocellular Carcinoma , 2002, Disease markers.

[13]  K. Jaskiewicz,et al.  Overexpression of cathepsin B correlates with angiogenesis in colon adenocarcinoma. , 2004, Neoplasma.

[14]  Jie Liu,et al.  Estrogen signaling in livers of male mice with hepatocellular carcinoma induced by exposure to arsenic in utero. , 2004, Journal of the National Cancer Institute.

[15]  S. Seki,et al.  Proteome analysis of rat hepatic stellate cells , 2000, Hepatology.

[16]  T. Yip,et al.  Comprehensive proteomic profiling identifies serum proteomic signatures for detection of hepatocellular carcinoma and its subtypes. , 2003, Clinical chemistry.

[17]  H. El‐Serag Hepatocellular carcinoma: an epidemiologic view. , 2002, Journal of clinical gastroenterology.

[18]  Will Dwinnell Data enhancement — filtering for neural network success , 2000 .

[19]  Jung Eun Shim,et al.  An integrated proteome database for two‐dimensional electrophoresis data analysis and laboratory information management system , 2002, Proteomics.

[20]  J. Luk,et al.  Proteomic identification of Ku70/Ku80 autoantigen recognized by monoclonal antibody against hepatocellular carcinoma , 2005, Proteomics.

[21]  S. Willich,et al.  Occupational exposure and liver cancer in women: results of the Multicentre International Liver Tumour Study (MILTS). , 2000, Occupational medicine.

[22]  R. Schmelzeisen,et al.  Comparison of Fuzzy Inference, Logistic Regression, and Classification Trees (CART) , 2003, Methods of Information in Medicine.