Classification of intramural metastases and lymph node metastases of esophageal cancer from gene expression based on boosting and projective adaptive resonance theory.

Esophageal cancer is a well-known cancer with poorer prognosis than other cancers. An optimal and individualized treatment protocol based on accurate diagnosis is urgently needed to improve the treatment of cancer patients. For this purpose, it is important to develop a sophisticated algorithm that can manage a large amount of data, such as gene expression data from DNA microarrays, for optimal and individualized diagnosis. Marker gene selection is essential in the analysis of gene expression data. We have already developed a combination method of the use of the projective adaptive resonance theory and that of a boosted fuzzy classifier with the SWEEP operator denoted PART-BFCS. This method is superior to other methods, and has four features, namely fast calculation, accurate prediction, reliable prediction, and rule extraction. In this study, we applied this method to analyze microarray data obtained from esophageal cancer patients. A combination method of PART-BFCS and the U-test was also investigated. It was necessary to use a specific type of BFCS, namely, BFCS-1,2, because the esophageal cancer data were very complexity. PART-BFCS and PART-BFCS with the U-test models showed higher performances than two conventional methods, namely, k-nearest neighbor (kNN) and weighted voting (WV). The genes including CDK6 could be found by our methods and excellent IF-THEN rules could be extracted. The genes selected in this study have a high potential as new diagnosis markers for esophageal cancer. These results indicate that the new methods can be used in marker gene selection for the diagnosis of cancer patients.

[1]  Yoav Freund,et al.  An Adaptive Version of the Boost by Majority Algorithm , 1999, COLT.

[2]  R. Sutherland,et al.  Overexpression of the cell cycle inhibitor p16INK4A in high-grade prostatic intraepithelial neoplasia predicts early relapse in prostate cancer patients. , 2001, Clinical cancer research : an official journal of the American Association for Cancer Research.

[3]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[4]  H Kato,et al.  Prognostic evaluation for squamous cell carcinomas of the lower thoracic esophagus treated with three-field lymph node dissection. , 2001, European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery.

[5]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[6]  Taizo Hanai,et al.  Fuzzy Neural Network Applied to Gene Expression Profiling for Predicting the Prognosis of Diffuse Large B‐cell Lymphoma , 2002, Japanese journal of cancer research : Gann.

[7]  L C Harrison,et al.  Fuzzy neural network-based prediction of the motif for MHC class II binding peptides. , 2001, Journal of bioscience and bioengineering.

[8]  Jacques Ferlay,et al.  Estimating the world cancer burden: Globocan 2000 , 2001, International journal of cancer.

[9]  Yoshiki Uchikawa,et al.  On fuzzy modeling using fuzzy neural networks with the back-propagation algorithm , 1992, IEEE Trans. Neural Networks.

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[11]  Hiroyuki Honda,et al.  Construction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering method , 2005, Bioinform..

[12]  Hiroyuki Honda,et al.  A New Reliable Cancer Diagnosis Method Using Boosted Fuzzy Classifier with a SWEEP Operator Method , 2005 .

[13]  Axel Benner,et al.  Genomic and protein expression profiling identifies CDK6 as novel independent prognostic marker in medulloblastoma. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[14]  Lydia Sánchez,et al.  Hodgkin and Reed-Sternberg cells harbor alterations in the major tumor suppressor pathways and cell-cycle checkpoints: analyses using tissue microarrays. , 2003, Blood.

[15]  Hiroyuki Honda,et al.  Prediction of peptide binding to major histocompatibility complex class II molecules through use of boosted fuzzy classifier with SWEEP operator method. , 2006, Journal of bioscience and bioengineering.

[16]  Robert E. Schapire,et al.  The strength of weak learnability , 1990, Mach. Learn..

[17]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[18]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.