Two applications of the LSA machine

We present two applications of a learning algorithm that combines logarithmic simulated annealing with the perceptron algorithm. The implementation of the learning algorithm is called LSA machine and has been successfully applied already to the classification of liver tissue from CT images. We investigate the performance of the LSA machine on two sets of numerical data: The Wisconsin breast cancer diagnosis (WBCD) database and microarray data published by Golub et al. (1999). The WBCD data consist of 683 samples with 9 input values that are divided into 444 benign cases (positive examples) and 239 malignant cases (negative examples). The LSA machine has been trained on 50% and 75% of the entire sample set, and the test has been performed on the remaining samples. In both cases, we obtain a correct classification close to 99% which is comparable to the best results published on WBCD data. The training set of the microarray data consists of I I samples of acute myeloid leukemia (AML) and 27 samples of acute lymphoblastic leukemia (ALL), each of them with 7129 input values (gene-expression data). For the test, 14 AML samples and 20 ALL samples are used. We obtain a single classification error (which is a ALL test sample) on seven genes only, which improves on the results published by Golub et al. (1999) by using the model of self-organising maps. Our result is competitive to the best results published for support vector machines.

[1]  Hans Ulrich Simon,et al.  Robust Trainability of Single Neurons , 1995, J. Comput. Syst. Sci..

[2]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Chak-Kuen Wong,et al.  Combining the Perceptron Algorithm with Logarithmic Simulated Annealing , 2001, Neural Processing Letters.

[4]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[5]  Rudy Setiono,et al.  Generating concise and accurate classification rules for breast cancer diagnosis , 2000, Artif. Intell. Medicine.

[6]  Bruce E. Hajek,et al.  Cooling Schedules for Optimal Annealing , 1988, Math. Oper. Res..

[7]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[8]  N. Maughan,et al.  An introduction to arrays , 2001, The Journal of pathology.

[9]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[10]  Jianping Zhang,et al.  Selecting Typical Instances in Instance-Based Learning , 1992, ML.

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  Chak-Kuen Wong,et al.  Bounded-depth threshold circuits for computer-assisted CT image classification , 2002, Artif. Intell. Medicine.

[13]  Yoav Freund,et al.  Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[14]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[15]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[16]  Yves Crama,et al.  Local Search in Combinatorial Optimization , 2018, Artificial Neural Networks.

[17]  Moshe Sipper,et al.  Fuzzy CoCo: a cooperative-coevolutionary approach to fuzzy modeling , 2001, IEEE Trans. Fuzzy Syst..

[18]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[19]  Manfred K. Warmuth,et al.  On Weak Learning , 1995, J. Comput. Syst. Sci..

[20]  Partha S. Vasisht Computational Analysis of Microarray Data , 2003 .

[21]  Peter L. Bartlett,et al.  The Sample Complexity of Pattern Classification with Neural Networks: The Size of the Weights is More Important than the Size of the Network , 1998, IEEE Trans. Inf. Theory.

[22]  Chak-Kuen Wong,et al.  A Simulated Annealing and Resampling Method for Training Perceptrons to Classify Gene-Expression Data , 2002, ICANN.

[23]  Geoffrey Holmes,et al.  Experiences with a weighted decision tree learner , 2001 .

[24]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.