Soft Computing Methods for Global, Local and Personalized Modeling and Applications in Bioinformatics

The paper is a comparative study of major modeling and pattern discovery approaches applicable to the area of Bioinformatics and the area of decision support systems in general. These approaches include inductive versus transductive reasoning, global, local, and personalized modeling and their potentials are illustrated on a case study of gene expression and clinical data related to cancer outcome prognosis. While inductive modeling is used to develop a model (function) from data on the whole problem space and then to recall it on new data, transductive modeling is concerned with the creation of single model for every new input vector based on some closest vectors from the existing problem space. The paper uses several techniques to illustrate these approaches – multiple linear regression, Bayesian inference, support vector machines, evolving connectionist systems (ECOS), weighted kNN – each of them providing different accuracy on specific problem and facilitating the discovery of different patterns and rules from data.

[1]  Harry Wechsler,et al.  Watch List Face Surveillance Using Transductive Inference , 2004, ICBA.

[2]  Nikola Kasabov,et al.  Evolving connectionist systems , 2002 .

[3]  Pong C. Yuen,et al.  Transductive Learning: Learning Iris Data with Two Labeled Data , 2001, ICANN.

[4]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[5]  Nikola Kasabov,et al.  Evolving Connectionist Systems: Methods and Applications in Bioinformatics, Brain Study and Intelligent Machines , 2002, IEEE Transactions on Neural Networks.

[6]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[7]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[8]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[9]  Ayhan Demiriz,et al.  Semi-Supervised Support Vector Machines , 1998, NIPS.

[10]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[11]  Tapio Elomaa,et al.  Machine Learning: ECML 2002 , 2002, Lecture Notes in Computer Science.

[12]  Jocelyn Dow,et al.  Biochemistry: Molecules, Cells, and the Body , 1995 .

[13]  Tao Xiong,et al.  A combined SVM and LDA approach for classification , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[14]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Hong Liu,et al.  Evolutionary semi-supervised fuzzy clustering , 2003, Pattern Recognit. Lett..

[16]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[17]  F. Crick Central Dogma of Molecular Biology , 1970, Nature.

[18]  Z. Bosnic,et al.  Evaluation of prediction reliability in regression using the transduction principle , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[19]  Matjaz Kukar,et al.  Transductive reliability estimation for medical diagnosis , 2003, Artif. Intell. Medicine.

[20]  Nikola Kasabov,et al.  A method for modelling genetic regulatory networks by using evolving connectionist systems and microarray gene expression data , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[21]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[22]  Nello Cristianini,et al.  Large Margin Trees for Induction and Transduction , 1999, ICML.

[23]  Vladimir Brusic,et al.  Computer model for recognition of functional transcription start sites in RNA polymerase II promoters of vertebrates. , 2003, Journal of molecular graphics & modelling.

[24]  ndtedited by Julio Collado-Vides and Ralf Hofest Gene regulation and metabolism , 2002 .

[25]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[26]  Bernhard Schölkopf,et al.  Feature selection and transduction for prediction of molecular bioactivity for drug design , 2003, Bioinform..

[27]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[28]  Doulaye Dembélé,et al.  Fuzzy C-means Method for Clustering Microarray Data , 2003, Bioinform..

[29]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[30]  Yudong D. He,et al.  A Gene-Expression Signature as a Predictor of Survival in Breast Cancer , 2002 .

[31]  Nikola Kasabov,et al.  Fuzzy clustering of gene expression data , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[32]  Alfred C. Weaver,et al.  Biometric authentication , 2006, Computer.

[33]  Shaoning Pang,et al.  Inductive vs transductive inference, global vs local models: SVM, TSVM, and SVMT for gene expression classification problems , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[34]  Nikola K. Kasabov,et al.  DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction , 2002, IEEE Trans. Fuzzy Syst..

[35]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[36]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[37]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[38]  L. Penland,et al.  Use of a cDNA microarray to analyse gene expression patterns in human cancer , 1996, Nature Genetics.

[39]  Guoping Wang,et al.  Learning with progressive transductive support vector machine , 2003, Pattern Recognit. Lett..

[40]  David Botstein,et al.  The Stanford Microarray Database: data access and quality assessment tools , 2003, Nucleic Acids Res..

[41]  Jiang Li,et al.  Transductive inference for color-based particle filter tracking , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[42]  Nikola K. Kasabov,et al.  TWRBF - Transductive RBF Neural Network with Weighted Data Normalization , 2004, ICONIP.

[43]  Shaoning Pang,et al.  Transductive support vector machines and applications in bioinformatics for promoter recognition , 2003, International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003.

[44]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[45]  Kurt Hornik,et al.  Artificial Neural Networks — ICANN 2001 , 2001, Lecture Notes in Computer Science.

[46]  Rainer Fuchs,et al.  Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters , 2001, Bioinform..

[47]  Nikola Kasabov,et al.  GA-parameter optimisation of evolving connectionist systems for classification and a case study from bioinformatics , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[48]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[49]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[50]  Nikola K. Kasabov,et al.  Evolving fuzzy neural networks for supervised/unsupervised online knowledge-based learning , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[51]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[52]  Hitoshi Iba,et al.  Evolutionary modeling and inference of gene network , 2002, Inf. Sci..

[53]  Arjen Van Ooyen,et al.  Modeling neural development , 2003 .

[54]  Alexander Gammerman,et al.  Transductive Confidence Machines for Pattern Recognition , 2002, ECML.

[55]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.