Transductive support vector machines and applications in bioinformatics for promoter recognition

This paper introduces a novel transductive support vector machine (TSVM) model and compares it with the traditional inductive SVM on a key problem in bioinformatics - promoter recognition. While inductive reasoning is concerned with the development of a model (a function) to approximate data from the whole problem space (induction), and consecutively using this model to predict output values for a new input vector (deduction), in the transductive inference systems a model is developed for every new input vector based on some closest to the new vector data from an existing database and this model is used to predict only the output for this vector. The TSVM outperforms by far the inductive SVM models applied on the same problems. Analysis is given on the advantages and disadvantages of the TSVM. Hybrid TSVM-evolving connections systems are discussed as directions for future research.

[1]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[2]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986 .

[3]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[4]  Gerald Sommer,et al.  Pattern Recognition by Self-Organizing Neural Networks , 1994 .

[5]  J. Jones,et al.  Uses and abuses of statistical models for evaluating trauma care. , 1995, The Journal of trauma.

[6]  Farmer,et al.  Predicting chaotic time series. , 1987, Physical review letters.

[7]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[8]  Steen Knudsen,et al.  Promoter2.0: for the recognition of PolII promoter sequences , 1999, Bioinform..

[9]  J. Tattersall,et al.  Artificial intelligence: a new approach for prescription and monitoring of hemodialysis therapy. , 2001, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[10]  T. Werner,et al.  Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: a novel context analysis approach. , 2000, Journal of molecular biology.

[11]  K. Anderson,et al.  Cardiovascular disease risk profiles. , 1991, American heart journal.

[12]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[13]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[14]  J. Fickett,et al.  Eukaryotic promoter recognition. , 1997, Genome research.

[15]  Hilbert J. Kappen,et al.  On-line learning processes in artificial neural networks , 1993 .

[16]  Guoping Wang,et al.  Learning with progressive transductive support vector machine , 2003, Pattern Recognit. Lett..

[17]  Nikola Kasabov,et al.  Evolving Connectionist Systems: Methods and Applications in Bioinformatics, Brain Study and Intelligent Machines , 2002, IEEE Transactions on Neural Networks.

[18]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[19]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..

[20]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[21]  Nikola K. Kasabov,et al.  DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction , 2002, IEEE Trans. Fuzzy Syst..

[22]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[23]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[24]  Nikola Kasabov,et al.  Foundations Of Neural Networks, Fuzzy Systems, And Knowledge Engineering [Books in Brief] , 1996, IEEE Transactions on Neural Networks.

[25]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[26]  Mahesan Niranjan,et al.  On-line Q-learning using connectionist systems , 1994 .

[27]  T. Werner Models for prediction and recognition of eukaryotic promoters , 1999, Mammalian Genome.

[28]  Rudolf Kruse,et al.  Neuro-Fuzzy Systems , 1998 .

[29]  Nikola Kasabov,et al.  A method for modelling genetic regulatory networks by using evolving connectionist systems and microarray gene expression data , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[30]  Nikola Kasabov,et al.  Evolving Fuzzy Neural Networks for On-line Knowledge Discovery , 2001 .

[31]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[32]  Hans-Heinrich Bothe,et al.  Neuro Fuzzy Systems , 1998, NC.

[33]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[34]  Tarun Khanna,et al.  Foundations of neural networks , 1990 .

[35]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[36]  Graham C. Goodwin,et al.  Adaptive filtering prediction and control , 1984 .

[37]  Nikola K. Kasabov,et al.  Evolving fuzzy neural networks for supervised/unsupervised online knowledge-based learning , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[38]  Shaoning Pang,et al.  Membership authentication in the dynamic group by face classification using SVM ensemble , 2003, Pattern Recognit. Lett..