Weighted Automata Sequence Kernel

Sequence kernels are widely used for learning from sequential data. The literature includes a variety of sequence kernels. In this paper, we present a general framework to deal with sequence kernels, termed weighted automata sequence kernel. In fact, the mapping of a string s to a high dimensional feature space can be modeled by a formal power series that can be realized by a weighted automaton (WA) As representing all subsequences. The computation of the kernel between two strings K(s,t) is the behavior of the WA As,t = As ∩ At. Thus, as formulated, it can be generalized to sequence set kernel. For the kernel computation efficiency, we propose a forward lookup automata intersection technique to prevent non successful ∈-paths while evaluating the WA computations. The experiments use the Reuters-21578 collection. The results reveal that the kernel evaluation using our proposed technique is faster than that using the standard intersection. We also study the relationship between our general framework and a variety of common sequence kernels. Finally, based on our general framework, we describe the essential to create a new sequence-set kernel that can also be seen as a tree kernel.

[1]  Jean Berstel,et al.  Rational series and their languages , 1988, EATCS monographs on theoretical computer science.

[2]  Yves Schabes,et al.  Speech Recognition by Composition of Weighted Finite Automata , 1997 .

[3]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[4]  Djelloul Ziadi,et al.  Efficient List-Based Computation of the String Subsequence Kernel , 2014, LATA.

[5]  Fernando Pereira,et al.  Weighted Automata in Text and Speech Processing , 2005, ArXiv.

[6]  Mehryar Mohri,et al.  Semiring Frameworks and Algorithms for Shortest-Distance Problems , 2002, J. Autom. Lang. Comb..

[7]  Arto Salomaa,et al.  Semirings, Automata and Languages , 1985 .

[8]  Juho Rousu,et al.  Efficient Computation of Gapped Substring Kernels on Large Alphabets , 2005, J. Mach. Learn. Res..

[9]  András Kornai Extended finite state models of language , 1996, Nat. Lang. Eng..

[10]  Jacques Sakarovitch,et al.  Elements of Automata Theory , 2009 .

[11]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[12]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[13]  Emmanuel Roche,et al.  Finite-State Language Processing , 1997 .

[14]  Eleazar Eskin,et al.  The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.

[15]  Mehryar Mohri,et al.  Rational Kernels: Theory and Algorithms , 2004, J. Mach. Learn. Res..

[16]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .