Efficient Wu Manber String Matching Algorithm for Large Number of Patterns

String matching is one of the most important concept used in computer science in various real life applications like as Intrusion detection system, Data mining, Plagiarism detection system. There are many string matching algorithms which help to find pattern from the text. These algorithms are categorized in single string matching and multiple string matching. The Wu-Manber (WM) algorithm is multiple patterns algorithm which is the finest string matching algorithm. The performance of WM depends on various table build in pre processing phase these are prefix table, shift table and hase table. We introduce a new algorithm namely the Efficient Wu Manber algorithm (EWM) algorithm which is advance version of Wu Manber algorithm with respect to time. Efficient Wu-Manber Algorithm eliminate the prefix table which is unused most of the cases in wu manber, construct two shift table instead of single shift table and uses nonlinear data structure i.e. AVL tree instead of linear data structure i.e. linked list used in WM in Hash table, which reduce the traversed number of nodes to find exact match. The experimental results and analysis show that EWM algorithm has better performance as compare to WM and its existing improved algorithm and also better from various string matching tools.

[1]  Robert S. Boyer,et al.  A fast string searching algorithm , 1977, CACM.

[2]  Seung-Woo Seo,et al.  BLAST: B-LAyered bad-character SHIFT tables for high-speed pattern matching , 2013, IET Inf. Secur..

[3]  Xuezeng Pan,et al.  High Concurrence Wu-Manber Multiple Patterns Matching Algorithm , 2009 .

[4]  Daniel Sunday,et al.  A very fast substring search algorithm , 1990, CACM.

[5]  Wu Di,et al.  Improving Wu-Manber: A Multi-pattern Matching Algorithm , 2008, 2008 IEEE International Conference on Networking, Sensing and Control.

[6]  Donald E. Knuth,et al.  Fast Pattern Matching in Strings , 1977, SIAM J. Comput..

[7]  Beate Commentz-Walter,et al.  A String Matching Algorithm Fast on the Average , 1979, ICALP.

[8]  Alfred V. Aho,et al.  Efficient string matching , 1975, Commun. ACM.

[9]  Cui Yong,et al.  An improved Wu-Manber multiple patterns matching algorithm , 2006, 2006 IEEE International Performance Computing and Communications Conference.

[10]  Jingbo Yuan,et al.  An Improved Pattern Matching Algorithm , 2010, 2010 Third International Symposium on Intelligent Information Technology and Security Informatics.

[11]  Weifeng Zhang An improved Wu-Manber multiple patterns matching algorithm , 2016 .

[12]  Thierry Lecroq,et al.  Handbook of Exact String Matching Algorithms , 2004 .

[13]  R. Nigel Horspool,et al.  Practical fast searching in strings , 1980, Softw. Pract. Exp..

[14]  Richard M. Karp,et al.  Efficient Randomized Pattern-Matching Algorithms , 1987, IBM J. Res. Dev..

[15]  Udi Manber,et al.  A FAST ALGORITHM FOR MULTI-PATTERN SEARCHING , 1999 .