GPU based Suffix Array Pattern Matching Approach for Big Data

Big data has been an emerging problem these days. To solve this problem Hadoop has evolved as a most widely used tool and adopted by various popular MNCs like Facebook and Yahoo. To search large number of pattern in big data is a challenging task. Map/Reduce is used to write codes to perform pattern matching on big data. In this work OpenCL is combined with Apache Hadoop to write fast Map/Reduce for pattern matching in data using suffix arrays.

[1]  Gediminas Adomavicius,et al.  New Recommendation Techniques for Multicriteria Rating Systems , 2007, IEEE Intelligent Systems.

[2]  Jun Wang,et al.  An improved method of keywords extraction based on short technology text , 2010, Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010).

[3]  Xiao Zhang,et al.  A high-level energy consumption model for heterogeneous data centers , 2013, Simul. Model. Pract. Theory.

[4]  Vivek Sarkar,et al.  HadoopCL: MapReduce on Distributed Heterogeneous Platforms through Seamless Integration of Hadoop and OpenCL , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[5]  Christophe Nicolle,et al.  Understandable Big Data: A survey , 2015, Comput. Sci. Rev..

[6]  Nisha Agrawal,et al.  Performance analysis between aparapi (a parallel API) and JAVA by implementing sobel edge detection Algorithm , 2013, 2013 National Conference on Parallel Computing Technologies (PARCOMPTECH).

[7]  S. Niwattanakul,et al.  Using of Jaccard Coefficient for Keywords Similarity , 2022 .

[8]  Jinjun Chen,et al.  KASR: A Keyword-Aware Service Recommendation Method on MapReduce for Big Data Applications , 2014, IEEE Transactions on Parallel and Distributed Systems.

[9]  Randy H. Katz,et al.  How Hadoop Clusters Break , 2013, IEEE Software.

[10]  Xiao Peng,et al.  A Low-Cost Power Measuring Technique for Virtual Machine in Cloud Environments , 2013 .

[11]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[12]  Jarmo Takala,et al.  OpenCL-based design methodology for application-specific processors , 2010, 2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation.

[13]  Gediminas Adomavicius,et al.  Context-aware recommender systems , 2008, RecSys '08.

[14]  Anna-Lan Huang,et al.  Similarity Measures for Text Document Clustering , 2008 .

[15]  Alexander Felfernig,et al.  Basic Approaches in Recommendation Systems , 2014, Recommendation Systems in Software Engineering.