A Binary-based MapReduce Analysis for Cloud Logs

Abstract Efficiently managing and analyzing cloud logs is a difficult and expensive task due the growth in size and variety of formats. In this paper, we propose a binary-based approach for frequency mining correlated attacks in log data. This approach is conceived to work using the MapReduce programming model. Initial experimental results are presented and they serve as the subject of a data mining algorithm to help us predict the likelihood of correlated attacks taking place.

[1]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[2]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[3]  Mouad Lemoudden,et al.  Managing cloud-generated logs using big data technologies , 2015, 2015 International Conference on Wireless Networks and Mobile Communications (WINCOM).

[4]  Ujwal A. Lanjewar,et al.  Implementation of Cloud Computing on Web Application , 2010 .

[5]  Meiyappan Nagappan,et al.  Creating operational profiles of software systems by transforming their log files to directed cyclic graphs , 2011, TEFSE '11.

[6]  Barrie Sosinsky Cloud Computing Bible: Sosinsky/Cloud , 2010 .

[7]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[8]  Luis Rodero-Merino,et al.  A break in the clouds: towards a cloud definition , 2008, CCRV.

[9]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[10]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[11]  John D. McGregor,et al.  Hadoop and its Evolving Ecosystem , 2013, IWSECO@ICSOB.

[12]  GaniAbdullah,et al.  The rise of "big data" on cloud computing , 2015 .

[13]  PageKicker Robot Phil OWASP Top 10: The Top 10 Most Critical Web Application Security Threats Enhanced with Text Analytics and Content by PageKicker Robot Phil 73 , 2014 .

[14]  Sallam Osman Fageeri,et al.  An Efficient Log File Analysis Algorithm Using Binary-based Data Structure , 2014 .

[15]  Clemente Izurieta,et al.  Comparison of JSON and XML Data Interchange Formats: A Case Study , 2009, CAINE.

[16]  Barrie Sosinsky,et al.  Cloud Computing Bible , 2010 .

[17]  Xi He,et al.  Cloud Computing: a Perspective Study , 2010, New Generation Computing.

[18]  Paul McGuire,et al.  Getting started with pyparsing , 2007 .