Open Source Big Data Analytics Technique

In this mobile computing and business era, a huge amount of data is generated, which is Big Data. Such a large data becomes unmanageable and cannot be used for analytics using traditional methods. A large number of fields and sectors, ranging from economic and business activities, involve with Big Data problems. Big Data analytics is extremely valuable to make decisions for increasing productivity in businesses, which gives us a lot of opportunities to make great progresses in many fields. So, this paper discusses approaches and environments for carrying out analytics for Big Data applications. It revolves around important areas of analytics, Big Data, tools, and data base used. A comparative study is done and tabulated on parameters like Data Base used, real-time analytics, size, etc. Then on open source technology Kibana, Elastic search and JASON Query, a big data analytics experimental setup is done. Analytics is done in many dimensions like domain counts, percentile gross margins, sector-wise count, etc. Their drawn results are recorded and reported in form of graphs.

[1]  Joseph M. Hellerstein,et al.  MAD Skills: New Analysis Practices for Big Data , 2009, Proc. VLDB Endow..

[2]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[3]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[4]  Liang Dong,et al.  Starfish: A Self-tuning System for Big Data Analytics , 2011, CIDR.

[5]  Dilpreet Singh,et al.  A survey on platforms for big data analytics , 2014, Journal of Big Data.

[6]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[7]  Milind A. Bhandarkar,et al.  MapReduce programming with apache Hadoop , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[8]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[9]  Jun Bai,et al.  Feasibility analysis of big log data real time search based on Hbase and ElasticSearch , 2013, 2013 Ninth International Conference on Natural Computation (ICNC).

[10]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.