Scalable Big Data Analysis in Cloud Environment: A Review

The mounting of industrial advancements have led to substantial quantity of data from unusual areas, like health care, user- generated data, internet, financial companies etc. The term big data was discovered to confine the significance of this rising trend. Due to its complete volume, big data exhibits exclusive personality as comparison to traditional data. Big data may be structured, unstructured or semi-structured which requires more analysis. Its processing is achieved due to its high volume, velocity, value, variety and veracity. It has a lot of Challenges. Thus, cloud computing is being deployed in order to regulate the big data requirements. The security and privacy is not well maintained in cloud. Big data systems are decomposed into four modules called data generation, data acquisition, data storage and data analysis. These modules are otherwise known as big data value chain. Big data is widely established in these recent years, and implemented Hadoop framework for addressing big data challenges.

[1]  Sailaja Arsi,et al.  SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING , 2014 .

[2]  J. Alberto Espinosa,et al.  Big Data: Issues and Challenges Moving Forward , 2013, 2013 46th Hawaii International Conference on System Sciences.

[3]  M Markus Maier,et al.  Towards a big data reference architecture , 2013 .

[4]  Peter Mork,et al.  From Data to Decisions: A Value Chain for Big Data , 2013, IT Professional.

[5]  Ivor W. Tsang,et al.  The Emerging "Big Dimensionality" , 2014, IEEE Computational Intelligence Magazine.

[6]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[7]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[8]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[9]  Yuri Demchenko,et al.  Architecture Framework and Components for the Big Data Ecosystem , 2013 .

[10]  Hector Garcia-Molina,et al.  Parallel crawlers , 2002, WWW.

[11]  Vice President,et al.  Big Data and Current Cloud Computing Issues and Challenges , 2014 .

[12]  Yi Zhang,et al.  Novelty and redundancy detection in adaptive filtering , 2002, SIGIR '02.

[13]  Dharm Singh Jat,et al.  Big data: Emerging technological paradigm and challenges , 2015, 2015 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC).

[14]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[15]  Manjit Kaur,et al.  BIG Data and Methodology-A review , 2013 .