State of Big Data Analysis in the Cloud

Big Data is data that either is too large, grows too fast, or does not fit into traditional architectures. Within such data can be valuable information that can be discovered through data analysis. With the emergence of cloud computing services, big data processing has become a less costly task. In this paper, we examine the current trends and characteristics of Big Data, its analysis and how these are presenting challenges in data collection, storage and management in cloud computing.

[1]  Jennifer Widom,et al.  Challenges and Opportunities with Big Data 2012-2 , 2011 .

[2]  Mingquan Wu,et al.  On Wide Area Network Optimization , 2012, IEEE Communications Surveys & Tutorials.

[3]  Lei Gao,et al.  Serving large-scale batch computed data with project Voldemort , 2012, FAST.

[4]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[5]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.

[6]  Jaroslav Pokorný,et al.  NoSQL databases: a step to database scalability in web environment , 2011, iiWAS '11.

[7]  Brian Tierney,et al.  Efficient data transfer protocols for big data , 2012, 2012 IEEE 8th International Conference on E-Science.

[8]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[9]  Keqiu Li,et al.  Big Data Processing in Cloud Computing Environments , 2012, 2012 12th International Symposium on Pervasive Systems, Algorithms and Networks.

[10]  Edmon Begoli,et al.  Design Principles for Effective Knowledge Discovery from Big Data , 2012, 2012 Joint Working IEEE/IFIP Conference on Software Architecture and European Conference on Software Architecture.

[11]  Pete Wyckoff,et al.  Hive - A Warehousing Solution Over a Map-Reduce Framework , 2009, Proc. VLDB Endow..

[12]  Aravind Menon,et al.  Big data @ facebook , 2012 .

[13]  Ravi Kumar,et al.  Pig latin: a not-so-foreign language for data processing , 2008, SIGMOD Conference.

[14]  Wesley Chou Optimizing the WAN between Branch Offices and the Data Center , 2009, IT Professional.

[15]  Ismail Ari,et al.  Data stream analytics and mining in the cloud , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[16]  J. Alberto Espinosa,et al.  Big Data: Issues and Challenges Moving Forward , 2013, 2013 46th Hawaii International Conference on System Sciences.