Emergence of NoSQL Platforms for Big Data Needs

Big data is revolutionizing world in the age of Internet. The wide variety of areas like online businesses, electronic health management, social networking, demographics, geographic information systems, online education etc. are gaining insight from big data principles. Big data is comprised of heterogeneous datasets which are too large to be handled by traditional relational database systems. An important reason for explosion of interest in big data is that it has become cheap to store volumes of data and there is a major rise in computation capacity. To extract valuable patterns from big data, one needs to choose a right platform for capturing, organizing, searching and analyzing the context of voluminous data in combination with traditional enterprise database management systems. Different platforms supporting big data management by many software organizations enable easy use of services. These platforms mainly focus on data storage, management, processing, and distribution and on data analytics. Various NoSQL data stores like Cassandra, MongoDB and Hadoop HBASE etc. are in use today to acquire, manage, store and query big data. NoSQL databases are inherently schema-less and permit records to have variable number of fields, making them distinct from other non-relational databases like hierarchical databases and object-oriented databases. These are highly scalable and well suited for dynamic data structures. NoSQL data is characterized by being basically available and eventually consistent. The frameworks like MapReduce, Dryad etc. support processing of large amounts of data in parallel and hence the management and analysis of big data. The technologies like GNU R and Apache MAHOUT are also useful in exploring big data for finding relevant valuable patterns. This article aims at giving an overview of the rationales behind NoSQL movement as well as various big data platforms useful in today’s competitive world.

[1]  Ao Lei,et al.  Exploration on Big Data Oriented Data Analyzing and Processing Technology , 2013 .

[2]  Chen Li,et al.  Big data platforms: What's next? , 2012, XRDS.

[3]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[4]  Chen Li,et al.  Inside "Big Data management": ogres, onions, or parfaits? , 2012, EDBT '12.

[5]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[6]  Michael Stonebraker,et al.  New opportunities for New SQL , 2012, CACM.

[7]  Rares Vernica,et al.  Hyracks: A flexible and extensible foundation for data-intensive computing , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[8]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[9]  Hans-Wolfgang Loidl,et al.  Comparing High Level MapReduce Query Languages , 2011, APPT.

[10]  Ana Paula Appel,et al.  Weights and Multi-Edges in Link Prediction , 2014 .

[11]  Clarence J M Tauro,et al.  Comparative Study of the New Generation, Agile, Scalable, High Performance NOSQL Databases , 2012 .

[12]  Eric A. Brewer,et al.  Towards robust distributed systems (abstract) , 2000, PODC '00.

[13]  Nijaz Bajgoric Server Operating Environment and Business Continuity Drivers , 2009 .

[14]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[15]  M. Courtney Puzzling out big data , 2012 .

[16]  John Wang,et al.  Encyclopedia of Business Analytics and Optimization , 2018 .

[17]  Nijaz Bajgoric Continuous Computing Technologies for Enhancing Business Continuity , 2008 .

[18]  Lakhmi Jain,et al.  Computational Economics: A Perspective from Computational Intelligence , 2006 .

[19]  Michael Stonebraker,et al.  SQL databases v. NoSQL databases , 2010, CACM.

[20]  Jaroslav Pokorný,et al.  NoSQL databases: a step to database scalability in web environment , 2011, iiWAS '11.

[21]  Sandeep Chanda,et al.  Introducing Non-Relational Databases , 2013 .

[22]  Nancy A. Lynch,et al.  Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services , 2002, SIGA.

[23]  Virginia M. Miori,et al.  The Pollyanna Problem: Assignment of Participants in a Gift Exchange , 2014, Int. J. Bus. Intell. Res..