A Survey on NoSQL Databases

NoSQL databases have gained popularity in the recent years and have been successful in many production systems. The goal of this document is to understand the current needs that have led to the evolution of NoSQL data stores, why relational database systems were not able to meet these requirements and a brief discussion of some of the successful NoSQL data stores. We will study the common concepts underlying these data stores and how they compromise on ACID properties to achieve high scalability and availability. We also look at how the database community looks at this evolution: will it supersede the RDBMS (or) just a passing cloud?

[1]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[2]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[3]  Michael Stonebraker,et al.  The Case for Shared Nothing , 1985, HPTS.

[4]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[5]  Eric A. Brewer,et al.  Towards robust distributed systems (abstract) , 2000, PODC '00.

[6]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[7]  Nancy A. Lynch,et al.  Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services , 2002, SIGA.

[8]  GhemawatSanjay,et al.  The Google file system , 2003 .

[9]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[10]  Michael Stonebraker,et al.  The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[11]  Douglas Stott Parker,et al.  Map-reduce-merge: simplified relational data processing on large clusters , 2007, SIGMOD '07.

[12]  Werner Vogels,et al.  Eventually consistent , 2008, CACM.

[13]  A. Lakshman,et al.  Cassandra: a structured storage system on a P2P network , 2009, ACM Symposium on Parallelism in Algorithms and Architectures.

[14]  Hans De Sterck,et al.  Case Study of Scientific Data Processing on a Cloud Using Hadoop , 2009, HPCS.

[15]  Michael Stonebraker,et al.  SQL databases v. NoSQL databases , 2010, CACM.

[16]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[17]  Neal Leavitt,et al.  Will NoSQL Databases Live Up to Their Promise? , 2010, Computer.

[18]  Ruichun Hou,et al.  Cache and consistency in NOSQL , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[19]  Doug Woos Consistent Hashing , 2011, Encyclopedia of Parallel Computing.

[20]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[21]  Chen Feng,et al.  CCIndex for Cassandra: A Novel Scheme for Multi-dimensional Range Queries in Cassandra , 2011, 2011 Seventh International Conference on Semantics, Knowledge and Grids.

[22]  Michael Stonebraker,et al.  10 rules for scalable performance in 'simple operation' datastores , 2011, Commun. ACM.

[23]  Massimo Carro,et al.  NoSQL Databases , 2014, ArXiv.

[24]  Follow Us on Twitter at Http://twitter.com/blogcacm from Michael Stonebraker's " the Nosql Discussion Has Nothing to Do with Sql " , .