Comparison of Graph Databases and Relational Databases When Handling Large-Scale Social Data

Over the past few years, with the rapid development of mobile technology, more people use mobile social applications, such as Facebook, Twitter and Weibo, in their daily lives, and there is an increasing amount of social data. Thus, finding a suitable storage approach to store and process the social data, especially for the large-scale social data, should be important for the social network companies. Traditionally, a relational database, which represents data in terms of tables, is widely used in the legacy applications. However, a graph database, which is a kind of NoSQL databases, is in a rapid development to handle the growing amount of unstructured or semi-structured data. The two kinds of storage approaches have their own advantages. For example, a relational database should be a more mature storage approach, and a graph database can handle graph-like data in an easier way. In this research, a comparison of capabilities for storing and processing large-scale social data between relational databases and graph databases is applied. Two kinds of analysis, the quantitative research analysis of storage cost and executing time and the qualitative analysis of five criteria, including maturity, ease of programming, flexibility, security and data visualization, are taken into the comparison to evaluate the performance of relational databases and graph databases when handling large-scale social data. Also, a simple mobile social application is developed for experiments. The comparison is used to figure out which kind of database is more suitable for handling large-scale social data, and it can compare more graph database models with real-world social data sets in the future research.

[1]  Josep-Lluís Larriba-Pey,et al.  Survey of Graph Database Performance on the HPC Scalable Graph Analysis Benchmark , 2010, WAIM Workshops.

[2]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Jeff Z. Pan,et al.  Resource Description Framework , 2020, Definitions.

[5]  Frank van Harmelen,et al.  A semantic web primer , 2004 .

[6]  René Peinl,et al.  Performance of graph query languages: comparison of cypher, gremlin and native access in Neo4j , 2013, EDBT '13.

[7]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[8]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[9]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[10]  H. Klein,et al.  The Social Construction of Technology: Structural Considerations , 2002 .

[11]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[12]  Ellen R. Bergeman,et al.  Graph database systems , 1995 .

[13]  D. West Introduction to Graph Theory , 1995 .

[14]  Morris Sloman,et al.  A survey of quality of service in mobile computing environments , 1999, IEEE Communications Surveys & Tutorials.

[15]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[16]  Luciano Rossoni,et al.  Models and methods in social network analysis , 2006 .

[17]  D. Krackhardt The strength of strong ties: The importance of Philos in organizations , 2003 .

[18]  Mark S. Granovetter T H E S T R E N G T H O F WEAK TIES: A NETWORK THEORY REVISITED , 1983 .

[19]  Marc Gyssens,et al.  The Structure of the Relational Database Model , 1989, EATCS Monographs on Theoretical Computer Science.

[20]  G. Lawrence Sanders,et al.  Denormalization strategies for data retrieval from data warehouses , 2006, Decis. Support Syst..

[21]  Samuel Madden,et al.  Transactional Consistency and Automatic Management in an Application Data Cache , 2010, OSDI.

[22]  E. F. Codd,et al.  The Relational Model for Database Management, Version 2 , 1990 .

[23]  John Zahorjan,et al.  The challenges of mobile computing , 1994, Computer.

[24]  Andreas Reuter,et al.  Principles of transaction-oriented database recovery , 1983, CSUR.

[25]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[26]  Syed Akhter Hossain,et al.  NoSQL Database: New Era of Databases for Big data Analytics - Classification, Characteristics and Comparison , 2013, ArXiv.

[27]  Yixin Chen,et al.  A comparison of a graph database and a relational database: a data provenance perspective , 2010, ACM SE '10.

[28]  Valentino Lee,et al.  Mobile Applications: Architecture, Design, and Development , 2004 .

[29]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[30]  Sarah Allen,et al.  Pro Smartphone Cross-Platform Development: iPhone, Blackberry, Windows Mobile and Android Development and Distribution , 2010 .

[31]  Borislav Iordanov,et al.  HyperGraphDB: A Generalized Graph Database , 2010, WAIM Workshops.

[32]  Alberto Rodrigues da Silva,et al.  Survey on Cross-Platforms and Languages for Mobile Apps , 2012, 2012 Eighth International Conference on the Quality of Information and Communications Technology.

[33]  Nadeem Akhtar,et al.  Analysis of Facebook Social Network , 2013, 2013 5th International Conference on Computational Intelligence and Communication Networks.

[34]  Joaquín Salvachúa,et al.  Social Stream, a social network framework , 2012, The First International Conference on Future Generation Communication Technologies.

[35]  Reza Tadayoni,et al.  Challenges for mobile application development , 2010, 2010 14th International Conference on Intelligence in Next Generation Networks.

[36]  Ben Y. Zhao,et al.  User interactions in social networks and their implications , 2009, EuroSys '09.

[37]  Ajay Mehra The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[38]  R. Johnston,et al.  The SAGE Handbook of Social Network Analysis , 2011 .

[39]  M. Ruef,et al.  Strong ties, weak ties and islands: structural and cultural predictors of organizational innovation , 2002 .

[40]  Antonio Cicchetti,et al.  Comparison of cross-platform mobile development tools , 2012, 2012 16th International Conference on Intelligence in Next Generation Networks.

[41]  Stéphane Bressan,et al.  Introduction to Database Systems , 2005 .

[42]  Norman,et al.  Structural Models: An Introduction to the Theory of Directed Graphs. , 1966 .

[43]  Henri E. Bal,et al.  Ibis for mobility: solving challenges of mobile computing using grid techniques , 2009, HotMobile '09.

[44]  Dan McCreary Making Sense of NoSQL , 2014 .

[45]  Christos Faloutsos,et al.  R-MAT: A Recursive Model for Graph Mining , 2004, SDM.

[46]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[47]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[48]  R. Nelson THE STRENGTH OF STRONG TIES: SOCIAL NETWORKS AND INTERGROUP CONFLICT IN ORGANIZATIONS , 1989 .

[49]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[50]  A. Ramezanpour,et al.  Investigation of a protein complex network , 2003, cond-mat/0304207.

[51]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[52]  R. Hanneman Introduction to Social Network Methods , 2001 .

[53]  Andre Charland,et al.  Mobile application development , 2011, Commun. ACM.

[54]  Shahram Ghandeharizadeh,et al.  BG: A Benchmark to Evaluate Interactive Social Networking Actions , 2013, CIDR.

[55]  Jim Webber,et al.  Graph Databases: New Opportunities for Connected Data , 2015 .

[56]  Sang Ho Lee,et al.  An Integrated Database Benchmark Suite , 2005, 2005 First International Conference on Semantics, Knowledge and Grid.

[57]  Brian Fling Mobile Design and Development - Practical concepts and techniques for creating mobile sites and web apps , 2009 .

[58]  Jay Williams Native , 1996 .

[59]  A. Rapoport Contribution to the theory of random and biased nets , 1957 .

[60]  Timothy G. Armstrong,et al.  LinkBench: a database benchmark based on the Facebook social graph , 2013, SIGMOD '13.