Storage Solutions for Big Data Systems: A Qualitative Study and Comparison

Big data systems development is full of challenges in view of the variety of application areas and domains that this technology promises to serve. Typically, fundamental design decisions involved in big data systems design include choosing appropriate storage and computing infrastructures. In this age of heterogeneous systems that integrate different technologies for optimized solution to a specific real world problem, big data system are not an exception to any such rule. As far as the storage aspect of any big data system is concerned, the primary facet in this regard is a storage infrastructure and NoSQL seems to be the right technology that fulfills its requirements. However, every big data application has variable data characteristics and thus, the corresponding data fits into a different data model. This paper presents feature and use case analysis and comparison of the four main data models namely document oriented, key value, graph and wide column. Moreover, a feature analysis of 80 NoSQL solutions has been provided, elaborating on the criteria and points that a developer must consider while making a possible choice. Typically, big data storage needs to communicate with the execution engine and other processing and visualization technologies to create a comprehensive solution. This brings forth second facet of big data storage, big data file formats, into picture. The second half of the research paper compares the advantages, shortcomings and possible use cases of available big data file formats for Hadoop, which is the foundation for most big data computing technologies. Decentralized storage and blockchain are seen as the next generation of big data storage and its challenges and future prospects have also been discussed.

[1]  Norbert Ritter,et al.  Real-Time Data Management for Big Data , 2018, EDBT.

[2]  Behrouz Sefid-Dashti,et al.  Thrift Service Composition: Toward Extending BPEL , 2018, SCIOT '18.

[3]  Rohit Anand,et al.  Big Data Analysis and Compression for Indoor Air Quality , 2019 .

[4]  Parmeet Kaur,et al.  Empirical Investigation of Trends in NoSQL-Based Big-Data Solutions in the Last Decade , 2018, 2018 Eleventh International Conference on Contemporary Computing (IC3).

[5]  John Kolb,et al.  Steel: Simplified Development and Deployment of Edge-Cloud Applications , 2018, HotCloud.

[6]  Bela Stantic,et al.  Precomputing architecture for flexible and efficient big data analytics , 2018, Vietnam Journal of Computer Science.

[7]  Mouad Banane,et al.  A Survey on RDF Data Store Based on NoSQL Systems for the Semantic Web Applications , 2018 .

[8]  Ganesh Chandra Deka Chapter Three - NoSQL Web Crawler Application , 2018, Adv. Comput..

[9]  Javed Ferzund,et al.  Modern Data Formats for Big Bioinformatics Data Analytics , 2017, ArXiv.

[10]  Mansaf Alam,et al.  Big Data Computing Using Cloud-Based Technologies, Challenges and Future Perspectives , 2017, ArXiv.

[11]  Jorge Bernardino,et al.  Graph Databases Comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB , 2018, DATA.

[12]  Mansaf Alam,et al.  A survey on scholarly data: From big data perspective , 2017, Inf. Process. Manag..

[13]  Beng Chin Ooi,et al.  Chain of Trust: Can Trusted Hardware Help Scaling Blockchains? , 2018, ArXiv.

[14]  Jyotir Moy Chatterjee,et al.  NoSQL Database Classification: New Era of Databases for Big Data , 2019, Int. J. Knowl. Based Organ..

[15]  Kristina Chodorow,et al.  MongoDB - The Definitive Guide: Powerful and Scalable Data Storage , 2019 .

[16]  Pabitra Mitra,et al.  The big data system, components, tools, and technologies: a survey , 2018, Knowledge and Information Systems.

[17]  H. Peter Hofstee,et al.  Supporting Columnar In-memory Formats on FPGA: The Hardware Design of Fletcher for Apache Arrow , 2019, ARC.

[18]  Rupali Ahuja Hadoop Framework for Handling Big Data Needs , 2018 .

[19]  Rob Reagan Azure Data Storage Overview , 2018 .

[20]  Monther Aldwairi,et al.  IMPLEMENTING BI-TEMPORAL PROPERTIES INTO VARIOUS NOSQL DATABASE CATEGORIES , 2019, International Journal of Computing.

[21]  Claudio Gutierrez,et al.  An introduction to Graph Data Management , 2017, Graph Data Management.

[22]  Pavel Masek,et al.  Performance testing of NoSQL and RDBMS for storing big data in e-applications , 2018, 2018 3rd International Conference on Intelligent Green Building and Smart Grid (IGBSG).

[23]  Rami Melhem,et al.  Dynamic Elasticity for Distributed Graph Analytics , 2018 .

[24]  Massimo Carro,et al.  NoSQL Databases , 2014, ArXiv.

[25]  Liu Chen,et al.  A Survey on NoSQL Stores , 2018, ACM Comput. Surv..

[26]  Jordi Cabot,et al.  Advanced prefetching and caching of models with PrefetchML , 2019, Software & Systems Modeling.

[27]  Ee-Chien Chang,et al.  Towards Scaling Blockchain Systems via Sharding , 2018, SIGMOD Conference.

[28]  Stanley M. Huff,et al.  The Pluripotent Rendering of Clinical Data for Precision Medicine , 2017, MedInfo.

[29]  Juan M. Corchado,et al.  Tendencies of Technologies and Platforms in Smart Cities: A State-of-the-Art Review , 2018, Wirel. Commun. Mob. Comput..

[30]  Hellen Adams,et al.  Patent and Trademark Office , 2017 .

[31]  Jorge Bernardino,et al.  Consistency Models of NoSQL Databases , 2019, Future Internet.

[32]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[33]  Zibin Zheng,et al.  Blockchain challenges and opportunities: a survey , 2018, Int. J. Web Grid Serv..

[34]  Juan Sequeda,et al.  G-CORE: A Core for Future Graph Query Languages , 2017, SIGMOD Conference.

[35]  Galyna Kondratenko,et al.  Multi-criteria decision making for selecting a rational IoT platform , 2018, 2018 IEEE 9th International Conference on Dependable Systems, Services and Technologies (DESSERT).

[36]  Vivek Shrivastava,et al.  Survey on RDBMS and NoSQL Databases , 2018 .

[37]  Cevdet Aykanat,et al.  Scaling sparse matrix-matrix multiplication in the accumulo database , 2019, Distributed and Parallel Databases.

[38]  Kostas E. Psannis,et al.  Secure integration of IoT and Cloud Computing , 2018, Future Gener. Comput. Syst..

[39]  Michael Hippler,et al.  pymzML v2.0: introducing a highly compressed and seekable gzip format , 2018, Bioinform..

[40]  Michael Hausenblas,et al.  Apache Drill: Interactive Ad-Hoc Analysis at Scale , 2013, Big Data.

[41]  Javier Vargas,et al.  Performance Evaluation of NoSQL and SQL Queries in Response Time for the E-government , 2018, 2018 International Conference on eDemocracy & eGovernment (ICEDEG).

[42]  Gidon Gershinsky Efficient Analytics on Encrypted Data , 2018, SYSTOR.

[43]  Xi Chen,et al.  A Distributed Cooperative Control Framework for Synchronized Reconnection of a Multi-Bus Microgrid , 2017, IEEE Transactions on Smart Grid.

[44]  Syed Akhter Hossain,et al.  NoSQL Database: New Era of Databases for Big data Analytics - Classification, Characteristics and Comparison , 2013, ArXiv.

[45]  Eduardo Castelló Ferrer The blockchain: a new framework for robotic swarm systems , 2016, Proceedings of the Future Technologies Conference (FTC) 2018.

[46]  Neal Leavitt,et al.  Will NoSQL Databases Live Up to Their Promise? , 2010, Computer.

[47]  Mirosław Płaza,et al.  Analysis of parallel computational models for clustering , 2018, Symposium on Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments (WILGA).

[48]  Xiaoqin Yin,et al.  Research and Application of Large Data Query Technology Based on NoSQL Database , 2018 .

[49]  Jeremy Kepner,et al.  A Billion Updates per Second Using 30, 000 Hierarchical In-Memory D4M Databases , 2019, ArXiv.

[50]  Hidayat Ur Rahman,et al.  Programming and Pre-Processing Systems for Big Data Storage and Visualization , 2018 .

[51]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[52]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[53]  Hao Jiang,et al.  Boosting data filtering on columnar encoding with SIMD , 2018, DaMoN.

[54]  Zach Smith Joining and aggregating datasets using CouchDB , 2018 .

[55]  Oliver Gutsche,et al.  Striped Data Server for Scalable Parallel Data Analysis , 2017 .

[56]  Abhishek Kumar,et al.  Comparison Study of Different NoSQL and Cloud Paradigm for Better Data Storage Technology , 2018 .

[57]  Markus Krötzsch,et al.  Practical Linked Data Access via SPARQL: The Case of Wikidata , 2018, LDOW@WWW.

[58]  Priyanka Chawla,et al.  Analyzing Performance of Apache Pig and Apache Hive with Hadoop , 2019 .

[59]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[60]  Partha Pratim Ray A survey on Internet of Things architectures , 2018, J. King Saud Univ. Comput. Inf. Sci..

[61]  Engin Zeydan,et al.  An Artificial Intelligence Enabled Data Analytics Platform for Digital Advertisement , 2019, 2019 22nd Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN).

[62]  Ivan Perl,et al.  Design Of Specialized Storage for Heterogeneous Project Data , 2018, 2018 23rd Conference of Open Innovations Association (FRUCT).

[63]  Brian Donohue,et al.  Towards a methodology for lossless data exchange between NoSQL data structures , 2018, Defense + Security.

[64]  Alza A. Mahmood Automated Algorithm for Data Migration from Relational to NoSQL Databases , 2018 .

[65]  Hassan Hajjdiab,et al.  Cloud Storage Comparative Analysis Amazon Simple Storage vs. Microsoft Azure Blob Storage , 2018 .

[66]  Samiya Khan,et al.  Educational intelligence: Applying cloud-based big data analytics to the Indian education sector , 2016, 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I).

[67]  Vellingiri Jayagopal,et al.  Data Management and Big Data Analytics: Data Management in Digital Economy , 2019 .

[68]  Vinod Shokeen,et al.  Comparison of Approaches of Distributed Satellite Image Edge Detection on Hadoop , 2018, 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT).

[69]  Ivan Kovacevic,et al.  Novel BI data architectures , 2018, 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[70]  Martin Grund,et al.  Impala: A Modern, Open-Source SQL Engine for Hadoop , 2015, CIDR.

[71]  Min Gu,et al.  Optical storage arrays: a perspective for future big data storage , 2014, Light: Science & Applications.

[72]  J Prabhu,et al.  Comparison of NoSQL Database and Traditional Database-An emphatic analysis , 2018 .

[73]  Rupali Gill,et al.  Performance Analysis of RDBMS and Hadoop Components with Their File Formats for the Development of Recommender Systems , 2018, 2018 3rd International Conference for Convergence in Technology (I2CT).

[74]  Pierluigi Siano,et al.  A Survey and Evaluation of the Potentials of Distributed Ledger Technology for Peer-to-Peer Transactive Energy Exchanges in Local Energy Markets , 2019, IEEE Systems Journal.

[75]  Yunjun Gao,et al.  UlTraMan: A Unified Platform for Big Trajectory Data Management and Analytics , 2018, Proc. VLDB Endow..

[76]  Mohammed El Malki,et al.  MPT: Suite Tools to Support Performance Tuning in NoSQL Systems , 2018, ICEIS.

[77]  Nur Izura Udzir,et al.  Security of NoSQL Database Against Intruders , 2019, Recent Patents on Engineering.

[78]  Junshan Li,et al.  Research on NoSQL Database Technology , 2018 .

[79]  Sharayu Moharir,et al.  Effects of storage heterogeneity in distributed cache systems , 2018, 2018 16th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt).

[80]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[81]  Francesc D. Muñoz-Escoí,et al.  CAP Theorem: Revision of Its Related Consistency Models , 2019, Comput. J..

[82]  Andrea C. Arpaci-Dusseau,et al.  Revisiting Concurrency in High-Performance NoSQL Databases , 2018, HotStorage.

[83]  Muhammad Imran,et al.  Managing big RDF data in clouds: Challenges, opportunities, and solutions , 2018 .

[84]  Klas Segeljakt,et al.  A Scala DSL for Rust code generation , 2018 .

[85]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[86]  Mark Hwang Graph Processing Using SAP HANA: A Teaching Case. , 2018 .

[87]  Naw Thiri Wai Khin,et al.  Query Classification based Information Retrieval System , 2018, 2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS).

[88]  Jens Lehmann,et al.  A Stitch in Time Saves Nine - SPARQL querying of Property Graphs using Gremlin Traversals , 2018, ArXiv.

[89]  Maria Fazio,et al.  An OAIS-Based Hospital Information System on the Cloud: Analysis of a NoSQL Column-Oriented Approach , 2018, IEEE Journal of Biomedical and Health Informatics.

[90]  Abdessamad Belangour,et al.  Meta-modeling of Zookeeper and MapReduce processing , 2018, 2018 International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS).

[91]  Maristela Holanda,et al.  Graph Databases in Molecular Biology , 2018, BSB.

[92]  Mohd Abdul Ahad,et al.  Handling Small Size Files in Hadoop: Challenges, Opportunities, and Review , 2018, Soft Computing in Data Analytics.

[93]  S. P. Pingat,et al.  PHOTOGROUP: DECENTRALIZED WEB APPLICATION USING ETHEREUM BLOCKCHAIN , 2018 .

[94]  Xi Zheng Database as a Service - Current Issues and Its Future , 2018, ArXiv.

[95]  Huimin Lu,et al.  Brain Intelligence: Go beyond Artificial Intelligence , 2017, Mobile Networks and Applications.

[96]  Mário Rodrigues,et al.  Experimental Evaluation of Big Data Analytical Tools , 2018, EMCIS.

[97]  Earl E. Swartzlander,et al.  Data Compression Device Based on Modified LZ4 Algorithm , 2018, IEEE Transactions on Consumer Electronics.

[98]  Gur Mauj Saran Srivastava,et al.  “Big” Data Management in Cloud Computing Environment , 2018, Harmony Search and Nature Inspired Optimization Algorithms.

[99]  Martin Brain,et al.  Application of Abstract Interpretation to the Automotive Electronic Control System , 2019, VMCAI.

[100]  Konstantinos Tserpes,et al.  Performance Evaluation of MongoDB and PostgreSQL for Spatio-temporal Data , 2019, EDBT/ICDT Workshops.

[101]  Paolo Atzeni,et al.  Data Modeling Across the Evolution of Database Technology , 2018, A Comprehensive Guide Through the Italian Database Research.

[102]  Takaaki Goto,et al.  A Framework to Convert NoSQL to Relational Model , 2018, ACIT 2018.

[103]  Bogdan Iancu,et al.  Saving Large Semantic Data in Cloud: A Survey of the Main DBaaS Solutions , 2018 .

[104]  Muhammad Younas,et al.  Testing of transactional services in NoSQL key-value databases , 2018, Future Gener. Comput. Syst..

[105]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[106]  Dong Ryeol Shin,et al.  Deep Learning Frameworks on Apache Spark: A Review , 2019 .

[107]  Octavian Postolache,et al.  Big Data Storage for a Health Predictive System , 2018, 2018 International Symposium in Sensing and Instrumentation in IoT Era (ISSI).

[108]  Mohd Vasim Ahamad,et al.  Big Data Analytics Tools and Platform in Big Data Landscape , 2018 .

[109]  Moditha Hewasinghage,et al.  Modeling Strategies for Storing Data in Distributed Heterogeneous NoSQL Databases , 2018, ER.

[110]  Marimuthu Palaniswami,et al.  Real-Time Urban Microclimate Analysis Using Internet of Things , 2018, IEEE Internet of Things Journal.

[111]  Siu-Ming Yiu,et al.  Decentralized Search on Decentralized Web , 2018, CIDR.

[112]  Sebastian Link,et al.  SQL schema design: foundations, normal forms, and normalization , 2018, Inf. Syst..

[113]  Raju Kumar Mishra The Era of Big Data, Hadoop, and Other Big Data Processing Frameworks , 2018 .

[114]  Rahul Batra A History of SQL and Relational Databases , 2018 .

[115]  Spyros Sioutas,et al.  A NoSQL Database Approach for Modeling Heterogeneous and Semi-Structured Information , 2018, 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA).

[116]  Victor I. Chang,et al.  A Proposed Solution and Future Direction for Blockchain-Based Heterogeneous Medicare Data in Cloud Environment , 2018, Journal of Medical Systems.

[117]  J. Ritchie Carroll,et al.  A COMPARISON OF PHASOR COMMUNICATIONS PROTOCOLS , 2019 .

[118]  Genaína Nunes Rodrigues,et al.  Statistical analysis of Amazon EC2 cloud pricing models , 2019, Concurr. Comput. Pract. Exp..

[119]  Juan José Cuadrado Gallego,et al.  Analysis of the impact of file formats for open data analytics efficiency: a case study with R , 2016 .

[120]  Alexander Shraer,et al.  FoundationDB Record Layer: A Multi-Tenant Structured Datastore , 2019, SIGMOD Conference.

[121]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[122]  Ryan M White Open Data Standards for Administrative Data Processing , 2018 .

[123]  Bidyut Biman Sarkar,et al.  NoSQL Overview and Performance Testing of HBase Over Multiple Nodes with MySQL , 2018, Advances in Intelligent Systems and Computing.

[124]  Gergely Mezei,et al.  Benchmarking Graph Database Backends - What Works Well with Wikidata? , 2019, Acta Cybern..

[125]  Kostas Magoutis,et al.  Replica-Group Leadership Change as a Performance Enhancing Mechanism in NoSQL Data Stores , 2018, 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).

[126]  Lars George,et al.  HBase - The Definitive Guide: Random Access to Your Planet-Size Data , 2011 .

[127]  Stephen W. Keckler,et al.  Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks , 2017, 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA).

[128]  Min Song,et al.  Understanding User’s Interests in NoSQL Databases in Stack Overflow , 2018 .

[129]  Rajkumar Buyya,et al.  Next generation cloud computing: New trends and research directions , 2017, Future Gener. Comput. Syst..

[130]  Rajkumar Buyya,et al.  Big Data computing and clouds: Trends and future directions , 2013, J. Parallel Distributed Comput..

[131]  Rinkle Rani,et al.  Comparative study of NoSQL databases for big data storage , 2018 .

[132]  Ian T. Foster,et al.  Performance, Resilience, and Security in Moving Data from the Fog to the Cloud: The DYNAMO Transfer Framework Approach , 2018, IDCS.

[133]  Mehmet S. Aktas,et al.  On the Performance Analysis of Map-Reduce Programming Model on In-Memory NoSQL Storage Platforms: A Case Study , 2018, 2018 International Congress on Big Data, Deep Learning and Fighting Cyber Terrorism (IBIGDELFT).

[134]  Bibhudatta Sahoo,et al.  Storage Size Estimation for Schemaless Big Data Applications: A JSON-based Overview , 2018 .

[135]  Khalid Moussaid,et al.  A Performed Load Balancing Algorithm for Public Cloud Computing Using Ant Colony Optimization , 2018 .

[136]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[137]  Anwar ul Haque,et al.  Performance Comparison of State of Art NoSql Technologies Using Apache Spark , 2018, IntelliSys.

[138]  Vagelis Hristidis,et al.  A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL Databases , 2018, SIGMOD Conference.

[139]  Chen Liang,et al.  An optimized data hiding scheme for Deflate codes , 2018, Soft Comput..

[140]  Seema Rawat,et al.  Analytical Planning and Implementation of Big Data Technology Working at Enterprise Level , 2018 .

[141]  Sanjay L. Nalbalwar,et al.  Fusion Execution of NaCl on Tree-Shaped MSA , 2018 .

[142]  Yanlong Zhai,et al.  Hadoop Perfect File: A fast access container for small files with direct in disc metadata access , 2019, ArXiv.

[143]  Cole Hudson,et al.  Avro: Overview and Implications for Metadata Processing , 2018 .

[144]  Oscar Novo,et al.  Blockchain Meets IoT: An Architecture for Scalable Access Management in IoT , 2018, IEEE Internet of Things Journal.

[145]  Jeang-Kuo Chen,et al.  A Study of NoSQL Database for Enterprises , 2018, 2018 International Symposium on Computer, Consumer and Control (IS3C).

[146]  Valentin Puente,et al.  Memory Hierarchy Characterization of NoSQL Applications through Full-System Simulation , 2018, IEEE Transactions on Parallel and Distributed Systems.

[147]  Gugulothu Narsimha,et al.  Miscegenation of scalable and DEP3K performance evaluation of nosql-cassandra for bigdata applications deployed in cloud , 2018 .

[148]  Kun Hu,et al.  A Progressive Web Application on Ancient Roman Empire Coins and Relevant Historical Figures with Graph Database , 2018, EuroMed.

[149]  Pedro Martins,et al.  A Study over NoSQL Performance , 2019, WorldCIST.

[150]  Jim Webber,et al.  A programmatic introduction to Neo4j , 2018, SPLASH '12.

[151]  Debajyoti Pal,et al.  Big Data in Smart-Cities: Current Research and Challenges , 2018, Indonesian Journal of Electrical Engineering and Informatics (IJEEI).

[152]  Peter A. Flach,et al.  A Big Data platform for smart meter data analytics , 2019, Comput. Ind..

[153]  Muhammad Shiraz,et al.  Big Data: Survey, Technologies, Opportunities, and Challenges , 2014, TheScientificWorldJournal.

[154]  Mehreen Ansar,et al.  Biometric Encryption in Cloud Computing: A Systematic Review , 2018 .

[155]  Maria Cláudia Reis Cavalcanti,et al.  Data modeling for analytical queries on document-oriented DBMS , 2018, SAC.

[156]  Mansaf Alam,et al.  Cloud based Big Data Analytics: A Survey of Current Research and Future Directions , 2015, ArXiv.

[157]  Paolo Tell,et al.  From Scrum to Agile: a journey to tackle the challenges of distributed development in an Agile team , 2018, ICSSP.

[158]  Yaling Zhang,et al.  A Blockchain-Based Framework for Data Sharing With Fine-Grained Access Control in Decentralized Storage Systems , 2018, IEEE Access.

[159]  Marten van Sinderen,et al.  Semantic Interoperability for the IoT: Analysis of JSON for Linked Data , 2018 .

[160]  S. Kami Makki,et al.  Measuring the Performance of Data Placement Structures for MapReduce-based Data Warehousing Systems , 2018 .

[161]  Adolfo Muñoz,et al.  Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases , 2018, Journal of visualized experiments : JoVE.

[162]  Milko Marinov,et al.  NoSQL approach for sensor data storage and retrieval , 2018, 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[163]  Stefan Plantikow,et al.  Cypher: An Evolving Query Language for Property Graphs , 2018, SIGMOD Conference.