Extracting Fuzzy Summaries from NoSQL Graph Databases

Linguistic summaries have been studied for many years and allow to sum up large volumes of data in a very intuitive manner. They have been studied over several types of data. However, few works have been led on graph databases. Graph databases are becoming popular tools and have recently gained significant recognition with the emergence of the so-called NoSQL graph databases. These databases allow users to handle huge volumes of data (e.g., scientific data, social networks). There are several ways to consider graph summaries. In this paper, we detail the specificities of NoSQL graph databases and we discuss how to summarize them by introducing several types of linguistic summaries, namely structure summaries, data structure summaries and fuzzy summaries. We present extraction methods that have been tested over synthetic and real database experimentations.

[1]  Charu C. Aggarwal,et al.  Managing and Mining Graph Data , 2010, Managing and Mining Graph Data.

[2]  Rui Jorge Almeida,et al.  Linguistic summaries of categorical time series for septic shock patient data , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[3]  Doina Caragea,et al.  Graph Databases , 2019, Encyclopedia of Big Data Technologies.

[4]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[5]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[6]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[7]  Evimaria Terzi,et al.  GraSS: Graph Structure Summarization , 2010, SDM.

[8]  A. John MINING GRAPH DATA , 2022 .

[9]  Mario Vento,et al.  Thirty Years Of Graph Matching In Pattern Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[10]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[11]  Slawomir Zadrozny,et al.  Linguistic database summaries and their protoforms: towards natural language based knowledge discovery tools , 2005, Inf. Sci..

[12]  Ronald R. Yager,et al.  A new approach to the summarization of data , 1982, Inf. Sci..

[13]  Luc De Raedt,et al.  A perspective on inductive databases , 2002, SKDD.

[14]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[15]  Frank Neven,et al.  Inferring XML Schema Definitions from XML Data , 2007, VLDB.

[16]  Bernadette Bouchon-Meunier,et al.  Fuzzy linguistic summaries: Where are we, where can we go? , 2012, 2012 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr).

[17]  Anna Wilbik,et al.  Tracking of multiple target types with a single neural extended Kalman filter , 2010 .

[18]  Ronald R. Yager,et al.  Finding fuzzy and gradual functional dependencies with SummarySQL , 1999, Fuzzy Sets Syst..

[19]  Lawrence B. Holder,et al.  Mining Graph Data: Cook/Mining Graph Data , 2006 .

[20]  Anne Laurent,et al.  Fuzzy Queries over NoSQL Graph Databases: Perspectives for Extending the Cypher Language , 2014, IPMU.