Design of Declarative Graph Query Languages: On the Choice between Value, Pattern and Object Based Representations for Graphs

Graphs play a major role in many applications in business, science, and social sciences allowing modeling and analysis of interactions among objects and their structural relationships. The recent push for analytical methods to study inter-object associations in these domains has resulted in many interesting algorithms such as page rank and random walk, and systems such as TALE, SUMMA, QuickSI and C-Tree. While these algorithms address different aspects of graph analysis with varying degrees of effectiveness, their focus is not on declarative querying of graphs except GraphQL and IsoSearch. However, the first-order framework on which both GraphQL and IsoSearch are based does not allow for many queries of interest. In this paper, we argue that this limitation may be, at least partially, removed by implementing novel graph operators for relational databases. We present the idea of minimum hub covers as a basis for a compact and non-redundant representation of graphs in nested relational databases, and leverage this representation, called g-relations, toward graph operator implementation for graph isomorphisms and graph comparisons. This is an ongoing research and hence, depicts partial results.

[1]  Sherif Sakr,et al.  Graph indexing and querying: a review , 2010, Int. J. Web Inf. Syst..

[2]  Gabriel Valiente,et al.  Algorithms on Trees and Graphs , 2002, Springer Berlin Heidelberg.

[3]  Ehud Gudes,et al.  Subsea: an efficient heuristic algorithm for subgraph isomorphism , 2009, Data Mining and Knowledge Discovery.

[4]  Hasan M. Jamil A Novel Knowledge Representation Framework for Computing Sub-Graph Isomorphic Queries in Interaction Network Databases , 2009, 2009 21st IEEE International Conference on Tools with Artificial Intelligence.

[5]  Jiawei Han,et al.  On graph query optimization in large networks , 2010, Proc. VLDB Endow..

[6]  Wei Jin,et al.  SAPPER: Subgraph Indexing and Approximate Matching in Large Graphs , 2010, Proc. VLDB Endow..

[7]  Giuseppe Psaila,et al.  A New SQL-like Operator for Mining Association Rules , 1996, VLDB.

[8]  Jiawei Han,et al.  The Multi-Relational Skyline Operator , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[9]  Wei Wang,et al.  Graph Database Indexing Using Structured Graph Decomposition , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[10]  Wilfred Ng,et al.  Fg-index: towards verification-free query processing on graph databases , 2007, SIGMOD '07.

[11]  Dennis Shasha,et al.  GraphGrep: A fast and universal method for querying graphs , 2002, Object recognition supported by user interaction for service robots.

[12]  José Júlio Alferes,et al.  Abduction in well-founded semantics and generalized stable models via tabled dual programs , 2003, Theory and Practice of Logic Programming.

[13]  Jianzhong Li,et al.  Coding-based Join Algorithms for Structural Queries on Graph-Structured XML Document , 2008, World Wide Web.

[14]  Sourav S. Bhowmick,et al.  In the Search of NECTARs from Evolutionary Trees , 2009, DASFAA.

[15]  Xuemin Lin,et al.  NOVA: A Novel and Efficient Framework for Finding Subgraph Isomorphism Mappings in Large Graphs , 2010, DASFAA.

[16]  Hasan M. Jamil Computing subgraph isomorphic queries using structural unification and minimum graph structures , 2011, SAC '11.

[17]  Jeffrey Xu Yu,et al.  Taming verification hardness: an efficient algorithm for testing subgraph isomorphism , 2008, Proc. VLDB Endow..

[18]  Philip S. Yu,et al.  Graph indexing: a frequent structure-based approach , 2004, SIGMOD '04.

[19]  Kazi Zakia Sultana,et al.  IsoKEGG: A logic based system for querying biological pathways in KEGG , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[20]  Yannis E. Ioannidis,et al.  On the Computation of the Transitive Closure of Relational Operators , 1986, VLDB.

[21]  Philip S. Yu,et al.  GString: A Novel Approach for Efficient Search in Graph Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[22]  Dennis Shasha,et al.  SING: Subgraph search In Non-homogeneous Graphs , 2010, BMC Bioinformatics.

[23]  Shijie Zhang,et al.  TreePi: A Novel Graph Indexing Method , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[24]  Henry F. Korth,et al.  Query Languages for Nested Relational Databases , 1987, NF².

[25]  Jianzhong Li,et al.  Hash-base subgraph query processing method for graph-structured XML documents , 2008, Proc. VLDB Endow..

[26]  Roded Sharan,et al.  Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks , 2006, J. Comput. Biol..

[27]  Jignesh M. Patel,et al.  SAGA: a subgraph matching tool for biological graphs , 2007, Bioinform..

[28]  Mario Vento,et al.  An Improved Algorithm for Matching Large Graphs , 2001 .

[29]  Kaspar Riesen,et al.  Graph Embedding in Vector Spaces by Means of Prototype Selection , 2007, GbRPR.

[30]  Shijie Zhang,et al.  GADDI: distance index based subgraph matching in biological networks , 2009, EDBT '09.

[31]  Ambuj K. Singh,et al.  Graphs-at-a-time: query language and access methods for graph databases , 2008, SIGMOD Conference.

[32]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[33]  Katsuhisa Horimoto,et al.  BMC Systems Biology BioMed Central Methodology article , 2008 .

[34]  Jignesh M. Patel,et al.  TALE: A Tool for Approximate Large Graph Matching , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[35]  Nisheeth Shrivastava,et al.  Mining (Social) Network Graphs to Detect Random Link Attacks , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[36]  Hans-Jörg Schek,et al.  The relational model with relation-valued attributes , 1986, Inf. Syst..

[37]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[38]  Shijie Zhang,et al.  SUMMA: subgraph matching in massive graphs , 2010, CIKM.

[39]  Pavel Zezula,et al.  Indexing Structure for Graph-Structured Data , 2009, Mining Complex Data.

[40]  Georgia Garani Nest and Unnest Operators in Nested Relations , 2008, Data Sci. J..

[41]  Wojciech Szpankowski,et al.  Detecting Conserved Interaction Patterns in Biological Networks , 2006, J. Comput. Biol..

[42]  Konstantinos Sagonas,et al.  XSB as an efficient deductive database engine , 1994, SIGMOD '94.

[43]  Michael Garland,et al.  On the Visualization of Social and other Scale-Free Networks , 2008, IEEE Transactions on Visualization and Computer Graphics.

[44]  Serge Abiteboul,et al.  Deep Equality Revisited , 1995, DOOD.