Quo Vadis, Web Queries?

Various query languages for Web and Semantic Web data, both for practical use and as an area of research in the scientific community, have emerged in recent years. At the same time, the broad adoption of the internet where keyword search is used in many applications, e.g. search engines, has familiarized casual users with using keyword queries to retrieve information on the internet. Unlike this easy-to-use querying, traditional query languages require knowledge of the language itself as well as of the data to be queried. Keyword-based query languages for XML and RDF bridge the gap between the two, aiming at enabling simple querying of semi-structured data, which is relevant e.g. in the context of the emerging Semantic Web. This article presents an overview of traditional query languages for XML and RDF, focused on emerging preeminent exemplars in each field, and contrasts these languages with the field of keyword querying for XML and RDF.

[1]  Tim Furche,et al.  Evaluating Complex Queries Against XML Streams with Polynomial Combined Complexity , 2004, BNCOD.

[2]  Felix Weigel A Survey of Indexing Techniques for Semistructured Documents, Institute of Computer Science, LMU, Mu , 2002 .

[3]  Axel Polleres,et al.  XSPARQL: Traveling between the XML and RDF Worlds - and Avoiding the XSLT Pilgrimage , 2008, ESWC.

[4]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[5]  Gabriel M. Kuper,et al.  Structural properties of XPath fragments , 2003, Theor. Comput. Sci..

[6]  Yi Chen,et al.  Identifying meaningful return information for XML keyword search , 2007, SIGMOD '07.

[7]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[8]  Georg Gottlob,et al.  The complexity of XPath query evaluation , 2003, PODS.

[9]  Sihem Amer-Yahia,et al.  XML retrieval: db/ir in theory, web in practice , 2007, VLDB.

[10]  Chong Wang,et al.  SPARK: Adapting Keyword Query to Semantic Search , 2007, ISWC/ASWC.

[11]  Tim Furche,et al.  RDF Querying: Language Constructs and Evaluation Methods Compared , 2006, Reasoning Web.

[12]  Sherif Sakr,et al.  XQuery on SQL Hosts , 2004, VLDB.

[13]  Tok Wang Ling,et al.  On boosting holism in XML twig pattern matching using structural indexing techniques , 2005, SIGMOD '05.

[14]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[15]  Claudio Gutiérrez,et al.  Querying RDF Data from a Graph Database Perspective , 2005, ESWC.

[16]  Torsten Grust,et al.  MonetDB/XQuery: a fast XQuery processor powered by a relational engine , 2006, SIGMOD Conference.

[17]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[18]  Jianyong Wang,et al.  Effective keyword search for valuable lcas over xml documents , 2007, CIKM '07.

[19]  Yehoshua Sagiv,et al.  XSEarch: A Semantic Search Engine for XML , 2003, VLDB.

[20]  Dan Olteanu,et al.  SPEX: Streamed and Progressive Evaluation of XPath , 2007, IEEE Transactions on Knowledge and Data Engineering.

[21]  Vassilis Christophides,et al.  Viewing the Semantic Web through RVL Lenses , 2003, SEMWEB.

[22]  Tim Furche,et al.  Querying the standard and Semantic Web using Xcerpt and visXcerpt , 2005 .

[23]  Andy Seaborne,et al.  Three Implementations of SquishQL, a Simple RDF Query Language , 2002, SEMWEB.

[24]  Ur Informatik,et al.  A Gentle Introduction into Xcerpt, a Rule-Based Query and Transformation Language for XML , 2002 .

[25]  Alin Deutsch,et al.  The NEXT Logical Framework for XQuery , 2004, VLDB.

[26]  Torsten. Grust,et al.  Accelerating XPath location steps , 2002, SIGMOD '02.

[27]  Alin Deutsch,et al.  XML-QL: A Query Language for XML , 1998 .

[28]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[29]  Georg Gottlob,et al.  Monadic queries over tree-structured data , 2002, Proceedings 17th Annual IEEE Symposium on Logic in Computer Science.

[30]  Ioana Manolescu,et al.  Integrating Keyword Search into XML Query Processing , 2000, BDA.

[31]  Thomas Schwentick,et al.  XPath query containment , 2004, SGMD.

[32]  Ulf Leser,et al.  Fast and practical indexing and querying of very large graphs , 2007, SIGMOD '07.

[33]  Georg Gottlob,et al.  The complexity of acyclic conjunctive queries , 2001, JACM.

[34]  Michael J. Carey,et al.  The BEA streaming XQuery processor , 2004, The VLDB Journal.

[35]  Jan Hidders Satisfiability of XPath Expressions , 2003, DBPL.

[36]  Alessandro Campi,et al.  Design and implementation of a graphical interface to XQuery , 2003, SAC '03.

[37]  Georg Lausen,et al.  Access to Objects by Path Expressions and Rules , 1994, VLDB.

[38]  Vassilis Christophides,et al.  Declarative Languages for Querying Portal Catalogs , 2000, DELOS.

[39]  Torsten Grust,et al.  Jump Through Hoops to Grok the Loops Pathfinder ’ s Purely Relational Account of XQuery-style Iteration Semantics , 2008 .

[40]  Karen Sparck Jones Automatic keyword classification for information retrieval , 1971 .

[41]  Peter T. Wood,et al.  On the Equivalence of XML Patterns , 2000, Computational Logic.

[42]  Cong Yu,et al.  Schema-Free XQuery , 2004, VLDB.

[43]  Marcelo Arenas,et al.  Semantics and complexity of SPARQL , 2006, TODS.

[44]  Tim Furche Implementation of web query languages reconsidered: beyond tree and single-language algebras at (almost) no cost , 2008 .

[45]  Yehoshua Sagiv,et al.  Interconnection semantics for keyword search in XML , 2005, CIKM '05.

[46]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[47]  François Bry,et al.  Querying the Web Reconsidered: A Practical Introduction to Xcerpt , 2004, Extreme Markup Languages®.

[48]  Wei-Ying Ma,et al.  Probabilistic query expansion using query logs , 2002, WWW '02.

[49]  Alexander Borgida,et al.  Efficient management of transitive relationships in large data and knowledge bases , 1989, SIGMOD '89.

[50]  John Davies,et al.  QuizRDF: search technology for the semantic Web , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[51]  W. Bruce Croft,et al.  An Association Thesaurus for Information Retrieval , 1994, RIAO.

[52]  Letizia Tanca,et al.  XML document summarization: using XQuery for synopsis creation , 2004, Proceedings. 15th International Workshop on Database and Expert Systems Applications, 2004..

[53]  Georg Gottlob,et al.  XPath query evaluation: improving time and space efficiency , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[54]  Shaul Dar,et al.  DTL's DataSpot: Database Exploration Using Plain Language , 1998, VLDB.

[55]  Michael Benedikt,et al.  XPath leashed , 2009, CSUR.

[56]  Alon Y. Halevy,et al.  Efficient query reformulation in peer data management systems , 2004, SIGMOD '04.

[57]  Daniela Florescu,et al.  Quilt: An XML Query Language for Heterogeneous Data Sources , 2000, WebDB.

[58]  gillian austen ILRT: The Institute for Learning and Research Technology , 1996 .

[59]  Sihem Amer-Yahia,et al.  Texquery: a full-text search extension to xquery , 2004, WWW '04.

[60]  Tim Furche,et al.  Xcerpt and visXcerpt: Twin Query Languages for the Semantic Web , 2004 .

[61]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[62]  Peter Fankhauser XQuery formal semantics state and challenges , 2001, SGMD.

[63]  Norman May,et al.  Quantifiers in XQuery , 2003, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003. WISE 2003..

[64]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[65]  Steffen Staab,et al.  Networked graphs: a declarative mechanism for SPARQL rules, SPARQL views and RDF data integration on the web , 2008, WWW.

[66]  Yannis Papakonstantinou,et al.  Efficient keyword search for smallest LCAs in XML databases , 2005, SIGMOD '05.

[67]  Vassilis Christophides,et al.  RQL: a declarative query language for RDF , 2002, WWW.

[68]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[69]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[70]  Enrico Motta,et al.  SemSearch: A Search Engine for the Semantic Web , 2006, EKAW.

[71]  Tim Furche,et al.  Visual Languages: A Matter of Style , 2007, VLL.

[72]  Alin Deutsch,et al.  Containment and Integrity Constraints for XPath Fragments , 2001 .

[73]  Menzo Windhouwer,et al.  Querying XML documents made easy: nearest concept queries , 2001, Proceedings 17th International Conference on Data Engineering.

[74]  Guido Moerkotte,et al.  Evaluating queries with generalized path expressions , 1996, SIGMOD '96.

[75]  Tim Furche,et al.  Web and Semantic Web Query Languages: A Survey , 2005, Reasoning Web.

[76]  Sebastian Schaffert,et al.  Xcerpt: a rule-based query and transformation language for the web , 2004 .

[77]  Elke A. Rundensteiner,et al.  Rainbow: multi-XQuery optimization using materialized XML views , 2003, SIGMOD '03.

[78]  Alberto O. Mendelzon,et al.  Foundations of semantic web databases , 2004, PODS.

[79]  Stephan Kepser,et al.  A Simple Proof for the Turing-Completeness of XSLT and XQuery , 2004, Extreme Markup Languages®.

[80]  Maarten Marx,et al.  First Order Paths in Ordered Trees , 2005, ICDT.

[81]  Peter Fankhauser,et al.  XQuery by the Book: The IPSI XQuery Demonstrator , 2002, EDBT.

[82]  V. S. Subrahmanian,et al.  RDF aggregate queries and views , 2005, 21st International Conference on Data Engineering (ICDE'05).

[83]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[84]  Dan Brickley,et al.  The syntactic web , 2001 .

[85]  Sihem Amer-Yahia,et al.  XML Full-Text Search: Challenges and Opportunities , 2005, VLDB.

[86]  Maarten Marx,et al.  Conditional XPath, the first order complete XPath dialect , 2004, PODS.

[87]  Feng Shao,et al.  XRANK: ranked keyword search over XML documents , 2003, SIGMOD '03.

[88]  Nicola Onose,et al.  XQuery at your web service , 2004, WWW '04.

[89]  S. Sudarshan,et al.  Keyword searching and browsing in databases using BANKS , 2002, Proceedings 18th International Conference on Data Engineering.

[90]  Jim Melton,et al.  An early look at XQuery , 2002, SGMD.

[91]  Stefano Paraboschi,et al.  Active XQuery , 2002, Proceedings 18th International Conference on Data Engineering.

[92]  Clemens Ley RDFLog: It's like Datalog for RDF , 2008 .

[93]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[94]  Edith Cohen,et al.  Reachability and distance queries via 2-hop labels , 2002, SODA '02.

[95]  Laks V. S. Lakshmanan,et al.  On the evaluation of tree pattern queries , 2006, ICSOFT.

[96]  Jim Melton,et al.  An early look at XQuery API for Java#8482; (XQJ) , 2004, SGMD.

[97]  Christoph Koch,et al.  On the complexity of nonrecursive XQuery and functional query languages on complex values , 2006, TODS.

[98]  Tim Furche,et al.  GRDDLing with Xcerpt: Learn one, get one free! , 2007 .

[99]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[100]  Vassilis Christophides,et al.  Querying the Semantic Web with RQL , 2003, Comput. Networks.

[101]  James Clark,et al.  XSL Transformations (XSLT) Version 1.0 , 1999 .

[102]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[103]  Tim Furche,et al.  XPath: Looking Forward , 2002, EDBT Workshops.

[104]  Haofen Wang,et al.  Q2Semantic: A Lightweight Keyword Interface to Semantic Search , 2008, ESWC.

[105]  Michael Benedikt,et al.  Interpreting Tree-to-Tree Queries , 2006, ICALP.

[106]  Moshé M. Zloof Query-by-Example: A Data Base Language , 1977, IBM Syst. J..

[107]  Klaus U. Schulz,et al.  The BIRD Numbering Scheme for XML and Tree Databases - Deciding and Reconstructing Tree Relations Using Efficient Arithmetic Operations , 2005, XSym.

[108]  Divesh Srivastava,et al.  Index Structures for Matching XML Twigs Using Relational Query Processors , 2005, 21st International Conference on Data Engineering Workshops (ICDEW'05).

[109]  Tim Furche,et al.  Towards Data-Integration on the Semantic Web: Querying RDF with Xcerpt , 2005 .

[110]  Jacques Le Maitre,et al.  Extending xQuery with transformation operators , 2003, DocEng '03.

[111]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[112]  Pascal Francq,et al.  Index and Search XML Documents by Combining Content and Structure , 2006, International Conference on Internet Computing.

[113]  Wolfgang Nejdl,et al.  Can all tags be used for search? , 2008, CIKM '08.

[114]  Peter F. Patel-Schneider,et al.  The Yin/Yang web: XML syntax and RDF semantics , 2002, WWW '02.

[115]  Gerhard Weikum,et al.  HOPI: An Efficient Connection Index for Complex XML Document Collections , 2004, EDBT.

[116]  Tim Furche,et al.  Querying the Web Reconsidered: Design Principles for Versatile Web Query Languages , 2005, Int. J. Semantic Web Inf. Syst..

[117]  M. Tamer Özsu,et al.  A comprehensive XQuery to SQL translation using dynamic interval encoding , 2003, SIGMOD '03.

[118]  Torsten Grust,et al.  Accelerating XPath evaluation in any RDBMS , 2004, TODS.

[119]  Shinichi Morishita,et al.  Amoeba Join: Overcoming Structural Fluctuations in XML Data , 2006, WebDB.

[120]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[121]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[122]  Maarten Marx,et al.  XPath with Conditional Axis Relations , 2004, EDBT.

[123]  Li Chen,et al.  Stack-based Algorithms for Pattern Matching on DAGs , 2005, VLDB.

[124]  Arjohn Kampman,et al.  SeRQL: A Second Generation RDF Query Language , 2003 .

[125]  Elke A. Rundensteiner,et al.  Honey, I shrunk the XQuery!: an XML algebra optimization approach , 2002, WIDM '02.

[126]  Tim Furche,et al.  XcerptRDF: A Pattern-based Answer to the Versatile Web Challenge , 2008 .

[127]  Robert E. Tarjan,et al.  Three Partition Refinement Algorithms , 1987, SIAM J. Comput..

[128]  Makoto Murata,et al.  XML access control using static analysis , 2006, TSEC.

[129]  Tim Furche,et al.  An efficient single-pass query evaluator for XML data streams , 2004, SAC '04.

[130]  Donald D. Chamberlin XQuery: An XML query language , 2002, IBM Syst. J..

[131]  Vassilis Christophides,et al.  Querying RDF Descriptions for Community Web Portals , 2001, BDA.

[132]  P. Wadler Two semantics for XPath , 2000 .

[133]  François Bry,et al.  Towards Aggregated Answers for Semistructured Data , 2001, ICDT.

[134]  François Bry,et al.  Xcerpt and visXcerpt: From Pattern-Based to Visual Querying of XML and Semistructured Data , 2003, VLDB.

[135]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[136]  Axel Polleres,et al.  From SPARQL to rules (and back) , 2007, WWW '07.

[137]  Quanzhong Li,et al.  On the Effectiveness of Flexible Querying Heuristics for XML Data , 2007, XSym.

[138]  Tim Furche,et al.  Efficient evaluation of n-ary conjunctive queries over trees and graphs , 2006, WIDM '06.

[139]  Torsten Grust,et al.  eXrQuy: Order Indifference in XQuery , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[140]  Paul F. Dietz Maintaining order in a linked list , 1982, STOC '82.

[141]  Divesh Srivastava,et al.  A System for Keyword Proximity Search on XML Databases , 2003, VLDB.

[142]  Robert E. Tarjan,et al.  Fast Algorithms for Finding Nearest Common Ancestors , 1984, SIAM J. Comput..

[143]  Thomas Schwentick,et al.  Two-variable logic on data trees and XML reasoning , 2009, JACM.

[144]  Michael J. Franklin,et al.  A Fast Index for Semistructured Data , 2001, VLDB.

[145]  Tim Furche,et al.  Taming Existence in RDF Querying , 2008, RR.

[146]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[147]  Gillian Dobbie,et al.  Mining Association Rules from XML Data using XQuery , 2004, ACSW.

[148]  Menzo Windhouwer,et al.  Efficient Relational Storage and Retrieval of XML Documents , 2000, WebDB.

[149]  Jussi Myllymaki,et al.  An evaluation of binary xml encoding optimizations for fast stream based xml processing , 2004, WWW '04.

[150]  Philip Wadler,et al.  XQuery from the Experts: A Guide to the W3C XML Query Language , 2003 .

[151]  Stefanie Scherzinger,et al.  FluXQuery: An Optimizing XQuery Processor for Streaming XML Data , 2004, VLDB.

[152]  Dan Suciu,et al.  Containment and equivalence for an XPath fragment , 2002, PODS.

[153]  Rémi Gilleron,et al.  Retrieving Top Relaxed Tightest Fragments for XML Keyword Search , 2008 .

[154]  Vassilis Christophides,et al.  RQL: A Functional Query Language for RDF , 2004 .

[155]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[156]  Hewlett-Packard Round-tripping between XML and RDF , 2004 .

[157]  R. G. G. Cattell,et al.  Recent books , 2000, IEEE Spectrum.

[158]  Pierre Genevès,et al.  XPath Formal Semantics and Beyond: a Coq based approach , 2004 .

[159]  Tim Furche,et al.  XML Perspectives on RDF Querying: Towards integrated Access to Data and Metadata on the Web , 2005, Grundlagen von Datenbanken.

[160]  Klaus U. Schulz,et al.  Complete answer aggregates for treelike databases: a novel approach to combine querying and navigation , 2001, TOIS.

[161]  Philip S. Yu,et al.  Dual Labeling: Answering Graph Reachability Queries in Constant Time , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[162]  Amélie Marian,et al.  Implementing Xquery 1.0: The Galax Experience , 2003, VLDB.

[163]  Paul J. Walmsley,et al.  XML Schema Part 0: Primer Second Edition , 2004 .

[164]  Xiang Zhang,et al.  Ontology summarization based on rdf sentence graph , 2007, WWW '07.

[165]  Sujeet Pradhan,et al.  An algebraic query model for effective and efficient retrieval of XML fragments , 2006, VLDB.

[166]  Yi Chen,et al.  XSeek: A Semantic XML Search Engine Using Keywords , 2007, VLDB.

[167]  Jorge Pérez,et al.  Minimal Deductive Systems for RDF , 2007, ESWC.

[168]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[169]  Vagelis Hristidis,et al.  Keyword proximity search on XML graphs , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[170]  Nancy Wiegand,et al.  Investigating XQuery for querying across database object types , 2002, SGMD.

[171]  Sebastian Schaffert,et al.  IkeWiki: A User-Friendly Semantic Wiki , 2006 .

[172]  Carlo Zaniolo,et al.  The database language GEM , 1983, SIGMOD '83.