A fuzzy extension of the XPath query language

Today the current state of the art in querying XML data is represented by XPath and XQuery, both of which rely on Boolean conditions for node selection. Boolean selection is too restrictive when users do not use or even know the data structure precisely, e.g. when queries are written based on a summary rather than on a schema. In this paper we describe a XML querying framework, called FuzzyXPath, based on Fuzzy Set Theory, which relies on fuzzy conditions for the definition of flexible constraints on stored data. A function called “deep-similar” is introduced to replace XPath’s typical “deep-equal” function. The main goal is to provide a degree of similarity between two XML trees, assessing whether they are similar both structure-wise and content-wise. Several query examples are discussed in the field of XML based metadata for e-learning.

[1]  Ernesto Damiani,et al.  FXPath: Flexible Querying of XML Documents , 2002 .

[2]  Letizia Tanca,et al.  Fuzzy Techniques for XML Data Smushing , 2001, Fuzzy Days.

[3]  Steven J. DeRose,et al.  XML Path Language (XPath) , 1999 .

[4]  Letizia Tanca,et al.  An Approximate Querying Environment for XML Data , 2004 .

[5]  Paolo Ciaccia,et al.  The Collection Index to Support Complex Approximate Queries on XML Documents , 2003, Xsym.

[6]  Letizia Tanca,et al.  Fuzzy XML queries via context-based choice of aggregations , 2000, Kybernetika.

[7]  Janusz Kacprzyk,et al.  Database Queries with Fuzzy Linguistic Quantifiers , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[8]  H. V. Jagadish,et al.  Evaluating Structural Similarity in XML Documents , 2002, WebDB.

[9]  Henri Prade,et al.  Flexible querying of semistructured data: A fuzzy‐set‐based approach , 2007, Int. J. Intell. Syst..

[10]  Zongmin Ma,et al.  Extending object-oriented databases for fuzzy information modeling , 2004, Inf. Syst..

[11]  Ernesto Damiani,et al.  Fuzzy service selection in a distributed object-oriented environment , 2001, IEEE Trans. Fuzzy Syst..

[12]  Mohand Boughanem,et al.  Qualitative pattern matching with linguistic terms , 2004, AI Commun..

[13]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[14]  Patrick Bosc,et al.  Soft Querying, a New Feature for Database Management Systems , 1994, DEXA.

[15]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[16]  Henri Prade,et al.  Flexible querying of semistructured data: A fuzzy-set-based approach: Research Articles , 2007 .

[17]  Torsten Schlieder Schema-Driven Evaluation of Approximate Tree-Pattern Queries , 2002, EDBT.

[18]  Cong Yu,et al.  Schema summarization , 2006, VLDB.

[19]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[20]  Amit Kumar,et al.  Correlating XML data streams using tree-edit distance embeddings , 2003, PODS '03.

[21]  Laks V. S. Lakshmanan,et al.  FleXPath: flexible structure and full-text querying for XML , 2004, SIGMOD '04.

[22]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[23]  Henri Prade,et al.  Fuzzy Logic Techniques in Multimedia Database Querying: A Preliminary Investigation of the Potentials , 2001, IEEE Trans. Knowl. Data Eng..

[24]  Dan Simon,et al.  Sum Normal Optimization of Fuzzy Membership Functions , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[25]  Patrick Bosc,et al.  On Representation-Based Querying of Databases Containing Ill-known Values , 1997, ISMIS.

[26]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[27]  Paolo Tiberio,et al.  Approximate Query Answering for a Heterogeneous XML Document Base , 2004, WISE.

[28]  Slawomir Zadrozny,et al.  Internet as a Challenge to Fuzzy Querying , 2003, Intelligent Exploration of the Web.

[29]  Raymond T. Ng,et al.  Very large data bases , 1994 .

[30]  Letizia Tanca,et al.  Blind Queries to XML Data , 2000, DEXA.

[31]  Patrice Buche,et al.  Approximate Querying of XML Fuzzy Data , 2006, FQAS.

[32]  Patrick Bosc,et al.  Quantified statements in a flexible relational query language , 1995, SAC '95.

[33]  Olga Pons,et al.  A Server for Fuzzy SQL Queries , 1998, FQAS.

[34]  Bernd Reusch Proceedings of the International Conference, 7th Fuzzy Days on Computational Intelligence, Theory and Applications , 2001 .

[35]  Sihem Amer-Yahia,et al.  Tree Pattern Relaxation , 2002, EDBT.

[36]  Alberto H. F. Laender,et al.  Automatic web news extraction using tree edit distance , 2004, WWW '04.

[37]  Paul Van Dooren,et al.  A MEASURE OF SIMILARITY BETWEEN GRAPH VERTICES . WITH APPLICATIONS TO SYNONYM EXTRACTION AND WEB SEARCHING , 2002 .

[38]  Ernesto Damiani,et al.  Bottom-Up Extraction and Trust-Based Refinement of Ontology Metadata , 2007, IEEE Transactions on Knowledge and Data Engineering.

[39]  L. Zadeh,et al.  Fuzzy Logic for the Management of Uncertainty , 1992 .

[41]  Hua-Gang Li,et al.  FLUX: fuzzy content and structure matching of XML range queries , 2006, WWW '06.

[42]  Patrick Bosc,et al.  Fuzzy querying in conventional databases , 1992 .