CSC: Supporting Queries on Compressed Cached XML

Whenever a client frequently has to retrieve, to query and to locally transform large parts of a huge XML document that is stored on a remote web information server, data exchange from the server to the client may become a serious bottleneck that simply limits scaling of the amount of information that can be processed locally on the client by a client-based application. We present Compressed Structure Caching (CSC) as a solution that reduces the amount of data exchange by a combination of the following techniques: compression of the XML document's structure, client-side caching of the structure and of already received XML content, inference and optimized loading of the content needed on the client to answer a given query. We provide a performance evaluation that demonstrates that our approach significantly reduces the amount of data exchange from server to client.

[1]  Stefan Böttcher,et al.  XML index compression by DTD subtraction , 2007, ICEIS.

[2]  Wilfred Ng,et al.  XQzip: Querying Compressed XML Using Structural Indexing , 2004, EDBT.

[3]  Dan Suciu,et al.  XMill: an efficient compressor for XML data , 2000, SIGMOD '00.

[4]  Stefanie Scherzinger,et al.  XML Prefiltering as a String Matching Problem , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[5]  Stefan Böttcher,et al.  BSBC: Towards a Succinct Data Format for XML Streams , 2008, WEBIST.

[6]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[7]  Stefan Böttcher,et al.  Caching XML Data on Mobile Web Clients , 2004, International Conference on Internet Computing.

[8]  Serge Abiteboul,et al.  Representing and querying XML with incomplete information , 2001, PODS '01.

[9]  Hamid Pirahesh,et al.  A Framework for Using Materialized XPath Views in XML Query Processing , 2004, VLDB.

[10]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[11]  Michael Benedikt,et al.  XPath satisfiability in the presence of DTDs , 2008, JACM.

[12]  Peter Buneman,et al.  Edinburgh Research Explorer Path Queries on Compressed XML , 2022 .

[13]  Stefan Böttcher,et al.  Evaluating XPath Queries on XML Data Streams , 2007, BNCOD.

[14]  Jae-Woo Chang,et al.  Advances in Web-Age Information Management , 2001, Lecture Notes in Computer Science.

[15]  Mong-Li Lee,et al.  Efficient Mining of XML Query Patterns for Caching , 2003, VLDB.

[16]  Tim Furche,et al.  XPath: Looking Forward , 2002, EDBT Workshops.

[17]  Xmldm,et al.  XML-Based Data Management and Multimedia Engineering — EDBT 2002 Workshops , 2002, Lecture Notes in Computer Science.

[18]  Sebastian Maneth,et al.  Efficient Memory Representation of XML Documents , 2005, DBPL.

[19]  Dan Suciu,et al.  Query Caching and View Selection for XML Databases , 2005, VLDB.

[20]  Z. Meral Özsoyoglu,et al.  Rewriting XPath Queries Using Materialized Views , 2005, VLDB.

[21]  James Cheney Compressing XML with multiplexed hierarchical PPM models , 2001, Proceedings DCC 2001. Data Compression Conference.

[22]  Mark Levene,et al.  XCQ: A queriable XML compression system , 2006, Knowledge and Information Systems.

[23]  Jan Hidders Satisfiability of XPath Expressions , 2003, DBPL.

[24]  M. Schkolnick,et al.  9th International Conference on Very Large Data Bases , 1983, Very Large Data Bases Conference.

[25]  Amélie Marian,et al.  Projecting XML Documents , 2003, VLDB.

[26]  Stefan Böttcher Cache Consistency in Mobile XML Databases , 2006, WAIM.

[27]  R. Watson,et al.  Data Management , 1980, Bone Marrow Transplantation.

[28]  Stefan Böttcher,et al.  Data Management for Mobile Ajax Web 2.0 Applications , 2007, DEXA.

[29]  M. Tamer Özsu,et al.  A succinct physical storage scheme for efficient evaluation of path queries in XML , 2004, Proceedings. 20th International Conference on Data Engineering.