XML summarization: A survey

eXtensible Markup Language (XML) is one of the standard data representation nowadays. It can be used in various applications as its flexibility and easy to use so the need to summarize XML document become increasingly an important topic to save time and cost. For these reasons, there are more interest for developing tools for summarizing XML Documents. This paper surveys different approaches for summarizing XML documents regarding to both its structure and content.

[1]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[2]  Jeffrey F. Naughton,et al.  Estimating the Selectivity of XML Path Expressions for Internet Scale Applications , 2001, VLDB.

[3]  Maya Ramanath,et al.  A rank-rewrite framework for summarizing XML documents , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[4]  Hongjun Lu,et al.  Bloom Histogram: Path Selectivity Estimation for XML Data with Updates , 2004, VLDB.

[5]  José de Aguiar Moraes Filho Summarizing XML documents: contributions, empirical studies, and challenges , 2009 .

[6]  Vassilis J. Tsotras,et al.  XML Structural Summaries , 2008, Proc. VLDB Endow..

[7]  Mounia Lalmas,et al.  Learning to summarise XML documents using content and structure , 2005, CIKM '05.

[8]  Gudrun Fischer,et al.  A Template-Based Approach to Summarize XML Collections , 2005, LWA.

[9]  Tomislava Lauc,et al.  CROXMLSUM – the System for XML Document Summarization in Croatian , 2007 .

[10]  M. Tamer Özsu,et al.  XSEED: Accurate and Fast Cardinality Estimation for XPath Queries , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[11]  Ehud Gudes,et al.  Exploiting local similarity for indexing paths in graph-structured data , 2002, Proceedings 18th International Conference on Data Engineering.

[12]  Jakub Marciniak XML Schema and Data Summarization , 2010, ICAISC.

[13]  Ping Yan,et al.  A framework of summarizing XML documents with schemas , 2013, Int. Arab J. Inf. Technol..