Uma Estratégia baseada em Técnicas de KDD para apoiar o Projeto Físico em SGBDs XML Nativos

Technological advances have contributed for an expressive increase of Web information, in terms of volume and diversity. Much of this information is organized as XML documents and come from many different sources. Content management for heterogeneous XML documents does not provide efficient mechanisms for guiding the storage of these documents, in such a way that facilitates their retrieval. Therefore, this paper presents a strategy based on Knowledge Discovery and Data Mining to guide the storage of heterogeneous XML documents. A case study and a performance comparative analysis illustrate the potential of the proposed strategy.

[1]  Won Kim,et al.  Preparations for semantics-based XML mining , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[2]  Cong Yu,et al.  TIMBER: A native XML database , 2002, The VLDB Journal.

[3]  Siu-Ming Yiu,et al.  An efficient and scalable algorithm for clustering XML documents by structure , 2004, IEEE Transactions on Knowledge and Data Engineering.

[4]  William Kwok-Wai Cheung,et al.  Integrating element and term semantics for similarity-based XML document clustering , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[5]  Ronaldo Goldschmidt,et al.  Data Mining: um Guia Prático , 2005 .

[6]  Alan F. Smeaton,et al.  Automatic Phrase Recognition and Extraction from Text , 1997, BCS-IRSG Annual Colloquium on IR Research.

[7]  Danny Brian,et al.  The Definitive Guide to Berkeley DB XML (Definitive Guide) , 2006 .

[8]  Peter Dale,et al.  Guidelines for the Establishment and Development of Monolingual Thesauri. Second Revised Edition. , 1981 .

[9]  Ioana Manolescu,et al.  A Test Platform for the INEX Heterogeneous Track , 2004, INEX.

[10]  H. Schoning Tamino - a DBMS designed for XML , 2001, Proceedings 17th International Conference on Data Engineering.

[11]  Elio Masciari,et al.  Fast detection of XML structural similarity , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  H. V. Jagadish,et al.  Evaluating Structural Similarity in XML Documents , 2002, WebDB.

[13]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[14]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[15]  Cong Yu,et al.  Schema-Free XQuery , 2004, VLDB.