A structured documents retrieval method supporting attribute-based structure information

There are many studies on retrieval methods for structured documents but most of the studies are for those whose structure information is expressed by elements. But when elements are used to describe a document structure, the structure becomes static and difficult to expand. So describing a document structure using attributes is used in many standards. But most of the existing systems support mainly element-based structured documents and do not consider attribute-based ones. Hence they do not support attribute-based structured documents well. So, we propose a new indexing method that supports attribute-based structured documents. In our index scheme, element-based structure information and attribute-based structure information are seamlessly integrated to describe a general document structure. Also, we consider possible searching methods under the proposed index, and implement each method. And then, we experiment each method using the document actually being used in business, then analyze the results.

[1]  Menzo Windhouwer,et al.  Efficient Relational Storage and Retrieval of XML Documents , 2000, WebDB.

[2]  Philippe Mulhem,et al.  A Generic Framework for Structured Document Access , 1998, DEXA.

[3]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[4]  Masatoshi Yoshikawa,et al.  Storage and Retrieval of XML Documents Using Object-Relational Databases , 1999, DEXA.

[5]  Jae-Woo Chang,et al.  Design and implementation of a structured information retrieval system for SGML documents , 1999, Proceedings. 6th International Conference on Advanced Systems for Advanced Applications.

[6]  Sung-Bae Cho,et al.  Structured storage and retrieval of SGML documents using Grove , 2000, Inf. Process. Manag..

[7]  Ioana Manolescu,et al.  Integrating Keyword Search into XML Query Processing , 2000, BDA.

[8]  Ahmad Ashari,et al.  Storing And Querying XML Data Using RDBMS , 2004, iiWAS.

[9]  Armin B. Cremers,et al.  Searching and browsing collections of structural information , 2000, Proceedings IEEE Advances in Digital Libraries 2000.

[10]  Ron Sacks-Davis,et al.  Database Systems for Structured Documents , 1995, IEICE Trans. Inf. Syst..

[11]  Masatoshi Yoshikawa,et al.  An efficiently updatable index scheme for structured documents , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[12]  Klemens Böhm,et al.  Applying a flexible OODBMS-IRS-coupling to structured document handling , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[13]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[14]  Sung-Hyon Myaeng,et al.  A flexible model for retrieval of SGML documents , 1998, SIGIR '98.

[15]  Dongwook Shin,et al.  BUS: an effective indexing and retrieval scheme in structured documents , 1998, DL '98.

[16]  David D. Oberhelman Text Encoding Initiative Home Page , 1998 .