A Multi-Agent Framework for Storage and Retrieval of Documents from Distributed XML Collections

A distributed and dynamic multi-agent system for integrated performing of gathering, processing, and retrieval of highly heterogeneous XML documents' collections is presented. Collections of XML documents are assumed to be distributed among nodes of a network. The management process consists of a few steps: documents in each node are clustered, clusters' representatives are computed and stored in metadata repository, queries are confronted with metadata repository, distributed retrieval is carried out over relevant clusters, retrieval results are integrated and final answer is presented to an end user. Major functions of such strategy were designed and implemented on JADE platform. Actual implementations of the proposed multi-agent system can vary, depending on applied models of XML documents' clustering, indexing and retrieval.

[1]  Evaggelia Pitoura,et al.  Peer-to-peer management of XML data: issues and research challenges , 2005, SGMD.

[2]  Jung Soon Ro An evaluation of the applicability of ranking algorithms to improve the effectiveness of full‐text retrieval. II. On the effectiveness of ranking algorithms on full‐text retrieval , 1988 .

[3]  Kaizhong Zhang,et al.  Simple Fast Algorithms for the Editing Distance Between Trees and Related Problems , 1989, SIAM J. Comput..

[4]  Philip N. Klein,et al.  Computing the Edit-Distance between Unrooted Ordered Trees , 1998, ESA.

[5]  Kaizhong Zhang,et al.  Approximate Tree Matching in the Presence of Variable Length Don't Cares , 1994, J. Algorithms.

[6]  Jung Soon Ro An evaluation of the applicability of ranking algorithms to improve the effectiveness of full‐text retrieval. I. On the effectiveness of full‐text retrieval , 1988 .

[7]  Riccardo Ortale,et al.  Distance-based Clustering of XML Documents , 2003 .

[8]  H. V. Jagadish,et al.  Evaluating Structural Similarity in XML Documents , 2002, WebDB.

[9]  Hélène Touzet,et al.  Tree edit distance with gaps , 2003, Inf. Process. Lett..

[10]  Sara Stoecklin,et al.  An XML Distance Measure , 2005, DMIN.

[11]  Yangyong Zhu,et al.  Similarity Metric for XML Documents , 2003 .

[12]  Kaizhong Zhang,et al.  Algorithms for the constrained editing distance between ordered labeled trees and related problems , 1995, Pattern Recognit..

[13]  Norman Winterbottom,et al.  Performing joins without decompression in a compressed database system , 2003, SGMD.

[14]  Lusheng Wang,et al.  Alignment of trees: an alternative to tree edit , 1995 .

[15]  G. Italiano,et al.  Algorit[h]ms - ESA '98 : 6th Annual European Symposium, Venice, Italy, August 24-26, 1998 : proceedings , 1998 .