Affinity-based probabilistic reasoning and document clustering on the WWW

The World Wide Web (WWW) has become one of the fastest growing applications on the Internet today. More and more information sources have linked online through WWW, but finding information on the WWW is also a great challenge. For most users, the information retrieved is not well organized and the access time is considered high on the WWW currently. Therefore, there is a need to develop a good mechanism to organize and manage the tremendous size and various kinds of information to facilitate the functionality of a search engine for information retrieval on the WWW. In response to such a demand we propose a Markov Model Mediator (MMM) mechanism which employs affinity based data mining techniques to organize and manage the information sources so that the most relevant documents are clustered together to achieve higher recall and precision values for information retrieval on the WWW.

[1]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[2]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[3]  Véronique Benzaken,et al.  Enhancing Performance in a Persistent Object Store: Clustering Strategies in O2 , 1990, POS.

[4]  Richard T. Snodgrass,et al.  Semantic Clustering , 1990, POS.

[5]  Kien A. Hua,et al.  A decomposition-based simulated annealing technique for data clustering , 1994, PODS '94.

[6]  Rangasami L. Kashyap,et al.  A Spatio-Temporal Semantic Model for Multimedia Database Systems and Multimedia Information Systems , 2001, IEEE Trans. Knowl. Data Eng..

[7]  J. Banerjee,et al.  Clustering a DAG for CAD Databases , 1988, IEEE Trans. Software Eng..

[8]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[9]  David J. DeWitt,et al.  Object and File Management in the EXODUS Extensible Database System , 1986, VLDB.

[10]  Rangasami L. Kashyap,et al.  Information Retrieval Using Markov Model Mediators In Multimedia Database Systems , 1998 .

[11]  Jeffrey F. Naughton,et al.  On the performance of object clustering techniques , 1992, SIGMOD '92.

[12]  Hongjun Lu,et al.  NeuroRule: A Connectionist Approach to Data Mining , 1995, VLDB.

[13]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[14]  Hans-Peter Kriegel,et al.  Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification , 1995, SSD.

[15]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.