Cluster-based adaptive information retrieval

The paper discusses the issues involved in the design of a complete information retrieval system based on user-oriented clustering schemes. Clusters are constructed taking into account the users' perception of similarity between documents. The system accumulates feed-back from the users and employs it to construct user-oriented clusters. An optimization function to improve the effectiveness of the clustering process is developed. A retrieval process based on the clustering scheme is described. The system developed is experimentally validated and compared with existing systems.<<ETX>>

[1]  Vijay V. Raghavan,et al.  User-oriented document clustering: a framework for learning in information retrieval , 1986, SIGIR '86.

[2]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[3]  Keinosuke Fukunaga,et al.  A Branch and Bound Clustering Algorithm , 1975, IEEE Transactions on Computers.

[4]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[5]  W. Bruce Croft,et al.  Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..

[6]  Mohamed A. Ismail,et al.  Multidimensional data clustering utilizing hybrid search strategies , 1989, Pattern Recognit..

[7]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[8]  David S. Johnson,et al.  Some Simplified NP-Complete Graph Problems , 1976, Theor. Comput. Sci..

[9]  RICHARD C. DUBES,et al.  How many clusters are best? - An experiment , 1987, Pattern Recognit..

[10]  Robert T. Dattola Experiments with a fast algorithm for automatic classification , 1971 .

[11]  Vijay V. Raghavan,et al.  Optimal Determination of User-Oriented Clusters: An Application for the Reproductive Plan , 1987, ICGA.

[12]  Clement T. Yu A clustering algorithm based on user queries , 1974, J. Am. Soc. Inf. Sci..

[13]  Gerard Salton,et al.  Generation and search of clustered files , 1978, TODS.

[14]  Vijay V. Raghavan,et al.  Optimal determination of user-oriented clusters , 1987, SIGIR '87.

[15]  Carolyn J. Crouch,et al.  A cluster-based approach to thesaurus construction , 1988, SIGIR '88.

[16]  Robert E. Jensen,et al.  A Dynamic Programming Algorithm for Cluster Analysis , 1969, Oper. Res..

[17]  Ellen M. Vdorhees The cluster hypothesis revisited , 1985, SIGIR 1985.

[18]  Cecilia R. Aragon,et al.  Optimization by Simulated Annealing: An Experimental Evaluation; Part I, Graph Partitioning , 1989, Oper. Res..

[19]  L. B. Doyle BREAKING THE COST BARRIER IN AUTOMATIC CLASSIFICATION , 1966 .

[20]  Ellen M. Vdorhees,et al.  The cluster hypothesis revisited , 1985, SIGIR '85.

[21]  K. Sparck Jones,et al.  A TEST FOR THE SEPARATION OF RELEVANT AND NON‐RELEVANT DOCUMENTS IN EXPERIMENTAL RETRIEVAL COLLECTIONS , 1973 .

[22]  Clement T. Yu Adaptive document clustering , 1985, SIGIR '85.

[23]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[24]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[25]  Anil K. Jain,et al.  Clustering techniques: The user's dilemma , 1976, Pattern Recognit..

[26]  Sartaj Sahni,et al.  Simulated Annealing and Combinatorial Optimization , 1986, DAC 1986.

[27]  R. Sibson,et al.  A model for taxonomy , 1968 .

[28]  Vijay V. Raghavan,et al.  Retrieval system evaluation using recall and precision: problems and answers , 1989, SIGIR '89.

[29]  Vijay V. Raghavan,et al.  Near-Optimal Algorithms for the Boundary Selection Problem in User-Oriented Information Retrieval , 1989, IFIP Congress.