论文信息 - QCS: A Tool for Querying, Clustering, and Summarizing Documents

QCS: A Tool for Querying, Clustering, and Summarizing Documents

The QCS information retrieval (IR) system is presented as a tool for querying, clustering, and summarizing document sets. QCS has been developed as a modular development framework, and thus facilitates the inclusion of new technologies targeting these three IR tasks. Details of the system architecture, the QCS interface, and preliminary results are presented.

Dianne P. O'Leary | John M. Conroy | Daniel M. Dunlavy

[1] L. Baum,et al. A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[2] John M. Conroy,et al. Performance of a Three-Stage System for Multi-Document Summarization , 2003 .

[3] Hamparsum Bozdogan,et al. Statistical Data Mining and Knowledge Discovery , 2004 .

[4] W. Nelson Francis,et al. FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[5] Inderjit S. Dhillon,et al. Concept Decompositions for Large Sparse Text Data Using Clustering , 2004, Machine Learning.

[6] Andrei Mikheev,et al. Tagging Sentence Boundaries , 2000, ANLP.

[7] Dianne P. O'Leary,et al. Text Summarization via Hidden Markov Models and Pivoted QR Matrix Decomposition , 2001 .

[8] Tamara G. Kolda,et al. A semidiscrete matrix decomposition for latent semantic indexing information retrieval , 1998, TOIS.

[9] Susan T. Dumais,et al. Improving the retrieval of information from external sources , 1991 .

[10] Inderjit S. Dhillon,et al. Iterative clustering of high dimensional text data augmented by local search , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[11] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.