Document Representation and Quality of Text: An Analysis

[1]  Farhad Oroumchian,et al.  N-gram and Local Context Analysis for Persian text retrieval , 2007, 2007 9th International Symposium on Signal Processing and Its Applications.

[2]  Mostafa Keikha,et al.  Using Rich Document Representation in XML Information Retrieval , 2006, INEX.

[3]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[4]  Farhad Oroumchian,et al.  Rich Document Representation for Document Clustering , 2004, RIAO.

[5]  George Karypis,et al.  Centroid-Based Document Classification: Analysis and Experimental Results , 2000, PKDD.

[6]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[7]  Robert N. Oddy,et al.  An application of plausible reasoning to information retrieval , 1996, SIGIR '96.

[8]  Claudia Pearce,et al.  TELLTALE: Experiments in a Dynamic Hypertext Environment for Degraded and Multilingual Data , 1996, J. Am. Soc. Inf. Sci..

[9]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[10]  Fabio Crestani,et al.  Probability kinematics in information retrieval , 1995, SIGIR '95.

[11]  M Damashek,et al.  Gauging Similarity with n-Grams: Language-Independent Categorization of Text , 1995, Science.

[12]  Joon Ho Lee,et al.  Properties of extended Boolean models in information retrieval , 1994, SIGIR '94.

[13]  Elizabeth D. Liddy,et al.  Text categorization for multiple users based on semantic features from a machine-readable dictionary , 1994, TOIS.

[14]  Ryszard S. Michalski,et al.  The Logic of Plausible Reasoning: A Core Theory , 1989, Cogn. Sci..

[15]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[16]  Paul R. Cohen,et al.  The evolution and performance of the GRANT System , 1987, IEEE Expert.

[17]  Elena M. Zamora,et al.  The use of trigram analysis for spelling error detection , 1981, Inf. Process. Manag..

[18]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[19]  Ching Y. Suen,et al.  n-Gram Statistics for Natural Language Understanding and Text Processing , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.