Improving Methods for Single-label Text Categorization

[1]  Gordon V. Cormack,et al.  Validity and power of t-test for comparing MAP and GMAP , 2007, SIGIR.

[2]  Arlindo L. Oliveira,et al.  Semi-supervised single-label text categorization using centroid-based classifiers , 2007, SAC '07.

[3]  Yaxin Bi,et al.  COMBINING MULTIPLE CLASSIFIERS USING DEMPSTER'S RULE FOR TEXT CATEGORIZATION , 2007, Appl. Artif. Intell..

[4]  Pável Calado,et al.  IR-BASE: An integrated framework for the research and teaching of information retrieval technologies , 2007 .

[5]  S. Sathiya Keerthi,et al.  Large scale semi-supervised linear SVMs , 2006, SIGIR.

[6]  Arlindo L. Oliveira,et al.  Empirical Evaluation of Centroid-based Models for Single-label Text Categorization , 2006 .

[7]  Moustafa Ghanem,et al.  A novel refinement approach for text categorization , 2005, CIKM '05.

[8]  Yaxin Bi,et al.  On combining classifier mass functions for text categorization , 2005, IEEE Transactions on Knowledge and Data Engineering.

[9]  Joydeep Ghosh,et al.  Model-based overlapping clustering , 2005, KDD '05.

[10]  Moustafa Ghanem,et al.  Using dragpushing to refine centroid text classifiers , 2005, SIGIR '05.

[11]  Mark Sanderson,et al.  Information retrieval system evaluation: effort, sensitivity, and reliability , 2005, SIGIR '05.

[12]  Pushpak Bhattacharyya,et al.  A model for handling approximate, noisy or incomplete labeling in text classification , 2005, ICML.

[13]  Yiming Yang,et al.  An experimental study on large-scale web categorization , 2005, WWW '05.

[14]  Fabrizio Sebastiani,et al.  An Analysis of the Relative Hardness of Reuters-21578 Subsets , 2003 .

[15]  Susan T. Dumais,et al.  The Combination of Text Classifiers Using Reliability Indicators , 2016, Information Retrieval.

[16]  Chirag Shah,et al.  Evaluating high accuracy retrieval techniques , 2004, SIGIR '04.

[17]  Raymond J. Mooney,et al.  Integrating constraints and metric learning in semi-supervised clustering , 2004, ICML.

[18]  Roberto Basili,et al.  Complex Linguistic Features for Text Classification: A Comprehensive Study , 2004, ECIR.

[19]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[20]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[21]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[22]  Verayuth Lertnattee,et al.  Effect of term distributions on centroid-based text categorization , 2004, Inf. Sci..

[23]  Arlindo L. Oliveira,et al.  An Empirical Comparison of Text Categorization Methods , 2003, SPIRE.

[24]  C. Ling,et al.  AUC: a Statistically Consistent and more Discriminating Measure than Accuracy , 2003, IJCAI.

[25]  Yiming Yang,et al.  A scalability analysis of classifiers in text categorization , 2003, SIGIR.

[26]  Cornelis H. A. Koster,et al.  Taming Wild Phrases , 2003, ECIR.

[27]  Fabrizio Sebastiani,et al.  Supervised term weighting for automated text categorization , 2003, SAC '03.

[28]  Susan T. Dumais,et al.  Probabilistic combination of text classifiers using reliability indicators: models and results , 2002, SIGIR '02.

[29]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[30]  Peter Jackson,et al.  Combining multiple classifiers for text categorization , 2001, CIKM '01.

[31]  Wai Lam,et al.  A meta-learning approach for text categorization , 2001, SIGIR '01.

[32]  Vipin Kumar,et al.  Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification , 2001, PAKDD.

[33]  Stan Matwin,et al.  A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization , 2001 .

[34]  O. Mangasarian,et al.  Semi-Supervised Support Vector Machines for Unlabeled Data Classification , 2001 .

[35]  John Caron,et al.  Experiments with LSA scoring: optimal rank and basis , 2001 .

[36]  Kenneth Ward Church,et al.  Using Bins to Empirically Estimate Term Weights for Text Categorization , 2001, EMNLP.

[37]  George Karypis,et al.  Centroid-Based Document Classification: Analysis and Experimental Results , 2000, PKDD.

[38]  Jihoon Yang,et al.  A Fast Algorithm for Hierarchical Text Classification , 2000, DaWaK.

[39]  Vibhu O. Mittal,et al.  Bridging the lexical chasm: statistical approaches to answer-finding , 2000, SIGIR '00.

[40]  George Karypis,et al.  Weight Adjustment Schemes for a Centroid Based Classifier , 2000 .

[41]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[42]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[43]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[44]  David E. Johnson,et al.  Maximizing Text-Mining Performance , 1999 .

[45]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[46]  William W. Cohen,et al.  Context-sensitive learning methods for text categorization , 1999, TOIS.

[47]  Andrew McCallum,et al.  Using Maximum Entropy for Text Classification , 1999 .

[48]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.

[49]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[50]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[51]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[52]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[53]  Peter Willett,et al.  Readings in information retrieval , 1997 .

[54]  Daphne Koller,et al.  Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.

[55]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[56]  David J. Miller,et al.  A Mixture of Experts Classifier with Learning Based on Both Labelled and Unlabelled Data , 1996, NIPS.

[57]  W. Bruce Croft,et al.  Combining classifiers in text categorization , 1996, SIGIR '96.

[58]  Hinrich Schütze,et al.  Method combination for document filtering , 1996, SIGIR '96.

[59]  Karen Spärck Jones,et al.  Natural language processing for information retrieval , 1996, CACM.

[60]  Hinrich Schütze,et al.  A comparison of classifiers and document representations for the routing problem , 1995, SIGIR '95.

[61]  David A. Landgrebe,et al.  The effect of unlabeled samples in reducing the small sample size problem and mitigating the Hughes phenomenon , 1994, IEEE Trans. Geosci. Remote. Sens..

[62]  Yiming Yang,et al.  Expert network: effective and efficient learning from human decisions in text categorization and retrieval , 1994, SIGIR '94.

[63]  David A. Hull Improving text retrieval for the routing problem using latent semantic indexing , 1994, SIGIR '94.

[64]  David A. Hull Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[65]  David D. Lewis Text representation for intelligent text retrieval: a classification-oriented view , 1992 .

[66]  David L. Waltz,et al.  Classifying news stories using memory based reasoning , 1992, SIGIR '92.

[67]  David D. Lewis,et al.  An evaluation of phrasal and clustered representations on a text categorization task , 1992, SIGIR '92.

[68]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[69]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[70]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[71]  Peter Willett,et al.  Document Retrieval Systems , 1988 .

[72]  Stephen Robertson,et al.  Probabilistic Automatic Indexing by Learning from Human indexers , 1984, J. Documentation.

[73]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[74]  Gerard Salton,et al.  The SMART Retrieval System , 1971 .

[75]  Michael E. Lesk,et al.  Computer Evaluation of Indexing and Text Processing , 1968, JACM.

[76]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .