Feature Engineering for a Symbolic Approach to Text Classification

................................................................................................................................ III LIST OF FIGURES......................................................................................................................VI LIST OF TABLES...................................................................................................................... VII GLOSSARY ..................................................................................................................................IX

[1]  Tom Fawcett,et al.  Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions , 1997, KDD.

[2]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[3]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[4]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[5]  Stan Matwin,et al.  Text Classification Using WordNet Hypernyms , 1998, WordNet@ACL/COLING.

[6]  Daphne Koller,et al.  Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.

[7]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[8]  Thomas G. Dietterich Machine-Learning Research Four Current Directions , 1997 .

[9]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[10]  Sharon Flank,et al.  A Layered Approach to NLP-Based Information Retrieval , 1998, ACL.

[11]  William W. Cohen Text Categorization and Relational Learning , 1995, ICML.

[12]  David D. Lewis,et al.  An evaluation of phrasal and clustered representations on a text categorization task , 1992, SIGIR '92.

[13]  Johannes Fürnkranz,et al.  Incremental Reduced Error Pruning , 1994, ICML.

[14]  Ellen Riloff,et al.  Little words can make a big difference for text classification , 1995, SIGIR '95.

[15]  David D. Lewis,et al.  Evaluating and optimizing autonomous text classification systems , 1995, SIGIR '95.

[16]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[17]  Hwee Tou Ng,et al.  Feature selection, perceptron learning, and a usability case study for text categorization , 1997, SIGIR '97.

[18]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[19]  Stan Matwin,et al.  Using Lexical Knowledge in Text Classification , 1998 .

[20]  Yoram Singer,et al.  Context-sensitive learning methods for text categorization , 1996, SIGIR '96.

[21]  Philip J. Hayes,et al.  CONSTRUE/TIS: A System for Content-Based Indexing of a Database of News Stories , 1990, IAAI.

[22]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[23]  William W. Cohen Learning Trees and Rules with Set-Valued Features , 1996, AAAI/IAAI, Vol. 1.

[24]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[25]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[26]  David D. Lewis Text representation for intelligent text retrieval: a classification-oriented view , 1992 .

[27]  Eric Brill,et al.  Text Classification in USENET Newsgroups: A Progress Report , 1996 .

[28]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[29]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[30]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[31]  Manuel de Buenaga Rodríguez,et al.  Using WordNet to Complement Training Information in Text Categorization , 1997, ArXiv.

[32]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[33]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[34]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[35]  William W. Cohen Learning Rules that Classify E-Mail , 1996 .

[36]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[37]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[38]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[39]  David D. Lewis,et al.  Representation and Learning in Information Retrieval , 1991 .

[40]  Ronald L. Rivest,et al.  Inferring Decision Trees Using the Minimum Description Length Principle , 1989, Inf. Comput..

[41]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[42]  Stan Matwin,et al.  A WordNet-based Algorithm for Word Sense Disambiguation , 1995, IJCAI.