Extracting key-substring-group features for text classification
暂无分享,去创建一个
[1] Yoram Singer,et al. BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.
[2] Peter Jackson,et al. Natural language processing for online applications : text retrieval, extraction and categorization , 2002 .
[3] Ah-Hwee Tan,et al. On Machine Learning Methods for Chinese Document Categorization , 2003, Applied Intelligence.
[4] Dan Gusfield,et al. Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .
[5] Eugene L. Lawler,et al. Sublinear approximate string matching and biological applications , 1994, Algorithmica.
[6] Alexander J. Smola,et al. Fast Kernels for String and Tree Matching , 2002, NIPS.
[7] Donald Ervin Knuth,et al. The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information , 1978 .
[8] Akiko Aizawa. Linguistic Techniques to Improve the Performance of Automatic Text Categorization , 2001, NLPRS.
[9] A. Bratko,et al. Spam Filtering Using Compression Models , 2005 .
[10] Maxime Crochemore,et al. Algorithms on strings , 2007 .
[11] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[12] Kenneth Ward Church,et al. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus , 2001, Computational Linguistics.
[13] Naftali Tishby,et al. Discriminative Feature Selection via Multiclass Variable Memory Markov Model , 2002, EURASIP J. Adv. Signal Process..
[14] Ralf Herbrich,et al. Learning Kernel Classifiers: Theory and Algorithms , 2001 .
[15] Oren Etzioni,et al. Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.
[16] Eleazar Eskin,et al. The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.
[17] Ah-Hwee Tan,et al. A Comparative Study on Chinese Text Categorization Methods , 2000, PRICAI Workshop on Text and Web Mining.
[18] Efstathios Stamatatos,et al. Automatic Text Categorization In Terms Of Genre and Author , 2000, CL.
[19] Sung-Hyon Myaeng,et al. Text genre classification with genre-revealing and subject-revealing features , 2002, SIGIR '02.
[20] Esko Ukkonen,et al. On-line construction of suffix trees , 1995, Algorithmica.
[21] Ning Wu,et al. On Compression-Based Text Classification , 2005, ECIR.
[22] David R. Karger,et al. Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.
[23] Lee-Feng Chien,et al. Automatic acquisition of phrasal knowledge for English-Chinese bilingual information retrieval , 1998, SIGIR '98.
[24] Jean-Michel Renders,et al. Word-Sequence Kernels , 2003, J. Mach. Learn. Res..
[25] Robert E. Schapire,et al. The Boosting Approach to Machine Learning An Overview , 2003 .
[26] Jiong Yang,et al. CLUSEQ: efficient and effective sequence clustering , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).
[27] Joshua Goodman,et al. A bit of progress in language modeling , 2001, Comput. Speech Lang..
[28] Hinrich Schütze,et al. Automatic Detection of Text Genre , 1997, ACL.
[29] Nello Cristianini,et al. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .
[30] Kenneth Ward Church,et al. Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.
[31] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[32] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.
[33] Nello Cristianini,et al. Classification using String Kernels , 2000 .
[34] Thorsten Joachims,et al. A statistical learning learning model of text classification for support vector machines , 2001, SIGIR '01.
[35] Ian H. Witten. Applications of Lossless Compression in Adaptive Text Mining , 2000 .
[36] D. Holmes,et al. The Federalist Revisited: New Directions in Authorship Attribution , 1995 .
[37] Xi Chen,et al. Text classification with kernels on the multinomial manifold , 2005, SIGIR '05.
[38] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.
[39] Yiming Yang,et al. A re-examination of text categorization methods , 1999, SIGIR '99.
[40] Evgeniy Gabrilovich,et al. Text categorization with many redundant features: using aggressive feature selection to make SVMs competitive with C4.5 , 2004, ICML.
[41] Ian H. Witten,et al. Text categorization using compression models , 2000, Proceedings DCC 2000. Data Compression Conference.
[42] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.
[43] Alexander J. Smola,et al. Learning with kernels , 1998 .
[44] Dustin Boswell,et al. Introduction to Support Vector Machines , 2002 .
[45] Lee-Feng Chien,et al. PAT-tree-based keyword extraction for Chinese information retrieval , 1997, SIGIR '97.
[46] David D. Lewis,et al. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.
[47] John D. Lafferty,et al. Diffusion Kernels on Statistical Manifolds , 2005, J. Mach. Learn. Res..
[48] Yiming Yang,et al. A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.
[49] Ian H. Witten,et al. Data Compression Using Adaptive Coding and Partial String Matching , 1984, IEEE Trans. Commun..
[50] R. Rosenfeld,et al. Two decades of statistical language modeling: where do we go from here? , 2000, Proceedings of the IEEE.
[51] CHENGXIANG ZHAI,et al. A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.
[52] Susan T. Dumais,et al. Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.
[53] John G. Proakis,et al. Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..
[54] Alexander Dekhtyar,et al. Information Retrieval , 2018, Lecture Notes in Computer Science.
[55] James H. Martin,et al. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .
[56] Dell Zhang,et al. Semantic, Hierarchical, Online Clustering of Web Search Results , 2004, APWeb.
[57] Thorsten Joachims,et al. Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.
[58] Nello Cristianini,et al. Kernel Methods for Pattern Analysis , 2003, ICTAI.
[59] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.
[60] Ian H. Witten,et al. Text mining: a new frontier for lossless compression , 1999, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).
[61] Thorsten Joachims,et al. A Statistical Learning Model of Text Classification for Support Vector Machines. , 2001, SIGIR 2002.
[62] David J. Harper,et al. Using compression based language models for text categorization. , 2003 .
[63] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.
[64] Dana Ron,et al. The power of amnesia: Learning probabilistic automata with variable memory length , 1996, Machine Learning.
[65] Mark Levene,et al. A Suffix Tree Approach to Email Filtering , 2005, ArXiv.
[66] David Thomas,et al. The Art in Computer Programming , 2001 .
[67] John G. Cleary,et al. Unbounded length contexts for PPM , 1995, Proceedings DCC '95 Data Compression Conference.
[68] Dale Schuurmans,et al. Augmenting Naive Bayes Classifiers with Statistical Language Models , 2004, Information Retrieval.
[69] Tony Jebara,et al. Probability Product Kernels , 2004, J. Mach. Learn. Res..
[70] Michael A. Bender,et al. The LCA Problem Revisited , 2000, LATIN.
[71] George Kingsley Zipf,et al. Human behavior and the principle of least effort , 1949 .