Learning to classify short and sparse text & web with hidden topics from large-scale data collections
暂无分享,去创建一个
[1] Ludovic Denoyer,et al. The XML Wikipedia Corpus , 2006 .
[2] Wei-Ying Ma,et al. Learning to cluster web search results , 2004, SIGIR '04.
[3] Thomas Hofmann,et al. Latent semantic models for collaborative filtering , 2004, TOIS.
[4] Gregor Heinrich. Parameter estimation for text analysis , 2009 .
[5] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.
[6] Danushka Bollegala,et al. Measuring semantic similarity between words using web search engines , 2007, WWW '07.
[7] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.
[8] T. Landauer,et al. Indexing by Latent Semantic Analysis , 1990 .
[9] Mehran Sahami,et al. A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.
[10] Tom Minka,et al. Expectation-Propogation for the Generative Aspect Model , 2002, UAI.
[11] W. Bruce Croft,et al. LDA-based document models for ad-hoc retrieval , 2006, SIGIR.
[12] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.
[13] Mark Steyvers,et al. Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.
[14] Donald Geman,et al. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .
[15] Inderjit S. Dhillon,et al. Concept Decompositions for Large Sparse Text Data Using Clustering , 2004, Machine Learning.
[16] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[17] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.
[18] Sebastian Thrun,et al. Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.
[19] Evgeniy Gabrilovich,et al. Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.
[20] Frank Keller,et al. Using the Web to Overcome Data Sparseness , 2002, EMNLP.
[21] Ludovic Denoyer,et al. The Wikipedia XML Corpus , 2006, INEX.
[22] Oren Etzioni,et al. Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.
[23] Péter Schönhofen. Identifying document topics using the Wikipedia category network , 2009, Web Intell. Agent Syst..
[24] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..
[25] Susan T. Dumais,et al. Similarity Measures for Short Segments of Text , 2007, ECIR.
[26] Somnath Banerjee,et al. Clustering short texts using wikipedia , 2007, SIGIR.
[27] Ran El-Yaniv,et al. Distributional Word Clusters vs. Words for Text Categorization , 2003, J. Mach. Learn. Res..
[28] Thomas Hofmann,et al. Text categorization by boosting automatically extracted concepts , 2003, SIGIR.
[29] Andrew McCallum,et al. Distributional clustering of words for text classification , 1998, SIGIR '98.
[30] Shourya Roy,et al. A hierarchical monothetic document clustering algorithm for summarization and browsing search results , 2004, WWW '04.
[31] Nando de Freitas,et al. An Introduction to MCMC for Machine Learning , 2004, Machine Learning.
[32] John D. Lafferty,et al. A correlated topic model of Science , 2007, 0708.3601.
[33] R. Bekkerman. Distributional Word Clusters vs , 2006 .
[34] Christopher Meek,et al. Improving Similarity Measures for Short Segments of Text , 2007, AAAI.
[35] Lise Getoor,et al. A Latent Dirichlet Model for Unsupervised Entity Resolution , 2005, SDM.