论文信息 - A Neural Network for Text Representation

A Neural Network for Text Representation

Text categorization and retrieval tasks are often based on a good representation of textual data. Departing from the classical vector space model, several probabilistic models have been proposed recently, such as PLSA. In this paper, we propose the use of a neural network based, non-probabilistic, solution, which captures jointly a rich representation of words and documents. Experiments performed on two information retrieval tasks using the TDT2 database and the TREC-8 and 9 sets of queries yielded a better performance for the proposed neural network model, as compared to PLSA and the classical TFIDF representations.

Samy Bengio | Mikaela Keller | Samy Bengio | Mikaela Keller

[1] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[2] Tapio Elomaa,et al. Machine Learning: ECML 2002 , 2002, Lecture Notes in Computer Science.

[3] Kris Popat,et al. A Hierarchical Model for Clustering and Categorising Documents , 2002, ECIR.

[4] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[5] Fabrizio Sebastiani,et al. Machine learning in automated text categorization , 2001, CSUR.

[6] Gerard Salton,et al. Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[7] Samy Bengio,et al. Theme Topic Mixture Model: A Graphical Model for Document Representation , 2004 .

[8] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[9] Wray L. Buntine. Variational Extensions to EM and Multinomial PCA , 2002, ECML.

[10] Samy Bengio,et al. Links between perceptrons, MLPs and SVMs , 2004, ICML.

[11] Yann LeCun,et al. Loss Functions for Discriminative Training of Energy-Based Models , 2005, AISTATS.

[12] Thomas Hofmann,et al. Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[13] Gerard Salton,et al. A vector space model for automatic indexing , 1975, CACM.

[14] David D. Lewis. The TREC-4 Filtering Track , 1995, TREC.

[15] Nello Cristianini,et al. Latent Semantic Kernels , 2001, Journal of Intelligent Information Systems.