论文信息 - Using Stochastic Helmholtz Machine for Text Learning

Using Stochastic Helmholtz Machine for Text Learning

We present an approach for text analysis, especially for topic words extraction and document classification, based on a probabilistic generative model. Generative models are useful since they can extract the underlying causal structure of data objects. For this model, a stochastic Helmholtz machine is used and it is fitted using the wake-sleep algorithm, a simple stochastic learning algorithm. Given a document set, the Helmholtz machine tries to capture the correlation of the words used in the set, thus can extract various semantic features for a set of documents. We present some experimental results on topic words extraction for TDT-2 and TREC-8 ad-hoc data sets. And for another real-world document set, 20 Newsgroup collection, a categorization is performed and the performance is compared with that of naive Bayes classifier, another simple generative model. Additionally, we present a preliminary work to make Helmholtz machines more appropriate for processing text documents.

Byoung-Tak Zhang | Jeong Ho Chang | Byoung-Tak Zhang | J. Chang

[1] Geoffrey E. Hinton,et al. The Helmholtz Machine , 1995, Neural Computation.

[2] Ken Lang,et al. NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[3] Daphne Koller,et al. Using machine learning to improve information access , 1998 .

[4] R. Zemel,et al. Learning sparse multiple cause models , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[5] Andrew McCallum,et al. A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[6] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[7] Brendan J. Frey,et al. Graphical Models for Machine Learning and Digital Communication , 1998 .

[8] Virginia R. de Sa,et al. Using Helmholtz Machines to Analyze Multi-channel Neuronal Recordings , 1997, NIPS.

[9] Geoffrey E. Hinton,et al. Varieties of Helmholtz Machine , 1996, Neural Networks.

[10] Naftali Tishby,et al. Document clustering using word clusters via the information bottleneck method , 2000, SIGIR '00.

[11] Yee Whye Teh,et al. Rate-coded Restricted Boltzmann Machines for Face Recognition , 2000, NIPS.

[12] Geoffrey E. Hinton,et al. The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.