A Nested HDP for Hierarchical Topic Models
暂无分享,去创建一个
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according to a document-specific distribution on a shared tree. This alleviates the rigid, single-path formulation of the nCRP, allowing a document to more easily express thematic borrowings as a random effect. We demonstrate our algorithm on 1.8 million documents from The New York Times.
[1] Michael I. Jordan,et al. Hierarchical Dirichlet Processes , 2006 .
[2] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..
[3] Chong Wang,et al. Nested Hierarchical Dirichlet Processes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[4] Thomas L. Griffiths,et al. The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies , 2007, JACM.