论文信息 - Sparse Additive Text Models with Low Rank Background

Sparse Additive Text Models with Low Rank Background

The sparse additive model for text modeling involves the sum-of-exp computing, whose cost is consuming for large scales. Moreover, the assumption of equal background across all classes/topics may be too strong. This paper extends to propose sparse additive model with low rank background (SAM-LRB) and obtains simple yet efficient estimation. Particularly, employing a double majorization bound, we approximate log-likelihood into a quadratic lower-bound without the log-sum-exp terms. The constraints of low rank and sparsity are then simply embodied by nuclear norm and l1-norm regularizers. Interestingly, we find that the optimization task of SAM-LRB can be transformed into the same form as in Robust PCA. Consequently, parameters of supervised SAM-LRB can be efficiently learned using an existing algorithm for Robust PCA based on accelerated proximal gradient. Besides the supervised case, we extend SAM-LRB to favor unsupervised and multifaceted scenarios. Experiments on three real data demonstrate the effectiveness and efficiency of SAM-LRB, compared with a few state-of-the-art models.

Lei Shi

[1] Xu Ling,et al. Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[2] Chong Wang,et al. Variational inference in nonconjugate models , 2012, J. Mach. Learn. Res..

[3] John Wright,et al. Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices via Convex Optimization , 2009, NIPS.

[4] Mário A. T. Figueiredo. Adaptive Sparseness Using Jeffreys Prior , 2001, NIPS.

[5] Eric P. Xing,et al. Staying Informed: Supervised and Semi-Supervised Multi-View Topical Analysis of Ideological Perspective , 2010, EMNLP.

[6] Martin Jaggi,et al. A Simple Algorithm for Nuclear Norm Regularized Problems , 2010, ICML.

[7] Michael J. Paul,et al. A Two-Dimensional Topic-Aspect Model for Discovering Multi-Faceted Topics , 2010, AAAI.

[8] Chong Wang,et al. Decoupling Sparsity and Smoothness in the Discrete Hierarchical Dirichlet Process , 2009, NIPS.

[9] Eric P. Xing,et al. Sparse Additive Generative Models of Text , 2011, ICML.

[10] Arvind Ganesh,et al. Fast Convex Optimization Algorithms for Exact Recovery of a Corrupted Low-Rank Matrix , 2009 .

[11] Marc Teboulle,et al. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[12] Noah A. Smith,et al. Discovering Factions in the Computational Linguistics Community , 2012, Discoveries@ACL.

[13] Jean Ponce,et al. Efficient Optimization for Discriminative Latent Class Models , 2010, NIPS.

[14] David M. Blei,et al. Supervised Topic Models , 2007, NIPS.

[15] Yun Jiang,et al. Discovering Different Types of Topics: Factored Topic Models , 2013, IJCAI.

[16] Junfeng Yang,et al. Linearized augmented Lagrangian and alternating direction methods for nuclear norm minimization , 2012, Math. Comput..

[17] Guillaume Bouchard. Efficient Bounds for the Softmax Function and Applications to Approximate Inference in Hybrid models , 2008 .

[18] T. Minka. Estimating a Dirichlet distribution , 2012 .

[19] Lawrence Carin,et al. Bayesian Robust Principal Component Analysis , 2011, IEEE Transactions on Image Processing.

[20] Nicolas Vayatis,et al. Estimation of Simultaneously Sparse and Low Rank Matrices , 2012, ICML.

[21] John Wright,et al. Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices via Convex Optimization , 2009, NIPS.

[22] Mark Dredze,et al. Shared Components Topic Models , 2012, HLT-NAACL.

[23] D. Böhning. Multinomial logistic regression algorithm , 1992 .

[24] Michael I. Jordan,et al. A Variational Approach to Bayesian Logistic Regression Models and their Extensions , 1997, AISTATS.

[25] John D. Lafferty,et al. Correlated Topic Models , 2005, NIPS.

[26] Eric P. Xing,et al. MedLDA: maximum margin supervised topic models , 2012, J. Mach. Learn. Res..

[27] Alexander J. Smola,et al. Discovering geographical topics in the twitter stream , 2012, WWW.

[28] Xi Chen,et al. Smoothing proximal gradient method for general structured sparse regression , 2010, The Annals of Applied Statistics.

[29] Jonathan Eckstein. Augmented Lagrangian and Alternating Direction Methods for Convex Optimization: A Tutorial and Some Illustrative Computational Results , 2012 .

[30] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..