论文信息 - Contrastive Learning for Neural Topic Model

Contrastive Learning for Neural Topic Model

Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample. However, utilizing that discriminative-generative architecture has two important drawbacks: (1) the architecture does not relate similar documents, which has the same document-word distribution of salient words; (2) it restricts the ability to integrate external information, such as sentiments of the document, which has been shown to benefit the training of neural topic model. To address those issues, we revisit the adversarial topic architecture in the viewpoint of mathematical analysis, propose a novel approach to re-formulate discriminative goal as an optimization problem, and design a novel sampling method which facilitates the integration of external variables. The reformulation encourages the model to incorporate the relations among similar samples and enforces the constraint on the similarity among dissimilar ones; while the sampling method, which is based on the internal input and reconstructed output, helps inform the model of salient words contributing to the main topic. Experimental results show that our framework outperforms other state-of-the-art neural topic models in three common benchmark datasets that belong to various domains, vocabulary sizes, and document lengths in terms of topic coherence.

Anh Tuan Luu | Thong Nguyen | A. Luu | Thong Nguyen

[1] Geoffrey E. Hinton,et al. Big Self-Supervised Models are Strong Semi-Supervised Learners , 2020, NeurIPS.

[2] Pushmeet Kohli,et al. Graph Matching Networks for Learning the Similarity of Graph Structured Objects , 2019, ICML.

[4] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[5] Phillip Isola,et al. Contrastive Multiview Coding , 2019, ECCV.

[6] Feng Nan,et al. Topic Modeling with Wasserstein Autoencoders , 2019, ACL.

[7] Jianhua Lin,et al. Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[8] Yue Lu,et al. Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA , 2011, Information Retrieval.

[9] Zhenguo Li,et al. DetCo: Unsupervised Contrastive Learning for Object Detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[10] R Devon Hjelm,et al. Learning Representations by Maximizing Mutual Information Across Views , 2019, NeurIPS.

[11] Lars Petersson,et al. Dual Contrastive Learning for Unsupervised Image-to-Image Translation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[12] Noah A. Smith,et al. Neural Models for Documents with Metadata , 2017, ACL.

[13] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14] Sandhya Subramani,et al. A Novel Approach of Neural Topic Modelling for Document Clustering , 2018, 2018 IEEE Symposium Series on Computational Intelligence (SSCI).

[15] Honglak Lee,et al. An efficient framework for learning sentence representations , 2018, ICLR.

[16] Cordelia Schmid,et al. What makes for good views for contrastive learning , 2020, NeurIPS.

[17] Ching-Yao Chuang,et al. Contrastive Learning with Hard Negative Samples , 2020, ArXiv.

[18] Phil Blunsom,et al. Discovering Discrete Latent Topics with Neural Variational Inference , 2017, ICML.

[19] K. Schittkowski,et al. NONLINEAR PROGRAMMING , 2022 .

[20] Deyu Zhou,et al. Neural Topic Modeling with Cycle-Consistent Adversarial Training , 2020, EMNLP.

[21] Kaveh Hassani,et al. Contrastive Multi-View Representation Learning on Graphs , 2020, ICML.

[22] Chi Zhang,et al. FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] How Pandemic Spread in News: Text Analysis Using Topic Model , 2020, 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT).

[24] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[25] Dinh Phung,et al. OTLDA: A Geometry-aware Optimal Transport Approach for Topic Modeling , 2020, NeurIPS.

[26] Aditya Prasad,et al. Unsupervised Hard Example Mining from Videos for Improved Object Detection , 2018, ECCV.

[27] Philip Resnik,et al. Improving Neural Topic Models Using Knowledge Distillation , 2020, EMNLP.

[28] Ali Razavi,et al. Data-Efficient Image Recognition with Contrastive Predictive Coding , 2019, ICML.

[29] Ching-Yao Chuang,et al. Debiased Contrastive Learning , 2020, NeurIPS.

[30] Rui Wang,et al. Open Event Extraction from Online Text using a Generative Adversarial Network , 2019, EMNLP.

[31] Yoshua Bengio,et al. Learning deep representations by mutual information estimation and maximization , 2018, ICLR.

[32] W. Karush. Minima of Functions of Several Variables with Inequalities as Side Conditions , 2014 .

[33] Jian Tang,et al. InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization , 2019, ICLR.

[34] Phil Blunsom,et al. Neural Variational Inference for Text Processing , 2015, ICML.

[35] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.