Discovering New Intents with Deep Aligned Clustering

Discovering new intents is a crucial task in a dialogue system. Most existing methods are limited in transferring the prior knowledge from known intents to new intents. These methods also have difficulties in providing high-quality supervised signals to learn clustering-friendly features for grouping unlabeled intents. In this work, we propose an effective method (Deep Aligned Clustering) to discover new intents with the aid of limited known intent data. Firstly, we leverage a few labeled known intent samples as prior knowledge to pre-train the model. Then, we perform k-means to produce cluster assignments as pseudo-labels. Moreover, we propose an alignment strategy to tackle the label inconsistency during clustering assignments. Finally, we learn the intent representations under the supervision of the aligned pseudo-labels. With an unknown number of new intents, we predict the number of intent categories by eliminating low-confidence intent-wise clusters. Extensive experiments on two benchmark datasets show that our method is more robust and achieves substantial improvements over the state-of-the-art methods.(Code available at this https URL)

[1]  Geoffrey Zweig,et al.  Clustering novel intents in a conversational interaction system with semantic parsing , 2015, INTERSPEECH.

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  Hua Xu,et al.  Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement , 2019, AAAI.

[4]  Chen Change Loy,et al.  Online Deep Clustering for Unsupervised Representation Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Yangming Li,et al.  DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification , 2020, AAAI.

[6]  G. Krishna,et al.  Agglomerative clustering using the concept of mutual nearest neighbourhood , 1978, Pattern Recognit..

[7]  Andrew Zisserman,et al.  Learning to Discover Novel Visual Categories via Deep Transfer Clustering , 2019 .

[8]  Hua Xu,et al.  Deep Unknown Intent Detection with Margin Loss , 2019, ACL.

[9]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[10]  Matthew Henderson,et al.  Efficient Intent Detection with Dual Sentence Encoders , 2020, NLP4CONVAI.

[11]  Zsolt Kira,et al.  Multi-class Classification without Multi-class Labels , 2019, ICLR.

[12]  Zsolt Kira,et al.  Learning to cluster in order to Transfer across domains and tasks , 2017, ICLR.

[13]  Chih-Li Huo,et al.  Slot-Gated Modeling for Joint Slot Filling and Intent Prediction , 2018, NAACL.

[14]  K. Chidananda Gowda A feature reduction and unsupervised classification algorithm for multispectral data , 1984, Pattern Recognit..

[15]  Alessandro Moschitti,et al.  Supervised Clustering of Questions into Intents for Dialog System Applications , 2018, EMNLP.

[16]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[17]  Gökhan Tür,et al.  A weakly-supervised approach for discovering new user intents from search query logs , 2013, INTERSPEECH.

[18]  Arindam Banerjee,et al.  Active Semi-Supervision for Pairwise Constrained Clustering , 2004, SDM.

[19]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[20]  Raymond J. Mooney,et al.  Integrating constraints and metric learning in semi-supervised clustering , 2004, ICML.

[21]  Pavel Král,et al.  Unsupervised Dialogue Act Induction using Gaussian Mixtures , 2016, EACL.

[22]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[23]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[24]  Dhruv Batra,et al.  Joint Unsupervised Learning of Deep Representations and Image Clusters , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Yangming Li,et al.  A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding , 2019, EMNLP.

[26]  Yi Yang,et al.  Dialog Intent Induction with Deep Multi-View Clustering , 2019, EMNLP/IJCNLP.

[27]  Bo Yang,et al.  Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering , 2016, ICML.

[28]  Hongxia Jin,et al.  A Bi-Model Based RNN Semantic Frame Parsing Model for Intent Detection and Slot Filling , 2018, NAACL.

[29]  Srinivas Bangalore,et al.  Intent Discovery Through Unsupervised Semantic Text Clustering , 2018, INTERSPEECH.

[30]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[31]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[32]  Claire Cardie,et al.  Constrained K-means Clustering with Background Knowledge , 2001, ICML.

[33]  Lingfeng Wang,et al.  Deep Adaptive Image Clustering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Xu Sun,et al.  Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning , 2018, EMNLP.

[35]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[36]  Qingkai Min,et al.  Dialogue State Induction Using Neural Latent Variable Models , 2020, IJCAI.

[37]  Lingjia Tang,et al.  An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction , 2019, EMNLP.

[38]  Srinivasan Parthasarathy,et al.  Open Intent Extraction from Natural Language Interactions , 2020, WWW.

[39]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.