aDMSCN: A Novel Perspective for User Intent Prediction in Customer Service Bots

As one of the core components of customer service bot, User Intent Prediction (UIP) aims at predicting users? intents (usually represented as predefined user questions) before they ask, and has been widely applied in real applications. However, when developing a machine learning system for this problem, two critical issues, i.e., the problem of feature drift and class imbalance, may emerge and seriously deprave the system performance. Moreover, various scenarios may arise due to business demands, making the aforementioned problems much more severe. To address these two problems, we propose an attention-based Deep Multi-instance Sequential Cross Network (aDMSCN) to deal with the UIP task. On the one hand,the UIP task can be subtly formalized as multi-instance learning(MIL) task with an attention-based method proposed to alleviate the influences of feature drift. To the best of our knowledge, this is the first attempt to model the problem from a MIL perspective.On the other hand, a ratio-sensitive loss is also developed in our model, which can mitigate the negative impact of class imbalance. Extensive experiments on both offline real-world datasets and on-line A/B testing show that our proposed framework significantly out performs other state-of-art methods for the UIP task.

[1]  Ivan Laptev,et al.  Weakly supervised object recognition with convolutional neural networks , 2014 .

[2]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[3]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[4]  Tat-Seng Chua,et al.  Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks , 2017, IJCAI.

[5]  Ohad Shamir,et al.  Learning to classify with missing and corrupted features , 2008, ICML '08.

[6]  Max Welling,et al.  Attention-based Deep Multiple Instance Learning , 2018, ICML.

[7]  Tat-Seng Chua,et al.  Neural Factorization Machines for Sparse Predictive Analytics , 2017, SIGIR.

[8]  Jun Zhou,et al.  Locally Connected Deep Learning Framework for Industrial-scale Recommender Systems , 2017, WWW.

[9]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[10]  Zhi-Hua Zhou,et al.  Multi-Instance Learning Based Web Mining , 2005, Applied Intelligence.

[11]  Jaume Amores,et al.  Multiple instance classification: Review, taxonomy and comparative study , 2013, Artif. Intell..

[12]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[13]  Heng-Tze Cheng,et al.  Wide & Deep Learning for Recommender Systems , 2016, DLRS@RecSys.

[14]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[15]  Dong Yu,et al.  Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features , 2016, KDD.

[16]  Guorui Zhou,et al.  Deep Interest Network for Click-Through Rate Prediction , 2017, KDD.

[17]  Zhi-Hua Zhou,et al.  Multi-Instance Learning with Key Instance Shift , 2017, IJCAI.

[18]  W. Bruce Croft,et al.  User Intent Prediction in Information-seeking Conversations , 2019, CHIIR.

[19]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[20]  Cen Chen,et al.  AntProphet: an Intention Mining System behind Alipay's Intelligent Customer Service Bot , 2019, IJCAI.

[21]  Jian Tang,et al.  AutoInt: Automatic Feature Interaction Learning via Self-Attentive Neural Networks , 2018, CIKM.

[22]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Ying Gao,et al.  A rank-based Prediction Algorithm of Learning User's Intention , 2012 .

[24]  BengioYoshua,et al.  A neural probabilistic language model , 2003 .

[25]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Jean Paul Barddal,et al.  A survey on feature drift adaptation: Definition, benchmark, challenges and future directions , 2017, J. Syst. Softw..

[27]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[28]  Xing Xie,et al.  xDeepFM: Combining Explicit and Implicit Feature Interactions for Recommender Systems , 2018, KDD.

[29]  Jun Wang,et al.  Deep Learning over Multi-field Categorical Data - - A Case Study on User Response Prediction , 2016, ECIR.

[30]  Gang Fu,et al.  Deep & Cross Network for Ad Click Predictions , 2017, ADKDD@KDD.

[31]  Chih-Jen Lin,et al.  Field-aware Factorization Machines for CTR Prediction , 2016, RecSys.

[32]  Jun Zhou,et al.  Reinforcement Learning for User Intent Prediction in Customer Service Bots , 2019, SIGIR.

[33]  Yunming Ye,et al.  DeepFM: A Factorization-Machine based Neural Network for CTR Prediction , 2017, IJCAI.

[34]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[35]  Chang Zhou,et al.  Deep Interest Evolution Network for Click-Through Rate Prediction , 2018, AAAI.