Cross-domain aspect extraction for sentiment analysis: A transductive learning approach

Abstract Aspect-Based Sentiment Analysis (ABSA) is a promising approach to analyze consumer reviews at a high level of detail, where the opinion about each feature of the product or service is considered. ABSA usually explores supervised inductive learning algorithms, which requires intense human effort for the labeling process. In this paper, we investigate Cross-Domain Transfer Learning approaches, in which aspects already labeled in some domains can be used to support the aspect extraction of another domain where there are no labeled aspects. Existing cross-domain transfer learning approaches learn classifiers from labeled aspects in the source domain and then apply these classifiers in the target domain, i.e., two separate stages that may cause inconsistency due to different feature spaces. To overcome this drawback, we present an innovative approach called CD-ALPHN (Cross-Domain Aspect Label Propagation through Heterogeneous Networks). First, we propose a heterogeneous network-based representation that combines different features (labeled aspects, unlabeled aspects, and linguistic features) from source and target domain as nodes in a single network. Second, we propose a label propagation algorithm for aspect extraction from heterogeneous networks, where the linguistic features are used as a bridge for this propagation. Our algorithm is based on a transductive learning process, where we explore both labeled and unlabeled aspects during the label propagation. Experimental results show that the CD-ALPHN outperforms the state-of-the-art methods in scenarios where there is a high-level of inconsistency between the source and target domains — the most common scenario in real-world applications.

[1]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2]  Raymond Y. K. Lau,et al.  Product aspect extraction supervised with online domain knowledge , 2014, Knowl. Based Syst..

[3]  Michael K. Ng,et al.  Transductive Multilabel Learning via Label Set Propagation , 2013, IEEE Transactions on Knowledge and Data Engineering.

[4]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[5]  Alneu de Andrade Lopes,et al.  A parameter-free label propagation algorithm using bipartite heterogeneous networks for text classification , 2014, SAC.

[6]  George Michailidis,et al.  Graph-Based Semi-Supervised Learning With Big Data , 2020, Cognitive Analytics.

[7]  Arjun Mukherjee,et al.  Aspect Extraction with Automated Prior Knowledge Learning , 2014, ACL.

[8]  Raymond Y. K. Lau,et al.  Social analytics: Learning fuzzy product ontologies for aspect-oriented sentiment analysis , 2014, Decis. Support Syst..

[9]  Peng Hao,et al.  Transfer learning using computational intelligence: A survey , 2015, Knowl. Based Syst..

[10]  Pradeep Kumar,et al.  A New Relevance Measure for Heterogeneous Networks , 2015, DaWaK.

[11]  Witold Pedrycz,et al.  Particle Competition and Cooperation in Networks for Semi-Supervised Learning , 2012, IEEE Trans. Knowl. Data Eng..

[12]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[13]  Thanassis Tiropanis,et al.  The Role of Data Science in Web Science , 2016, IEEE Intelligent Systems.

[14]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[15]  Erik Cambria,et al.  Affective Computing and Sentiment Analysis , 2016, IEEE Intelligent Systems.

[16]  Yiming Yang,et al.  Cross-Domain Kernel Induction for Transfer Learning , 2017, AAAI.

[17]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[18]  Fei Song,et al.  Feature Selection for Sentiment Analysis Based on Content and Syntax Models , 2011, Decis. Support Syst..

[19]  Alneu de Andrade Lopes,et al.  Optimization and label propagation in bipartite heterogeneous networks to improve transductive classification of texts , 2016, Inf. Process. Manag..

[20]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[21]  Arjun Mukherjee,et al.  Aspect Extraction through Semi-Supervised Modeling , 2012, ACL.

[22]  Yu-N Cheah,et al.  A two-fold rule-based model for aspect extraction , 2017, Expert Syst. Appl..

[23]  Claire Cardie,et al.  39. Opinion mining and sentiment analysis , 2014 .

[24]  Jianmin Wang,et al.  Transfer Learning with Graph Co-Regularization , 2012, IEEE Transactions on Knowledge and Data Engineering.

[25]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[26]  Ricardo M. Marcacini,et al.  On combining Websensors and DTW distance for kNN Time Series Forecasting , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[27]  Kim Schouten,et al.  Survey on Aspect-Level Sentiment Analysis , 2016, IEEE Transactions on Knowledge and Data Engineering.

[28]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[29]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[30]  Ricardo M. Marcacini,et al.  Aspect-based Sentiment Analysis using Semi-supervised Learning in Bipartite Heterogeneous Networks , 2016, J. Inf. Data Manag..

[31]  Huimin Zhao,et al.  Adapting sentiment lexicons to domain-specific social media texts , 2017, Decis. Support Syst..

[32]  Morad Benyoucef,et al.  Consumer behavior in social commerce: A literature review , 2016, Decis. Support Syst..

[33]  Salwani Abdullah,et al.  Approaches to Cross-Domain Sentiment Analysis: A Systematic Literature Review , 2017, IEEE Access.

[34]  Yu-N Cheah,et al.  Aspect extraction in sentiment analysis: comparative analysis and survey , 2016, Artificial Intelligence Review.

[35]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[36]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[37]  Jeff A. Bilmes,et al.  Soft-Supervised Learning for Text Classification , 2008, EMNLP.

[38]  Hui Xiong,et al.  Cross-Domain Learning from Multiple Sources: A Consensus Regularization Perspective , 2010, IEEE Transactions on Knowledge and Data Engineering.

[39]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[40]  Bernt Schiele,et al.  Transfer Learning in a Transductive Setting , 2013, NIPS.

[41]  Pushpak Bhattacharyya,et al.  Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis , 2017, Knowl. Based Syst..

[42]  Ronaldo Florence,et al.  Constrained Hierarchical Clustering for News Events , 2017, IDEAS.

[43]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[44]  Pu Zhang,et al.  A statistical approach to opinion target extraction using domain relevance , 2016, 2016 2nd IEEE International Conference on Computer and Communications (ICCC).

[45]  Qiong Wu,et al.  A two-stage framework for cross-domain sentiment classification , 2011, Expert Syst. Appl..

[46]  Philip S. Yu,et al.  Adaptation Regularization: A General Framework for Transfer Learning , 2014, IEEE Transactions on Knowledge and Data Engineering.

[47]  Jun Yan,et al.  Active Sentiment Domain Adaptation , 2017, ACL.

[48]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[49]  Hui Xiong,et al.  Transfer learning from multiple source domains via consensus regularization , 2008, CIKM '08.

[50]  Philip S. Yu,et al.  Domain Invariant Transfer Kernel Learning , 2015, IEEE Transactions on Knowledge and Data Engineering.

[51]  Guodong Zhou,et al.  Active Learning for Cross-domain Sentiment Classification , 2013, IJCAI.