Active learning enhanced semi-automatic annotation tool for aspect-based sentiment analysis

Aspect-based sentiment analysis has become popular research field which allows the quantification of textual evaluations of different aspects of products and services. Methods of aspect-based sentiment analysis built on machine learning usually depend on manually annotated training corpora. In order to facilitate the processes of their creation, annotation tools dedicated to this purpose are needed. In this work we proposed a semi-automatic annotation tool which uses active learning to increase the effectiveness of the documents annotation. The use of active learning adapted to the needs of aspect-based sentiment analysis is the main difference between the proposed solution and existing annotation tools. We applied it in the domain of hotels evaluations. The results of realized experiments confirmed the faster increase of the annotation suggestions quality in terms of F1-measure in comparison to the scenario without active learning.

[1]  Christopher S. G. Khoo,et al.  Aspect-based sentiment analysis of movie reviews on discussion boards , 2010, J. Inf. Sci..

[2]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[3]  Sang-goo Lee,et al.  Feature-based Product Review Summarization Utilizing User Score , 2010, J. Inf. Sci. Eng..

[4]  Son Bao Pham,et al.  Sentiment Analysis for Vietnamese , 2010, 2010 Second International Conference on Knowledge and Systems Engineering.

[5]  John Mylopoulos,et al.  Cerno: Light-weight tool support for semantic annotation of textual documents , 2009, Data Knowl. Eng..

[6]  Andrea Esuli,et al.  Active Learning Strategies for Multi-Label Text Classification , 2009, ECIR.

[7]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[8]  Rada Mihalcea,et al.  Sentiment Analysis , 2014, Encyclopedia of Social Network Analysis and Mining.

[9]  Ramanathan V. Guha,et al.  A case for automated large-scale semantic annotation , 2003, J. Web Semant..

[10]  Jan Paralic,et al.  An approach to feature selection for sentiment analysis , 2011, 2011 15th IEEE International Conference on Intelligent Engineering Systems.

[11]  Atanas Kiryakov,et al.  Semantic annotation, indexing, and retrieval , 2004, J. Web Semant..

[12]  Sethuraman Panchanathan,et al.  Optimal batch selection for active learning in multi-label classification , 2011, ACM Multimedia.

[13]  ChengXiang Zhai,et al.  Opinion-based entity ranking , 2012, Information Retrieval.

[14]  Christopher G. Chute,et al.  Semantator: Annotating Clinical Narratives with Semantic Web Ontologies , 2012, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[15]  Sophia Ananiadou,et al.  Argo: an integrative, interactive, text mining-based workbench supporting curation , 2012, Database J. Biol. Databases Curation.

[16]  Guodong Zhou,et al.  Active Learning for Imbalanced Sentiment Classification , 2012, EMNLP.

[17]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[18]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[19]  Zheng Chen,et al.  Effective multi-label active learning for text classification , 2009, KDD.