Aspect based citation sentiment analysis using linguistic patterns for better comprehension of scientific knowledge

An almost unrestrained access to research plethora has emerged with a potential drawback: extracting relevant scientific publications is not a straightforward task anymore. The best way is to search on citation indexes, which also provide large number of pertinent papers and when a paper is focused even then it ascertains thousands of citations. In such a scenario, citation text could be a quintessential indicator in determining the importance and relevancy of paper for the researcher based on different aspects of the cited work such as technique, corpus, method, task, concept, measure, model and tool etc. This paper presents a novel approach to identify aspect level sentiments to reveal the hidden patterns from scholarly big data. The proposed methodology comprises of two levels. At first level, it extracts the aspects from the citation sentences using the pattern of opinionated phrases around the aspect. At the second level, it detects the sentiment polarity of the identified aspect considering nearby words and associates it with the corresponding aspect category based on a linguistic rule-based approach. We consider the words before, after and around the aspect using n-gram based features: ‘N-gram after’, ‘N-gram before’ and ‘N-gram around’. Our results reveal that ‘N-gram around’ feature performed better than other features and the SVM outperformed other considered classifiers for all N-gram models.

[1]  Oren Etzioni,et al.  Identifying Meaningful Citations , 2015, AAAI Workshop: Scholarly Big Data.

[2]  Deepa Anand,et al.  Semi-supervised Aspect Based Sentiment Analysis for Movies Using Review Filtering , 2015, IHCI.

[3]  KaziParvezahamad,et al.  Towards a new perspective on context based citation index of research articles , 2016 .

[4]  Ali Selamat,et al.  Twitter Feature Selection and Classification Using Support Vector Machine for Aspect-Based Sentiment Analysis , 2016, IEA/AIE.

[5]  Namita Mittal,et al.  Machine Learning Approach for Sentiment Analysis , 2016 .

[6]  Miguel Ángel Rodríguez-García,et al.  Sentiment Analysis on Tweets about Diabetes: An Aspect-Level Approach , 2017, Comput. Math. Methods Medicine.

[7]  Myriam Hernández-Alvarez,et al.  Annotated Corpus for Citation Context Analysis , 2016 .

[8]  Yu-N Cheah,et al.  Aspect extraction in sentiment analysis: comparative analysis and survey , 2016, Artificial Intelligence Review.

[9]  Muhammad Tanvir Afzal,et al.  Open source software adoption evaluation through feature level sentiment analysis using Twitter data , 2016 .

[10]  Halil Kilicoglu,et al.  Biomedical Text Mining for Research Rigor and Integrity: Tasks, Challenges, Directions , 2017, bioRxiv.

[11]  Gurjit Singh Walia,et al.  Recent advances on multicue object tracking: a survey , 2016, Artificial Intelligence Review.

[12]  Horacio Saggion,et al.  Trainable Citation-enhanced Summarization of Scientific Articles , 2016, BIRNDL@JCDL.

[13]  Changqin Quan,et al.  Feature-level sentiment analysis by using comparative domain corpora , 2016, Enterp. Inf. Syst..

[14]  Miguel Ángel Rodríguez-García,et al.  Feature-based opinion mining through ontologies , 2014, Expert Syst. Appl..

[15]  Niket Tandon,et al.  Citation Context Sentiment Analysis for Structured Summarization of Research Papers , 2012 .

[16]  Chao Lu,et al.  Understanding the impact change of a highly cited article: a content-based citation analysis , 2017, Scientometrics.

[17]  José M. Gómez,et al.  Survey about citation context analysis: Tasks, techniques, and resources , 2015, Natural Language Engineering.

[18]  Fan Liu,et al.  User-Level Twitter Sentiment Analysis with a Hybrid Approach , 2016, ISNN.

[19]  Rafael Valencia-García,et al.  Sentiment Polarity Detection in Social Networks: An Approach for Asthma Disease Management , 2017, ICCSAMA.

[20]  Afsheen Khalid,et al.  Extracting reference text from citation contexts , 2017, Cluster Computing.

[21]  Daniel Lemire,et al.  Measuring academic influence: Not all citations are equal , 2015, J. Assoc. Inf. Sci. Technol..

[22]  Alexander Hapfelmeier,et al.  Nonparametric Subgroup Identification by PRIM and CART: A Simulation and Application Study , 2017, Comput. Math. Methods Medicine.

[23]  Myriam A. Hernández,et al.  Sentiment, Polarity and Function Analysis in Bibliometrics: A Review , 2015 .

[24]  Judit Bar-Ilan,et al.  Post retraction citations in context: a case study , 2017, Scientometrics.

[25]  Bo Wang,et al.  Literature retrieval based on citation context , 2014, Scientometrics.

[26]  Antonio Ruiz-Martínez,et al.  Feature-based opinion mining in financial news: An ontology-driven approach , 2017, J. Inf. Sci..

[27]  Pushkar S. Joglekar,et al.  Towards a new perspective on context based citation index of research articles , 2016, Scientometrics.

[28]  Xiaojun Wan,et al.  Are all literature citations equally important? Automatic citation strength estimation and its applications , 2014, J. Assoc. Inf. Sci. Technol..

[29]  P. Deepa Shenoy,et al.  Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier , 2016, World Wide Web.

[30]  Yanchun Zhang,et al.  Guest editorial: WWWJ special issue of the 16th International Conference on Web Information Systems Engineering (WISE 2015) , 2016, World Wide Web.

[31]  Namita Mittal,et al.  Prominent feature extraction for review analysis: an empirical study , 2016, J. Exp. Theor. Artif. Intell..

[32]  Dipankar Das,et al.  Determining Sentiment in Citation Text and Analyzing Its Impact on the Proposed Ranking Index , 2016, CICLing.

[33]  Patricia Ordóñez de Pablos,et al.  What is the role of IT in innovation? A bibliometric analysis of research development in IT innovation , 2016, Behav. Inf. Technol..

[34]  Hinrich Schütze,et al.  Towards a Generic and Flexible Citation Classifier Based on a Faceted Classification Scheme , 2012, COLING.

[35]  Yaoyun Zhang,et al.  Citation Sentiment Analysis in Clinical Trial Papers , 2015, AMIA.

[36]  Awais Athar,et al.  Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[37]  Adele Paul-Hus,et al.  The linguistic patterns and rhetorical structure of citation context: an approach using n-grams , 2016 .

[38]  Hamido Fujita,et al.  A hybrid approach to the sentiment analysis problem at the sentence level , 2016, Knowl. Based Syst..

[39]  Ariyur Mahadevan Abirami,et al.  Feature Based Sentiment Analysis for Service Reviews , 2016, J. Univers. Comput. Sci..

[40]  Ali Selamat,et al.  Improving Twitter Aspect-Based Sentiment Analysis Using Hybrid Approach , 2016, ACIIDS.

[41]  Dwi H. Widyantoro,et al.  Citation sentence identification and classification for related work summarization , 2014, 2014 International Conference on Advanced Computer Science and Information System.