Citation Function Classification Based on Ontologies and Convolutional Neural Networks

In recent years, there has been significant growth in the use of citation to improve the methods of evaluating the quality of publications. To determine the quality of the publications, traditional methods such as impact factor depend only on the citation count. Recently, citation functions or purposes have gained attention to evaluate the quality of these methods. Citation function classification is defined as a way to find out the reasons behind quoting previous literature. Several approaches for citation function classification have been proposed to classify citation functions in scholarly publication. However, these approaches do not consider the author’s characteristics such as author’s information, neither the publication level. Those characteristics can be useful in the process of citation function classification. In addition, previous studies mainly used classical machine learning techniques such as support vector machine and neural networks with a number of manually created features. The manual feature representation is time-consuming and error prone. To address these problems, we propose a citation function classification model by combining ontologies with convolutional neural networks (CNN). In our model, ontologies were used to represent the author’s characteristics and the citations semantically. Then, we have incorporated this representation into a CNN model to classify citations into six functions. We have conducted experiments using public dataset and showed that the proposed approach achieves good performance compared with the existing techniques in terms of accuracy.

[1]  Dik Lun Lee,et al.  Feature reduction for neural network based text categorization , 1999, Proceedings. 6th International Conference on Advanced Systems for Advanced Applications.

[2]  Zhendong Niu,et al.  Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning , 2017, Artificial Intelligence Review.

[3]  Michael N. Huhns,et al.  An ontology tool for query formulation in an agent-based context , 1997, Proceedings of CoopIS 97: 2nd IFCIS Conference on Cooperative Information Systems.

[4]  Hakan Ferhatosmanoglu,et al.  Short text classification in twitter to improve information filtering , 2010, SIGIR.

[5]  Yutaka Takahashi,et al.  Nursing-Care Freestyle Text Classification Using Support Vector Machines , 2007 .

[6]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[7]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[8]  M. Moravcsik,et al.  Some Results on the Function and Quality of Citations , 1975 .

[9]  Angelo Di Iorio,et al.  Evaluating Citation Functions in CiTO: Cognitive Issues , 2014, ESWC.

[10]  Veda C. Storey,et al.  Conceptual Modeling Meets Domain Ontology Development: A Reconciliation , 2017, J. Database Manag..

[11]  Dale Schuurmans,et al.  Augmenting Naive Bayes Classifiers with Statistical Language Models , 2004, Information Retrieval.

[12]  Jean Carletta,et al.  An annotation scheme for discourse-level argumentation in research articles , 1999, EACL.

[13]  Yifan He,et al.  Towards Fine-grained Citation Function Classification , 2013, RANLP.

[14]  Mohammad Abdullatif,et al.  Making the H-index more relevant: A step towards standard classes for citation classification , 2013, 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW).

[15]  Mei-Po Kwan,et al.  Location-based service using ontology-based semantic queries: A study with a focus on indoor activities in a university context , 2017, Comput. Environ. Urban Syst..

[16]  E. Garfield Citation analysis as a tool in journal evaluation. , 1972, Science.

[17]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[18]  Mahmood Yousefi-Azar,et al.  Text summarization using unsupervised deep learning , 2017, Expert Syst. Appl..

[19]  R. D. Goyal Knowledge Based Neural Network for Text Classification , 2007 .

[20]  Simone Teufel,et al.  An annotation scheme for citation function , 2009, SIGDIAL Workshop.

[21]  Guoyong Qiu,et al.  A survey of virtual sample generation technology for face recognition , 2018, Artificial Intelligence Review.

[22]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[23]  Zhendong Niu,et al.  A survey on sentiment analysis of scientific citations , 2019, Artificial Intelligence Review.

[24]  Gillian Dobbie,et al.  Unsupervised Semantic and Syntactic Based Classification of Scientific Citations , 2015, DaWaK.

[25]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[26]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[27]  Angelo Di Iorio,et al.  Towards the Automatic Identification of the Nature of Citations , 2013, SePublica.

[28]  Myriam Hernández-Alvarez,et al.  Annotated Corpus for Citation Context Analysis , 2016 .

[29]  Zhendong Niu,et al.  A hybrid knowledge-based recommender system for e-learning based on ontology and sequential pattern mining , 2017, Future Gener. Comput. Syst..

[30]  Tania Tudorache,et al.  Collaborative Protege: Enabling Community-based Authoring of Ontologies , 2008, International Semantic Web Conference.

[31]  Angelo Di Iorio,et al.  Semantic Annotation of Scholarly Documents and Citations , 2013, AI*IA.