CCRO: Citation’s Context & Reasons Ontology

Research papers can be visualized as a networked information space that contains a collection of information entities, inter-connected by directed links, commonly known as citation graph. There is a possibility to enrich the citation graph with meaningful relations using semantic tags. We have discovered and evaluated more than 150 citations’ reasons from the existing published literature to be represented as citation tags. Many of these reasons have overlapped and diffused meanings. Annotating such a large volume of citation graphs with citation’s reasons manually is nearly impossible, giving rise to a need to discover the citation’s reasons automatically with high accuracy. The first step towards this is developing a minimal set of citation’s context and reasons that are disjoint in nature. It would be a great help to the reasoning system if these reasons are represented in a formal way in the form of an ontology. By adopting a well-defined scientific methodology to formulate an ontology of citation reasons, we have reduced 150 reasons into only eight disjoint reasons, using an iterative process of sentiment analysis, collaborative meanings, and experts’ opinions. Based on our findings and experiments, we have proposed a citation’s context and reasons ontology (CCRO) that provides abstract conceptualization required to organize citations’ relations. CCRO has been verified, validated, and assessed by using the well-defined procedures and tools proposed in the literature for ontology evaluation. The results show that the proposed ontology is concise, complete, and consistent. For the instantiation and mapping of ontology classes on real data, we have developed a mapping graph among the verbs with predicative complements in the English Language, the verbs extracted from the selected corpus using the NLP and CCRO classes. Using this mapping graph, the mapping of ontology classes in each citation’s sentiment is explained with a complete mapping on the selected dataset.

[1]  Dragomir R. Radev,et al.  NLP-driven citation analysis for scientometrics , 2016, Natural Language Engineering.

[2]  Stephen Cranefield,et al.  Context identification of sentences in related work sections using a conditional random field: towards intelligent digital libraries , 2010, JCDL '10.

[3]  Angelo Di Iorio,et al.  Evaluating Citation Functions in CiTO: Cognitive Issues , 2014, ESWC.

[4]  Xiaojun Wan,et al.  Are all literature citations equally important? Automatic citation strength estimation and its applications , 2014, J. Assoc. Inf. Sci. Technol..

[5]  Staša Milojević,et al.  Citation content analysis (CCA): A framework for syntactic and semantic analysis of citation content , 2012, J. Assoc. Inf. Sci. Technol..

[6]  Katherine W. McCain,et al.  Citation context analysis and aging patterns of journal articles in molecular genetics , 1989, Scientometrics.

[7]  E. Garfield When to Cite , 1996, The Library Quarterly.

[8]  J. Ioannidis,et al.  Why Current Publication Practices May Distort Science , 2008, PLoS medicine.

[9]  Andrea Giovanni Nuzzolese,et al.  Describing bibliographic references in RDF , 2014, SePublica.

[10]  Awais Athar,et al.  Sentiment analysis of scientific citations , 2014 .

[11]  Barbara Tillett,et al.  What is FRBR? A conceptual model for the bibliographic universe , 2005 .

[12]  Muhammad Abdul Qadir,et al.  Ontological Errors - Inconsistency, Incompleteness and Redundancy , 2008, ICEIS.

[13]  Hinrich Schütze,et al.  Towards a Generic and Flexible Citation Classifier Based on a Faceted Classification Scheme , 2012, COLING.

[14]  Daryl E. Chubin,et al.  Content Analysis of References: Adjunct or Alternative to Citation Counting? , 1975 .

[15]  Yaoyun Zhang,et al.  Citation Sentiment Analysis in Clinical Trial Papers , 2015, AMIA.

[16]  Muhammad Rafi,et al.  Classification of Research Citations (CRC) , 2015, CLBib@ISSI.

[17]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[18]  Dirk Noël Beth Levin. English Verb Classes and Alternations: A Preliminary Investigation , 1995 .

[19]  Boris Motik,et al.  HermiT: An OWL 2 Reasoner , 2014, Journal of Automated Reasoning.

[20]  Bluma C. Peritz,et al.  A classification of citation roles for the social sciences and related fields , 1983, Scientometrics.

[21]  Bei Yu,et al.  Automated citation sentiment analysis: What can we learn from biomedical researchers , 2013, ASIST.

[22]  Silvio Peroni,et al.  FaBiO and CiTO: Ontologies for describing bibliographic resources and citations , 2012, J. Web Semant..

[23]  Alan Ruttenberg,et al.  The SWAN biomedical discourse ontology , 2008, J. Biomed. Informatics.

[24]  Simone Teufel,et al.  An annotation scheme for citation function , 2009, SIGDIAL Workshop.

[25]  Niket Tandon,et al.  Citation Context Sentiment Analysis for Structured Summarization of Research Papers , 2012 .

[26]  Awais Athar,et al.  Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[27]  Asunción Gómez-Pérez,et al.  Ontological Engineering: With Examples from the Areas of Knowledge Management, e-Commerce and the Semantic Web , 2004, Advanced Information and Knowledge Processing.

[28]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[29]  Eugene Garfield,et al.  Highly cited old papers , 1979, Journal of the American Society for Information Science.

[30]  C. O. Frost The Use of Citations in Literary Research: A Preliminary Classification of Citation Functions , 1979, The Library Quarterly.

[31]  José M. Gómez,et al.  Survey in sentiment, polarity and function analysis of citation , 2014, ArgMining@ACL.

[32]  M. Moravcsik,et al.  Some Results on the Function and Quality of Citations , 1975 .

[33]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[34]  Umut Al,et al.  A content-based citation analysis study based on text categorization , 2017, Scientometrics.

[35]  Oren Etzioni,et al.  Identifying Meaningful Citations , 2015, AAAI Workshop: Scholarly Big Data.

[36]  S. Cozzens Comparing the Sciences: Citation Context Analysis of Papers from Neuropharmacology and the Sociology of Science , 1985 .

[37]  Simone Teufel,et al.  Context-Enhanced Citation Sentiment Detection , 2012, NAACL.

[38]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[39]  Fabio Vitali,et al.  The Document Components Ontology (DoCO) , 2016, Semantic Web.

[40]  Ulrich Schäfer,et al.  Ensemble-style Self-training on Citation Classification , 2011, IJCNLP.

[41]  I. Spiegel-Rosing Science Studies: Bibliometric and Content Analysis , 1977 .

[42]  Dragomir R. Radev,et al.  The ACL anthology network corpus , 2009, Language Resources and Evaluation.

[43]  Boris Motik,et al.  OWL 2 Web Ontology Language XML Serialization , 2008 .

[44]  In-Cheol Kim,et al.  Automated classification of author's sentiments in citation using machine learning techniques: A preliminary study , 2015, 2015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[45]  Amit P. Sheth,et al.  CEVO: Comprehensive EVent Ontology Enhancing Cognitive Annotation , 2017, ArXiv.

[46]  Ben-Ami Lipetz,et al.  Improvement of the selectivity of citation indexes to science literature through inclusion of citation relationship indicators , 1965 .

[47]  Manasi S. Patwardhan,et al.  Context based citation summary of research articles: A step towards qualitative citation index , 2015, 2015 International Conference on Computer, Communication and Control (IC4).

[48]  Yifan He,et al.  Towards Fine-grained Citation Function Classification , 2013, RANLP.