Semantic Annotation of Scholarly Documents and Citations

Scholarly publishing is in the middle of a revolution based on the use of Web-related technologies as medium of communication. In this paper we describe our ongoing study of semantic publishing and automatic annotation of scholarly documents, presenting several models and tools for the automatic annotation of structural and semantic components of documents. In particular, we focus on citations and their automatic classification obtained by CiTalO, a framework that combines ontology learning techniques with NLP techniques.

[1]  Silvio Peroni,et al.  FaBiO and CiTO: Ontologies for describing bibliographic resources and citations , 2012, J. Web Semant..

[2]  Enrico Motta,et al.  Mining Semantic Relations between Research Areas , 2012, SEMWEB.

[3]  Angelo Di Iorio,et al.  Towards the Automatic Identification of the Nature of Citations , 2013, SePublica.

[4]  Simone Teufel,et al.  An Architecture for Language Processing for Scientific Texts , 2006 .

[5]  Anita de Waard From Proteins to Fairytales: Directions in Semantic Publishing , 2010, IEEE Intelligent Systems.

[6]  Enrico Motta,et al.  Making Sense of Research with Rexplore , 2012, International Semantic Web Conference.

[7]  Brett T. Litz,et al.  Emotional numbing in combat-related post-traumatic stress disorder: A critical review and reformulation , 1992 .

[8]  Aldo Gangemi,et al.  Knowledge Extraction Based on Discourse Representation Theory and Linguistic Frames , 2012, EKAW.

[9]  Angelo Di Iorio,et al.  Dealing with structural patterns of XML documents , 2014, J. Assoc. Inf. Sci. Technol..

[10]  Brian Davis,et al.  Knowledge Engineering and Knowledge Management , 2012, Lecture Notes in Computer Science.

[11]  Steve Pettifer,et al.  Utopia documents: linking scholarly literature with research data , 2010, Bioinform..

[12]  Aldo Gangemi,et al.  The OntoWordNet Project: Extension and Axiomatization of Conceptual Relations in WordNet , 2003, OTM.

[13]  Robert Meersman,et al.  On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE , 2003, Lecture Notes in Computer Science.

[14]  Steve Pettifer,et al.  Ceci n'est pas un hamburger: modelling and representing the scholarly article , 2011, Learn. Publ..

[15]  Angelo Di Iorio,et al.  Recognising document components in XML-based academic articles , 2013, ACM Symposium on Document Engineering.

[16]  David M. Shotton,et al.  Semantic publishing: the coming revolution in scientific journal publishing , 2009, Learn. Publ..

[17]  Angelo Di Iorio,et al.  A Semantic Web approach to everyday overlapping markup , 2011, J. Assoc. Inf. Sci. Technol..

[18]  Fabio Vitali,et al.  Faceted documents: describing document characteristics using semantic lenses , 2012, DocEng '12.

[19]  Jeff Heflin,et al.  The Semantic Web – ISWC 2012 , 2012, Lecture Notes in Computer Science.

[20]  Hwee Tou Ng,et al.  It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text , 2010, ACL.

[21]  Andrei Voronkov,et al.  PDFX: fully-automated PDF-to-XML conversion of scientific literature , 2013, ACM Symposium on Document Engineering.