Open Research Knowledge Graph: Towards Machine Actionability in Scholarly Communication

Despite improved findability of and access to scientific knowledge in recent decades, scholarly communication continues to be document-based. Scientific knowledge remains locked in representations that are inadequate for machine processing. In this article, we present initial steps towards next generation digital libraries and infrastructures that acquire, curate, publish and process scholarly knowledge semantically, in machine readable form leveraging knowledge graphs. The primary contribution of this work is to present and discuss early developments of a system designed to crowdsource machine readable descriptions of research contributions published in scholarly articles and a knowledge graph infrastructure for description storage and access. We report on the results of a first experimental evaluation of the concept and its implementation with the participants of a recent international conference. The results suggest that users find such a system useful, and the possibilities it could enable intriguing.

[1]  Jens Lehmann,et al.  Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short Text , 2019, NAACL.

[2]  Bianca Kramer,et al.  Ten Hot Topics around Scholarly Publishing , 2019, Publ..

[3]  Júlio Cesar dos Reis,et al.  Automated Coding of Medical Diagnostics from Free-Text: The Role of Parameters Optimization and Imbalanced Classes , 2018, DILS.

[4]  Erhard Rahm,et al.  A Learning-Based Approach to Combine Medical Annotation Results - (Short Paper) , 2018, DILS.

[5]  Andreas Friedrich,et al.  Interactive Visualization for Large-Scale Multi-factorial Research Designs , 2018, DILS.

[6]  Manuel Prinz,et al.  Towards Research Infrastructures that Curate Scientific Information: A Use Case in Life Sciences , 2018, DILS.

[7]  Brandon M. Malone,et al.  Knowledge Graph Completion to Predict Polypharmacy Side Effects , 2018, DILS.

[8]  Martin Hofmann-Apitius,et al.  Converting Alzheimer's Disease Map into a Heavyweight Ontology: A Formal Network to Integrate Data , 2018, DILS.

[9]  Maria-Esther Vidal,et al.  Towards a Knowledge Graph for Science , 2018, WIMS.

[10]  Jens Lehmann,et al.  Two for one: querying property graph databases using SPARQL via gremlinator , 2018, GRADES/NDA@SIGMOD/PODS.

[11]  Doug Downey,et al.  Construction of the Literature Graph in Semantic Scholar , 2018, NAACL.

[12]  Jens Lehmann,et al.  Why Reinvent the Wheel: Let's Build Question Answering Systems Together , 2018, WWW.

[13]  Krzysztof Janowicz,et al.  The GeoLink knowledge graph , 2018 .

[14]  Jens Lehmann,et al.  EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs , 2018, SEMWEB.

[15]  Angelo Di Iorio,et al.  Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles , 2017, PeerJ Prepr..

[16]  Christoph Lange,et al.  Towards a Knowledge Graph Representing Research Findings by Semantifying Survey Articles , 2017, TPDL.

[17]  Maria-Esther Vidal,et al.  Integration of Scholarly Communication Metadata Using Knowledge Graphs , 2017, TPDL.

[18]  Bianca Kramer,et al.  The Scholarly Commons - principles and practices to guide research communication , 2017 .

[19]  Maria-Esther Vidal,et al.  Towards an Integrated Graph Algebra for Graph Pattern Matching with Gremlin , 2017, DEXA.

[20]  Ruben Verborgh,et al.  Decentralised Authoring, Annotations and Notifications for a Read-Write Web with dokieli , 2017, ICWE.

[21]  Amir Aryani,et al.  Research Graph: Building a Distributed Graph of Scholarly Works using Research Data Switchboard , 2017 .

[22]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[23]  Tim DiLauro,et al.  The RMap Project: Capturing and Preserving Associations amongst Multi-Part Distributed Publications , 2015, JCDL.

[24]  Pietro Baroni,et al.  Automatic evaluation of design alternatives with quantitative argumentation , 2015, Argument Comput..

[25]  Michael Günther,et al.  Introducing Wikidata to the Linked Data Web , 2014, SEMWEB.

[26]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[27]  Lutz Bornmann,et al.  Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references , 2014, J. Assoc. Inf. Sci. Technol..

[28]  Karen R McElfresh,et al.  Development of the research lifecycle model for library services. , 2013, Journal of the Medical Library Association : JMLA.

[29]  Gerhard Weikum,et al.  YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract , 2013, IJCAI.

[30]  Carole A. Goble,et al.  Micropublications: a semantic model for claims, evidence, arguments and annotations in biomedical communications , 2013, Journal of Biomedical Semantics.

[31]  Christoph Lange,et al.  Ontologies and languages for representing mathematical knowledge on the Semantic Web , 2013, Semantic Web.

[32]  Jens Lehmann,et al.  LinkedGeoData: A core for a web of spatial open data , 2012, Semantic Web.

[33]  Robert B. Allen,et al.  Supporting Structured Browsing for Full-Text Scientific Research Reports , 2012, ArXiv.

[34]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[35]  Sean Bechhofer,et al.  Research Objects: Towards Exchange and Reuse of Digital Knowledge , 2010 .

[36]  Arif E. Jinha Article 50 million: an estimate of the number of scholarly articles in existence , 2010, Learn. Publ..

[37]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[38]  Dejing Dou,et al.  Ontology-based information extraction: An introduction and a survey of current approaches , 2010, J. Inf. Sci..

[39]  H. Jansen,et al.  The Logic of Qualitative Survey Research and its Position in the Field of Social Research Methods , 2010 .

[40]  Boyan Brodaric,et al.  SKIing with DOLCE: toward an e-Science Knowledge Infrastructure , 2008, FOIS.

[41]  Leen Breure,et al.  Modeling Rhetoric in Scientific Publications , 2008 .

[42]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[43]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[44]  Siegfried Handschuh,et al.  SALT - Semantically Annotated LaTeX for scientific publications , 2007 .

[45]  J. Ioannidis,et al.  Why Most Published Research Findings Are False , 2005, PLoS medicine.

[46]  Barend Mons,et al.  Which gene did you mean? , 2005, BMC Bioinformatics.

[47]  Alia I. Abdelmoty,et al.  Building a Geographical Ontology for Intelligent Spatial Search on the Web , 2005, Databases and Applications.

[48]  Alexander Hars,et al.  Designing Scientific Knowledge Infrastructures: The Contribution of Epistemology , 2001, Inf. Syst. Frontiers.

[49]  Nikos I. Karacapilidis,et al.  The Zeno argumentation framework , 1997, ICAIL '97.

[50]  J. Armstrong,et al.  Peer review for journals: Evidence on quality control, fairness, and innovation , 1997 .

[51]  Vera G. Meister Towards a Knowledge Graph for a Research Group with Focus on Qualitative Analysis of Scholarly Papers , 2017, SemSci@ISWC.

[52]  Paolo Manghi,et al.  The Scholix Framework for Interoperability in Data-Literature Information Exchange , 2017, D Lib Mag..

[53]  Wolfram Wöß,et al.  Towards a Definition of Knowledge Graphs , 2016, SEMANTiCS.

[54]  Fabio Vitali,et al.  The Document Components Ontology (DoCO) , 2016, Semantic Web.

[55]  Paul Donohoe,et al.  The Long Road to JATS , 2015 .

[56]  Silvio Peroni,et al.  The Semantic Publishing and Referencing Ontologies , 2014 .

[57]  Hugo Fjelsted Alrøe,et al.  Second-Order Science of Interdisciplinary Research: A Polyocular Framework for Wicked Problems , 2014 .

[58]  Paul T. Groth,et al.  The anatomy of a nanopublication , 2010, Inf. Serv. Use.

[59]  Bo-Christer Björk,et al.  Scientific journal publishing: yearly volume and open access availability , 2009, Inf. Res..

[60]  Herbert Van de Sompel,et al.  All aboard: toward a machine-friendly scholarly communication system , 2009, The Fourth Paradigm.

[61]  Jenny Fry,et al.  Scholarly research and information practices: a domain analytic approach , 2006, Inf. Process. Manag..