Authoring and Publishing of Units and Quantities in Semantic Documents

This paper shows how an explicit representation of units and quantities can improve the experience of semantically published documents, and provides a first authoring method in this respect. To exemplify the potential and practical advantages of encoding explicit semantics regarding units w.r.t. user experience, we demonstrate a unit system preference service, which enables the user to choose the system of units for the displayed paper. By semantically publishing units, we obtain a basis for a wide range of applications and services such as unknown unit lookup, unit and quantity semantic search and unit and quantity manipulation. Enabling semantic publishing for units is also presented in the context of a large collection of legacy scientific documents (the arXMliv corpus), where our approach allows to non-invasively enrich legacy publications.

[1]  Christoph Lange,et al.  The Planetary System: Web 3.0 & Active Documents for STEM , 2011, ICCS.

[2]  Joseph B. Collins OpenMath Content Dictionaries for SI Quantities and Units , 2009, Calculemus/MKM.

[3]  Michael Kohlhase,et al.  Using L A T E X as a Semantic Markup Format , 2008 .

[4]  Robert G. Raskin,et al.  Knowledge representation in the semantic web for Earth and environmental terminology (SWEET) , 2005, Comput. Geosci..

[5]  Marcel Heldoorn,et al.  The SIunits package∗ Consistent application of SI units , 2007 .

[6]  Christoph Lange,et al.  Ontologies and languages for representing mathematical knowledge on the Semantic Web , 2013, Semantic Web.

[7]  James H. Davenport,et al.  Unit Knowledge Management , 2008, AISC/MKM/Calculemus.

[8]  Ondřej Klobušník,et al.  ArXiv.org e-print archive , 2004 .

[9]  Christoph Lange,et al.  Publishing Math Lecture Notes as Linked Data , 2010, ESWC.

[10]  J. Stratford,et al.  Creating an Extensible Unit Converter Using OpenMath as the Representation of the Semantics of the Units , 2008 .

[11]  Christoph Lange,et al.  Integrating Web Services into Active Mathematical Documents , 2009, Calculemus/MKM.

[12]  Stephen M. Watt,et al.  Mathematical Markup Language (MathML) Version 3.0 , 2001, WWW 2001.

[13]  Michael Kohlhase,et al.  MathWebSearch 0 . 4 A Semantic Search Engine for Mathematics , 2008 .

[14]  R. Burchfield Oxford English dictionary , 1982 .

[15]  Michael Kohlhase,et al.  An Architecture for Linguistic and Semantic Analysis on the arXMLiv Corpus , 2009, GI Jahrestagung.

[16]  Bruce R. Miller,et al.  Transforming Large Collections of Scientific Publications to XML , 2010, Math. Comput. Sci..

[17]  Michael J. Barany,et al.  '[B]ut this is blog maths and we're free to make up conventions as we go along': Polymath1 and the modalities of 'massively collaborative mathematics' , 2010, Int. Sym. Wikis.

[18]  Michael Kohlhase,et al.  Semantics of OpenMath and MathML3 , 2012, Math. Comput. Sci..

[19]  Christoph Lange,et al.  Semantics of Governmental Statistics Data , 2010 .

[20]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[21]  Oscar Naim,et al.  Word add-in for ontology recognition: semantic enrichment of scientific literature , 2010, BMC Bioinformatics.

[22]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[23]  Michael Kohlhase,et al.  Using as a Semantic Markup Format , 2008, Math. Comput. Sci..

[24]  Siegfried Handschuh,et al.  SALT - Semantically Annotated LaTeX for scientific publications , 2007 .