Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles

Purpose. This paper introduces the Research Articles in Simplified HTML (or RASH), which is aWeb-first format for writingHTML-based scholarly papers; it is accompanied by the RASH Framework, a set of tools for interacting with RASH-based articles. The paper also presents an evaluation that involved authors and reviewers of RASH articles submitted to the SAVE-SD 2015 and SAVE-SD 2016 workshops. Design. RASH has been developed aiming to: be easy to learn and use; share scholarly documents (and embedded semantic annotations) through the Web; support its adoption within the existing publishing workflow. Findings. The evaluation study confirmed that RASH is ready to be adopted in workshops, conferences, and journals and can be quickly learnt by researchers who are familiar with HTML. Research Limitations. The evaluation study also highlighted some issues in the adoption of RASH, and in general of HTML formats, especially by less technically savvy users. Moreover, additional tools are needed, e.g., for enabling additional conversions from/to existing formats such as OpenXML. Practical Implications. RASH (and its Framework) is another step towards enabling the definition of formal representations of the meaning of the content of an article, facilitating its automatic discovery, enabling its linking to semantically related articles, providing access to data within the article in actionable form, and allowing integration of data between papers. Social Implications. RASH addresses the intrinsic needs related to the various users of a scholarly article: researchers (focussing on its content), readers (experiencing new ways for browsing it), citizen scientists (reusing available data formally defined within it through semantic annotations), publishers (using the advantages of new technologies as envisioned by the Semantic Publishing movement). Value. RASH helps authors to focus on the organisation of their texts, supports them in the task of semantically enriching the content of articles, and leaves all the issues about validation, visualisation, conversion, and semantic data extraction to the various tools developed within its Framework. How to cite this article Peroni et al. (2017), Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles. PeerJ Comput. Sci. 3:e132; DOI 10.7717/peerj-cs.132 Subjects Digital Libraries, World Wide Web and Web Science

[1]  Peroni Silvio,et al.  Outcomes of SAVE-SD 2015 and 2016 questionnaires on RASH and analysis of RDF annotations in the RASH papers. , 2016 .

[2]  S. Pemberton,et al.  Accessible Rich Internet Applications (WAI-ARIA) 1.0 , 2014 .

[3]  Silvio Peroni,et al.  The Semantic Publishing and Referencing Ontologies , 2014 .

[4]  Angelo Di Iorio,et al.  The RASH JavaScript Editor (RAJE): A Wordprocessor for Writing Web-first Scholarly Articles , 2017, DocEng.

[5]  Angelo Di Iorio,et al.  Recognising document components in XML-based academic articles , 2013, ACM Symposium on Document Engineering.

[6]  Steve Pettifer,et al.  Ceci n'est pas un hamburger: modelling and representing the scholarly article , 2011, Learn. Publ..

[7]  Angelo Di Iorio,et al.  It ROCS!: The RASH Online Conversion Service , 2016, WWW.

[8]  Fabien L. Gandon,et al.  RDF 1.1 XML Syntax , 2014 .

[9]  Silvio Peroni,et al.  FaBiO and CiTO: Ontologies for describing bibliographic resources and citations , 2012, J. Web Semant..

[10]  Silvio Peroni Semantic Web Technologies and Legal Scholarly Publishing , 2014 .

[11]  C. M. Sperberg-McQueen,et al.  W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures , 2012 .

[12]  Fabio Vitali,et al.  Towards accessible graphs in HTML-based scientific articles , 2017, 2017 14th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[13]  Angelo Di Iorio,et al.  A first approach to the automatic recognition of structural patterns in XML documents , 2012, DocEng '12.

[14]  Angelo Di Iorio,et al.  Dealing with structural patterns of XML documents , 2014, J. Assoc. Inf. Sci. Technol..

[15]  Christopher Alexander,et al.  The Timeless Way of Building , 1979 .

[16]  Fabio Vitali,et al.  The Document Components Ontology (DoCO) , 2016, Semantic Web.

[17]  David M. Shotton,et al.  Adventures in Semantic Publishing: Exemplar Semantic Enhancements of a Research Article , 2009, PLoS Comput. Biol..