On the communication of scientific data: The Full-Metadata Format

In this paper, we introduce a scientific format for text-based data files, which facilitates storing and communicating tabular data sets. The so-called Full-Metadata Format builds on the widely used INI-standard and is based on four principles: readable self-documentation, flexible structure, fail-safe compatibility, and searchability. As a consequence, all metadata required to interpret the tabular data are stored in the same file, allowing for the automated generation of publication-ready tables and graphs and the semantic searchability of data file collections. The Full-Metadata Format is introduced on the basis of three comprehensive examples. The complete format and syntax are given in the appendix.

[1]  Andreas W. Liehr,et al.  High throughput testing platform for organic Solar Cells , 2008 .

[2]  James Campbell,et al.  Big Opportunities in Access to "Small Science" Data , 2007, Data Sci. J..

[3]  Andreas W. Liehr,et al.  Improving the Traditional Information Management in Natural Sciences , 2009, Data Sci. J..

[4]  Terence Parr The Definitive ANTLR Reference: Building Domain-Specific Languages , 2007 .

[5]  B. Taylor,et al.  CODATA recommended values of the fundamental physical constants: 2006 | NIST , 2007, 0801.0028.

[6]  Alexander S. Szalay,et al.  VOTable Format Definition Version 1.1 , 2004 .

[7]  Edward J. Shaya,et al.  Specifics on a XML Data Format for Scientific Data , 2001 .

[8]  H. SHULL,et al.  Atomic Units , 1959, Nature.

[9]  Gang Li,et al.  Accurate Measurement and Characterization of Organic Solar Cells , 2006 .

[10]  Paul F. Uhlir Open Data for Global Science: A Review of Recent Developments in National and International Scientific Data Policies and Related Proposals , 2007, Data Sci. J..

[11]  F. Rademakers,et al.  ROOT — An object oriented data analysis framework , 1997 .

[12]  Andreas W. Liehr,et al.  Re-oxygenation of haemoglobin in livores after post-mortem exposure to a cold environment , 2008, International Journal of Legal Medicine.

[13]  Nico Bruns,et al.  Amphiphilic conetworks as activating carriers for the enhancement of enzymatic activity in supercritical CO2 , 2008, Biotechnology and bioengineering.

[14]  Kyle Cranmer,et al.  Explicit state representation and the ATLAS event data model: Theory and practice , 2008 .

[15]  Caroline van Wijk,et al.  Evaluating File Formats for Long-term Preservation , 2008, iPRES.

[16]  Jens Klump,et al.  Data publication in the open access initiative , 2006, Data Sci. J..

[17]  Moritz Riede,et al.  Identification and Analysis of Key Parameters in Organic Solar Cells , 2006 .

[18]  Andreas W. Liehr,et al.  Datamining and analysis of the key parameters in organic solar cells , 2006, SPIE Photonics Europe.

[19]  K Zimmermann,et al.  Pyphant - A Python Framework for Modelling Reusable Information Processing Tasks , 2007 .

[20]  Francois Yergeau UTF-8, a transformation format of ISO 10646 , 1998, RFC.

[21]  Competing Cations U ser's Guide and Reference Manual , 2007 .

[22]  Luca Callegaro CODATA recommended values , 2012 .

[23]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .