Strategy for Extensible, Evolving Terminology for the Materials Genome Initiative Efforts

Intuitive, flexible, and evolving terminology plays a significant role in capitalizing on recommended knowledge representation models for materials engineering applications. In this article, we present a proposed rules-based approach with initial examples from a growing corpus of materials terms in the National Institute of Standards and Technology (NIST) Materials Data Repository. Our method aims to establish a common, consistent, and evolving set of rules for creating or extending terminology as needed to describe materials data. The rules are intended to be simple and generalizable for users to understand and extend as well as for groups to apply to their own repositories. The rules generate terms that facilitate machine processing and decision making.

[1]  Shuichi Iwata,et al.  Computerization and networking of materials databases , 1992 .

[2]  Stephen R. Heller,et al.  InChI - the worldwide chemical structure identifier standard , 2013, Journal of Cheminformatics.

[3]  Charles H. Ward Materials Genome Initiative for Global Competitiveness , 2012 .

[4]  Steven J. Johnston,et al.  A framework for user driven data management , 2014, Inf. Syst..

[5]  Jeremy G Frey,et al.  Laboratory notebooks in the digital era: the role of ELNs in record keeping for chemistry and other sciences. , 2013, Chemical Society reviews.

[6]  Vladimir Diky,et al.  Extension of ThermoML: The IUPAC standard for thermodynamic data communications (IUPAC Recommendations 2011) , 2011 .

[7]  Ursula R. Kattner,et al.  Invited review: Modelling of thermodynamics and diffusion in multicomponent systems , 2009 .

[8]  Ursula R. Kattner,et al.  File and data repositories for Next Generation CALPHAD , 2014 .

[9]  Anne L. Plant,et al.  New concepts for building vocabulary for cell image ontologies , 2011, BMC Bioinformatics.

[10]  Maki Suemitsu,et al.  Effects of the Hole Tunneling Barrier Width on the Electrical Characteristic in Silicon Quantum Dots Light-Emitting Diodes , 2011 .

[11]  Michael Rubacha,et al.  A Review of Electronic Laboratory Notebooks Available in the Market Today , 2011, Journal of laboratory automation.

[12]  Boyan Brodaric,et al.  Semantic scientific knowledge integration : Papers from the AAAI Spring Symposium , 2008 .

[13]  S. Stein,et al.  XML-based IUPAC standard for experimental, predicted, and critically evaluated thermodynamic property data storage and capture (ThermoML) (IUPAC Recommendations 2006) , 2006 .

[14]  Stephen D. Larson,et al.  NeuroLex.org: an online framework for neuroscience knowledge , 2013, Front. Neuroinform..

[15]  Antony J. Williams,et al.  Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining , 2010, J. Cheminformatics.

[16]  Robert J. Hanisch,et al.  Making materials science and engineering data more valuable research products , 2014, Integrating Materials and Manufacturing Innovation.

[17]  Matthew P. Miller,et al.  The Design of a Software Environment for Organizing, Sharing, and Archiving Materials Data , 2009 .

[18]  David Cebon,et al.  Engineering Materials Informatics , 2006 .

[19]  L Charles Bailey,et al.  Building a Common Pediatric Research Terminology for Accelerating Child Health Research , 2014, Pediatrics.

[20]  Kei Koizumi,et al.  Increasing Access to the Results of Federally Funded Scientific Research , 2016 .

[21]  J. Holdren Memorandum for the Heads of Executive Departments and Agencies: Increasing Access to the Results of Federally Funded Scientific Research , 2013 .

[22]  S. M. Arnold,et al.  Paradigm Shift in Data Content and Informatics Infrastructure Required for Generalized Constitutive Modeling of Materials Behavior , 2006 .

[23]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..