NMRexSeer: Metadata extraction and search for large scale Nuclear Magnetic Resonance (NMR) experimental data

Sciences have become both complex and demanding for cutting-edge technology and resources to perform experiments. Since 1997, the Environmental Molecular Sciences Laboratory (EMSL) has served as a user facility housing resources for global scientists to perform experiments necessary to their research. Overtime, the generated data has become both massive and redundant. To encourage better management and reuse of such experimental data, MyEMSL has emerged as an in-house centralized data management tool that collects and distributes data from the experiments at EMSL. Nuclear Magnetic Resonance Spectroscopy (NMR) is one of the major experiment resources that EMSL houses. We discuss NMRexSeer, a proposed digital library system that automatically extracts and indexes NMR specific metadata from NMR experimental data packages. The system also generates visualized previews and provides a search interface for easy access and discovery of desired data.

[1]  Prasenjit Mitra,et al.  An algorithm search engine for software developers , 2011, SUITE '11.

[2]  Tobias Schreck,et al.  Content-based layouts for exploratory metadata search in scientific research data , 2012, JCDL '12.

[3]  C. Lee Giles,et al.  Automatic Detection of Pseudocodes in Scholarly Documents Using Machine Learning , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[4]  C. Lee Giles,et al.  A generalized topic modeling approach for automatic document annotation , 2015, International Journal on Digital Libraries.

[5]  C. Lee Giles,et al.  Improving algorithm search using the algorithm co-citation network , 2012, JCDL '12.

[6]  Jeffery S. Horsburgh,et al.  ONEMercury: Towards Automatic Annotation of Environmental Science Metadata , 2012, LISC@ISWC.

[7]  C. Lee Giles,et al.  ChemXSeer: a digital library and data repository for chemical kinetics , 2007, CIMS '07.

[8]  Cornelia Caragea,et al.  PDFMEF: A Multi-Entity Knowledge Extraction Framework for Scholarly Documents and Semantic Search , 2015, K-CAP.

[9]  John A. Kunze,et al.  Dublin Core Metadata for Resource Discovery , 1998, RFC.

[10]  J. Keeler Understanding NMR Spectroscopy , 2005 .

[11]  C. Lee Giles,et al.  Automatic tag recommendation for metadata annotation using probabilistic topic modeling , 2013, JCDL '13.

[12]  Na Li,et al.  oreChem ChemXSeer: a semantic digital library for chemistry , 2010, JCDL '10.

[13]  C. Lee Giles,et al.  A classification scheme for algorithm citation function in scholarly works , 2013, JCDL '13.

[14]  Lior Rokach,et al.  A figure search engine architecture for a chemistry digital library , 2013, JCDL '13.

[15]  Ranjeet Devarakonda,et al.  Mercury- Distributed Metadata Management, Data Discovery and Access System , 2007 .

[16]  Kun Bai,et al.  TableSeer: automatic table metadata extraction and searching in digital libraries , 2007, JCDL '07.

[17]  C. Lee Giles,et al.  A hybrid approach to discover semantic hierarchical sections in scholarly documents , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[18]  C. Lee Giles,et al.  Building a Search Engine for Algorithms , 2014 .

[19]  Wenyi Huang,et al.  Towards building a scholarly big data platform: Challenges, lessons and opportunities , 2014, IEEE/ACM Joint Conference on Digital Libraries.

[20]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..