Mars Target Encyclopedia: Rock and Soil Composition Extracted From the Literature

We have constructed an information extraction system called the Mars Target Encyclopedia that takes in planetary science publications and extracts scientific knowledge about target compositions. The extracted knowledge is stored in a searchable database that can greatly accelerate the ability of scientists to compare new discoveries with what is already known. To date, we have applied this system to ∼6000 documents and achieved 41–56% precision in the extracted information.

[1]  Ulf Leser,et al.  A Comprehensive Benchmark of Kernel Methods to Extract Protein–Protein Interactions from Literature , 2010, PLoS Comput. Biol..

[2]  Peter M. A. Sloot,et al.  A hybrid approach to extract protein-protein interactions , 2011, Bioinform..

[3]  M. Saccoccio,et al.  The ChemCam Instrument Suite on the Mars Science Laboratory (MSL) Rover: Science Objectives and Mast Unit Description , 2012 .

[4]  Horacio Saggion,et al.  Knowledge Extraction and Modeling from Scientific Publications , 2016 .

[5]  Jukka Zitting,et al.  Tika in Action , 2011 .

[6]  Satoshi Tsutsui Machine Reading Approach to Understand Alzheimer ’ s Disease Literature , 2016 .

[7]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[8]  Christopher Ré,et al.  GeoDeepDive: statistical inference using familiar data-processing languages , 2013, SIGMOD '13.

[9]  A. Yingst,et al.  A Habitable Fluvio-Lacustrine Environment at Yellowknife Bay, Gale Crater, Mars , 2014, Science.

[10]  Nello Cristianini,et al.  Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL) , 2012 .

[11]  T. C. Stein,et al.  PDS Analyst's Notebook for MSL and MER: Interface Update and Image Drawing Tools , 2016 .

[12]  A. Valencia,et al.  Information Retrieval and Text Mining Technologies for Chemistry. , 2017, Chemical reviews.

[13]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[14]  Claudio Giuliano,et al.  Exploiting Shallow Linguistic Information for Relation Extraction from Biomedical Literature , 2006, EACL.