Open Data Platform for Knowledge Access in Plant Health Domain : VESPA Mining

Important data are locked in ancient literature. It would be uneconomic to produce these data again and today or to extract them without the help of text mining technologies. Vespa is a text mining project whose aim is to extract data on pest and crops interactions, to model and predict attacks on crops, and to reduce the use of pesticides. A few attempts proposed an agricultural information access. Another originality of our work is to parse documents with a dependency of the document architecture.

[1]  B. Carpenter,et al.  LingPipe for 99.99% Recall of Gene Mentions , 2007 .

[2]  Pertti Vakkari How specific thesauri and a general thesaurus cover lay persons’ vocabularies concerning health, nutrition and social services , 2011 .

[3]  Tomaz Bartol,et al.  Assessment of Food and Nutrition Related Descriptors in Agricultural and Biomedical Thesauri , 2009, MTSR.

[4]  Yi Zhang,et al.  Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies , 2014, LREC.

[5]  Ian H. Witten,et al.  Mining Domain-Specific Thesauri from Wikipedia: A Case Study , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[6]  Ellen Riloff,et al.  An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains , 1996, Artif. Intell..

[7]  Isabelle Mougenot,et al.  ThesauForm - Traits: A web based collaborative tool to develop a thesaurus for plant functional diversity research , 2012, Ecol. Informatics.

[8]  Nicolas Turenne,et al.  BELUGA : un outil pour l'analyse dynamique des connaissances de la littérature scientifique d'un domaine - Première application au cas des maladies à prions , 2004, EGC.

[9]  Nicolas Turenne,et al.  x.ent: R Package for Entities and Relations Extraction based on Unsupervised Learning and Document Structure , 2015, ArXiv.

[10]  Dan Roth,et al.  Probabilistic Reasoning for Entity & Relation Recognition , 2002, COLING.

[11]  Mihai Surdeanu,et al.  Customizing an Information Extraction System to a New Domain , 2011, RELMS@ACL.