A decision support system for eco-efficient biorefinery process comparison using a semantic approach

We define a decision support system for eco-efficient biorefinery process comparison.Uncertainty is managed all along the pipeline for eco-design indicator computations.Data extraction from textual documents is guided by a Termino-ontological resource.Flexible structured database querying is guided by a Termino-ontological resource. Enzymatic hydrolysis of the main components of lignocellulosic biomass is one of the promising methods to further upgrading it into biofuels. Biomass pre-treatment is an essential step in order to reduce cellulose crystallinity, increase surface and porosity and separate the major constituents of biomass. Scientific literature in this domain is increasing fast and could be a valuable source of data. As these abundant scientific data are mostly in textual format and heterogeneously structured, using them to compute biomass pre-treatment efficiency is not straightforward. This paper presents the implementation of a Decision Support System (DSS) based on an original pipeline coupling knowledge engineering (KE) based on semantic web technologies, soft computing techniques and environmental factor computation. The DSS allows using data found in the literature to assess environmental sustainability of biorefinery systems. The pipeline permits to: (1) structure and integrate relevant experimental data, (2) assess data source reliability, (3) compute and visualize green indicators taking into account data imprecision and source reliability. This pipeline has been made possible thanks to innovative researches in the coupling of ontologies, uncertainty management and propagation. In this first version, data acquisition is done by experts and facilitated by a termino-ontological resource. Data source reliability assessment is based on domain knowledge and done by experts. The operational prototype has been used by field experts on a realistic use case (rice straw). The obtained results have validated the usefulness of the system. Further work will address the question of a higher automation level for data acquisition and data source reliability assessment.

[1]  Kristina Lerman,et al.  Semi-automatically Mapping Structured Sources into the Semantic Web , 2012, ESWC.

[2]  M. Delwiche,et al.  Methods for Pretreatment of Lignocellulosic Biomass for Efficient Hydrolysis and Biofuel Production , 2009 .

[3]  Shinji Fujimoto,et al.  Wet disk milling pretreatment without sulfuric acid for enzymatic hydrolysis of rice straw. , 2009, Bioresource technology.

[4]  Henrik Bindslev,et al.  Plasma-Assisted Pretreatment of Wheat Straw for Ethanol Production , 2011, Applied biochemistry and biotechnology.

[5]  Jun'ichi Tsujii,et al.  A Rich Feature Vector for Protein-Protein Interaction Extraction from Multiple Corpora , 2009, EMNLP.

[6]  Franjo Cecelja,et al.  Semantic approach for pre-assessment of environmental indicators in Industrial Symbiosis , 2015 .

[7]  Hajo Rijgersberg,et al.  How semantics can improve engineering processes: A case of units of measure and quantities , 2011, Adv. Eng. Informatics.

[8]  Barbara Rosario,et al.  Multi-way Relation Classification: Application to Protein-Protein Interactions , 2005, HLT.

[9]  Chul-Hwan Kim,et al.  Effect of torrefaction for the pretreatment of rice straw for ethanol production. , 2013, Journal of the science of food and agriculture.

[10]  Sébastien Destercke,et al.  A flexible bipolar querying approach with imprecise data and guaranteed results , 2011, Fuzzy Sets Syst..

[11]  Udo Hahn,et al.  Event Extraction from Trimmed Dependency Graphs , 2009, BioNLP@HLT-NAACL.

[12]  Jari Björne,et al.  Extracting Complex Biological Events with Rich Graph-Based Feature Sets , 2009, BioNLP@HLT-NAACL.

[13]  Alon Y. Halevy,et al.  Principles of Data Integration , 2012 .

[14]  J. Y. Zhu,et al.  Woody biomass pretreatment for cellulosic ethanol production: Technology and energy consumption evaluation , 2010 .

[15]  Tayebeh Behzad,et al.  Improvement of saccharification and ethanol production from rice straw by NMMO and [BMIM][OAc] pretreatments. , 2013 .

[16]  Franjo Cecelja,et al.  An ontological approach towards enabling processing technologies participation in industrial symbiosis , 2013, Comput. Chem. Eng..

[17]  Quang Le Minh,et al.  A Pattern Approach for Biomedical Event Annotation , 2011, Proceedings of BioNLP Shared Task 2011 Workshop.

[18]  Liliana Ibanescu,et al.  Fuzzy Web Data Tables Integration Guided by an Ontological and Terminological Resource , 2013, IEEE Transactions on Knowledge and Data Engineering.

[19]  Deyu Zhou,et al.  Biomedical Relation Extraction: From Binary to Complex , 2014, Comput. Math. Methods Medicine.

[20]  Jong-In Han,et al.  Effect of nitric acid on pretreatment and fermentation for enhancing ethanol production of rice straw. , 2014, Carbohydrate polymers.

[21]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[22]  Ziqi Zhang,et al.  Towards Efficient and Effective Semantic Table Interpretation , 2014, SEMWEB.

[23]  Barbara Rosario,et al.  Classifying Semantic Relations in Bioscience Texts , 2004, ACL.

[24]  Xiaoyan Zhu,et al.  Discovering Patterns to Extract Protein-Protein Interactions from Full Biomedical Texts , 2004, NLPBA/BioNLP.

[25]  Hamid Zilouei,et al.  Organosolv pretreatment of rice straw for efficient acetone, butanol, and ethanol production. , 2014, Bioresource technology.

[26]  Paul Buitelaar,et al.  LexInfo: A declarative model for the lexicon-ontology interface , 2011, J. Web Semant..

[27]  Daniel P. Miranker,et al.  QODI: Query as Context in Automatic Data Integration , 2013, International Semantic Web Conference.

[28]  Yvan Saeys,et al.  Analyzing text in search of bio-molecular events: a high-precision machine learning framework , 2009, BioNLP@HLT-NAACL.

[29]  Christophe Roche,et al.  Ontoterminology - A New Paradigm for Terminology , 2009, KEOD.

[30]  Kalpana Raja,et al.  PPInterFinder—a mining tool for extracting causal relations on human proteins from literature , 2013, Database J. Biol. Databases Curation.

[31]  Abdellatif Barakat,et al.  Innovative combined dry fractionation technologies for rice straw valorization to biofuels , 2015 .

[32]  Philipp Cimiano,et al.  Linking Lexical Resources and Ontologies on the Semantic Web with Lemon , 2011, ESWC.

[33]  Takashi Yanagida,et al.  Combination of hot compressed water treatment and wet disk milling for high sugar recovery yield in enzymatic hydrolysis of rice straw. , 2012, Bioresource technology.

[34]  Xiaoyan Zhu,et al.  Protein-protein interaction extraction from bio-literature with compact features and data sampling strategy , 2011, 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI).

[35]  Hao Yu,et al.  Discovering patterns to extract protein-protein interactions from the literature: Part II , 2005, Bioinform..

[36]  Sébastien Destercke,et al.  Evaluating Data Reliability: An Evidential Answer with Application to a Web-Enabled Data Warehouse , 2013, IEEE Transactions on Knowledge and Data Engineering.

[37]  Madalina Croitoru,et al.  A Decision Support System to design modified atmosphere packaging for fresh produce based on a bipolar flexible querying approach , 2015, Comput. Electron. Agric..

[38]  Jun'ichi Tsujii Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task , 2009 .

[39]  Anne-Lyse Minard,et al.  Multi-class SVM for Relation Extraction from Clinical Reports , 2011, RANLP.

[40]  Peter M. A. Sloot,et al.  Extracting Biological Events from Text Using Simple Syntactic Patterns , 2011, BioNLP@ACL.

[41]  Liliana Ibanescu,et al.  An Ontological and Terminological Resource for n-ary Relation Annotation in Web Data Tables , 2011, OTM Conferences.

[42]  Peter Murray-Rust,et al.  ChemicalTagger: A tool for semantic text-mining in chemistry , 2011, J. Cheminformatics.

[43]  G. Schoenau,et al.  Grinding performance and physical properties of non-treated and steam exploded barley, canola, oat and wheat straw , 2011 .

[44]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[45]  Steffen Staab,et al.  What Is an Ontology? , 2009, Handbook on Ontologies.

[46]  Abdellatif Barakat,et al.  Eco-friendly dry chemo-mechanical pretreatments of lignocellulosic biomass: Impact on energy and yield of the enzymatic hydrolysis , 2014 .