Introduction to Linked Data and Its Lifecycle on the Web

With Linked Data, a very pragmatic approach towards achieving the vision of the Semantic Web has gained some traction in the last years. The term Linked Data refers to a set of best practices for publishing and interlinking structured data on the Web. While many standards, methods and technologies developed within by the Semantic Web community are applicable for Linked Data, there are also a number of specific characteristics of Linked Data, which have to be considered. In this article we introduce the main concepts of Linked Data. We present an overview of the Linked Data lifecycle and discuss individual approaches as well as the state-of-the-art with regard to extraction, authoring, linking, enrichment as well as quality of Linked Data. We conclude the chapter with a discussion of issues, limitations and further research and development challenges of Linked Data. This article is an updated version of a similar lecture given at Reasoning Web Summer School 2011.

[1]  Ping Chen,et al.  Hypothesis generation and data quality assessment through association mining , 2010, 9th IEEE International Conference on Cognitive Informatics (ICCI'10).

[2]  Stephen Muggleton,et al.  Can ILP be Applied to Large Dataset ? , 2010 .

[3]  Peter Christen,et al.  A Comparison of Fast Blocking Methods for Record Linkage , 2003, KDD 2003.

[4]  Jens Lehmann,et al.  Hybrid Learning of Ontology Classes , 2007, MLDM.

[5]  Jens Lehmann,et al.  Sorry, i don't speak SPARQL: translating SPARQL queries into natural language , 2013, WWW.

[6]  Hyoil Han,et al.  A survey on ontology mapping , 2006, SGMD.

[7]  Jens Lehmann,et al.  DBpedia SPARQL Benchmark - Performance Assessment with Real Queries on Real Data , 2011, SEMWEB.

[8]  Elisa Bertino,et al.  A Hybrid Approach to Private Record Linkage , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[9]  Jens Lehmann,et al.  Improving the Performance of a SPARQL Component for Semantic Web Applications , 2012 .

[10]  Felix Naumann,et al.  Profiling linked open data with ProLOD , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[11]  Axel-Cyrille Ngonga Ngomo,et al.  On Link Discovery using a Hybrid Approach , 2012, Journal on Data Semantics.

[12]  Robert Isele,et al.  Efficient Multidimensional Blocking for Link Discovery without losing Recall , 2011, WebDB.

[13]  Li Ding,et al.  Characterizing the Semantic Web on the Web , 2006, SEMWEB.

[14]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[15]  David Aumueller,et al.  Semantic authoring and retrieval within a Wiki , 2005 .

[16]  Johanna Völker,et al.  Fostering Web Intelligence by Semi-automatic OWL Ontology Refinement , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[17]  Luigi Iannone,et al.  An Algorithm Based on Counterfactuals for Concept Learning in the Semantic Web , 2005, IEA/AIE.

[18]  Andreas Harth,et al.  Weaving the Pedantic Web , 2010, LDOW.

[19]  York Sure-Vetter,et al.  Learning Disjointness , 2007, ESWC.

[20]  Jeffrey P. Bigham,et al.  Organizing and Searching the World Wide Web of Facts - Step One: The One-Million Fact Extraction Challenge , 2006, AAAI.

[21]  Sam Coates-Stephens,et al.  The Analysis and Acquisition of Proper Names for the Understanding of Free Text , 1992, Comput. Humanit..

[22]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[23]  Diana Maynard,et al.  Metrics for Evaluation of Ontology-based Information Extraction , 2006, EON@WWW.

[24]  Peter D. Turney Coherent Keyphrase Extraction via Web Mining , 2003, IJCAI.

[25]  Philipp Frischmuth,et al.  Weaving a Social Data Web with Semantic Pingback , 2010, EKAW.

[26]  Jens Lehmann,et al.  Linked-Data Aware URI Schemes for Referencing Text Fragments , 2012, EKAW.

[27]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[28]  Li Ma,et al.  Semantic Enhancement for Enterprise Data Management , 2009, SEMWEB.

[29]  Sebastian Schaffert,et al.  IkeWiki: A Semantic Wiki for Collaborative Knowledge Management , 2006, 15th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE'06).

[30]  Martin Hepp,et al.  Swiqa - a semantic web information quality assessment framework , 2011, ECIS.

[31]  A. Maurino,et al.  Quality Assessment Methodologies for Linked Open Data , 2012 .

[32]  Tim Furche,et al.  deqa: Deep Web Extraction for Question Answering , 2012, SEMWEB.

[33]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[34]  João Gama,et al.  Functional Trees , 2001, Machine Learning.

[35]  Jens Lehmann,et al.  RAVEN - active learning of link specifications , 2011, OM.

[36]  Jens Lehmann,et al.  Increasing the financial transparency of European Commission project funding , 2014, Semantic Web.

[37]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[38]  Naoaki Okazaki,et al.  Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web , 2009, ACL.

[39]  Christian Bizer,et al.  Sieve: linked data quality assessment and fusion , 2012, EDBT-ICDT '12.

[40]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[41]  Temple F. Smith Occam's razor , 1980, Nature.

[42]  Yi-fang Brook Wu,et al.  Domain-specific keyphrase extraction , 2005, CIKM '05.

[43]  Jens Lehmann,et al.  Foundations of Refinement Operators for Description Logics , 2007, ILP.

[44]  Nicola Fanizzi,et al.  A Note on the Evaluation of Inductive Concept Classification Procedures , 2008, SWAP.

[45]  Andreas Thor,et al.  Comparative evaluation of entity resolution approaches with FEVER , 2009, Proc. VLDB Endow..

[46]  Eyal Oren,et al.  SemperWiki: a semantic personal Wiki , 2005, Semantic Desktop Workshop.

[47]  Jens Lehmann,et al.  DL-Learner: Learning Concepts in Description Logics , 2009, J. Mach. Learn. Res..

[48]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[49]  Axel-Cyrille Ngonga Ngomo,et al.  SCMS - Semantifying Content Management Systems , 2011, SEMWEB.

[50]  Jian Su,et al.  Named Entity Recognition using an HMM-based Chunk Tagger , 2002, ACL.

[51]  Yolanda Gil,et al.  Towards content trust of web resources , 2006, WWW '06.

[52]  Philipp Frischmuth,et al.  RDFauthor: Employing RDFa for Collaborative Knowledge Engineering , 2010, EKAW.

[53]  Carole A. Goble,et al.  Quality, trust, and utility of scientific data on the web: towards a joint model , 2011, WebSci '11.

[54]  Jens Lehmann,et al.  Publishing and interlinking the Global Health Observatory dataset - Towards increasing transparency in Global Health , 2013, Semantic Web.

[55]  Sören Auer,et al.  OntoWiki Mobile - Knowledge Management in Your Pocket , 2011, ESWC.

[56]  Axel-Cyrille Ngonga Ngomo,et al.  Link Discovery with Guaranteed Reduction Ratio in Affine Spaces with Minkowski Measures , 2012, SEMWEB.

[57]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[58]  Baris Sertkaya,et al.  OntoComP System Description , 2009, Description Logics.

[59]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[60]  Jennifer Golbeck,et al.  Using Trust and Provenance for Content Filtering on the Semantic Web , 2006, MTW.

[61]  Mateja Verlic,et al.  LODGrefine - LOD-enabled Google Refine in Action , 2012, I-SEMANTICS.

[62]  Michael Martin,et al.  Knowledge Engineering for Historians on the Example of the Catalogus Professorum Lipsiensis , 2010, SEMWEB.

[63]  Shan-Hwei Nienhuys-Cheng,et al.  Foundations of Inductive Logic Programming , 1997, Lecture Notes in Computer Science.

[64]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[65]  Jens Lehmann,et al.  DBpedia and the live extraction of structured data from Wikipedia , 2012, Program.

[66]  Adam Souzis,et al.  Building a Semantic Wiki , 2005, IEEE Intell. Syst..

[67]  Jens Lehmann,et al.  A Refinement Operator Based Learning Algorithm for the ALC Description Logic , 2007, ILP.

[68]  Jens Lehmann,et al.  I18n of Semantic Web Applications , 2010, SEMWEB.

[69]  Tamar Domany,et al.  Enterprise Data Classification Using Semantic Web Technologies , 2010, SEMWEB.

[70]  Martin Gaedke,et al.  Discovering and Maintaining Links on the Web of Data , 2009, SEMWEB.

[71]  Previous version: , 2004 .

[72]  Frank van Harmelen,et al.  OWL Reasoning with WebPIE: Calculating the Closure of 100 Billion Triples , 2010, ESWC.

[73]  Roy T. Fielding,et al.  Uniform Resource Identifiers (URI): Generic Syntax , 1998, RFC.

[74]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[75]  Jeff Heflin,et al.  Extending Functional Dependency to Detect Abnormal Data in RDF Graphs , 2011, SEMWEB.

[76]  Jens Lehmann,et al.  User-driven quality evaluation of DBpedia , 2013, I-SEMANTICS '13.

[77]  Axel-Cyrille Ngonga Ngomo,et al.  EAGLE: Efficient Active Learning of Link Specifications Using Genetic Programming , 2012, ESWC.

[78]  Stan Matwin,et al.  Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity , 2006, Canadian AI.

[79]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[80]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[81]  Sebastian Rudolph,et al.  Exploring Relational Structures Via FLE , 2004, ICCS.

[82]  Steffen Staab,et al.  Ontology Learning , 2004, Encyclopedia of Machine Learning and Data Mining.

[83]  Sören Auer,et al.  EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases , 2010, SEMWEB.

[84]  Steffen Lohmann,et al.  Semantifying Requirements Engineering - The SoftWiki Approach , 2008 .

[85]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[86]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[87]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[88]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[89]  Nikolas Mitrou,et al.  Bringing relational databases into the Semantic Web: A survey , 2012, Semantic Web.

[90]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[91]  James A. Hendler,et al.  Trust Networks on the Semantic Web , 2003, WWW.

[92]  Axel Polleres,et al.  Robust and scalable Linked Data reasoning incorporating provenance and trust annotations , 2011, J. Web Semant..

[93]  Olaf Hartig,et al.  Using Web Data Provenance for Quality Assessment , 2009, SWPM.

[94]  Mitsuru Ishizuka,et al.  Relation Extraction from Wikipedia Using Subtree Mining , 2007, AAAI.

[95]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[96]  Jens Lehmann,et al.  Introduction to Linked Data and Its Lifecycle on the Web , 2013, Reasoning Web.

[97]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[98]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[99]  Saeedeh Shekarpour,et al.  Modeling and evaluation of trust with an extension in semantic web , 2010, J. Web Semant..

[100]  Axel-Cyrille Ngonga Ngomo,et al.  A time-efficient hybrid approach to link discovery , 2011, OM.

[101]  Jens Lehmann,et al.  Usage-Centric Benchmarking of RDF Triple Stores , 2012, AAAI.

[102]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.1 , 1997, RFC.

[103]  Jens Lehmann,et al.  Triplify: light-weight linked data publication from relational databases , 2009, WWW '09.

[104]  Heiner Stuckenschmidt,et al.  Incoherence as a Basis for Measuring the Quality of Ontology Mappings , 2008, OM.

[105]  Sören Auer,et al.  LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data , 2011, IJCAI.

[106]  Francesca A. Lisi,et al.  Learning SHIQ+log Rules for Ontology Evolution , 2008, SWAP.

[107]  Jens Lehmann,et al.  Making the Web a data washing machine , 2010 .

[108]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[109]  Mitsuru Ishizuka,et al.  Keyword extraction from a single document using word co-occurrence statistical information , 2004, Int. J. Artif. Intell. Tools.

[110]  Hugh Glaser,et al.  Research on Linked Data and Co-reference Resolution , 2009, Dublin Core Conference.

[111]  Jérôme Euzenat,et al.  Ten Challenges for Ontology Matching , 2008, OTM Conferences.

[112]  Franz Baader,et al.  Computing the Least Common Subsumer w.r.t. a Background Terminology , 2004, Description Logics.

[113]  Nicola Fanizzi,et al.  DL-FOIL Concept Learning in Description Logics , 2008, ILP.

[114]  Lalana Kagal,et al.  Rule-Based Trust Assessment on the Semantic Web , 2011, RuleML Europe.

[115]  Felix Naumann,et al.  Quality-Driven Query Answering for Integrated Information Systems , 2002, Lecture Notes in Computer Science.

[116]  William W. Cohen,et al.  Learning the Classic Description Logic: Theoretical and Experimental Results , 1994, KR.

[117]  Harris Wu,et al.  Harvesting social knowledge from folksonomies , 2006, HYPERTEXT '06.

[118]  Johanna Völker,et al.  Mining RDF Data for Property Axioms , 2012, OTM Conferences.

[119]  Gang Wang,et al.  PORE: Positive-Only Relation Extraction from Wikipedia Text , 2007, ISWC/ASWC.

[120]  Christian Bizer,et al.  Quality-driven information filtering using the WIQA policy framework , 2009, J. Web Semant..

[121]  Steffen Stadtmüller,et al.  On the Diversity and Availability of Temporal Information in Linked Open Data , 2012, SEMWEB.

[122]  Jens Lehmann,et al.  LinkedGeoData: Adding a Spatial Dimension to the Web of Data , 2009, SEMWEB.

[123]  Massimiliano Ciaramita,et al.  A framework for benchmarking entity-annotation systems , 2013, WWW.

[124]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[125]  James R. Curran,et al.  Language Independent NER using a Maximum Entropy Tagger , 2003, CoNLL.

[126]  Jens Lehmann,et al.  Learning OWL Class Expressions , 2010, Studies on the Semantic Web.

[127]  Timothy Baldwin,et al.  SemEval-2010 Task 5 : Automatic Keyphrase Extraction from Scientific Articles , 2010, *SEMEVAL.

[128]  Daniela Petrelli,et al.  Hybrid Search: Effectively Combining Keywords and Semantic Searches , 2008, ESWC.

[129]  Jens Lehmann,et al.  Class expression learning for ontology engineering , 2011, J. Web Semant..

[130]  Enrico Motta,et al.  A framework for evaluating semantic metadata , 2007, K-CAP '07.

[131]  Maribel Acosta,et al.  Crowdsourcing Linked Data Quality Assessment , 2013, SEMWEB.

[132]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[133]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[134]  Jens Lehmann,et al.  LinkedGeoData: A core for a web of spatial open data , 2012, Semantic Web.

[135]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[136]  Sebastian Hellmann,et al.  N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format , 2014, LREC.

[137]  Satya S. Sahoo,et al.  A Survey of Current Approaches for Mapping of Relational Databases to RDF , 2009 .

[138]  Jürgen Umbrich,et al.  An empirical survey of Linked Data conformance , 2012, J. Web Semant..

[139]  Liviu Badea,et al.  A Refinement Operator for Description Logics , 2000, ILP.

[140]  Marine Carpuat,et al.  A Stacked, Voted, Stacked Model for Named Entity Recognition , 2003, CoNLL.

[141]  Markus Krötzsch,et al.  Semantic Wikipedia , 2006, WikiSym '06.

[142]  Jens Lehmann,et al.  Concept learning in description logics using refinement operators , 2009, Machine Learning.

[143]  Yolanda Gil,et al.  Trusting Information Sources One Citizen at a Time , 2002, SEMWEB.

[144]  Leo Sauermann,et al.  Cool URIs for the semantic web , 2007 .

[145]  Ralph Grishman,et al.  NYU: Description of the Proteus/PET System as Used for MUC-7 ST , 1998, MUC.

[146]  Deborah L. McGuinness,et al.  Towards Identity in Linked Data , 2010, OWLED.

[147]  Robert Isele,et al.  Active Learning of Expressive Linkage Rules for the Web of Data , 2012, ICWE.

[148]  Erhard Rahm,et al.  When to Reach for the Cloud: Using Parallel Hardware for Link Discovery , 2013, ESWC.

[149]  Steven Pemberton,et al.  RDFa in XHTML: Syntax and Processing A collection of attributes and processing rules for extending XHTML to support RDF , 2008 .

[150]  Alan Agresti,et al.  A Step-by-Step Approach to Using SAS for Univariate & Multivariate Statistics, 2nd Edition + An Introduction to Categorical Data Analysis, 2nd Edition , 2008 .

[151]  Raymond J. Mooney,et al.  Adaptive Blocking: Learning to Scale Up Record Linkage , 2006, Sixth International Conference on Data Mining (ICDM'06).

[152]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[153]  Jeremy J. Carroll,et al.  Signing RDF Graphs , 2003, SEMWEB.

[154]  Jens Lehmann,et al.  Navigation-Induced Knowledge Engineering by Example , 2012, JIST.

[155]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[156]  Jens Lehmann,et al.  Learning of OWL Class Expressions on Very Large Knowledge Bases and its Applications , 2011, Semantic Services, Interoperability and Web Applications.

[157]  Bernhard Ganter,et al.  Completing Description Logic Knowledge Bases Using Formal Concept Analysis , 2007, IJCAI.

[158]  Jens Lehmann,et al.  Learning of OWL Class Descriptions on Very Large Knowledge Bases , 2008, SEMWEB.

[159]  Jens Lehmann,et al.  ReDD-Observatory: Using the Web of Data for Evaluating the Research-Disease Disparity , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[160]  Jens Lehmann,et al.  Universal OWL Axiom Enrichment for Large Knowledge Bases , 2012, EKAW.

[161]  Andrea Maurino,et al.  Capturing the Age of Linked Open Data: Towards a Dataset-Independent Framework , 2012, 2012 IEEE Sixth International Conference on Semantic Computing.

[162]  Geoffrey Edwards,et al.  An ontology-based method for quality assessment of spatial data bases , 2004 .

[163]  Jens Lehmann,et al.  Assessing Linked Data Mappings Using Network Measures , 2012, ESWC.

[164]  Jens Lehmann,et al.  AutoSPARQL: Let Users Query Your Knowledge Base , 2011, ESWC.

[165]  Sören Auer,et al.  A Versioning and Evolution Framework for RDF Knowledge Bases , 2006, Ershov Memorial Conference.

[166]  Nilson Arrais Quality control handbook , 1966 .

[167]  W. Winkler Overview of Record Linkage and Current Research Directions , 2006 .

[168]  Enrico Motta,et al.  Cross ontology query answering on the semantic web: an initial evaluation , 2009, K-CAP '09.

[169]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[170]  Jens Lehmann,et al.  Managing the Life-Cycle of Linked Data with the LOD2 Stack , 2012, SEMWEB.

[171]  Axel-Cyrille Ngonga Ngomo,et al.  COALA - Correlation-Aware Active Learning of Link Specifications , 2013, ESWC.

[172]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[173]  O. Hartig Trustworthiness of Data on the Web , 2008 .

[174]  Christian Bizer,et al.  Quality-Driven Information Filtering- In the Context of Web-Based Information Systems , 2007 .

[175]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[176]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[177]  Min-Yen Kan,et al.  Keyphrase Extraction in Scientific Publications , 2007, ICADL.

[178]  David Nadeau,et al.  Semi-supervised named entity recognition: learning to recognize 100 entity types with little supervision , 2007 .

[179]  Ryan Moats,et al.  URN Syntax , 1997, RFC.

[180]  Geoffrey Sampson,et al.  How Fully Does a Machine-Usable Dictionary Cover English Text? , 1989 .

[181]  Jens Lehmann,et al.  Towards Semantic based Requirements Engineering , 2007 .

[182]  Luigi Iannone,et al.  Knowledge-Intensive Induction of Terminologies from Metadata , 2004, SEMWEB.

[183]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[184]  Richard Y. Wang,et al.  Developing Measurement Scales for Data-Quality Dimensions , 2014 .

[185]  Michael Martin,et al.  Developing Semantic Web Applications with the OntoWiki Framework , 2009, Networked Knowledge - Networked Media - Integrating Knowledge Management.

[186]  Branimir Boguraev,et al.  Automatic Glossary Extraction: Beyond Terminology Identification , 2002, COLING.

[187]  Mark B. Sandler,et al.  Automatic Interlinking of Music Datasets on the Semantic Web , 2008, LDOW.

[188]  Peter F. Patel-Schneider,et al.  Manchester Syntax for OWL 1.1 , 2008, OWLED.

[189]  Axel-Cyrille Ngonga Ngomo,et al.  Parallelizing LIMES for large-scale link discovery , 2011, I-Semantics '11.

[190]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[191]  Alexander Borgida,et al.  Computing Least Common Subsumers in Description Logics , 1992, AAAI.

[192]  Sanda M. Harabagiu,et al.  Shallow Semantics for Relation Extraction , 2005, IJCAI.

[193]  Elena Console,et al.  Data Fusion , 2009, Encyclopedia of Database Systems.

[194]  Sören Auer,et al.  OntoWiki: A Tool for Social, Semantic Collaboration , 2006, CKC.

[195]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[196]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[197]  Francesca A. Lisi,et al.  Under Consideration for Publication in Theory and Practice of Logic Programming Building Rules on Top of Ontologies for the Semantic Web with Inductive Logic Programming , 2007 .

[198]  Johanna Völker,et al.  Statistical Schema Induction , 2011, ESWC.

[199]  Christine Thielen,et al.  An Approach to Proper Name Tagging for German , 1995, cmp-lg/9506024.

[200]  Tiziana Catarci,et al.  Managing Data Quality in Cooperative Information Systems , 2002, OTM.

[201]  Erhard Rahm,et al.  Schema Matching and Mapping , 2013, Schema Matching and Mapping.

[202]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[203]  Min-Yen Kan,et al.  Re-examining Automatic Keyphrase Extraction Approaches in Scientific Articles , 2009, MWE@IJCNLP.