Exploiting Open Data to Improve the Business Intelligence & Business Discovery Experience

The extent to which data mining tools are able to make efficient use of an open data oriented strategy in a smart city is limited. In a sense that it is not fully automated, incompatible or has to be supervised. These sets of tools may offer the possibility to import a dataset in a certain predefined standardized format, still, they do not make it a part of their workflow and algorithms in a fully unsupervised manner (i.e without ongoing human guidance). In a departure from previous research works, in this paper, we present a middleware architecture that exploits open data as background knowledge by acting as a bridge between data mining tools and open data resources.

[1]  Alberto Anguita,et al.  OntoDataClean: Ontology-Based Integration and Preprocessing of Distributed Data , 2006, ISBMDA.

[2]  Saso Dzeroski,et al.  OntoDM-KDD: Ontology for Representing the Knowledge Discovery Process , 2013, Discovery Science.

[3]  Enrico Motta,et al.  Walking Linked Data: a Graph Traversal Approach to Explain Clusters , 2014, COLD.

[4]  Daniel Xodo,et al.  Data Mining With Ontologies: Implementations, Findings and Frameworks , 2007 .

[5]  Martin Hepp,et al.  Using Semantic Web Technologies for Data Quality Management , 2013, Handbook of Data Quality.

[6]  Howard J. Hamilton,et al.  An Ontology-Based Approach to Data Cleaning , 2005 .

[7]  Yongheng Wang,et al.  Outlier detection from massive short documents using domain ontology , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[8]  Saso Dzeroski,et al.  OntoDM: An Ontology of Data Mining , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[9]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[10]  Saso Dzeroski,et al.  Ontology of core data mining entities , 2014, Data Mining and Knowledge Discovery.

[11]  Heiko Paulheim,et al.  Semantic Web in data mining and knowledge discovery: A comprehensive survey , 2016, J. Web Semant..

[12]  Abraham Bernstein,et al.  A survey of intelligent assistants for data analysis , 2013, CSUR.

[13]  Sung-Hyon Myaeng,et al.  Feature Selection Using a Semantic Hierarchy for Event Recognition and Type Classification , 2013, IJCNLP.

[14]  Ryutaro Ichise,et al.  LiDDM: A Data Mining System for Linked Data , 2011, LDOW.

[15]  Saso Dzeroski,et al.  Representing Entities in the OntoDM Data Mining Ontology , 2010, Inductive Databases and Constraint-Based Data Mining.

[16]  Martin Hepp,et al.  Using SPARQL and SPIN for Data Quality Management on the Semantic Web , 2010, BIS.

[17]  Hussein A. Abbass,et al.  A Comparative Study for Domain Ontology Guided Feature Extraction , 2003, ACSC.

[18]  Martin Hepp,et al.  Swiqa - a semantic web information quality assessment framework , 2011, ECIS.

[19]  Ye Ye,et al.  Domain ontology-based feature reduction for high dimensional drug data and its application to 30-day heart failure readmission prediction , 2013, 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing.

[20]  Aba-Sah Dadzie,et al.  Approaches to visualising Linked Data: A survey , 2011, Semantic Web.

[21]  Melanie Hilario,et al.  Ontology-Based Meta-Mining of Knowledge Discovery Workflows , 2011, Meta-Learning in Computational Intelligence.

[22]  Heiko Paulheim,et al.  Automated Feature Generation from Structured Knowledge , 2011 .

[23]  Steffen Staab,et al.  What Is an Ontology? , 2009, Handbook on Ontologies.

[24]  Nada Lavrac,et al.  Automating Knowledge Discovery Workflow Composition Through Ontology-Based Planning , 2011, IEEE Transactions on Automation Science and Engineering.

[25]  Abraham Bernstein,et al.  Towards cooperative planning of data mining workflows , 2009 .

[26]  Heiko Paulheim Exploiting Linked Open Data as Background Knowledge in Data Mining , 2013, DMoLD.

[27]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[28]  M. Hilario,et al.  A Data Mining Ontology for Algorithm Selection and Meta-Mining , 2009 .

[29]  Heiko Paulheim,et al.  Feature Selection in Hierarchical Feature Spaces , 2014, Discovery Science.

[30]  Martin Hepp,et al.  Using Semantic Web Resources for Data Quality Management , 2010, EKAW.

[31]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[32]  Enrico Motta,et al.  Dedalo: Looking for Clusters Explanations in a Labyrinth of Linked Data , 2014, ESWC.

[33]  Zoubida Kedad,et al.  Ontology-Based Data Cleaning , 2002, NLDB.