Ontology-based data mining model management for self-service knowledge discovery

Data mining (DM) models are knowledge-intensive information products that enable knowledge creation and discovery. As large volume of data is generated with high velocity from a variety of sources, there is a pressing need to place DM model selection and self-service knowledge discovery in the hands of the business users. However, existing knowledge discovery and data mining (KDDM) approaches do not sufficiently address key elements of data mining model management (DMMM) such as model sharing, selection and reuse. Furthermore, they are mainly from a knowledge engineer’s perspective, while the business requirements from business users are often lost. To bridge these semantic gaps, we propose an ontology-based DMMM approach for self-service model selection and knowledge discovery. We develop a DM3 ontology to translate the business requirements into model selection criteria and measurements, provide a detailed deployment architecture for its integration within an organization’s KDDM application, and use the example of a student loan company to demonstrate the utility of the DM3.

[1]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[2]  Kweku-Muata Osei-Bryson,et al.  Evaluation of decision trees: a multi-criteria approach , 2004, Comput. Oper. Res..

[3]  Neal Leavitt,et al.  Data Mining for the Corporate Masses? , 2002, Computer.

[4]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[5]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[6]  Rini van Solingen,et al.  Goal Question Metric (GQM) Approach , 2002 .

[7]  Vicent J. Botti,et al.  An ontological-based knowledge-representation formalism for case-based argumentation , 2015, Inf. Syst. Frontiers.

[8]  Yuh-Jen Chen,et al.  Development of a method for ontology-based empirical knowledge representation and reasoning , 2010, Decis. Support Syst..

[9]  Bing Liu,et al.  Managing large collections of data mining models , 2008, CACM.

[10]  Silvio Peroni,et al.  FaBiO and CiTO: Ontologies for describing bibliographic resources and citations , 2012, J. Web Semant..

[11]  Philip A. Bernstein,et al.  Model management 2.0: manipulating richer mappings , 2007, SIGMOD '07.

[12]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[13]  David G. Schwartz,et al.  From Open IS Semantics to the Semantic Web: The Road Ahead , 2003, IEEE Intell. Syst..

[14]  Matt-Mouley Bouamrane,et al.  Development of an ontology for a preoperative risk assessment clinical decision support system , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[15]  Michael Gruninger,et al.  Methodology for the Design and Evaluation of Ontologies , 1995, IJCAI 1995.

[16]  Schubert Foo,et al.  Ontology research and development. Part 1 - a review of ontology generation , 2002, J. Inf. Sci..

[17]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[18]  Sylvain Delisle,et al.  Bridging the gap between data mining and decision support: A case-based reasoning and ontology approach , 2008, Intell. Data Anal..

[19]  Dorothy E. Leidner,et al.  Review: Knowledge Management and Knowledge Management Systems: Conceptual Foundations and Research Issues , 2001, MIS Q..

[20]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[21]  Saso Dzeroski,et al.  OntoDM: An Ontology of Data Mining , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[22]  Jaroslaw A. Chudziak,et al.  Ontological Learning Assistant for Knowledge Discovery and Data Mining , 2009, 2009 International Multiconference on Computer Science and Information Technology.

[23]  Tania Tudorache,et al.  Web-Protege: A Lightweight OWL Ontology Editor for the Web , 2008, OWLED.

[24]  James A. Thom,et al.  Requirements-oriented methodology for evaluating ontologies , 2009, Inf. Syst..

[25]  James D. McKeen,et al.  Knowledge management and organizational performance: an exploratory analysis , 2009, J. Knowl. Manag..

[26]  Claudia Diamantini,et al.  A virtual mart for knowledge discovery in databases , 2013, Inf. Syst. Frontiers.

[27]  Gonzalo Mariscal,et al.  A survey of data mining and knowledge discovery process models and methodologies , 2010, The Knowledge Engineering Review.

[28]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[29]  Michael J. A. Berry,et al.  Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management , 2004 .

[30]  Stuart Zweben,et al.  Development and application of a white box approach to integration testing , 1984, J. Syst. Softw..

[31]  Asunción Gómez-Pérez,et al.  Building a chemical ontology using Methontology and the Ontology Design Environment , 1999, IEEE Intell. Syst..

[32]  T. Davenport Competing on analytics. , 2006, Harvard business review.

[33]  Lily Sun,et al.  An ontological modelling of user requirements for personalised information provision , 2010, Inf. Syst. Frontiers.

[34]  Aldo Gangemi,et al.  Modelling Ontology Evaluation and Validation , 2006, ESWC.

[35]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[36]  Marta E. Zorrilla,et al.  A service oriented architecture to provide data mining services for non-expert data miners , 2013, Decis. Support Syst..

[37]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[38]  Rohanizadeh Seyyed Soroush,et al.  A Proposed Data Mining Methodology and its Application to Industrial Procedures , 2009 .

[39]  Ricardo Vilalta,et al.  A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[40]  Vladan Devedzic,et al.  Understanding ontological engineering , 2002, CACM.

[41]  Andrés Montoyo,et al.  Applying model-driven engineering to the development of Rich Internet Applications for Business Intelligence , 2013, Inf. Syst. Frontiers.

[42]  Kweku-Muata Osei-Bryson,et al.  Evaluation of an integrated Knowledge Discovery and Data Mining process model , 2012, Expert Syst. Appl..

[43]  Ernestina Menasalvas Ruiz,et al.  An Engineering Approach to Data Mining Projects , 2007, IDEAL.

[44]  Roger Alan Pick,et al.  Meta-modeling concepts and tools for model management: a systems approach , 1994 .