Ontology-Based Workflow Generation for Intelligent Big Data Analytics

Big Data analytics provide support for decision making by discovering patterns and other useful information from large set of data. Organizations utilizing advanced analytics techniques to gain real value from Big Data will grow faster than their competitors and seize new opportunities. Cross-Industry Standard Process for Data Mining (CRISP-DM) is an industry-proven way to build predictive analytics models across the enterprise. However, the manual process in CRISP-DM hinders faster decision making on real-time application for efficient data analysis. In this paper, we present an approach to automate the process using Automatic Service Composition (ASC). Focusing on the planning stage of ASC, we propose an ontology-based workflow generation method to automate the CRISP-DM process. Ontology and rules are designed to infer workflow for data analytics process according to the properties of the datasets as well as user needs. Empirical study of our prototyping system has proved the efficiency of our workflow generation method.

[1]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[2]  Dana S. Nau,et al.  SHOP2: An HTN Planning System , 2003, J. Artif. Intell. Res..

[3]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[4]  Michael N. Huhns,et al.  A Scalable Architecture for Automatic Service Composition , 2014, IEEE Transactions on Services Computing.

[5]  Tom Fawcett,et al.  Data Science and its Relationship to Big Data and Data-Driven Decision Making , 2013, Big Data.

[6]  Ioan Salomie,et al.  Semantic Web Service Composition Method Based on Fluent Calculus , 2009, 2009 11th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[7]  Ahmed K. Elmagarmid,et al.  Composing Web services on the Semantic Web , 2003, The VLDB Journal.

[8]  Patrick Martin,et al.  Assisting developers of Big Data Analytics Applications when deploying on Hadoop clouds , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[9]  T. Sasipraba,et al.  AN APPROACH FOR GRAPH BASED PLANNING AND QUALITY DRIVEN COMPOSITION OF WEB SERVICES , 2011 .

[10]  James A. Hendler,et al.  HTN planning for Web Service composition using SHOP2 , 2004, J. Web Semant..

[11]  Steffen Staab,et al.  The Ontology Inference Layer OIL , 2000 .