Historical Data Analysis through Data Mining From an Outsourcing Perspective: The Three-Phases Model

The process of historical data analysis through data mining has proven valuable for the industrial environment. There are many models available that describe the in-house process of data mining. However, many companies either do not have in-house skills or do not wish to invest in performing in-house data mining. This paper investigates the applicability of two well-established data mining process models in an outsourcing context. The authors observe that both models cannot properly accommodate several key aspects in this context; therefore, this paper proposes the Three-phases method, which consists of data retrieval, data mining and results implementation within an organization. Each element is presented as a visual method fragment, and the model is validated through expert interviews and an extensive case study at a large Dutch staffing company. Both validation techniques substantiate the authors’ claim that the Three-phases model accurately describes the data mining process from an outsourcing perspective.

[1]  Peter Kawalek,et al.  Goal-based business process models: creation and evaluation , 1997, Bus. Process. Manag. J..

[2]  Ian Witten,et al.  Data Mining , 2000 .

[3]  Zhexue Huang,et al.  CLUSTERING LARGE DATA SETS WITH MIXED NUMERIC AND CATEGORICAL VALUES , 1997 .

[4]  M. J. Earl,et al.  The Risks of Outsourcing IT , 1996 .

[5]  Jude W. Shavlik,et al.  Using neural networks for data mining , 1997, Future Gener. Comput. Syst..

[6]  Philip S. Yu,et al.  Domain Driven Data Mining , 2015 .

[7]  Erhard Rahm,et al.  Data Cleaning: Problems and Current Approaches , 2000, IEEE Data Eng. Bull..

[8]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[9]  H. van Driel Oude nieuwe media , 1999 .

[10]  Wil M. P. van der Aalst,et al.  Process mining: a research agenda , 2004, Comput. Ind..

[11]  Barbara von Halle,et al.  Business Rules Applied: Building Better Systems Using the Business Rules Approach , 2001 .

[12]  N. Venkatraman,et al.  An Empirical Study of Information Technology Outsourcing: Benefits, Risks, and Performance Implications , 1995, ICIS.

[13]  C. Apte,et al.  Data mining with decision trees and decision rules , 1997, Future Gener. Comput. Syst..

[14]  Sjaak Brinkkemper,et al.  Situational method engineering for informational system project approaches , 1994, Methods and Associated Tools for the Information Systems Life Cycle.

[15]  W. H. Inmon,et al.  The data warehouse and data mining , 1996, CACM.

[16]  Sjaak Brinkkemper,et al.  Method engineering: engineering of information systems development methods and tools , 1996, Inf. Softw. Technol..

[17]  Michael J. Shaw,et al.  Knowledge management and data mining for marketing , 2001, Decis. Support Syst..

[18]  W. Wilhelm,et al.  Strategic, tactical and operational decisions in multi-national logistics networks: A review and discussion of modelling issues , 2000 .

[19]  Ivo Düntsch,et al.  Uncertainty Measures of Rough Set Prediction , 1998, Artif. Intell..

[20]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[21]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[22]  Anne C. Rouse,et al.  Revisiting IT outsourcing risks: Analysis of a survey of Australia’s Top 1000 organizations , 2003 .

[23]  Hongjun Lu,et al.  Cleansing Data for Mining and Warehousing , 1999, DEXA.