Power to the People! - Meta-Algorithmic Modelling in Applied Data Science

This position paper first defines the research field of applied data science at the intersection of domain expertise, data mining, and engineering capabilities, with particular attention to analytical applications. We then propose a meta-algorithmic approach for applied data science with societal impact based on activity recipes. Our people-centred motto from an applied data science perspective translates to design science research which focuses on empowering domain experts to sensibly apply data mining techniques through prototypical software implementations supported by meta-algorithmic recipes.

[1]  Inge van de Weerd,et al.  Understanding users' behavior with software operation data mining , 2014, Comput. Hum. Behav..

[2]  Niklas Lavesson,et al.  User-oriented Assessment of Classification Model Understandability , 2011, SCAI.

[3]  Sjaak Brinkkemper,et al.  Method engineering: engineering of information systems development methods and tools , 1996, Inf. Softw. Technol..

[4]  Illhoi Yoo,et al.  Data Mining in Healthcare and Biomedicine: A Survey of the Literature , 2012, Journal of Medical Systems.

[5]  Martin Bichler,et al.  Design science in information systems research , 2006, Wirtschaftsinf..

[6]  David L. Olson,et al.  Comparative analysis of data mining methods for bankruptcy prediction , 2012, Decis. Support Syst..

[7]  Geza Joos,et al.  On the Accuracy Versus Transparency Trade-Off of Data-Mining Models for Fast-Response PMU-Based Catastrophe Predictors , 2012, IEEE Transactions on Smart Grid.

[8]  Marco R. Spruit,et al.  Effective and Efficient Classification of Topically-Enriched Domain-Specific Text Snippets: The TETSC Method , 2015, Int. J. Strateg. Decis. Sci..

[9]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[10]  Johannes Gehrke,et al.  Accurate intelligible models with pairwise interactions , 2013, KDD.

[11]  Marco R. Spruit,et al.  Transitioning to a Data Driven Mental Health Practice: Collaborative Expert Sessions for Knowledge and Hypothesis Finding , 2016, Comput. Math. Methods Medicine.

[12]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[13]  David B. Fogel,et al.  Techniques for Extracting Classification and Regression Rules from Artificial Neural Networks , 2003 .

[14]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[15]  Paulo Cortez,et al.  Using sensitivity analysis and visualization techniques to open black box data mining models , 2013, Inf. Sci..

[16]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[17]  Lars Niklasson,et al.  Accuracy vs. comprehensibility in data mining models , 2004 .

[18]  Marco R. Spruit,et al.  The Performance Mining Method: Extracting Performance Knowledge from Software Operation Data , 2015, Int. J. Bus. Intell. Res..

[19]  Marco R. Spruit,et al.  Historical Data Analysis through Data Mining From an Outsourcing Perspective: The Three-Phases Model , 2010, Int. J. Bus. Intell. Res..

[20]  Kevin Leyton-Brown,et al.  Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[21]  Steven J. Simske Meta-Algorithmics: Patterns for Robust, Low Cost, High Quality Systems , 2013 .