Closing the Gap Between Experts and Novices Using Analytics-as-a-Service: An Experimental Study

Generating insights and value from data has become an important asset for organizations. At the same time, the need for experts in analytics is increasing and the number of analytics applications is growing. Recently, a new trend has emerged, i.e. analytics-as-a-service platforms, that makes it easier to apply analytics both for novice and expert users. In this study, the authors approach these new services by conducting a full-factorial experiment where both inexperienced and experienced users take on an analytics task with an analytics-as-a-service technology. The research proves that although experts in analytics still significantly outperform novices, these web-based platforms do offer an advantage to inexperienced users. Furthermore, the authors find that analytics-as-a-service does not offer the same benefits across different analytics tasks. That is, they observe better performance for supervised analytics tasks. Moreover, this study indicates that there are significant differences between novices. The most important distinction lies in the approach they take on the task. Novices who follow a more complex, although structured, workflow behave more similarly to experts and, thus, also perform better. The findings can aid managers in their hiring and training strategy with regards to both business users and data scientists. Moreover, it can guide managers in the development of an enterprise-wide analytics culture. Finally, the results can inform vendors about the design and development of these platforms.

[1]  Bart Baesens,et al.  Defining analytics maturity indicators: A survey approach , 2017, Int. J. Inf. Manag..

[2]  Pei-Yu Sharon Chen,et al.  The Impact and Implications of On-Demand Services on Market Structure , 2013, Inf. Syst. Res..

[3]  T. Davenport big data @ work , 2014 .

[4]  Dursun Delen,et al.  Leveraging the capabilities of service-oriented decision support systems: Putting analytics and big data in cloud , 2013, Decis. Support Syst..

[5]  Bart Baesens,et al.  API for prediction and machine learning: poll results and analysis , 2015 .

[6]  Guillem Pratx,et al.  Cloud computing for big data , 2019, Big Data in Radiation Oncology.

[7]  Marta E. Zorrilla,et al.  A service oriented architecture to provide data mining services for non-expert data miners , 2013, Decis. Support Syst..

[8]  Véronique Van Vlasselaer,et al.  Determining the use of data quality metadata (DQM) for decision making purposes and its impact on decision outcomes - An exploratory study , 2016, Decis. Support Syst..

[9]  Nikolay Borissov,et al.  Cloud Computing – A Classification, Business Models, and Research Directions , 2009, Bus. Inf. Syst. Eng..

[10]  Michael A. Fligner,et al.  Distribution-Free Two-Sample Tests for Scale , 1976 .

[11]  Pablo Montero,et al.  TSclust: An R Package for Time Series Clustering , 2014 .

[12]  H. Levene Robust tests for equality of variances , 1961 .

[13]  Terrence August,et al.  Cloud Implications on Software Network Structure and Security Risks , 2014, Inf. Syst. Res..

[14]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[15]  Detmar W. Straub,et al.  Validating Instruments in MIS Research , 1989, MIS Q..

[16]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[17]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[18]  M. E. Johnson,et al.  A Comparative Study of Tests for Homogeneity of Variances, with Applications to the Outer Continental Shelf Bidding Data , 1981 .

[19]  Arumugam Seetharaman,et al.  The usage and adoption of cloud computing by small and medium businesses , 2013, Int. J. Inf. Manag..

[20]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[21]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[22]  Martin Gilje Jaatun,et al.  Enhancing accountability in the cloud , 2016, Int. J. Inf. Manag..

[23]  Subhajyoti Bandyopadhyay,et al.  Cloud computing - The business perspective , 2011, Decis. Support Syst..

[24]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[25]  J. J. Higgins,et al.  The aligned rank transform for nonparametric factorial analyses using only anova procedures , 2011, CHI.

[26]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[27]  Detmar W. Straub,et al.  Validation in Information Systems Research: A State-of-the-Art Assessment , 2001, MIS Q..

[28]  Jan vom Brocke,et al.  Comparing Business Intelligence and Big Data Skills , 2014, Business & Information Systems Engineering.

[29]  Sam Ransbotham,et al.  Beyond the hype: The hard work behind analytics success , 2016 .

[30]  Anil K. Bera,et al.  A test for normality of observations and regression residuals , 1987 .

[31]  Murray Campbell,et al.  Analytics Ecosystem Transformation: A Force for Business Model Innovation , 2011, 2011 Annual SRII Global Conference.

[32]  D. Darling,et al.  A Test of Goodness of Fit , 1954 .

[33]  Paul Alpar,et al.  Self-Service Business Intelligence , 2016, Bus. Inf. Syst. Eng..

[34]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[35]  Jeanne G. Harris,et al.  Competing on Analytics: The New Science of Winning , 2007 .

[36]  Hui Xiong,et al.  Understanding and Enhancement of Internal Clustering Validation Measures , 2013, IEEE Transactions on Cybernetics.

[37]  Neal Leavitt Bringing big analytics to the masses , 2013, Computer.

[38]  Thomas H. Davenport,et al.  Big Data at Work: Dispelling the Myths, Uncovering the Opportunities , 2014 .

[39]  Piotr Indyk,et al.  Mining the stock market (extended abstract): which measure is best? , 2000, KDD '00.

[40]  Bart Baesens,et al.  Analytics in a Big Data World: The Essential Guide to Data Science and its Applications , 2014 .

[41]  Amela Karahasanovic,et al.  A survey of controlled experiments in software engineering , 2005, IEEE Transactions on Software Engineering.