Data-Driven Meets Theory-Driven Research in the Era of Big Data: Opportunities and Challenges for Information Systems Research

The era of big data provides many opportunities for conducting impactful research from both datadriven and theory-driven perspectives. However, data-driven and theory-driven research have progressed somewhat independently. In this paper, we develop a framework that articulates important differences between these two perspectives and propose a role for information systems research at their intersection. The framework presents a set of pathways that combine the datadriven and theory-driven perspectives. From these pathways, we derive a set of challenges, and show how they can be addressed by research in information systems. By doing so, we identify an important role that information systems research can play in advancing both data-driven and theory-driven research in the era of big data.

[1]  Roger H. L. Chiang,et al.  Database Submission - Market Dynamics and User-Generated Content About Tablet Computers , 2014, Mark. Sci..

[2]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[3]  Veda C. Storey,et al.  Conceptual Modeling for Financial Investment with Text Mining , 2015, ER.

[4]  Lakshmi S. Iyer,et al.  Big Data & Analytics for Societal Impact: Recent Research and Trends , 2018, Information Systems Frontiers.

[5]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[6]  A. Budden,et al.  Big data and the future of ecology , 2013 .

[7]  Hany Farid,et al.  The accuracy, fairness, and limits of predicting recidivism , 2018, Science Advances.

[8]  Irina Astrova,et al.  Reverse Engineering of Relational Databases to Ontologies , 2004, ESWS.

[9]  Jeffrey Parsons,et al.  An Information Model Based on Classification Theory , 1996 .

[10]  Michael Batty,et al.  Big data, smart cities and city planning , 2013, Dialogues in human geography.

[11]  Benedikt Fecher,et al.  Open Science: One Term, Five Schools of Thought , 2013 .

[12]  R. Groves Three Eras of Survey Research , 2011 .

[13]  Katharine Armstrong,et al.  Big data: a revolution that will transform how we live, work, and think , 2014 .

[14]  Veda C. Storey,et al.  A Framework for Managing Complexity in Information Systems , 2017, J. Database Manag..

[15]  Vasant Dhar,et al.  Data science and prediction , 2012, CACM.

[16]  Anindya Ghose,et al.  The Internet and Racial Hate Crime: Offline Spillovers from Online Access , 2015, MIS Q..

[17]  Timothy P. McNamara,et al.  Systems of Spatial Reference in Human Memory , 2001, Cognitive Psychology.

[18]  Cristina Baroglio,et al.  From Tags to Emotions: Ontology-driven Sentimental Analysis in the Social Semantic Web , 2011, DART@AI*IA.

[19]  Carson C. Woo,et al.  The Role of Conceptual Modeling in Managing and Changing the Business , 2011, ER.

[20]  Xindong Wu,et al.  A logical framework for identifying quality knowledge from different data sources , 2006, Decis. Support Syst..

[21]  Roman Lukyanenko,et al.  Artifact Sampling: Using Multiple Information Technology Artifacts to Increase Research Rigor , 2018, HICSS.

[22]  Reginald G. Golledge,et al.  HUMAN WAYFINDING AND COGNITIVE MAPS , 2003 .

[23]  Ian Sommerville,et al.  Requirements Engineering: Processes and Techniques , 1998 .

[24]  R. Kitchin,et al.  Big Data, new epistemologies and paradigm shifts , 2014, Big Data Soc..

[25]  John Mylopoulos,et al.  Strategic business modeling: representation and reasoning , 2014, Software & Systems Modeling.

[26]  Paulo B. Góes,et al.  Editor's comments: big data and IS research , 2014 .

[27]  C. Bard,et al.  Reference systems for coding spatial information in normal subjects and a deafferented patient , 2004, Experimental Brain Research.

[28]  Gunnar Rätsch,et al.  Opening the Black Box: Revealing Interpretable Sequence Motifs in Kernel-Based Learning Algorithms , 2015, ECML/PKDD.

[29]  Sandeep Purao,et al.  Action Design Research , 2011, MIS Q..

[30]  Veda C. Storey,et al.  An ontological analysis of the relationship construct in conceptual modeling , 1999, TODS.

[31]  R. Karsh The Two Cultures and The Scientific Revolution , 1961 .

[32]  Wolfgang Maass,et al.  Big Data and Theory , 2017 .

[33]  Stuart E. Madnick,et al.  Improving data quality through effective use of data semantics , 2006, Data Knowl. Eng..

[34]  Heikki Topi,et al.  Big Data, Big Decisions for Science, Society, and Business: Report on a Research Agenda Setting Workshop , 2015 .

[35]  H. V. Jagadish Big Data and Science: Myths and Reality , 2015, Big Data Res..

[36]  A. Wandersman,et al.  The importance of neighbors: The social, cognitive, and affective components of neighboring , 1985 .

[37]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[38]  Veda C. Storey,et al.  Mining social media data from sparse text: an application to diplomacy , 2016 .

[39]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[40]  Veda C. Storey,et al.  Domain design principles for managing complexity in conceptual modeling , 2017 .

[41]  Wenli Zhang,et al.  Predicting Asthma-Related Emergency Department Visits Using Big Data , 2015, IEEE Journal of Biomedical and Health Informatics.

[42]  Yair Wand,et al.  Theoretical foundations for conceptual modelling in information systems development , 1995, Decis. Support Syst..

[43]  D. Sandy Staples,et al.  The use of collaborative electronic media for information sharing: an exploratory study of determinants , 2000, J. Strateg. Inf. Syst..

[44]  Izak Benbasat,et al.  Designing Warning Messages for Detecting Biased Online Product Recommendations: An Empirical Investigation , 2015, Inf. Syst. Res..

[45]  Arun Rai,et al.  Synergies Between Big Data and Theory , 2016 .

[46]  Roger H. L. Chiang,et al.  Big Data Research in Information Systems: Toward an Inclusive Research Agenda , 2016, J. Assoc. Inf. Syst..

[47]  Tran Quoc Bao,et al.  Trends in adult body-mass index in 200 countries from 1975 to 2014: a pooled analysis of 1698 population-based measurement studies with 19·2 million participants , 2016, The Lancet.

[48]  Yair Wand,et al.  Extending Classification Principles from Information Modeling to Other Disciplines , 2013, J. Assoc. Inf. Syst..

[49]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[50]  Victor Pankratius,et al.  Computing in Astronomy: To See the Unseen , 2014, Computer.

[51]  Galit Shmueli,et al.  Predictive Analytics in Information Systems Research , 2010, MIS Q..

[52]  Bart Baesens,et al.  Call for Papers MISQ Special Issue on Transformational Issues of Big Data and Analytics in Networked Business , 2014 .

[53]  K. Eisenhardt Building theories from case study research , 1989, STUDI ORGANIZZATIVI.

[54]  Object-Oriented Analysis Computational Earth Science: Big Data Transformed Into Insight , 2013 .

[55]  Elizabeth Gilbert,et al.  Reproducibility Project: Results (Part of symposium called "The Reproducibility Project: Estimating the Reproducibility of Psychological Science") , 2014 .

[56]  Joerg Evermann,et al.  Associations and Mutual Properties - An Experimental Assessment , 2008, AMCIS.

[57]  Robert M. Davison,et al.  The Roles of Theory in Canonical Action Research , 2012, MIS Q..

[58]  Sandeep Purao,et al.  Educating reflective systems developers , 2002, Inf. Syst. J..

[59]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[60]  Stéphane Bressan,et al.  Context Interchange: New Features and Formalisms for the Intelligent Integration of Information Context Interchange: New Features and Formalisms for the Intelligent Integration of Information , 1997 .

[61]  M. J. O’Brien,et al.  Mapping collective behavior in the big-data era , 2014, Behavioral and Brain Sciences.

[62]  Vijayan Sugumaran,et al.  Ontologies for conceptual modeling: their creation, use, and management , 2002, Data Knowl. Eng..

[63]  Chulwoo Han,et al.  Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies , 2017, Expert Syst. Appl..

[64]  Ron Weber,et al.  Editor's comment: still desperately seeking the IT artifact , 2003 .

[65]  Ritu Agarwal,et al.  Matching Platforms and HIV Incidence: An Empirical Investigation of Race, Gender, and Socio-Economic Status , 2015, Manag. Sci..

[66]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[67]  Tanja Hueber,et al.  Information Systems Development Methodologies Techniques And Tools , 2016 .

[68]  Gina Neff,et al.  Talking to Bots: Symbiotic Agency and the Case of Tay , 2016 .

[69]  Marleen Huysman,et al.  Debating big data: A literature review on realizing value from big data , 2017, J. Strateg. Inf. Syst..

[70]  Mathias Weske,et al.  Action patterns in business process model repositories , 2012, Comput. Ind..

[71]  John Mylopoulos,et al.  Can Goal Reasoning Techniques Be Used for Strategic Decision-Making? , 2016, ER.

[72]  Mariel M. Finucane,et al.  Bayesian Estimation of Population-Level Trends in Measures of Health Status , 2014, 1405.4682.

[73]  Charles Anderson,et al.  The end of theory: The data deluge makes the scientific method obsolete , 2008 .

[74]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[75]  Edward N. Zalta,et al.  The Stanford Encyclopedia of Philosophy: A Developed Dynamic Reference Work , 2002 .