Grounded reality meets machine learning: A deep-narrative analysis framework for energy policy research

Text-based data sources like narratives and stories have become increasingly popular as critical insight generator in energy research and social science. However, their implications in policy application usually remain superficial and fail to fully exploit state-of-the-art resources which digital era holds for text analysis. This paper illustrates the potential of deep-narrative analysis in energy policy research using text analysis tools from the cutting-edge domain of computational social sciences, notably topic modelling. We argue that a nested application of topic modelling and grounded theory in narrative analysis promises advances in areas where manual-coding driven narrative analysis has traditionally struggled with directionality biases, scaling, systematisation and repeatability. The nested application of the topic model and the grounded theory goes beyond the frequentist approach of narrative analysis and introduces insight generation capabilities based on the probability distribution of words and topics in a text corpus. In this manner, our proposed methodology deconstructs the corpus and enables the analyst to answer research questions based on the foundational element of the text data structure. We verify theoretical compatibility through a meta-analysis of a state-of-the-art bibliographic database on energy policy, narratives and computational social science. Furthermore, we establish a proof-of-concept using a narrative-based case study on energy externalities in slum rehabilitation housing in Mumbai, India. We find that the nested application contributes to the literature gap on the need for multidisciplinary methodologies that can systematically include qualitative evidence into policymaking.

[1]  Shion Guha,et al.  Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence? , 2017, J. Assoc. Inf. Sci. Technol..

[2]  William L. Weber,et al.  A directional slacks-based measure of technical inefficiency , 2009 .

[3]  Marie-Francine Moens,et al.  Identifying Word Translations from Comparable Corpora Using Latent Topic Models , 2011, ACL.

[4]  Arnab Jana,et al.  Mumbai slums since independence: Evaluating the policy outcomes , 2015 .

[5]  A. Strauss,et al.  Grounded Theory in Practice , 1997 .

[6]  Heiner Stuckenschmidt,et al.  Multidimensional topic analysis in political texts , 2014, Data Knowl. Eng..

[7]  R. Bardhan,et al.  How does slum rehabilitation influence appliance ownership? A structural model of non-income drivers , 2019, Energy policy.

[8]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[9]  Renata Tyszczuk,et al.  Gathering around stories: Interdisciplinary experiments in support of energy system transitions , 2017 .

[10]  Sergey I. Nikolenko,et al.  Topic modelling for qualitative studies , 2017, J. Inf. Sci..

[11]  Cecilia Mascolo,et al.  Talking Places: Modelling and Analysing Linguistic Content in Foursquare , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[12]  Antony Bryant,et al.  The SAGE Handbook of Current Developments in Grounded Theory , 2019 .

[13]  Thomas Jacobs,et al.  Topic models meet discourse analysis: a quantitative tool for a qualitative approach , 2019, International Journal of Social Research Methodology.

[14]  Melanie Birks,et al.  Grounded theory research: A design framework for novice researchers , 2019, SAGE open medicine.

[15]  Michael Williams,et al.  The Art of Coding and Thematic Exploration in Qualitative Research , 2019 .

[16]  P. Gill,et al.  Methods of data collection in qualitative research: interviews and focus groups , 2008, BDJ.

[17]  D. Blei,et al.  Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding , 2013 .

[18]  Margaret E. Roberts,et al.  Navigating the Local Modes of Big Data: The Case of Topic Models , 2016, Computational Social Science.

[19]  R. Trotter Qualitative research sample design and sample size: resolving and unresolved issues and inferential imperatives. , 2012, Preventive medicine.

[20]  P. Bazeley Issues in Mixing Qualitative and Quantitative Approaches to Research , 2004 .

[21]  Matthew L. Jockers Macroanalysis: Digital Methods and Literary History , 2013 .

[22]  Arthur Petersen,et al.  Exploring the Impact of the IPCC Assessment Reports on Science , 2011 .

[23]  L. Hermwille The role of narratives in socio-technical transitions—Fukushima and the energy regimes of Japan, Germany, and the United Kingdom , 2016 .

[24]  Kathryn B. Janda,et al.  Telling tales: using stories to remake energy policy , 2015 .

[25]  Ronita Bardhan,et al.  India nudges to contain COVID-19 pandemic: A reactive public policy analysis using machine-learning based topic modelling , 2020, PloS one.

[26]  A. Corner,et al.  Using Narrative Workshops to socialise the climate debate: Lessons from two case studies – centre-right audiences and the Scottish public , 2017 .

[27]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[28]  Massimo Aria,et al.  bibliometrix: An R-tool for comprehensive science mapping analysis , 2017, J. Informetrics.

[29]  Rob Procter,et al.  Mapping Consumer Sentiment Toward Wireless Services Using Geospatial Twitter Data , 2019, IEEE Access.

[30]  Tiago P. Peixoto,et al.  A network approach to topic models , 2017, Science Advances.

[31]  Ronita Bardhan,et al.  Gender, domestic energy and design of inclusive low-income habitats: A case of slum rehabilitation housing in Mumbai, India , 2019, Energy Research & Social Science.

[32]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[33]  Noah A. Smith,et al.  Predicting Response to Political Blog Posts with Topic Models , 2009, NAACL.

[34]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[35]  L. Norford,et al.  Indoor air quality among Mumbai's resettled populations: Comparing Dharavi slum to nearby rehabilitation sites , 2020 .

[36]  Colin Fay,et al.  Text Mining with R: A Tidy Approach , 2018 .

[37]  Mithra Moezzia,et al.  Narratives and Storytelling in Energy and Climate Change Research , 2017 .

[38]  R. Lertzman Environmental Melancholia: Psychoanalytic dimensions of engagement , 2015 .

[39]  G. Evans,et al.  Housing Quality and Health: An Evaluation of Slum Rehabilitation in India , 2017 .

[40]  R. Lamberts,et al.  Energy Justice in Slum Rehabilitation Housing: An Empirical Exploration of Built Environment Effects on Socio-Cultural Energy Demand , 2020, Sustainability.

[41]  Yan Wang,et al.  DUET: Data-Driven Approach Based on Latent Dirichlet Allocation Topic Modeling , 2019, J. Comput. Civ. Eng..

[42]  C. Brodsky The Discovery of Grounded Theory: Strategies for Qualitative Research , 1968 .

[43]  K. Janda,et al.  Using stories, narratives, and storytelling in energy and climate change research , 2017 .

[44]  Kurt Hornik,et al.  topicmodels : An R Package for Fitting Topic Models , 2016 .

[45]  Y. Laouris,et al.  A Systemic Evaluation of the State of Affairs Following the Negative Outcome of the Referendum in Cyprus Using the Structured Dialogic Design Process , 2009, Systemic Practice and Action Research.

[46]  Cedric E. Ginestet ggplot2: Elegant Graphics for Data Analysis , 2011 .

[47]  C. Howarth Informing decision making on climate change and low carbon futures: Framing narratives around the United Kingdom’s fifth carbon budget , 2017 .

[48]  Barry Goodchild,et al.  Once Upon a Time...How to tell a good energy efficiency story that 'sticks' , 2015 .

[49]  Yan Wang,et al.  Tracking urban geo-topics based on dynamic topic model , 2020, Comput. Environ. Urban Syst..

[50]  Peng Lin,et al.  A topic modeling based bibliometric exploration of hydropower research , 2016 .

[51]  Anonymous Authors,et al.  Modeling Polarizing Topics: When Do Different Political Communities Respond Differently to the Same News? , 2012 .

[52]  Ray Pawson,et al.  Evidence-based Policy: The Promise of `Realist Synthesis' , 2002 .

[53]  L. Connelly Grounded theory. , 2013, Medsurg nursing : official journal of the Academy of Medical-Surgical Nurses.

[54]  Min Song,et al.  Analyzing the Political Landscape of 2012 Korean Presidential Election in Twitter , 2014, IEEE Intelligent Systems.

[55]  M. Workman,et al.  Strategic narratives in climate change: Towards a unifying narrative to address the action gap on climate change , 2017 .

[56]  David J. C. MacKay Sustainable Energy - Without the Hot Air , 2008 .

[57]  Y. Chandra,et al.  Topic Modeling the Research‐Practice Gap in Public Administration , 2019, Public Administration Review.

[58]  B. Rapkin,et al.  Leveraging Latent Dirichlet Allocation in processing free-text personal goals among patients undergoing bladder cancer surgery , 2019, Quality of Life Research.

[59]  Daniel Kifer,et al.  What Is an Opinion About? Exploring Political Standpoints Using Opinion Scoring Model , 2010, AAAI.

[60]  Olivier Toubia,et al.  Extracting Features of Entertainment Products: A Guided Latent Dirichlet Allocation Approach Informed by the Psychology of Media Consumption , 2018, Journal of Marketing Research.

[61]  Timothy Baldwin,et al.  Automatic Detection and Language Identification of Multilingual Documents , 2014, TACL.

[62]  David A. Clausi,et al.  A Multiscale Latent Dirichlet Allocation Model for Object-Oriented Clustering of VHR Panchromatic Satellite Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[63]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[64]  R. Bardhan,et al.  Discomfort and distress in slum rehabilitation: Investigating a rebound phenomenon using a backcasting approach , 2019, Habitat international.

[65]  Xia Feng,et al.  Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey , 2017, Multimedia Tools and Applications.

[66]  Rosemary Randall,et al.  Loss and Climate Change: The Cost of Parallel Narratives , 2009 .