Ready for data analytics?: data collection and creation in local governments

A good understanding of local government data, particularly the contexts in which the data are collected and created, is essential in order for data guardians and users to effectively manage that data and to draw accurate and rich information from them. However, our knowledge of both local government data and data contexts is very limited, which poses a wide range of challenges to data initiatives inside and outside local governments, from open data to data analytics. This paper explores the mechanisms of data generation and the determinants of data contexts in local governments. To do so, we conducted an in-depth case study of data collection and creation in a city government in New York State in the U.S., focusing on administrative data. We found that local government data are collected or created in three ways: original raw data collection, original raw data creation, and second or higher-order data creation. Both the internal and external environments of local governments add complexity to data management by influencing who, what, where, when and how to collect and create data for administrative purposes through these processes.

[1]  R. Yin Case Study Research: Design and Methods , 1984 .

[2]  M. Zeleny Management support systems: Towards integrated knowledge management , 1987 .

[3]  I. Nonaka A Dynamic Theory of Organizational Knowledge Creation , 1994 .

[4]  Jean-Paul A. Barthès,et al.  Knowledge Management , 1994, Encyclopedia of Database Systems.

[5]  Abby Smith Why Digitize? , 1999 .

[6]  J. Wyatt Decision support systems. , 2000, Journal of the Royal Society of Medicine.

[7]  Erik Duval,et al.  Metadata Principles and Practicalities , 2002, D Lib Mag..

[8]  Frederick E. Petry,et al.  Extraction and representation of contextual information for knowledge discovery in texts , 2003, Inf. Sci..

[9]  U. Benz,et al.  Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information , 2004 .

[10]  Matthew Jones,et al.  Maximizing the Value of Ecological Data with Structured Metadata: An Introduction to Ecological Metadata Language (EML) and Principles for Metadata Creation , 2005 .

[11]  Andrew McCallum,et al.  Information Extraction , 2005, ACM Queue.

[12]  Chaim Zins,et al.  Conceptual approaches for defining data, information, and knowledge , 2007, J. Assoc. Inf. Sci. Technol..

[13]  Chaim Zins Conceptual approaches for defining data, information, and knowledge: Research Articles , 2007 .

[14]  Z. Zainal Case Study As a Research Method , 2007 .

[15]  W. David Schwaderer,et al.  Data Lifecycles: Managing Data for Strategic Advantage , 2007 .

[16]  Hans-Georg Kemper,et al.  Management Support with Structured and Unstructured Data—An Integrated Business Intelligence Framework , 2008, Inf. Syst. Manag..

[17]  Beng Chin Ooi,et al.  EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data , 2008, SIGMOD Conference.

[18]  Sharon S. Dawes,et al.  Stewardship and usefulness: Policy principles for information-based transparency , 2010, Gov. Inf. Q..

[19]  Paul T. Jaeger,et al.  Using ICTs to create a culture of transparency: E-government and social media as openness and anti-corruption tools for societies , 2010, Gov. Inf. Q..

[20]  Jay H. Bernstein,et al.  The Data-Information-Knowledge-Wisdom Hierarchy and its Antithesis , 2011 .

[21]  Boris Otto,et al.  Toward a functional reference model for master data quality management , 2011, Information Systems and e-Business Management.

[22]  Yannis Charalabidis,et al.  Benefits, Adoption Barriers and Myths of Open Data and Open Government , 2012, Inf. Syst. Manag..

[23]  Maxat Kassen,et al.  A promising phenomenon of open data: A case study of the Chicago open data project , 2013, Gov. Inf. Q..

[24]  Boris Otto,et al.  Management of the master data lifecycle: a framework for analysis , 2013, J. Enterp. Inf. Manag..

[25]  Sunil Choenni,et al.  On the barriers for local government releasing open data , 2014, Gov. Inf. Q..

[26]  Jignesh M. Patel,et al.  Big data and its technical challenges , 2014, CACM.

[27]  R. Ackoff From Data to Wisdom , 2014 .

[28]  Brian A. Harris-Kojetin,et al.  Innovations in Federal Statistics: Combining Data Sources While Protecting Privacy , 2017 .