论文信息 - What makes Big Data, Big Data? Exploring the ontological characteristics of 26 datasets

What makes Big Data, Big Data? Exploring the ontological characteristics of 26 datasets

Big Data has been variously defined in the literature. In the main, definitions suggest that Big Data possess a suite of key traits: volume, velocity and variety (the 3Vs), but also exhaustivity, resolution, indexicality, relationality, extensionality and scalability. However, these definitions lack ontological clarity, with the term acting as an amorphous, catch-all label for a wide selection of data. In this paper, we consider the question ‘what makes Big Data, Big Data?’, applying Kitchin’s taxonomy of seven Big Data traits to 26 datasets drawn from seven domains, each of which is considered in the literature to constitute Big Data. The results demonstrate that only a handful of datasets possess all seven traits, and some do not possess either volume and/or variety. Instead, there are multiple forms of Big Data. Our analysis reveals that the key definitional boundary markers are the traits of velocity and exhaustivity. We contend that Big Data as an analytical category needs to be unpacked, with the genus of Big Data further delineated and its various species identified. It is only through such ontological work that we will gain conceptual clarity about what constitutes Big Data, formulate how best to make sense of it, and identify how it might be best used to make sense of the world.

Rob Kitchin | Gavin McArdle | G. Mcardle | Rob Kitchin

[1] Rob Kitchin,et al. The data revolution : big data, open data, data infrastructures & their consequences , 2014 .

[2] R. Kitchin,et al. Big data and human geography , 2013 .

[3] Francis X. Diebold,et al. A Personal Perspective on the Origin(s) and Development of 'Big Data': The Phenomenon, the Term, and the Discipline, Second Version , 2012 .

[4] Nathan Marz,et al. Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[5] Taina Bucher,et al. Want to be on the top? Algorithmic power and the threat of invisibility on Facebook , 2012, New Media Soc..

[6] David Stuart,et al. The Data Revolution: Big Data, Open Data, Data Infrastructures and Their Consequences , 2015, Online Inf. Rev..

[7] Helmut Krcmar,et al. Big Data , 2014, Wirtschaftsinf..

[8] Itzhak Benenson,et al. The Data Revolution: Big Data, Open Data, Data Infrastructures and their Consequences. By Rob Kitchin, London: Sage, 2014. , 2016 .

[9] R. Kitchin. The opportunities, challenges and risks of big data for official statistics , 2015 .

[10] Rob Kitchin,et al. Codes of Life: Identification Codes and the Machine-Readable World , 2005 .

[11] D. Boyd,et al. CRITICAL QUESTIONS FOR BIG DATA , 2012 .

[12] Timothy McCarthy,et al. MIMIC: Mobile mapping point density calculator , 2012, COM.Geo '12.