Big data impact on society: a research roadmap for Europe

With its rapid growth and increasing adoption, big data is producing a substantial impact in society. Its usage is opening both opportunities such as new business models and economic gains and risks such as privacy violations and discrimination. Europe is in need of a comprehensive strategy to optimise the use of data for a societal benefit and increase the innovation and competitiveness of its productive activities. In this paper, we contribute to the definition of this strategy with a research roadmap to capture the economic, social and ethical, legal and political benefits associated with the use of big data in Europe. The present roadmap considers the positive and negative externalities associated with big data, maps research and innovation topics in the areas of data management, processing, analytics, protection, visualisation, as well as non-technical topics, to the externalities they can tackle, and provides a time frame to address these topics in order to deliver social impact, skills development and standardisation. Finally, it also identifies what sectors will be most benefited by each of the research efforts. The goal of the roadmap is to guide European research efforts to develop a socially responsible big data economy, and to allow stakeholders to identify and meet big data challenges and proceed with a shared understanding of the societal impact, positive and negative externalities and concrete problems worth investigating in future programmes.

[1]  Rolf Johansson,et al.  Case Study Methodology , 2003 .

[2]  Costantino Thanos,et al.  A Vision for Open Cyber-Scholarly Infrastructures , 2016, Publ..

[3]  Rachel L. Finn,et al.  Exploring big 'crisis' data in action: potential positive and negative externalities , 2015, ISCRAM.

[4]  Ashok N. Srivastava,et al.  Advances in Machine Learning and Data Mining for Astronomy , 2012 .

[5]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[6]  Shu-Ling Lu,et al.  Case Study Methodology , 2019, New Teaching Resources for Management in a Globalised World.

[7]  John Gantz,et al.  The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East , 2012 .

[8]  Robert Phaal,et al.  Technology roadmapping—A planning framework for evolution and revolution , 2004 .

[9]  Jeffrey Perkel,et al.  MAKING SENSE OF BIG DATA. , 2016, BioTechniques.

[10]  Vitaly Shmatikov,et al.  Towards a Privacy Research Roadmap for the Computing Community , 2016, ArXiv.

[11]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[12]  André Freitas,et al.  Big Data Curation , 2016, New Horizons for a Data-Driven Economy.

[13]  Michael L. Brodie Understanding Data Science: An Emerging Discipline for Data Intensive Discovery , 2015, DAMDID/RCDL.

[14]  Wolfgang Wahlster,et al.  New Horizons for a Data-Driven Economy , 2016, Springer International Publishing.

[15]  D. Boyd,et al.  Perspectives on Big Data, Ethics, and Society , 2016 .

[16]  Silvio Peroni,et al.  Setting our bibliographic references free: towards open citation data , 2015, J. Documentation.

[17]  Charles Anderson,et al.  The end of theory: The data deluge makes the scientific method obsolete , 2008 .

[18]  Tilman Becker,et al.  Big Data Usage , 2016, New Horizons for a Data-Driven Economy.

[19]  Kasper Hornbæk,et al.  Subjunctive interfaces: Extending applications to support parallel setup, viewing and control of alternative scenarios , 2008, TCHI.

[20]  Chris Mattmann,et al.  Computing: A vision for data science , 2013, Nature.

[21]  Z. Popovic,et al.  Crystal structure of a monomeric retroviral protease solved by protein folding game players , 2011, Nature Structural &Molecular Biology.

[22]  Nelia Lasierra,et al.  Big Data Analysis , 2016, New Horizons for a Data-Driven Economy.

[23]  Axel-Cyrille Ngonga Ngomo,et al.  Big Data Acquisition , 2016, New Horizons for a Data-Driven Economy.

[24]  Paul T. Groth,et al.  The anatomy of a nanopublication , 2010, Inf. Serv. Use.

[25]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[26]  Kristin E. Lauter,et al.  Cryptographic Cloud Storage , 2010, Financial Cryptography Workshops.

[27]  G Koren,et al.  Adherence and tolerability of iron-containing prenatal multivitamins in pregnant women with pre-existing gastrointestinal conditions , 2009, Journal of obstetrics and gynaecology : the journal of the Institute of Obstetrics and Gynaecology.

[28]  Jordan Raddick,et al.  Galaxy Zoo: Morphological Classification and Citizen Science , 2011, 1104.5513.

[29]  Jörg Daubert,et al.  Big Data Storage , 2021, New Horizons for a Data-Driven Economy.

[30]  Nicolas Le Novère,et al.  MIRIAM Resources: tools to generate and resolve robust cross-references in Systems Biology , 2007, BMC Systems Biology.

[31]  Raymond Heatherly,et al.  A Game Theoretic Framework for Analyzing Re-Identification Risk , 2015, PloS one.

[32]  Li Qin,et al.  Concept-level access control for the Semantic Web , 2003, XMLSEC '03.

[33]  Anna Fensel,et al.  A European research roadmap for optimizing societal impact of big data on environment and energy efficiency , 2017, 2017 Global Internet of Things Summit (GIoTS).