Big Data—Technologies and Potential

Recently, the term Big Data has gained tremendous popularity in business and academic discussions and is now prominently used in scientific publications (Jacobs, Communications of the ACM—A Blind person’s interaction with technology, 2009), business literature (Mayer-Schonberger and Cukier, Big Data. A revolution that will transform how we live, work, and think, 2013; McAfee and Brynjolfsson, Harvard Business Review 90, 2012), whitepapers and analyst reports (Brown et al., Big Data. The next frontier for innovation, competition, and productivity, 2011b; Economist Intelligence Unit 2012; Schroeck et al., Analytics: The real-world use of Big Data, 2012), as well as in popular magazines (Cukier 2010). While all these references somewhat associate the term with a new paradigm for data processing and analytics, the perception of what exactly it refers to are very diverse. The gap in the understanding of the phenomenon of Big Data is highlighted by the results of a recent study – in which respondents were asked to choose descriptions of the term Big Data – resulting in diverse characterizations such as, e.g., “A greater scope of information”, “New kinds of data and analysis” or “Real-time information” (Schroeck et al. 2012).

[1]  Michael Stonebraker,et al.  The Case for Shared Nothing , 1985, HPTS.

[2]  L. Nelson Data, data everywhere. , 1997, Critical care medicine.

[3]  Jennifer Widom,et al.  Continuous queries over data streams , 2001, SGMD.

[4]  Jonathan Rodden,et al.  Strength in Numbers? , 2002 .

[5]  The numbers of our lives. , 2003, Pediatric critical care medicine : a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies.

[6]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[7]  Michael Stonebraker,et al.  The 8 requirements of real-time stream processing , 2005, SGMD.

[8]  T. Davenport Competing on analytics. , 2006, Harvard business review.

[9]  Deciding factor. , 2006, Managed care.

[10]  Jim Gray,et al.  2020 Computing: Science in an exponential world , 2006, Nature.

[11]  Jeanne G. Harris,et al.  Competing on Analytics: The New Science of Winning , 2007 .

[12]  David Luckham,et al.  The power of events - an introduction to complex event processing in distributed enterprise systems , 2002, RuleML.

[13]  Michael Stonebraker,et al.  A comparison of approaches to large-scale data analysis , 2009, SIGMOD Conference.

[14]  Adam Jacobs,et al.  The pathologies of big data , 2009, Commun. ACM.

[15]  Alain Biem,et al.  Real-Time Traffic Information Management using Stream Computing , 2010, IEEE Data Eng. Bull..

[16]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[17]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[18]  Michael Stonebraker,et al.  MapReduce and parallel DBMSs: friends or foes? , 2010, CACM.

[19]  Lorin M. Hitt,et al.  Strength in Numbers: How Does Data-Driven Decisionmaking Affect Firm Performance? , 2011, ICIS 2011.

[20]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[21]  D. Luckham Event Processing for Business: Organizing the Real-Time Enterprise , 2011 .

[22]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[23]  Shirish Tatikonda,et al.  SystemML: Declarative machine learning on MapReduce , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[24]  Doug Patterson,et al.  Ethics of big data , 2012 .

[25]  Dominic Barton,et al.  Making advanced analytics work for you. , 2012, Harvard business review.

[26]  Thomas J. Steenburgh,et al.  Motivating Salespeople: What Really Works , 2012, Harvard business review.

[27]  D. Kiron,et al.  Sustainability nears a tipping point , 2012 .

[28]  Martin Fowler,et al.  NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence , 2012 .

[29]  Omer Tene,et al.  Big Data for All: Privacy and User Control in the Age of Analytics , 2012 .

[30]  Omer Tene Jules Polonetsky,et al.  Privacy in the Age of Big Data: A Time for Big Decisions , 2012 .

[31]  Christopher Kuner,et al.  The Challenge of "Big Data" for Data Protection , 2012 .

[32]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[33]  John Gantz,et al.  The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East , 2012 .

[34]  Viktor Mayer-Schnberger,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2013 .

[35]  Michelle Cheatham,et al.  Privacy in the age of big data , 2015, 2015 International Conference on Collaboration Technologies and Systems (CTS).