Big Data Framework

We are constantly being told that we live in the Information Era - the Age of BIG data. It is clearly apparent that organizations need to employ data-driven decision making to gain competitive advantage. Processing, integrating and interacting with more data should make it better data, providing both more panoramic and more granular views to aid strategic decision making. This is made possible via Big Data exploiting affordable and usable Computational and Storage Resources. Many offerings are based on the Map-Reduce and Hadoop paradigms and most focus solely on the analytical side. Nonetheless, in many respects it remains unclear what Big Data actually is, current offerings appear as isolated silos that are difficult to integrate and/or make it difficult to better utilize existing data and systems. Paper addresses this lacunae by characterising the facets of Big Data and proposing a framework in which Big Data applications can be developed. The framework consists of three Stages and seven Layers to divide Big Data application into modular blocks. The aim is to enable organizations to better manage and architect a very large Big Data application to gain competitive advantage by allowing management to have a better handle on data processing.

[1]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[2]  B. Prabavathy,et al.  A novel indexing scheme for efficient handling of small files in Hadoop Distributed File System , 2013, 2013 International Conference on Computer Communication and Informatics.

[3]  A Comparison of the Top Four Enterprise-Architecture Methodologies , 2010 .

[4]  Laurent Amsaleg,et al.  Distributed high-dimensional index creation using Hadoop, HDFS and C++ , 2012, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI).

[5]  Thomas J. Steenburgh,et al.  Motivating Salespeople: What Really Works , 2012, Harvard business review.

[6]  Anne Laurent,et al.  Reduce, You Say: What NoSQL Can Do for Data Aggregation and BI in Large Repositories , 2011, 2011 22nd International Workshop on Database and Expert Systems Applications.

[7]  Ian T. Foster,et al.  Software as a service for data scientists , 2012, Commun. ACM.

[8]  Anthony K. H. Tung,et al.  Efficient and Scalable Processing of String Similarity Join , 2013, IEEE Transactions on Knowledge and Data Engineering.

[9]  M. Courtney Puzzling out big data , 2012 .

[10]  Wes Nichols,et al.  Advertising Analytics 2.0. (cover story) , 2013 .

[11]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[12]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[13]  Volker Markl,et al.  Big Data Analytics on Modern Hardware Architectures: A Technology Survey , 2012, eBISS.

[14]  Sophia Ananiadou,et al.  Parallel Text Mining for Large Text Processing , 2013 .

[15]  Dominic Barton,et al.  Making advanced analytics work for you. , 2012, Harvard business review.

[16]  T. Davenport,et al.  Data scientist: the sexiest job of the 21st century. , 2012, Harvard business review.

[17]  Brian M. Gaff,et al.  Privacy and Big Data , 2014, Computer.

[18]  John Sadowsky Historias de liderazgo y branding , 2011 .

[19]  Michael Stonebraker,et al.  Researchers' big data crisis; understanding design and functionality , 2012, Commun. ACM.

[20]  Terence Craig,et al.  Privacy and Big Data , 2011 .

[21]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[22]  Craig MacDonald,et al.  MapReduce indexing strategies: Studying scalability and efficiency , 2012, Inf. Process. Manag..

[23]  Robert M. French Moving beyond the Turing test , 2012, CACM.

[24]  Roger L. Martin Los catalizadores de la innovación , 2011 .

[25]  Peter Mork,et al.  From Data to Decisions: A Value Chain for Big Data , 2013, IT Professional.

[26]  Piyush Malik,et al.  Governing Big Data: Principles and practices , 2013, IBM J. Res. Dev..

[27]  Kurt Fanning,et al.  Big Data: Implications for Financial Managers , 2013 .