Overview on Performance Testing Approach in Big Data

Big data is defined as large amount of data which requires new technologies and architectures so that it becomes possible to extract value from it by capturing and analysis process. Big data due to its various properties like volume, velocity, variety, variability, value, complexity and performance put forward many challenges. Many organizations are facing challenges in facing test strategies for structured and unstructured data validation, setting up optimal test environment, working with non relational database and performing non functional testing. These challenges cause poor quality of data in production, delay in implementation and increase in cost. Map Reduce provides a parallel and scalable programming model for data-intensive business and scientific applications. To obtain the actual performance of big data applications, such as response time, maximum online user data capacity size, and a certain maximum processing capacity. Keywords: Bigdata, Testing strategies, MapReduce, Hadoop, performance testing

[1]  Zhenyu Liu,et al.  Research of performance test technology for big data applications , 2014, 2014 IEEE International Conference on Information and Automation (ICIA).

[2]  Zibin Zheng,et al.  Service-Generated Big Data and Big Data-as-a-Service: An Overview , 2013, 2013 IEEE International Congress on Big Data.

[3]  Marcos Lordello Chaim,et al.  Data-Flow Testing in the Large , 2014, ICST.

[4]  Jun Fan,et al.  Diagnosing Virtualized Hadoop Performance from Benchmark Results: An Exploratory Study , 2014, 2014 IEEE International Congress on Big Data.

[5]  Avita Katal,et al.  Big data: Issues, challenges, tools and Good practices , 2013, 2013 Sixth International Conference on Contemporary Computing (IC3).

[6]  Mahesh Gudipati,et al.  Big Data : Testing Approach to Overcome Quality Challenges , 2013 .

[7]  Jie Li,et al.  The overview of big data storage and management , 2014, 2014 IEEE 13th International Conference on Cognitive Informatics and Cognitive Computing.

[8]  Joseph K. Liu,et al.  Toward efficient and privacy-preserving computing in big data era , 2014, IEEE Network.

[9]  Rosangela de Fatima Pereira,et al.  Exploiting Hadoop Topology in Virtualized Environments , 2014, 2014 IEEE World Congress on Services.

[10]  Kostas Katrinis,et al.  Pythia: Faster Big Data in Motion through Predictive Software-Defined Network Optimization at Runtime , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[11]  Martin Hilbert,et al.  The World’s Technological Capacity to Store, Communicate, and Compute Information , 2011, Science.

[12]  Judy Qiu,et al.  Supporting Queries and Analyses of Large-Scale Social Media Data with Customizable and Scalable Indexing Techniques over NoSQL Databases , 2014, 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.