A Study on MapReduce: Challenges and Trends

Nowadays we all are surrounded by Big data. The term ‘Big Data’ itself indicates huge volume, high velocity, variety and veracity i.e. uncertainty of data which gave rise to new difficulties and challenges. Big data generated may be structured data, Semi Structured data or unstructured data. For existing database and systems lot of difficulties are there to process, analyze, store and manage such a Big Data. The Big Data challenges are Protection, Curation, Capture, Analysis, Searching, Visualization, Storage, Transfer and sharing. Map Reduce is a framework using which we can write applications to process huge amount of data, in parallel, on large clusters of commodity hardware in a reliable manner. Lot of efforts have been put by different researchers to make it simple, easy, effective and efficient. In our survey paper we emphasized on the working of Map Reduce, challenges, opportunities and recent trends so that researchers can think on further improvement.

[1]  Noh Kyoo-sung,et al.  Bigdata Platform Design and Implementation Model , 2015 .

[2]  Jie Wu,et al.  Dache: A data aware caching for big-data applications using the MapReduce framework , 2013, 2013 Proceedings IEEE INFOCOM.

[3]  Miriam A. M. Capretz,et al.  Challenges for MapReduce in Big Data , 2014, 2014 IEEE World Congress on Services.

[4]  Zeba Khanam,et al.  Map Reduce: A Survey Paper on Recent Expansion , 2015 .

[5]  Du Zhang,et al.  Inconsistencies in big data , 2013, 2013 IEEE 12th International Conference on Cognitive Informatics and Cognitive Computing.

[6]  Vimla Jethani,et al.  Distributed Metadata Management Scheme in HDFS , 2013 .

[7]  Sungyoung Lee,et al.  Context‐aware scheduling in MapReduce: a compact review , 2015, Concurr. Comput. Pract. Exp..

[8]  Priyaneet Bhatia,et al.  Correlated Appraisal of Big Data, Hadoop and MapReduce , 2015 .

[9]  Jun Qu,et al.  The Optimization and Improvement of MapReduce in Web Data Mining , 2015 .

[10]  Gabriel Antoniu,et al.  Optimizing intermediate data management in MapReduce computations , 2011, CloudCP '11.

[11]  Ananthi Sheshasayee,et al.  Comparison of Machine Learning Algorithm on Map Reduction for Performance Improvement in Big Data , 2015 .

[12]  P. Kannan Review Paper on Data-aware Caching for Big Data Applications , 2015 .

[13]  Wei Fan,et al.  Mining big data: current status, and forecast to the future , 2013, SKDD.

[14]  G. Vadivu,et al.  MapReduce: A Technical Review , 2016 .

[15]  Atul Negi,et al.  Performance Improvement of MapReduce Framework in Heterogeneous Context using Reinforcement Learning , 2015 .

[16]  Jun Liu,et al.  An Efficient Job Scheduling for MapReduce Clusters , 2015 .

[17]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[18]  Bo Zhang,et al.  Self-Configuration of the Number of Concurrently Running MapReduce Jobs in a Hadoop Cluster , 2015, 2015 IEEE International Conference on Autonomic Computing.