论文信息 - Enhancing NameNode Fault Tolerance in Hadoop Distributed File System

Enhancing NameNode Fault Tolerance in Hadoop Distributed File System

today's cloud computing environment, Hadoop is applied for handling huge data, tens of terabytes to petabytes, with commodity hardware (HDFS) for storage and software (MapReduce) for parallel data processing. In Hadoop version 1.0.3, there is a single metadata server called NameNode which stores the entire file system metadata in main memory and most of I/O operations are associated with those credential metadata. Hadoop is out of commission if NameNode is crashed because it works on memory which becomes exhausted due to multiple concurrent accesses (3). Therefore, NameNode is a single point of failure (SPOF) in Hadoop and it has to tolerate faults. To solve this issue, a proactive predictive solution is proposed for enhancing NameNode fault tolerance. The solution is designed to proactively calculate the predicted time to crash of NameNode due to resource exhaustion by evaluating the use of powerful Back Propagation Algorithm Neural Network. The proposed approach can give prediction accuracy with minimal error compared to the actual result. Therefore, NameNode's single point of failure can overcome through proposed proactively predicting the time to crash of NameNode caused by memory resource exhaustion.

Thandar Thein | Ohnmar Aung | T. Thein | Ohnmar Aung

[1] Cristina L. Abad,et al. Metadata Traces and Workload Models for Evaluating Big Storage Systems , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[2] Jordi Torres,et al. Predicting Web Server Crashes: A Case Study in Comparing Prediction Algorithms , 2009, 2009 Fifth International Conference on Autonomic and Autonomous Systems.

[3] Javier Alonso López. Proactive software rejuvenation solution for web enviroments on virtualized platforms , 2011 .

[4] Eric Sammer. Hadoop Operations , 2012 .

[5] Chuck Lam,et al. Hadoop in Action , 2010 .

[6] Anil K. Jain,et al. Artificial Neural Networks: A Tutorial , 1996, Computer.

[7] Rudolf Eigenmann,et al. Prediction of Resource Availability in Fine-Grained Cycle Sharing Systems Empirical Evaluation , 2007, Journal of Grid Computing.

[8] Diane Hatcher,et al. Considerations for Implementing a Highly Available or Disaster Recovery Environment , 2011 .

[9] J. Nazuno. Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[10] Bo Dong,et al. Hadoop high availability through metadata replication , 2009, CloudDB@CIKM.

[11] Jimmy J. Lin,et al. Book Reviews: Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer , 2010, CL.

[12] Tom White,et al. Hadoop: The Definitive Guide , 2009 .