论文信息 - Training Backpropagation Neural Network in MapReduce

Training Backpropagation Neural Network in MapReduce

BP neural network is generally serially trained by one machine. But massive training data makes the process slow, costing too much system resources. For these problems, one effective solution is to use the MapReduce framework to do the distributed training. Some methods have been proposed, but it is still very slow when facing the neural network with complex structure. This paper presents a new method for BP neural network training based on MapReduce, MR-TMNN (MapReduce based Training in Mapper Neural Network). This method puts most of the training process into Mappers, and then emits the variations of weights and thresholds to Reducer process to do the batch update. It can effectively reduce the volume of intermediate data created by Mappers, reducing the cost of I/O, thereby accelerating training speed. Experimental results show that MR-TMNN has a better convergence without losing too much accuracy, comparing with conventional training method, and it still performs well with the complexity of neural network structure increasing.

[1] Álvaro Herrero,et al. International Joint Conference - CISIS'15 and ICEUTE'15, 8th International Conference on Computational Intelligence in Security for Information Systems / 6th International Conference on EUropean Transnational Education, Burgos, Spain, 15-17 June, 2015 , 2015, CISIS-ICEUTE.

[2] Kunle Olukotun,et al. Map-Reduce for Machine Learning on Multicore , 2006, NIPS.

[3] Bernd Klauer,et al. Pipelining and parallel training of neural networks on distributed-memory multiprocessors , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[4] Farshad Fotouhi,et al. Parallel backpropagation learning algorithm for urban traffic congestion measurement , 1999 .

[5] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[6] Scott Valentine,et al. A parallel implementation of the batch backpropagation training of neural networks , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[7] Hongyan Li,et al. MapReduce-based Backpropagation Neural Network over large scale mobile data , 2010, 2010 Sixth International Conference on Natural Computation.

[8] Zhu Chenje,et al. The Improved BP Algorithm Based on MapReduce and Genetic Algorithm , 2012, 2012 International Conference on Computer Science and Service System.