Deploying massive runs of evolutionary algorithms with ECJ and Hadoop: Reducing interest points required for face recognition

In this paper we present a new strategy for deploying massive runs of evolutionary algorithms with the well-known Evolutionary Computation Library (ECJ) tool, which we combine with the MapReduce model so as to allow the deployment of computing intensive runs of evolutionary algorithms on big data infrastructures. Moreover, by addressing a hard real life problem, we show how the new strategy allows us to address problems that cannot be solved with more traditional approaches. Thus, this paper shows that by using the Hadoop framework ECJ users can, by means of a new parameter, choose where the run will be launched, whether in a Hadoop based infrastructure or in a desktop computer. Moreover, together with the performed tests we address the well-known face recognition problem with a new purpose: to allow a genetic algorithm to decide which are the more relevant interest points within the human face. Massive runs have allowed us to reduce the set from about 60 to just 20 points. In this way, recognition tasks based on the solution provided by the genetic algorithm will work significantly quicker in the future, given that just 20 points will be required. Therefore, two goals have been achieved: (a) to allow ECJ users to launch massive runs of evolutionary algorithms on big data infrastructures and also (b) to demonstrate the capabilities of the tool to successfully improve results regarding the problem of face recognition.

[1]  Kalyan Veeramachaneni,et al.  Flex-GP: Genetic Programming on the Cloud , 2012, EvoApplications.

[2]  Juan Humberto Sossa Azuela,et al.  Self organizing natural scene image retrieval , 2013, Expert Syst. Appl..

[3]  Werner Mellis,et al.  Success factors of organizational change in software process improvement , 1998 .

[4]  Hossein Mobahi,et al.  Toward a Practical Face Recognition System: Robust Alignment and Illumination by Sparse Representation , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Leonardo Vanneschi,et al.  An Empirical Study of Multipopulation Genetic Programming , 2003, Genetic Programming and Evolvable Machines.

[6]  Leonardo Trujillo,et al.  Automated Design of Image Operators that Detect Interest Points , 2008, Evolutionary Computation.

[7]  V. Balamurugan,et al.  A Novel Approach for Facial Feature Extraction in Face Recognition , 2014 .

[8]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[9]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[10]  Ho-Hyun Park,et al.  Tagging and classifying facial images in cloud environments based on KNN using MapReduce , 2015 .

[11]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[12]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[13]  Leonardo Trujillo,et al.  ECJ+HADOOP: An Easy Way to Deploy Massive Runs of Evolutionary Algorithms , 2016, EvoApplications.

[14]  Wei Li,et al.  A Fast Face Recognition Algorithm Based on MapReduce , 2014, 2014 Seventh International Symposium on Computational Intelligence and Design.

[15]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  F. Prieto,et al.  Extracción de puntos característicos del rostro para medidas antropométricas , 2010 .

[17]  Miguel E. Ruiz,et al.  Automatic Classification of Medical Images for Content Based Image Retrieval Systems (CBIR) , 2008 .

[18]  Juan Julián Merelo Guervós,et al.  EvAg: a scalable peer-to-peer evolutionary algorithm , 2010, Genetic Programming and Evolvable Machines.

[19]  Leonardo Trujillo,et al.  Automatic Feature Localization in Thermal Images for Facial Expression Recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[20]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[21]  Leonardo Vanneschi,et al.  An MPI-based tool for distributed genetic programming , 2000, Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000.

[22]  Juan Julián Merelo Guervós,et al.  Increasing GP Computing Power for Free via Desktop GRID Computing and Virtualization , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[23]  Gustavo Olague,et al.  Genetic programming as strategy for learning image descriptor operators , 2013, Intell. Data Anal..

[24]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Juan Julián Merelo Guervós,et al.  EvoSpace: A Distributed Evolutionary Platform Based on the Tuple Space Model , 2013, EvoApplications.

[26]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Zhiqiang Yao,et al.  High performance parallel evolutionary algorithm model based on MapReduce framework , 2013, Int. J. Comput. Appl. Technol..

[28]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[29]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[30]  El-Ghazali Talbi,et al.  Grid computing for parallel bioinspired algorithms , 2006, J. Parallel Distributed Comput..

[31]  Weiming Tong,et al.  Research on Fusion Algorithm of Elliptic Curve Cryptography in Advanced Metering Infrastructure Communication , 2014, 2014 Seventh International Symposium on Computational Intelligence and Design.

[32]  Leonardo Trujillo,et al.  Evolutionary-computer-assisted design of image operators that detect interest points using genetic programming , 2011, Image Vis. Comput..

[33]  Erick Cantú-Paz,et al.  Efficient and Accurate Parallel Genetic Algorithms , 2000, Genetic Algorithms and Evolutionary Computation.

[34]  Marco Tomassini,et al.  A Parallel Genetic Programming Tool Based on PVM , 1999, PVM/MPI.

[35]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[36]  Lei Zhang,et al.  Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.