GRASP Algorithm for Optimization of Grids for Multiple Classifier System

In recent years the volume of data used in scientific researches and industry has increased significantly. Distributed computing systems including Grids use the public Internet to share computational resources of research institutions around the world in order to process the data. Due to large data volumes being transferred, network aspects of Grids have become important. In this work we introduce a model of an overlay Grid system, which could be used by the distributed recognition system based on the idea of combining classifiers. We formulate an Integer Programming optimization problem with the objective to minimize the overall cost including processing and data transfer. Next, an effective heuristic algorithm is developed to solve the problem. Results of numerical experiments showing the comparison of the heuristic against solutions provided by CPLEX solver are presented.

[1]  Ying Zhu,et al.  Overlay Networks with Linear Capacity Constraints , 2008, IEEE Trans. Parallel Distributed Syst..

[2]  A.C. Campilho,et al.  Combining independent and unbiased classifiers using weighted average , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[3]  Lei Yu,et al.  Grid Resource Management: Toward Virtual and Services Compliant Grid Computing , 2008 .

[4]  Alex Alves Freitas,et al.  Mining Very Large Databases with Parallel Processing , 1997, The Kluwer International Series on Advances in Database Systems.

[5]  Fabio Roli,et al.  Bayesian Analysis of Linear Combiners , 2007, MCS.

[6]  Mauricio G. C. Resende,et al.  Greedy Randomized Adaptive Search Procedures , 1995, J. Glob. Optim..

[7]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[8]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[10]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[11]  Jarek Nabrzyski,et al.  Grid resource management: state of the art and future trends , 2004 .

[12]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[13]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[14]  Panos M. Pardalos,et al.  Handbook of applied optimization , 2002 .

[15]  Mauricio G. C. Resende,et al.  Grasp: An Annotated Bibliography , 2002 .

[16]  Michal Wozniak,et al.  Decision Tree Induction Methods for Distributed Environment , 2009, ICMMI.

[17]  Robert P. W. Duin,et al.  Limits on the majority vote accuracy in classifier fusion , 2003, Pattern Analysis & Applications.