Wsrf-Based Services for Distributed Data Mining

Computational Grids can be effectively used as an infrastructure for distributed data mining and knowledge discovery in large data sets. To utilize Grids for high-performance knowledge discovery, software tools and mechanisms are needed. To this purpose we designed a system called Knowledge Grid and we are implementing its services as WSRF-compliant Grid Services. This chapter describes the composition of distributed knowledge discovery services, according to the service oriented architecture model, by using the Knowledge Grid environment. We discuss Grid Services for searching Grid resources, composing software and data elements, and executing the resulting data mining application on the Knowledge Grid. The chapter focuses in particular on the application modeling. Applications are designed using a UML model, which is translated into a BPEL representation, in turn processed by the Knowledge Grid services for its execution.