A Case Study in Using ADAGE for Compute-Intensive Financial Analysis Processes

The Ad hoc DAta Grid Environment (ADAGE) has been proposed as a framework to support analysis processes for large repositories of ad hoc data. Its use of a service-oriented architecture (SOA) brings the promise of flexibility, as well as enabling domain experts to define their own analysis processes at a high level of abstraction. However, these claims have not been verified empirically and the performance penalty of using additional abstract software layers has not been assessed on complex problems. This chapter describes a case study involving a realistic analysis process conducted by an expert user. It assesses the benefits and drawbacks of using the ADAGE approach versus conventional manual analysis processes. This chapter also outlines some avenues for future research to address existing limitations.

[1]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[2]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[3]  Magdalena Balazinska,et al.  Biology and data-intensive scientific discovery in the beginning of the 21st century. , 2011, Omics : a journal of integrative biology.

[4]  F. Hartwig,et al.  Exploratory Data Analysis , 2008, Using Science in Cybersecurity.

[5]  Nancy Wilkins-Diehr,et al.  Special Issue: Science Gateways—Common Community Interfaces to Grid Resources , 2007, Concurr. Comput. Pract. Exp..

[6]  John R. Rice,et al.  GasTurbnLab: a multidisciplinary problem solving environment for gas turbine engine design on a network of nonhomogeneous machines , 2002 .

[7]  Jan M. Zytkow,et al.  Knowledge discovery in databases: the purpose, necessity, and challenges , 2002 .

[8]  Layne T. Watson,et al.  WBCSim: an environment for modeling wood-based composites manufacture , 2006, Engineering with Computers.

[9]  R. Gencay,et al.  An Introduc-tion to High-Frequency Finance , 2001 .

[10]  Douglas Thain,et al.  Biocompute 2.0: an improved collaborative workspace for data intensive bio‐science , 2011, Concurr. Comput. Pract. Exp..

[11]  E. Gallopoulos,et al.  Problem-solving Environments For Computational Science , 1997, IEEE Computational Science and Engineering.

[12]  Dennis Kundisch,et al.  A User-Driven SOA for Financial Market Data Analysis , 2010, Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model..

[13]  Gary J. Nutt,et al.  Open Systems , 2019 .

[14]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[15]  Fethi A. Rabhi,et al.  A data model for processing financial market and news data , 2009 .

[16]  Lawrence Yao,et al.  Modelling Exploratory Analysis Processes for eResearch , 2010 .

[17]  Diego Scardaci,et al.  The GENIUS Grid Portal: Its Architecture, Improvements of Features, and New Implementations about Authentication and Authorization , 2007, 16th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE 2007).

[18]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[19]  Alexander S. Szalay,et al.  Extreme Data-Intensive Scientific Computing , 2011, Computing in Science & Engineering.

[20]  Fethi A. Rabhi,et al.  ADAGE: a framework for supporting user-driven ad-hoc data analysis processes , 2012, Computing.

[21]  Jan M. Zytkow,et al.  Handbook of Data Mining and Knowledge Discovery , 2002 .

[22]  Paul Watson,et al.  e-Science Central for CARMEN: science as a service , 2010 .