Distributed Technologies for Remote Access of HDF Data

Scientific simulations and experiments use sophisticated data formats to store and access their data. An example of such a format is the hierarchical data format (HDF), commonly used in fusion and plasma physics, geosciences, astronomy and medical research. Most HDF data gets generated remotely (at a remote supercomputer or an experimental site) and is not readily available to the scientists. Transferring the whole data to the users' machines for analysis and visualization might be prohibitive because of the size of the data, bandwidth limitations or local storage limitations. In addition, different subsets of data may be interesting for analysis at different times. Thus, scientists need a solution for querying and accessing subsets of remote data. This paper describes several approaches to provide an access to remote HDF data and compares their performances. It also gives some details about the solution that we consider the winner - a Globus-based Web service.

[1]  Brygg Ullmer,et al.  Progressive Retrieval and Hierarchical Visualization of Large Remote Data , 2001, Scalable Comput. Pract. Exp..

[2]  Steve Vinoski,et al.  Advanced CORBA® Programming with C++ , 1999 .

[3]  Kyle A. Gallivan,et al.  The gSOAP Toolkit for Web Services and Peer-to-Peer Computing Networks , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[4]  John R. Cary,et al.  Grid service for visualization and analysis of remote fusion data , 2004, Proceedings of the Second International Workshop on Challenges of Large Applications in Distributed Environments, 2004. CLADE 2004..