论文信息 - Job Provenance - Insight into Very Large Provenance Datasets

Job Provenance - Insight into Very Large Provenance Datasets

Following the job-centric monitoring concept, Job Provenance (JP) service organizes provenance records on the per-job basis. It is designed to manage very large number of records, as was required in the EGEE project where it was developed originally. The quantitative aspect is also a focus of the presented demonstration. We show JP capability to retrieve data items of interest from a large dataset of full records of more than 1 million of jobs, to perform non-trivial transformation on those data, and organize the results in such a way that repeated interactive queries are possible. The application area of the demo is derived from that of previous Provenance Challenges. Though the topic of the demo -- a computational experiment -- is arranged rather artificially, the demonstration still delivers its main message that JP supports non-trivial transformations and interactive queries on large data sets.

[1] Simon Miles. Electronically Querying for the Provenance of Entities , 2006, IPAW.

[2] Zdenek Salvet,et al. gLite Job Provenance—a job‐centric view , 2008, Concurr. Comput. Pract. Exp..

[3] Luděk Matyska,et al. Experimental evaluation of job provenance in ATLAS environment , 2008 .

[4] D. Head,et al. Frontal-hippocampal double dissociation between normal aging and Alzheimer's disease. , 2005, Cerebral cortex.

[5] Zdenek Sustr,et al. Multiple Ligand Trajectory Docking Study - Semiautomatic Analysis of Molecular Dynamics Simulations using EGEE gLite Services , 2008, 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008).

[6] Zdenek Salvet,et al. gLite Job Provenance , 2006, IPAW.