DIRAC, LHCb's Grid Workload and Data Management System, utilizes WLCG resources and middleware components to perform distributed computing tasks satisfying LHCb's Computing Model. The Data Management System (DMS) handles data transfer and data access within LHCb. Its scope ranges from the output of the LHCb Online system to Grid-enabled storage for all data types. It supports metadata for these files in replica and bookkeeping catalogues, allowing dataset selection and localization. The DMS controls the movement of files in a redundant fashion whilst providing utilities for accessing all metadata. To do these tasks effectively the DMS requires complete self integrity between its components and external physical storage. The DMS provides highly redundant management of all LHCb data to leverage available storage resources and to manage transient errors in underlying services. It provides data driven and reliable distribution of files as well as reliable job output upload, utilizing VO Boxes at LHCb Tier1 sites to prevent data loss. This paper presents several examples of mechanisms implemented in the DMS to increase reliability, availability and integrity, highlighting successful design choices and limitations discovered.
[1]
Andrei Tsaregorodtsev,et al.
DIRAC optimized workload management
,
2008
.
[2]
Lawrence M. Seiford,et al.
Recent developments in dea : the mathematical programming approach to frontier analysis
,
1990
.
[3]
D Gregori,et al.
LHCb Computing TDR
,
2005
.
[4]
Joel Closier,et al.
DIRAC: a community grid solution
,
2008
.
[5]
Johan Blouw,et al.
The LHCb Computing Data Challenge DC06
,
2008
.
[6]
Marianne Bargiotti,et al.
DIRAC Data Management: consistency, integrity and coherence of data
,
2008
.
[7]
Brian Bockelman,et al.
Scaling CMS data transfer system for LHC start-up
,
2008
.
[8]
Flavia Donno,et al.
Storage Resource Managers: Recent International Experience on Requirements and Multiple Co-Operating Implementations
,
2007,
24th IEEE Conference on Mass Storage Systems and Technologies (MSST 2007).
[9]
Flavia Donno,et al.
Storage Resource Manager Version 2.2: design, implementation, and testing experience
,
2008
.
[10]
Mario Lassnig,et al.
Managing ATLAS data on a petabyte-scale with DQ2
,
2008
.