CASTOR: A Distributed Storage Resource Facility for High Performance Data Processing at CERN

Mass storage systems at CERN have evolved over time to meet growing requirements, in terms of both scalability and fault resiliency. The CERN advanced storage system (CASTOR) and its new disk cache management layer (CASTOR2) have been developed to meet the challenges raised by the experiments using the new accelerator that CERN is building: the large hadron collider (LHC) [4]. This system must be able to cope with hundreds of millions of files, tens of petabytes of storage and handle a constant throughput of several gigabytes per second. In this paper, we detail CASTOR's architecture and implementation and present some operational aspects. We finally list the performance levels achieved by the current version both in a production environment and during internal tests.

[1]  Olof Bärring,et al.  Storage Resource Sharing with CASTOR , 2004, MSST.

[2]  Flavia Donno,et al.  Storage Resource Managers: Recent International Experience on Requirements and Multiple Co-Operating Implementations , 2007, 24th IEEE Conference on Mass Storage Systems and Technologies (MSST 2007).

[3]  L. Evans The Large Hadron Collider , 2007 .

[4]  Fons Rademakers,et al.  ROOT — An object oriented data analysis framework , 1997 .

[5]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[6]  Ákos Frohner,et al.  VOMS, an Authorization System for Virtual Organizations , 2003, European Across Grids Conference.