Rucio – The next generation of large scale distributed system for ATLAS Data Management

Rucio is the next-generation Distributed Data Management (DDM) system benefiting from recent advances in cloud and "Big Data" computing to address HEP experiments scaling requirements. Rucio is an evolution of the ATLAS DDM system Don Quijote 2 (DQ2), which has demonstrated very large scale data management capabilities with more than 140 petabytes spread worldwide across 130 sites, and accesses from 1,000 active users. However, DQ2 is reaching its limits in terms of scalability, requiring a large number of support staff to operate and being hard to extend with new technologies. Rucio will deal with these issues by relying on a conceptual data model and new technology to ensure system scalability, address new user requirements and employ new automation framework to reduce operational overheads. We present the key concepts of Rucio, including its data organization/representation and a model of how to manage central group and user activities. The Rucio design, and the technology it employs, is described, specifically looking at its RESTful architecture and the various software components it uses. We show also the performance of the system.

[1]  Barry Leiba,et al.  OAuth Web Authorization Protocol , 2012, IEEE Internet Computing.

[2]  Roy T. Fielding,et al.  Principled design of the modern Web architecture , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[3]  Rick Copeland,et al.  Essential SQLAlchemy , 2008 .

[4]  Graeme Stewart,et al.  ATLAS Replica Management in Rucio: Replication Rules and Subscriptions , 2014 .

[5]  Gergely V. Záruba,et al.  Abstract: PanDA: Next Generation Workload Management and Analysis System for Big Data , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[6]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[7]  Graeme Stewart,et al.  The ATLAS Distributed Data Management project: Past and Future , 2012 .

[8]  Graeme Stewart,et al.  DDM Workload Emulation , 2014 .