Enabling data analysis à la PROOF on the Italian ATLAS Tier-2s using PoD

We describe our experience using PROOF for data analysis on the Italian ATLAS-Tier2 in Frascati, Napoli and Roma1. To enable PROOF on the cluster we used PoD, Proof-on-Demand. PoD is a set of tools designed to interact with any Local Resource Management System (LRMS) to start the PROOF daemons. In this way any user can quickly setup its own PROOF cluster on the resources, with the LRMS taking care of scheduling, priorities and accounting. Usage of PoD has steadily increased in the last years, and the product has now reached a production level quality. PoD features an abstract interface to LRMSs and provides plugins for several LRMSs. In our tests we used both the gLite and PBS plug-ins, the latter being the native LRMS handling the resources under test. Data were accessed via XRootD with file discovery provided by the standard ATLAS tools. The Storage Element was Disk Pool Manager (DPM) which traditionally uses RFIO rfio data access protocol; we added XRootD on top of this system so PoD could access the data. We describe the configuration and setup details and the results of some benchmark tests we run on the facility.

[1]  Johannes Elmsheuser,et al.  Functional and large-scale testing of the ATLAS distributed analysis facilities with Ganga , 2010 .

[2]  Dario Barberis,et al.  The ATLAS computing model , 2008 .

[3]  Johannes Elmsheuser,et al.  Ganga: A tool for computational-task management and easy access to Grid resources , 2009, Comput. Phys. Commun..

[4]  Predrag Buncic,et al.  Studying ROOT I/O performance with PROOF-Lite , 2011 .

[5]  T Maeno,et al.  PanDA: distributed production and distributed analysis system for ATLAS , 2008 .

[6]  Lorenzo Moneta,et al.  ROOT - A C++ framework for petabyte data storage, statistical analysis and visualization , 2009, Comput. Phys. Commun..

[7]  David Berge,et al.  SFrame: A high-performance ROOT-based framework for HEP data analysis , 2010 .

[8]  Ricardo Rocha,et al.  Web enabled data management with DPM & LFC , 2012 .

[9]  Rene Brun,et al.  The PROOF Distributed Parallel Analysis Framework based on ROOT , 2003 .

[10]  Predrag Buncic,et al.  Software installation and condition data distribution via CernVM File System in ATLAS , 2012 .

[11]  Dario Barberis,et al.  The Evolution of the ATLAS Computing Model , 2010 .

[12]  J Cranshaw,et al.  The ATLAS ROOT-based data formats: recent improvements and performance measurements , 2012 .

[13]  Amir Farbin,et al.  ATLAS analysis model , 2006 .

[14]  Ricardo Rocha,et al.  DPM: Future Proof Storage , 2012 .

[15]  Claudia Ciocca,et al.  Deployment of job priority mechanisms in the Italian Cloud of the ATLAS experiment , 2010 .

[16]  Cedric Serfon,et al.  Evolving ATLAS Computing For Today's Networks , 2012 .

[17]  A Brunengo,et al.  ATLAS computing activities and developments in the Italian Grid cloud , 2012 .