Distributed Analysis in CMS

The CMS experiment expects to manage several Pbytes of data each year during the LHC programme, distributing them over many computing sites around the world and enabling data access at those centers for analysis. CMS has identified the distributed sites as the primary location for physics analysis to support a wide community with thousands potential users. This represents an unprecedented experimental challenge in terms of the scale of distributed computing resources and number of user. An overview of the computing architecture, the software tools and the distributed infrastructure is reported. Summaries of the experience in establishing efficient and scalable operations to get prepared for CMS distributed analysis are presented, followed by the user experience in their current analysis activities.

Chris Brew | Liang Dong | Ricky Egeland | Giacinto Donvito | Jorgen D'Hondt | Tibor Kurca | Edward Karavakis | Bockjoo Kim | Kenneth Bloom | Daniele Cesini | Daniele Spiga | Kati Lassila-Perini | Carlos Kavka | Alessandra Fanfani | Daniele Bonacorsi | Giulio Eulisse | Federica Fanzago | Daniel Riley | Frank Würthwein | Yuyi Guo | Patricia McBride | Thomas Kress | Tony Wildish | Chih-Hao Huang | Sanjay Padhi | Claudio Grandi | Andrea Sartirana | Hassen Riahi | Stefano Belforte | Julia Andreeva | Dave Evans | L. A. T. Bauerdick | Giuseppe Bagliesi | Valentin Kuznetsov | Kalle Happonen | Tomas Lindén | Giuseppe Codispoti | Lee Lueking | Simon Metson | Ian Fisk | Andrea Sciabà | Eric Vaandering | Ilaria Villella | Stefano Lacaprara | David Dykstra | Gerhild Maier | Mattia Cinquilli | Vincenzo Miccio | Vijay Sekhri | José M. Hernández | Fabio Farina | Akram Khan | Matthias Kasemann | Petra Van Mulders | Christoph Wissing | Lassi A. Tuura | Barry Blumenfeld | Jesper Koivumäki | Pablo Saiz | M. Anzar Afaq | Jukka Klem | Nicolò Magini | Erik Edelmann | Lukas Vanelderen | Derek Feichtinger | Aresh Vedaee | Jose Afonso Sanches | Patricia Bittencourt Sampaio | Marco Calloni | Danilo N. Dongiovanni | Peter Elmer | Josep Flix | Kejing Kang | Peter Kreuzer | James Letts | Joris Maes | Haifeng Pi | Paul Rossman | Eric Wicklund | B. Blumenfeld | P. Elmer | C. Grandi | J. Andreeva | V. Kuznetsov | V. Miccio | A. Sciabà | D. Cesini | D. Dongiovanni | D. Spiga | P. Mulders | I. Villella | L. Vanelderen | K. Lassila-Perini | T. Lindén | T. Kurca | P. Kreuzer | T. Kress | M. Kasemann | D. Bonacorsi | A. Fanfani | F. Fanzago | S. Lacaprara | G. Bagliesi | S. Belforte | J. Flix | G. Codispoti | E. Karavakis | S. Metson | C. Brew | J. Letts | S. Padhi | H. Pi | F. Würthwein | L. Bauerdick | I. Fisk | E. Vaandering | Bockjoo Kim | J. Hernández | P. McBride | N. Magini | C. Kavka | E. Wicklund | D. Riley | T. Wildish | Chih-Hao Huang | F. Farina | L. Dong | G. Donvito | J. Maes | J. Klem | D. Feichtinger | M. Afaq | D. Dykstra | L. Lueking | V. Sekhri | R. Egeland | G. Eulisse | L. Tuura | A. Sartirana | M. Cinquilli | P. Rossman | P. Saiz | M. Calloni | A. Vedaee | J. A. Sanches | E. Edelmann | G. Maier | D. Evans | K. Happonen | J. D’Hondt | Akram Khan | C. Wissing | K. Bloom | H. Riahi | Yuyi Guo | K. Kang | J. Koivumäki

[1]  I. Fisk,et al.  The CMS data transfer test environment in preparation for LHC data taking , 2008, 2008 IEEE Nuclear Science Symposium Conference Record.

[2]  Daniele Spiga,et al.  The CMS Remote Analysis Builder (CRAB) , 2007, HiPC.

[3]  Ricky Egeland,et al.  Data transfer infrastructure for CMS data taking , 2009 .

[4]  J. Klem,et al.  The commissioning of CMS computing centres in the worldwide LHC computing Grid , 2008, 2008 IEEE Nuclear Science Symposium Conference Record.

[5]  G. Codispoti,et al.  CRAB: A CMS application for distributed analysis , 2008, 2008 IEEE Nuclear Science Symposium Conference Record.

[6]  Marcelino B. Santos,et al.  CMS Physics Technical Design Report, Volume II: Physics Performance , 2007 .

[7]  D. Bonacorsi,et al.  CMS results in the Combined Computing Readiness Challenge CCRC'08 , 2009 .

[8]  A. D. Meglio,et al.  Programming the Grid with gLite , 2006 .

[9]  Federico Carminati,et al.  AliEn: ALICE environment on the GRID , 2008 .

[10]  Jose M Hernandez,et al.  The CMS Monte Carlo Production System: Development and Design , 2008 .

[11]  Johannes Elmsheuser,et al.  Ganga: A tool for computational-task management and easy access to Grid resources , 2009, Comput. Phys. Commun..

[12]  Mario Maggi,et al.  The CMS analysis chain in a distributed environment , 2006 .

[13]  S. Lacaprara,et al.  Distributed computing grid experiences in CMS , 2005, IEEE Transactions on Nuclear Science.

[14]  J. Lindemann,et al.  Advanced Resource Connector middleware for lightweight computational Grids , 2007, Future Gener. Comput. Syst..

[15]  Jose M Hernandez,et al.  Use of the gLite-WMS in CMS for production and analysis , 2010 .

[16]  Ricky Egeland,et al.  SiteDB: Marshalling people and resources available to CMS , 2010 .

[17]  Iosif Legrand,et al.  Models Of Networked Analysis At Regional Centres For Lhc Experiments (monarc), Phase 2 Report, 24th March 2000 , 2000 .

[18]  Claudio Grandi,et al.  The CMS Computing Model , 2004 .

[19]  J Andreeva,et al.  Dashboard for the LHC experiments , 2008 .

[20]  Jose M Hernandez,et al.  The commissioning of CMS sites: Improving the site reliability , 2010 .

[21]  Igor Sfiligoi,et al.  glideinWMS - A generic pilot-based Workload Management System , 2008 .

[22]  Anders Wäänänen,et al.  Advanced resource connector middleware for lightweight computational Grids , 2007 .

[23]  Barry Blumenfeld,et al.  CMS conditions data access using FroNTier , 2008 .

[24]  Brian Bockelman,et al.  Scaling CMS data transfer system for LHC start-up , 2008 .

[25]  Moreno Marzolla,et al.  The gLite Workload Management System , 2008, GPC.

[26]  Edward Hæggström,et al.  CMS The Computing Project , 2005 .

[27]  Jorge Luis Rodriguez,et al.  The Open Science Grid , 2005 .

[28]  João Paulo Teixeira,et al.  The CMS experiment at the CERN LHC , 2008 .

[29]  Yuyi Guo,et al.  The CMS dataset bookkeeping service , 2008 .