Issue |
EPJ Web Conf.
Volume 214, 2019
23rd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2018)
|
|
---|---|---|
Article Number | 03004 | |
Number of page(s) | 7 | |
Section | T3 - Distributed computing | |
DOI | https://doi.org/10.1051/epjconf/201921403004 | |
Published online | 17 September 2019 |
https://doi.org/10.1051/epjconf/201921403004
Producing Madgraph5_aMC@NLO gridpacks and using TensorFlow GPU resources in the CMS HTCondor Global Pool
1
University of Nebraska-Lincoln,
Lincoln, NE,
USA
2
University of California San Diego,
La Jolla, CA,
USA
3
Autonomous University of Puebla,
Puebla,
Mexico
4
University of Notre Dame,
Notre Dame, IN,
USA
5
Fermi National Accelerator Laboratory,
Batavia, IL,
USA
6
Port d’Informació Científica,
Barcelona,
Spain
7
Centro de Investigaciones Energéticas Medioambientales y Tecnológicas,
Madrid,
Spain
8
University of Sofia,
Sofia,
Bulgaria
Published online: 17 September 2019
The CMS experiment has an HTCondor Global Pool, composed of more than 200K CPU cores available for Monte Carlo production and the analysis of da.The submission of user jobs to this pool is handled by either CRAB, the standard workflow management tool used by CMS users to submit analysis jobs requiring event processing of large amounts of data, or by CMS Connect, a service focused on final stage condor-like analysis jobs and applications that already have a workflow job manager in place. The latest scenario canbring cases in which workflows need further adjustments in order to efficiently work in a globally distributed pool of resources. For instance, the generation of matrix elements for high energy physics processes via Madgraph5_aMC@NLO and the usage of tools not (yet) fully supported by the CMS software, such as Ten-sorFlow with GPUsupport, are tasks with particular requirements. A special adaption, either at the pool factory level (advertising GPU resources) or at the execute level (e.g: to handle special parameters that describe certain needs for the remote execute nodes during submission) is needed in order to adequately work in the CMS global pool. This contribution describes the challenges and efforts performed towards adaptingsuch workflows so they can properly profit from the Global Pool via CMS Connect.
© The Authors, published by EDP Sciences, 2019
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.