EPJ Web Conf.
Volume 214, 201923rd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2018)
|Number of page(s)||8|
|Section||T3 - Distributed computing|
|Published online||17 September 2019|
Improving the Scheduling Efficiency of a Global Multi-Core HTCondor Pool in CMS
University of Nebraska-Lincoln, Lincoln,
2 Benémerita Universidad Autónoma de Puebla, Puebla, México
3 University of Notre Dame, Notre Dame IN USA
4 University of Sofia, Sofia, Bulgaria
5 Fermi National Accelerator Laboratory, Batavia, IL USA
6 University of Malaya, Kuala Lumpur, Malaysia
7 University of California San Diego, La Jolla, CA USA
8 Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid, Spain
9 Port d'Informació Científica (PIC), Barcelona, Spain
* Corresponding author: firstname.lastname@example.org
Published online: 17 September 2019
Scheduling multi-core workflows in a global HTCondor pool is a multi-dimensional problem whose solution depends on the requirements of the job payloads, the characteristics of available resources, and the boundary conditions such as fair share and prioritization imposed on the job matching to resources. Within the context of a dedicated task force, CMS has increased significantly the scheduling efficiency of workflows in reusable multi-core pilots by various improvements to the limitations of the GlideinWMS pilots, accuracy of resource requests, efficiency and speed of the HTCondor infrastructure, and job matching algorithms.
© The Authors, published by EDP Sciences, 2019
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Initial download of the metrics may take a while.