Issue |
EPJ Web Conf.
Volume 245, 2020
24th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2019)
|
|
---|---|---|
Article Number | 10004 | |
Number of page(s) | 7 | |
Section | 10 - Crossover sessions from online, offline and exascale | |
DOI | https://doi.org/10.1051/epjconf/202024510004 | |
Published online | 16 November 2020 |
https://doi.org/10.1051/epjconf/202024510004
Let’s get our hands dirty: a comprehensive evaluation of DAQDB, key-value store for petascale hot storage
1
European Laboratory for Particle Physics, CERN, Geneva 23, CH-1211, Switzerland
2
Intel Technology Poland, ul. Slowackiego 173, 80-298 Gdansk, Poland
3
Argonne National Laboratory, 9700 S. Cass Avenue, Argonne, IL 60439, USA
4
Fermi National Accelerator Laboratory, Batavia, IL 60510, USA
* e-mail: grzegorz.jereczek@intel.com
Published online: 16 November 2020
Data acquisition systems are a key component for successful data taking in any experiment. The DAQ is a complex distributed computing system and coordinates all operations, from the data selection stage of interesting events to storage elements. For the High Luminosity upgrade of the Large Hadron Collider, the experiments at CERN need to meet challenging requirements to record data with a much higher occupancy in the detectors. The DAQ system will receive and deliver data with a significantly increased trigger rate, one million events per second, and capacity, terabytes of data per second. An effective way to meet these requirements is to decouple real-time data acquisition from event selection. Data fragments can be temporarily stored in a large distributed key-value store. Fragments belonging to the same event can be then queried on demand, by the data selection processes. Implementing such a model relies on a proper combination of emerging technologies, such as persistent memory, NVMe SSDs, scalable networking, and data structures, as well as high performance, scalable software. In this paper, we present DAQDB (Data Acquisition Database) — an open source implementation of this design that was presented earlier, with an extensive evaluation of this approach, from the single node to the distributed performance. Furthermore, we complement our study with a description of the challenges faced and the lessons learned while integrating DAQDB with the existing software framework of the ATLAS experiment.
© The Authors, published by EDP Sciences, 2020
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.