Embedded Continual Learning for High-Energy Physics

Marco Barbone; Christopher Brown; Georgi Gaydadjiev; Thomas Maguire; Mikael Mieskolainen; Benjamin Radburn-Smith; Wayne Luk; Alexander Tapper

doi:10.1051/epjconf/202429509014

All issues

Volume 295 (2024)

EPJ Web of Conf., 295 (2024) 09014

Abstract

Open Access

Issue		EPJ Web of Conf. Volume 295, 2024 26^th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2023)


Article Number		09014
Number of page(s)		8
Section		Artificial Intelligence and Machine Learning
DOI		https://doi.org/10.1051/epjconf/202429509014
Published online		06 May 2024

EPJ Web of Conferences 295, 09014 (2024)
https://doi.org/10.1051/epjconf/202429509014

Embedded Continual Learning for High-Energy Physics

Marco Barbone¹^*, Christopher Brown¹, Georgi Gaydadjiev², Thomas Maguire², Mikael Mieskolainen¹, Benjamin Radburn-Smith¹, Wayne Luk¹ and Alexander Tapper¹

¹ Imperial College London, South Kensington, London, United Kingdom
² Bernoulli Institute, University of Groningen, The Netherlands

^* e-mail: m.barbone19@imperial.ac.uk

Published online: 6 May 2024

Abstract

Neural Networks (NN) are often trained offline on large datasets and deployed on specialised hardware for inference, with a strict separation between training and inference. However, in many realistic applications the training environment differs from the real world, or data arrives in a streaming fashion and is continuously changing. In these scenarios, the ability to continuously train and update NN models is desirable. Continual learning (CL) algorithms allow training of models on a stream of data. CL algorithms are often designed to work in constrained settings, such as limited memory and computational power, or limitations on the ability to store past data (e.g, due to privacy concerns or memory requirements). High-energy physics experiments are developing intelligent detectors, with algorithms running on computer systems located close to the detector to meet the challenges of increased data rates and occupancies. The use of NN algorithms in this context is limited by changing detector conditions, such as degradation over time or failure of an input signal which might cause the NNs to lose accuracy leading, in the worst case to the loss of interesting events. CL has the potential to solve this issue, using large amounts of continuously streaming data to allow the network to recognise changes, and to learn and adapt to detector conditions. It has the potential to outperform traditional NN training techniques as not all possible scenarios can be predicted and modelled in static training data samples. However, NN training is computationally expensive and when combined with the strict timing requirements of embedded processors deployed close to the detector, current state-of-the-art offline approaches cannot be directly applied to the real-time systems. Alternatives to typical backpropagation-based training that can be deployed on FPGAs for real-time data processing are presented, and their computational and accuracy characteristics are discussed in the context of High-Luminosity LHC.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.