| Issue |
EPJ Web Conf.
Volume 337, 2025
27th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2024)
|
|
|---|---|---|
| Article Number | 01031 | |
| Number of page(s) | 8 | |
| DOI | https://doi.org/10.1051/epjconf/202533701031 | |
| Published online | 07 October 2025 | |
https://doi.org/10.1051/epjconf/202533701031
Development of machine-learning based app for anomaly detection in CMSWEB
1 CERN, Geneva, Switzerland.
2 Quaid-i-Azam University, Islamabad, Pakistan.
3 National Centre for Physics, Islamabad, Pakistan.
4 Cornell University, New York, USA.
* e-mail: nasir.hussain@cern.ch
** e-mail: muhammad.imran@ncp.edu.pk
*** e-mail: andreas.pfeiffer@cern.ch
**** e-mail: aroosha.pervaiz@cern.ch
† e-mail: vkuznet@protonmail.com
Published online: 7 October 2025
This paper presents an anomaly detection system for CMSWEB services, a critical component of CMS infrastructure supporting over two dozen web services. The system employs deep learning techniques to enhance service performance and reliability by detecting anomalies in real time. Multiple autoencoder-based models such as Hybrid CNN-LSTM, Hybrid CNN-GRU, GRU, LSTM, CNN, and a fully connected autoencoder were trained using data collected from the CMS Monit infrastructure. Hyperparameter tuning and continuous learning were implemented to adapt models to dynamic service behaviors. Anomaly detection thresholds were derived using statistical methods, including mean + 1.5 standard deviations, median absolute deviation, and the 95th and 99th percentiles of reconstruction errors. The Hybrid CNN-LSTM model demonstrated superior performance, achieving the lowest RMSE and MAE and the highest R-squared values. The developed app provides real-time insights, visualizations, and automated alerts for anomalies. This work significantly contributes to improving CMS operational efficiency by reducing downtime and enhancing system stability. Future research will focus on refining the monitoring dashboard, expanding anomaly insights, enabling user interaction for model training, and integrating expert feedback to further optimize detection processes.
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

