Issue |
EPJ Web Conf.
Volume 321, 2025
VII International Conference on Applied Physics, Information Technologies and Engineering (APITECH-VII-2025)
|
|
---|---|---|
Article Number | 02006 | |
Number of page(s) | 10 | |
Section | Condensed Matter Physics, Materials Science, and Nanoscale Phenomena | |
DOI | https://doi.org/10.1051/epjconf/202532102006 | |
Published online | 10 March 2025 |
https://doi.org/10.1051/epjconf/202532102006
Chaos engineering in cloud platforms
University of Haifa, Abba Khoushy Ave 199, Haifa, 3498838, Israel
* Corresponding author: bringomun@rambler.ru
Published online: 10 March 2025
This article explores the implementation and impact of chaos engineering (CE) on cloud platforms, emphasizing its role in enhancing system resilience and reliability. Various CE methodologies, such as fault injection and passive observation, are discussed, alongside case studies from major cloud service providers (AWS, Azure, GCP). The integration of artificial intelligence and machine learning in automating and refining chaos experiments is analyzed, highlighting its growing importance as cloud infrastructures become more complex. The potential of CE for proactive prediction and management of system vulnerabilities is underscored, illustrating its capability to preemptively identify and address weaknesses. Through a comprehensive review of existing research and practical case studies, this article aims to demonstrate how CE can effectively bolster cloud platform resilience. The market for CE tools, valued at two billion dollars in 2022, is projected to grow significantly, driven by increasing investment and the need for robust security models. The historical development of CE, from Netflix's Chaos Monkey to contemporary practices, is also covered, showcasing the evolution of CE and its adoption by tech giants to maintain system robustness and reliability.
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.