Volume 501 - 39th International Cosmic Ray Conference (ICRC2025) - Gamma-Ray Astrophysics
Enhancing CTAO Monitoring and Alarm Subsystems in Distributed Environments Using ServiMon
K. Munari*, A. Costa, F. Incardona, E. Mastriani, S. Spinello, S. Germani, P.G. Bruno  on behalf of the CTAO Consortium
*: corresponding author
Full text: pdf
Pre-published on: September 24, 2025
Published on:
Abstract
<p style="text-align: justify;">

ServiMon is a scalable data collection and auditing pipeline designed for service-oriented, cost-efficient quality control in distributed environments, including the CTAO monitoring, logging, and alarm subsystems. Developed within a Docker-based architecture, it leverages cloud-native technologies and distributed computing principles to enhance system observability and reliability.

<br>
<br>

At its core, ServiMon integrates key technologies such as Prometheus, Grafana, Kafka, and
Cassandra. Prometheus serves as the primary engine for real-time performance metric collection,
enabling efficient monitoring across multiple nodes. Grafana provides interactive, service-oriented
data visualization, facilitating system performance analysis. Additionally, Kafka and Cassandra
expose system metrics via the JMX Exporter, offering critical insights into infrastructure availability
and performance.

<br>
<br>

This contribution exposes how ServiMon could provide an enhancement on scalability, security,
and efficiency in a distributed computing environment, such as the CTAO monitoring, logging,
and alarm subsystems. This integrated approach not only ensures robust real-time monitoring, but
also optimizes operational costs. Furthermore, ServiMon’s ability to generate large volumes of
diverse data over time provides a strong foundation for predictive maintenance. By incorporating
stochastic and approximate computing techniques, it enables proactive failure detection and system
optimization, minimizing downtime and maximizing telescope availability.

</p>
DOI: https://doi.org/10.22323/1.501.0775
How to cite

Metadata are provided both in article format (very similar to INSPIRE) as this helps creating very compact bibliographies which can be beneficial to authors and readers, and in proceeding format which is more detailed and complete.

Open Access
Creative Commons LicenseCopyright owned by the author(s) under the term of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.