arXiv Open Access 2026

A Holistic Framework for Automated Configuration Recommendation for Cloud Service Monitoring

Anson Bastos Shreeya Venneti Anjaly Parayil Ayush Choure Chetan Bansal +1 lainnya
Lihat Sumber

Abstrak

Reliability of large-scale cloud services is critical for user satisfaction and business continuity. Despite significant investments in reliability engineering, production incidents remain inevitable, often leading to customer impact and operational overhead. In large cloud companies, multiple services are deployed across regions necessitating robust health monitoring systems. However, the current monitor configuration process is manual, largely reactive and ad hoc, resulting in gaps in coverage and redundant alerts. In this paper, we present a comprehensive study of monitor creation in Microsoft, identifying key components in the existing process. We further design a modular recommendation framework that processes the graph structured service entities to suggest optimal monitor configurations. Through extensive experimentation on historical data and user study of recommendations for production services at Microsoft, we demonstrate the efficacy of our approach in providing relevant recommendations for monitor configurations.

Topik & Kata Kunci

Penulis (6)

A

Anson Bastos

S

Shreeya Venneti

A

Anjaly Parayil

A

Ayush Choure

C

Chetan Bansal

R

Rujia Wang

Format Sitasi

Bastos, A., Venneti, S., Parayil, A., Choure, A., Bansal, C., Wang, R. (2026). A Holistic Framework for Automated Configuration Recommendation for Cloud Service Monitoring. https://arxiv.org/abs/2603.12268

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓