arXiv Open Access 2024

How Industry Tackles Anomalies during Runtime: Approaches and Key Monitoring Parameters

Monika Steidl Benedikt Dornauer Michael Felderer Rudolf Ramler Mircea-Cristian Racasan +1 lainnya
Lihat Sumber

Abstrak

Deviations from expected behavior during runtime, known as anomalies, have become more common due to the systems' complexity, especially for microservices. Consequently, analyzing runtime monitoring data, such as logs, traces for microservices, and metrics, is challenging due to the large volume of data collected. Developing effective rules or AI algorithms requires a deep understanding of this data to reliably detect unforeseen anomalies. This paper seeks to comprehend anomalies and current anomaly detection approaches across diverse industrial sectors. Additionally, it aims to pinpoint the parameters necessary for identifying anomalies via runtime monitoring data. Therefore, we conducted semi-structured interviews with fifteen industry participants who rely on anomaly detection during runtime. Additionally, to supplement information from the interviews, we performed a literature review focusing on anomaly detection approaches applied to industrial real-life datasets. Our paper (1) demonstrates the diversity of interpretations and examples of software anomalies during runtime and (2) explores the reasons behind choosing rule-based approaches in the industry over self-developed AI approaches. AI-based approaches have become prominent in published industry-related papers in the last three years. Furthermore, we (3) identified key monitoring parameters collected during runtime (logs, traces, and metrics) that assist practitioners in detecting anomalies during runtime without introducing bias in their anomaly detection approach due to inconclusive parameters.

Topik & Kata Kunci

Penulis (6)

M

Monika Steidl

B

Benedikt Dornauer

M

Michael Felderer

R

Rudolf Ramler

M

Mircea-Cristian Racasan

M

Marko Gattringer

Format Sitasi

Steidl, M., Dornauer, B., Felderer, M., Ramler, R., Racasan, M., Gattringer, M. (2024). How Industry Tackles Anomalies during Runtime: Approaches and Key Monitoring Parameters. https://arxiv.org/abs/2408.07816

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓