arXiv Open Access 2022

Active Learning Framework to Automate NetworkTraffic Classification

Jaroslav Pešek Dominik Soukup Tomáš Čejka
Lihat Sumber

Abstrak

Recent network traffic classification methods benefitfrom machine learning (ML) technology. However, there aremany challenges due to use of ML, such as: lack of high-qualityannotated datasets, data-drifts and other effects causing aging ofdatasets and ML models, high volumes of network traffic etc. Thispaper argues that it is necessary to augment traditional workflowsof ML training&deployment and adapt Active Learning concepton network traffic analysis. The paper presents a novel ActiveLearning Framework (ALF) to address this topic. ALF providesprepared software components that can be used to deploy an activelearning loop and maintain an ALF instance that continuouslyevolves a dataset and ML model automatically. The resultingsolution is deployable for IP flow-based analysis of high-speed(100 Gb/s) networks, and also supports research experiments ondifferent strategies and methods for annotation, evaluation, datasetoptimization, etc. Finally, the paper lists some research challengesthat emerge from the first experiments with ALF in practice.

Topik & Kata Kunci

Penulis (3)

J

Jaroslav Pešek

D

Dominik Soukup

T

Tomáš Čejka

Format Sitasi

Pešek, J., Soukup, D., Čejka, T. (2022). Active Learning Framework to Automate NetworkTraffic Classification. https://arxiv.org/abs/2211.08399

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓