arXiv Open Access 2023

FLASC: A Flare-Sensitive Clustering Algorithm

D. M. Bot J. Peeters J. Liesenborgs J. Aerts
Lihat Sumber

Abstrak

Clustering algorithms are often used to find subpopulations in exploratory data analysis workflows. Not only the clusters themselves, but also their shape can represent meaningful subpopulations. In this paper, we present FLASC, an algorithm that detects branches within clusters to identify such subpopulations. FLASC builds upon HDBSCAN*, a state-of-the-art density-based clustering algorithm, and detects branches in a post-processing step that describes within-cluster connectivity. Two variants of the algorithm are presented, which trade computational cost for noise robustness. We show that both variants scale similarly to HDBSCAN* in terms of computational cost and provide stable outputs using synthetic data sets, resulting in an efficient flare-sensitive clustering algorithm. In addition, we demonstrate the benefit of branch-detection on two real-world data sets.

Topik & Kata Kunci

Penulis (4)

D

D. M. Bot

J

J. Peeters

J

J. Liesenborgs

J

J. Aerts

Format Sitasi

Bot, D.M., Peeters, J., Liesenborgs, J., Aerts, J. (2023). FLASC: A Flare-Sensitive Clustering Algorithm. https://arxiv.org/abs/2311.15887

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓