CrossRef Open Access 2021

Agglomerative likelihood clustering

Lionel Yelibi Tim Gebbie

Abstrak

Abstract We consider the problem of fast time-series data clustering. Building on previous work modeling, the correlation-based Hamiltonian of spin variables we present an updated fast non-expensive agglomerative likelihood clustering algorithm (ALC). The method replaces the optimized genetic algorithm based approach (f-SPC) with an agglomerative recursive merging framework inspired by previous work in econophysics and community detection. The method is tested on noisy synthetic correlated time-series datasets with a built-in cluster structure to demonstrate that the algorithm produces meaningful non-trivial results. We apply it to time-series datasets as large as 20 000 assets and we argue that ALC can reduce computation time costs and resource usage costs for large scale clustering for time-series applications while being serialized, and hence has no obvious parallelization requirement. The algorithm can be an effective choice for state-detection for online learning in a fast non-linear data environment, because the algorithm requires no prior information about the number of clusters.

Penulis (2)

L

Lionel Yelibi

T

Tim Gebbie

Format Sitasi

Yelibi, L., Gebbie, T. (2021). Agglomerative likelihood clustering. https://doi.org/10.1088/1742-5468/ac3661

Akses Cepat

Lihat di Sumber doi.org/10.1088/1742-5468/ac3661
Informasi Jurnal
Tahun Terbit
2021
Bahasa
en
Sumber Database
CrossRef
DOI
10.1088/1742-5468/ac3661
Akses
Open Access ✓