arXiv Open Access 2025

Improving Deep Learning-based Respiratory Sound Analysis with Frequency Selection and Attention Mechanism

Nouhaila Fraihi Ouassim Karrakchou Mounir Ghogho
Lihat Sumber

Abstrak

Accurate classification of respiratory sounds requires deep learning models that effectively capture fine-grained acoustic features and long-range temporal dependencies. Convolutional Neural Networks (CNNs) are well-suited for extracting local time-frequency patterns but are limited in modeling global context. In contrast, transformer-based models can capture long-range dependencies, albeit with higher computational demands. To address these limitations, we propose a compact CNN-Temporal Self-Attention (CNN-TSA) network that integrates lightweight self-attention into an efficient CNN backbone. Central to our approach is a Frequency Band Selection (FBS) module that suppresses noisy and non-informative frequency regions, substantially improving accuracy and reducing FLOPs by up to 50%. We also introduce age-specific models to enhance robustness across diverse patient groups. Evaluated on the SPRSound-2022/2023 and ICBHI-2017 lung sound datasets, CNN-TSA with FBS sets new benchmarks on SPRSound and achieves state-of-the-art performance on ICBHI, all with a significantly smaller computational footprint. Furthermore, integrating FBS into an existing transformer baseline yields a new record on ICBHI, confirming FBS as an effective drop-in enhancement. These results demonstrate that our framework enables reliable, real-time respiratory sound analysis suitable for deployment in resource-constrained settings.

Topik & Kata Kunci

Penulis (3)

N

Nouhaila Fraihi

O

Ouassim Karrakchou

M

Mounir Ghogho

Format Sitasi

Fraihi, N., Karrakchou, O., Ghogho, M. (2025). Improving Deep Learning-based Respiratory Sound Analysis with Frequency Selection and Attention Mechanism. https://arxiv.org/abs/2507.20052

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓