arXiv Open Access 2025

Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification

Peidong Wei Shiyu Miao Lin Li

Lihat Sumber

Abstrak

Deep neural networks have been applied to audio spectrograms for respiratory sound classification, but it remains challenging to achieve satisfactory performance due to the scarcity of available data. Moreover, domain mismatch may be introduced into the trained models as a result of the respiratory sound samples being collected from various electronic stethoscopes, patient demographics, and recording environments. To tackle this issue, we proposed a modified MaskedAutoencoder(MAE) model, named Disentangling Dual-Encoder MAE (DDE-MAE) for respiratory sound classification. Two independent encoders were designed to capture disease-related and disease-irrelevant information separately, achieving feature disentanglement to reduce the domain mismatch. Our method achieves a competitive performance on the ICBHI dataset.

Topik & Kata Kunci

eess.AS cs.SD

Penulis (3)

Peidong Wei

Shiyu Miao

Lin Li

Format Sitasi

APA MLA BibTeX

Wei, P., Miao, S., Li, L. (2025). Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification. https://arxiv.org/abs/2506.10698

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓