arXiv Open Access 2025

Prototypical Contrastive Learning For Improved Few-Shot Audio Classification

Christos Sgouropoulos Christos Nikou Stefanos Vlachos Vasileios Theiou Christos Foukanelis +1 lainnya
Lihat Sumber

Abstrak

Few-shot learning has emerged as a powerful paradigm for training models with limited labeled data, addressing challenges in scenarios where large-scale annotation is impractical. While extensive research has been conducted in the image domain, few-shot learning in audio classification remains relatively underexplored. In this work, we investigate the effect of integrating supervised contrastive loss into prototypical few shot training for audio classification. In detail, we demonstrate that angular loss further improves the performance compared to the standard contrastive loss. Our method leverages SpecAugment followed by a self-attention mechanism to encapsulate diverse information of augmented input versions into one unified embedding. We evaluate our approach on MetaAudio, a benchmark including five datasets with predefined splits, standardized preprocessing, and a comprehensive set of few-shot learning models for comparison. The proposed approach achieves state-of-the-art performance in a 5-way, 5-shot setting.

Topik & Kata Kunci

Penulis (6)

C

Christos Sgouropoulos

C

Christos Nikou

S

Stefanos Vlachos

V

Vasileios Theiou

C

Christos Foukanelis

T

Theodoros Giannakopoulos

Format Sitasi

Sgouropoulos, C., Nikou, C., Vlachos, S., Theiou, V., Foukanelis, C., Giannakopoulos, T. (2025). Prototypical Contrastive Learning For Improved Few-Shot Audio Classification. https://arxiv.org/abs/2509.10074

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓