arXiv Open Access 2024

Dual-Label Learning With Irregularly Present Labels

Mingqian Li Qiao Han Ruifeng Li Yao Yang Hongyang Chen

Lihat Sumber

Abstrak

In multi-task learning, labels are often missing irregularly across samples, which can be fully labeled, partially labeled or unlabeled. The irregular label presence often appears in scientific studies due to experimental limitations. It triggers a demand for a new training and inference mechanism that could accommodate irregularly present labels and maximize their utility. This work focuses on the two-label learning task and proposes a novel training and inference framework, Dual-Label Learning (DLL). The DLL framework formulates the problem into a dual-function system, in which the two functions should simultaneously satisfy standard supervision, structural duality and probabilistic duality. DLL features a dual-tower model architecture that allows for explicit information exchange between labels, aimed at maximizing the utility of partially available labels. During training, missing labels are imputed as part of the forward propagation process, while during inference, labels are predicted jointly as unknowns of a bivariate system of equations. Our theoretical analysis guarantees the feasibility of DLL, and extensive experiments are conducted to verify that by explicitly modeling label correlation and maximizing label utility, our method makes consistently better prediction than baseline approaches by up to 9.6% gain in F1-score or 10.2% reduction in MAPE. Remarkably, DLL maintains robust performance at a label missing rate of up to 60%, achieving even better results than baseline approaches at lower missing rates down to only 10%.

Topik & Kata Kunci

cs.LG

Penulis (5)

Mingqian Li

Qiao Han

Ruifeng Li

Yao Yang

Hongyang Chen

Format Sitasi

APA MLA BibTeX

Li, M., Han, Q., Li, R., Yang, Y., Chen, H. (2024). Dual-Label Learning With Irregularly Present Labels. https://arxiv.org/abs/2410.14380

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓