arXiv Open Access 2026

Transfer Learning with Network Embeddings under Structured Missingness

Mengyan Li Xiaoou Li Kenneth D Mandl Tianxi Cai
Lihat Sumber

Abstrak

Modern data-driven applications increasingly rely on large, heterogeneous datasets collected across multiple sites. Differences in data availability, feature representation, and underlying populations often induce structured missingness, complicating efforts to transfer information from data-rich settings to those with limited data. Many transfer learning methods overlook this structure, limiting their ability to capture meaningful relationships across sites. We propose TransNEST (Transfer learning with Network Embeddings under STructured missingness), a framework that integrates graphical data from source and target sites with prior group structure to construct and refine network embeddings. TransNEST accommodates site-specific features, captures within-group heterogeneity and between-site differences adaptively, and improves embedding estimation under partial feature overlap. We establish the convergence rate for the TransNEST estimator and demonstrate strong finite-sample performance in simulations. We apply TransNEST to a multi-site electronic health record study, transferring feature embeddings from a general hospital system to a pediatric hospital system. Using a hierarchical ontology structure, TransNEST improves pediatric embeddings and supports more accurate pediatric knowledge extraction, achieving the best accuracy for identifying pediatric-specific relational feature pairs compared with benchmark methods.

Topik & Kata Kunci

Penulis (4)

M

Mengyan Li

X

Xiaoou Li

K

Kenneth D Mandl

T

Tianxi Cai

Format Sitasi

Li, M., Li, X., Mandl, K.D., Cai, T. (2026). Transfer Learning with Network Embeddings under Structured Missingness. https://arxiv.org/abs/2602.19922

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓