arXiv Open Access 2024

Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen

Alessandro Palma Till Richter Hanyi Zhang Manuel Lubetzki Alexander Tong +2 lainnya
Lihat Sumber

Abstrak

Generative modeling of single-cell RNA-seq data is crucial for tasks like trajectory inference, batch effect removal, and simulation of realistic cellular data. However, recent deep generative models simulating synthetic single cells from noise operate on pre-processed continuous gene expression approximations, overlooking the discrete nature of single-cell data, which limits their effectiveness and hinders the incorporation of robust noise models. Additionally, aspects like controllable multi-modal and multi-label generation of cellular data remain underexplored. This work introduces CellFlow for Generation (CFGen), a flow-based conditional generative model that preserves the inherent discreteness of single-cell data. CFGen generates whole-genome multi-modal single-cell data reliably, improving the recovery of crucial biological data characteristics while tackling relevant generative tasks such as rare cell type augmentation and batch correction. We also introduce a novel framework for compositional data generation using Flow Matching. By showcasing CFGen on a diverse set of biological datasets and settings, we provide evidence of its value to the fields of computational biology and deep generative models.

Penulis (7)

A

Alessandro Palma

T

Till Richter

H

Hanyi Zhang

M

Manuel Lubetzki

A

Alexander Tong

A

Andrea Dittadi

F

Fabian Theis

Format Sitasi

Palma, A., Richter, T., Zhang, H., Lubetzki, M., Tong, A., Dittadi, A. et al. (2024). Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen. https://arxiv.org/abs/2407.11734

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓