arXiv Open Access 2024

A Comparison of SynDiffix Multi-table versus Single-table Synthetic Data

Paul Francis
Lihat Sumber

Abstrak

SynDiffix is a new open-source tool for structured data synthesis. It has anonymization features that allow it to generate multiple synthetic tables while maintaining strong anonymity. Compared to the more common single-table approach, multi-table leads to more accurate data, since only the features of interest for a given analysis need be synthesized. This paper compares SynDiffix with 15 other commercial and academic synthetic data techniques using the SDNIST analysis framework, modified by us to accommodate multi-table synthetic data. The results show that SynDiffix is many times more accurate than other approaches for low-dimension tables, but somewhat worse than the best single-table techniques for high-dimension tables.

Topik & Kata Kunci

Penulis (1)

P

Paul Francis

Format Sitasi

Francis, P. (2024). A Comparison of SynDiffix Multi-table versus Single-table Synthetic Data. https://arxiv.org/abs/2403.08463

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓