Semantic Scholar Open Access 2015 2622 sitasi

Privacy-preserving deep learning

R. Shokri Vitaly Shmatikov

Abstrak

Deep learning based on artificial neural networks is a very popular approach to modeling, classifying, and recognizing complex data such as images, speech, and text. The unprecedented accuracy of deep learning methods has turned them into the foundation of new AI-based services on the Internet. Commercial companies that collect user data on a large scale have been the main beneficiaries of this trend since the success of deep learning techniques is directly proportional to the amount of data available for training. Massive data collection required for deep learning presents obvious privacy issues. Users' personal, highly sensitive data such as photos and voice recordings is kept indefinitely by the companies that collect it. Users can neither delete it, nor restrict the purposes for which it is used. Furthermore, centrally kept data is subject to legal subpoenas and extrajudicial surveillance. Many data owners-for example, medical institutions that may want to apply deep learning methods to clinical records-are prevented by privacy and confidentiality concerns from sharing the data and thus benefitting from large-scale deep learning. In this paper, we present a practical system that enables multiple parties to jointly learn an accurate neural-network model for a given objective without sharing their input datasets. We exploit the fact that the optimization algorithms used in modern deep learning, namely, those based on stochastic gradient descent, can be parallelized and executed asynchronously. Our system lets participants train independently on their own datasets and selectively share small subsets of their models' key parameters during training. This offers an attractive point in the utility/privacy tradeoff space: participants preserve the privacy of their respective data while still benefitting from other participants' models and thus boosting their learning accuracy beyond what is achievable solely on their own inputs. We demonstrate the accuracy of our privacy-preserving deep learning on benchmark datasets.

Topik & Kata Kunci

Computer Science

Penulis (2)

R. Shokri

Vitaly Shmatikov

Format Sitasi

APA MLA BibTeX

Shokri, R., Shmatikov, V. (2015). Privacy-preserving deep learning. https://doi.org/10.1145/2810103.2813687

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber doi.org/10.1145/2810103.2813687

Informasi Jurnal

Tahun Terbit: 2015
Bahasa: en
Total Sitasi: 2622×
Sumber Database: Semantic Scholar
DOI: 10.1145/2810103.2813687
Akses: Open Access ✓