arXiv Open Access 2024

DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model

Rongqing Li Jiaqi Yu Changsheng Li Wenhan Luo Ye Yuan +1 lainnya
Lihat Sumber

Abstrak

Deep learning models are usually black boxes when deployed on machine learning platforms. Prior works have shown that the attributes (e.g., the number of convolutional layers) of a target black-box model can be exposed through a sequence of queries. There is a crucial limitation: these works assume the training dataset of the target model is known beforehand and leverage this dataset for model attribute attack. However, it is difficult to access the training dataset of the target black-box model in reality. Therefore, whether the attributes of a target black-box model could be still revealed in this case is doubtful. In this paper, we investigate a new problem of black-box reverse engineering, without requiring the availability of the target model's training dataset. We put forward a general and principled framework DREAM, by casting this problem as out-of-distribution (OOD) generalization. In this way, we can learn a domain-agnostic meta-model to infer the attributes of the target black-box model with unknown training data. This makes our method one of the kinds that can gracefully apply to an arbitrary domain for model attribute reverse engineering with strong generalization ability. Extensive experimental results demonstrate the superiority of our proposed method over the baselines.

Topik & Kata Kunci

Penulis (6)

R

Rongqing Li

J

Jiaqi Yu

C

Changsheng Li

W

Wenhan Luo

Y

Ye Yuan

G

Guoren Wang

Format Sitasi

Li, R., Yu, J., Li, C., Luo, W., Yuan, Y., Wang, G. (2024). DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model. https://arxiv.org/abs/2412.05842

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓