arXiv Open Access 2023

On the unreasonable vulnerability of transformers for image restoration -- and an easy fix

Shashank Agnihotri Kanchana Vaishnavi Gandikota Julia Grabinski Paramanand Chandramouli Margret Keuper
Lihat Sumber

Abstrak

Following their success in visual recognition tasks, Vision Transformers(ViTs) are being increasingly employed for image restoration. As a few recent works claim that ViTs for image classification also have better robustness properties, we investigate whether the improved adversarial robustness of ViTs extends to image restoration. We consider the recently proposed Restormer model, as well as NAFNet and the "Baseline network" which are both simplified versions of a Restormer. We use Projected Gradient Descent (PGD) and CosPGD, a recently proposed adversarial attack tailored to pixel-wise prediction tasks for our robustness evaluation. Our experiments are performed on real-world images from the GoPro dataset for image deblurring. Our analysis indicates that contrary to as advocated by ViTs in image classification works, these models are highly susceptible to adversarial attacks. We attempt to improve their robustness through adversarial training. While this yields a significant increase in robustness for Restormer, results on other networks are less promising. Interestingly, the design choices in NAFNet and Baselines, which were based on iid performance, and not on robust generalization, seem to be at odds with the model robustness. Thus, we investigate this further and find a fix.

Topik & Kata Kunci

Penulis (5)

S

Shashank Agnihotri

K

Kanchana Vaishnavi Gandikota

J

Julia Grabinski

P

Paramanand Chandramouli

M

Margret Keuper

Format Sitasi

Agnihotri, S., Gandikota, K.V., Grabinski, J., Chandramouli, P., Keuper, M. (2023). On the unreasonable vulnerability of transformers for image restoration -- and an easy fix. https://arxiv.org/abs/2307.13856

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓