arXiv Open Access 2022

Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input

Zilun Zhang Farzad Khalvati
Lihat Sumber

Abstrak

Many high-performance classification models utilize complex CNN-based architectures for Alzheimer's Disease classification. We aim to investigate two relevant questions regarding classification of Alzheimer's Disease using MRI: "Do Vision Transformer-based models perform better than CNN-based models?" and "Is it possible to use a shallow 3D CNN-based model to obtain satisfying results?" To achieve these goals, we propose two models that can take in and process 3D MRI scans: Convolutional Voxel Vision Transformer (CVVT) architecture, and ConvNet3D-4, a shallow 4-block 3D CNN-based model. Our results indicate that the shallow 3D CNN-based models are sufficient to achieve good classification results for Alzheimer's Disease using MRI scans.

Topik & Kata Kunci

Penulis (2)

Z

Zilun Zhang

F

Farzad Khalvati

Format Sitasi

Zhang, Z., Khalvati, F. (2022). Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input. https://arxiv.org/abs/2210.01177

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓