arXiv Open Access 2022

Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input

Zilun Zhang Farzad Khalvati

Lihat Sumber

Abstrak

Many high-performance classification models utilize complex CNN-based architectures for Alzheimer's Disease classification. We aim to investigate two relevant questions regarding classification of Alzheimer's Disease using MRI: "Do Vision Transformer-based models perform better than CNN-based models?" and "Is it possible to use a shallow 3D CNN-based model to obtain satisfying results?" To achieve these goals, we propose two models that can take in and process 3D MRI scans: Convolutional Voxel Vision Transformer (CVVT) architecture, and ConvNet3D-4, a shallow 4-block 3D CNN-based model. Our results indicate that the shallow 3D CNN-based models are sufficient to achieve good classification results for Alzheimer's Disease using MRI scans.

Topik & Kata Kunci

eess.IV cs.CV cs.LG

Penulis (2)

Zilun Zhang

Farzad Khalvati

Format Sitasi

APA MLA BibTeX

Zhang, Z., Khalvati, F. (2022). Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input. https://arxiv.org/abs/2210.01177

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2022
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓