Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.

Abstract

In the rapidly evolving field of medical image analysis utilizing artificial intelligence (AI), the selection of appropriate computational models is critical for accurate diagnosis and patient care. This literature review provides a comprehensive comparison of vision transformers (ViTs) and convolutional neural networks (CNNs), the two leading techniques in the field of deep learning in medical imaging. We conducted a survey systematically. Particular attention was given to the robustness, computational efficiency, scalability, and accuracy of these models in handling complex medical datasets. The review incorporates findings from 36 studies and indicates a collective trend that transformer-based models, particularly ViTs, exhibit significant potential in diverse medical imaging tasks, showcasing superior performance when contrasted with conventional CNN models. Additionally, it is evident that pre-training is important for transformer applications. We expect this work to help researchers and practitioners select the most appropriate model for specific medical image analysis tasks, accounting for the current state of the art and future trends in the field.© 2024. The Author(s).

Show Full Text

Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.

Researchers

Journal

Modalities

Models

Abstract

A cyber-physical system to design 3D models using mixed reality technologies and deep learning for additive manufacturing.

Masked Face Emotion Recognition Based on Facial Landmarks and Deep Learning Approaches for Visually Impaired People.

Core and penumbra estimation using deep learning-based AIF in association with clinical measures in computed tomography perfusion (CTP).

Influence of training and expertise on deep neural network attention and human attention during a medical image classification task.

Deep Learning in the Ubiquitous Human-Computer Interactive 6G Era: Applications, Principles and Prospects.

Convolutional neural network-based metal and streak artifacts reduction in dental CT images with sparse-view sampling scheme.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply