Comparative Evaluation of Vision Transformers and Convolutional Networks for Breast Ultrasound Image Classification

Naral S.; Cakmak Y.; Pacal I.; Pacal, Ishak; Cakmak, Yigitcan; Naral, Suleyman

doi:10.37349/emed.2026.1001382

Comparative Evaluation of Vision Transformers and Convolutional Networks for Breast Ultrasound Image Classification

Date

2026

Authors

Publisher

Open Exploration Publishing Inc

Abstract

Aim: Interobserver variability continues to limit the consistency of breast ultrasound interpretation. This study compares two Vision Transformer (ViT) models and two Convolutional Neural Network (CNN) models for automated three-class breast ultrasound classification, with a specific focus on the tradeoff between predictive performance and computational efficiency. Methods: Swin Transformer Base and DeiT Base were evaluated alongside InceptionV3 and MobileNetV3 Large using the public Breast Ultrasound Images (BUSI) dataset, which contains 780 images labeled as benign, malignant, and normal. A consistent on-the-fly augmentation pipeline was applied during training to promote robustness and reduce sensitivity to incidental image variations. Results: Swin Transformer Base achieved the highest test accuracy (0.9167) and F1 score (0.8981). MobileNetV3 Large reached an accuracy of 0.8583 with substantially lower computational demand. The efficiency contrast was pronounced, with Swin requiring 30.33 GFLOPs versus 0.43 GFLOPs for MobileNetV3 Large. Conclusions: On this benchmark, ViT models can yield higher classification performance, while lightweight CNNs offer a strong efficiency profile that may better match deployment-constrained settings. These results suggest that model selection should be guided by both predictive accuracy and operational feasibility within the target clinical workflow. © The Author(s) 2026.

Keywords

Breast Cancer, Computer-Aided Diagnosis, Deep Learning, Ultrasound Images

WoS Q

N/A

Scopus Q

Q4

Source

Exploration of Medicine

Volume

7

URI

https://doi.org/10.37349/emed.2026.1001382
https://hdl.handle.net/20.500.14627/1463

Collections

Scopus İndeksli Yayınlar Koleksiyonu

PlumX Metrics

Citations

Scopus : 0

Full item page

Comparative Evaluation of Vision Transformers and Convolutional Networks for Breast Ultrasound Image Classification

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Description

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections

PlumX Metrics

Citations

OpenAlex FWCI

0.0

Sustainable Development Goals

SDG data could not be loaded because of an error. Please refresh the page or try again later.