Transforming Retinal Diagnostics: Advanced Detection of Diabetic Retinopathy Using Vision Transformers and Capsule Networks

Vishal Sharma; Rishu; Vinay Kukreja; Ayush Dogra; Bhawna Goyal; Bhawna Goyal

doi:10.3844/jcssp.2025.304.321

Research Article Open Access

Transforming Retinal Diagnostics: Advanced Detection of Diabetic Retinopathy Using Vision Transformers and Capsule Networks

Vishal Sharma¹, Rishu ¹, Vinay Kukreja¹, Ayush Dogra¹ and Bhawna Goyal^2,3

¹ Centre for Research Impact and Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
² Marwadi University Research Centre Derailment of Engineering, Rajkot, Gujarat, India
³ Faculty of Engineering, Sohar University, Sohar, Oman

Abstract

Diabetic Retinopathy (DR), nowadays is one of the leading causes of blindness worldwide, it is a severe complication of diabetes mellitus that affects the retina blood vessels. Accurate diagnosis depends on early detection of DR. The study aims to develop a hybrid model that is the combination of a Vision Transformer and Capsule Network (ViT-CapsNet) to classify the DR at early stages. The ViT-CapsNet model is proposed to detect the DR from the retinal images at the early stage. The eyepieces public dataset is used. The data preprocessing takes place in which the resizing and data augmentation are used to improve the quality and increase the diversity of the data. Then, the Vision transformer extracts the global features from the retinal fundus image while the capsule network preserves the spatial relationships and hierarchies within the data, also classified into different classes that are No DR, Mild DR, Moderate DR, Severe DR and Proliferative DR. The ViT-CapsNet model has a precision, recall and F1-Score with values of 0.92, 0.91 and 0.91 respectively. The ViT-CapsNet model shows an accuracy of 94% compared to the other traditional methods such as CNN (88%), ResNet (90%), and EfficientNet (92%). The AUC-ROC scores for classes No DR, Mild DR, Moderate DR, Severe DR, and Proliferative DR are 0.56, 0.48, 0.44, 0.45, and 0.51 respectively.

Journal of Computer Science

Volume 21 No. 2, 2025, 304-321

DOI: https://doi.org/10.3844/jcssp.2025.304.321

Submitted On: 25 September 2024 Published On: 14 January 2025

How to Cite: Sharma, V., R., Kukreja, V., Dogra, A. & Goyal, B. (2025). Transforming Retinal Diagnostics: Advanced Detection of Diabetic Retinopathy Using Vision Transformers and Capsule Networks. Journal of Computer Science, 21(2), 304-321. https://doi.org/10.3844/jcssp.2025.304.321

Copyright: © 2025 Vishal Sharma, Rishu , Vinay Kukreja, Ayush Dogra and Bhawna Goyal. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

149 Views
50 Downloads
0 Citations

Download

Keywords

Vision Transformers
Capsule Networks
Diabetic Retinopathy
Retinal Images
Deep Learning