Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture

Aydın, Ömer Faruk; Tek, Faik Boray; Turkan, Yasemin

Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture

Dosyalar

Retinal_Disease_Classification_from_Bimodal_OCT_and_OCTA_Using_a_CNN_ViT_Hybrid_Architecture.pdf (1.93 MB)

Tarih

2025-09-21

Yazarlar

Aydın, Ömer Faruk

Tek, Faik Boray

Turkan, Yasemin

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Retinal diseases are the leading cause of vision impairment and blindness worldwide. Early and accurate diagnosis is critical for effective treatment, and recent advances in imaging technologies such as Optical Coherence Tomography (OCT) and OCT Angiography (OCTA), have enabled detailed visualization of the retinal structure and vasculature. By leveraging these modalities, this study proposes an advanced deep learning architecture called MultiModalNet for automated multi-class retinal disease classification. MultiModalNet employs a dual-branch design, where OCTA projection maps are processed through a ResNet101 encoder, and cross-sectional slices from the OCT volume (B-scans) are analyzed using a Vision Transformer (ViT-Large). The extracted features from both branches were fused and passed through the fully connected layers for the final classification. Evaluated on the 3-class OCTA-500 dataset, which includes Age-related Macular Degeneration (AMD), Diabetic Retinopathy (DR), and Normal cases, the proposed model achieved state-of-the-art classification accuracy of 94.59 percent, significantly o utperforming single-modality baselines. This result highlights the effectiveness of integrating vascular and structural information to improve the diagnostic performance. The findings suggest that hybrid multi-modal deep learning approaches can play a transformative role in computer-aided ophthalmology, enhancing both clinical decision-making and screening workflows.

Açıklama

This study was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) under Grant Number 122E509.

Anahtar Kelimeler

Convolutional Neural Networks (CNN), Deep learning, Multi-modal, Optical Coherence Tomography Angiography (OCTA), Retinal disease classification, Vision Transformer (ViT), Architecture, Classification (of information), Computer aided diagnosis, Convolutional neural networks, Decision making, Deep neural networks, Eye protection, Learning systems, Ophthalmology, Coherence tomography, Convolutional neural network, Disease classification, Retinal disease, Angiography

Kaynak

International Conference on Computer Science and Engineering, UBMK

Scopus Q Değeri

N/A

Sayı

2025

Künye

Aydın, Ö. F., Tek, F. B. & Turkan, Y. (2025). Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture. Paper presented at the International Conference on Computer Science and Engineering, UBMK, 2025, 260-264. doi:https://doi.org/10.1109/UBMK67458.2025.11206835

Bağlantı

https://hdl.handle.net/11729/7105
https://doi.org/10.1109/UBMK67458.2025.11206835

Koleksiyon

Öğrenci Yayınları Bildiri Koleksiyonu
Lisansüstü Eğitim Enstitüsü Diğer Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Retinal disease classification from bimodal OCT and OCTA using a CNN-ViT hybrid architecture

Dosyalar

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Araştırma projeleri

Organizasyon Birimleri

Dergi sayısı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon