
Enhanced OCT GA segmentation achieved with new multistage model
Researchers introduce a multistage dual-branch network to improve accuracy and efficiency.
A novel
Their so-called multistage dual-branch image projection network (DIPN) can learn feature information in B-scan images to assist with GA segmentation via the introduction of additional components, that is, Convolutional Long-Short-Term Memory Networks (ConvLSTM), a projection attention module, an adaptive pooling module, and a contrastive learning enhancement (CLE) module, according to the researchers, who reported their work in Science Reports.1
The challenge with other existing GA segmentation tasks is that they can use only 3-dimensional (3D) data, which ignores the fact that a large number of B-scan images contain lesion information, the researchers explained.
There is an increasing prevalence of GA worldwide with the aging of the population,2 and lesion development and enlargement can result in irreversible loss of visual function, thus underscoring the importance of accurate segmentation of the lesion area for preventing progression and guiding subsequent treatment.3
Building on OCT capabilities
OCT is a valuable, noninvasive technology that provides high-resolution, 3D, cross-sectional images and rapid biomedical imaging technology.4 It can image biologic tissues at the micron level and generate high-resolution, 3D, cross-sectional images, which are widely used in clinical ophthalmology and are vital to diagnosing and monitoring retinal diseases.5-9
Previous efforts into GA segmentation have attempted to use traditional methods. The authors cited a study10 that proposed a method to create OCT projection images by applying constrained subvolume projection to 3D OCT data and another11 that used U-Net, a deep-learning network, to automatically segment GA lesions. A third study12 used U-Net and Y-Net to automatically segment GA lesions on fundus autofluorescence images. However, the authors pointed out that those studies used only the features of the en face images and did not use the spatial information in the volumetric data.
In response to this, the new method being described used a 2D network framework while incorporating ConvLSTM to capture adjacent information between slices of volumetric data. However, that approach may cause mis-segmentation when segmenting GA edges (low contrast of edge pixels), and it is difficult for the network to classify such samples. The researchers then introduced a projection attention module that was proposed to focus the network attention on the projection direction to capture the contextual relationships. However, because the current projection network uses a unidirectional pooling operation to achieve feature projection, multiscale features and channel information are ignored, and an adaptive pooling module was used to reduce feature dimensions when grasping multiscale features and channel information. Finally, they introduced a CLE module to mitigate the effect of image contrast on the network segmentation performance.
They summarized their efforts as follows: “Specifically, we proposed a multi-stage DIPN that can obtain pretraining weights using many B-scan images during the pretraining stage. In addition, inspired by Liu et al13 we proposed using a projection attention module to integrate long-range dependencies by calculating the affinity between 2 different pixels on each projection column in the B-scan. An adaptive pooling module focused on the channels while extracting and fusing multi-scale features, thus effectively improving the feature utilization. Finally, to ensure that the spatial information in the volumetric data is fully utilized during the segmentation process, we incorporated ConvLSTM to capture the neighborhood information between images in the fine-tuning stage. Utilizing a contrastive learning module enhanced the network’s ability to distinguish boundary features.”
Method applied
To validate the effectiveness of their proposed method, the researchers conducted experiments on 2 data sets. They explained, “The first was a retinal geographic atrophy data set containing 44 OCT volumes and 2823 GA B-scan images that were used in the pretraining phase. To explore the cross-domain generalizability of our method,14 the second data set is the public data set OCTA50015,16 that included 3D FAZ segmentation labels and retinal vessel segmentation labels.”
They reported the success of their method as follows: “Our network effectively combines the incorporated components, using a large number of individual B-scans images to pretrain the network, and experimental validation on 2 data sets demonstrates the soundness and effectiveness of our approach. The segmentation results show that our method is more effective than other methods in the GA segmentation task and the FAZ segmentation task.”
They will continue to fine-tune the method to reduce the labeling time while ensuring the quality of segmentation. At the same time, they will continue to collect larger retinal OCT data sets.
Xiaoming Liu, PhD
E: lxmspace@gmail.com
Liu and Li are from the Wuhan University of Science and Technology, Wuhan, China. The authors have no financial interest in this subject matter. Liu and Li were joined in this study by Ying Zhang and Junping Yao, who are from the Wuhan Aier Eye Hospital of Wuhan University, Wuhan, and the Tianyou Hospital, affiliated with Wuhan University of Science and Technology, Wuhan, respectively.
References:
- Liu X, Li J, Zhang Y, Yao J. Dual-branch image projection network for geographic atrophy segmentation in retinal OCT images. Sci Rep. 2025;15(1):6535. doi:10.1038/s41598-025-90709-6
- Wong WL, Su X, Li X, et al. Global prevalence of age-related macular degeneration and disease burden projection for 2020 and 2040: a systematic review and meta-analysis. Lancet Glob Health. 2014;2(2):e106-e116. doi:10.1016/S2214-109X(13)70145-1
- Holz FG, Sadda SR, Staurenghi G, et al; CAM group. Imaging protocols in clinical studies in advanced age-related macular degeneration: recommendations from classification of atrophy consensus meetings. Ophthalmology. 2017;124(4):464-478. doi:10.1016/j.ophtha.2016.12.002
- Fazekas B, Lachinov D, Aresta G, Mai J, Schmidt-Erfurth U, Bogunovic H. Segmentation of Bruch’s membrane in retinal OCT with AMD using anatomical priors and uncertainty quantification. IEEE J Biomed Health Inform. 2023;27(1):41-52. doi:10.1109/JBHI.2022.3217962
- Hassan B, Raja G, Hassan T, Usman Akram M. Structure tensor based automated detection of macular edema and central serous retinopathy using optical coherence tomography images. J Opt Soc Am A Opt Image Sci Vis. 2016;33(4):455-463. doi:10.1364/JOSAA.33.000455
- Wang M, Zhu W, Yu K, et al. Semi-supervised capsule cGAN for speckle noise reduction in retinal OCT images. IEEE Trans Med Imaging. 2021;40(4):1168-1183. doi:10.1109/TMI.2020.3048975
- Fang J, Zhang Y, Xie K, Yuan S, Chen Q. In: Ophthalmic Medical Image Analysis: 6th International Workshop, OMIA 2019, Held in Conjunction with MICCAI 2019, Shenzhen, China, October 17, Proceedings 6. 130-138 (Springer).
- Yang J, Ji Z, Niu S, Chen Q, Yuan S, Fan W. RMPPNet: residual multiple pyramid pooling network for subretinal fluid segmentation in SD-OCT images. OSA Contin. 2020;3(7):1751-1769. doi:10.1364/osac.387102
- Shi F, Chen X, Zhao H, et al. Automated 3-D retinal layer segmentation of macular optical coherence tomography images with serous pigment epithelial detachments. IEEE Trans Med Imaging. 2015;34(2):441-452. doi:10.1109/TMI.2014.2359980
- Wu M, Cai X, Chen Q, et al. Geographic atrophy segmentation in SD-OCT images using synthesized fundus autofluorescence imaging. Comput Methods Programs Biomed. 2019;182:105101. doi:10.1016/j.cmpb.2019.105101
- Patil J, Kawczynski M, Gao SS, Coimbra AF. Geographic atrophy lesion segmentation using a deep learning network (U-net). Invest Ophthalmol Vis Sci. 2019;60:1459.
- Spaide T, Jiang J, Patil J, et al. Geographic atrophy segmentation using multimodal deep learning. Transl Vis Sci Technol. 2023;12(7):10. doi:10.1167/tvst.12.7.10
- Liu X, Cao J, Wang S, Zhang Y, Wang M. Confidence-guided topology-preserving layer segmentation for optical coherence tomography images with focus-column module. IEEE Trans Instrum Meas. 2020;70:1-12. doi:10.1109/tim.2020.3047430
- Shao HC, Chen CY, Chang MH, Yu CH, Lin CW, Yang JW. Retina-TransNet: a gradient-guided few-shot retinal vessel segmentation net. IEEE J Biomed Health Inform. 2023;27(10):4902-4913. doi:10.1109/JBHI.2023.3298710
- Li M, Chen Y, Ji Z, et al. Image projection network: 3D to 2D image segmentation in OCTA images. IEEE Trans Med Imaging. 2020;39(11):3343-3354. doi:10.1109/TMI.2020.2992244
- Li M, Huang K, Xu Q, et al. OCTA-500: a retinal dataset for optical coherence tomography angiography study. Med Image Anal. 2024;93:103092. doi:10.1016/j.media.2024.103092
Newsletter
Keep your retina practice on the forefront—subscribe for expert analysis and emerging trends in retinal disease management.