Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
11institutetext: Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China 11email: hwang007@connect.hkust-gz.edu.cn 22institutetext: University of Electronic Science and Technology of China, Chengdu, China
33institutetext: West China Hospital, Sichuan University, Chengdu, China
44institutetext: Chengdu First People’s Hospital
55institutetext: Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
66institutetext: The Hong Kong University of Science and Technology

Advancing UWF-SLO Vessel Segmentation with Source-Free Active Domain Adaptation and a Novel Multi-Center Dataset

Hongqiu Wang 11    Xiangde Luo 22    Wu Chen 22    Qingqing Tang 33    Mei Xin 44    Qiong Wang 55    Lei Zhu(🖂) 1166
Abstract

Accurate vessel segmentation in Ultra-Wide-Field Scanning Laser Ophthalmoscopy (UWF-SLO) images is crucial for diagnosing retinal diseases. Although recent techniques have shown encouraging outcomes in vessel segmentation, models trained on one medical dataset often underperform on others due to domain shifts. Meanwhile, manually labeling high-resolution UWF-SLO images is an extremely challenging, time-consuming and expensive task. In response, this study introduces a pioneering framework that leverages a patch-based active domain adaptation approach. By actively recommending a few valuable image patches by the devised Cascade Uncertainty-Predominance (CUP) selection strategy for labeling and model-finetuning, our method significantly improves the accuracy of UWF-SLO vessel segmentation across diverse medical centers. In addition, we annotate and construct the first Multi-center UWF-SLO Vessel Segmentation (MU-VS) dataset to promote this topic research, comprising data from multiple institutions. This dataset serves as a valuable resource for cross-center evaluation, verifying the effectiveness and robustness of our approach. Experimental results demonstrate that our approach surpasses existing domain adaptation and active learning methods, considerably reducing the gap between the Upper and Lower bounds with minimal annotations, highlighting our method’s practical clinical value. We will release our dataset and code to facilitate relevant research (Git).

Keywords:
Vessel segmentation Ultra-Wide-Field source free active domain adaptation multi-center dataset.
H. Wang and X. Luo contributed equally to this work.

1 Introduction

Accurate segmentation of retinal vessels in fundus images is critical in aiding ophthalmologists with quantitative analysis and treatment [3, 11]. For instance, Retinal Vein Occlusion (RVO) is identified by increased retinal vessel tortuosity, enlarged vessel caliber, and retinal non-perfusion [16]. A variety of deep learning models for automated vessel segmentation have emerged, showing promising results [4, 10, 27, 28, 29]. These segmentation models are mainly adapted to Narrow Field (NF) Fundus Photography (FP), since NF FP is the most common format and modality in clinical practice, and previous relevant segmentation datasets are also mainly based on NF FP [6, 18, 30].

Recently, Ultra-Wide-Field Scanning Laser Ophthalmoscopy (UWF-SLO) imaging has gained popularity due to its ability to provide extensive retinal coverage and superior imaging of peripheral lesions over NF FP, thereby enhancing diagnostic precision [21]. Generally, UWF-SLO images provide an expansive 200° field-of-view (FOV), far exceeding the typical 30°-50° FOV of NF FP. This broader view grants ophthalmologists access to more comprehensive information for more accurate diagnoses [2, 12]. There are also some efforts in the field. For example, Li et al. proposed a weakly-supervised iterative learning method and the PRIME-P20 dataset to segment vessels in UWF images[2]. Qiu et al. introduced a dual-stream super-resolution network for this task[15].

Table 1: Quantitative analysis of the existing datasets, including categories and image resolution. Center A: Hospital A. Center B: Hospital B.
Dataset Amount of data Categories Resolution Public available
In-house [9] 65 Normal,VO 3900×\times×3072 no
PRIME-FP20 [2] 15 DR 4000×\times×4000 yes
Center A (Ours) 30 Normal, RVO 3900×\times×3072 yes
Center B (Ours) 30 DR, RVO, RP, RAO, CSC 3900×\times×3072 yes
Refer to caption
Figure 1: Several data visualization examples from the proposed Multi-center UWF-SLO Vessel Segmentation (MU-VS) dataset, illustrate that various centers encompass different disease categories.

Although there has been a notable improvement in UWF vessel segmentation approaches, the above models are all developed within the single-center setting, lacking multi-center, cross-domain studies. In real-world clinical scenarios, domain shifts commonly occur among datasets [5, 32] because of variations in imaging equipment and patient populations, potentially leading to suboptimal performance in new domains [13, 26]. The simplest approach involves fully supervised training on target data, yet annotating high-resolution UWF-SLO images is extremely expensive, requiring approximately 18 hours of an expert’s time to annotate a single image [14]. Therefore, unsupervised domain adaptation (UDA) techniques are widely explored, designed to reduce the domain discrepancy between the labeled source and the unlabeled target domain [8]. Although UDA approaches yield better outcomes, their performance still significantly falls short of that achieved by fully supervised models [8, 26]. Moreover, accessing source medical datasets raises privacy and security concerns [31].

To alleviate these above issues, we propose a novel patch-based Source-Free Active Domain Adaptation (SFADA) method for advancing UWF-SLO cross-center vessel segmentation. Our approach offers three advantages: First, it eliminates the need to access source domain data, thereby enhancing data security and privacy protection. Second, we introduce the Cascade Uncertainty-Predominance (CUP) selection strategy, which efficiently identifies a small subset of valuable image patches for annotation, substantially reducing the annotation burden. Lastly, by integrating our method with a minimal number of patch annotations, we can significantly boost the model’s performance. Meanwhile, by integrating the existing dataset and our newly collected and labeled datasets from two distinct centers (as detailed in Table 1 and Fig. 1), we construct the first Multi-center UWF-SLO Vessel Segmentation (MU-VS) dataset to explore the cross-center segmentation study. The main contributions are summarized as follows:

  • To our knowledge, this marks a pioneering exploration on an essential application of cross-center vessel segmentation using UWF-SLO, and we propose a patch-based SFADA framework to enhance segmentation performance.

  • We design the Cascade Uncertainty Predominance (CUP) selection strategy to select a small number of patches with high uncertainty and dominance to recommend for manual annotation.

  • We establish the first multi-center UWF-SLO vessel segmentation dataset consisting of 60 UWF-SLO images from two hospitals, named MU-VS, to support relevant studies.

  • Experimental results show that our method significantly surpasses other state-of-the-art domain adaptation and active learning methods, effectively enhancing segmentation accuracy.

2 Methodology

2.1 Problem setting

The goal of medical image segmentation is to construct a model \mathcal{M}caligraphic_M that links an image sample x𝑥xitalic_x from the space X𝑋Xitalic_X to its predictive label y𝑦yitalic_y within the space Y𝑌Yitalic_Y. In the SFADA setting, direct access to the source dataset and its annotations (xs,ys)superscript𝑥𝑠superscript𝑦𝑠{(x^{s},y^{s})}( italic_x start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ) is avoided, thereby safeguarding data privacy and security. Instead, we employ a model ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT pre-trained in the source domain alongside unlabeled data Xtsuperscript𝑋𝑡X^{t}italic_X start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT from the target domain to guide the recommendation of annotations. The quantity of target patches chosen for manual annotation is denoted by NtAL=αNtsuperscriptsubscript𝑁𝑡𝐴𝐿𝛼subscript𝑁𝑡N_{t}^{AL}=\alpha\cdot N_{t}italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_A italic_L end_POSTSUPERSCRIPT = italic_α ⋅ italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, with NtALNtmuch-less-thansuperscriptsubscript𝑁𝑡𝐴𝐿subscript𝑁𝑡N_{t}^{AL}\ll N_{t}italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_A italic_L end_POSTSUPERSCRIPT ≪ italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT indicating that the actively selected patches are significantly fewer than the total target patches, where α𝛼\alphaitalic_α signifies the selection ratio and Ntsubscript𝑁𝑡N_{t}italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the count of all target patches. Concurrently, the labels for these selected patches are symbolized as YLtsuperscript𝑌𝐿𝑡Y^{Lt}italic_Y start_POSTSUPERSCRIPT italic_L italic_t end_POSTSUPERSCRIPT. Our goal is to refine the performance of the model tsuperscript𝑡\mathcal{M}^{t}caligraphic_M start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT in the target domain, striving to keep the parameter α𝛼\alphaitalic_α as small as possible.

Refer to caption
Figure 2: Overview of the proposed patch-based SFADA framework including the CUP selection strategy. The first row of gray represents the training of source model; the second row of light blue represents the recommendation and annotation of the valuable image patches based on the CUP strategy; and the third row of green represents the target model’s fine-tuning under the supervision of enhanced pseudo-labels.

2.2 Pipeline of Patch-Based SFADA Framework

Considering the high-resolution UWF-SLO images used in our segmentation task, fully annotating the image is extremely expensive and time-consuming. To address this, we propose a method that focuses on selecting and annotating a few valuable image patches and finally integrating them into enhanced pseudo-labels (YEtsuperscript𝑌𝐸𝑡Y^{Et}italic_Y start_POSTSUPERSCRIPT italic_E italic_t end_POSTSUPERSCRIPT) for target model tsuperscript𝑡\mathcal{M}^{t}caligraphic_M start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT fine-tuning.

Fig. 2 depicts the structural design of our patch-based SFADA framework. The initial row illustrates the pipeline’s first phase, where we utilize images with their corresponding annotations (Xs,Ys)superscript𝑋𝑠superscript𝑌𝑠{(X^{s},Y^{s})}( italic_X start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT , italic_Y start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ) from the source domain to train the segmentation model, obtaining the source domain segmentation model ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT. The second step is depicted in the second row of Fig. 2. Here, we freeze the parameters of ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT to infer the target domain UWF-SLO image Xtsuperscript𝑋𝑡X^{t}italic_X start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, subsequently deriving its prediction masks Ytsuperscript𝑌𝑡Y^{t}italic_Y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and uncertainty maps Utsuperscript𝑈𝑡U^{t}italic_U start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT (detailed computational methods are explained in Section 2.3). Subsequently, we divide Ytsuperscript𝑌𝑡Y^{t}italic_Y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and Utsuperscript𝑈𝑡U^{t}italic_U start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT into numerous small patches and recommend the most valuable ones to ophthalmologists for annotation, employing our CUP strategy. The third step, illustrated in the third row of Fig. 2, involves merging the actively annotated real patch labels YLtsuperscript𝑌𝐿𝑡Y^{Lt}italic_Y start_POSTSUPERSCRIPT italic_L italic_t end_POSTSUPERSCRIPT with the network’s prediction masks Ytsuperscript𝑌𝑡Y^{t}italic_Y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT to produce the enhanced pseudo labels YEtsuperscript𝑌𝐸𝑡Y^{Et}italic_Y start_POSTSUPERSCRIPT italic_E italic_t end_POSTSUPERSCRIPT. The specific operation is to modify the corresponding image patches using YLtsuperscript𝑌𝐿𝑡Y^{Lt}italic_Y start_POSTSUPERSCRIPT italic_L italic_t end_POSTSUPERSCRIPT according to the position index of these patches. Finally, the source domain model ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT undergoes fine-tuning guided by the enhanced pseudo-label YEtsuperscript𝑌𝐸𝑡Y^{Et}italic_Y start_POSTSUPERSCRIPT italic_E italic_t end_POSTSUPERSCRIPT to develop the target domain model tsuperscript𝑡\mathcal{M}^{t}caligraphic_M start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. This process involves minimizing the loss function Ls(θs,Ωs)subscript𝐿𝑠subscript𝜃𝑠subscriptΩ𝑠L_{s}(\theta_{s},\Omega_{s})italic_L start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , roman_Ω start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) (a combined loss function of cross-entropy and Dice) in relation to the network parameters ΩssubscriptΩ𝑠\Omega_{s}roman_Ω start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT.

2.3 Cascade Uncertainty-Predominance selection

In the domain adaptation task, the source domain model ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT has already acquired some fundamental knowledge of vessel segmentation, enabling it to generate a preliminary vessel mask from target domain data Xtsuperscript𝑋𝑡X^{t}italic_X start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. However, the variances between domains result in several regions within the ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT predicted vessel mask exhibiting high uncertainty. To address this challenge, we design a CUP selection strategy to prioritize patches with higher uncertainty, where ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT may lack related knowledge. Building on this premise, we further identify patches with substantial vessel prediction by ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT, denoting regions of predominance. This approach underscores that, despite significant uncertainties in ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT’s predictions, where model predictions are prone to errors. Certain regions still predict a large number of target vessels, necessitating ground truth annotations for precise model training and refinement.

As illustrated in the CUP selection box of Fig. 2, the CUP strategy comprises two cascades: C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT for uncertainty and C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for predominance. First, we employ the source domain model ssuperscript𝑠\mathcal{M}^{s}caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT to generate the prediction masks Ytsuperscript𝑌𝑡Y^{t}italic_Y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and the uncertainty maps Utsuperscript𝑈𝑡U^{t}italic_U start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. The procedure for prediction masks is detailed as follows:

Prediction_mask=argmax𝑐(softmax𝑐(s(xt))),𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛_𝑚𝑎𝑠𝑘𝑐𝑐𝑠𝑜𝑓𝑡𝑚𝑎𝑥superscript𝑠superscript𝑥𝑡Prediction\_mask=\underset{c}{\arg\max}\left(\underset{c}{softmax}(\mathcal{M}% ^{s}(x^{t}))\right),italic_P italic_r italic_e italic_d italic_i italic_c italic_t italic_i italic_o italic_n _ italic_m italic_a italic_s italic_k = underitalic_c start_ARG roman_arg roman_max end_ARG ( underitalic_c start_ARG italic_s italic_o italic_f italic_t italic_m italic_a italic_x end_ARG ( caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) ) , (1)

where xtsuperscript𝑥𝑡x^{t}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT represents a target image from Xtsuperscript𝑋𝑡X^{t}italic_X start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, and c𝑐citalic_c denotes the predicted category, here refers to the foreground vessels. The process for uncertainty maps is outlined in the following manner:

Uncertainty=1×c=1Csoftmax𝑐(s(xt))log(softmax𝑐(s(xt))),𝑈𝑛𝑐𝑒𝑟𝑡𝑎𝑖𝑛𝑡𝑦1superscriptsubscript𝑐1𝐶𝑐𝑠𝑜𝑓𝑡𝑚𝑎𝑥superscript𝑠superscript𝑥𝑡𝑐𝑠𝑜𝑓𝑡𝑚𝑎𝑥superscript𝑠superscript𝑥𝑡Uncertainty=-1\times\sum_{c=1}^{C}\underset{c}{softmax}(\mathcal{M}^{s}(x^{t})% )\log\left(\underset{c}{softmax}(\mathcal{M}^{s}(x^{t}))\right),italic_U italic_n italic_c italic_e italic_r italic_t italic_a italic_i italic_n italic_t italic_y = - 1 × ∑ start_POSTSUBSCRIPT italic_c = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_C end_POSTSUPERSCRIPT underitalic_c start_ARG italic_s italic_o italic_f italic_t italic_m italic_a italic_x end_ARG ( caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) roman_log ( underitalic_c start_ARG italic_s italic_o italic_f italic_t italic_m italic_a italic_x end_ARG ( caligraphic_M start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) ) , (2)

where c=1Csuperscriptsubscript𝑐1𝐶\sum_{c=1}^{C}∑ start_POSTSUBSCRIPT italic_c = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_C end_POSTSUPERSCRIPT represents the sum of c𝑐citalic_c classes, both the foreground vessels and background. This measure reflects the entropy of the model’s predictive probability distribution: higher values indicate greater uncertainty (i.e., the model’s predictions are spread out across different classes).

Next, as depicted in Fig. 2, the prediction masks Ytsuperscript𝑌𝑡Y^{t}italic_Y start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and the uncertainty maps Utsuperscript𝑈𝑡U^{t}italic_U start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT are divided into multiple small patches. For each patch, we calculate the total number of predicted vessel pixels and the aggregate uncertainty values, denoted as vesp={P0,,Pt}𝑣𝑒subscript𝑠𝑝subscript𝑃0subscript𝑃𝑡ves_{p}=\{P_{0},...,P_{t}\}italic_v italic_e italic_s start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = { italic_P start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } and vesu={U0,,Ut}𝑣𝑒subscript𝑠𝑢subscript𝑈0subscript𝑈𝑡ves_{u}=\{U_{0},...,U_{t}\}italic_v italic_e italic_s start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = { italic_U start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_U start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT }, respectively. Based on these statistical results, our cascade selection strategy is operated as follows:

step1: SelectC1%=Top(vesu)[C1%],step1: 𝑆𝑒𝑙𝑒𝑐subscript𝑡𝐶percent1Topsubscriptves𝑢delimited-[]𝐶percent1\displaystyle\text{step1: }Select_{C1\%}=\text{Top}(\text{ves}_{u})[C1\%],step1: italic_S italic_e italic_l italic_e italic_c italic_t start_POSTSUBSCRIPT italic_C 1 % end_POSTSUBSCRIPT = Top ( ves start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) [ italic_C 1 % ] , (3)
step2: Selected=Top(vespinSelectC1%)[C2%],step2: 𝑆𝑒𝑙𝑒𝑐𝑡𝑒𝑑Topsubscriptves𝑝𝑖𝑛𝑆𝑒𝑙𝑒𝑐subscript𝑡𝐶percent1delimited-[]𝐶percent2\displaystyle\text{step2: }Selected=\text{Top}(\text{ves}_{p}\ in\ Select_{C1% \%})[C2\%],step2: italic_S italic_e italic_l italic_e italic_c italic_t italic_e italic_d = Top ( ves start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_i italic_n italic_S italic_e italic_l italic_e italic_c italic_t start_POSTSUBSCRIPT italic_C 1 % end_POSTSUBSCRIPT ) [ italic_C 2 % ] ,

where C1%𝐶percent1C1\%italic_C 1 % and C2%𝐶percent2C2\%italic_C 2 % denote the ratio of patches selected based on the highest uncertainty and predicted vessel pixels, respectively.

3 Experiments and Results

3.1 Data Description

We collected 30 UWF-SLO images each from two distinct medical centers, utilizing Optos California and 200Tx cameras (Optos plc, Dunfermline, UK) for capture. The datasets from each center comprised varied categories (refer to Table 1 for details) and were annotated with vessel masks by their respective ophthalmologists. The ophthalmologists utilized Photoshop software for the manual annotation of vessels within the UWF-SLO images. They precisely labeled the vessels across different regions by iteratively fine-tuning the image’s brightness and contrast, adopting a layered approach, and leveraging the software’s outlining tools for accurate delineation. Combined with the currently existing publicly available data PRIME-FP20 [2], we established the first multi-center vessel segmentation of UWF-SLO, with domain shifts potentially attributed to different annotators and different annotation approaches and disease categories.

3.2 Implementation Details and Evaluation Metrics

Implementation Details. For objective evaluation, each dataset is randomly split into three subsets (training, validation, testing) with a ratio of 6:2:2. The model that performs best on the validation set is then chosen for reporting its results on the test set. The PRIME-FP20 [2] dataset is used as the source domain, with centers A and B serving as the target domains. All experiments are carried out on an NVIDIA RTX 3090 GPU with 24 GB memory. The original image size is 3900×\times×3072 and the patch size is set to 260×\times×256. C1%𝐶percent1C1\%italic_C 1 % and C2%𝐶percent2C2\%italic_C 2 % are set to 10% and 50% respectively, which means that a total of α=5%𝛼percent5\alpha=5\%italic_α = 5 % of the patches are selected for annotation. All input images resized to 1024×\times×1024 for uniform training. The SGD optimizer and a batch size of 5 are employed for training. For original training with all labels, models undergo 6000 iterations, while fine-tuning with pseudo labels involves 3000 iterations. An initial learning rate of 0.03 is set, undergoing exponential decay at a rate of 0.9 per iteration. For consistency, comparison methods are re-implemented using the same U-Net backbone [17] and executed under identical conditions.

Evaluation Metrics. Following previous work [15], We employ the Dice score (Dice), Intersection over Union (IoU), Matthews Correlation Coefficient (MCC), and Bookmaker Informedness (BM) as metrics to assess the performance of these models. Higher values indicate superior model performance.

Table 2: Quantitative comparison on Dice and IoU of our method and other state-of-the-art domain adaptation and active learning methods on the MU-VS dataset.
Dice (mean±plus-or-minus\pm±std, %) IoU (mean±plus-or-minus\pm±std, %)
Methods Center A Center B Overall Center A Center B Overall
Lower bound 54.76±2.73plus-or-minus54.762.7354.76\pm 2.7354.76 ± 2.73 51.32±3.25plus-or-minus51.323.2551.32\pm 3.2551.32 ± 3.25 53.04±2.99plus-or-minus53.042.9953.04\pm 2.9953.04 ± 2.99 37.75±2.56plus-or-minus37.752.5637.75\pm 2.5637.75 ± 2.56 34.58±2.91plus-or-minus34.582.9134.58\pm 2.9134.58 ± 2.91 36.17±2.74plus-or-minus36.172.7436.17\pm 2.7436.17 ± 2.74
Upper bound 62.00±1.32plus-or-minus62.001.3262.00\pm 1.3262.00 ± 1.32 57.36±2.99plus-or-minus57.362.9957.36\pm 2.9957.36 ± 2.99 59.68±2.16plus-or-minus59.682.1659.68\pm 2.1659.68 ± 2.16 44.94±1.38plus-or-minus44.941.3844.94\pm 1.3844.94 ± 1.38 40.27±2.96plus-or-minus40.272.9640.27\pm 2.9640.27 ± 2.96 42.61±2.17plus-or-minus42.612.1742.61\pm 2.1742.61 ± 2.17
AdvEnt [23] 56.29±1.35plus-or-minus56.291.3556.29\pm 1.3556.29 ± 1.35 51.95±2.96plus-or-minus51.952.9651.95\pm 2.9651.95 ± 2.96 54.12±2.16plus-or-minus54.122.1654.12\pm 2.1654.12 ± 2.16 39.18±1.30plus-or-minus39.181.3039.18\pm 1.3039.18 ± 1.30 35.14±2.72plus-or-minus35.142.7235.14\pm 2.7235.14 ± 2.72 37.16±2.01plus-or-minus37.162.0137.16\pm 2.0137.16 ± 2.01
DPL [1] 56.50±2.20plus-or-minus56.502.2056.50\pm 2.2056.50 ± 2.20 52.58±2.92plus-or-minus52.582.9252.58\pm 2.9252.58 ± 2.92 54.54±2.56plus-or-minus54.542.5654.54\pm 2.5654.54 ± 2.56 39.40±2.12plus-or-minus39.402.1239.40\pm 2.1239.40 ± 2.12 35.72±2.67plus-or-minus35.722.6735.72\pm 2.6735.72 ± 2.67 37.56±2.40plus-or-minus37.562.4037.56\pm 2.4037.56 ± 2.40
CBMT [20] 57.51±1.42plus-or-minus57.511.4257.51\pm 1.4257.51 ± 1.42 52.95±2.79plus-or-minus52.952.7952.95\pm 2.7952.95 ± 2.79 55.23±2.10plus-or-minus55.232.1055.23\pm 2.1055.23 ± 2.10 40.38±1.39plus-or-minus40.381.3940.38\pm 1.3940.38 ± 1.39 36.06±2.57plus-or-minus36.062.5736.06\pm 2.5736.06 ± 2.57 38.22±1.98plus-or-minus38.221.9838.22\pm 1.9838.22 ± 1.98
CPR [7] 57.79±2.01plus-or-minus57.792.0157.79\pm 2.0157.79 ± 2.01 53.28±2.93plus-or-minus53.282.9353.28\pm 2.9353.28 ± 2.93 55.54±2.47plus-or-minus55.542.4755.54\pm 2.4755.54 ± 2.47 40.66±1.96plus-or-minus40.661.9640.66\pm 1.9640.66 ± 1.96 36.36±2.71plus-or-minus36.362.7136.36\pm 2.7136.36 ± 2.71 38.51±2.34plus-or-minus38.512.3438.51\pm 2.3438.51 ± 2.34
Adversarial [22] 58.79±1.73plus-or-minus58.791.7358.79\pm 1.7358.79 ± 1.73 53.32±2.76plus-or-minus53.322.7653.32\pm 2.7653.32 ± 2.76 56.06±2.25plus-or-minus56.062.2556.06\pm 2.2556.06 ± 2.25 41.66±1.71plus-or-minus41.661.7141.66\pm 1.7141.66 ± 1.71 36.40±2.56plus-or-minus36.402.5636.40\pm 2.5636.40 ± 2.56 39.03±2.14plus-or-minus39.032.1439.03\pm 2.1439.03 ± 2.14
AADA [19] 58.92±1.46plus-or-minus58.921.4658.92\pm 1.4658.92 ± 1.46 53.38±2.84plus-or-minus53.382.8453.38\pm 2.8453.38 ± 2.84 56.15±2.15plus-or-minus56.152.1556.15\pm 2.1556.15 ± 2.15 41.78±1.46plus-or-minus41.781.4641.78\pm 1.4641.78 ± 1.46 36.45±2.66plus-or-minus36.452.6636.45\pm 2.6636.45 ± 2.66 39.12±2.06plus-or-minus39.122.0639.12\pm 2.0639.12 ± 2.06
MHPL [24] 59.32±1.22plus-or-minus59.321.2259.32\pm 1.2259.32 ± 1.22 53.58±2.99plus-or-minus53.582.9953.58\pm 2.9953.58 ± 2.99 56.45±2.11plus-or-minus56.452.1156.45\pm 2.1156.45 ± 2.11 42.18±1.23plus-or-minus42.181.2342.18\pm 1.2342.18 ± 1.23 36.66±2.84plus-or-minus36.662.8436.66\pm 2.8436.66 ± 2.84 39.42±2.04plus-or-minus39.422.0439.42\pm 2.0439.42 ± 2.04
STDR [25] 59.51±1.54plus-or-minus59.511.5459.51\pm 1.5459.51 ± 1.54 53.96±2.80plus-or-minus53.962.8053.96\pm 2.8053.96 ± 2.80 56.73±2.17plus-or-minus56.732.1756.73\pm 2.1756.73 ± 2.17 42.38±1.56plus-or-minus42.381.5642.38\pm 1.5642.38 ± 1.56 37.00±2.67plus-or-minus37.002.6737.00\pm 2.6737.00 ± 2.67 39.69±2.11plus-or-minus39.692.1139.69\pm 2.1139.69 ± 2.11
Ours 60.92±0.94plus-or-minus60.920.9460.92\pm 0.9460.92 ± 0.94 54.92±2.81plus-or-minus54.922.8154.92\pm 2.8154.92 ± 2.81 57.92±1.88plus-or-minus57.921.8857.92\pm 1.8857.92 ± 1.88 43.81±0.98plus-or-minus43.810.9843.81\pm 0.9843.81 ± 0.98 37.91±2.70plus-or-minus37.912.7037.91\pm 2.7037.91 ± 2.70 40.86±1.84plus-or-minus40.861.8440.86\pm 1.8440.86 ± 1.84

3.3 Experimental Results

This section provides an overview of experimental results across various medical centers in Table 2 and 3, including the lower bound (model without finetuning), upper bound (model finetuned with all labels), and comparisons with other state-of-the-art domain adaptation and active learning methods. Fig. 3 exhibits some visualizations of the segmentation results. By analyzing the data from Table 2 and Table 3, it becomes evident that significant performance gaps exist between the lower and upper bounds across various evaluation metrics. For instance, in the case of the BM metric, the overall gap widens from 49.61% to 58.29%.

Table 3: Quantitative comparison on MCC and BM of our method and other state-of-the-art domain adaptation and active learning methods on the MU-VS dataset.
MCC (mean±plus-or-minus\pm±std, %) BM (mean±plus-or-minus\pm±std, %)
Methods Center A Center B Overall Center A Center B Overall
Lower bound 54.03±2.54plus-or-minus54.032.5454.03\pm 2.5454.03 ± 2.54 50.33±3.25plus-or-minus50.333.2550.33\pm 3.2550.33 ± 3.25 52.18±2.90plus-or-minus52.182.9052.18\pm 2.9052.18 ± 2.90 48.95±4.91plus-or-minus48.954.9148.95\pm 4.9148.95 ± 4.91 50.27±4.74plus-or-minus50.274.7450.27\pm 4.7450.27 ± 4.74 49.61±4.83plus-or-minus49.614.8349.61\pm 4.8349.61 ± 4.83
Upper bound 61.48±1.29plus-or-minus61.481.2961.48\pm 1.2961.48 ± 1.29 56.58±2.80plus-or-minus56.582.8056.58\pm 2.8056.58 ± 2.80 59.03±2.05plus-or-minus59.032.0559.03\pm 2.0559.03 ± 2.05 55.71±3.98plus-or-minus55.713.9855.71\pm 3.9855.71 ± 3.98 60.87±4.23plus-or-minus60.874.2360.87\pm 4.2360.87 ± 4.23 58.29±4.10plus-or-minus58.294.1058.29\pm 4.1058.29 ± 4.10
AdvEnt [23] 55.40±1.38plus-or-minus55.401.3855.40\pm 1.3855.40 ± 1.38 50.94±2.85plus-or-minus50.942.8550.94\pm 2.8550.94 ± 2.85 53.17±2.11plus-or-minus53.172.1153.17\pm 2.1153.17 ± 2.11 51.18±3.63plus-or-minus51.183.6351.18\pm 3.6351.18 ± 3.63 52.92±4.64plus-or-minus52.924.6452.92\pm 4.6452.92 ± 4.64 52.05±4.14plus-or-minus52.054.1452.05\pm 4.1452.05 ± 4.14
DPL [1] 55.69±2.13plus-or-minus55.692.1355.69\pm 2.1355.69 ± 2.13 51.60±2.79plus-or-minus51.602.7951.60\pm 2.7951.60 ± 2.79 53.65±2.46plus-or-minus53.652.4653.65\pm 2.4653.65 ± 2.46 51.07±4.39plus-or-minus51.074.3951.07\pm 4.3951.07 ± 4.39 51.39±4.59plus-or-minus51.394.5951.39\pm 4.5951.39 ± 4.59 51.23±4.49plus-or-minus51.234.4951.23\pm 4.4951.23 ± 4.49
CBMT [20] 56.79±1.42plus-or-minus56.791.4256.79\pm 1.4256.79 ± 1.42 51.99±2.67plus-or-minus51.992.6751.99\pm 2.6751.99 ± 2.67 54.39±2.05plus-or-minus54.392.0554.39\pm 2.0554.39 ± 2.05 51.25±2.84plus-or-minus51.252.8451.25\pm 2.8451.25 ± 2.84 52.34±4.68plus-or-minus52.344.6852.34\pm 4.6852.34 ± 4.68 51.79±3.76plus-or-minus51.793.7651.79\pm 3.7651.79 ± 3.76
CPR [7] 57.21±1.93plus-or-minus57.211.9357.21\pm 1.9357.21 ± 1.93 52.35±2.82plus-or-minus52.352.8252.35\pm 2.8252.35 ± 2.82 54.78±2.38plus-or-minus54.782.3854.78\pm 2.3854.78 ± 2.38 50.88±3.67plus-or-minus50.883.6750.88\pm 3.6750.88 ± 3.67 55.33±4.73plus-or-minus55.334.7355.33\pm 4.7355.33 ± 4.73 53.11±4.20plus-or-minus53.114.2053.11\pm 4.2053.11 ± 4.20
Adversarial [22] 57.99±1.79plus-or-minus57.991.7957.99\pm 1.7957.99 ± 1.79 52.39±2.64plus-or-minus52.392.6452.39\pm 2.6452.39 ± 2.64 55.19±2.22plus-or-minus55.192.2255.19\pm 2.2255.19 ± 2.22 53.27±3.27plus-or-minus53.273.2753.27\pm 3.2753.27 ± 3.27 55.60±4.31plus-or-minus55.604.3155.60\pm 4.3155.60 ± 4.31 54.44±3.79plus-or-minus54.443.7954.44\pm 3.7954.44 ± 3.79
AADA [19] 58.03±1.54plus-or-minus58.031.5458.03\pm 1.5458.03 ± 1.54 52.43±2.70plus-or-minus52.432.7052.43\pm 2.7052.43 ± 2.70 55.23±2.12plus-or-minus55.232.1255.23\pm 2.1255.23 ± 2.12 53.99±3.00plus-or-minus53.993.0053.99\pm 3.0053.99 ± 3.00 54.63±4.88plus-or-minus54.634.8854.63\pm 4.8854.63 ± 4.88 54.31±3.94plus-or-minus54.313.9454.31\pm 3.9454.31 ± 3.94
MHPL [24] 58.38±1.31plus-or-minus58.381.3158.38\pm 1.3158.38 ± 1.31 52.72±2.89plus-or-minus52.722.8952.72\pm 2.8952.72 ± 2.89 55.55±2.10plus-or-minus55.552.1055.55\pm 2.1055.55 ± 2.10 55.00±2.74plus-or-minus55.002.7455.00\pm 2.7455.00 ± 2.74 56.91±5.00plus-or-minus56.915.0056.91\pm 5.0056.91 ± 5.00 55.96±3.87plus-or-minus55.963.8755.96\pm 3.8755.96 ± 3.87
STDR [25] 58.58±1.64plus-or-minus58.581.6458.58\pm 1.6458.58 ± 1.64 53.07±2.67plus-or-minus53.072.6753.07\pm 2.6753.07 ± 2.67 55.83±2.16plus-or-minus55.832.1655.83\pm 2.1655.83 ± 2.16 55.08±2.97plus-or-minus55.082.9755.08\pm 2.9755.08 ± 2.97 56.56±4.78plus-or-minus56.564.7856.56\pm 4.7856.56 ± 4.78 55.82±3.88plus-or-minus55.823.8855.82\pm 3.8855.82 ± 3.88
Ours 59.94±0.99plus-or-minus59.940.9959.94\pm 0.9959.94 ± 0.99 54.09±2.67plus-or-minus54.092.6754.09\pm 2.6754.09 ± 2.67 57.02±1.83plus-or-minus57.021.8357.02\pm 1.8357.02 ± 1.83 57.82±3.06plus-or-minus57.823.0657.82\pm 3.0657.82 ± 3.06 57.95±5.15plus-or-minus57.955.1557.95\pm 5.1557.95 ± 5.15 57.89±4.10plus-or-minus57.894.1057.89\pm 4.1057.89 ± 4.10
Table 4: Ablation experiments on the MU-VS dataset with Dice and IoU.
Patch-based Methods Dice (mean±plus-or-minus\pm±std, %) IoU (mean±plus-or-minus\pm±std, %)
Methods Random C1 C2 Center A Center B Overall Center A Center B Overall
M1 \checkmark - - 58.17±1.89plus-or-minus58.171.8958.17\pm 1.8958.17 ± 1.89 52.87±2.78plus-or-minus52.872.7852.87\pm 2.7852.87 ± 2.78 55.52±2.34plus-or-minus55.522.3455.52\pm 2.3455.52 ± 2.34 41.04±1.86plus-or-minus41.041.8641.04\pm 1.8641.04 ± 1.86 35.99±2.56plus-or-minus35.992.5635.99\pm 2.5635.99 ± 2.56 38.52±2.21plus-or-minus38.522.2138.52\pm 2.2138.52 ± 2.21
M2 - \checkmark - 59.13±1.42plus-or-minus59.131.4259.13\pm 1.4259.13 ± 1.42 53.52±2.82plus-or-minus53.522.8253.52\pm 2.8253.52 ± 2.82 56.33±2.12plus-or-minus56.332.1256.33\pm 2.1256.33 ± 2.12 41.99±1.43plus-or-minus41.991.4341.99\pm 1.4341.99 ± 1.43 36.58±2.62plus-or-minus36.582.6236.58\pm 2.6236.58 ± 2.62 39.29±2.03plus-or-minus39.292.0339.29\pm 2.0339.29 ± 2.03
Ours - \checkmark \checkmark 60.92±0.94plus-or-minus60.920.9460.92\pm 0.9460.92 ± 0.94 54.92±2.81plus-or-minus54.922.8154.92\pm 2.8154.92 ± 2.81 57.92±1.88plus-or-minus57.921.8857.92\pm 1.8857.92 ± 1.88 43.81±0.98plus-or-minus43.810.9843.81\pm 0.9843.81 ± 0.98 37.91±2.70plus-or-minus37.912.7037.91\pm 2.7037.91 ± 2.70 40.86±1.84plus-or-minus40.861.8440.86\pm 1.8440.86 ± 1.84
Table 5: Ablation experiments on the MU-VS dataset with MCC and BM.
Patch-based Methods MCC (mean±plus-or-minus\pm±std, %) BM (mean±plus-or-minus\pm±std, %)
Methods Random C1 C2 Center A Center B Overall Center A Center B Overall
M1 \checkmark - - 57.46±1.88plus-or-minus57.461.8857.46\pm 1.8857.46 ± 1.88 51.93±2.66plus-or-minus51.932.6651.93\pm 2.6651.93 ± 2.66 54.69±2.27plus-or-minus54.692.2754.69\pm 2.2754.69 ± 2.27 51.97±3.51plus-or-minus51.973.5151.97\pm 3.5151.97 ± 3.51 54.86±4.80plus-or-minus54.864.8054.86\pm 4.8054.86 ± 4.80 53.42±4.15plus-or-minus53.424.1553.42\pm 4.1553.42 ± 4.15
M2 - \checkmark - 58.17±1.52plus-or-minus58.171.5258.17\pm 1.5258.17 ± 1.52 52.68±2.66plus-or-minus52.682.6652.68\pm 2.6652.68 ± 2.66 55.42±2.09plus-or-minus55.422.0955.42\pm 2.0955.42 ± 2.09 54.93±3.03plus-or-minus54.933.0354.93\pm 3.0354.93 ± 3.03 56.93±4.64plus-or-minus56.934.6456.93\pm 4.6456.93 ± 4.64 55.93±3.83plus-or-minus55.933.8355.93\pm 3.8355.93 ± 3.83
Ours - \checkmark \checkmark 59.94±0.99plus-or-minus59.940.9959.94\pm 0.9959.94 ± 0.99 54.09±2.67plus-or-minus54.092.6754.09\pm 2.6754.09 ± 2.67 57.02±1.83plus-or-minus57.021.8357.02\pm 1.8357.02 ± 1.83 57.82±3.06plus-or-minus57.823.0657.82\pm 3.0657.82 ± 3.06 57.95±5.15plus-or-minus57.955.1557.95\pm 5.1557.95 ± 5.15 57.89±4.10plus-or-minus57.894.1057.89\pm 4.1057.89 ± 4.10
Refer to caption
Figure 3: Visual comparisons of our method and other leading domain adaptation and active learning methods. Our method can more accurately segment the vessels (please zoom in for more details).

Domain adaptation analysis. We benchmark our method against the latest state-of-the-art domain adaptation techniques under identical backbone architectures and experimental conditions. This comparison encompasses methods requiring access to source data, such as AdvEnt [23], and source-free approaches like DPL [1], CBMT [20], and CPR [7]. The experimental outcomes indicate that various methods have led to improvements in accuracy. As illustrated in Table 2, the overall Dice scores for these techniques vary from 54.12% to 55.54%, surpassing the lower bound of 53.04%. However, perhaps due to the absence of supervised training with real labels, these improvements are relatively limited, and our method achieved 57.92% in this indicator with few labels (5%).

Active learning analysis. Given that SFADA incorporates elements of active learning, we also compare our approach with the recent leading active learning methods, all evaluated under the same experimental setup with 5% labeled data. This comparison includes methods such as Adversarial [22], AADA [19], MHPL [24], and STDR [25], ensuring a comprehensive analysis under uniform conditions. Merging the data from Table 2 and Table 3 reveals that, overall, active learning approaches outperform domain adaptation methods. Notably, our strategy yields the highest scores across all four metrics, underscoring the efficacy of our patch-based approach augmented by the CUP selection strategy.

Ablation Studies. To verify the effectiveness of our method, we conduct corresponding ablation experiments (the results are shown in Table 4 and Table 5), including three configurations as follows: (1) M1: randomly select patches combined with our patch-based framework. (2) M2: based on C1𝐶1C1italic_C 1 uncertainty, the top 5% patches are selected for annotation and then integrated into the overall framework. (3) Ours: performing cascade selection, first select C1𝐶1C1italic_C 1 uncertainty and then C2𝐶2C2italic_C 2 predominance, named the CUP strategy. The experimental results show that M2 outperforms M1 overall, while Ours achieves greater performance gains compared to M2, e.g., the overall Dice from 55.52% to 56.33%, and finally to 57.92% in Table 4, which proves the effectiveness of our CUP strategy.

4 Conclusion

In this paper, we explore the task of vessel segmentation of UWF-SLO images across different centers. Considering the high cost of labeling high-resolution UWF-SLO images, we propose a patch-based SFADA approach to significantly save labeling resources while boosting segmentation performance. We also devise a CUP strategy to cascade the selection of valuable patches with high uncertainty and dominance for annotation. Meanwhile, we construct the first public multi-center UWF-SLO vessel segmentation (MU-VS) dataset to facilitate related research. Experimental results demonstrate that our method achieves optimal results compared to other domain adaptation and active learning methods. In the future, we plan to extend our method to other similar high-resolution medical image segmentation tasks.

References

  • [1] Chen, C., Liu, Q., Jin, Y., Dou, Q., Heng, P.A.: Source-free domain adaptive fundus image segmentation with denoised pseudo-labeling. In: MICCAI. pp. 225–235. Springer (2021)
  • [2] Ding, L., Kuriyan, A.E., Ramchandran, R.S., Wykoff, C.C., Sharma, G.: Weakly-supervised vessel detection in ultra-widefield fundus photography via iterative multi-modal registration and learning. IEEE Transactions on Medical Imaging 40(10), 2748–2758 (2020)
  • [3] Fraz, M.M., Remagnino, P., Hoppe, A., Uyyanonvara, B., Rudnicka, A.R., Owen, C.G., Barman, S.A.: Blood vessel segmentation methodologies in retinal images–a survey. Computer methods and programs in biomedicine 108(1), 407–433 (2012)
  • [4] Galdran, A., Costa, P., Bria, A., Araújo, T., Mendonça, A.M., Campilho, A.: A no-reference quality metric for retinal vessel tree segmentation. In: International conference on medical image computing and computer-assisted intervention. pp. 82–90. Springer (2018)
  • [5] Guan, H., Liu, M.: Domain adaptation for medical image analysis: a survey. IEEE Transactions on Biomedical Engineering 69(3), 1173–1185 (2021)
  • [6] Hoover, A., Kouznetsova, V., Goldbaum, M.: Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Transactions on Medical imaging 19(3), 203–210 (2000)
  • [7] Huai, Z., Ding, X., Li, Y., Li, X.: Context-aware pseudo-label refinement for source-free domain adaptive fundus image segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 618–628. Springer (2023)
  • [8] Kumari, S., Singh, P.: Deep learning for unsupervised domain adaptation in medical imaging: Recent advancements and future perspectives. Computers in Biology and Medicine p. 107912 (2023)
  • [9] Li, X., Hao, H., Fu, H., Zhang, D., Chen, D., Qiao, Y., Liu, J., Zhao, Y., Zhang, J.: Privileged modality guided network for retinal vessel segmentation in ultra-wide-field images. In: International Workshop on Ophthalmic Medical Image Analysis. pp. 82–91. Springer (2023)
  • [10] Menten, M.J., Paetzold, J.C., Dima, A., Menze, B.H., Knier, B., Rueckert, D.: Physiology-based simulation of the retinal vasculature enables annotation-free segmentation of oct angiographs. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 330–340. Springer (2022)
  • [11] Mookiah, M.R.K., Hogg, S., MacGillivray, T.J., Prathiba, V., Pradeepa, R., Mohan, V., Anjana, R.M., Doney, A.S., Palmer, C.N., Trucco, E.: A review of machine learning methods for retinal blood vessel segmentation and artery/vein classification. Medical Image Analysis 68, 101905 (2021)
  • [12] Nagiel, A., Lalane, R.A., Sadda, S.R., Schwartz, S.D.: Ultra-widefield fundus imaging: a review of clinical applications and future trends. Retina 36(4), 660–678 (2016)
  • [13] Niu, Z., Ouyang, S., Xie, S., Chen, Y.w., Lin, L.: A survey on domain generalization for medical image analysis. arXiv preprint arXiv:2402.05035 (2024)
  • [14] Pellegrini, E., Robertson, G., Trucco, E., MacGillivray, T.J., Lupascu, C., van Hemert, J., Williams, M.C., Newby, D.E., van Beek, E.J., Houston, G.: Blood vessel segmentation and width estimation in ultra-wide field scanning laser ophthalmoscopy. Biomedical optics express 5(12), 4329–4337 (2014)
  • [15] Qiu, Z., Hu, Y., Chen, X., Zeng, D., Hu, Q., Liu, J.: Rethinking dual-stream super-resolution semantic learning in medical image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2023)
  • [16] Rogers, S., McIntosh, R.L., Cheung, N., Lim, L., Wang, J.J., Mitchell, P., Kowalski, J.W., Nguyen, H., Wong, T.Y., Consortium, I.E.D., et al.: The prevalence of retinal vein occlusion: pooled data from population studies from the united states, europe, asia, and australia. Ophthalmology 117(2), 313–319 (2010)
  • [17] Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: MICCAI. pp. 234–241 (2015)
  • [18] Staal, J., Abràmoff, M.D., Niemeijer, M., Viergever, M.A., Van Ginneken, B.: Ridge-based vessel segmentation in color images of the retina. IEEE transactions on medical imaging 23(4), 501–509 (2004)
  • [19] Su, J.C., Tsai, Y.H., Sohn, K., Liu, B., Maji, S., Chandraker, M.: Active adversarial domain adaptation. In: WACV. pp. 739–748 (2020)
  • [20] Tang, L., Li, K., He, C., Zhang, Y., Li, X.: Source-free domain adaptive fundus image segmentation with class-balanced mean teacher. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 684–694. Springer (2023)
  • [21] Tang, Q.Q., Yang, X.G., Wang, H.Q., Wu, D.W., Zhang, M.X.: Applications of deep learning for detecting ophthalmic diseases with ultrawide-field fundus images. International Journal of Ophthalmology 17(1),  188 (2024)
  • [22] Tsai, Y.H., Hung, W.C., Schulter, S., Sohn, K., Yang, M.H., Chandraker, M.: Learning to adapt structured output space for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 7472–7481 (2018)
  • [23] Vu, T.H., Jain, H., Bucher, M., Cord, M., Pérez, P.: Advent: Adversarial entropy minimization for domain adaptation in semantic segmentation. In: CVPR. pp. 2517–2526 (2019)
  • [24] Wang, F., Han, Z., Zhang, Z., He, R., Yin, Y.: Mhpl: Minimum happy points learning for active source free domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 20008–20018 (2023)
  • [25] Wang, H., Chen, J., Zhang, S., He, Y., Xu, J., Wu, M., He, J., Liao, W., Luo, X.: Dual-reference source-free active domain adaptation for nasopharyngeal carcinoma tumor segmentation across multiple hospitals. IEEE Transactions on Medical Imaging (2024)
  • [26] Wang, H., Zhang, S., Luo, X., Liao, W., Zhu, L.: Advancing delineation of gross tumor volume based on magnetic resonance imaging by performing source-free domain adaptation in nasopharyngeal carcinoma. In: International Workshop on Computational Mathematics Modeling in Cancer Analysis. pp. 71–80. Springer (2023)
  • [27] Xing, Z., Wan, L., Fu, H., Yang, G., Zhu, L.: Diff-unet: A diffusion embedded network for volumetric segmentation. arXiv preprint arXiv:2303.10326 (2023)
  • [28] Xing, Z., Ye, T., Yang, Y., Liu, G., Zhu, L.: Segmamba: Long-range sequential modeling mamba for 3d medical image segmentation. arXiv preprint arXiv:2401.13560 (2024)
  • [29] Xu, R., Liu, T., Ye, X., Lin, L., Chen, Y.W.: Boosting connectivity in retinal vessel segmentation via a recursive semantics-guided network. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 786–795. Springer (2020)
  • [30] Zhang, J., Dashtbozorg, B., Bekkers, E., Pluim, J.P., Duits, R., ter Haar Romeny, B.M.: Robust retinal vessel segmentation via locally adaptive derivative frames in orientation scores. IEEE transactions on medical imaging 35(12), 2631–2644 (2016)
  • [31] Zhang, N., Lu, J., Li, K., Fang, Z., Zhang, G.: Source-free unsupervised domain adaptation: Current research and future directions. Neurocomputing p. 126921 (2023)
  • [32] Zhao, H., Dong, W., Yu, R., Zhao, Z., Bo, D., Xu, Y.: Morestyle: Relax low-frequency constraint of fourier-based image reconstruction in generalizable medical image segmentation. arXiv preprint arXiv:2403.11689 (2024)