Accelerated Proton Resonance Frequency-based Magnetic Resonance Thermometry by Optimized Deep Learning Method

Sijie Xu^1,†
sijie.x@sjtu.edu
&Shenyan Zong^2,†
shenyanzong@fudan.edu.cn
&Chang-Sheng Mei^3,4
mei.changsheng@gmail.com
&Guofeng Shen^1,∗
shenguofeng@sjtu.edu.cn
&Yueran Zhao¹
shenguofeng@sjtu.edu.cn
&He Wang^2,5,∗
shenguofeng@sjtu.edu.cn

Abstract

Background: Proton resonance frequency (PRF)–based magnetic resonance (MR) thermometry is essential in thermal ablation therapies through focused ultrasound (FUS). The clinical treatments require temperature feedback must be rapid and accurate. Purpose: This work aims to enhance temporal resolution in dynamic MR temperature map reconstruction with an improved deep learning method, to ensure the safety and effectiveness of FUS treatments.

Methods: The training-optimized methods and five classical neural networks were applied on the 2-fold and 4-fold under-sampling k-space data to reconstruct the temperature maps. The used neural networks were cascade net, complex valued U-Net, shift window transformer for MRI, real valued U-Net and U-Net with residual block. The enhanced training modules included offline/online data augmentations, knowledge distillation, and the amplitude-phase decoupling loss function. The heating experiments were performed by a FUS transducer on phantom and ex vivo tissues, respectively. In datasets, the ground-truth was the complex MR images with accurate temperature increases. These data were also manually under-sampled to imitate acceleration procedures and trained in our method to get the reconstruction model. The additional dozen or so testing datasets were separately obtained for evaluating the real-time performance and temperature accuracy.

Results: Acceleration factors of 1.9 and 3.7 were found for $2\times$ and $4\times$ k-space under-sampling strategies and the ResUNet-based deep learning reconstruction performed exceptionally well. In 2-fold acceleration scenario, the RMSE of temperature map patches provided the values of 0.888 ℃ and 1.145 ℃ on phantom and ex vivo testing datasets. The DICE value of temperature areas enclosed by $43$ ℃ isotherm was 0.809, and the Bland-Altman analysis showed a bias of $-0.253$ ℃ with the apart of $\pm 2.16$ ℃. In $4\times$ under-sampling case, these evaluating values decreased by approximately 10%.

Conclusion: This study demonstrates that the application of deep learning-based reconstruction significantly enhances the accuracy and efficiency of MR thermometry, particularly benefiting the clinical thermal therapies for uterine fibroid, essential tremor, and prostate cancer by FUS.

The source code for our optimizing methods and neural networks is available at: https://github.com/minipuding/FastMRT.

¹Biomedical Instrument Institute, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai.
²Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai.
³Department of Radiology, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachu-setts.
⁴Department of Physics, Soochow University, Taipei
⁵Department of Radiology, Shanghai Fourth People’s Hospital Affiliated to Tongji University School of Medicine, Shanghai
^†Co-First Author, ^∗Corresponding Author

1 Introduction

Magnetic resonance (MR) thermometry is widely used in noninvasive surgical treatments by focused ultrasound (FUS)[1]. However, achieving real-time temperature measurement that preserves temperature information using magnetic resonance imaging (MRI) is highly challenging due to the underlying imaging principles. Such surgeries require the continuous acquisition of approximately ten frames for each ablation, with each frame taking approximately two seconds. A complete uterine fibroid ablation procedure typically lasts between two and four hours, with a considerable portion of the time dedicated to image acquisition. The use of temperature monitoring intervals lasting seconds still poses a certain degree of safety risk to patients[2]. Throughout the procedure, patients must remain motionless to prevent any hazardous circumstances, which can be highly distressing for them.

Fast imaging based on the reconstruction of under-sampled magnetic resonance images has been in development for several years and has shown great potential to significantly increase the speed of temperature measurement. Most studies in the area of MRI temperature measurement use conventional methods, such as parallel imaging and compress sensing[3, 4]. The accelerated MR thermometry through coil-sensitivity encoding was generally accompanied by a decrease in temperature accuracy[5]. Besides that, the reduced field of view (FOV) method might be an alternative for fast temperature measurements, but the absence of a full-FOV monitor increased the risk of ablation treatments. More advanced non-cartesian readout strategies such as spiral and radial MR thermometry proposed by Kisoo and Pooja et al, can achieve volumetric and motion-immune temperature measurements with a temporal resolution of 100 300 ms for each slice[6, 7]. However, our novel deep learning-based rapid reconstruction method was non-conflicting with these sampling strategies. The spiral and radial k-space data can also be under-sampled to reach higher imaging speeds, and accelerated imaging via reconstruction in this study was also applicable. The work to be done is to train a specialized reconstruction model for them through our proposed method. In addition, the rapid echo planar imaging (EPI) sequence was validated for temperature measurements by Andrew and Henrik et al[8, 9]. Nevertheless, the segmented or single-shot EPI was always vulnerable to B0 inhomogeneities. The irresistible susceptibility led to a significant reduction in the clinical acceptability of using this sequence for temperature measurements[10, 11]. Furthermore, the temperature increase-induced focus shift remains a difficult and not well-resolved problem. In contrast, the rapid reconstruction method we proposed for cartesian-based gradient echo sequences was more robust.

Since the inception of the fast MRI challenge[12, 13], numerous deep learning-based MRI reconstruction techniques[14, 15] have been proposed. Following the emergence of Vision Transformer (ViT)[16], several transformer-based reconstruction methods have also been proposed[17, 18, 19]. However, magnetic resonance thermometry encounters two major issues when attempting to apply existing fast imaging algorithms. Firstly, the most commonly used temperature measurement method, proton resonance frequency (PRF) shift[20], relies on the phase discrepancy of complex images. However, current fast MRI methods primarily focus on reconstructing amplitude images, with little emphasis on phase[21]. As a result, these methods are suboptimal for preserving phase information in magnetic resonance temperature measurement, lacking specific design or improvement for this purpose. Secondly, current undersampling reconstruction methods prioritize image quality restoration over time preservation, leading to increasingly larger and more complex models with insufficient attention to the impact of model inference time on actual acceleration rates[22]. Furthermore, insufficient datasets may result in limited model’s performance due to overfitting and underutilization.

This work improves the performance of deep learning by adopting network structure-independent methods without increasing the number of network parameters or computational complexity to achieve fast and accurate measurement. The proposed approach involves several techniques to improve the performance of neural network models; it utilizes offline diffusion model augmentation, online complex-valued data augmentation techniques, knowledge distillation, and an amplitude-phase decoupled loss function. The first two modules are utilized for data augmentation to prevent overfitting and unleash the potential of the model. The knowledge distillation module enables a smaller model to learn capabilities several times greater than its parameter capacity. The decoupled loss function separates amplitude and phase differences, allowing the model to adjust weights and focus on the image phase. Based on these training strategies, the cascade net (CasNet)[23], complex valued U-Net (CUNet)[24], shift window transformer for MRI (SwinMR)[19, 25, 26], real valued U-Net (RUNet)[27] and U-Net with residual block (ResUNet) were involved in this deep learning method to improve the speed of MR temperature measurements.

2 Theories

2.1 MR Reconstruction via Deep Learning

The speed of MRI is determined by the number of sampled k-space lines for a regular gradient echo sequence, and the acceleration can be achieved through reducing phase encoding numbers. In the case of single-channel MRI signal sampling, it can be mathematically represented by the formula 1:

y=\mathcal{M}\cdot\mathscr{F}(x)+\epsilon

(1)

where $x\in C^{(N_{1}\times N_{2})}$ denotes the MR images reconstructed from fully sampled k-space, while $\mathcal{M}\in C^{(N_{1}\times N_{2})}$ represents the mask lines selected from the phase encoding direction of $y\in C^{(N_{1}\times N_{2})}$ , and $\mathscr{F}(\cdot)$ is the Fourier transform.

A typical function for estimating the MR image $x$ from measurements is given by:

x=\mathop{\arg\min}\limits_{x}||y-\mathcal{M}\cdot\mathscr{F}(x)||_{2}^{2}+% \lambda\cdot R(x)

(2)

where $R(x)$ denotes the regularizer, which is dependent on the reconstruction algorithm used. For deep learning training processes, a function $G_{DL}(\cdot)$ can be utilized as the regularizer:

\begin{split}\hat{x}=arg\min\limits_{x}||y-\mathcal{M}\cdot\mathscr{F}(x)||_{2% }^{2}+\lambda R(x,\theta^{*}),\\ where\ \theta^{*}=arg\min\limits_{\theta}E||x-G_{DL}(\mathcal{M}\cdot\mathscr{% F}(x);\theta)||,x\sim S\end{split}

(3)

where $S$ is the dataset and $x$ is the complex image sampled from $S$ . We train a model to minimize the expected difference between sampled and fully sampled images[15].

2.2 Proton Resonance Frequency Shift

At present, proton resonance frequency (PRF) shift thermometry is a widely used technique for temperature measurement via MRI. PRF shift thermometry shows a persistent linear correlation with temperature and is largely tissue-type agnostic (excluding adipose tissue), while providing a simple and robust real-time measurement method through the regular sequences. To determine temperature changes it calculates the phase difference between the magnetic resonance images with heating and the baseline images. The temperature alteration can be expressed as a linear function of the phase difference, as shown by formula 4:

\Delta T=\frac{\phi-\phi_{ref}}{\alpha\cdot\gamma\cdot t_{TE}\cdot B_{0}}

(4)

where $\phi$ represents the phase of the current image, $\phi_{ref}$ represents the phase of the image acquired at time 0, $\alpha$ denotes the PRF (Proton Resonance Frequency) change coefficient of water tissue, which is -0.01 ppm/℃, $\gamma$ represents the magnetic moment ratio of hydrogen atoms, $t_{TE}$ denotes the echo time, and $B_{0}$ represents the main magnetic field strength.

2.3 Actual Acceleration Ration

As stated above, it was assumed that the under-sampling rate represented the time saved, without considering the inference time required by the reconstruction algorithm itself. However, for magnetic resonance temperature measurement tasks, we need real-time imaging. Therefore, it is essential to compute the effective acceleration rate of the reconstruction model, which is defined as : reconstruction model. We define it as follows:

E_{N=n}=\frac{t_{a}}{\frac{t_{a}}{n}+t_{m}}=\frac{n\cdot t_{a}}{t_{a}+n\cdot t% _{m}}

(5)

where $t_{a}$ denotes the acquisition time of the fully sampled image, $t_{m}$ , $E_{N=n}$ and $n$ represent the inference time of the model, the effective acceleration rate, and the theoretical acceleration rate, respectively. When computing this metric in practice, we approximate $t_{a}$ by $t_{TR}\times num_{pe}$ where the $num_{pe}$ denotes the number of phase encoding and we obtain $t_{m}$ via the model’s CPU forward inference time.

3 Methods

3.1 Deep Learning Training and Models

Refer to caption — Figure 1: Algorithm Structure Diagram showcasing four optimizing modules: offline diffusion augments (DA), online complex augments (CA), knowledge distillation (KD), and decoupled loss (DL). The DA module employs a trained diffusion model to generate new data samples offline, thereby enhancing data diversity and complexity. The CA module combines augmented amplitude and phase maps of complex data. The KD module extracts knowledge from a larger pretrained teacher model and transfers it to a smaller model, thereby enhancing performance using a compact model. The teacher model is pretrained from FastMRI dataset and fine-tuned on our dataset. The DL module separates the amplitude and phase components of a signal, assigning distinct weights to each, to enhance the reconstruction capability of the phase component. The study incorporates five classical models.

As depicted in Figure 1, the proposed deep learning method incorporates data augments, teacher model, decoupled loss function, and classical network models. The offline diffusion augment and online data augment were able to expand the amount of MR images obtained in the heating experiments. This preprocessing procedure was designed to improve the performance of network model. Furthermore, the cascade net, complex net, swin-transformer, real-unet, and resunet were used in the training to get five different reconstruction models. The decoupling loss function was modified to adapt complex-valued MR images used for temperature measurements. During the training process, we leverage a sizable teacher model pre-trained on the FastMRI dataset and further fine-tuned on our own dataset. This teacher model serves as a guiding influence for the student model, enabling us to achieve a compact model with a performance comparable to that of the teacher model.

3.1.1 Offline Diffusion Augmentations

In medical image-related tasks, there is a growing trend toward the adoption of model-based data augmentation techniques[28]. As demonstrated by Trabucco Brandon et al, the diffusion model is a more effective means of generating diverse and realistic images[29]. Therefore, the diffusion model called Denoising Diffusion Probabilistic Model (DDPM)[30] was utilized here to generate a substantial amount of similar data for our training process. Additionally, we set the time step to 600 and increased the amount of data by a factor of five for the phantom and ex vivo sub-datasets, respectively. Since the diffusion model requires a considerable amount of time for processing, we opted for an offline method to generate augmented data before the training phase.

3.1.2 Online Complex Augmentations

Before inputting data into the model, we perform conventional data augmentation, which involves random cropping, flipping, rotation (0°, 90°, 180°, 270°), and Gaussian blurring. We would like to high-light that we extended real-valued data augmentation to complex-valued data by separately augmenting the magnitude and phase of complex images before combining them. This approach significantly increases data diversity by orders of squares, thereby enhancing the model’s robustness and generalizability. To avoid introducing undesirable bias into the model training due to the disruption of spatial consistency between magnitude and phase, we apply complex-valued data augmentation with a specific probability. We set the optimal probability for triggering complex-valued data augmentation to 0.3.

3.1.3 Base Models

To achieve faster temperature map reconstruction, we implemented our proposed method on the Naive-Real-UNet (RUNet) and ResUNet, which has state-of-the-art (SOTA) performance and is lightweight, making it an ideal choice for MR temperature map reconstruction. Compared to RUNet, ResUNet replaces ordinary convolutional layers with residual blocks and adds self-attention modules, which can be considered an improved version of RUNet. We also compared our method with several SOTA MR reconstruction methods, such as the cascade network with data consistency (CasNet)[23], complex-valued convolutional network with UNet structure (CUNet)[24, 31], Swin-Transformer (SwinMR)[19], and the RUNet and ResUNet without any structure modifications.

3.1.4 Knowledge Distillation

Knowledge distillation[32] is a technique that can accelerate inference times by transferring the knowledge learned by a larger and more complex model to a smaller and simpler one while preserving or even improving performance[33, 34]. Larger models usually exhibit stronger generalization capabilities, but their inference speeds may be lower. Thus, the knowledge acquired by the larger “teacher” model network is transferred to the smaller “student” model in the form of soft labels. Initially, we pre-train a teacher model with an identical structure to that of the student model and extend the channels by a factor of four using both online and offline augmentation techniques. To ensure comprehensive training of the teacher network, we adopt a two-step approach. We perform pre-training on the FastMRI dataset, followed by full parameter fine-tuning on our research dataset. During each forward pass of student model, the output is compared to both the ground truth and the soft labels generated by the pre-trained teacher model. The losses are then weighted and calculated accordingly. The weights are dynamically adjusted to decay over time during training so that the student model can primarily learn from the teacher network in the initial stages and gradually transition to learning from the ground truth. The loss function for knowledge distillation is:

\begin{split}L_{total}=(1-w)\cdot L_{gt}+w\cdot L_{soft},\\ where\ w=(1-\frac{E_{curr}}{E_{total}})\cdot\gamma\end{split}

(6)

where $l_{gt}$ and $l_{soft}$ denote the loss values calculated with the ground truth and the soft labels generated by the teacher network, respectively; $E_{curr}$ and $E_{total}$ denote the current epoch and the total number of epochs, respectively; $\gamma$ is the only hyperparameter that adjusts the weight of the teacher network guidance.

3.1.5 Decoupled Loss

Our investigation revealed that most reconstruction algorithms rely on the L1 loss function[35, 36] which may not be optimal for temperature measurement tasks that emphasize phases. More specifically, the L1 loss function tends to couple the magnitude and phase, resulting in identical loss values that correspond to different phases. Consequently, it becomes challenging to specifically optimize the phase component.

Therefore, we attempted to decouple the loss function, as illustrated in Figure 2. We partitioned the loss into two components: magnitude loss (computed as the absolute error of the amplitude values) and phase loss (computed as the error in radians), with the former quantifying the difference in magnitude and the latter quantifying the phase difference. In contrast to the decoupled loss functions proposed by Zhang et al.[37] and other researchers, our decoupled loss function is simpler and has a more straightforward geometric interpretation:

loss_{dc}=d+\alpha\cdot l=||\hat{y}|-|y||+\alpha\cdot|\hat{y}|\cdot\mathcal{A}% (\hat{y}\times\bar{y})

(7)

where $\mathcal{A}$ represents the angle calculation function and $\alpha$ is a parameter that controls the degree of bias applied to the phase loss. The variables $y$ , $\hat{y}$ , and $\bar{y}$ represent the predicted complex output, ground truth, and the complex conjugate of $y$ , respectively. In some other fields related to phase maps, different types of specialized phase loss functions are used[37, 38], but the rationale for their design and mathematical basis are not provided.

3.2 Heating Experiments and Implementation

3.2.1 FUS heating

Our dataset was acquired through the application of a 128-element high-intensity focused ultrasound transducer (with a frequency of $1.1MHz$ , a focal length of $150mm$ , and a focal radius of $120mm$ ), followed by imaging with a 3T MR system (Discovery MR750; GE Healthcare, Milwaukee, WI). The images were obtained using the Fast Spoiled Gradient Echo (FSPGR) sequence, with 96 phase encoding steps, a TR/TE of $12/16ms$ , a flip angle of $30^{\circ}$ , a slice thickness of $3mm$ , a field of view (FOV) of $28\times 28cm^{2}$ , a Number of Excitation (NEX) of 1, and a bandwidth of $\pm 62.5kHz$ . The dataset comprises two distinct parts: phantom heating data and ex vivo heating data. For each part, there are 96 heating samples (consisting of 2186 slices) and 105 samples (consisting of 1623 slices), respectively, with each sample containing either one or three layers. The temperature change at the focus was approximately 30 degrees Celsius, and the focus position was consistently located at the center of the image. To enhance the speed of temperature measurement, we employed a smaller TR and fewer phase encoding steps, which led to a lower signal-to-noise ratio and resolution of the acquired images. This underscores the significance of utilizing fast temperature measurement algorithms to compensate for the reduced image quality.

3.2.2 Model Metrics

Temperature Metrics.

After deriving the PRF temperature map from the complex images generated by the model and the reference images, we obtain a common metric that characterizes the reconstruction error of the entire image by calculating the average pixel-wise error compared to the original temperature map (represented as $T_{err}$ ). However, as we use the HIFU device to focus heat on a very small area, only a small region undergoes significant temperature changes. Therefore, it is also necessary to consider local metrics for the heating focus.

Specifically, we evaluate the temperature using metrics such as root mean square error (RMSE), standard deviation (STD), and Dice coefficient (DICE). These metrics are calculated within a pixel block that is cropped around the focal area, covering one-fourth of the width and height of the image. Furthermore, we also assess the agreement between the reconstructed temperature values and the reference values using Bland-Altman analysis and examine the linear relationship between these two sets of values using linear regression analysis, both of which are commonly used in previous related works[10]. The temperature map is calculated from the phase difference, and noise can be present in areas with very low signal intensity. To ensure accurate evaluation of the temperature image, a mask is applied before calculating temperature metrics.

Computation Quantity Metrics.

We evaluate the efficiency of the models by computing their Floating-point Operations per Second (FLOPs), number of parameters (Params), CPU inference time (CPU-T), and effective acceleration ratio ( $E_{N=n}$ ) at a certain undersampling rate, which is calculated using formula 5. It is worth noticing that we place greater emphasis on the effective acceleration ratio, as it can intuitively reflect the acceleration ratio that the model can achieve while considering the model inference time. By combining it with the model’s performance for comparison, we can more effectively assess the cost-effectiveness of each model. Additionally, we estimated the total acquisition time (Cost- $N\times$ ) for magnetic resonance imaging using the $T_{TR}\cdot num_{pe}/N+$ CPU-T formula. As the Fourier inverse transform and PRF temperature measurement have extremely short processing times, they were not included in the time calculation.

3.2.3 Training Computation

We employed a mask like the one used in FastMRI to simulate the undersampling process. Specifically, we fully adopted the low-frequency part and uniformly under-sampled the other high-frequency parts, and the proportion of the fully adopted low-frequency part was set to 15%. We used the AdamW optimizer with a learning rate of 5e-4 and decayed it using a cosine scheduler, and the batch size was set to 8. We conducted experiments separately on both phantom and ex vivo datasets and trained the models for approximately 200 epochs on an NVIDIA RTX A6000.

4 Results

This section presents the experimental results of our study. Firstly, we compare the performance of various deep learning models in the reconstruction of temperature using comparative experiments. Secondly, through ablation experiments, we validate the effectiveness of our proposed method. In addition, we conduct a comprehensive analysis of a long sequence sample, including both time series and consistency analyses. Finally, we demonstrate the resource utilization of different models by presenting parameters and effective acceleration rates.

4.1 Comparison Study

The comparison results under $2\times$ and $4\times$ undersampling on both phantom and ex vivo datasets are presented in Table 1. In our study, we have included the zero-filling (ZF) and compressive sensing (CS) algorithms for comparative analysis. Zero-filling involves filling the under-sampled k-space region with zeros after undersampling, while the compressive sensing algorithm employed is Total Variation Minimization, which makes 200 iterations. It can be observed from the results that the reconstruction performance of RUNet and ResUNet with our optimized methods are superior to that of other methods. In addition, we present the temperature map reconstruction results on specific example samples in Figure 3. It can be observed that the sample using ResUNet+all retains more temperature information in the reconstructed sample, particularly in the ex vivo $4\times$ case, where it can display the heated focal temperature, a feature does not present in the results obtained from other methods.

Table 1: Temperature error metrics of different reconstruction methods on phantom and ex vivo data sets with different undersampling rates

		phantom				ex vivo
N	Net	$T_{err}$	DICE	STD	RMSE	$T_{err}$	DICE	STD	RMSE
$2\times$	zf	0.310	0.402	1.219	1.286	0.542	0.507	2.403	2.751
	CS	0.310	0.597	1.182	1.279	0.498	0.524	2.092	2.493
	CasNet	0.291	0.424	1.194	1.276	0.550	0.48	2.323	2.685
	CUNet	0.304	0.576	1.086	1.120	0.531	0.446	2.397	2.753
	SwinMR	0.259	0.759	0.935	0.950	0.523	0.519	2.218	2.566
	RUNet	0.302	0.669	1.055	1.080	0.559	0.44	2.505	2.876
	ResUNet	0.251	0.732	0.923	0.937	0.482	0.524	2.058	2.364
	RUNet+all	0.258	0.779	0.903	0.915	0.448	0.537	1.909	2.163
	ResUNet+all	0.245	0.809	0.877	0.888	0.429	0.567	1.802	2.045
$4\times$	zf	0.383	0.082	1.557	1.646	0.740	0.335	3.261	3.679
	CS	0.398	0.168	1.635	1.788	0.703	0.336	3.004	3.558
	CasNet	0.341	0.132	1.514	1.586	0.699	0.328	3.012	3.419
	CUNet	0.372	0.221	1.443	1.511	0.770	0.279	3.284	3.690
	SwinMR	0.322	0.466	1.291	1.338	0.754	0.312	3.378	3.820
	RUNet	0.376	0.256	1.484	1.554	0.824	0.272	3.604	4.073
	ResUNet	0.319	0.459	1.272	1.312	0.712	0.329	3.078	3.520
	RUNet+all	0.321	0.488	1.217	1.242	0.635	0.345	2.815	3.167
	ResUNet+all	0.301	0.653	1.126	1.145	0.610	0.365	2.674	3.033

To visually compare the efficacy of various methods for rapid temperature measurement, we calculated the mean temperature error using pixels above 43 ºC within the temperature map patches reconstructed by each model. As an example, we utilized the phantom sub-dataset with $4\times$ undersampling. Subsequently, we generated box plots for all samples in the test set, as depicted in Figure 4. The results indicate that the proposed methods are superior to the ZF method, with the ResUNet+all method demonstrating the lowest average temperature error.

4.2 Ablation Study

To showcase the efficacy of our method, we conducted a series of ablation experiments on the RUNet model using phantom validation datasets that were subjected to $4\times$ undersampling. As shown in Table 2, we incorporated four individual modules (DA for diffusion model augmentation, CA for complex-valued data augmentation, KD for knowledge distillation, and DL for decoupled loss) into the baseline model across all four temperature metrics, resulting in varying degrees of improvement. In addition, we combined the four modules in all possible combinations and generated a $4\times 4$ heat map matrix to visualize the resulting temperature indicators; different combinations of the modules have varying effects on specific indicators, as illustrated in Figure 5. However, as an overall observation, it can be inferred that complex-valued data augmentation had the most substantial contribution.

Table 2: Temperature error metrics of RUNet with different modules on

4\times

undersampled phantom dataset

Modules				phantom				ex vivo
DA	CA	KD	DL	$T_{err}$	DICE	STD	RMSE	$T_{err}$	DICE	STD	RMSE
				0.505	0.327	1.473	1.538	0.822	0.266	2.750	2.971
✓				0.503	0.323	1.448	1.529	0.749	0.231	2.575	2.788
	✓			0.454	0.403	1.346	1.386	0.612	0.311	2.248	2.454
		✓		0.494	0.338	1.428	1.494	0.775	0.266	2.727	2.974
			✓	0.495	0.334	1.438	1.497	0.793	0.224	2.692	2.892
✓	✓	✓	✓	0.447	0.415	1.337	1.382	0.594	0.318	2.227	2.420

4.3 Time-Consuming Study

To evaluate the resource utilization of each network we used four key indicators: number of parameters (Params), number of floating-point operations (FLOPs), CPU inference time (CPU-T), and total cost and the effective acceleration rate under $2\times$ (Cost- $2\times$ , $E_{N=2}$ ) and $4\times$ (Cost- $4\times$ , $E_{N=4}$ ) undersampling. Here, the performance of CPU-T is evaluated by conducting 1000 forward processing runs and calculating the average processing time on an Intel(R) Xeon(R) Gold 6248R CPU. These indicators were selected to provide a comprehensive assessment of the network’s resource utilization in terms of its computational complexity, memory usage, and inference speed. Through our evaluation of these indicators, we were able to gain insights into the efficiency and effectiveness of each network.

Our evaluation results are presented in Table 3. Compared with SwinMR and CS methods, RUNet and ResUNet exhibit the shortest CPU running time and the highest effective acceleration rate among all the evaluated networks. These findings suggest that RUNet and ResUNet may be particularly well-suited for resource constrained applications that require low-latency and high-throughput processing. Furthermore, in conjunction with our evaluation of temperature indicators, our results show that the addition of the ResUNet+all network yields a remarkably high cost-effectiveness ratio. This suggests that the proposed optimizing modules may serve as a valuable augmentation technique for improving the performance and efficiency of the UNet network, especially in applications where resource utilization is a critical consideration.

Table 3: Resource utilization metrics of different models under different under-sampling rates

Methods	Metrics
Methods	Params(M)	FLOPs(G)	CPU-T(s)	Cost- $2\times$ (s)	$E_{N=2}$	Cost- $4\times$ (s)	$E_{N=4}$
ZF	-	-	-	0.768	2.0	0.384	4.0
CS	-	-	22.653	23.421	0.1	23.037	0.0
CasNet	0.10	4.36	0.0431	0.811	1.9	0.427	3.6
CUNet	3.87	1.68	0.0976	0.866	1.8	0.482	3.2
SwinMR	11.45	105.74	0.4370	1.205	1.3	0.821	1.9
RUNet-Tea	123.70	26.67	0.1112	0.879	1.7	0.495	3.1
ResUNet-Tea	32.83	51.61	0.0687	0.837	1.8	0.453	3.4
RUNet	7.74	1.69	0.0282	0.796	1.9	0.412	3.7
ResUNet	2.06	3.23	0.0277	0.796	1.9	0.412	3.7

4.4 Long sequence sample study

Applying rapid temperature measurement to practical devices improves the temporal resolution of the temperature measurement process. This improvement is reflected in an increased number of frames when the HIFU heating power and ablation duration remain constant. To examine the potential impact of changes in temporal resolution on temperature measurement, we analyzed temperature image samples from simulated long-sequence data of a phantom model with improved temporal resolution.

In this study we selected 32-frame long-sequence samples from the test set of the phantom model and extracted every other frame to simulate a sequence of 16 frames representing full sampling conditions for a given heating duration. The remaining 32 frames simulated a sequence obtained under $2\times$ undersampling. Using ResUNet+all, we recorded temperatures of the $3\times 3$ pixel block at the center of the original reconstructed temperature images.

Temperature-time curves were plotted and analyzed using Bland-Altman and linear regression methods. The calculation of time on the horizontal axis followed the same method as Cost- $N\times$ . Additionally, we incorporated the inference time of the model into the plotting of the temperature-time curves, where a longer inference time resulted in a rightward shift compared to the fully sampled temperature curve. A larger shift indicated a greater impact on temporal resolution improvement, reflecting a poor performance, while a smaller shift indicated a better performance.

The results, presented in Figure 6, demonstrate that the reconstructed focus closely aligns with the temperature-time curve of the fully sampled images. There is no significant deviation observed in the curve, suggesting that the model exhibits strong real-time inference capability. Most of the data points fall within the 95% confidence interval, with upper and lower limits within $\pm 3$ ℃, indicating a strong linear relationship between the reconstructed and fully-sampled temperature maps, suggesting that the reconstructed temperature map is highly consistent with the fully-sampled temperature map.

5 Discussion

In this article, we present an introduction to the application of deep learning methods for rapid magnetic resonance thermometry. Previously, the fast readout patterns have been studied for temperature measurements, such as spiral and radial strategies, EPI sequence[6, 7, 8]. The deep learning-based rapid reconstruction here was appropriate for them. In the case of under-sampling, these sequences can achieve a higher temporal resolution, combining our proposed methods. Theoretically, our approach can be applied by retraining whenever the acceleration is realized via undersampling. Specifically, we focus on enhancing the effective acceleration rate and improving model performance within a short inference time. To achieve this, we propose a series of model-agnostic techniques and validate the effectiveness of each module through experimental verification. The assembly of under-sampling and deep learning reconstruction for fast temperature measurements has quite a few benefits. For example, Undersampling means fewer phase encoding and less B0 field drift[39]. Without considering the loss of signal-to-noise ratio (SNR), the measured temperature should be more accurate. On the subject of motion-induced artifacts, our proposed deep-learning reconstruction module can also be improved to make it insensitive to respiration and other movement by modifying the neural network module. Once the fast thermometry can be realized, the volumetric temperature monitoring covering the whole focal area will be easier.

In our experiments, we employed only a maximum undersampling rate of $4\times$ , instead of the up to $10\times$ rate used by fastMRI. This choice was influenced by the limitations of the image resolution and signal-to-noise ratio in our acquired images. The number of phase encodings in k-space was also relatively low, and the signal was not highly concentrated at the center. These factors resulted in significant signal loss when using excessively high undersampling rates, making reconstruction difficult. Interestingly, the compromised resolution and signal-to-noise ratio in the acquired dataset are flaws due to equipment design, which prioritized faster temperature mapping at the expense of spatial resolution; this highlights the importance of rapid thermometry. With the same temperature mapping speed, higher spatial resolution and signal-to-noise ratio can be achieved, leading to higher-quality images. Consequently, these higher-quality images can then be subjected to higher undersampling rates.

To address this, we conducted an experiment using a phantom dataset and performed interpolation to simulate the acquisition of high-resolution images. Subsequently, we applied undersampling rates of $6\times$ , $8\times$ and $10\times$ to these images and compared the results with those obtained from the original resolution images. The obtained RMSE values for the temperature map patches were 0.610℃, 0.704℃, and 0.724℃, respectively. These values closely align with the results obtained for $2\times$ and $4\times$ resolutions in a $96\times 96$ format.

Furthermore, our study only utilized phantom and ex vivo tissue datasets. The temperature distribution in actual human tissue is more complex. Therefore, in the future, we plan to conduct testing and research on live animal models and specific human tissue datasets to further investigate and validate our findings.

6 Conclusion

This paper presents the first formal investigation into the application of deep learning methods for magnetic resonance temperature measurement. To the best of our knowledge, this is the first comprehensive study to explore the use of deep learning methods for this purpose. We have made our code publicly available and have proposed the use of four optimizing modules to enhance model performance without increasing the number of parameters or computational complexity. We compared various existing MR re-construction models and demonstrated the effectiveness of our proposed method, as well as its resource-saving characteristics. We hope that our research will serve as inspiration for further investigations related to MRI temperature measurement. Moving forward, we plan to explore end-to-end approaches that incorporate temporal information, as well as investigate the feasibility of adopting reference-free imaging techniques for rapid temperature measurement.

References

[1] Kullervo Hynynen. Mri-guided focused ultrasound treatments. Ultrasonics, 50(2):221–229, 2010.
[2] Jing Yuan, Chang-Sheng Mei, Lawrence P Panych, Nathan J McDannold, and Bruno Madore. Towards fast and accurate temperature mapping with proton resonance frequency-based mr thermometry. Quantitative imaging in medicine and surgery, 2(1):21, 2012.
[3] Zhipeng Cao, John C Gore, and William A Grissom. Low-rank plus sparse compressed sensing for accelerated proton resonance frequency shift mr temperature imaging. Magnetic resonance in medicine, 81(6):3555–3566, 2019.
[4] Efrat Shimron, William Grissom, and Haim Azhari. Temporal differences (TED) compressed sensing: A method for fast MRgHIFU temperature imaging. NMR in Biomedicine, 33(9):e4352, 2020.
[5] Chang-Sheng Mei, Lawrence P. Panych, Jing Yuan, Nathan J. McDannold, Lisa H. Treat, Yun Jing, and Bruno Madore. Combining two-dimensional spatially selective RF excitation, parallel imaging, and UNFOLD for accelerated MR thermometry imaging. Magnetic Resonance in Medicine, 66(1):112–122, 2011.
[6] Pooja Gaur and William A. Grissom. Accelerated MRI thermometry by direct estimation of temperature from undersampled k-space data. Magnetic Resonance in Medicine, 73(5):1914–1925, 2015.
[7] Kisoo Kim, Chris Diederich, Kazim Narsinh, and Eugene Ozhinsky. Motion-robust, multi-slice, real-time mr thermometry for mr-guided thermal therapy in abdominal organs. International Journal of Hyperthermia, 40(1):2151649, 2023.
[8] Henrik Odéen, Nick Todd, Mahamadou Diakite, Emilee Minalga, Allison Payne, and Dennis L Parker. Sampling strategies for subsampled segmented epi prf thermometry in mr guided high intensity focused ultrasound. Medical physics, 41(9):092301, 2014.
[9] Andrew B Holbrook, Juan M Santos, Elena Kaye, Viola Rieke, and Kim Butts Pauly. Real-time mr thermometry for monitoring hifu ablations of the liver. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine, 63(2):365–373, 2010.
[10] Shenyan Zong, Guofeng Shen, Chang-Sheng Mei, and Bruno Madore. Improved prf-based mr thermometry using k-space energy spectrum analysis. Magnetic resonance in medicine, 84(6):3325–3332, 2020.
[11] Chang-Sheng Mei, Renxin Chu, W Scott Hoge, Lawrence P Panych, and Bruno Madore. Accurate field mapping in the presence of b0 inhomogeneities, applied to mr thermometry. Magnetic resonance in medicine, 73(6):2142–2151, 2015.
[12] Jure Zbontar, Florian Knoll, Anuroop Sriram, Tullie Murrell, Zhengnan Huang, Matthew J. Muckley, Aaron Defazio, Ruben Stern, Patricia Johnson, Mary Bruno, Marc Parente, Krzysztof J. Geras, Joe Katsnelson, Hersh Chandarana, Zizhao Zhang, Michal Drozdzal, Adriana Romero, Michael Rabbat, Pascal Vincent, Nafissa Yakubova, James Pinkerton, Duo Wang, Erich Owens, C. Lawrence Zitnick, Michael P. Recht, Daniel K. Sodickson, and Yvonne W. Lui. fastMRI: An Open Dataset and Benchmarks for Accelerated MRI, December 2019.
[13] Florian Knoll, Jure Zbontar, Anuroop Sriram, Matthew J. Muckley, Mary Bruno, Aaron Defazio, Marc Parente, Krzysztof J. Geras, Joe Katsnelson, Hersh Chandarana, Zizhao Zhang, Michal Drozdzalv, Adriana Romero, Michael Rabbat, Pascal Vincent, James Pinkerton, Duo Wang, Nafissa Yakubova, Erich Owens, C. Lawrence Zitnick, Michael P. Recht, Daniel K. Sodickson, and Yvonne W. Lui. fastMRI: A Publicly Available Raw k-Space and DICOM Dataset of Knee Images for Accelerated MR Image Reconstruction Using Machine Learning. Radiology. Artificial Intelligence, 2(1):e190007, January 2020.
[14] Matthew J. Muckley, Bruno Riemenschneider, Alireza Radmanesh, Sunwoo Kim, Geunu Jeong, Jingyu Ko, Yohan Jun, Hyungseob Shin, Dosik Hwang, Mahmoud Mostapha, Simon Arberet, Dominik Nickel, Zaccharie Ramzi, Philippe Ciuciu, Jean-Luc Starck, Jonas Teuwen, Dimitrios Karkalousos, Chaoping Zhang, Anuroop Sriram, Zhengnan Huang, Nafissa Yakubova, Yvonne W. Lui, and Florian Knoll. Results of the 2020 fastMRI Challenge for Machine Learning MR Image Reconstruction. IEEE Transactions on Medical Imaging, 40(9):2306–2317, September 2021.
[15] Arghya Pal and Yogesh Rathi. A review and experimental evaluation of deep learning methods for MRI reconstruction. arXiv:2109.08618 [cs, eess], March 2022.
[16] Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, June 2021.
[17] Kang Lin and Reinhard Heckel. Vision Transformers Enable Fast and Robust Accelerated MRI. In Medical Imaging with Deep Learning, June 2022.
[18] Pengfei Guo, Yiqun Mei, Jinyuan Zhou, Shanshan Jiang, and Vishal M. Patel. ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer, January 2022.
[19] Jiahao Huang, Yingying Fang, Yinzhe Wu, Huanjun Wu, Zhifan Gao, Yang Li, Javier Del Ser, Jun Xia, and Guang Yang. Swin transformer for fast MRI. Neurocomputing, 493:281–304, July 2022.
[20] John De Poorter, Carlos De Wagter, Yves De Deene, Carsten Thomsen, Freddy Ståhlberg, and Eric Achten. Noninvasive MRI Thermometry with the Proton Resonance Frequency (PRF) Method: In Vivo Results in Human Muscle. Magnetic Resonance in Medicine, 33(1):74–81, 1995.
[21] Kim Jong-Min, Jeong You-Jin, Cheong Han-Jae, Yoo Jae-Won, Kim Jeong-Hee, and Lee Chulhyun. Real-Time T1/PRF-Based MR Thermometry Using Deep Learning and VFA-mFFE for Guidance of HIFU Treatment. https://cds.ismrm.org/protected/19MPresentations/abstracts/0970.html, 2019.
[22] Fahad Shamshad, S. Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, F. Khan, and H. Fu. Transformers in Medical Imaging: A Survey. undefined, 2022.
[23] Jo Schlemper, Jose Caballero, Joseph V. Hajnal, Anthony Price, and Daniel Rueckert. A Deep Cascade of Convolutional Neural Networks for MR Image Reconstruction. In Marc Niethammer, Martin Styner, Stephen Aylward, Hongtu Zhu, Ipek Oguz, Pew-Thian Yap, and Dinggang Shen, editors, Information Processing in Medical Imaging, Lecture Notes in Computer Science, pages 647–658, Cham, 2017. Springer International Publishing.
[24] Muneer Ahmad Dedmari, Sailesh Conjeti, Santiago Estrada, Phillip Ehses, Tony Stöcker, and Martin Reuter. Complex Fully Convolutional Neural Networks for MR Image Reconstruction. In Florian Knoll, Andreas Maier, and Daniel Rueckert, editors, Machine Learning for Medical Image Reconstruction, Lecture Notes in Computer Science, pages 30–38, Cham, 2018. Springer International Publishing.
[25] Mevan Ekanayake, Kamlesh Pawar, Mehrtash Harandi, Gary Egan, and Zhaolin Chen. Multi-head Cascaded Swin Transformers with Attention to k-space Sampling Pattern for Accelerated MRI Reconstruction, July 2022.
[26] Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows, August 2021.
[27] Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Nassir Navab, Joachim Hornegger, William M. Wells, and Alejandro F. Frangi, editors, Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, volume 9351 of Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, pages 234–241. Springer International Publishing, Cham, 2015.
[28] Aghiles Kebaili, Jérôme Lapuyade-Lahorgue, and Su Ruan. Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review. Journal of Imaging, 9(4):81, April 2023.
[29] Brandon Trabucco, Kyle Doherty, Max Gurinas, and Ruslan Salakhutdinov. Effective Data Augmentation With Diffusion Models, May 2023.
[30] Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising Diffusion Probabilistic Models, 2020.
[31] Chiheb Trabelsi, Olexa Bilaniuk, Ying Zhang, Dmitriy Serdyuk, Sandeep Subramanian, João Felipe Santos, Soroush Mehri, Negar Rostamzadeh, Yoshua Bengio, and Christopher J Pal. Deep Complex Networks. page 19, 2018.
[32] Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. Distilling the Knowledge in a Neural Network, March 2015.
[33] Balamurali Murugesan, Sricharan Vijayarangan, Kaushik Sarveswaran, Keerthi Ram, and Mohanasankar Sivaprakasam. KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow. In Proceedings of the Third Conference on Medical Imaging with Deep Learning, pages 515–526. PMLR, September 2020.
[34] Yuanyuan Tan and Jun Lyu. Semi-supervised Distillation Learning Based on Swin Transformer for MRI Reconstruction. In Shiqi Yu, Zhaoxiang Zhang, Pong C. Yuen, Junwei Han, Tieniu Tan, Yike Guo, Jianhuang Lai, and Jianguo Zhang, editors, Pattern Recognition and Computer Vision, Lecture Notes in Computer Science, pages 67–76, Cham, 2022. Springer Nature Switzerland.
[35] Elizabeth Cole, Joseph Cheng, John Pauly, and Shreyas Vasanawala. Analysis of deep complex-valued convolutional neural networks for MRI reconstruction and phase-focused applications. Magnetic Resonance in Medicine, 86(2):1093–1109, 2021.
[36] Dongwook Lee, Jaejun Yoo, Sungho Tak, and Jong Chul Ye. Deep Residual Learning for Accelerated MRI Using Magnitude and Phase Networks. IEEE Transactions on Biomedical Engineering, 65(9):1985–1995, September 2018.
[37] Jingshu Zhang, Mark D. Plumbley, and Wenwu Wang. Weighted Magnitude-Phase Loss for Speech Dereverberation. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 5794–5798, June 2021.
[38] G. E. Spoorthi, Rama Krishna Sai Subrahmanyam Gorthi, and Subrahmanyam Gorthi. PhaseNet 2.0: Phase Unwrapping of Noisy Data Based on Deep Learning Approach. IEEE Transactions on Image Processing, 29:4862–4872, 2020.
[39] Yanfei Wang, Yan Kang, Jinzhu Yang, and Yanfa He. A method to correct magnetic resonance imaging magnetic field drift. Journal of Medical Imaging and Health Informatics, 8(7):1519–1525, 2018.