Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI

m.j.ehrhardt. m.mauritz
(Date: January 2024)
Abstract.

Reconstructing high-quality images from undersampled dynamic MRI data is a challenging task and important for the success of this imaging modality. To remedy the naturally occurring artifacts due to measurement undersampling, one can incorporate a motion model into the reconstruction so that information can propagate across time frames. Current models for MRI imaging are using the optical flow equation. However, they are based on real-valued images. Here, we generalise the optical flow equation to complex-valued images and demonstrate, based on two real cardiac MRI datasets, that the new model is capable of improving image quality.

1. Introduction

Magnetic resonance imaging (MRI) is a powerful medical imaging modality that allows for noninvasive visualisations of the human body. In particular, for dynamic MRI, there is a variety of applications, e.g.  cardiac imaging [25, 2], dynamic contrast-enhanced magnetic resonance imaging [13], [9] and flow imaging [24]. However, the imaging times in MRI are inherently slow, leading to motion artifacts that degrade image quality. This issue can be addressed from a software perspective where one aims for acquisition speedups by decreasing the number of measurements and using mathematical modelling to improve image quality. In this article, we follow this approach and solve the ill-posed inverse problem Atρt=ytsubscript𝐴𝑡subscript𝜌𝑡subscript𝑦𝑡A_{t}\rho_{t}=y_{t}italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (the index t𝑡titalic_t denotes the t𝑡titalic_t-th motion state of the considered subject; the MRI forward operator Atsubscript𝐴𝑡A_{t}italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT includes subsampling of measurements and hence may depend on t𝑡titalic_t) of undersampled dynamic MRI by considering the variational problem

minρ,vtAtρtyt2+(ρ,v),subscript𝜌𝑣subscript𝑡superscriptnormsubscript𝐴𝑡subscript𝜌𝑡subscript𝑦𝑡2𝜌𝑣\displaystyle\min_{\rho,v}\sum_{t}\|A_{t}\rho_{t}-y_{t}\|^{2}+\mathcal{R}(\rho% ,v),roman_min start_POSTSUBSCRIPT italic_ρ , italic_v end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + caligraphic_R ( italic_ρ , italic_v ) ,

where \mathcal{R}caligraphic_R is a regularisation term that incorporates prior knowledge about the images into the reconstruction. More precisely, we aim to jointly reconstruct the temporal image sequence ρ𝜌\rhoitalic_ρ and the velocity field v𝑣vitalic_v associated with the temporal evolution. To this end, we model the underlying motion using the optical flow equation [6]. This was already done for MRI and real-valued images [1]. Here, we generalise this idea by explicitly taking into account that images are complex-valued, i.e.  we formulate an optical flow equation for complex-valued images. This is important since omitting the phase may lead to reconstruction artifacts. This problem can be seen in fig. 1. The figure shows reconstructions from fully sampled MRI measurements. The images on the right show a nonnegative real-valued reconstruction where some structures, marked by red boxes, of the complex-valued reconstructions on the left are missing. The top row shows the magnitude of the reconstructions, while the bottom row depicts complex values: The hue represents the normalised phase and the brightness represents the magnitude. In particular, one can see that structures where the phase is different are missing in the real-valued images.

1.1. Related work

Dynamic inverse problems in general have been studied extensively. An overview of the topic can be found in [14, 23]. The particular topic of undersampled (dynamic) MRI has also been widely studied and many different approaches have been proposed to enhance reconstruction quality. Many works leveraged compressed sensing making use of the fact that MRI images can be sparsely represented in suitable domains [17, 16, 10]. In dynamic imaging, the temporal relation between motion states should be taken into account to improve the results. One approach is to decompose the (temporal) sequence of images into a low-rank part (background between time frames) and a sparse part (modelling dynamics between frames) [12, 18] . This way, the image sequence is represented suitably for the undersampled MRI problem. Another approach is taken in [26], where the motion between different motion states is estimated based on the frame-wise undersampled reconstruction of each state. Subsequently, this motion information is used to get a final reconstruction incorporating all motion states into a single reconstruction. Joint reconstruction and motion estimation approaches are also promising. This was investigated in [1, 29] where the underlying motion was modelled using (locally affine) optical flow. Besides classical image reconstruction, learning-based approaches have been considered in dynamic MRI. Those aim at learning spatiotemporal dependencies within the image sequence, and/or mimic the iterative nature of classical reconstruction algorithms [21, 20, 28].

Besides MRI, dynamic inverse problems arise in other fields of imaging. For instance, in dynamic PET imaging the continuity equation was used as the motion model to describe temporal changes between time points [22]. In tomography, an approach based on a template and its temporal deformation was taken [7].

When it comes to working with complex-valued images, two approaches are conceivable: First, one can parameterise the images using real and imaginary part. This is the standard approach that is basically always followed. However, in MRI imaging often the magnitude of the image is of most interest and one would like to pose prior assumptions on the magnitude. This is difficult to incorporate using real and imaginary part because calculating the magnitude is a nonlinear operation, hence the overall regularisation would involve a nonlinear operator. Therefore, one can work with a magnitude and phase parameterisation of the complex-valued images. This makes regularising the magnitude easier but comes at the cost that now the MRI forward operator is nonlinear. This was considered in [11, 27].

2. Methods

2.1. MRI modelling

Solving the inverse problem in MRI imaging in its simplest form leads to inverting the discrete Fourier transform. Given enough measurements (i.e.  enough coverage of Fourier space/k-space) this problem is well-posed by the stable invertibility of the Fourier transform. In practice, however, multiple receiver coils (here are the measurements detected) and undersampling lead to an ill-posed inverse problem. The MRI forward operator maps a complex-valued image ρtsubscript𝜌𝑡\rho_{t}italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (which in our case is a time slice of a temporal sequence of images) to some complex space Ytsubscript𝑌𝑡Y_{t}italic_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, where Ytsubscript𝑌𝑡Y_{t}italic_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT depends on the undersampling factor and the number of receiver coils. For instance, in a continuous setting one could choose ρLp((0,T);BV(Ω))𝜌superscript𝐿𝑝0𝑇𝐵𝑉Ω\rho\in L^{p}((0,T);BV(\Omega))italic_ρ ∈ italic_L start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( ( 0 , italic_T ) ; italic_B italic_V ( roman_Ω ) ), hence ρtBV(Ω)subscript𝜌𝑡𝐵𝑉Ω\rho_{t}\in BV(\Omega)italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_B italic_V ( roman_Ω ), or similar spaces with more regularity [6]. In a discrete setting (which will be considered in this article) we take ρX=Nt×Nx×Ny𝜌𝑋superscriptsubscript𝑁𝑡subscript𝑁𝑥subscript𝑁𝑦\rho\in X=\mathbb{C}^{N_{t}\times N_{x}\times N_{y}}italic_ρ ∈ italic_X = blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, where Ntsubscript𝑁𝑡N_{t}italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, Nxsubscript𝑁𝑥N_{x}italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and Nysubscript𝑁𝑦N_{y}italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT denote the number of dimensions in the temporal, x𝑥xitalic_x and y𝑦yitalic_y direction, respectively. Therefore, it is ρtXt=Nx×Nysubscript𝜌𝑡subscript𝑋𝑡superscriptsubscript𝑁𝑥subscript𝑁𝑦\rho_{t}\in X_{t}=\mathbb{C}^{N_{x}\times N_{y}}italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT for each time slice. We next introduce the forward operator describing the measurements. Denoting by Ncsubscript𝑁𝑐N_{c}italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT the number of receiver coils, we have Yt=Nc×Nk,tsubscript𝑌𝑡superscriptsubscript𝑁𝑐subscript𝑁𝑘𝑡Y_{t}=\mathbb{C}^{N_{c}\times N_{k,t}}italic_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_k , italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, t=1,,Nt𝑡1subscript𝑁𝑡t=1,\ldots,N_{t}italic_t = 1 , … , italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, as the measurement space. Here, Nk,tsubscript𝑁𝑘𝑡N_{k,t}italic_N start_POSTSUBSCRIPT italic_k , italic_t end_POSTSUBSCRIPT denotes the number of measured k-space points at time t𝑡titalic_t after the undersampling has been applied. With the Fourier operator F𝐹Fitalic_F, the coil sensitivity operators Ct,isubscript𝐶𝑡𝑖C_{t,i}italic_C start_POSTSUBSCRIPT italic_t , italic_i end_POSTSUBSCRIPT and the undersampling operators Stsubscript𝑆𝑡S_{t}italic_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (see section 4.4 for details) we can find for the i𝑖iitalic_i-th component (i.e.  coil) of the MRI forward operator Atsubscript𝐴𝑡A_{t}italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT at time point t𝑡titalic_t

(1) At:XtYt,(Atρt)i=StFCt,iρt.:subscript𝐴𝑡formulae-sequencesubscript𝑋𝑡subscript𝑌𝑡subscriptsubscript𝐴𝑡subscript𝜌𝑡𝑖subscript𝑆𝑡𝐹subscript𝐶𝑡𝑖subscript𝜌𝑡\displaystyle A_{t}\colon X_{t}\to Y_{t},\quad(A_{t}\rho_{t})_{i}=S_{t}FC_{t,i% }\rho_{t}.italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT → italic_Y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , ( italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_F italic_C start_POSTSUBSCRIPT italic_t , italic_i end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

Hence, we model MRI measurements as

yt=Atρt+εtsubscript𝑦𝑡subscript𝐴𝑡subscriptsuperscript𝜌𝑡subscript𝜀𝑡\displaystyle y_{t}=A_{t}\rho^{\dagger}_{t}+\varepsilon_{t}italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT

where ρtXtsubscriptsuperscript𝜌𝑡subscript𝑋𝑡\rho^{\dagger}_{t}\in X_{t}italic_ρ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the ground truth image that we want to reconstruct and εtsubscript𝜀𝑡\varepsilon_{t}italic_ε start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is some additive noise.

Refer to caption
Figure 1. Comparison between real-valued and complex-valued reconstructions in MRI. Left: Complex-valued reconstruction. Right: Real-valued reconstruction. Top row: Magnitude representation. The bottom row shows the same images but as a complex-valued image: The hue represents the normalised phase and the brightness represents the magnitude. The red boxes point to some missing structures in the real-valued image.

2.2. Modelling motion for complex-valued images

In this work, we use the optical flow equation as a motion model to regularise the dynamic inverse problem. The optical flow equation is often used in computer vision and has proven valuable for 2D images [1, 5, 6]. Its defining property is constant image intensity along a trajectory with ddtρ=vdd𝑡𝜌𝑣\frac{\mathrm{d}}{\mathrm{d}t}\rho=vdivide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_ρ = italic_v [14]. Thus, the underlying PDE reads

tρ+v,ρ=0subscript𝑡𝜌subscript𝑣𝜌0\displaystyle\partial_{t}\rho+\langle v,\nabla\rho\rangle_{\mathbb{R}}=0∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ + ⟨ italic_v , ∇ italic_ρ ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT = 0

and during the reconstruction, both the image ρ𝜌\rhoitalic_ρ as well as the velocity field v𝑣vitalic_v have to be reconstructed. Note, that since velocity and density are multiplied by each other, using the optical flow equation makes the problem nonconvex. To be able to apply the optical flow equation in MRI imaging (where complex-valued images are reconstructed), we propose to generalise this equation by allowing both the image ρ𝜌\rhoitalic_ρ and the velocity v𝑣vitalic_v to be complex-valued. The inner product is therefore replaced by the standard inner product for complex vector spaces. This leads us to the model

(2) 0=t(ρ1+iρ2)+v1+iv2,(ρ1+iρ2)0subscript𝑡superscript𝜌1𝑖superscript𝜌2subscriptsuperscript𝑣1𝑖superscript𝑣2superscript𝜌1𝑖superscript𝜌2\displaystyle 0=\partial_{t}(\rho^{1}+i\rho^{2})+\langle v^{1}+iv^{2},\nabla(% \rho^{1}+i\rho^{2})\rangle_{\mathbb{C}}0 = ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + ⟨ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∇ ( italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ⟩ start_POSTSUBSCRIPT blackboard_C end_POSTSUBSCRIPT
{0=tρ1+v1,ρ1+v2,ρ20=tρ2+v2,ρ1v1,ρ2.absentcases0subscript𝑡superscript𝜌1subscriptsuperscript𝑣1superscript𝜌1subscriptsuperscript𝑣2superscript𝜌2otherwise0subscript𝑡superscript𝜌2subscriptsuperscript𝑣2superscript𝜌1subscriptsuperscript𝑣1superscript𝜌2otherwise\displaystyle\Longleftrightarrow\begin{cases}0=\partial_{t}\rho^{1}+\langle v^% {1},\nabla\rho^{1}\rangle_{\mathbb{R}}+\langle v^{2},\nabla\rho^{2}\rangle_{% \mathbb{R}}\\ 0=\partial_{t}\rho^{2}+\langle v^{2},\nabla\rho^{1}\rangle_{\mathbb{R}}-% \langle v^{1},\nabla\rho^{2}\rangle_{\mathbb{R}}.\end{cases}⟺ { start_ROW start_CELL 0 = ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + ⟨ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT + ⟨ italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 = ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ⟨ italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT - ⟨ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT . end_CELL start_CELL end_CELL end_ROW

for ρ=ρ1+iρ2X𝜌superscript𝜌1𝑖superscript𝜌2𝑋\rho=\rho^{1}+i\rho^{2}\in Xitalic_ρ = italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∈ italic_X and v=v1+iv2V2𝑣superscript𝑣1𝑖superscript𝑣2superscript𝑉2v=v^{1}+iv^{2}\in V^{2}italic_v = italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∈ italic_V start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (note, that the velocity is a 2D vector). Here, V𝑉Vitalic_V is a suitable space for the velocities such that the derivatives needed for the priors are well-defined. In the discrete setting, we may choose V=X𝑉𝑋V=Xitalic_V = italic_X. Associated with the above two equations we introduce the optical flow operator

(3) M(ρ,v)=(tρ1+v1,ρ1+v2,ρ2tρ2+v2,ρ1v1,ρ2).𝑀𝜌𝑣matrixsubscript𝑡superscript𝜌1subscriptsuperscript𝑣1superscript𝜌1subscriptsuperscript𝑣2superscript𝜌2subscript𝑡superscript𝜌2subscriptsuperscript𝑣2superscript𝜌1subscriptsuperscript𝑣1superscript𝜌2\displaystyle M(\rho,v)=\begin{pmatrix}\partial_{t}\rho^{1}+\langle v^{1},% \nabla\rho^{1}\rangle_{\mathbb{R}}+\langle v^{2},\nabla\rho^{2}\rangle_{% \mathbb{R}}\\ \partial_{t}\rho^{2}+\langle v^{2},\nabla\rho^{1}\rangle_{\mathbb{R}}-\langle v% ^{1},\nabla\rho^{2}\rangle_{\mathbb{R}}\end{pmatrix}.italic_M ( italic_ρ , italic_v ) = ( start_ARG start_ROW start_CELL ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + ⟨ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT + ⟨ italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ⟨ italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT - ⟨ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) .

For our discrete space X=Nt×Nx×Ny𝑋superscriptsubscript𝑁𝑡subscript𝑁𝑥subscript𝑁𝑦X=\mathbb{C}^{N_{t}\times N_{x}\times N_{y}}italic_X = blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT the temporal derivative t:XX:subscript𝑡𝑋𝑋\partial_{t}\colon X\to X∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : italic_X → italic_X and the inner product ,:X×XX:subscript𝑋𝑋𝑋\langle\cdot,\cdot\rangle_{\mathbb{R}}\colon X\times X\to X⟨ ⋅ , ⋅ ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT : italic_X × italic_X → italic_X are to be understood pointwise.

We use the same discrete versions of the differential operators as in [6]. This means we use forward differences in time and central differences in space with Neumann boundary conditions for the differential operators involved in the optical flow equation. For the differential operators of other parts of the optimization function we use forward differences with Neumann boundary conditions (see [6] for more details).

Remark (Choices on optical flow model).

There are other possible generalisations of the optical flow equation to complex-valued images. For instance, one could introduce separate velocities for real and imaginary part, hence decoupling both components. Also, a single velocity for both the real and imaginary part would be an option. Finally, only one of the two parts could be regularised by the optical flow equation and the other one evolves freely. However, these variants do not explicitly model interchange between real and imaginary part. To be able to model such an interchange, we used the above approach.

Remark (Alternative choice for motion model).

An alternative motion model would be to use the continuity equation in density and momentum variables instead of density and velocity variables (see, for example, [4]). This way, the motion model does not incorporate a nonconvexity into the optimisation problem. Unfortunately, finding meaningful regularisers for the momentum variables is more challenging than for the velocities. For instance, an object moving along the x𝑥xitalic_x axis with constant velocity will have a zero gradient of the corresponding velocities. The momentum, however, is nonconstant as it also changes when the mass of the underlying object changes. Therefore, we decided to use the optical flow equation.

3. Optimisation and Implementation

Based on [1, 6] we minimise

(4) F(ρ,v)=tAtρtyt2+α11(ρ)+α22(v)+α3𝐹𝜌𝑣subscript𝑡superscriptnormsubscript𝐴𝑡subscript𝜌𝑡subscript𝑦𝑡2subscript𝛼1subscript1𝜌subscript𝛼2subscript2𝑣subscript𝛼3\displaystyle F(\rho,v)=\sum_{t}||A_{t}\rho_{t}-y_{t}||^{2}+\alpha_{1}\mathcal% {R}_{1}(\rho)+\alpha_{2}\mathcal{R}_{2}(v)+\alpha_{3}italic_F ( italic_ρ , italic_v ) = ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) + italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_v ) + italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT 3(M(ρ,v))subscript3𝑀𝜌𝑣\displaystyle\mathcal{R}_{3}(M(\rho,v))caligraphic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_M ( italic_ρ , italic_v ) )

with respect to ρ=ρ1+iρ2,v=(vx1,vy1)+i(vx2,vy2)formulae-sequence𝜌superscript𝜌1𝑖superscript𝜌2𝑣superscriptsubscript𝑣𝑥1superscriptsubscript𝑣𝑦1𝑖superscriptsubscript𝑣𝑥2superscriptsubscript𝑣𝑦2\rho=\rho^{1}+i\rho^{2},\ v=(v_{x}^{1},v_{y}^{1})+i(v_{x}^{2},v_{y}^{2})italic_ρ = italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_v = ( italic_v start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_v start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) + italic_i ( italic_v start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_v start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Here, ytsubscript𝑦𝑡y_{t}italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the measured MRI signal at time point t𝑡titalic_t, and Atsubscript𝐴𝑡A_{t}italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and M𝑀Mitalic_M are operators related to the MRI measurements and the optical flow equation, respectively (see (1) and (3)). The regularisers for the variables ρ𝜌\rhoitalic_ρ and v𝑣vitalic_v are chosen to promote smoothness (the “level of smoothness” can be controlled via the εisubscript𝜀𝑖\varepsilon_{i}italic_ε start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT parameters)

1(ρ)=ε1(ρ1)+ε1(ρ2)subscript1𝜌subscriptsubscript𝜀1superscript𝜌1subscriptsubscript𝜀1superscript𝜌2\displaystyle\mathcal{R}_{1}(\rho)=\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{1% })+\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{2})caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )
2(v)=ε2(vx1)+ε2(vy1)+ε2(vx2)+ε2(vy2),subscript2𝑣subscriptsubscript𝜀2subscriptsuperscript𝑣1𝑥subscriptsubscript𝜀2subscriptsuperscript𝑣1𝑦subscriptsubscript𝜀2subscriptsuperscript𝑣2𝑥subscriptsubscript𝜀2subscriptsuperscript𝑣2𝑦\displaystyle\mathcal{R}_{2}(v)=\mathcal{H}_{\varepsilon_{2}}(\nabla v^{1}_{x}% )+\mathcal{H}_{\varepsilon_{2}}(\nabla v^{1}_{y})+\mathcal{H}_{\varepsilon_{2}% }(\nabla v^{2}_{x})+\mathcal{H}_{\varepsilon_{2}}(\nabla v^{2}_{y}),caligraphic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_v ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ) ,

where \nabla denotes the spatial gradient and εsubscript𝜀\mathcal{H}_{\varepsilon}caligraphic_H start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT is the Huber loss with parameter ε𝜀\varepsilonitalic_ε. Defining for x2𝑥superscript2x\in\mathbb{R}^{2}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT

hε(x)={x222εif x2ε,x2ε2else,subscript𝜀𝑥casessubscriptsuperscriptnorm𝑥222𝜀if subscriptnorm𝑥2𝜀subscriptnorm𝑥2𝜀2else\displaystyle h_{\varepsilon}(x)=\begin{cases}\frac{\|x\|^{2}_{2}}{2% \varepsilon}\ \ &\text{if }\|x\|_{2}\leq\varepsilon,\\ \|x\|_{2}-\frac{\varepsilon}{2}\ \ &\text{else},\end{cases}italic_h start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_x ) = { start_ROW start_CELL divide start_ARG ∥ italic_x ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_ε end_ARG end_CELL start_CELL if ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_ε , end_CELL end_ROW start_ROW start_CELL ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - divide start_ARG italic_ε end_ARG start_ARG 2 end_ARG end_CELL start_CELL else , end_CELL end_ROW

we have ε(x)=i=1Nhε(xi)subscript𝜀𝑥superscriptsubscript𝑖1𝑁subscript𝜀subscript𝑥𝑖\mathcal{H}_{\varepsilon}(x)=\sum_{i=1}^{N}h_{\varepsilon}(x_{i})caligraphic_H start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for x(2)N𝑥superscriptsuperscript2𝑁x\in(\mathbb{R}^{2})^{N}italic_x ∈ ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT. Finally, we use a soft constraint

3(M(ρ,v))=ε3(M(ρ,v))subscript3𝑀𝜌𝑣subscriptsubscript𝜀3𝑀𝜌𝑣\displaystyle\mathcal{R}_{3}(M(\rho,v))=\mathcal{H}_{\varepsilon_{3}}(M(\rho,v))caligraphic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_M ( italic_ρ , italic_v ) ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_M ( italic_ρ , italic_v ) )

since we cannot expect the optical flow equation to be satisfied perfectly. Compared to the classical L1superscriptL1\mathrm{L}^{1}roman_L start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT loss the Huber loss is differentiable, and the parameter ε𝜀\varepsilonitalic_ε controls the extent of smoothness within the Huber loss, i.e. it controls whether the Huber loss is more similar to the norm 2\|\cdot\|_{2}∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT or the squared norm 22\|\cdot\|_{2}^{2}∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Tuning this parameter allows us to adapt to the considered problem and find a better model.

The considered optimisation problem (4) is nonconvex due to the optical flow equation. To deal with the nonconvexity we follow the same strategy as in [1, 6] and apply a block coordinate descent (see algorithm 2). This means we iteratively optimise the minimisation function only for ρ𝜌\rhoitalic_ρ or v𝑣vitalic_v while keeping the other variable fixed. Each resulting subproblem is solved using the FISTA algorithm [3] (applied to f+g𝑓𝑔f+gitalic_f + italic_g with f𝑓fitalic_f smooth and g=0𝑔0g=0italic_g = 0, see algorithm 1) with early stopping (iterations where stopped if the relative L2superscriptL2\mathrm{L}^{2}roman_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT error was below δ=105𝛿superscript105\delta=10^{-5}italic_δ = 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT). Furthermore, motivated by the coarse-to-fine approach in image registration, we smoothed the density ρ𝜌\rhoitalic_ρ and the velocity v𝑣vitalic_v after each sub-optimisation using a Gaussian smoothing with standard deviation σ/iouter𝜎subscript𝑖outer\sigma/{i_{\mathrm{outer}}}italic_σ / italic_i start_POSTSUBSCRIPT roman_outer end_POSTSUBSCRIPT. Here, σ𝜎\sigmaitalic_σ is again a tunable parameter, and iouter{1,,nouter}subscript𝑖outer1subscript𝑛outeri_{\mathrm{outer}}\in\{1,\ldots,n_{\mathrm{outer}}\}italic_i start_POSTSUBSCRIPT roman_outer end_POSTSUBSCRIPT ∈ { 1 , … , italic_n start_POSTSUBSCRIPT roman_outer end_POSTSUBSCRIPT } denotes the current outer iteration, i.e. the smoothing decays with growing iterations. For the maximal number of iterations we used nouter=200subscript𝑛outer200n_{\mathrm{outer}}=200italic_n start_POSTSUBSCRIPT roman_outer end_POSTSUBSCRIPT = 200, nρ=1400subscript𝑛𝜌1400n_{\rho}=1400italic_n start_POSTSUBSCRIPT italic_ρ end_POSTSUBSCRIPT = 1400 and nv=3200subscript𝑛𝑣3200n_{v}=3200italic_n start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = 3200. The optimisation algorithm was implemented in Python using SIRF [19] for reading the MR data and applying the associated measurement operators, and CIL [15] to implement the actual optimisation algorithm. The code will be made available upon publication and is attached for the review process.

Algorithm 1 FISTA with early stopping for minimizing smooth function f𝑓fitalic_f
1:Inputs:
2:      smooth function f𝑓fitalic_f with Lipschitz constant L𝐿Litalic_L, initialisation xinitsubscript𝑥initx_{\mathrm{init}}italic_x start_POSTSUBSCRIPT roman_init end_POSTSUBSCRIPT Algorithm parameters n𝑛nitalic_n, δ𝛿\deltaitalic_δ
3:Initialize:
4:      x0xinitsuperscript𝑥0subscript𝑥initx^{0}\leftarrow x_{\mathrm{init}}italic_x start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ← italic_x start_POSTSUBSCRIPT roman_init end_POSTSUBSCRIPT, x^1xinitsuperscript^𝑥1subscript𝑥init\hat{x}^{1}\leftarrow x_{\mathrm{init}}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ← italic_x start_POSTSUBSCRIPT roman_init end_POSTSUBSCRIPT, t11superscript𝑡11t^{1}\leftarrow 1italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ← 1
5:for j=1𝑗1j=1italic_j = 1 to n𝑛nitalic_n do
6:     xjsuperscript𝑥𝑗absentx^{j}\leftarrowitalic_x start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ← x^j1Lf(x^j)superscript^𝑥𝑗1𝐿𝑓superscript^𝑥𝑗\hat{x}^{j}-\frac{1}{L}\nabla f(\hat{x}^{j})over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_L end_ARG ∇ italic_f ( over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT )
7:     tj+11+1+4(tj)22superscript𝑡𝑗1114superscriptsuperscript𝑡𝑗22t^{j+1}\leftarrow\frac{1+\sqrt{1+4(t^{j})^{2}}}{2}italic_t start_POSTSUPERSCRIPT italic_j + 1 end_POSTSUPERSCRIPT ← divide start_ARG 1 + square-root start_ARG 1 + 4 ( italic_t start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG start_ARG 2 end_ARG
8:     x^j+1xj+(tj1tj+1)(xjxj1)superscript^𝑥𝑗1superscript𝑥𝑗superscript𝑡𝑗1superscript𝑡𝑗1superscript𝑥𝑗superscript𝑥𝑗1\hat{x}^{j+1}\leftarrow x^{j}+(\frac{t^{j}-1}{t^{j+1}})(x^{j}-x^{j-1})over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_j + 1 end_POSTSUPERSCRIPT ← italic_x start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + ( divide start_ARG italic_t start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT - 1 end_ARG start_ARG italic_t start_POSTSUPERSCRIPT italic_j + 1 end_POSTSUPERSCRIPT end_ARG ) ( italic_x start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT )
9:     if xjxj12/xj12<δsubscriptnormsuperscript𝑥𝑗superscript𝑥𝑗12subscriptnormsuperscript𝑥𝑗12𝛿\|x^{j}-x^{j-1}\|_{2}/\|x^{j-1}\|_{2}<\delta∥ italic_x start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / ∥ italic_x start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_δ then
10:         break      
11:return xjsuperscript𝑥𝑗x^{j}italic_x start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT
Algorithm 2 Optimization of F(ρ,v)𝐹𝜌𝑣F(\rho,v)italic_F ( italic_ρ , italic_v )
1:Inputs:
2:      Measurement y𝑦yitalic_y Model parameters α1,α2,α3,ε1,ε2,ε3subscript𝛼1subscript𝛼2subscript𝛼3subscript𝜀1subscript𝜀2subscript𝜀3\alpha_{1},\alpha_{2},\alpha_{3},\varepsilon_{1},\varepsilon_{2},\varepsilon_{3}italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT Algorithm parameters σ,nouter,nρ,nv𝜎subscript𝑛outersubscript𝑛𝜌subscript𝑛𝑣\sigma,n_{\mathrm{outer}},n_{\rho},n_{v}italic_σ , italic_n start_POSTSUBSCRIPT roman_outer end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT italic_ρ end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT, δ𝛿\deltaitalic_δ
3:Initialize:
4:      v10superscript𝑣10v^{1}\leftarrow 0italic_v start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ← 0, v^10superscript^𝑣10\hat{v}^{1}\leftarrow 0over^ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ← 0, ρ10superscript𝜌10\rho^{1}\leftarrow 0italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ← 0, ρ^10superscript^𝜌10\hat{\rho}^{1}\leftarrow 0over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ← 0
5:for i=1𝑖1i=1italic_i = 1 to noutersubscript𝑛outern_{\mathrm{outer}}italic_n start_POSTSUBSCRIPT roman_outer end_POSTSUBSCRIPT do
6:     ρi+1FISTA(ρF(ρ,v^i),ρ^i)superscript𝜌𝑖1FISTAmaps-to𝜌𝐹𝜌superscript^𝑣𝑖superscript^𝜌𝑖\rho^{i+1}\leftarrow\text{FISTA}(\rho\mapsto F(\rho,\hat{v}^{i}),\hat{\rho}^{i})italic_ρ start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT ← FISTA ( italic_ρ ↦ italic_F ( italic_ρ , over^ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) , over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) with nρsubscript𝑛𝜌n_{\rho}italic_n start_POSTSUBSCRIPT italic_ρ end_POSTSUBSCRIPT iterations
7:     ρ^i+1smoothσ/i(ρi+1)superscript^𝜌𝑖1subscriptsmooth𝜎𝑖superscript𝜌𝑖1\hat{\rho}^{i+1}\leftarrow\mathrm{smooth}_{\sigma/i}(\rho^{i+1})over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT ← roman_smooth start_POSTSUBSCRIPT italic_σ / italic_i end_POSTSUBSCRIPT ( italic_ρ start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT )
8:     vi+1FISTA(vF(ρ^i+1,v),v^i)superscript𝑣𝑖1FISTAmaps-to𝑣𝐹superscript^𝜌𝑖1𝑣superscript^𝑣𝑖v^{i+1}\leftarrow\text{FISTA}(v\mapsto F(\hat{\rho}^{i+1},v),\hat{v}^{i})italic_v start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT ← FISTA ( italic_v ↦ italic_F ( over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT , italic_v ) , over^ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) with nvsubscript𝑛𝑣n_{v}italic_n start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT iterations
9:     v^i+1smoothσ/i(vi+1)superscript^𝑣𝑖1subscriptsmooth𝜎𝑖superscript𝑣𝑖1\hat{v}^{i+1}\leftarrow\mathrm{smooth}_{\sigma/i}(v^{i+1})over^ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT ← roman_smooth start_POSTSUBSCRIPT italic_σ / italic_i end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT )
10:     if 12vi+1vi2/vi2+12ρi+1ρi2/ρi2<δ12subscriptnormsuperscript𝑣𝑖1superscript𝑣𝑖2subscriptnormsuperscript𝑣𝑖212subscriptnormsuperscript𝜌𝑖1superscript𝜌𝑖2subscriptnormsuperscript𝜌𝑖2𝛿\frac{1}{2}\|v^{i+1}-v^{i}\|_{2}/\|v^{i}\|_{2}+\frac{1}{2}\|\rho^{i+1}-\rho^{i% }\|_{2}/\|\rho^{i}\|_{2}<\deltadivide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ italic_v start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT - italic_v start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / ∥ italic_v start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ italic_ρ start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT - italic_ρ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / ∥ italic_ρ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_δ then
11:         break      
12:return ρi+1,vi+1superscript𝜌𝑖1superscript𝑣𝑖1\rho^{i+1},v^{i+1}italic_ρ start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT

3.1. Models for comparison

We compare our proposed method (4) (we call this model OF  and label corresponding reconstructions OF, too) to three slightly different variants:

  • (a)

    Frame-wise model (FW): For each time point a separate reconstruction is computed and no correlation between time points is assumed. This corresponds to optimising (4) for the density ρ𝜌\rhoitalic_ρ only and setting α2subscript𝛼2\alpha_{2}italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and α3subscript𝛼3\alpha_{3}italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT to zero. Thus, we minimise

    F(ρ)=tAtρtyt2+α11(ρ)𝐹𝜌subscript𝑡superscriptnormsubscript𝐴𝑡subscript𝜌𝑡subscript𝑦𝑡2subscript𝛼1subscript1𝜌\displaystyle F(\rho)=\sum_{t}||A_{t}\rho_{t}-y_{t}||^{2}+\alpha_{1}\mathcal{R% }_{1}(\rho)italic_F ( italic_ρ ) = ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ )

    with respect to ρ=ρ1+iρ2𝜌superscript𝜌1𝑖superscript𝜌2\rho=\rho^{1}+i\rho^{2}italic_ρ = italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT applying algorithm 1 with nρsubscript𝑛𝜌n_{\rho}italic_n start_POSTSUBSCRIPT italic_ρ end_POSTSUBSCRIPT iterations, where

    1(ρ)=ε1(ρ1)+ε1(ρ2)subscript1𝜌subscriptsubscript𝜀1superscript𝜌1subscriptsubscript𝜀1superscript𝜌2\displaystyle\mathcal{R}_{1}(\rho)=\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{1% })+\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{2})caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )

    as above.

  • (b)

    Time derivative model (DT): The term containing the optical flow equation is replaced by a term containing time derivatives only. This means we fix the velocities to zero and optimise (4) for the density ρ𝜌\rhoitalic_ρ only. Thus, we minimise

    F(ρ)=tAtρtyt2+α11(ρ)+α33(M(ρ,0))𝐹𝜌subscript𝑡superscriptnormsubscript𝐴𝑡subscript𝜌𝑡subscript𝑦𝑡2subscript𝛼1subscript1𝜌subscript𝛼3subscript3𝑀𝜌0\displaystyle F(\rho)=\sum_{t}||A_{t}\rho_{t}-y_{t}||^{2}+\alpha_{1}\mathcal{R% }_{1}(\rho)+\alpha_{3}\mathcal{R}_{3}(M(\rho,0))italic_F ( italic_ρ ) = ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) + italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_M ( italic_ρ , 0 ) )

    with respect to ρ1+iρ2superscript𝜌1𝑖superscript𝜌2\rho^{1}+i\rho^{2}italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT applying algorithm 1 with nρsubscript𝑛𝜌n_{\rho}italic_n start_POSTSUBSCRIPT italic_ρ end_POSTSUBSCRIPT iterations, where

    1(ρ)=ε1(ρ1)+ε1(ρ2).subscript1𝜌subscriptsubscript𝜀1superscript𝜌1subscriptsubscript𝜀1superscript𝜌2\displaystyle\mathcal{R}_{1}(\rho)=\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{1% })+\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{2}).caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

    and

    3(M(ρ,0))=ε3([tρ1,tρ2]).subscript3𝑀𝜌0subscriptsubscript𝜀3subscript𝑡superscript𝜌1subscript𝑡superscript𝜌2\displaystyle\mathcal{R}_{3}(M(\rho,0))=\mathcal{H}_{\varepsilon_{3}}([% \partial_{t}\rho^{1},\partial_{t}\rho^{2}]).caligraphic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_M ( italic_ρ , 0 ) ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( [ ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ) .

    as above promote spatial and temporal regularity, respectively.

  • (c)

    Optical flow using ground truth velocities (Cheat-OF): With this model we want to investigate how our method would perform if we had access to the velocities underlying the motion, i.e. this model gives us the best possible reconstruction within our framework. To this end, we use our model (4) with fully sampled data to determine ground truth velocities vGTsubscript𝑣GTv_{\mathrm{GT}}italic_v start_POSTSUBSCRIPT roman_GT end_POSTSUBSCRIPT. Then, we optimise (4) for the density ρ𝜌\rhoitalic_ρ only while fixing the velocity to be vGTsubscript𝑣GTv_{\mathrm{GT}}italic_v start_POSTSUBSCRIPT roman_GT end_POSTSUBSCRIPT. Thus, we minimise

    F(ρ)=tAtρtyt2+α11(ρ)+α33(M(ρ,vGT))𝐹𝜌subscript𝑡superscriptnormsubscript𝐴𝑡subscript𝜌𝑡subscript𝑦𝑡2subscript𝛼1subscript1𝜌subscript𝛼3subscript3𝑀𝜌subscript𝑣GT\displaystyle F(\rho)=\sum_{t}||A_{t}\rho_{t}-y_{t}||^{2}+\alpha_{1}\mathcal{R% }_{1}(\rho)+\alpha_{3}\mathcal{R}_{3}(M(\rho,v_{\mathrm{GT}}))italic_F ( italic_ρ ) = ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) + italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_M ( italic_ρ , italic_v start_POSTSUBSCRIPT roman_GT end_POSTSUBSCRIPT ) )

    with respect to ρ=ρ1+iρ2𝜌superscript𝜌1𝑖superscript𝜌2\rho=\rho^{1}+i\rho^{2}italic_ρ = italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_i italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT applying algorithm 1 with nρsubscript𝑛𝜌n_{\rho}italic_n start_POSTSUBSCRIPT italic_ρ end_POSTSUBSCRIPT iterations, where

    1(ρ)=ε1(ρ1)+ε1(ρ2)subscript1𝜌subscriptsubscript𝜀1superscript𝜌1subscriptsubscript𝜀1superscript𝜌2\displaystyle\mathcal{R}_{1}(\rho)=\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{1% })+\mathcal{H}_{\varepsilon_{1}}(\nabla\rho^{2})caligraphic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ρ ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) + caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ∇ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )

    and

    3(M(ρ,vGT))=ε3(M(ρ,vGT)\displaystyle\mathcal{R}_{3}(M(\rho,v_{\mathrm{GT}}))=\mathcal{H}_{\varepsilon% _{3}}(M(\rho,v_{\mathrm{GT}})caligraphic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_M ( italic_ρ , italic_v start_POSTSUBSCRIPT roman_GT end_POSTSUBSCRIPT ) ) = caligraphic_H start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_M ( italic_ρ , italic_v start_POSTSUBSCRIPT roman_GT end_POSTSUBSCRIPT )

    as above.

3.2. Metrics

We consider two metrics to compare the quality of two images: PSNR (which is also used as the metric for the hyperparameter optimisation) and SSIM (which is the secondary alternative metric to compare image quality). To compute them, we use the respective functions provided by scikit-image where the ”data_range“ parameter is obtained from the ground truth image. Also, we compute the metrics for each time point and compute the final metric as the average over all time points.

3.3. Hyperparameter optimisation

Hyperparameter optimisation was carried out using scikit-optimize to find good sets of parameters for each model. We chose PSNR (computed on the dynamic mask, see section 4.2) as the metric to be minimised by the Bayesian optimisation. For the Bayesian optimisation we used the Optimizer class with a Gaussian process as the base estimator and expected improvement as the acquisition function. For the model in (4) and its optimisation we used the following hyperparameters. The regularisation parameters α1,α2,α3subscript𝛼1subscript𝛼2subscript𝛼3\alpha_{1},\alpha_{2},\alpha_{3}italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, the smoothness parameters ε1,ε2,ε3subscript𝜀1subscript𝜀2subscript𝜀3\varepsilon_{1},\varepsilon_{2},\varepsilon_{3}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT and the algorithm parameter σ𝜎\sigmaitalic_σ which gives the initial amount of smoothing of the density and velocity after each outer optimisation step. The number of tested parameters depends on the model. The more model parameters exist, the more combinations of parameters were considered within the Bayesian optimisation. More details can be found in the code.

4. Data

We work with two datasets taken from the OCMR dataset [8] containing cardiac cine series (long-axis view). The data consists of multi-coil k-space data acquired with Cartesian sampling on a Siemens MAGNETOM Avanto scanner (1.5T) (see [8] for more details). We use fully sampled data and apply the subsampling manually. A fully sampled reconstruction of both datasets can be found in fig. 2. Based on subject 1, we additionally consider a simulated dataset that resembles a real-world MRI image but satisfies the optical flow equation perfectly. This way, we can assess the model capacities and our optimisation procedure in an idealised setting.

4.1. Ground truth images

To assess the reconstruction quality, we need a suitable “ground truth” image to compare the results to. One possibility would be to use the fully sampled measurements to get reconstructions for each time frame separately using the FW  model. In doing so, however, one gets time trajectories that are slightly irregular and do not resemble realistic motion (see fig. 2 second column on the right) due to, e.g., in-frame motion. Therefore, we use a “dynamic ground truth” that results from minimising (4) with fully sampled data and high dynamic regularisation parameters. This results in reconstructions that are more regular in time (see fig. 2 first column on the right). This approach also has the advantage that we have “ground truth” velocities available that describe the dynamics within the ground truth image sequence. These can be used to assess the quality of our reconstructed velocities.

In the discussion section, we will briefly comment on the results being similar when using the frame-wise “ground-truth”.

4.2. Mask focusing on dynamic parts in images

We are primarily interested in the dynamic parts of the image and want to improve image quality in these regions particularly. Since large parts of the images are basically constant in time, computing loss metrics between ground truth images and reconstructions over the whole image would consequently result in focusing on the wrong parts. Therefore, we computed masks focusing on the dynamic parts of the images (see fig. 2 right column), and computed the image metrics on these masks only.

Refer to caption
Refer to caption
Figure 2. Each row shows a reconstruction for a different data set. The first column shows a dynamic ground truth (obtained as a reconstruction using (4) and fully-sampled data). The second column shows the same for a frame-wise ground truth (obtained from reconstructing each time frame separately (FW model) using fully-sampled data). The last column shows an overlay of the reconstructions with the dynamic masks. Within each column, the left side shows one time slice of the reconstructions, the right side shows time-space images along the red lines.

4.3. Simulated data

We test our proposed model on simulated data. To this end, we used a dataset from [8] (subject 1 in fig. 2) and solved (4) for fully sampled data. Then, we smoothed the obtained velocities in space and time and used the resulting velocities to simulate a sequence of images starting with the image at the first time point from solving (4). The thus obtained images and velocities are therefore guaranteed to satisfy the optical flow equation and the velocities are smooth enough to be reconstructable. Based on these simulated ground truth images we generated measurements by applying the MRI forward operator Atsubscript𝐴𝑡A_{t}italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. Finally, Gaussian noise (zero mean) was added where the standard deviation was chosen such that unregularised reconstructions yielded qualitatively similar images to unregularised reconstructions based on the original measured data.

4.4. Undersampling

We next describe the used undersampling. The data we are working with has been acquired with a standard Cartesian sampling scheme, and we consider four times undersampling in the phase encoding direction with full sampling of the rows in the readout direction. More precisely, we use the same x=15𝑥15x=15italic_x = 15 percent central rows in each time frame and randomly select 25x25𝑥25-x25 - italic_x percent of the outer rows (same number of rows on each side of the central rows) in such a way that the selected outer rows between consecutive time frames do not overlap.

5. Experiments and results

5.1. Simulated data

The results for the simulated data are summarised in table 1 and can visually be inspected in fig. 3. The top row in that figure shows the magnitude of the simulated images. In the second row, the same images are represented as complex-valued images where the hue represents the normalised phase and the brightness represents the magnitude. From this row of images, we see that the phase varies spatially, but its temporal changes are small.

The other rows depict the difference between the reconstructions using the respective model and the simulated images (magnitude images are compared). We can clearly see that DT  improves over the frame-wise approach FW, and that OF  improves over DT. The improved reconstruction qualities can also be seen in PSNR values in table 1 (first column), where OF  improves over FW  and DT  by 10dB and 4dB, respectively. Moreover, we see, that Cheat-OF , for which ground truth velocities are available in the reconstruction, is basically not improving over OF. Similar observations can be made by comparing the SSIM values in table 1.

This good reconstruction result of the OF  model is backed by the reconstructed velocities depicted in fig. 4 (bottom row). We see that the general form of the velocities has been recovered compared to the ground truth velocities (top row), hence a good reconstruction quality is expected.

Refer to caption
Figure 3. Results for the simulated data. The top row shows the evolution (magnitude) of the simulated ground truth images for the time points t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, t2subscript𝑡2t_{2}italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, t4subscript𝑡4t_{4}italic_t start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, t6subscript𝑡6t_{6}italic_t start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT (out of 8888 time points). The second row shows the same images but as a complex-valued image: The hue represents the normalised phase and the brightness represents the magnitude. The other rows depict the difference between the magnitudes of the reconstructions and the magnitude of the simulated images on the dynamic mask. Rows 3-5: FW, DT  and OF.
Refer to caption
Figure 4. Reconstructed velocities of one time frame for the simulated data using our proposed complex-valued optical flow method. First row: Ground truth velocities in x𝑥xitalic_x and y𝑦yitalic_y direction. Second row: Reconstructed velocities in x𝑥xitalic_x and y𝑦yitalic_y direction

5.2. Subject 1

The summarised results of subject 1 can be found in table 1 and fig. 5. We see that our proposed method OF  performs better than FW  or DT. The comparison between OF  and the frame-wise approach yields PSNR improvements (a similar pattern can be observed in the SSIM values) of almost 3dB, showing that the complex optical flow equation helps the reconstruction by describing the underlying motion. The gains of OF  over DT  are not very pronounced. However, they improve to over 1dB when looking at finer structures in more detail as can be seen in fig. 5 in the third row. Further, the results in table 1 and fig. 5 also show that the method Cheat-OF  including ground truth velocities outperforms all the other methods by a lot (which is no surprise since this method basically has access to the fully sampled measurements via the ground truth velocities obtained from a fully sampled reconstruction using (4)). This improvement can be explained by comparing the reconstructed to the “ground truth” velocities in fig. 7. We see that the reconstruction quality is decent for the real part, whereas the reconstructed imaginary part differs significantly from the “ground truth” velocities’ imaginary part. Overall, the velocities are not as well reconstructed as for the simulated data, which explains the improved results using Cheat-OF.

5.3. Subject 2

The summarised results of subject 2 can be found in table 1 and fig. 6. We see that OF  improves PSNR over FW  and DT  by 2dB and 1.4dB, respectively. Therefore, the proposed optical flow model helps with reconstructing dynamic MRI images. Again, a similar conclusion can be drawn from the SSIM values. Looking at the reconstructed phase (fig. 6 rows 3 and 4), we see that all methods reconstruct the phase well on an overall level (row 3). However, the magnified area in row 4 shows differences in the reconstructed phases in the detailed structures. Here, OF  gives a better result over FW  and DT. This improved phase reconstruction is associated with a better reconstruction of details, as can be seen in the second row. Again, Cheat-OF, as expected, improves over all other methods.

 simulation

 subject 1 GT(dyn)

 subject 2 GT(dyn)

 subject 1 GT(FW)

 subject 2 GT(FW)

FW 34.1±1.2plus-or-minus34.11.234.1\pm 1.234.1 ± 1.2 27.3±0.7plus-or-minus27.30.727.3\pm 0.727.3 ± 0.7 25.6±0.6plus-or-minus25.60.625.6\pm 0.625.6 ± 0.6 27.7±0.8plus-or-minus27.70.827.7\pm 0.827.7 ± 0.8 26.4±0.9plus-or-minus26.40.926.4\pm 0.926.4 ± 0.9
DT 40.1±0.9plus-or-minus40.10.940.1\pm 0.940.1 ± 0.9 29.6±0.3plus-or-minus29.60.329.6\pm 0.329.6 ± 0.3 26.2±0.3plus-or-minus26.20.326.2\pm 0.326.2 ± 0.3 29.8±0.4plus-or-minus29.80.429.8\pm 0.429.8 ± 0.4 27.5±0.5plus-or-minus27.50.527.5\pm 0.527.5 ± 0.5
OF 44.1±0.6plus-or-minus44.10.644.1\pm 0.644.1 ± 0.6 30.2±0.5plus-or-minus30.20.530.2\pm 0.530.2 ± 0.5 27.6±0.4plus-or-minus27.60.427.6\pm 0.427.6 ± 0.4 30.2±0.5plus-or-minus30.20.530.2\pm 0.530.2 ± 0.5 28.0±0.5plus-or-minus28.00.528.0\pm 0.528.0 ± 0.5
PSNR Cheat-OF 44.2±0.3plus-or-minus44.20.344.2\pm 0.344.2 ± 0.3 34.4±0.7plus-or-minus34.40.734.4\pm 0.734.4 ± 0.7 32.0±0.7plus-or-minus32.00.732.0\pm 0.732.0 ± 0.7 33.1±0.4plus-or-minus33.10.433.1\pm 0.433.1 ± 0.4 28.7±1.0plus-or-minus28.71.028.7\pm 1.028.7 ± 1.0
FW 0.943±0.010plus-or-minus0.9430.0100.943\pm 0.0100.943 ± 0.010 0.828±0.014plus-or-minus0.8280.0140.828\pm 0.0140.828 ± 0.014 0.812±0.015plus-or-minus0.8120.0150.812\pm 0.0150.812 ± 0.015 0.847±0.014plus-or-minus0.8470.0140.847\pm 0.0140.847 ± 0.014 0.852±0.018plus-or-minus0.8520.0180.852\pm 0.0180.852 ± 0.018
DT 0.978±0.004plus-or-minus0.9780.0040.978\pm 0.0040.978 ± 0.004 0.869±0.009plus-or-minus0.8690.0090.869\pm 0.0090.869 ± 0.009 0.819±0.012plus-or-minus0.8190.0120.819\pm 0.0120.819 ± 0.012 0.880±0.009plus-or-minus0.8800.0090.880\pm 0.0090.880 ± 0.009 0.869±0.011plus-or-minus0.8690.0110.869\pm 0.0110.869 ± 0.011
OF 0.991±0.001plus-or-minus0.9910.0010.991\pm 0.0010.991 ± 0.001 0.892±0.010plus-or-minus0.8920.0100.892\pm 0.0100.892 ± 0.010 0.873±0.012plus-or-minus0.8730.0120.873\pm 0.0120.873 ± 0.012 0.888±0.011plus-or-minus0.8880.0110.888\pm 0.0110.888 ± 0.011 0.880±0.011plus-or-minus0.8800.0110.880\pm 0.0110.880 ± 0.011
SSIM Cheat-OF 0.992±0.001plus-or-minus0.9920.0010.992\pm 0.0010.992 ± 0.001 0.959±0.009plus-or-minus0.9590.0090.959\pm 0.0090.959 ± 0.009 0.942±0.009plus-or-minus0.9420.0090.942\pm 0.0090.942 ± 0.009 0.934±0.004plus-or-minus0.9340.0040.934\pm 0.0040.934 ± 0.004 0.872±0.018plus-or-minus0.8720.0180.872\pm 0.0180.872 ± 0.018
Table 1. Summary of the results. The columns show the metrics for the respective data sets (with dynamic ground truth GT(dyn) and frame-wise ground truth GT(FW)). The displayed values are (average)±(standard deviation)plus-or-minusaveragestandard deviation(\text{average})\pm(\text{standard deviation})( average ) ± ( standard deviation ) taken over the considered time points. The metric for each time point is computed on a dynamic mask (see section 4.2, fig. 2).
Refer to caption
Figure 5. Reconstructions for subject 1. Left to right columns: Ground truth image, FW, DT, OF and Cheat-OF. First row shows magnitude of reconstructions. Second row: Difference of reconstruction to ground truth on the mask. Third row: Magnified area and corresponding metrics on magnification.
Refer to caption
Figure 6. Reconstructions for subject 2. Left to right columns: Ground truth image, FW, DT, OF and Cheat-OF. First and second row show magnitude of reconstructions with respective metrics. Third and fourth row show the phase of the reconstructions. Second and last row are the indicated magnified areas.
Refer to caption
Figure 7. Comparison between ground truth velocities (top row) and reconstructed velocities (bottom row) of subject 1 for a single time frame.

6. Discussion

Overall, our experiments demonstrated that explicitly considering the complex-valued optical flow equation (2) is useful for modelling motion between complex-valued images. In particular, our simulated experiment showed large reconstruction improvements. This, of course, is to be expected as the underlying ground truth motion perfectly satisfies the optical flow equation and the corresponding velocities follow the posed prior assumptions. Still, this simplified example shows that the optimisation procedure is suitable for the problem, and meaningful results are obtained. Moreover, we have mentioned already in section 5.1 that Cheat-OF  is basically not improving over OF  in the simulation, suggesting that OF  is able to produce nearly optimal reconstructions in case the underlying images satisfy the considered prior assumptions.

Comparing the ground truth velocities of the simulation (fig. 4) and of the measured subject 1 (fig. 7) we see that they are of different quality. In particular, the simulated velocity appears to be much smoother in space, hence they are more in accordance with our prior modelling. On the other hand, the ground truth velocities of subject 1 are much rougher in space which is not reflected in the reconstructed velocity. This of course influences reconstruction quality and partly explains the differences in reconstruction quality compared to the simulation. In future work, it will be interesting to investigate different priors that better resemble the underlying velocities’ structure. For instance, one could use Haar wavelets, include regularisation in time, or follow a learning-based approach that can derive very specialised and problem-specific priors.

For our “ground truth” images we argued in section 4.2 that using “dynamic ground truth” images is more appropriate in our setting. However, we repeated the procedures for subjects 1 and 2 with “frame-wise ground truth” images (for the simulation both versions of ground truth images coincide) with similar results, where PSNR improvements are slightly less pronounced (2.5dB over FW  for subject 1 and 1.6dB over FW  for subject 2, see table 1 right two columns). This suggests that the overall results do not depend on the choice of “ground truth” images.

7. Conclusions

Dynamic MRI (like cardiac MRI) is an important clinical imaging modality for which measurement undersampling is vital to get reasonably fast acquisitions. To remedy the image reconstruction artifacts naturally appearing due to this undersampling, motion models such as the optical flow equation can be used to propagate information across time frames. Current models have followed this approach for real-valued images. In this work, we have generalised the procedure to complex-valued images by proposing a complex version of the optical flow equation that can account for phase changes between time frames. We tested the new model on two real cardiac datasets and saw reconstruction improvements of the complex optical flow model over the comparison methods.

We used central differences for spatial derivatives in the optical flow equation. In future work, one could analyse which discrete differentiation approach works best in the context of the complex optical flow equation. Related is the question of whether a CFL condition changes when transitioning from the real to the complex optical flow equation.

We also saw that including “ground truth” velocities into the model further improved reconstruction quality implying that the currently chosen regularisation for the velocities is not optimal. In future work, it will be interesting to further investigate how to best choose regularisers for the velocities to achieve better performance (such an optimal choice is not even clear for the standard real-valued optical flow motion model).

Overall, we observed that the complex optical flow model is well-suited for cardiac MRI data. Of course, other options are conceivable. For instance, the continuity equation is a typical way of describing motion (especially in three dimensions). Also, a learning-based approach could be vital to adapt the model to the underlying problem so that problem-specific features can be taken into account. It will be interesting to compare different approaches with each other.

Acknowledgements

MM would like to express gratitude to Carola-Bibiane Schönlieb for generously hosting a research stay and supporting this work. MM acknowledges support from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC 2044 –, Mathematics Münster: Dynamics – Geometry – Structure, and under the Collaborative Research Centre 1450–431460824, InSight, University of Münster.

MJE acknowledges support from the EPSRC (EP/S026045/1, EP/T026693/1, EP/V026259/1, EP/Y037286/1) and the European Union Horizon 2020 research and innovation programme under the Marie Skodowska-Curie grant agreement REMODEL.

Calculations for this publication were performed on the HPC cluster PALMA II of the University of Münster, subsidised by the DFG (INST 211/667-1).

Special thanks to Christoph Kolbitsch and Kris Thielemans for their insightful discussions on dynamic MR, which greatly enriched the research process.

References

  • [1] Angelica I Aviles-Rivero, Noémie Debroux, Guy Williams, Martin J Graves, and Carola-Bibiane Schönlieb. Compressed sensing plus motion (cs + m): A new perspective for improving undersampled mr image reconstruction. Med Image Anal., 2021.
  • [2] Elwin C. Bassett, Eugene G. Kholmovski, Brent D. Wilson, Edward V. R. DiBella, Derek J. Dosdall, Ravi Ranjan, Christopher J. McGann, and Daniel Kim. Evaluation of highly accelerated real-time cardiac cine mri in tachycardia. NMR in Biomedicine, 27(2):175–182, 2014.
  • [3] Amir Beck and Marc Teboulle. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences, 2(1):183–202, 2009.
  • [4] Jean-David Benamou and Yann Brenier. A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem. Numer. Math., 84(3):375–393, 2000.
  • [5] Martin Burger, Hendrik Dirks, Lena Frerking, Andreas Hauptmann, Tapio Helin, and Samuli Siltanen. A variational reconstruction method for undersampled dynamic x-ray tomography based on physical motion models. Inverse Problems, 33(12):124008, nov 2017.
  • [6] Martin Burger, Hendrik Dirks, and Carola-Bibiane Schönlieb. A variational model for joint motion estimation and image reconstruction. SIAM Journal on Imaging Sciences, 11(1):94–128, 2018.
  • [7] Chong Chen, Barbara Gris, and Ozan Öktem. A new variational model for joint image reconstruction and motion estimation in spatiotemporal imaging. SIAM Journal on Imaging Sciences, 12(4):1686–1719, 2019.
  • [8] Chong Chen, Yingmin Liu, Philip Schniter, Matthew Tong, Karolina Zareba, Orlando Simonetti, Lee Potter, and Rizwan Ahmad. Ocmr (v1.0)–open-access multi-coil k-space dataset for cardiovascular magnetic resonance imaging. 2020. arXiv:2008.03410.
  • [9] Richard Kinh Gian Do, Henry Rusinek, and Bachir Taouli. Dynamic contrast-enhanced mr imaging of the liver: current status and future directions. Magnetic resonance imaging clinics of North America, 17(2):339–49, 2009.
  • [10] Li Feng, Monvadi B. Srichai, Ruth P. Lim, Alexis Harrison, Wilson King, Ganesh Adluru, Edward V. R. Dibella, Daniel K. Sodickson, Ricardo Otazo, and Daniel Kim. Highly accelerated real-time cardiac cine mri using k–t sparse-sense. Magnetic Resonance in Medicine, 70(1):64–74, 2013.
  • [11] J.A. Fessler and D.C. Noll. Iterative image reconstruction in mri with separate magnitude and phase regularization. In 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821), pages 209–212 Vol. 1, 2004.
  • [12] Hao Gao, Stanislas Rapacchi, Da Wang, John Moriarty, Conor Meehan, James Sayre, Gerhard Laub, Paul Finn, and Peng Hu. Compressed sensing using prior rank, intensity and sparsity model (prism): applications in cardiac cine mri. In Proceedings of the 20th Annual Meeting of ISMRM, Melbourne, Australia, volume 2242, 2012.
  • [13] Yaron Gordon, Sasan Partovi, and Matthias et al. Müller-Eschner. Dynamic contrast-enhanced magnetic resonance imaging: fundamentals and application to the evaluation of the peripheral perfusion. Cardiovascular diagnosis and therapy, 4(2):147–164, 2014.
  • [14] Andreas Hauptmann, Ozan Öktem, and Carola Schönlieb. Image Reconstruction in Dynamic Inverse Problems with Temporal Models, pages 1–31. Springer International Publishing, Cham, 2021.
  • [15] J. S. Jørgensen, E. Ametova, G. Burca, G. Fardell, E. Papoutsellis, E. Pasca, K. Thielemans, M. Turner, R. Warr, W. R. B. Lionheart, and P. J. Withers. Core imaging library - part i: a versatile python framework for tomographic imaging. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 379(2204):20200192, 2021.
  • [16] Dong Liang, Bo Liu, JiunJie Wang, and Leslie Ying. Accelerating sense using compressed sensing. Magnetic Resonance in Medicine, 62(6):1574–1584, 2009.
  • [17] Michael Lustig, David Donoho, and John M. Pauly. Sparse mri: The application of compressed sensing for rapid mr imaging. Magnetic Resonance in Medicine, 58(6):1182–1195, 2007.
  • [18] Ricardo Otazo, Emmanuel Candès, and Daniel K. Sodickson. Low-rank plus sparse matrix decomposition for accelerated dynamic mri with separation of background and dynamic components. Magnetic Resonance in Medicine, 73(3):1125–1136, 2015.
  • [19] Evgueni Ovtchinnikov, Richard Brown, Christoph Kolbitsch, Edoardo Pasca, Casper da Costa-Luis, Ashley G. Gillman, Benjamin A. Thomas, Nikos Efthimiou, Johannes Mayer, Palak Wadhwa, Matthias J. Ehrhardt, Sam Ellis, Jakob S. Jørgensen, Julian Matthews, Claudia Prieto, Andrew J. Reader, Charalampos Tsoumpas, Martin Turner, David Atkinson, and Kris Thielemans. Sirf: Synergistic image reconstruction framework. Computer Physics Communications, 249:107087, 2020.
  • [20] Chen Qin, Jo Schlemper, Jose Caballero, Anthony N. Price, Joseph V. Hajnal, and Daniel Rueckert. Convolutional recurrent neural networks for dynamic mr image reconstruction. IEEE Transactions on Medical Imaging, 38(1):280–290, 2019.
  • [21] Jo Schlemper, Jose Caballero, Joseph V. Hajnal, Anthony N. Price, and Daniel Rueckert. A deep cascade of convolutional neural networks for dynamic MR image reconstruction. IEEE Trans. Medical Imaging, 37(2):491–503, 2018.
  • [22] Bernhard Schmitzer, Klaus P. Schäfers, and Benedikt Wirth. Dynamic cell imaging in pet with optimal transport regularization. IEEE Transactions on Medical Imaging, 39(5):1626–1635, 2020.
  • [23] Thomas Schuster, Bernadette Hahn, and Martin Burger. Dynamic inverse problems: modelling—regularization—numerics. Inverse Problems, 34(4):040301, mar 2018.
  • [24] Zoran Stankovic, Bradley D. Allen, Julio Garcia, Kelly B. Jarvis, and Michael Markl. 4d flow imaging with mri. Cardiovascular Diagnosis and Therapy, 4(2), 2014.
  • [25] Sonja Sudarski, Thomas Henzler, and Holger et al. Haubenreisser. Free-breathing sparse sampling cine mr imaging with iterative reconstruction for the assessment of left ventricular function and mass at 3.0 t. Radiology, 282(1):74–83, 2017.
  • [26] Muhammad Usman, David Atkinson, Freddy Odille, Christoph Kolbitsch, Ghislain Vaillant, Tobias Schaeffter, Philip G. Batchelor, and Claudia Prieto. Motion corrected compressed sensing for free-breathing dynamic cardiac mri. Magnetic Resonance in Medicine, 70(2):504–516, 2013.
  • [27] Tuomo Valkonen. A primal–dual hybrid gradient method for nonlinear operators with applications to mri. Inverse Problems, 30(5):055012, may 2014.
  • [28] Junwei Yang, Thomas Küstner, and Peng et al. Hu. End-to-end deep learning of non-rigid groupwise registration and reconstruction of dynamic mri. Frontiers in Cardiovascular Medicine, 9, 2022.
  • [29] Ningning Zhao, Daniel O’Connor, and Basarab Adrian et al. Motion compensated dynamic mri reconstruction with local affine optical flow estimation. EEE transactions on bio-medical engineering, 66(11):3050–3059, 2019.