Research paper
Denoising of diffusion magnetic resonance images using a modified and differentiable Monge–Kantorovich distance for measure-valued functions

https://doi.org/10.1016/j.cnsns.2020.105456Get rights and content

Abstract

We report on the implementation of a novel total-variation denoising method for diffusion spectrum images (DSI). Our method works on the raw signal obtained from dMRI. From the Stejskal-Tanner equation [6], the signals S(x, sk), 1 ≤ k ≤ K, at a given voxel location x may be considered as samplings of a measure supported on the unit sphere S2R3 at locations sk=(θk,ϕk)S2 which quantify the ease/difficulty of diffusion in these directions. We consider the entire signal S as a measure-valued function in a complete metric space which employs the Monge–Kantorovich (MK) metric. A total variation (TV) for measures and measure-valued functions is also defined. A major advance in this paper is the use of a modification of the standard MK distance which permits rapid computation in higher dimensions. An added bonus is that this modified metric is differentiable. The resulting TV-based denoising problem is a convex optimization problem.

Introduction

In [11], a novel framework for the representation of images obtained from diffusion magnetic resonance imaging (dMRI) [8] and their denoising using total variation was presented. In this framework, the (digital) images obtained from dMRI are represented by measure-valued functions (MVFs): If we let X ⊂ Rn denote the base space, i.e., the support of an image (e.g., a human brain), then the dMR image is a mapping μ:XM(Y), where YRm, m=1,2 or 3, denotes an appropriate range space and M(Y) the set of Borel probability measures on Y. For example, if Y=S2={uR3|u2=1}, the unit sphere in R3, then the measure μ(x)M(S2) can characterize the relative ease or difficulty of diffusion of water molecules in the voxel located at x ∈ X in the direction s=Ω=(θ,ϕ)S2.

Let us contrast this framework with the conventional way in which images from dMRI, in particular, high angular diffusion resolution imaging (HARDI), are represented, namely, as vector-valued functions, i.e., u(x)=(u1(x),u2(x),uK(x)), composed of signals, uk(x), which are obtained by aligning the linear diffusion gradient vector with gradient gR3 in K different directions, sk=(θk,ϕk)S2, 1 ≤ k ≤ K. Most image processing algorithms simply work with such vector-valued functions in a standard manner, treating them as “cubes”. What we are proposing is to work with dMRI signals in a manner that attempts to transcend such approaches in which the net signal u is simply regarded as a “stack” of component signals uk.

In an even earlier paper [13], we examined the use of function-valued mappings (FVMs) for HARDI. In this case, a HARDI signal is represented by the function-valued mapping u:XL2(S2). In all of these works – including this paper – our philosophy has been, as described in our more recent paper on FVMs and their use in hyperspectral imaging [16], to investigate if “vector-valued images be better understood, and perhaps better algorithms be developed, if their range were a space of continuously defined functions instead of RN.” Indeed, with continued improvements in technology, the number of components K in dMRI, as well as in hyperspectral imaging for remote sensing [1], is steadily increasing, essentially approaching the “continuum limit” of FVM and MVF.

In [11], our total variation-based denoising problem was formulated as follows: Given a noisy (measure-valued) image μ˜, find a solution to the following optimization problem,minνF(X)dF(X)(μ˜,ν)+νTV.Here, F(X) denotes an appropriate space of measure-valued functions suppported on X, with metric dF(X), and ‖νTV denotes the total variation of the measure-valued function νF(X). Both of these will be reviewed in the next section.

As discussed in [11], the metric dF(X) requires an appropriate metric between measures in the space M(Y). A natural choice is the so-called Monge-Kantorovich (MK) metric for measures, also to be reviewed below. Unfortunately, the computation of the traditional MK distance between (discrete) probability measures is computationally inexpensive only in one dimension, i.e., measures on R. The determination of efficient algorithms to compute MK distances in R2 or S2 and R3 is still an open problem. In [11], we showed how to solve the optimization problem of denoising using total variation minimization for the rather artificial – but still mathematically interesting – one-dimensional case, exploiting the fact that the MK distance between two one-dimensional measures is the L1 distance between their respective cumulative distribution functions. For the much more difficult higher-dimensional case, as we wrote in [11], “an alternative is to replace the Monge-Kantorovich metric on measures with another metric.” This is indeed one of the major accomplishments of this paper, representing a major advance over [11]: We propose a modification of the MK metric which is computationally inexpensive not only in one dimension but also in higher-dimensional spaces. An added bonus is that this modified metric is differentiable. The resulting total variation denoising scheme for measure-valued functions becomes a convex optimization problem which can be applied to real data, as we show below.

In closing this section, we review a few basic ideas from dMRI which provide the basis of our measure-valued approach. The reader may recall that in standard magnetic resonance imaging (MRI), a linear three-dimensional magnetic field is used so that the MRI signal is a Fourier transform. Inversion of this signal produces an “image” of the object being examined. In dMRI, a linear magnetic field is also used to define the direction of diffusion which is being examined. By means of an appropriate pulsing of a radio-frequency field in a spin-echo measurement, the relative ease/difficulty of the diffusion of “active” molecules, principally water, is measured. The strength of the signal obtained at a voxel located at x ∈ X in the direction sS2 is given by the following form of the so-called Stejskal-Tanner equation [8],S(x,s)=S0(x)exp(bg^sTD(x)g^s),where

  • S0(x) is the strength of the signal obtained in the absence of diffusion,

  • b > 0 is the so-called b-value which is determined by the physics behind the measurement, i.e., the gyromagnetic ratio, γ, along with the pulsing time T1,

  • g^s is the unit vector with direction sS2, the direction of the gradient of the linear diffusion gradient field,

  • D(x) is the 3 × 3 diffusion tensor at x. If the medium is isotropic in a neighbourhood of x, then D=dI, where d is a constant (the diffusion coefficient) and I is the identity matrix.

In HARDI imaging, the signal attenuation can also be modelled as follows [9],S(x,s)=S0(x)exp((bd(x,s)),where d(x, s), the spherical Apparent Diffusion Coefficient(sADC), characterizes the rate of diffusion in the direction sS2. The sADC d(x, s) may be viewed as a function defined on the unit sphere S2, which provides a local map of the rates of diffusion of water molecules from location x in all (or at least experimentally realizable) directions. A related quantity is the diffusion orientation probability distribution(dODF) O(x, s), which is defined to be the probability that a diffusing water molecule at x moves in direction s [8]. (As is well known – and we simply mention it here to keep our discussion brief – the method of tractography [2], [12] uses this information to determine the anatomical structure of white matter in the brain by means of connectomes, visual representations of neural fibre connections. This information can then be used for diagnostic purposes [4].)

In this study, we choose to work with the raw experimental data S(x, sk), 1 ≤ k ≤ K, which is contaminated – usually quite highly – with instrument noise. Our goal is to denoise this data. From Eqs. (1.2) and (1.3), we see that diffusion attenuates the (theoretically noiseless) signal S. For a given x ∈ X, a direction for which d(x, s) (or O(x, s)) is large (small), i.e., a large diffusion rate, corresponds to a small (large) value of the signal strength S(x, s). This implies that the signal strength S(x, s) is (exponentially) inversely related to the diffusion rate at x in the direction s – essentially a measure of the resistance to diffusion. Even though, for each x ∈ X, the signal S(x, s), considered as a function of sS2 does not define a measure over S2 (in contrast to the ODF defined earlier), we propose to treat is as a probability measure (after suitable normalization). The motivation is that measure-based distance functions, such as the Monge-Kantorovich metric, should work better than simple metrics for functions such as Lp to characterize differences in information – in this case anisotropic diffusion – that is contained in the dMRI data. Indeed, there has been interest in the use of metrics derived from information theory (e.g., Fisher-Rao metric, von Mises-Fisher distribution) to measure the overlap/difference between ODFs – see the discussion and references contained in [5], [9] – but, to the best of our knowledge, no use of such metrics in any denoising algorithm has yet been performed, apart from [11] and this paper.

Section snippets

Measure-valued images

As mentioned in Section 1, we let XRn denote our “base space,” the physical space which contains the object being imaged (e.g., a human brain). Also let YRm be a compact range space and M(Y) the set of probability measures on Borel subsets of Y. In our particular applications below, m=n=3, with X=[0,1]3 and Y=S2, the unit sphere in R3, which represents the set of all directions from any point x ∈ X.

For any two measures α,βM(Y), the Monge–Kantorovich distance [10] is defined as follows,dMK(α,β

Discretization

Our numerical experiments were performed using data from the Stanford Digital Repository (https://purl.stanford.edu/ng782rw8378) – see also [17]. This data was acquired from two human subjects and was measured using 150 different directions (the directions are shown in Fig. 1). Our discussion of the discretization below is specific to this data set but the generalization to another is clear.

For the purposes of computation the domain space of the image, X, is partitioned into a 3D grid of voxels

Numerical experiments

We performed (using a mixture of MATLAB and python) preliminary numerical experiments both with small images and large images. The small images were extracted from the large ones and were used extensively in tuning the optimization process (in particular the weight λ).

For our preliminary experiments we used a variant of gradient descent with a careful control of the step size. Since we are using a modified Monge-Kantorovich distance, it is important that the total mass at each voxel remains

Concluding remarks

In this paper, we consider the raw signal obtained from dMRI as defining a measure-valued function. From the Stejskal-Tanner equation, at each voxel x ∈ X, the signals S(x, sk), 1 ≤ k ≤ K, are considered to be samplings of a measure supported on the unit sphere S2. This measure characterizes the ease/difficulty of diffusion of water molecules from x. A modified version of the classical Monge-Kantorovich metric – a metric between measures – is employed in an effort to characterize the

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This research was supported in part by Discovery Grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) (ERV and FM: 2019-05237). Financial support from Acadia University (JM) is also gratefully acknowledged.

References (20)

  • Airborne Visible/Infrared Imaging Spectrometer, NASA Jet Propulsion Laboratory, California Institute of Technology...
  • P.J. Basser et al.

    MR diffusion tensor spectroscopy and imaging

    Biophys J

    (1994)
  • A. Ben-Israel et al.

    Generalized inverses: theory and applications

    CMS books in mathematics

    (2003)
  • D.L. Bihan et al.

    MR imaging of intravoxel incoherent motions: application to diffusion and perfusion in neurological disorders

    Radiology

    (1986)
  • V. Brion et al.

    Noise correction for HARDI and HYDI data obtained with multi-channel coils and sum of squares reconstruction: an anisotropic extension of the LMMSE

    Magn Reson Imaging

    (2013)
  • P.T. Callahan

    Principles of nuclear magnetic resonance microscopy

    (1991)
  • B. Goldluecke et al.

    The natural vectorial total variation which arises from geometric measure theory

    SIAM J Imaging Sci

    (2012)
  • H. Johansen-Berg et al.

    Diffusion MRI: from quantitative measurements to in-vivo neuroanatomy

    (2009)
  • Y. Kim et al.

    HARDI data denoising using vectorial total variation and logarithmic barrier

    Inverse Prob Imaging

    (2010)
  • H. Kunze et al.

    Fractal-based methods in analysis

    (2012)
There are more references available in the full text version of this article.

Cited by (0)

View full text