Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Jul 31.
Published in final edited form as: Magn Reson Med. 2024 Feb 2;92(1):319–331. doi: 10.1002/mrm.30047

Self-supervised multi-contrast super-resolution for diffusion-weighted prostate MRI

Batuhan Gundogdu 1, Milica Medved 1, Aritrick Chatterjee 1, Roger Engelmann 1, Avery Rosado 1, Grace Lee 1, Nisa C Oren 1, Aytekin Oto 1, Gregory S Karczmar 1
PMCID: PMC11288973  NIHMSID: NIHMS2009418  PMID: 38308149

Abstract

Purpose:

This study addresses the challenge of low resolution and signal-to-noise ratio (SNR) in diffusion-weighted images (DWI), which are pivotal for cancer detection. Traditional methods increase SNR at high b-values through multiple acquisitions, but this results in diminished image resolution due to motion-induced variations. Our research aims to enhance spatial resolution by exploiting the global structure within multi-contrast DWI scans and millimetric motion between acquisitions.

Methods:

We introduce a novel approach employing a ”Perturbation Network” to learn sub-voxel-size motions between scans, trained jointly with an implicit neural representation (INR) network. INR encodes the DWI as a continuous volumetric function, treating voxel intensities of low-resolution acquisitions as discrete samples. By evaluating this function with a finer grid, our model predicts higher-resolution signal intensities for intermediate voxel locations. The Perturbation Network’s motion-correction efficacy was validated through experiments on biological phantoms and in-vivo prostate scans.

Results:

Quantitative analyses revealed significantly higher structural similarity measures of super-resolution images to ground truth high-resolution images compared to high-order interpolation (p < 0.005). In blind qualitative experiments, 96.1% of super-resolution images were assessed to have superior diagnostic quality compared to interpolated images.

Conclusion:

High-resolution details in DWI can be obtained without the need for high-resolution training data. One notable advantage of the proposed method is that it does not require a super-resolution training set. This is important in clinical practice because the proposed method can easily be adapted to images with different scanner settings or body parts, whereas the supervised methods do not offer such an option.

Keywords: super-resolution for MRI, DWI, prostate MRI, implicit neural representation

1 |. INTRODUCTION

Diffusion-weighted images (DWI) are sensitive to the molecular displacement of water within each voxel. By acquiring DWI at varying diffusion-weighting values (b-values), we can calculate apparent diffusion coefficient (ADC) maps. These maps reveal tissue diffusivity levels and are particularly important because the increased cell density typically present in cancer significantly reduces the ADC values, which allows cancer detection. However, DWI often exhibits lower image resolution and signal-to-noise ratio (SNR) compared to other imaging sequences like T1 and T2-weighted images. To improve the SNR, multiple high-b DWI acquisitions are performed. These are then averaged to create a single image. However, this approach is susceptible to significant artifacts1. Artifacts in these images mainly arise from two types of motion or magnetic field fluctuations: (1) gross motion or field changes between acquisitions (e.g., due to breathing, rectal gas motions), and (2) localized motions or field changes during diffusion encoding gradients (e.g., arterial pulse or peristalsis). The non-rigid gross motion occurring between the separate signal acquisitions makes the image blurry, consequently reducing the resolution whereas the localized motion or B0 field fluctuations during the acquisition can reduce or completely suppress the signal2.

This presents a challenging situation with no clear solution. Specifically, when dealing with small voxel sizes, the common approach of signal averaging (obtaining multiple acquisitions and averaging them) can enhance the SNR but leads to decreased image quality and resolution due to motion-induced variations between acquisitions. In clinical practice, high SNR, reduced scanning times, and patient comfort typically outweigh the need for high resolution. Consequently, DWI images often have low resolution, affecting their specificity and the accuracy of targeted biopsies3. Thus, resolution is considered crucial to the ‘diagnostic image quality’ in prostate MRI. This is measured with the PI-QUAL score, which reflects the extent to which the image assists in ruling in and ruling out clinically significant cancer4,5. Studies using the PI-QUAL score have shown that DWI image quality, including SNR and resolution, significantly impacts the accuracy of cancer detection6,7,8,9.

This study introduces a super-resolution model for diffusion-weighted prostate imaging that potentially reduces scan times and improves patient comfort, particularly for those with claustrophobia or Parkinson’s disease. The method enables high-quality imaging from lower-resolution inputs, mitigating the need for a large number of averages needed for obtaining high-resolution images with adequate SNR. Our study aims to improve prostate DWI resolution through self-supervised deep learning, utilizing anatomical patterns across varying b-values and the non-linear inter-acquisition motion. Its efficacy was assessed using in-vivo prostate MR images with biopsy-confirmed cancers and carefully designed biological phantom experiments.

1.1 |. Super-resolution in Medical Images

Super-resolution in medical images has been a topic of interest over the past two decades. Early methods concentrated on multi-image registration and stitching, utilizing shifted, orthogonal, anisotropic, or rotated scans10,11,12,13,14. However, the effectiveness of methods using linear sub-voxel shifts has been debated, as they provide minimal or no in-plane improvement. This is because MRI is a Fourier imaging technique, here rigid spatial shifts only introduce a linear phase gradient without yielding a more densely sampled k-space15. Consequently, these techniques have only shown success in achieving through-plane super-resolution.

The success of deep learning-based super-resolution technologies in real-world imaging has inspired the application of similar supervised methods in medical imaging, drawing from techniques proven effective in digital photography16. Recent supervised deep learning methods assume that a low-resolution image is a ‘degraded’ version of its high-resolution counterpart, aiming to reverse this ‘degradation process’ to recover the high-resolution image17. While these methods are increasingly applied and show promise, it’s crucial to note that, as with other medical deep learning applications, their effectiveness is often limited by the scarcity of available training data. Consequently, much of the super-resolution training in medical imaging depends on synthetically degraded input-output pairs.

However, it’s important to approach these supervised methods cautiously. In medical imaging, high- and low-resolution image pairs are not readily available, and synthetically degrading images to create these pairs for training can lead to artifacts and hallucinations. The emerging self-supervised super-resolution paradigm presents a promising solution to overcome these limitations. Self-supervised learning utilizes the inherent information in the data, eliminating the need for explicit high- and low-resolution pairs. Table 1 offers a concise summary of the relevant literature in this domain.

TABLE 1.

Literature Review of Super-resolution in Medical Imaging.

Reference Year Modality Body Part Method

Peled et.al10 2001 DWI brain registration of shifted scans
Carmi et al.11 2006 T2W phantom registration of rotated scans
Kennedy et al.12 2007 PET/CT lung registration of shifted scans
Mayer et.al.13 2007 T2W phantom registration of shifted scans
Lu et.al.14 2010 T2W body registration of parallel scans from different coils
Scherrer et.al.15 2011 DWI brain registration of rotated scans
Tieng et.al.18 2011 T2W phantom registration of shifted scans
Van Reeth et.al.19 2012 T2W brain registration of rotated/shifted scans (review article)
Scherrer et.al.20 2012 DWI brain registration of orthogonal scans
Wu et.al.21 2016 DWI brain High order singular value decomposition
Jurek et.al.22 2017 T2W prostate registration of orthogonal scans
Alexander et.al.23 2017 DWI brain patch regression
Chaudhari et.al.16 2018 T2W knee supervised CNN
Zeng et.al.24 2018 T1W brain supervised CNN
Chen et.al.25 2018 T2W brain supervised CNN
Sood et.al.26 2019 T2W prostate supervised GAN
Liu et.al.27 2019 T2W prostate supervised CNN
Hong et.al.28 2019 DWI brain supervised CNN
Dar et.al.29 2019 T1, T2 brain supervised GAN
He et.al.30 2020 T2W prostate supervised deep attention networks
Zhou et.al.31 2020 T1, T2, FLAIR brain supervised multimodal fusion-net
Park et.al.32 2021 T2W lung supervised autoencoder-inspired CNN
Sood et.al.33 2021 T2W prostate supervised GAN
Chatterjee et.al.34 2021 DWI brain supervised Unet
Wu et.al.35 2021 T1W brain self-supervised IRN
Jiang et.al.36 2022 T1W brain supervised CNN with real 0.35T vs 3T scans
Molahasani et.al.37 2022 T2W prostate supervised GAN
Mahaptra et.al.38 2022 PD, T1 and T2 brain supervised CNN and vision transformers
Guo et.al.39 2023 DWI brain semi-supervised GAN
This work 2023 DWI prostate self-supervised INR

CNN: Convolutional Neural Networks, GAN: Generative Adversarial Networks, IRN: Implicit Neural Representation

This paper presents a self-supervised super-resolution model* for DWI utilizing the Implicit Neural Representation (INR) framework40. Unlike supervised methods that assume a low-resolution image is simply a degraded high-resolution image and seek to learn the inverse degradation function, our model adopts a more realistic assumption. It posits that the high-resolution image is a continuous function mapping the continuous domain of x-space to the range of signal intensities, and the low-resolution voxels are merely samples of this continuous function. Therefore, the objective is to learn this continuous function via the INR network, using the sparse samples (voxels) from the low-resolution image in the training set. In other words, this framework uses the voxel locations as input, and the acquired signal intensities within these voxels as the output.

While the INR framework was initially not designed for super-resolution, Wu et. al. showed that using INR is an effective means of super-resolution on single-contrast T1W brain images35. We introduce two novelties to the INR approach and extend its application to DWI of the prostate: Firstly, we incorporate multiple degrees of diffusion weighting (quantified by the b-value) along with the standard 3D spatial inputs. This modification transforms the INR into a multi-contrast solution, thereby broadening its scope of application. Second, we introduce the input Perturbation Network to correct voxel mismatches observed between different signal acquisitions and potentially to harvest important morphological details that might exist in individual acquisitions but can be lost within the large voxel sizes and during aggregation of different acquisitions. This Perturbation Network learns to move the input voxel locations slightly for each acquisition while minimizing the reconstruction loss of the INR.

An analysis of the literature in Table 1 reveals that most super-resolution research focuses on brain imaging, with prostate MRI studies being relatively scarce, even though prostate cancer is the second deadliest cancer in men41. Moreover, the majority of super-resolution applications in MRI have been conducted using T1 or T2-weighted images. However, applying super-resolution to DWI which typically has lower resolution, could be more impactful. To our knowledge, existing super-resolution studies in DWI are limited to brain imaging, and the few prostate studies focus exclusively on T2-weighted images, with no attempts made on prostate DWI. Our study underscores the significance of high-resolution prostate DWI in (1) identifying extra-prostatic tumor extension, (2) pinpointing precise tumor locations, and (3) generating accurate coronal DWI views for enhanced cancer localization.

2 |. METHODS

The foundational assumption of our model is that a full-resolution analog image can be represented as a continuous function, with low-resolution acquisitions being just discrete and noisy samples from this function. An INR network is used to approximate this continuous function. As previously mentioned, DWI are typically acquired at various b-values, including multiple acquisitions at high b-values. Each acquisition can be represented as a discrete 3-D volume:

Sk(b)RMx×My×Mzk=1K, (1)

where b is the diffusion-weighting value, Mx,My and Mz denote the number of voxels along the x,y and z directions, respectively, with K representing the number of independent acquisitions. In standard practice, these acquisitions, denoted Sk are averaged to produce a single mean image for radiological examination:

Smean(b)=1Kk=1KSk(b). (2)

The INR network models the full-resolution image as a function f:R4R This 4D input space comprises the continuous 3D voxel locations and the b-value, which serves as the fourth dimension. The network’s output is the signal intensity corresponding to a specific voxel location and b-value. The INR is structured as a deep neural network with fully-connected layers and sinusoidal activation functions, following the initialization guidelines from Sitzmann et al.40. Training set inputs are quadruplets mx,my,mz,b representing the normalized voxel location and b-value, scaled to range between −1 and 1 along each axis. The model’s output labels are the intensity values of the image observed at the input location. Thus, the INR is trained to minimize the error between the predicted diffusion values at input locations xi=mx,my,mz and the measured diffusion values Skxi,b:

=i,kSk(xi,b)INR(xi,b)2. (3)

Essentially, INR’s objective is to estimate the signal for any given input location, even for those not included in the training set, i.e “intermediate voxels” with higher resolution. Take, for instance, our practically discrete input space: while input samples like x1=(0,0,0,0) and x2=1Mx/2,0,0,0 are part of the training set, an intermediate sample along the first axis, such as x3=1Mx,0,0,0, is not. Once trained with x1,x2 and other location indices, the INR is capable of predicting diffusion values for the ‘intermediate voxel locations’, like x3. Importantly, it should be emphasized that the INR functions beyond the capabilities of a simple sinc interpolator. This distinction arises for three reasons: (1) INR integrates information across various b-values within shared layers (Eq. (3)), (2) the Perturbation Network applies non-linear motion correction across acquisitions and b-values (§2.1), and (3) the training process incorporates data-driven learning weights for each voxel’s acquisitions, addressing inter-acquisition variability (§2.2).

2.1 |. Input Perturbation Network for Motion Correction

The motion-induced intra- and inter-acquisition variabilities are often non-linear due to B0 inhomogeneities and bowel-enclosed gas. We represent the warping effect of millimetric motion as a perturbation to the INR input. We assume that the reason Sk1(x,b)Sk2(x,b) is the millimetric motion between acquisitions k1 and k2 and that Sk1(x,b)=Sk2(x+ϵ,b). This implies that when an acquisition’s signal value differs from the network output, the method considers whether the discrepancy is due to the acquisition originating not from the designated voxel location but from a nearby sub-voxel location, affected by small inter-acquisition motions. A 1-D example of the idea behind the proposed methodology is visualized in Figure 1. In this example, when the point spread function is applied on each voxel, very small motions that occur between acquisitions cause the sharp edge to be lost even when it is sampled at the Nyquist rate. In the proposed approach, the discrete input space is generalized as a continuous domain, and the input perturbation network is trained in an unsupervised fashion to estimate a warping function for each acquisition to minimize the reconstruction loss at the output.

FIGURE 1.

FIGURE 1

A 1D example of the zero-shot super-resolution methodology. The supervised block uses the low-resolution voxel intensities as labels, hence effectively there is no need for high-resolution ground truth labels.

We propose that this perturbation value can be learned jointly with the INR parameters, on a separate neural network. The error that the signal for such an acquisition creates at the output is backpropagated to assign this value to a better location near the center of the input voxel such that the total loss is minimized for all voxels and all acquisitions. This warping is done via the Perturbation Network (PN) which is modeled as another fully connected neural network that takes the acquisition number and the b-value as input, in addition to the voxel location x The network then estimates a perturbation value ϵ via a scaled <monospace>tanh</monospace> activation function such that the perturbation is constrained to vary within physically feasible motion limits by enforcing the following cost:

=i,kSk(xi,b)INR(xi+ϵmaxtanh(PN(xi,b,k)),b)2. (4)

The perturbation upper-bound ϵmax was set as half a voxel size along each direction, i.e. ϵmax=1Mx,1My,1Mz. Note that the acquisition number k and the b-value used as inputs to the Perturbation Network take discriminative roles, and no temporal order along acquisitions is assumed. This creates an opportunity to capture sharp edges within the image, even if the voxel size is too large to capture it, and this edge is captured in different acquisitions for the same voxel due to the motion. The flowchart of the proposed network and the role of the Perturbation Network is demonstrated in Figure 2.

FIGURE 2.

FIGURE 2

The flowchart of the implicit neural representation learning, augmented by the input perturbation network.

2.2 |. Weighted Learning for Suppressing Anomalously Low Signals

Besides the inter-acquisition bulk motion of the prostate as discussed in §2.1, the disparity in Sk1(x,b) and Sk2(x,b) for k1k2 may also stem from the molecular motion of water or the fluctuation in local magnetic field during application of the diffusion-sensitizing gradients. These factors lead to variations that are too large to be explained by thermal Rician noise42. This type of variation is commonly referred to as physiological noise. Physiological noise typically leads to an anomalous reduction in signal intensity, resulting in a right-skewed per-voxel signal intensity distribution. This aspect is crucial, as conventional signal averaging tends to yield ADC values significantly higher than the actual ADC values, which are key for evaluating tissue composition and diagnosing cancers.

Representing a diffusion-weighted image S, derived from multiple acquisitions Sk, as a single INR (as proposed in this study), effectively addresses such signal mismatches. The INR framework’s operation within continuous spaces for both domain and range grants it an awareness of proximal voxels. This characteristic allows the INR to potentially favor intensity values of the acquisitions that are more consistent with the neighboring voxels, while naïve-averaging merely calculates the mean of different signal intensities for each voxel.

The proposed methodology also presents an opportunity to assign weight values to each acquisition such that the motion-corrupted signals can be automatically suppressed during training. For this, we follow the recipe that was recently proposed by Gundogdu et. al.42. Accordingly, we alter the loss function in Equation (3) to incorporate learning weights wk as thus:

=i,kwkxi,bSkxi-INRxi+ϵmaxtanhPNxi,b,k2. (5)

Detailed derivation and further interpretation of the learning weights are provided in the Supplement.

2.3 |. Evaluation of Super-Resolution Performance and Accuracy

In the inference phase, to generate the super-resolution image, the INR network employs a finer input grid than the resolution of the voxels used in training. This approach yields an image with enhanced detail and improved resolution. However, to be able to assert that the new details introduced by the network correspond to actual anatomical features, it is crucial to establish reliable ground truth images for evaluation. In the case of super-resolution examples in digital photography, such datasets are readily available43. On the other hand, for MRI, especially DWI, acquiring high-resolution images to this degree is not feasible. In-vivo acquisitions with higher resolution would result in unacceptably low SNR. Furthermore, the long acquisition times would increase vulnerability to motion artifacts. Therefore, in addition to the in-vivo prostate scans, we employed a biological phantom to evaluate our model’s performance.

2.3.1 |. Evaluation on Phantom

This specifically designed phantom experiment aimed to test the unsupervised motion-correction efficacy of the Perturbation Network. In this experiment, we emulated both the presence and absence of inter-acquisition motion in MRI scans of a biological phantom. Kiwifruit was selected as the biological phantom, as various studies have demonstrated its effectiveness as a proxy for prostate MRI, DWI included44,45.

Diffusion-weighted images of kiwifruit were acquired under three distinct settings. Firstly, we acquired images with a resolution of 1.5mm×1.5mm×3.0mm in nine repetitions. To simulate rigid motion in the axial plane, we simulated transitive motion to each repetition by shifting the center of the field-of-view in-plane by −0.5, 0.0, or +0.5mm in the readout or phase-encoding directions. In some cases, the shifting of the center of the field-of-view is achieved in post-processing by the scanner, rather than by changing the sequence parameters, and thus the underlying k-space data may not have changed due to the shift. However, our algorithm operates on the produced magnitude images without relying on the complex k-space information and the main objective was to assess if the Perturbation Network corrects this motion with unsupervised training, thus this was deemed acceptable.

The second image was acquired as a ground truth with the same resolution and the same number of acquisitions as the first one, but with no motion. We recognize that the synthetically introduced motion is rigid and might not fully replicate the non-rigid warping often encountered in prostate diffusion imaging. However, we opted to apply motion during the MR scan rather than inducing artifacts in post-processing, to more accurately mirror real-world conditions.

Additionally, we acquired a third image with a high-resolution of 0.75mm × 0.75mm × 3mm, serving as a reference to evaluate the detail captured in the INR output and the first two images (with and without motion). All scans were conducted on a dStream Philips 3T Ingenia scanner with digital coil technology, a 70 cm bore, dual RF transmission, and maximum gradient strength of 45 mT/m, and a maximum slew rate of 200 T/m/s. DWI images were acquired at b = 1000s/mm2, with TR/TE = 4000/90 ms, 4 EPI segments, in-plane resolution of 1.5 × 1.5mm2 (acquired) and 1.4 × 1.4mm2 (reconstructed), and slice thickness of 3 mm. The phase encoding direction is along the main axis of the kiwi fruit, parallel to its striated structural pattern.

To see the correcting impact of the Perturbation Network, we trained one INR by setting setting ϵmax=0 and another by setting ϵmax to half a voxel size. We used the scans with the motion as input to the network and compared the output to the no-motion ground truth, as well as the very high-resolution ground truth.

2.3.2 |. Evaluations on In-Vivo Prostate Images

We evaluated the performance of the proposed method on diffusion-weighted prostate images of patients with biopsy-verified prostate cancers. This was conducted following the approval of the Institutional Review Board (IRB) (Approval number: IRB17–1694). We ran both qualitative and quantitative experiments. To evaluate the resolution improvement, we created a training set of low-resolution prostate images by applying low-pass filtering and sub-sampling to the original images. Another approach could have involved acquiring a separate set of low-resolution images with larger voxel sizes during a second scan of the patient. However, this would have introduced a different SNR, rendering the original images suboptimal as ground truth. No additional noise was introduced to the training set. The network was trained with diffusion-weighted images with 4 b-values: 0, 150, 1000 and 1500sec/mm2, so that it can yield a super-resolved and motion-corrected ADC map. Orthogonal gradients were in the readout, phase encoding, and slice directions. At high b-values multiple acquisitions at each orthogonal direction were used as independent acquisitions to train INR. The INR framework addresses the inter-acquisition motion directly via the Perturbation Network, and intra-acquisition motion indirectly by suppressing the resulting low signals via weighted learning.

The network was trained with the voxels of the subsampled low-resolution image. Then the trained network was used to generate the 2x super-resolution images. It is noteworthy that the network has the potential to produce outputs at multiple resolutions, not limited to the 2x factor. Two fellowship-trained and board-certified attending radiologists, with 12+ and 6+ years of experience in prostate MRI (NCO and GL respectively), evaluated the 2x super-resolution images, 2x bicubic-interpolated images (serving as an approximation to sinc interpolation), and ground truth (GT) images. They were tasked with ranking these three sets of images – super-resolution, bicubic-interpolated, and GT – based on their perceptual quality. This experiment involved 65 cases with high-b images, and the order of presentation for the three test images was randomized for each subject during evaluation. Hence, the radiologists were unaware of the identity of the specific image they were evaluating.

The same set of 65 cases was utilized for a quantitative analysis, comparing the super-resolution and bicubic-interpolated images against their respective ground truth counterparts. For this analysis, we used established quantitative image similarity metrics common in digital image analysis, such as the Structural Similarity Index Measure (SSIM), Multiscale SSIM (MS-SSIM), Feature Similarity Index Measure (FSIM), and Spectral Residual-based Similarity Index Measure (SR-SIM)46,47,48. Each metric was used to compare the 2x super-resolution and 2x bicubic-interpolated images with their corresponding high-resolution ground truth images. Statistical significance was evaluated using paired t-tests with N=65.

3 |. RESULTS

Model training was carried out utilizing the PyTorch software package on a workstation equipped with Intel(R) Xeon(R) Gold 6130 CPU @ 2.10GHz, Tesla V100 GPU with 32GB memory.

3.1 |. Unsupervised Motion Correction with the Perturbation Network

Figure 3 displays representative images from the phantom experiment. Since MRI is a Fourier technique, the calculated shifts resulted in a linear phase change in k-space without contributing to increased resolution. The purpose of this experiment was to demonstrate that MRI scans can be modeled using an INR and that the Perturbation Network can effectively mitigate motion effects. The Perturbation Network successfully inferred the direction and magnitude of voxel movements (ϵ) for each acquisition and applied the necessary corrections.

FIGURE 3.

FIGURE 3

Proposed network on kiwifruit DWI. (a) image acquired without simulated motion, (b) image acquired with simulated motion, (c) model output with ϵmax is set to zero, i.e. Perturbation Network is not in effect, (d) model output with Perturbation Network turned on and (e) very high-resolution “ground truth” image used for visual validation. The vertical detail enclosed in the red circle is lost as a result of motion as seen in (b). The model output at (d) does not only address the effects of motion, but it also provides a better image than the “no-motion” image shown in (a), as can be verified from the very high-res image in (e).

In the kiwifruit experiment, the perturbation vector ϵ remained consistent across all voxels but varied for each acquisition. To assess the Perturbation Network’s ability to decipher completely random perturbations for all voxels, we created a motion-corrupted image by slightly displacing each voxel in random directions. This was achieved by slightly perturbing the location of each voxel in each acquisition towards a random direction. This in-silica experiment simulated a scenario significantly more severe than the actual motion effects typically observed in DWI scans. We conducted this test to determine whether the Perturbation Network could learn such nonlinear motion effects despite being trained in an unsupervised manner. Figure 4 provides a comparison between the image without motion corruption, the motion-corrupted average image, and the INR output.

FIGURE 4.

FIGURE 4

Comparison between the in-silica image with no motion corruption (left), the motion-corrupted average image (middle), and the output of the INR (right). The input to the INR model was the middle image.

3.2 |. Results on In-vivo Prostate Images

In the observer study, 96.1% of the super-resolution images were rated as having superior perceptual quality compared to the interpolated images. Moreover, 40.7% of the super-resolution images were voted to have better quality than the ground truth images, despite the ground truth images having double the resolution and quadruple the voxel count of the model’s input. This latter finding is particularly noteworthy as it indicates that the Perturbation Network significantly contributes to the improvement in image quality in these cases. The most likely reason for this outcome is that the model is trained by K=12 individual acquisitions despite seeing only a quarter of the number of voxels as input (compared to ground truth). It is worth noting that the interpolated and ground truth images were also generated using the mean of the same number of acquisitions, implying no expected SNR gain with the algorithm. When the Perturbation Network is not in effect, the main INR learns to average these acquisitions for the observed locations in the training set and attempts to predict the intermediate voxels based on the continuity of the INR function. However, considering that the model can produce outputs that visually outperform the high-resolution ground truth, we can argue that the Perturbation Network learns to reposition certain features away from the voxel center, thereby providing higher resolution information and accounting for sub-voxel motion. This effect is most prominent when dealing with sharp features that exhibit relatively high SNR.

In quantitative analyses, super-resolution INR outputs exhibited significantly higher similarity to the ground truth images compared to the bicubic-interpolated images (p<0.005) across all quantitative similarity metrics (SSIM, MS-SSIM, FSIM, and SR-SIM). Naturally, image areas are smooth regions and consist of low-resolution components that are similar in super-resolution, bicubic-interpolated and ground truth images. Consequently, these measurements yield very high similarity values in the aforementioned measurements. However, our primary focus is on determining whether the high-resolution details introduced by the super-resolution model are present in the ground truth images. To investigate this, we applied a high-pass filter to all super-resolution, ground truth, and bicubic images, effectively removing the low spatial frequency components, and focused solely on the high spatial frequency details. This analysis revealed that most of the gain obtained by super resolution is in high spatial resolution components, e.g., edges etc. Table 2 presents the similarity scores of the super-resolution and bicubic-interpolated mean images (baseline) relative to the high-resolution ground truth. Figure 5 showcases examples from this evaluation set. Additional details on the similarity scores and further image examples can be found in the Supplement.

TABLE 2.

Image similarity measures of super-resolution compared to bicubic interpolation.

All Bands High Frequency Bands
Metric bicubic vs GT SR vs GT Gain (%) p-value bicubic vs GT SR vs GT Gain (%) p-value
SSIM 88.5 89.3 0.9 <0.001 52.5 56.3 7.5 <0.001
MS-SSIM 88.6 90.1 1.8 <0.001 72.0 75.8 5.4 <0.001
SR-SIM 87.8 89.0 1.5 <0.005 87.6 89.1 1.8 <0.005
FSIM 90.2 91.9 1.9 <0.001 88.5 90.5 2.3 <0.001

SR: super-resolution, GT: ground truth

FIGURE 5.

FIGURE 5

Performance of the proposed method on two cases: (a) LR image, (b) 2x bi-cubic interpolated image, (c) the output of the model with 2x super-resolution, and (d) the HR ground truth image. The red arrows show anatomic details that INR enhances. Arrow 1 shows the enhancement in the border of the cancer along the margin of the prostate, arrow 2 shows the rectal wall, and arrow 3 shows that the boundaries of the urethra and an adjacent suspicious lesion can be seen better in the super-resolution image.

3.3 |. Super-resolution Performance Along Slice-select Direction

The proposed methodology treats the DWI as 3D volumetric objects, aiming to enhance super-resolution not only in the axial view but also along the slice select direction. Typically, the z-axis slices are taken thicker than the axial resolution to improve SNR gain. Consequently, the coronal or sagittal views of DWI, despite their potential significance, are often overlooked in clinical practice. Reconstructing INR with finer details along the slice-select direction enables the visualization of coronal or sagittal DWI views, derived from axial slices. Figure 6 demonstrates the super-resolution reconstructed coronal view of a sample diffusion-weighted image alongside the corresponding slice of the T2W coronal image.

FIGURE 6.

FIGURE 6

Super-resolution reconstructed coronal view of one sample image compared to the interpolated image. T2W coronal image that corresponds to the same slice is given on the right for reference. Arrow 1 shows the boundaries and extent of the cancer whereas Arrow 2 shows the prostatic urethra which was not visible in interpolation but visible in super-resolution image.

3.4 |. Contrast Enhancement via Weighted Training

To assess the contrast enhancement due to weighted training, we trained the network with the high-resolution GT image and applied the learning weights as detailed in the Supplement. ADC maps calculated using the model output are compared to the ADC maps of GT images. The analyses were done on a cohort of 12 patients with biopsy-verified cancers by measuring both the SNR on cancer and the cancer-to-healthy-prostate contrast ratio (CR). Statistical analysis was conducted using paired t-tests. The CR significantly improved on average from 1.9 to 2.4 (p < 0.01) when comparing the model output with the input image. Similarly, the SNR showed a significant average increase from 1.87 to 1.92 (p < 0.01), despite no noise suppression being applied.

4 |. DISCUSSION AND CONCLUSIONS

This study validates the feasibility of zero-shot super-resolution in diffusion-weighted prostate imaging and underscores the potential of using individual acquisitions for self-supervised motion mitigation to some extent. Although introducing synthetic shifts and obtaining super-resolution is a well-established implementation, we provided a novel unsupervised method to utilize the underlying non-rigid motion to obtain super-resolution. We showed this by comparing the model output with the motion-free ground truth, using the settings with the Perturbation Network turned on and off. The new detail introduced by the network was also compared with a very-high-resolution image, which was acquired in an idealistic setting obtained solely for this experiment and would not have been practically possible in actual patient scans.

As discussed earlier, pre-calculated linear shifts during imaging provide negligible gains in spatial resolution. However, the “shifting” observed between subsequent in-vivo acquisitions is non-rigid49. Moreover, local B0 inhomogeneities, arising from poor shimming, tissue boundaries, or proximity to bowel-enclosed gas, contribute additional non-linear alterations to the MRI signal. Thus, we argue that additional information may, in fact, be present in the repeated acquisitions of the same image. Much of the existing literature on super-resolution to DWI focuses on brain imaging, where motion is typically minimal compared to prostate imaging, thus not leading to significant non-linear changes in the MRI signal.

It’s important to note that phantom experiments (Figures 3 and 4 ) and in-vivo experiments (Figures 5 and 6 ) are designed to showcase two distinct features of the proposed model. All kiwifruit images in Figure 3 (ad) share the same resolution and voxel count. This figure effectively demonstrates the motion-correction capability of our model, especially the role of the Perturbation Network. The Perturbation Network aims not only to correct signal mismatches in different acquisitions of inter- and intra-b-value scans but also to recover details potentially lost in naive averaging. Conversely, Figures 5 and 6 showcase the super-resolution capability of the proposed method highlighting how the model, trained exclusively with LR image voxels, compares with the HR image. The model’s estimation of ‘intermediate voxels’ is noticeably superior to bi-cubic interpolation, a difference attributed to our method functioning globally, in contrast to interpolation’s local operation. The quantitative structural similarity results in Table 2 further substantiate this observation.

Naturally, this study has certain constraints and limitations that should be acknowledged. For example, we selected bi-cubic interpolation as a competitor for evaluation, whereas other studies employing synthetically generated low-resolution inputs could compare their approaches directly. However, it is important to note that the challenge we undertook in the context of ”real image super-resolution” does not allow for a uniform or universally accepted ground truth. If we had acquired images with smaller voxel sizes, it would have resulted in lower SNR, or longer acquisition times at lower voxel sizes would have introduced variations in motion exposure. As a result, direct comparisons or supervised training with other CNN-based super-resolution methods would not have been feasible. In essence, we intentionally adopted this self-supervised setting to address these challenges. Key advantages of the proposed model over the supervised deep learning-based methods are enumerated below:

  1. Input/Output: Supervised methods rely on processing low-resolution images as input, utilizing local kernel operations. At the output, these models expect the high-resolution versions of the images. In contrast, the INR method takes only the voxel locations (x,y,z) as input and emit the voxel intensity that corresponds to the input location.

  2. Resolution Factor: Supervised models are trained with predesigned low/high-resolution pairs and learn to increase the resolution by the factor dictated in the difference of high- and low-resolution training images. INR models the continuous space and can provide improvement for any factor, even non-integer ones.

  3. Training Set: Supervised models typically require a large training set of input/output pairs. Deep models that are trained with fewer images than required possess the risk of hallucination, i.e., carrying some information/detail seen on the training set to a test image. INR doesn’t need a training set of images from other patients. It merely uses the existing voxel intensities for each scan to infer a motion-corrected and higher-resolution image.

  4. Risk of Artifacts: Acquiring low/high-resolution training pairs for medical imaging is impractical, often leading to the synthetic generation of low-resolution samples. However, INR bypasses such complexities, as each image individually trains its own model, eliminating the risk of artifact generation due to incorrect modeling.

  5. Practicality: Supervised models, trained for specific imaging settings (TE, TR, b-value, etc.), or data from a certain vendor (Phillips, GE, Siemens etc.) or images from a certain modality (T2W, T1W, DWI, etc.), require fine-tuning for new settings, posing challenges for inter-institutional operation and clinical practice. Conversely, INR is free from such constraints; it can train and super-resolve a new scan within minutes without a pre-existing model for inference.

A particularly promising implication of this study is the potential for reduced scan times, which is especially beneficial for patients experiencing claustrophobia or those with Parkinson’s disease. By employing the proposed model for super-resolution, it becomes possible to obtain high-quality images from lower-resolution inputs, thus mitigating the need for a large number of averages needed for obtaining high-resolution images with adequate SNR. This has significant benefits in terms of patient comfort and convenience. In addition, shortened echo train durations will help reduce the spatial distortion often present in EPI-based images.

From the clinical perspective, reconstructed coronal DWI images of the prostate, as we proposed in this paper, could be helpful in more accurate localization of the tumor to the appropriate level of apex, midgland, or base; and distinguishing a lesion location in the peripheral zone of the prostate from normal periprostatic tissue such as the surgical capsule or the periprostatic plexus by lesion location confirmation with an additional view, or distinguishing lesion location in the central gland, where BPH nodules could mimic cancer, from the peripheral zone, where malignant lesions are more prevalent.

Supplementary Material

supplement

Section 1.1 Details of weighted learning and enhancement in restricted diffusion (complete derivation)

Section 1.2 Design of the Temperature Parameter

Section 1.3 Similarity Comparisons

Section 1.4 More in-vivo examples

Figure S1. Design of the τ parameter.

Figure S2. Frequency response of the high pass filter applied to suppress low resolution components and focus on high resolution components

Figure S3. Scatter plot that compares SR against bicubic interpolation in several structural similarity metrics. It can be seen that the superiority of SR is more 7 emphasized especially for the images with lower LR-HR similarity scores.

Figure S4. An example of through-plane super-resolution of DWI (coronal view)

Figure S5. An example of through-plane super-resolution of DWI (coronal view)

Figure S6. An example of through-plane super-resolution of DWI (coronal view)

Figure S7. An example of in-plane super-resolution of DWI (axial view)

Figure S8. An example of in-plane super-resolution of DWI (axial view)

Figure S9. An example of in-plane super-resolution of DWI (axial view)

Figure S10. An example of in-plane super-resolution of DWI (axial view)

Funding Information

Supported by the National Institutes of Health; (R01 CA227036, 1R41CA244056-01A1, R01 CA17280, and 1S10OD018448-01), Sanford J. Grossman Charitable Trust

Footnotes

Without loss of generality, this example assumes that Mx is an even number, which is in fact fair since FFT in DWI reconstruction typically calls for sizes with powers of 2.

REFERENCES

  • 1.Denis Le Bihan, Cyril Poupon, Alexis Amadon, Franck Lethimonnier. Artifacts and pitfalls in diffusion MRI. Journal of Magnetic Resonance Imaging: An Official Journal of the International Society for Magnetic Resonance in Medicine. 2006;24(3):478–488. [DOI] [PubMed] [Google Scholar]
  • 2.Joy Liau, Jimmy Lee, Schroeder Michael E, Sirlin Claude B Mark Bydder. Cardiac motion in diffusion-weighted MRI of the liver: artifact and a method of correction. Journal of Magnetic Resonance Imaging. 2012;35(2):318–327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Baxter Gabrielle C, Patterson Andrew J, Ramona Woitek, Iris Allajbeu, Graves Martin J, Fiona Gilbert. Improving the image quality of DWI in breast cancer: comparison of multi-shot DWI using multiplexed sensitivity encoding to conventional single-shot echo-planar imaging DWI. The British Journal of Radiology. 2021;94(1119):20200427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Francesco Giganti, Alex Kirkham, Veeru Kasivisvanathan, et al. Understanding PI-QUAL for prostate MRI quality: a practical primer for radiologists. Insights into Imaging. 2021;12(1):1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Francesco Giganti, Alexander Ng, Aqua Asif, et al. Global Variation in Magnetic Resonance Imaging Quality of the Prostate. Radiology. 2023;309(1):e231130. [DOI] [PubMed] [Google Scholar]
  • 6.Eoin Dinneen, Clare Allen, Tom Strange, et al. Negative mpMRI rules out extra-prostatic extension in prostate cancer before robot-assisted radical prostatectomy. Diagnostics. 022;12(5):1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Eleftherios Karanasios, Iztok Caglic, Zawaideh Jeries P, Tristan Barrett. Prostate MRI quality: clinical impact of the PI-QUAL score in prostate cancer diagnostic work-up. The British Journal of Radiology. 2022;95(1133):20211372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Olivier Windisch, Daniel Benamran, Charles Dariane, et al. Role of the Prostate Imaging Quality PI-QUAL Score for Prostate Magnetic Resonance Image Quality in Pathological Upstaging After Radical Prostatectomy: A Multicentre European Study. European Urology Open Science. 2023;47:94–101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sebastiaan Reijnen Jeroen, Una Ryg, Marthinsen Jon B, Irina Schönhardt, Therese Seierstad, Hole Knut H Monoparametric high-resolution diffusion weighted MRI as a possible first step in an MRI-directed diagnostic pathway for men with suspicion of prostate cancer. Frontiers in Oncology. 2023;13:1102860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Sharon Peled, Yehezkel Yeshurun. Superresolution in MRI: application to human white matter fiber tract visualization by diffusion tensor imaging. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine. 2001;45(1):29–35. [DOI] [PubMed] [Google Scholar]
  • 11.Eyal Carmi, Siuyan Liu, Noga Alon, Amos Fiat, Daniel Fiat. Resolution enhancement in MRI. Magnetic resonance imaging. 2006;24(2):133–154. [DOI] [PubMed] [Google Scholar]
  • 12.Kennedy John A, Ora Israel, Alex Frenkel, Rachel Bar-Shalom, Haim Azhari. Improved image fusion in PET/CT using hybrid image reconstruction and super-resolution. International journal of biomedical imaging. 2007;2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mayer Gregory S, Vrscay Edward R. Measuring information gain for frequency-encoded super-resolution MRI. Magnetic resonance imaging. 2007;25(7):1058–1069. [DOI] [PubMed] [Google Scholar]
  • 14.Yanhong Lu, Ran Yang, Jingxin Zhang, Cishen Zhang. Super resolution image reconstruction in parallel magnetic resonance imaging. In: :761–766IEEE ICCA 2010; 2010. [Google Scholar]
  • 15.Benoit Scherrer, Ali Gholipour, Warfield Simon K. Super-resolution in diffusion-weighted imaging. In: :124–132Medical Image Computing and Computer-Assisted Intervention–MICCAI 2011: 14th International Conference, Toronto, Canada, September 18–22, 2011, Proceedings, Part II 14; 2011. [Google Scholar]
  • 16.Akshay Chaudhari, Zhongnan Fang, Jin Hyung Lee, Garry Gold, Brian Hargreaves. Deep learning super-resolution enables rapid simultaneous morphological and quantitative magnetic resonance imaging. In: :3–11International Workshop on Machine Learning for Medical Image Reconstruction; 2018. [Google Scholar]
  • 17.Mehmet Yamac, Baran Ataman, Aakif Nawaz. Kernelnet: A blind super-resolution kernel estimation network. 2021;:453–462.
  • 18.Tieng Quang M, Cowin Gary J, Reutens David C, Galloway Graham J, Vegh Viktor. MRI resolution enhancement: How useful are shifted images obtained by changing the demodulation frequency?. Magnetic resonance in medicine. 2011;65(3):664–672. [DOI] [PubMed] [Google Scholar]
  • 19.Eric Van Reeth, Tham Ivan WK, Heng Tan Cher, Loo Poh Chueh. Super-resolution in magnetic resonance imaging: a review. Concepts in Magnetic Resonance Part A. 2012;40(6):306–325. [Google Scholar]
  • 20.Benoit Scherrer, Ali Gholipour, Warfield Simon K. Super-resolution reconstruction to increase the spatial resolution of diffusion weighted images from orthogonal anisotropic acquisitions. Medical image analysis. 2012;16(7):1465–1476. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Xi Wu, Zhipeng Yang, Jinrong Hu, Jing Peng, Peiyu He, Jiliu Zhou. Diffusion-weighted images super-resolution using high-order SVD. Computational and Mathematical Methods in Medicine. 2016;2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Jakub Jurek, Marek Kociński, Andrzej Materka, et al. Reconstruction of high-resolution t2W mr images of the prostate using maximum a posteriori approach and Markov random field regularization. In: :96–992017 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA); 2017. [Google Scholar]
  • 23.Alexander Daniel C, Darko Zikic, Aurobrata Ghosh, et al. Image quality transfer and applications in diffusion MRI. NeuroImage. 2017;152:283–298. [DOI] [PubMed] [Google Scholar]
  • 24.Kun Zeng, Hong Zheng, Congbo Cai, Yu Yang, Kaihua Zhang, Zhong Chen. Simultaneous single-and multi-contrast super-resolution for brain MRI images based on a convolutional neural network. Computers in biology and medicine. 2018;99:133–141. [DOI] [PubMed] [Google Scholar]
  • 25.Yuhua Chen, Yibin Xie, Zhengwei Zhou, Feng Shi, Christodoulou Anthony G, Debiao Li. Brain MRI super resolution using 3D deep densely connected neural networks. In: :739–7422018 IEEE 15th international symposium on biomedical imaging (ISBI 2018); 2018. [Google Scholar]
  • 26.Rewa Sood, Mirabela Rusu. Anisotropic super resolution in prostate mri using super resolution generative adversarial networks. In: :1688–16912019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019); 2019. [Google Scholar]
  • 27.Kewen Liu, Yuan Ma, Hongxia Xiong, et al. Medical image super-resolution method based on dense blended attention network. arXiv preprint arXiv:1905.05084. 2019;. [Google Scholar]
  • 28.Yoonmi Hong, Geng Chen, Pew-Thian Yap, Dinggang Shen. Reconstructing high-quality diffusion MRI data from orthogonal slice-undersampled data using graph convolutional neural networks. In: :529–537International Conference on Medical Image Computing and Computer-Assisted Intervention; 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Dar Salman UH, Mahmut Yurt, Levent Karacan Aykut Erdem, Erkut Erdem, Tolga Cukur. Image synthesis in multi-contrast MRI with conditional generative adversarial networks. IEEE transactions on medical imaging. 2019;38(10):2375–2388. [DOI] [PubMed] [Google Scholar]
  • 30.Xiuxiu He, Yang Lei, Yabo Fu, et al. Super-resolution magnetic resonance imaging reconstruction using deep attention networks. 2020;11313:642–647. [Google Scholar]
  • 31.Tao Zhou, Huazhu Fu, Geng Chen, Jianbing Shen, Ling Shao. Hi-net: hybrid-fusion network for multi-modal MR image synthesis. IEEE transactions on medical imaging. 2020;39(9):2772–2781. [DOI] [PubMed] [Google Scholar]
  • 32.Seonyeong Park, Michael Gach H, Siyong Kim, Jin Lee Suk, Yuichi Motai. Autoencoder-inspired convolutional network-based super-resolution method in MRI. IEEE Journal of Translational Engineering in Health and Medicine. 2021;9:1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sood Rewa R, Wei Shao, Christian Kunder, et al. 3D Registration of pre-surgical prostate MRI and histopathology images via super-resolution volume reconstruction. Medical image analysis. 2021;69:101957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Soumick Chatterjee, Alessandro Sciarra, Max Dünnwald, et al. ShuffleUNet: Super resolution of diffusion-weighted MRIs using deep learning. In: :940–9442021 29th European Signal Processing Conference (EUSIPCO); 2021. [Google Scholar]
  • 35.Qing Wu, Yuwei Li, Lan Xu, et al. Irem: High-resolution magnetic resonance image reconstruction via implicit neural representation. In: :65–74International Conference on Medical Image Computing and Computer-Assisted Intervention; 2021. [Google Scholar]
  • 36.Jialiang Jiang, Fulang Qi, Huiyu Du, et al. Super-Resolution Reconstruction of 3T-Like Images From 0.35 T MRI Using a Hybrid Attention Residual Network. IEEE Access. 2022;10:32810–32821. [Google Scholar]
  • 37.Mahdiyar Molahasani Majdabadi, Younhee Choi, Deivalakshmi S, Seokbum Ko. Capsule GAN for prostate MRI super-resolution. Multimedia Tools and Applications. 2022;81(3):4119–4141. [Google Scholar]
  • 38.Dwarikanath Mahapatra. Improved Super Resolution of MR Images Using CNNs and Vision Transformers. arXiv preprint arXiv:2207.11748. 2022;. [Google Scholar]
  • 39.Haotian Guo, Lihui Wang, Yulong Gu, Jian Zhang, Yuemin Zhu. Semi-Supervised Super-Resolution of Diffusion-Weighted Images Based on Multiple References. NMR in Biomedicine. 2023;:e4919. [DOI] [PubMed] [Google Scholar]
  • 40.Vincent Sitzmann, Julien Martel, Alexander Bergman, David Lindell, Gordon Wetzstein. Implicit neural representations with periodic activation functions. Advances in Neural Information Processing Systems. 2020;33:7462–7473. [Google Scholar]
  • 41.Giaquinto Angela N, Miller Kimberly D, Tossas Katherine Y, Winn Robert A, Ahmedin Jemal, Siegel Rebecca L Cancer statistics for African American/Black People 2022. CA: A Cancer Journal for Clinicians. 2022;72(3):202–229. [DOI] [PubMed] [Google Scholar]
  • 42.Batuhan Gundogdu, Pittman Jay M, Aritrick Chatterjee, et al. Directional and inter-acquisition variability in diffusion-weighted imaging and editing for restricted diffusion. Magnetic Resonance in Medicine. 2022;. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chao Dong, Change Loy Chen, Xiaoou Tang. Accelerating the super-resolution convolutional neural network. In: :391–407European conference on computer vision; 2016. [Google Scholar]
  • 44.Mueller-Lisse Ullrich G, Sophie Murer, Mueller-Lisse Ulrike L, Marissa Kuhn, Juergen Scheidler, Michael Scherr. Everyman’s prostate phantom: kiwi-fruit substitute for human prostates at magnetic resonance imaging, diffusion-weighted imaging and magnetic resonance spectroscopy. European Radiology. 2017;27(8):3362–3371. [DOI] [PubMed] [Google Scholar]
  • 45.Tatsuya Hayashi, Shimpei Yano, Shinya Kojima, Toshimune Ito. Variability in contrast and apparent diffusion coefficient of kiwifruit used as prostate MRI phantom: 1-week validation. Radiological Physics and Technology. 2022;:1–6. [DOI] [PubMed] [Google Scholar]
  • 46.Zhou Wang, Bovik Alan C, Sheikh Hamid R, Simoncelli Eero P. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing. 2004;13(4):600–612. [DOI] [PubMed] [Google Scholar]
  • 47.Lin Zhang, Lei Zhang, Xuanqin Mou, David Zhang. FSIM: A feature similarity index for image quality assessment. IEEE transactions on Image Processing. 2011;20(8):2378–2386. [DOI] [PubMed] [Google Scholar]
  • 48.Lin Zhang, Hongyu Li. SR-SIM: A fast and high performance IQA index based on spectral residual. In: :1473–14762012 19th IEEE international conference on image processing; 2012. [Google Scholar]
  • 49.Meredith Sadinski, Milica Medved, Ibrahim Karademir, et al. Short-term reproducibility of apparent diffusion coefficient estimated from diffusion-weighted MRI of the prostate. Abdominal imaging. 2015;40:2523–2528. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

Section 1.1 Details of weighted learning and enhancement in restricted diffusion (complete derivation)

Section 1.2 Design of the Temperature Parameter

Section 1.3 Similarity Comparisons

Section 1.4 More in-vivo examples

Figure S1. Design of the τ parameter.

Figure S2. Frequency response of the high pass filter applied to suppress low resolution components and focus on high resolution components

Figure S3. Scatter plot that compares SR against bicubic interpolation in several structural similarity metrics. It can be seen that the superiority of SR is more 7 emphasized especially for the images with lower LR-HR similarity scores.

Figure S4. An example of through-plane super-resolution of DWI (coronal view)

Figure S5. An example of through-plane super-resolution of DWI (coronal view)

Figure S6. An example of through-plane super-resolution of DWI (coronal view)

Figure S7. An example of in-plane super-resolution of DWI (axial view)

Figure S8. An example of in-plane super-resolution of DWI (axial view)

Figure S9. An example of in-plane super-resolution of DWI (axial view)

Figure S10. An example of in-plane super-resolution of DWI (axial view)

RESOURCES