Skip to main content
RSNA Journals logoLink to RSNA Journals
. 2023 Oct 10;309(1):e231092. doi: 10.1148/radiol.231092

Repeatability of MRI Biomarkers in Nonalcoholic Fatty Liver Disease: The NIMBLE Consortium

Kathryn J Fowler 1,, Sudhakar K Venkatesh 1, Nancy Obuchowski 1, Michael S Middleton 1, Jun Chen 1, Kay Pepin 1, Jessica Magnuson 1, Kathy J Brown 1, Danielle Batakis 1, Walter C Henderson 1, Sudha S Shankar 1, Tania N Kamphaus 1, Alex Pasek 1, Roberto A Calle 1, Arun J Sanyal 1, Rohit Loomba 1, Richard Ehman 1, Anthony E Samir 1, Claude B Sirlin 1,#, Sarah P Sherlock 1,#
PMCID: PMC10625902  PMID: 37815451

Abstract

Background

There is a need for reliable noninvasive methods for diagnosing and monitoring nonalcoholic fatty liver disease (NAFLD). Thus, the multidisciplinary Non-invasive Biomarkers of Metabolic Liver disease (NIMBLE) consortium was formed to identify and advance the regulatory qualification of NAFLD imaging biomarkers.

Purpose

To determine the different-day same-scanner repeatability coefficient of liver MRI biomarkers in patients with NAFLD at risk for steatohepatitis.

Materials and Methods

NIMBLE 1.2 is a prospective, observational, single-center short-term cross-sectional study (October 2021 to June 2022) in adults with NAFLD across a spectrum of low, intermediate, and high likelihood of advanced fibrosis as determined according to the fibrosis based on four factors (FIB-4) index. Participants underwent up to seven MRI examinations across two visits less than or equal to 7 days apart. Standardized imaging protocols were implemented with six MRI scanners from three vendors at both 1.5 T and 3 T, with central analysis of the data performed by an independent reading center (University of California, San Diego). Trained analysts, who were blinded to clinical data, measured the MRI proton density fat fraction (PDFF), liver stiffness at MR elastography (MRE), and visceral adipose tissue (VAT) for each participant. Point estimates and CIs were calculated using χ2 distribution and statistical modeling for pooled repeatability measures.

Results

A total of 17 participants (mean age, 58 years ± 8.5 [SD]; 10 female) were included, of which seven (41.2%), six (35.3%), and four (23.5%) participants had a low, intermediate, or high likelihood of advanced fibrosis, respectively. The different-day same-scanner mean measurements were 13%–14% for PDFF, 6.6 L for VAT, and 3.15 kPa for two-dimensional MRE stiffness. The different-day same-scanner repeatability coefficients were 0.22 L (95% CI: 0.17, 0.29) for VAT, 0.75 kPa (95% CI: 0.6, 0.99) for MRE stiffness, 1.19% (95% CI: 0.96, 1.61) for MRI PDFF using magnitude reconstruction, 1.56% (95% CI: 1.26, 2.07) for MRI PDFF using complex reconstruction, and 19.7% (95% CI: 15.8, 26.2) for three-dimensional MRE shear modulus.

Conclusion

This preliminary study suggests that thresholds of 1.2%–1.6%, 0.22 L, and 0.75 kPa for MRI PDFF, VAT, and MRE, respectively, should be used to discern measurement error from real change in patients with NAFLD.

ClinicalTrials.gov registration no. NCT05081427

© RSNA, 2023

Supplemental material is available for this article.

See also the editorial by Kozaka and Matsui in this issue.


graphic file with name radiol.231092.VA.jpg


Summary

NIMBLE 1.2 was a prospective study that evaluated repeatability of MRI biomarkers of liver fat, fibrosis, and body composition in participants with nonalcoholic fatty liver disease to inform their longitudinal use.

Key Results

  • ■ This prospective study included 17 participants at risk for nonalcoholic fatty liver disease who underwent multiple MRI examinations to determine the different-day same-scanner repeatability coefficients of potential imaging biomarkers for nonalcoholic steatohepatitis.

  • ■ The repeatability coefficients were 0.22 L (95% CI: 0.17, 0.29) for visceral adipose tissue; 0.75 kPa (95% CI: 0.6, 0.99) and 19.7% (95% CI: 15.8, 26.2) for two- and three-dimensional MR elastography measures, respectively; and 1.19% (95% CI: 0.96, 1.61) and 1.56% (95% CI: 1.26, 2.07) for MRI proton density fat fraction using magnitude or complex reconstruction, respectively.

Introduction

Approximately 25% of the general population in many parts of the world have nonalcoholic fatty liver disease (NAFLD) (1) and 1.5%–6.5% of the population are affected by the more advanced form of the disease, nonalcoholic steatohepatitis (NASH) (2). Patients with NASH are at risk for liver-related and cardiovascular morbidity (3). The current reference standard for diagnosis of NAFLD/NASH is liver biopsy with histologic assessment of hepatic fat, inflammation, ballooning, and fibrosis. However, longitudinal monitoring for disease using liver biopsy is not feasible due to the associated risk of complication and cost (4,5).

To overcome these limitations and to facilitate therapeutic trials, there is strong interest in noninvasive methods for identifying patients at risk for NASH. The Foundation for the National Institutes of Health, Non-invasive Biomarkers of Metabolic Liver Disease (NIMBLE) consortium is a multistakeholder public-private partnership that aims to standardize, compare, and validate circulating and imaging biomarkers for the noninvasive diagnosis and monitoring of NAFLD/NASH (6).

NIMBLE 1.2 was designed to assess the potential of using liver fat, liver stiffness, and body composition as imaging biomarkers for NASH. Modalities for measuring these biomarkers include MRI proton density fat fraction (PDFF) estimation, which occurs in two varieties (magnitude and complex reconstruction, where the former is a method that can be employed on any scanner and the latter requires commercial software and is not always available on all clinical scanners) and has been validated against pathology and spectroscopy as a highly accurate marker of liver fat (79). Additionally, MR elastography (MRE) can be used to estimate liver stiffness and has shown promise for diagnosing significant fibrosis (METAVIR score ≥F2) when compared with histopathology and other noninvasive methods (1014). Due to the interplay between obesity, insulin resistance, and NAFLD in patients with metabolic syndrome (15), there has also been increasing interest in measuring body composition metrics, such as visceral adipose tissue (VAT).

While these noninvasive MRI markers show promise, further validation of their repeatability and reproducibility in patients with NAFLD is needed to qualify them as biomarkers for clinical trial use.

The primary objective of NIMBLE 1.2 was to evaluate the pooled different-day repeatability coefficient of MRI PDFF, MRE stiffness, and VAT at 1.5 T and 3 T. The study was designed around a central hypothesis that the upper 95% confidence bound of the pooled different-day repeatability coefficient of the imaging biomarkers would be less than 25%, on the basis of pre-existing knowledge and literature (16). A secondary aim of this study was to evaluate additional repeatability and reproducibility indexes for these imaging biomarkers and determine factors that may impact variability in measurements.

Materials and Methods

This Health Insurance Portability and Accountability Act–compliant study was conducted following institutional review board approval of the protocol, and written informed consent was obtained from all participants.

Study Design and Participants

NIMBLE 1.2 is a single-center, prospective cross-sectional short-term study of adult participants who received a suspected or confirmed diagnosis of NAFLD at Mayo Clinic in Rochester, Minnesota from October 2021 to June 2022 (ClinicalTrials.gov registration no. NCT05081427). The study followed a protocol approved by the NIMBLE project team and in concordance with the Quantitative Imaging Biomarker Alliance (17).

Inclusion criteria were as follows: (a) age greater than or equal to 18 years; (b) probable NAFLD and NASH based on prior clinically performed biopsy within 36 months, or clinical or laboratory data within 3 months, consistent with NAFLD (alanine transaminase levels >30 U/L for male or >19 U/L for female participants, without other causes such as viral infection or alcohol); (c) able to have fibrosis status classified using the fibrosis based on four factors (FIB-4) index (18); (d) able and willing to participate, including maintaining diet, physical activity, alcohol consumption, and medication use unchanged from their usual regimen during the study period; and (e) meeting Adult Treatment Panel III criteria for metabolic syndrome (19). Exclusion criteria were as follows: (a) liver disease other than NAFLD; (b) excess alcohol consumption (>2 units per day for female or >3 units per day for male); (c) current diagnosis of drug-induced liver injury; (d) receiving a drug or placebo in a treatment trial during or within 30 days of the study period; (e) weight loss or gain of greater than 5 kg in the prior 3 months; and (f) other factors that might preclude study completion, including pregnancy, presence of implants such as pacemakers or defibrillators, or any other contraindication to MRI or vibration-controlled transient elastography (Fig 1).

Figure 1:

Flowchart shows study enrollment. ATP = adult treatment panel, BMI = body mass index, NAFLD = nonalcoholic fatty liver disease, NASH = nonalcoholic steatohepatitis.

Flowchart shows study enrollment. ATP = adult treatment panel, BMI = body mass index, NAFLD = nonalcoholic fatty liver disease, NASH = nonalcoholic steatohepatitis.

The FIB-4 index, which considers patient age (years), aspartate aminotransferase levels (units per liter), alanine transaminase levels (units per liter), and platelet count (109 cells per liter), was used to ensure recruitment of participants across the spectrum of disease severity (17). Participants were categorized as follows: low likelihood of advanced fibrosis (FIB-4 index ≤1.3), intermediate likelihood of advanced fibrosis (FIB-4 index >1.3 to <2.67), or high likelihood of advanced fibrosis (FIB-4 index ≥2.67). In addition to imaging, enrolled participants completed questionnaires, anthropometric measurements, and blood collection. The biomarkers assessed were MRI PDFF, MRE stiffness, and whole-body VAT.

Imaging Procedures

MRI was performed with six different MRI scanners (HDxt 1.5 T and HDxt 3 T, GE HealthCare; Ingenia 1.5 T and Ingenia 3 T, Philips; and MAGNETOM Aera 1.5 T and Skyra 3 T, Siemens Healthineers), and vibration-controlled transient elastography was preformed using FibroScan (Echosens). Examinations were carried out as shown in Figure 2. Four MRI scans were acquired for each participant on day one, and two to three MRI scans were acquired for each participant on day two and performed around the same time of day within a 7-day interval. The acquired data were transmitted via AG Mednet Portal (AG Mednet) for central analysis, which was performed at the central reading center (University of California, San Diego) and by AMRA Medical. MRI protocol details are provided in Table S1. Participants were assigned to vendor and field strength according to a block randomization scheme. The site qualification process is described in Appendix S1.

Figure 2:

Schematic shows study procedures, including randomization to different blocks to ensure spread of participant data across all vendors and field strengths relatively equally. Each participant also underwent vibration-controlled transient elastography (VCTE) as part of the study. Imaging examinations were performed at approximately the same time each day and within 7 days of each other. B0 = field strength.

Schematic shows study procedures, including randomization to different blocks to ensure spread of participant data across all vendors and field strengths relatively equally. Each participant also underwent vibration-controlled transient elastography (VCTE) as part of the study. Imaging examinations were performed at approximately the same time each day and within 7 days of each other. B0 = field strength.

Biomarker Analysis

MRI PDFF and MRE data were evaluated using Horos (Purview) Digital Imaging and Communications in Medicine image viewer software and MRE Quant (Resoundant) software. The three-dimensional MRE data underwent multimodal direct inversion reconstruction by Resoundant prior to transmission to the central reading center for analysis (20). All trained image analysts (D.B., W.H.), with at least 1 year of experience, were blinded to participant clinical information, including the FIB-4 index. Analysis was performed in accordance with the NIMBLE 1.2 Imaging Review Charter by placing nine regions of interest in each liver segment for PDFF and a large region of interest on four axial sections while avoiding the outer 1 cm of liver and including regions of parallel wave propagation within the 95% CI maps for MRE, and according to good clinical practice. After each image set was analyzed (including mean liver stiffness measured in kilopascals and mean liver PDFF percentage), a second analyst performed a quality inspection over-read to confirm the results. VAT (liters) and other body composition metrics were analyzed by AMRA Medical (21), with anonymization of other biomarker and participant data.

Statistical Analysis

The statistical analysis plan was developed by the study statistician (N.O.) and approved by the NIMBLE project team prior to study initiation. The study was designed around a central hypothesis that the upper 95% confidence bound of the pooled different-day repeatability coefficient of the imaging biomarkers would be less than 25%, on the basis of pre-existing knowledge and literature (16). For each performance estimate, 95% upper bounds were calculated based on a χ2 distribution and, to account for clustered data at the participant level when appropriate, by both a generalized linear model with generalized estimating equations and bootstrapping. A Monte Carlo simulation study was conducted to determine the sample size for paired test and retest observations to achieve an 80% power with 5% type 1 error rate assuming normally distributed data and, conservatively, assuming interscanner variability in the different-day repeatability coefficients of 14%, 18%, and 22% across scanners. An adaptive enrollment scheme was used in which an interim analysis was conducted after the enrollment of 12 participants, the results of which would be used to determine if more participants needed to be included in the final analysis (Fig 3).

Figure 3:

Schematic shows adaptive enrollment for determining the stopping rule for the interim analysis after enrollment of 12 participants, the results of which are used to determine if more participants need to be included in the final analysis. If the a priori upper bound 95% confidence threshold is met for each biomarker, no additional participants would be recruited. If a single biomarker or multiple biomarkers failed to meet the threshold, additional participants would be enrolled with another interim analysis planned upon enrollment of 21 and 24 participants. CSE = chemical shift encoding, MRE = MR elastography, PDFF = proton density fat fraction, RCdiff-day = different-day repeatability coefficient, 3D = three-dimensional, 2D = two-dimensional, VAT = visceral adipose tissue.

Schematic shows adaptive enrollment for determining the stopping rule for the interim analysis after enrollment of 12 participants, the results of which are used to determine if more participants need to be included in the final analysis. If the a priori upper bound 95% confidence threshold is met for each biomarker, no additional participants would be recruited. If a single biomarker or multiple biomarkers failed to meet the threshold, additional participants would be enrolled with another interim analysis planned upon enrollment of 21 and 24 participants. CSE = chemical shift encoding, MRE = MR elastography, PDFF = proton density fat fraction, RCdiff-day = different-day repeatability coefficient, 3D = three-dimensional, 2D = two-dimensional, VAT = visceral adipose tissue.

The different-day repeatability coefficient for the imaging biomarkers of interest was calculated by pooling data for each field strength across all scanners, details of which can be found in Appendix S1. The stopping point for the interim analysis was an upper bound for different-day repeatability coefficient values less than or equal to 24.1% for each biomarker. For the final analysis, if the upper 95% confidence bound was less than 24.1%, then it was concluded that the biomarker met the prespecified precision criterion of greater than 25%. Similar methods were used for the additional repeatability and reproducibility metrics (22). Linear regression of within-participant SD was also performed for each biomarker and is reported in Appendix S1 and Figures S1 and S2. Finally, a multivariable linear regression model was constructed to assess the effects of FIB-4 index, sex, body mass index (calculated as weight in kilograms divided by height in meters squared), and waist circumference on precision metrics, with results reported in Appendix S1 and Figure S3. Generalized estimating equations were used to handle the clustered data.

P < .05 was considered indicative of a statistically significant difference. All descriptive and inferential statistics were performed using SAS version 9.4 (SAS Institute) by the statistician, who was not part of the data acquisition or curation process. Descriptive statistics were expressed as counts and frequencies or means and SDs.

Results

Participant Characteristics

A total of 17 participants (mean age, 58 years ± 8.5 [SD]; 10 female, seven male) were included. Participants had a mean body mass index of 32.7 (range, 27.5–43.6) and a mean FIB-4 index of 3 (range, 0.49–7.0) (Table 1). All participants completed all study procedures. Although the interim analysis (which included only the first 12 participants enrolled) met the stopping rule based on the primary end point analysis, an additional five participants were enrolled while awaiting interim analysis results. Thus, data presented herein include all 17 participants. During the study period, two participants were identified as having overt cirrhosis, defined as cross-sectional imaging evidence of cirrhosis and portal hypertension (23). To evaluate the potential impact of overt cirrhosis on repeatability, a separate analysis was performed from which these participants were excluded. There were no reported changes in medication, alcohol use, or exercise routines between the two visits for any of the study participants.

Table 1:

Characteristics of All Study Participants and Stratified according to Likelihood of Advanced Fibrosis

graphic file with name radiol.231092.tbl1.jpg

Primary End Point Analysis

The relative precision across all biomarkers is shown in Figure 4, and a sample placement of regions of interest is shown in Figure 5 for PDFF and MRE analyses. Figure 6 shows an example case of a high FIB-4 index at MRE. The results of the analysis without the two participants demonstrating overt cirrhosis were similar, with repeatability coefficients of 0.22 L, 0.65 kPa, and 1.3%–1.5% for VAT, MRE, and PDFF, respectively (Table S2). The repeatability coefficient percentages for hypothesis testing are provided in Table 2. Point estimates for pooled different-day repeatability coefficients for the imaging biomarkers were as follows: 1.19% (95% CI: 0.96, 1.61) for MRI PDFF using magnitude reconstruction, 1.56% (95% CI: 1.26, 2.07) for MRI PDFF using complex reconstruction, 0.75 kPa (95% CI: 0.60, 0.99) for liver stiffness measured with two-dimensional MRE, and 0.22 L (95% CI: 0.17, 0.29) for VAT (Table 2). For three-dimensional MRE, the repeatability coefficient percentage was 19.7% (95% CI: 15.8, 26.2).

Figure 4:

Point estimate graph of the upper 95% confidence bounds for different-day repeatability coefficient percentages (%RC) for all imaging biomarkers shows the relative spread between biomarkers. MRE = MR elastography, PDFF = proton density fat fraction, 3D = three-dimensional, 2D = two-dimensional, VAT = visceral adipose tissue.

Point estimate graph of the upper 95% confidence bounds for different-day repeatability coefficient percentages (%RC) for all imaging biomarkers shows the relative spread between biomarkers. MRE = MR elastography, PDFF = proton density fat fraction, 3D = three-dimensional, 2D = two-dimensional, VAT = visceral adipose tissue.

Figure 5:

MRI in a 65-year-old male participant with low likelihood of advanced fibrosis according to the fibrosis based on four factors (FIB-4) index. (A) Axial MRI proton density fat fraction map shows regions of interest (green circles) drawn in each Couinaud segment while avoiding any vessels or lesions, as per the study protocol. (B) Coronal image shows body composition with autosegmented visceral (pink) and subcutaneous (blue) adipose tissue. (C) Left to right: Axial elastogram, magnitude, and wave images obtained at two-dimensional MR elastography (MRE) show placement of the region of interest (dotted outline) in the region of parallel wave propagation, avoiding the outer edge of the liver. (D) Left to right: Axial elastogram, magnitude, and wave images obtained at three-dimensional MRE similarly show the region of interest (dotted outline) in a region of parallel wave propagation, avoiding the outer margin of the liver. Color bars alongside elastograms indicate a range of kilopascals from low (purple) to high (red).

MRI in a 65-year-old male participant with low likelihood of advanced fibrosis according to the fibrosis based on four factors (FIB-4) index. (A) Axial MRI proton density fat fraction map shows regions of interest (green circles) drawn in each Couinaud segment while avoiding any vessels or lesions, as per the study protocol. (B) Coronal image shows body composition with autosegmented visceral (pink) and subcutaneous (blue) adipose tissue. (C) Left to right: Axial elastogram, magnitude, and wave images obtained at two-dimensional MR elastography (MRE) show placement of the region of interest (dotted outline) in the region of parallel wave propagation, avoiding the outer edge of the liver. (D) Left to right: Axial elastogram, magnitude, and wave images obtained at three-dimensional MRE similarly show the region of interest (dotted outline) in a region of parallel wave propagation, avoiding the outer margin of the liver. Color bars alongside elastograms indicate a range of kilopascals from low (purple) to high (red).

Figure 6:

Axial MRI in a 57-year-old female participant with a high likelihood of advanced fibrosis according to the fibrosis based on four factors (FIB-4) index. (A) Left to right: Elastogram, magnitude, and wave images obtained at three-dimensional MR elastography (MRE) show the region of interest (dotted outline) in a region of parallel wave propagation, avoiding the outer margin of the liver. (B) Left to right: Elastogram, magnitude, and wave images obtained at two-dimensional MRE show placement of the region of interest (dotted outline) in the region of parallel wave propagation, avoiding the outer edge of the liver. (C) MRI proton density fat fraction map shows placement of regions of interest (green circles) within different segments of the liver. Color bars alongside elastograms indicate a range of kilopascals from low (purple) to high (red).

Axial MRI in a 57-year-old female participant with a high likelihood of advanced fibrosis according to the fibrosis based on four factors (FIB-4) index. (A) Left to right: Elastogram, magnitude, and wave images obtained at three-dimensional MR elastography (MRE) show the region of interest (dotted outline) in a region of parallel wave propagation, avoiding the outer margin of the liver. (B) Left to right: Elastogram, magnitude, and wave images obtained at two-dimensional MRE show placement of the region of interest (dotted outline) in the region of parallel wave propagation, avoiding the outer edge of the liver. (C) MRI proton density fat fraction map shows placement of regions of interest (green circles) within different segments of the liver. Color bars alongside elastograms indicate a range of kilopascals from low (purple) to high (red).

Table 2:

Primary and Secondary End Point Analyses for Imaging Biomarkers in All Participants

graphic file with name radiol.231092.tbl2.jpg

Discussion

There is need for reliable noninvasive methods for diagnosing and monitoring nonalcoholic fatty liver disease (NAFLD). Thus, the multidisciplinary Non-invasive Biomarkers of Metabolic Liver Disease (NIMBLE) consortium was formed to identify and advance the regulatory qualification of NAFLD imaging biomarkers. The purpose of NIMBLE 1.2 was to determine the different-day same-scanner repeatability coefficient of liver MRI biomarkers in participants with NAFLD at risk for steatohepatitis. The point estimates for repeatability coefficients were 1.19% (95% CI: 0.96, 1.61) for MRI proton density fat fraction (PDFF) using magnitude reconstruction, 1.56% (95% CI: 1.26, 2.07) for MRI PDFF using complex reconstruction, 0.75 kPa (95% CI: 0.60, 0.99) for two-dimensional MR elastography (MRE) stiffness, and 0.22 L (95% CI: 0.17, 0.29) for visceral adipose tissue. The repeatability coefficient percentage for three-dimensional MRE was 19.7% (95% CI: 15.8, 26.2).

The present study extends the existing knowledge of repeatability and reproducibility for MRI biomarkers. Prior studies have evaluated the test-retest repeatability of MRI PDFF across vendors and field strengths in phantoms (24), using different technique parameters (25), and across different times of day to capture diurnal variations (26). Our results agree with prior reports that relative measurement differences up to 10% may occur due to measurement variance, with repeatability coefficients of 1.2%–1.6% in our study. Similarly, prior studies evaluating the test-retest repeatability of liver stiffness measurements have shown that up to 37% of true changes may be masked by measurement variance in patients with liver disease (27,28), with a smaller study in healthy volunteers and patients with hepatitis C demonstrating within-subject coefficients of variation ranging 6%–11% (29). Of note, the MRE studies referenced were performed in patients with other etiologies of liver disease, primarily hepatitis C, and not NAFLD/NASH. Alternatively, our study provides rigorous data on the reproducibility of quantitative MRI noninvasive tests in the relevant population of participants with NAFLD/NASH across different days, scanners, and field strengths. While PDFF has already been proposed as an end point for NASH clinical trials that assess changes in liver fat as a primary measure of drug efficacy (30), our results suggest that changes of 1.2%–1.6% are the thresholds that may be considered for estimating true change and power sample size for future studies.

Few studies have focused on the reproducibility or repeatability of body composition metrics. A study of 18 healthy volunteers assessed using a single MRI scanner reported a repeatability coefficient for VAT of 0.13 L (31). In another prospective cross-sectional study of 20 participants who were assessed using automated MRI from a single vendor, VAT measurements showed intra- and interexamination coefficients of variation of 3.3% and 3.6%, respectively (32). We found that VAT measurements are reproducible across vendors, field strengths, and temporal assessments, with a repeatability coefficient of 0.22 L, which will further support large-scale clinical trial use (33).

In addition, we evaluated the potential impact of participant-level factors of sex, body mass index, and severity of liver fibrosis (estimated according to the FIB-4 index) on measurement variation and found associations of BMI with repeatability coefficients for VAT, PDFF, and MRE. The impact of obesity on diagnostic accuracy of imaging biomarkers in NAFLD/NASH has been studied previously, with comparative studies showing that MRI may be less impacted by obesity than other imaging modalities, such as vibration-controlled transient elastography (34). Another study showed that as body mass index increased, discordance in fibrosis stage increased between MRE and vibration-controlled transient elastography (35). While it appears that body mass index may impact the performance of some noninvasive tests, our study was not powered to examine this relationship or the underlying biologic mechanisms.

Our study had several limitations. First, we recruited participants with NAFLD who were at risk for NASH but had not been diagnosed with NASH. These participants have relatively low disease severity and so our results should be applied with caution in different patient populations, such as those with known advanced disease. Second, while we assessed potential predictors of variability, our study was not powered for this analysis. Finally, our sample was not diverse with regards to race and ethnicity and may be prone to selection bias. Future studies that include more individuals are needed to examine potentially modifiable predictors of variability, such as the alteration of specific sequence parameters, different operators, diurnal variation, or fasting state. Finally, other MRI noninvasive tests that have shown potential value in assessing NAFLD/NASH, such as R2*, additional three-dimensional MRE markers, and additional body composition metrics, were not analyzed in the present study. These data have been collected and will be reported in a future study.

In conclusion, our study assessing potential MRI biomarkers in participants with nonalcoholic fatty liver disease/nonalcoholic steatohepatitis showed that a difference of at least 0.22 L for visceral adipose tissue volume, 1.2%–1.6% for MRI proton density fat fraction, and 0.75 kPa for MR elastography may constitute real change not attributable to measurement error. Future studies to establish clinically relevant cutoff values for clinical trial enrichment and to further validate the biomarkers in combination with circulating markers are needed to advance the field, and are planned as next stages in the NIMBLE project plan.

Acknowledgments

Acknowledgments

We acknowledge key collaborators who provide in-kind support to NIMBLE, including AMRA Medical, Canon Medical Systems, Echosens, GENFIT, GE HealthCare, Nordic Bioscience, OWL Metabolomics, Philips Ultrasound, P-Value, Hologic SuperSonic Imagine, Siemens Healthineers, and Siemens Medical Solutions.

*

K.J.F. and S.K.V. contributed equally to this work.

**

C.B.S. and S.P.S. are co-senior authors.

Supported in part by the Altman Clinical and Translational Research Institute at the University of California, San Diego, which received funding from the National Center for Advancing Translational Sciences–National Institutes of Health (UL1TR001442); Global Liver Institute; and U.S. Federal Drug Administration. The NIMBLE project is sponsored by the Foundation for the National Institutes of Health and is a public-private partnership supported by multiple entities, including AbbVie, AstraZeneca, Boehringer Ingelheim, Bristol-Myers Squibb, Echosens, GE HealthCare, Genentech, Gilead Sciences, Intercept Pharmaceuticals, Novo Nordisk, Pfizer, and Takeda Development Center Americas.

Data sharing: Data generated or analyzed during the study are available from the corresponding author by request.

Disclosures of conflicts of interest: K.J.F. Grants or contracts from Bayer, GE HealthCare, Median; consulting fees from Bayer, Ascelia Pharma, GE HealthCare; lecture payment from CME science; payment for expert testimony; travel and/or meeting support from Bayer; data safety monitoring board for Ascelia Pharma; chair, American College of Radiology Appropriateness panel; director, Society of Abdominal Radiology education portfolio; senior deputy editor for Radiology. S.K.V. Grants from NIH (R01 EB001981), U.S. Department of Defense (W81XWH-19-1-0583-01); textbook royalties from Springer; patent for nonalcoholic fatty liver disease activity score system using MRE (licensee, Mayo Foundation for Medical Education and Research). N.O. Institutional contracts for statistical support with the NIMBLE and Quantitative Imaging Biomarker Alliance (QIBA) studies. M.S.M. No relevant relationships. J.C. Salaries from Mayo Clinic, Resoundant; royalties from Resoundant; patents planned, issued, or pending via Mayo Clinic; stockholder, Resoundant; financial interest related to the intellectual properties of MRE. K.P. Employee, Resoundant. J.M. No relevant relationships. K.J.B. No relevant relationships. D.B. No relevant relationships. W.C.H. No relevant relationships. S.S.S. No relevant relationships. T.N.K. No relevant relationships. A.P. No relevant relationships. R.A.C. Member of the American Diabetes Association Community Leaders Board for New England; co-chair, Metabolic Disorders Steering Committee of the Foundation of the NIH Biomarkers Consortium; stockholder, Regeneron Pharmaceuticals, Pfizer. A.J.S. Grants or contracts from Intercept, Pfizer, Merck, Bristol-Myers Squibb, Eli Lilly, Novo Nordisk, Boehringer Ingelheim, AstraZeneca, Novartis; institutional grant from Madrigal Research and institutional collaborative agreement with Avant Sante; royalties from Elsevier, UpToDate; consulting fees from Path AI, Histoindex, Fibronest, BioCellvia, Merck, Pfizer, Eli Lily, Novo Nordisk, Boehringer Ingelheim, AstraZeneca, Akero, Intercept, Madrigal, Northsea, Takeda, Regeneron, Genetich, Alnylam, Roche, Glaxo Smith Kline, Novartis, Tern, Fractyl, Invventiva, Gilead, Target Pharmasolutions. R.L. Grants or contracts from Arrowhead Pharmaceuticals, AstraZeneca, Boehringer-Ingelheim, Bristol-Myers Squibb, Eli Lilly, Galectin Therapeutics, Galmed Pharmaceuticals, Gilead, Hanmi, Intercept, Inventiva, Ionis, Janssen, Madrigal Pharmaceuticals, Merck, NGM Biopharmaceuticals, Novo Nordisk, Pfizer, Sonic Incytes, Terns Pharmaceuticals; consulting fees from Aardvark Therapeutics, Altimmune, Anylam/Regeneron, Amgen, Arrowhead Pharmaceuticals, AstraZeneca, Bristol-Myers Squibb, CohBar, Eli Lilly, Galmed, Gilead, Glympse bio, Hightide, Inipharma, Intercept, Inventiva, Ionis, Janssen, Madrigal, Metacrine, NGM Biopharmaceuticals, Novartis, Novo Nordisk, Merck, Pfizer, Sagimet, Theratechnologies, 89 bio, Terns Pharmaceuticals, Viking Therapeutics. R.E. Institutional grants from NIH as principal investigator (R37 EB001981) and other NIH grants as co-investigator; sponsored research contract with Resoundant; shared royalties from Mayo Clinic; patents planned, issued, or pending via Mayo Clinic; Mayo Clinic–assigned role as CEO of Resoundant, a Mayo Clinic owned company; stockholder, Resoundant; financial interest related to the intellectual properties of MRE. A.E.S. Grant and consultant payment from GE HealthCare; patent pending for US beamforming and shear wave elastography. C.B.S. Institutional research grants from ACR, Bayer, FNIH, GE HealthCare, Gilead, Pfizer, Philips, Siemens Healthineers; lab service agreements with Enanta, Gilead, ICON, Intercept, Nusirt, Shire, Synageva, Takeda; royalty payments to institution from Medscape, Wolters Kluwer; personal consulting payments from Altimmune, Ascelia Pharma, Blade, Boehringer, Epigenomics, Guerbet; institutional consulting representative for AMRA, BMS, Exact Sciences, IBM-Watson, Pfizer; lecture payments from Japanese Society of Radiology, Stanford, MD Anderson; meeting and/or travel support from Fundacion Santa Fe, CADI, Stanford, Jornada Paulista de Radiologia, Ascelia Pharma; unpaid advisor for Quantix Bio; stockholder, Livivos; equipment loan to institution from GE HealthCare; institutional payments for academic co-chair position of Imaging Workstream, NIMBLE; volunteer in working groups, QIBA. S.P.S. Employee, Pfizer; unpaid leadership position as co-chair for the FNIH consortium that led to this work; stockholder, Pfizer.

Abbreviations:

FIB-4
fibrosis based on four factors
MRE
MR elastography
NAFLD
nonalcoholic fatty liver disease
NASH
nonalcoholic steatohepatitis
NIMBLE
Non-invasive Biomarkers of Metabolic Liver Disease
PDFF
proton density fat fraction
VAT
visceral adipose tissue

References

  • 1. Younossi ZM , Koenig AB , Abdelatif D , Fazel Y , Henry L , Wymer M . Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes . Hepatology 2016. ; 64 ( 1 ): 73 – 84 . [DOI] [PubMed] [Google Scholar]
  • 2. Bondini S , Kleiner DE , Goodman ZD , Gramlich T , Younossi ZM . Pathologic assessment of non-alcoholic fatty liver disease . Clin Liver Dis 2007. ; 11 ( 1 ): 17 – 23, vii . [DOI] [PubMed] [Google Scholar]
  • 3. Younossi Z , Anstee QM , Marietti M , et al . Global burden of NAFLD and NASH: trends, predictions, risk factors and prevention . Nat Rev Gastroenterol Hepatol 2018. ; 15 ( 1 ): 11 – 20 . [DOI] [PubMed] [Google Scholar]
  • 4. Khalifa A , Rockey DC . The utility of liver biopsy in 2020 . Curr Opin Gastroenterol 2020. ; 36 ( 3 ): 184 – 191 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Seeff LB , Everson GT , Morgan TR , et al . Complication rate of percutaneous liver biopsies among persons with advanced chronic liver disease in the HALT-C trial . Clin Gastroenterol Hepatol 2010. ; 8 ( 10 ): 877 – 883 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Sanyal AJ , Shankar SS , Calle RA , et al . Non-invasive biomarkers of nonalcoholic steatohepatitis: the FNIH NIMBLE project . Nat Med 2022. ; 28 ( 3 ): 430 – 432 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Heba ER , Desai A , Zand KA , et al . Accuracy and the effect of possible subject-based confounders of magnitude-based MRI for estimating hepatic proton density fat fraction in adults, using MR spectroscopy as reference . J Magn Reson Imaging 2016. ; 43 ( 2 ): 398 – 406 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Noureddin M , Lam J , Peterson MR , et al . Utility of magnetic resonance imaging versus histology for quantifying changes in liver fat in nonalcoholic fatty liver disease trials . Hepatology 2013. ; 58 ( 6 ): 1930 – 1940 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Kim HJ , Cho HJ , Kim B , et al . Accuracy and precision of proton density fat fraction measurement across field strengths and scan intervals: A phantom and human study . J Magn Reson Imaging 2019. ; 50 ( 1 ): 305 – 314 . [DOI] [PubMed] [Google Scholar]
  • 10. Cui J , Heba E , Hernandez C , et al . Magnetic resonance elastography is superior to acoustic radiation force impulse for the Diagnosis of fibrosis in patients with biopsy-proven nonalcoholic fatty liver disease: A prospective study . Hepatology 2016. ; 63 ( 2 ): 453 – 461 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Cui J , Ang B , Haufe W , et al . Comparative diagnostic accuracy of magnetic resonance elastography vs. eight clinical prediction rules for non-invasive diagnosis of advanced fibrosis in biopsy-proven non-alcoholic fatty liver disease: a prospective study . Aliment Pharmacol Ther 2015. ; 41 ( 12 ): 1271 – 1280 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Taouli B , Serfaty L . Magnetic resonance imaging/elastography is superior to transient elastography for detection of liver fibrosis and fat in nonalcoholic fatty liver disease . Gastroenterology 2016. ; 150 ( 3 ): 553 – 556 . [DOI] [PubMed] [Google Scholar]
  • 13. Kim D , Kim WR , Talwalkar JA , Kim HJ , Ehman RL . Advanced fibrosis in nonalcoholic fatty liver disease: noninvasive assessment with MR elastography . Radiology 2013. ; 268 ( 2 ): 411 – 419 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Loomba R , Wolfson T , Ang B , et al . Magnetic resonance elastography predicts advanced fibrosis in patients with nonalcoholic fatty liver disease: a prospective study . Hepatology 2014. ; 60 ( 6 ): 1920 – 1928 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Tejani S , McCoy C , Ayers CR , et al . Cardiometabolic health outcomes associated with discordant visceral and liver fat phenotypes: insights from the Dallas Heart Study and UK Biobank . Mayo Clin Proc 2022. ; 97 ( 2 ): 225 – 237 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Serai SD , Obuchowski NA , Venkatesh SK , et al . Repeatability of MR Elastography of Liver: A Meta-Analysis . Radiology 2017. ; 285 ( 1 ): 92 – 100 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. QIBA MR Biomarker Committee . MR Elastography of the Liver, Quantitative Imaging Biomarkers Alliance. Profile Stage: Technically Confirmed . https://qibawiki.rsna.org/images/5/54/MRE-QIBAProfile-2022-02-14-TECHNICALLY-CONFIRMED.pdf. Published February 14, 2022. Accessed February 15, 2023 .
  • 18. Sterling RK , Lissen E , Clumeck N , et al . Development of a simple noninvasive index to predict significant fibrosis in patients with HIV/HCV coinfection . Hepatology 2006. ; 43 ( 6 ): 1317 – 1325 . [DOI] [PubMed] [Google Scholar]
  • 19. Grundy SM , Cleeman JI , Daniels SR , et al . Diagnosis and management of the metabolic syndrome: an American Heart Association/National Heart, Lung, and Blood Institute Scientific Statement . Circulation 2005. ; 112 ( 17 ): 2735 – 2752 . [DOI] [PubMed] [Google Scholar]
  • 20. Silva AM , Grimm RC , Glaser KJ , et al . Magnetic resonance elastography: evaluation of new inversion algorithm and quantitative analysis method . Abdom Imaging 2015. ; 40 ( 4 ): 810 – 817 . [DOI] [PubMed] [Google Scholar]
  • 21. Borga M , Thomas EL , Romu T , et al . Validation of a fast method for quantification of intra-abdominal and subcutaneous adipose tissue for large-scale human studies . NMR Biomed 2015. ; 28 ( 12 ): 1747 – 1753 . [DOI] [PubMed] [Google Scholar]
  • 22. Raunig DL , McShane LM , Pennello G , et al . Quantitative imaging biomarkers: a review of statistical methods for technical performance assessment . Stat Methods Med Res 2015. ; 24 ( 1 ): 27 – 67 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Siddiqui MS , Harrison SA , Abdelmalek MF , et al . Case definitions for inclusion and analysis of endpoints in clinical trials for nonalcoholic steatohepatitis through the lens of regulatory science . Hepatology 2018. ; 67 ( 5 ): 2001 – 2012 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Zhao R , Hernando D , Harris DT , et al . Multisite multivendor validation of a quantitative MRI and CT compatible fat phantom . Med Phys 2021. ; 48 ( 8 ): 4375 – 4386 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Johnson BL , Schroeder ME , Wolfson T , et al . Effect of flip angle on the accuracy and repeatability of hepatic proton density fat fraction estimation by complex data-based, T1-independent, T2*-corrected, spectrum-modeled MRI . J Magn Reson Imaging 2014. ; 39 ( 2 ): 440 – 447 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Colgan TJ , Van Pay AJ , Sharma SD , Mao L , Reeder SB . Diurnal variation of proton density fat fraction in the liver using quantitative chemical shift encoded MRI . J Magn Reson Imaging 2020. ; 51 ( 2 ): 407 – 414 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Hines CDG , Bley TA , Lindstrom MJ , Reeder SB . Repeatability of magnetic resonance elastography for quantification of hepatic stiffness . J Magn Reson Imaging 2010. ; 31 ( 3 ): 725 – 731 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Lee Y , Lee JM , Lee JE , et al . MR elastography for noninvasive assessment of hepatic fibrosis: reproducibility of the examination and reproducibility and repeat-ability of the liver stiffness value measurement . J Magn Reson Imaging 2014. ; 39 ( 2 ): 326 – 331 . [DOI] [PubMed] [Google Scholar]
  • 29. Shire NJ , Yin M , Chen J , et al . Test-retest repeatability of MR elastography for noninvasive liver fibrosis assessment in hepatitis C . J Magn Reson Imaging 2011. ; 34 ( 4 ): 947 – 955 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Caussy C , Reeder SB , Sirlin CB , Loomba R . Noninvasive, quantitative assessment of liver fat by MRI-PDFF as an endpoint in NASH trials . Hepatology 2018. ; 68 ( 2 ): 763 – 772 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Borga M , Ahlgren A , Romu T , Widholm P , Dahlqvist Leinhard O , West J . Reproducibility and repeatability of MRI-based body composition analysis . Magn Reson Med 2020. ; 84 ( 6 ): 3146 – 3156 . [DOI] [PubMed] [Google Scholar]
  • 32. Middleton MS , Haufe W , Hooker J , et al . Quantifying Abdominal Adipose Tissue and Thigh Muscle Volume and Hepatic Proton Density Fat Fraction: Repeatability and Accuracy of an MR Imaging-based, Semiautomated Analysis Method . Radiology 2017. ; 283 ( 2 ): 438 – 449 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. West J , Dahlqvist Leinhard O , Romu T , et al . Feasibility of MR-based body composition analysis in large scale population studies . PLoS One 2016. ; 11 ( 9 ): e0163332 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Nogami A , Yoneda M , Iwaki M , et al . Diagnostic comparison of vibration-controlled transient elastography and MRI techniques in overweight and obese patients with NAFLD . Sci Rep 2022. ; 12 ( 1 ): 21925 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Caussy C , Chen J , Alquiraish MH , et al . Association Between Obesity and Discordance in Fibrosis Stage Determination by Magnetic Resonance vs Transient Elastography in Patients With Nonalcoholic Liver Disease . Clin Gastroenterol Hepatol 2018. ; 16 ( 12 ): 1974 – 1982.e7 . [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Radiology are provided here courtesy of Radiological Society of North America

RESOURCES