Abstract
Objectives
This study investigated the influence of hepatic vessels on the quantification of magnetic resonance imaging (MRI) proton density fat fraction (PDFF) and R2* using automated whole-liver segmentation.
Materials and methods
This prospective multicenter study included patients with chronic liver disease having paired liver biopsy and MR exams with a standardized multiecho chemical-shift gradient echo sequence. Automated whole-liver segmentation was performed, generating two masks per patient, one including and the other excluding the major hepatic vessels. PDFF and R2* were quantified and graded for both masks. Histological grading of hepatic steatosis and iron overload severity was used as a reference standard.
Results
A total of 377 patients were evaluated, of whom 54% had hepatic steatosis and 20% had iron overload on biopsy readings. Stratified by histological grades, there were no statistically significant differences in the distribution of PDFF or R2* between the two segmentation masks. Overall, PDFF and R2* values were minimally lower when vessels were included, with a bias of -0.06% for PDFF and -0.25 s-1 for R2*. A lower coefficient of variation was obtained for both imaging parameters after excluding vessels. Patients were classified in the same PDFF grades despite the segmentation approach, and only 7 cases (1.9% of the study population) were reclassified for R2* grading, all being upgraded after vessel exclusion.
Conclusion
Excluding hepatic vessels entails nonsignificant differences in PDFF and R2* quantification. Although with limited impact, vessel exclusion improves biomarker precision in research settings demanding high accuracy and increases clinicians’ confidence when using automatic tools in clinical practice.
Relevance statement
Fat and iron quantification on MRI are key imaging biomarkers for the accurate non-invasive assessment of patients with chronic liver disease. Proton density, fat fraction, and R2* quantification show minimal differences if hepatic vessels are included or excluded from the liver segmentation mask.
Key Points
The effect of hepatic vessels on proton density, fat fraction, and R2* quantification was evaluated.
No significant differences were found, excluding hepatic vessels, although their inclusion showed a small negative bias.
Vessel exclusion may improve clinicians’ confidence and precision in high-sensitivity applications.
Graphical Abstract
Keywords: Artificial intelligence, Blood vessels, Fatty liver, Iron overload, Magnetic resonance imaging
Introduction
Quantitative liver magnetic resonance imaging (MRI) has become an essential tool in both clinical assessment and research involving patients with chronic liver diseases. The growing burden of conditions such as metabolically-dysfunction-associated steatotic liver disease (MASLD) and alcohol-related liver disease, combined with the limitations of liver biopsy, has fostered the use of noninvasive imaging biomarkers [1, 2]. Multiecho chemical shift-encoded (MECSE) MR acquisitions have enabled accurate and simultaneous quantification of two relevant imaging biomarkers: proton density fat fraction (PDFF), a surrogate for hepatic steatosis, and R2*, a surrogate for iron content [3, 4]. These biomarkers have shown strong associations with histopathological grades, making them attractive tools for diagnosis, disease monitoring, and evaluation of treatment response [4‒6].
Despite their strengths, the reliability of PDFF and R2* quantification mainly depends on the standardization and reproducibility of the liver areas being measured. The first clinical studies relied on regions of interest manually placed in different liver segments to derive PDFF and R2* mean values [7, 8]. Although this approach is relatively straightforward, it has several limitations: (i) it samples only a small portion of the liver; (ii) it is susceptible to operator bias; (iii) it is time-consuming; and (iv) it fails to adequately capture spatial heterogeneity, especially relevant in patients with MASLD where fat and iron distribution may vary significantly across hepatic segments [9, 10]. Furthermore, region-of-interest-based approaches often exclude regions near vascular and biliary structures, underestimating disease burden in those regions.
To overcome these limitations, automated whole-liver segmentation methods have emerged, leveraging image processing techniques and artificial intelligence to delineate the entire liver parenchyma in an efficient and reproducible manner [11]. These methods facilitate comprehensive analysis of the entire parenchyma, and allow large-scale data extraction, which is essential for population studies and longitudinal imaging biomarker analysis. However, the effect of including or excluding the intrahepatic vessels when calculating PDFF and R2* from the entire organ remains unclear.
Hepatic vessels have inherently different image signal characteristics than the parenchyma. Although their inclusion might simplify the clinical workflow by eliminating the need to review vessel exclusion in segmentation masks, it might bias PDFF and R2* measurements, leading to systematic errors. This bias can introduce changes in the quantification of imaging biomarkers, having an impact on clinical management and therapeutic trials [12].
While some segmentation tools attempt to exclude vessels through thresholding or anatomical priors [13, 14], the degree to which their inclusion affects biomarker quantification in real-world patient populations is unknown. While most prior studies have validated PDFF and R2* against histology using whole-liver or region-of-interest-based methods [15, 16], this is the first to assess the impact of intrahepatic large vessels on biomarker precision and variability.
This study aims to address this knowledge gap by evaluating a large, multicenter cohort of patients with chronic liver disease who underwent both liver biopsy and MR imaging. Specifically, we assessed whether the inclusion of hepatic vessels within the segmented liver parenchyma significantly affects PDFF and R2* measurements. We hypothesize that vessel exclusion improves the accuracy of clinical assessment and supports the establishment of standardized imaging protocols and post-processing methodologies.
Materials and methods
Study design and population
This prospective multicenter study enrolled adult patients with chronic liver disease who underwent both liver biopsy and MRI within a one-month interval (median time difference of 19, range 13‒26 days). Recruitment occurred between 2017 and 2024 (study registry: 2017/0031/PI) at three medical centers in Valencia, Spain. Inclusion criteria were: ≥ 18-years-old, adequate specimen of liver biopsy for histological evaluation (> 15 mm, > 6 portal tracts), and written informed consent, as approved by the institutional review boards of all participating hospitals. Patients with prior liver transplantation, contraindications to MRI, severe motion or technical artifacts on MRI, decompensated cirrhosis, and the presence of hepatocellular carcinoma, were excluded.
MRI protocol and image analysis
All participants underwent a standard non-enhanced liver MRI examination with the same scanner (3-T Tx Achieva, Philips Healthcare) at ICTS-ReDIB Imaging La Fe (Valencia, Spain) A two-dimensional transversal MECSE gradient-echo sequence with 12 echoes (echo time 0.9–7.9 ms; delta echo time 0.7 ms; repetition time 9 ms), and low flip angle of 10° to minimize T1 bias, was used for the simultaneous PDFF and R2* quantification analysis. Imaging was performed during breath-hold, with whole-liver coverage; 24‒34 slices were acquired per volume with a slice thickness of 6‒7 mm.
Postprocessing was performed using QP-Liver® (Quibim, Valencia, Spain), an artificial intelligence-based software certified (Conformité Européene‒CE mark/UK conformity assessment class IIa) for automated liver segmentation and quantitative biomarker extraction. For each subject, two segmentation strategies were applied (Fig. 1):
whole-liver segmentation, including hepatic vessels [11];
whole-liver segmentation, excluding hepatic vessels. Vessels were automatically detected through an artificial intelligence-based algorithm and excluded from the liver mask. The vessel's automatic segmentation algorithm was based on a convolutional neural network with the same architecture as already proposed [11]. This algorithm yielded high accuracy when validated against expert manual annotations, with Dice coefficients of 0.97 in the internal test set, 0.93 in an external Siemens 3-T dataset, and 0.85 in an external Philips 1.5-T dataset (details in Supplementary Material 1). This ensures reproducibility and robustness across scanners.
Fig. 1.
Pipeline followed for image analysis on MECSE images. Whole-liver segmentation, including (top row) and excluding (bottom row) hepatic vessels, was automatically computed. For each, PDFF and R2* parametric maps were calculated. MECSE, Multiecho chemical shift encoded gradient-echo; MRI, Magnetic resonance imaging; PDFF, Proton density fat fraction
Voxel-wise PDFF (%) and R2* (s⁻¹) maps were computed, and statistical descriptors (median, mean, interquartile range) were extracted for both segmentation masks.
Histological analysis
Histological evaluation was centrally performed by an experienced liver pathologist (C.A.C, with > 10 years of experience), blinded to clinical data and MRI results. After formalin fixation, samples were embedded in paraffin, and tissue sections were cut and stained with hematoxylin and eosin for steatosis detection and Perls’ stain for iron assessment. Steatosis was graded according to the Nonalcoholic Steatohepatitis‒NASH Clinical Research Network scoring system (S0‒S3) [17]. The grading of iron storage was assessed using the Scheuer scoring system (Fe0‒Fe4) [18].
Statistical analysis
The primary analysis was planned as a paired two-sided test for equivalence on the mean difference. Because PDFF and R2* were considered co-primary endpoints, a Bonferroni correction was applied (αtotal = 0.05, αper-endpoint = 0.025). With 90% power (z1−β = 1.282) and z1−α = 1.96, the required sample size per endpoint was estimated as:
Estimates of the standard deviation of paired differences (sd) were obtained from reproducibility data in the literature. The Quantitative Imaging Biomarker Alliance (QIBA) PDFF profile reports a repeatability coefficient of 4.7% [19], corresponding to sd ≈ 2.4%. For R2*, a sd ≈ 5.1 s⁻¹ was assumed based on the repeatability coefficient for confounder-corrected R2* (10 s⁻¹ at 3 T) reported in [4]. With equivalence margins of Δ = 0.5% for PDFF and Δ = 2 s⁻¹ for R2*, the resulting sample size requirements were n ≈ 101 for PDFF and n ≈ 69 for R2*. Selecting the maximum across endpoints and scenarios yields a conservative requirement of n = 101.
Given non-Gaussian distributions, differences between the PDFF and R2* values with the two segmentation methods were assessed using the nonparametric Mann–Whitney U-test. The Spearman rank correlation coefficient (ρ) was used to explore the correlation between quantitative metrics computed with both segmentation methods. Statistical significance was set at p < 0.05. Median PDFF and R2* values were compared by linear regression and Bland–Altman analyses. Also, the coefficient of variation (CoV) of both PDFF and R2* parametric maps was measured by dividing the standard deviation by the mean value.
Reclassification analysis was conducted by categorizing patients into steatosis and iron overload grades based on PDFF [20] and R2* validated thresholds [15]. We assessed the number of patients who modified their grade severity when using each segmentation approach. The following imaging thresholds were used:
Steatosis (PDFF): S0 (< 5.75%), S1 (5.75‒15.5%), S2 (15.5–21.35%), and S3 (> 21.35%);
Iron overload (R2*): Fe0 (< 55 s-1), Fe1 (55‒56 s-1), Fe2 (56‒64 s-1), Fe3 (64‒69 s-1), and Fe4 (> 69 s-1).
Analyses were performed using Python 3.11.
Results
Patient characteristics
A total of 377 patients with different liver etiologies were finally included in the analysis, exceeding the required minimum sample size of 101 patients, ensuring robust statistical power for equivalence testing. Patient characteristics are described in Table 1.
Table 1.
Patient demographics and histological characteristics
| Characteristics | Sample (n = 377) | |
|---|---|---|
| Sex, female | 232 (61.5%) | |
| Age, years (range) | 55 (17‒80) | |
| Body mass index (kg/m2) | 27.9 ± 5.0 | |
| Liver disease etiology | MASLD | 205 (54.4%) |
| Autoimmune hepatitis | 121 (32.1%) | |
| Alcohol consumption | 25 (6.6%) | |
| Viral hepatitis | 18 (4.8%) | |
| Hemochromatosis | 4 (1.1%) | |
| Other | 4 (1.1%) | |
| Histological fibrosis stage | F0 | 132 (35.0%) |
| F1 | 83 (22.0%) | |
| F2 | 79 (21.0%) | |
| F3 | 48 (12.7%) | |
| F4 | 35 (9.3%) | |
| Histological steatosis grade | S0 | 174 (46.2%) |
| S1 | 70 (18.6%) | |
| S2 | 59 (15.6%) | |
| S3 | 74 (19.6%) | |
| Histological iron grade | Fe0 | 303 (80.4%) |
| Fe1 | 22 (5.8%) | |
| Fe2 | 17 (4.5%) | |
| Fe3 | 20 (5.3%) | |
| Fe4 | 15 (4.0%) | |
MASLD Metabolic dysfunction-associated steatotic liver disease
Segmentation masks
All cases were automatically segmented with both segmentation methods. Figure 2 shows the corresponding PDFF parametric map in a patient with severe steatosis. Figure 3 shows the corresponding R2* parametric map in a patient with high iron overload. Overall, the population mean liver volume was 1.39 ± 0.37 L, considering the entire liver mask and 1.36 ± 0.36 after excluding the hepatic vessels. The mean vessel volume was 25.52 ± 12.76 mL (1.81 ± 0.64% of the entire liver mask). Across the entire cohort, PDFF and R2* showed minimal absolute variations between both segmentation strategies, with no statistically significant differences (Table 2) and a very strong correlation between segmentation methods (ρ ≥ 0.99).
Fig. 2.
PDFF (a) and R2* (b) parametric maps from a patient with high steatosis (MASLD S3 histological steatosis grade). For each, the results of the automatic segmentation on three representative slices, including (upper row) and excluding (bottom row) the hepatic vessels, can be observed. Median PDFF: 29.46% with vessels, 29.72% without vessels. MASLD, Metabolic dysfunction-associated steatotic liver disease; PDFF, Proton density fat fraction
Fig. 3.
PDFF (a) and R2* (b) parametric maps from a patient with high iron overload (hemochromatosis Fe4 histological iron grade). For each, the results of the automatic segmentation on three representative slices, including (upper row) and excluding (bottom row) the hepatic vessels, can be observed. Median R2*: 277.42 s-1 with vessels, 279.03 s-1 without vessels. PDFF, Proton density fat fraction
Table 2.
PDFF and R2* values (median and interquartile range) extracted from the whole-liver segmentation, including and excluding the hepatic vessels
| Metric | With vessels | Without vessels | Absolute difference | p-value | ρ |
|---|---|---|---|---|---|
| PDFF (%) | 3.49 (0.37–10.77) | 3.56 (0.38–10.88) | 0.06 [0–0.27] | 0.90 | 0.99 |
| R2* (s-1) | 49.07 (41.59–59.14) | 49.27 (41.73–59.41) | 0.25 [0–2.93] | 0.76 | 0.99 |
The absolute difference is represented as the mean [minimum–maximum]
PDFF Proton density fat fraction
Linear regression analysis showed a slope of 0.99 for both PDFF and R2*. Bland–Altman analysis showed a minor negative bias when including the hepatic vessels, with a bias of -0.06% for PDFF and -0.25 s-1 for R2* (Fig. 4). The CoV analysis at a case level (Fig. 5) showed a lower mean CoV in both PDFF (120.31% versus 120.91%) and R2* (49.89% versus 50.86%) metrics derived when hepatic vessels were excluded from the segmentation mask.
Fig. 4.
Linear regression (left graphs) and Bland–Altman (right graphs) analysis comparing PDFF (upper row) and R2* (bottom row) values, including the hepatic vessels versus excluding hepatic vessels. For regression plots, the solid line represents the fitted regression line, and the dashed line represents the unity line. For Bland–Altman plots, the solid line represents the mean differences (bias), and the dashed lines represent 95% confidence intervals. PDFF, Proton density fat fraction
Fig. 5.
CoV at a case level of PDFF (upper row) and R2* (bottom row) values, including (left graphs) and excluding (right graphs) the hepatic vessels. The dashed line represents the mean CoV value. The outlier present in the R2* CoV analysis corresponds to a case with air in the colon, a small portion was incorrectly included in the automatic segmentation, significantly increasing the R2* standard deviation for both masks. CoV, Coefficient of variation; PDFF, Proton density fat fraction
Steatosis and iron grades
After excluding vessels, median PDFF values increased marginally across all histological steatosis grades (S0-S3) (Table 3). Differences between segmentation strategies were not statistically significant, with a very strong correlation between both methods (ρ ≥ 0.99) across each histological grade. Similarly, median R2* values showed minimal changes across histological iron scores, with no statistical differences between the two segmentation masks (Table 3). There was a very strong correlation between both methods (ρ ≥ 0.99) stratified by histological iron overload grades. Figure 6 represents PDFF and R2* values stratified by histological grades with both segmentation masks, showing similar distributions across each grade of steatosis and iron overload.
Table 3.
PDFF and R2* values (median and interquartile range) extracted from the whole-liver segmentation, including and excluding the hepatic vessels, for each steatosis and iron grade
| Histological steatosis grade | PDFF with vessels (%) | PDFF without vessels (%) | p-value | ρ |
|---|---|---|---|---|
| S0 (n = 174) | 0.36 (0.11–1.67) | 0.38 (0.10–1.70) | 0.95 | 0.99 |
| S1 (n = 70) | 4.82 (3.05–6.78) | 4.90 (3.10–6.85) | 0.81 | 0.99 |
| S2 (n = 59) | 9.48 (7.27–12.41) | 9.53 (7.33–12.54) | 0.91 | 0.99 |
| S3 (n = 74) | 16.15 (13.20–21.41) | 16.26 (13.29–21.55) | 0.89 | 0.99 |
| Histological iron grade | R2* with vessels (s-1) | R2* without vessels (s-1) | p-value | ρ |
| Fe0 (n = 303) | 47.02 (40.83–54.96) | 47.12 (41.03–55.26) | 0.76 | 0.99 |
| Fe1 (n = 22) | 60.43 (50.15–72.78) | 60.84 (50.43–73.04) | 0.95 | 0.99 |
| Fe2 (n = 17) | 58.38 (52.81–68.11) | 58.54 (52.98–68.42) | 0.73 | 1.00 |
| Fe3 (n = 20) | 90.83 (61.81–105.90) | 91.33 (62.07–106.37) | 0.98 | 0.99 |
| Fe4 (n = 15) | 111.40 (75.52–193.74) | 111.77 (75.86–196.00) | 0.98 | 0.99 |
PDFF Proton density fat fraction
Fig. 6.
Median PDFF (upper row) and R2* (bottom row) distributions across different histological steatosis and iron overload grades using the liver segmentation, including and excluding the hepatic vessels. PDFF, Proton density fat fraction
Considering PDFF grading cut-offs, no patients were reclassified into different PDFF grades (PDFF-S0 – PDFF-S3) using the segmentation approach that excludes the vessels. On the other hand, considering R2* grading cut-offs, only seven patients were reclassified, increasing by one grade the iron overload severity after vessel exclusion (Fig. 7). The etiologies of the discordant cases were MASLD (n = 5), alcohol related liver disease (n = 1), and autoimmune hepatitis (n = 1). Supplementary Table S1 shows the median R2* values of these discordant cases before and after vessel exclusion. The difference in liver iron content was very small (≤ 0.55 s⁻¹), and all cases presented R2* values close to the established MRI-derived grade cutoffs. No cases with severe iron overload (Fe4) were misclassified.
Fig. 7.
Confusion matrix comparing the PDFF-derived steatosis grades (left) and the R2*-derived iron scores (right) when using the whole-liver segmentation, including and excluding the hepatic vessels. PDFF, Proton density fat fraction
Discussion
This study provides comprehensive evidence that the inclusion of hepatic vessels in the segmentation mask of liver parenchyma has a low impact on PDFF and R2* quantification. Across a large and diverse cohort of patients with chronic liver disease, we observed no statistically significant differences in PDFF or R2* values when comparing segmentation strategies with versus without vessels. There was a strong correlation between segmentation methods (ρ = 0.99) and a minimal non-significant absolute difference (≤ 0.27% for PDFF and ≤ 2.93 s⁻¹ for R2*). In addition, the linear regression analysis showed a slope of 0.99 for both imaging biomarkers.
These results have several important clinical implications. First, they provide further validation to current automated whole-liver quantification tools that do not explicitly exclude vessels [11, 15]. In routine clinical practice, where processing speed is crucial and computational algorithms are not uniformly available, our findings support the use of whole-liver masks without complex vessel-exclusion filters as a reliable approach. Second, we demonstrate that vessel inclusion has minimal clinical impact on PDFF and R2* quantification. For PDFF, the absolute differences between segmentation strategies were consistently below the repeatability coefficient of 1.2–1.6% reported by the NIMBLE consortium, which defines the threshold for distinguishing measurement error from true biological change in patients with liver disease [21]. Importantly, no cases showed a change in PDFF-derived steatosis grade regardless of the segmentation method. For liver iron content, the absolute differences between segmentation strategies were below the mean reproducibility bias reported for liver R2* quantification [4]. No cases with severe iron overload were misclassified, and only 7 patients (1.9% of the study population) were reclassified across R2*-derived iron grade. Notably, all reclassified cases had R2* values very close to the thresholds used for categorization into iron overload grades. Unlike PDFF grading, where thresholds span wider ranges, R2* grading is constrained by narrow cut-offs between categories. This explains our findings and highlights the need for future research to establish clinically driven cut-points for liver iron quantification, ideally based on patient outcomes rather than conventional histology. Finally, the hepatic vessel-to-liver volume ratio in chronic liver disease was around 2%, consistent with previous studies [22]. Because this vessel volumetry is small—and even decreases in advanced liver disease [22]—its contribution to whole-liver MRI biomarker quantification is expected to be negligible.
Nonetheless, subtle trends were observed and must be critically considered. Excluding vessels consistently produced slightly higher PDFF and R2* values across all histological grades of steatosis and iron overload. Although these differences were small and not statistically significant, they were directionally consistent, suggesting a systematic effect. In addition, the exclusion of hepatic vessels yielded a lower mean CoV for both PDFF (-0.06%) and R2* (-0.97%). Again, although the magnitude of the change was small, vessel inclusion systematically introduces greater variability relative to the mean (Fig. 3). Bland–Altman analysis further supported this observation, showing a small negative bias when hepatic vessels were included (-0.06% for PDFF and -0.25 s-1 for R2*). These differences may be due to the intrinsic properties of blood vessels, since they lack intracellular fat and may have altered magnetic susceptibility due to blood oxygen transportation, leading to distinct signal decay and R2* behavior compared to liver parenchyma [14].
Despite lower mean values and increased CoV, the clinical impact of hepatic vessels inclusion remains negligible, as nearly all cases were classified in the same severity grades determined by MRI. Thus, in hepatology care settings, all patients with moderate-to-severe steatosis or iron overload, will be managed identically regardless of the segmentation strategy used. However, in research settings where maximal measurement precision is required, such as clinical trials for drug development with strict inclusion criteria and longitudinal monitoring, minimizing even small sources of bias becomes desirable [5, 23]. In such contexts, even small reductions in measurement noise can enhance sensitivity to detect changes over time, which is particularly relevant in trials targeting the reduction of MRI biomarkers as a surrogate endpoint [24]. Therefore, while vessel exclusion may not alter routine clinical decisions, it could offer incremental value in research applications where analytical sensitivity is critical.
From a technical perspective, implementing vessel exclusion is feasible and increasingly accessible. Segmentation models based on convolutional neural networks can be trained to distinguish parenchyma from vessels using MRI [25]. Alternatively, intensity-based filtering or shape priors can approximate vascular regions [13, 14]. However, additional processing steps introduce new sources of variability, particularly in patients with aberrant vasculature, large cysts, or tumor involvement. Therefore, segmented regions should always be edited if needed and approved by a radiologist.
From a practical perspective, vessel exclusion may require additional computational steps, which modestly increase processing complexity compared to simple whole-liver segmentation. While modern artificial intelligence tools make this feasible with limited additional time, the balance between improved precision and workflow simplicity should be considered when implementing these methods in routine clinical practice.
A major strength of this study is its generalizability. The patient cohort covers a wide range of liver disease etiologies, and disease severity of histological steatosis and iron grades. The use of close histological reference standards provides a robust anchor for interpreting imaging results. Several limitations merit consideration. First, the study was conducted using a single post-processing platform, although its methodology has been validated across multiple scanners and regions [11]. Second, while we evaluated PDFF and R2*, other MRI biomarkers, such as T1 mapping, extracellular volume, fibrosis, or elastography, may be differentially affected by vascular inclusion and warrant individual analysis. Future studies will be conducted to extend the analysis to a broader set of imaging biomarkers to assess whether the patterns observed for PDFF and R2* generalize to other quantitative MRI parameters, with special consideration to elastography. Third, the absence of pediatric patients in our cohort limits the extrapolation of our findings to children. However, our results are generalizable to adult patients with chronic liver disease, who represent the primary target population for the clinical application of PDFF and R2*. A strength of our study is the inclusion of patients with diverse etiologies and a broad range of disease severity, which enhances the robustness and applicability of the results. Fourth, the slice thickness of 6–7 mm may have introduced partial volume effects, particularly at vessel boundaries, potentially attenuating differences between segmentation strategies.
Finally, the evolving regulatory landscape for imaging biomarkers must be considered. As PDFF and R2* move toward qualification as surrogate endpoints in clinical trials and potentially regulatory approval, standardizing image acquisition and analysis, including decisions around vessel inclusion, will become increasingly important. Consensus guidelines from professional societies such as the European Imaging Biomarkers Alliance—EIBALL and Quantitative Imaging Committee—QUIC may benefit from incorporating evidence from studies like ours to guide best practices.
In conclusion, this study demonstrates that major hepatic vessels have a minimal influence on automated PDFF and R2* quantification using whole-liver segmentation. Additionally, the exclusion of hepatic vessels may enhance clinicians’ trustworthiness and confidence when using these automatic tools in clinical practice.
Supplementary information
Acknowledgements
This work acknowledges the use of ICTS-ReDIB, supported by the Ministry of Science, Innovation and Universities (MICIU) at Imaging La Fe. The authors affirm that large language models (LLMs) were applied for language editing to enhance clarity or rephrase certain sections. No scientific content creation or data analysis relied on LLM technologies. The authors take full responsibility for the content of this manuscript.
Abbreviations
- CoV
Coefficient of variation
- MASLD
Metabolically-dysfunction-associated steatotic liver disease
- MECSE
Multiecho chemical shift-encoded
- MRI
Magnetic resonance imaging
- PDFF
Proton density fat fraction
Author contributions
AJ-P analyzed and interpreted the patient data and was the main contributor in writing the manuscript. DM-A led patient recruitment and study design and was a key contributor in writing the manuscript. BP developed the algorithm for automatic vessel segmentation. CA-C performed the patients’ histological evaluation. AP-G reviewed all MR exams to guarantee inclusion criteria fulfillment. AA-B guaranteed the technical accuracy of the analyses and actively participated during manuscript writing. LM-B guaranteed the scientific accuracy of the study and actively participated during manuscript writing. All authors read and approved the final manuscript.
Funding
DM-A is supported by a Joan Rodés contract (JR22/00002), Instituto de Salud Carlos III (Spanish Ministry of Science and Innovation), and he also received a grant Grupo de Investigación Emergente (CIGE/2022/37) from Conselleria de Educación, Cultura, Universidades y Empleo, Generalitat Valenciana, Spain.
Data availability
The data that support the findings of this study are available upon reasonable request to the authors (registry number: 2017/0031/PI).
Declarations
Ethics approval and consent to participate
The study conforms to the ethical guidelines of the 1975 Declaration of Helsinki and had the a priori approval by the institutional review boards of the three participating hospitals (2016/209, 2017/0031/PI, and 13/2019).
Consent for publication
All participants signed informed consent.
Competing interests
AJ-P is an employee of Quibim SL. AA-B is the CEO and shareholder of Quibim SL. All the other authors have no competing interests.
Footnotes
Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Ana Jimenez-Pastor and David Marti-Aguado contributed equally to this work.
Supplementary information
The online version contains supplementary material available at 10.1186/s41747-025-00663-1.
References
- 1.Younossi ZM, Koenig AB, Abdelatif D, Fazel Y, Henry L, Wymer M (2016) Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology 64:73–84. 10.1002/hep.28431 [DOI] [PubMed] [Google Scholar]
- 2.Brunt EM (2005) Nonalcoholic steatohepatitis: pathologic features and differential diagnosis. Semin Diagn Pathol 22:330–338. 10.1053/j.semdp.2006.04.002 [DOI] [PubMed] [Google Scholar]
- 3.Reeder SB, Cruite I, Hamilton G, Sirlin CB (2011) Quantitative assessment of liver fat with magnetic resonance imaging and spectroscopy. J Magn Reson Imaging 34:729–749. 10.1002/jmri.22775 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gandon Y, Henninger B, Hillenbrand C et al (2023) Quantification of liver iron overload with MRI: review and guidelines from the ESGAR and SAR. Radiology 307:e221856. 10.1148/radiol.221856 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Noureddin M, Lam J, Peterson MR et al (2013) Utility of magnetic resonance imaging versus histology for quantifying changes in liver fat in nonalcoholic fatty liver disease trials. Hepatology 58:1930–1940. 10.1002/hep.26455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Leoni S, Tovoli F, Napoli L, Serio I, Ferri S, Bolondi L (2018) Current guidelines for the management of non-alcoholic fatty liver disease: a systematic review with comparative analysis. World J Gastroenterol 24:3361–3373. 10.3748/wjg.v24.i30.3361 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yokoo T, Serai SD, Pirasteh A et al (2017) Linearity, bias, and precision of hepatic proton density fat fraction measurements by using MR imaging: a meta-analysis. Radiology 286:486–498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mazzola M, Cannella R, Pilato G et al (2024) Quantitative liver MRI biomarkers: what the radiologist should know. J Med Imaging Intervent Radiol 79:12–22. 10.1007/s44326-024-00015-0 [Google Scholar]
- 9.Middleton MS, Van Natta ML, Heba ER et al (2018) Diagnostic accuracy of magnetic resonance imaging hepatic proton density fat fraction in pediatric nonalcoholic fatty liver disease. Hepatology 67:858–872 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Rafique MS, Ayub A, Malik AK, Malik T, Kundi S, Saeed A (2023) Assessment of segmental hepatic fat distribution using magnetic resonance proton density fat fraction MR-PDFF in non-alcoholic fatty liver disease (NAFLD). Nat J Health Sci 8:57–61. 10.21089/njhs.82.0057 [Google Scholar]
- 11.Jimenez-Pastor A, Alberich-Bayarri A, Lopez-Gonzalez R et al (2021) Precise whole liver automatic segmentation and quantification of PDFF and R2* on MR images. Eur Radiol 31:7876–7887. 10.1007/s00330-021-07838-5 [DOI] [PubMed] [Google Scholar]
- 12.Jang EC, Jun DW, Lee SM, Cho YK, Ahn SB (2017) Comparison of efficacy of low-carbohydrate and low-fat diet education programs in non-alcoholic fatty liver disease: a randomized controlled study. Hepatol Res 48:e22–e29. 10.1111/hepr.12918 [DOI] [PubMed] [Google Scholar]
- 13.Wantanajittikul K, Saiviroonporn P, Saekho S, Krittayaphong R, Viprakasit V (2021) An automated liver segmentation in liver iron concentration map using fuzzy c-means clustering combined with anatomical landmark data. BMC Med Imaging 21:138. 10.1186/s12880-021-00669-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tipirneni-Sajja A, Song R, McCarville MB, Loeffler RB, Hankins JS, Hillenbrand CM (2018) Automated vessel exclusion technique for quantitative assessment of hepatic iron overload by R2*-MRI. J Magn Reson Imaging 47:1542–1551. 10.1002/jmri.25880 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Martí-Aguado D, Jiménez-Pastor A, Alberich-Bayarri Á et al (2021) Automated whole-liver MRI segmentation to assess steatosis and iron quantification in chronic liver disease. Radiology 47:1542–1551. 10.1148/radiol.2021211027 [DOI] [PubMed] [Google Scholar]
- 16.França M, Alberich-Bayarri Á, Martí-Bonmatí L et al (2017) Accurate simultaneous quantification of liver steatosis and iron overload in diffuse liver diseases with MRI. Abdom Radiol (NY) 42:1434–1443. 10.1007/s00261-017-1048-0 [DOI] [PubMed] [Google Scholar]
- 17.Kleiner DE, Brunt EM, Van Natta M et al (2005) Design and validation of a histological scoring system for nonalcoholic fatty liver disease. Hepatology 41:1313–1321. 10.1002/hep.20701 [DOI] [PubMed] [Google Scholar]
- 18.Salomao MA (2021) Pathology of hepatic iron overload. Clin Liver Dis 17:232–237. 10.1002/cld.1051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Radiological Society of North America () QIBA profile: MRI-based proton density fat fraction (PDFF) of the liver. Available via https://qibawiki.rsna.org/images/b/bb/QIBA_MRI-Based_PDFF_of_the_Liver_Profile-Stage_1-Public_Comment.pdf
- 20.Qadri S, Vartiainen E, Lahelma M et al (2023) Marked difference in liver fat measured by histology vs. magnetic resonance-proton density fat fraction: a meta-analysis. JHEP Rep 6:100928. 10.1016/j.jhepr.2023.100928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fowler KJ, Venkatesh SK, Obuchowski N et al (2023) Repeatability of MRI biomarkers in nonalcoholic fatty liver disease: the NIMBLE consortium. Radiology 309:e231092. 10.1148/radiol.231092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Herold A, Sobotka D, Beer L et al (2025) MRI-derived quantification of hepatic vessel-to-volume ratios in chronic liver disease using a deep learning approach. Eur Radiol Exp 9:75. 10.1186/s41747-025-00612-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ratziu V, Harrison SA, Francque S et al (2016) Elafibranor, an agonist of the peroxisome proliferator-activated receptor-α and -δ, induces resolution of nonalcoholic steatohepatitis without fibrosis worsening. Gastroenterology 150:1147–1159.e5. 10.1053/j.gastro.2016.01.038 [DOI] [PubMed] [Google Scholar]
- 24.Caussy C, Reeder SB, Sirlin CB, Loomba R (2018) Noninvasive, quantitative assessment of liver fat by MRI-PDFF as an endpoint in NASH trials. Hepatology 68:763–772. 10.1002/hep.29797 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zbinden L, Catucci D, Suter Y et al (2022) Convolutional neural network for automated segmentation of the liver and its vessels on non-contrast T1 vibe Dixon acquisitions. Sci Rep 12:22059. 10.1038/s41598-022-26328-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support the findings of this study are available upon reasonable request to the authors (registry number: 2017/0031/PI).








