Abstract
The objective of this study was to assess the diagnostic value of plasma neutrophil gelatinase-associated lipocalin (pNGAL) for the early diagnosis of acute kidney injury (AKI) in adult patients following cardiac surgery requiring cardiopulmonary bypass (CPB). Electronic databases and other resources were systematically searched for relevant studies. Risk of bias was assessed using the Quality Assessment for Diagnostic Accuracy Studies 2 (QUADAS-2) tool. Studies were assigned to a sub-group based on the timing of the pNGAL sample in relation to the cessation of CPB. These were < 4 h, 4–8 h, 12 h or 24 h post-cessation of CPB. Summary values for sensitivity and specificity were estimated using the hierarchical summary receiver operator characteristic (ROC) curve model. A random-effects meta-analysis of each pair of sensitivity and specificity estimates from each included study was performed. In total, 3131 patients from 16 studies were included. When taken at 4–8 h following CPB, pNGAL had superior performance for the diagnosis of AKI in the defined population when compared to earlier and later time points. Prediction regions and confidence intervals, however, demonstrated significant variability in pooled estimates of sensitivity and specificity. This is likely due to population and study design heterogeneity, lack of standardisation of assays and thresholds, and inability to distinguish the different molecular forms of NGAL. In conclusion, the diagnostic utility of pNGAL in this clinical setting is inconclusive and large individual studies of representative populations of cardiac surgery patients using assays that specifically detect NGAL in its monomeric form are required.
Subject terms: Biochemistry, Cardiology, Health care, Nephrology
Introduction
Acute kidney injury (AKI) is a common complication of cardiac surgery1 and is associated with significant mortality, morbidity and increased health care costs2. Studies have reported that AKI, and more specifically AKI requiring renal replacement therapy (RRT) is associated with further increased mortality and longer intensive care unit (ICU) and total hospital length of stay (LOS)3–7. Patients undergoing cardiac surgery requiring cardiopulmonary bypass (CPB) are at even greater risk of developing AKI. The mechanism of kidney injury following CPB is multifactorial. Contributing factors include ischemia–reperfusion injury, reduced cardiac output, renal vasoconstriction, CPB hypothermia and rewarming, CPB-induced systemic inflammatory response leading to interstitial inflammation and coagulopathy, which in turn can lead to CPB-related embolization8–12. The reported incidence of post-cardiac surgery AKI varies depending on the definition of AKI and the clinical profile of patients studied. A large meta-analysis reported a rate of CPB associated AKI of 18.2% which is associated with a greater than twofold increase in mortality13.
In clinical practice, the diagnosis of AKI largely relies on the detection of an acute rise in serum creatinine and/or a reduction in urine output, both of which are considered suboptimal markers of AKI. Serum creatinine is affected by many factors including age, sex, race, body surface area, diet, diabetes, liver disease, drugs and laboratory analytical methods which compromise its performance as a diagnostic test for AKI14. Serum creatinine is a relatively late marker of AKI and up to 50% of glomerular function is lost before the serum creatinine is elevated above the reference range15. Neutrophil gelatinase-associated lipocalin (NGAL) is expressed and secreted by kidney cells soon after renal insult, including post-operative ischemia following CPB. NGAL concentrations rise rapidly, within 2–6 h, and are detectable in plasma when AKI is potentially limitable or reversible16,17 A meta-analysis of 307 studies involving 1200 patients, reported an area under the receiver operator characteristic curve (AUROC) of plasma NGAL (pNGAL) for predicting AKI post-cardiac surgery as 0.78 (95% confidence interval 0.67 to 0.87). Plasma NGAL is therefore considered to be a sensitive and specific early marker of AKI and can be detected up to 48 h before a diagnostic rise in serum creatinine17. A meta-analysis of 53 studies with over 7000 patients undergoing cardiac surgery reported that pNGAL was predictive of AKI and its severity, with an AUROC of 0.82–0.8318. This meta-analysis did not specifically review a patient cohort undergoing CPB.
In 2018, the AKI-diagnostics project reviewed the diagnostic performance of several potential tests including pNGAL for AKI in an ICU setting. Whilst a subgroup of post cardiac surgery patients was considered, the number of studies included was small and was not specifically defined as cardiac surgery requiring CPB. The review concluded that whilst pNGAL has the potential to add value, results of the analysis were highly uncertain, largely due to heterogeneity between studies19. A health technology assessment of pNGAL (and other biomarkers) in AKI in critically ill patients in 2019 similarly advised cautious interpretation of results due to heterogeneity between studies but concluded that future studies should evaluate the targeted use of biomarkers such as pNGAL within specific patient populations20. We therefore sought to quantitatively summarize published studies to evaluate the diagnostic accuracy of pNGAL for AKI in a specific clinical setting of adult patients who have undergone cardiac surgery requiring CPB.
Methods
The systematic review was registered with the International Database of Prospectively Registered Systematic Reviews (PROSPERO; registration number CRD42021261676) and conducted in accordance with the PRISMA guideline for reporting of systematic reviews (Supplementary Information)21.
Identification of studies
A comprehensive electronic search was carried out through PubMed, Cochrane database, Scopus, and ClinicalTrials.gov (http://clinicaltrials.gov/) up until June 2021. The search terms used in PubMed were: (heart OR cardiac OR aort* OR valv* OR thoracic) AND surg*) OR ‘valve replacement*’ OR ‘bypass*’ OR ‘CABG’ OR ‘extracorporeal circulation’ OR ‘on pump’ OR ‘Cardiac Surgical Procedures’) AND (‘neutrophil gelatinase-associated lipocalin’ OR NGAL OR ‘LCN2 protein, human’) AND (‘diagnostic accuracy’ OR ‘sensitivity’ OR ‘specificity’ OR ‘PPV’ OR ‘NPV’ OR ‘positive predictive value’ OR ‘negative predictive value’) . In the Cochrane library, Scopus and ClinicalTrials.gov, a similar strategy was used. In addition, abstracts from meetings and reference lists of eligible papers or related reviews were searched manually to identify additional relevant studies.
Inclusion and exclusion criteria
The inclusion criteria for studies were: (i) adult cardiac surgery cohort requiring CPB; (ii) measurement of pNGAL for the early diagnosis of AKI (within 24 h) after cardiac surgery; (iii) provision of data from which true-positive (TP), false-positive (FP), false-negative (FN) and true-negative (TN) could be identified or calculated; (iv) AKI clearly defined by acceptable methods- preferably by KDIGO, RIFLE or AKIN criteria22–24 and (v) those published in English. Exclusion criteria were: (i) studies with duplicate data reported in other studies; (ii) sample size less than 25; (iii) timing of pNGAL measurement not clearly defined; (iv) inclusion of paediatric patients within the cohort; (v) more than 20% ‘off-pump’ patients included in the cohort; (vi) insufficient diagnostic accuracy data available.
Study selection and data extraction
One reviewer (HSC) screened the titles and abstracts of all citations to judge eligibility based on the inclusion and exclusion criteria. For citations that could not be evaluated through the titles and abstracts, full texts were retrieved for thorough evaluation.
A second reviewer (JF) second checked all prospective citations for eligibility. Full-text copies of all potentially relevant reports were retrieved and assessed for inclusion by both reviewers (HSC and JF).
One reviewer (HSC) extracted the data from each study. This was checked for accuracy by the second reviewer (JF). Any discordance was then checked by the first reviewer. The following information was recorded from each selected study (i) basic characteristics of studies: name of the first author, year of publication, sample size, country; (ii) characteristics of cohort: AKI diagnosis criteria, number of ‘off-pump’ patients, number of patients who developed AKI; (iii) measurement of pNGAL: specimen type, analytical method, pNGAL test cut-off and the timing of sample collection; (iv) the criteria for the diagnosis of AKI; (v) study outcomes: test sensitivity and specificity and or true positive (TP), false positive (FP), false negative (FN), true negative (TN), positive predictive value (PPV), negative predictive value (NPV) and AUROC.
Assessment of the risk of bias
The Quality Assessment of Diagnostic Accuracy Studies version 2 (QUADAS-2) tool was used to assess the risk of bias25. The following items were evaluated: patient selection, interpretation of the index test, appropriateness and interpretation of the reference standard, flow of patients and timing of tests. The applicability of each study to the question under review was also assessed to consider whether the procedures employed in a study would differ significantly from those employed in real clinical practice. Each item was scored as either low risk of bias, unclear risk of bias or high risk of bias. Two reviewers (HSC and JF) assessed the risk of bias.
Studies identified to pose a high risk of bias were not excluded from the meta-analysis, but the findings were instead interpreted in light of the bias, which is in keeping with good practice.
Data analysis
For each study, sensitivity, specificity, prevalence, PPV, NPV, TP, FP, FN and TN cases were recorded. If a study lacked the mandatory diagnostic accuracy data, the TP/FP/FN/TN according to the following formulae: sensitivity = TP/ (TP + FN), specificity = TN/(FP + TN), AKI + non-AKI = TP + FP + TN + FN were calculated and entered into a 2 × 2 table.
The diagnostic data were entered into Review Manager software (RevMan version 5.4, Nordic Cochrane Centre, Copenhagen) to generate forest plots of sensitivity and specificity. The odds ratio was used for the synthesis and presentation of results.
To estimate the summary values for sensitivity and specificity, and their 95% confidence and prediction regions, a random-effects meta-analysis odds ratio (OR) was performed using the hierarchical summary ROC (hSROC) model implemented in STATA® software version 16.1 (StataCorp LP, College Station, TX, USA) using the METANDI command. This model is described in the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy for comparisons of test accuracy when there is variability in threshold between studies and is preferred as it takes into account both sensitivity and specificity measures and the correlation between them, assumes that thresholds vary between studies and incorporates variability within and between studies26. In the presence of heterogeneity, which was expected in this review, a random-effects meta-analysis weights the studies relatively more equally than a fixed-effect analysis.
In accordance with the STATA requirements, meta-analyses were performed only when data from four or more studies were available. For studies that reported multiple time points, each was assigned to a group based on the timing of the pNGAL sample in relation to the cessation of CPB. These were < 4 h, 4–8 h 12 h or 24 h post-cessation of CPB. A separate meta-analysis was performed for each time point. Subgroup meta-analysis was also performed based on whether a point of care test (POCT) or laboratory-based method was used.
Heterogeneity was assessed by visual inspection of the forest plots and of the size of the prediction region in the hSROC plots. The index of variability (I2) statistic and chi-squared test statistic were used to approximate proportion of total variability in point estimates that could be attributed to heterogeneity, although the limitations of this approach are discussed. An I2 > 50% with a p-value < 0.05 from the chi-squared test was indicative of moderate heterogeneity27.
Sensitivity analysis
Studies identified to have a high risk of bias were individually removed from the meta-analysis and data to determine the effect on the summary points and heterogeneity.
Results
Summary of included studies
The literature search yielded 360 records. There were 321 records remaining following removal of duplicates. Following removal of studies that did not meet the inclusion criteria, 39 full text articles were reviewed, and 16 individual studies were included in the final meta-analysis (Fig. 1). The characteristics of the selected studies are presented in Table 1.
Table 1.
First author, year | Assigned time since CPB cessation (h) | Sample size | AKI Prevalence | TP | FP | FN | TN | pNGAL cut-off (ng/mL) | Sensitivity | Specificity | PPV | NPV | AUROC (95% CI) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Merkle, 201942 | < 4 | 58 | 0.36 | 15 | 3 | 6 | 34 | 226.98 | 0.71 | 0.92 | 0.83 | 0.85 | 0.846 (0.727–0.927) |
24 | 58 | 0.36 | 18 | 8 | 3 | 29 | 115.87 | 0.86 | 0.78 | 0.69 | 0.91 | 0.891 (0.781–0.957) | |
Introcaso, 201843 | 4–8 | 69 | 0.35 | 18 | 18 | 6 | 27 | 154 | 0.76 | 0.59 | 0.5 | 0.81 | 0.71 (0.60–0.82) |
Guerci, 201844 | 4–8 | 50 | 0.36 | 15 | 5 | 3 | 27 | 112 | 0.83 | 0.84 | 0.75 | 0.9 | 0.9 (0.81–0.98) |
Schley, 201545 | < 4 | 110 | 0.34 | 28 | 18 | 9 | 55 | 173.43 | 0.76 | 0.75 | 0.609 | 0.859 | 0.81 (0.73–0.90) |
4–8 | 110 | 0.34 | 29 | 18 | 8 | 55 | 178.20 | 0.78 | 0.75 | 0.617 | 0.873 | 0.83 (0.75–0.91) | |
24 | 110 | 0.34 | 32 | 18 | 5 | 55 | 137.61 | 0.86 | 0.75 | 0.640 | 0.916 | 0.84 | |
Park, 201546 | 4–8 | 112 | 0.12 | 8 | 11 | 5 | 88 | 168.5 | 0.62 | 0.89 | 0.421 | 0.946 | 0.81 (0.68–0.95) |
Kidher, 201447 | < 4 | 53 | 0.30 | 13 | 4 | 3 | 33 | 150 | 0.80 | 0.89 | 0.765 | 0.917 | 0.83 (0.70–0.95) |
Ghonemy, 201430 | 4–8 | 50 | 0.34 | 17 | 3 | 0 | 30 | 62 | 0.98 | 0.92 | 0.842 | 0.967 | Not stated |
< 4 | 50 | 0.34 | 16 | 2 | 1 | 31 | 62 | 0.94 | 0.94 | 0.888 | 0.968 | Not stated | |
Parikh, 201129 | < 4 | 1219 | 0.05 | 30 | 209 | 30 | 950 | 293 | 0.50 | 0.82 | 0.126 | 0.969 | 0.7 (SE 0.04) |
Perry, 201048 | < 4 | 879 | 0.09 | 29 | 149 | 46 | 655 | 353.5 | 0.39 | 0.82 | 0.163 | 0.934 | 0.641 (0.58–0.71) |
24 | 879 | 0.09 | 42 | 242 | 33 | 562 | 353.5 | 0.55 | 0.70 | 0.148 | 0.945 | 0.670 (0.60–0.74) | |
Haase, 200949 | 4–8 | 100 | 0.46 | 34 | 14 | 12 | 40 | 150.0 | 0.73 | 0.74 | 0.708 | 0.769 | 0.77 (0.63–0.91) |
Haase-Fielitz, 200918 | 4–8 | 100 | 0.23 | 18 | 17 | 5 | 60 | 150.0 | 0.79 | 0.78 | 0.52 | 0.93 | 0.80 (0.63–0.96) |
24 | 100 | 0.23 | 21 | 18 | 2 | 59 | 150.0 | 0.91 | 0.76 | 0.53 | 0.97 | 0.87 (0.78–0.96) | |
Perrotti, 201550 | 24 | 166 | 0.31 | 31 | 27 | 21 | 87 | 258 | 0.60 | 0.76 | 0.534 | 0.806 | 0.7 |
12 | 166 | 0.31 | 43 | 48 | 9 | 66 | 173 | 0.82 | 0.58 | 0.473 | 0.88 | 0.7 | |
4–8 | 166 | 0.31 | 41 | 48 | 11 | 66 | 155 | 0.79 | 0.58 | 0.461 | 0.857 | 0.7 | |
< 4 | 166 | 0.31 | 28 | 33 | 24 | 81 | 178 | 0.54 | 0.71 | 0.459 | 0.771 | 0.6 | |
Gaipov, 201551 | 12 | 60 | 0.67 | 21 | 5 | 19 | 15 | 199 | 0.52 | 0.73 | 0.808 | 0.441 | 0.649 (0.507–0.774) |
24 | 60 | 0.67 | 32 | 11 | 8 | 9 | 175.4 | 0.79 | 0.46 | 0.744 | 0.529 | 0.646 (0.504–0.772) | |
Prabhu, 201052 | 4–8 | 30 | 0.27 | 8 | 2 | 0 | 20 | 229 | 1.00 | 0.91 | 0.794 | 1 | 0.98 |
Tuladhar, 200953 | < 4 | 50 | 0.18 | 7 | 14 | 2 | 27 | 426 | 0.80 | 0.67 | 0.333 | 0.931 | 0.85 (0.73–0.97) |
Othman, 202128 | 4–8 | 25 | 0.28 | 7 | 2 | 0 | 16 | 145 | 1.00 | 0.89 | 0.778 | 1 | 0.965 |
A total of 3131 individual patients were included. The studies were split into groups based on the timing of the pNGAL sample; these were < 4 h, 4–8 h 12 h or 24 h post-cessation of CPB. Where this was not clearly stated in the study, a pragmatic approach was taken and agreed by three reviewers (HSC, JF and HL). In the subsequent QUADAS-2 assessment the applicability of the index test to review the question was then judged to be unclear. A total of 8 studies were classified as < 4 h and included 2585 patients, 10 studies were classified as 4–8 h and included 812 patients, two studies classified as 12 h, and six at 24 h comprising 1373 patients (Table 2).
Table 2.
Timing of pNGAL sample (hrs since CPB) | Number of studies | Number of patients |
---|---|---|
< 4 | 8 | 2585 |
4–8 | 10 | 812 |
12 | 2 | 226 |
24 | 6 | 1373 |
Study quality
Risk of bias was assessed using the QUADAS-2 tool and summarised in Figs. 2 and 3. Most studies (94%) were deemed to be at low risk of bias due to patient selection. One study cohort included only patients who had undergone coronary artery bypass graft (CABG) which may not be representative of all cardiac surgery patients28. The main potential source of bias related to blinding. Most studies were assessed as being either at high risk (44%) or unclear risk (31%) of bias for the conduct or interpretation of the index test largely due to the interpretation of the index test (pNGAL) occurring with prior knowledge of the reference standard results. One study was deemed to be at high risk of bias due to the classification of AKI as a doubling of serum creatinine which would exclude early AKI and explains the lower study prevalence of AKI (4.9%) compared to other studies29. There were no concerns regarding risk of bias introduced by the flow and timing of samples.
The applicability of patient selection was deemed to be unclear in two studies. In one study, it was not explicitly stated that patients were on CPB, but this was implied in the discussion and conclusions of the paper30. The applicability of the index test to review the question was judged to be unclear in most studies (81%) due to the variation in pNGAL thresholds or lack of clarity of timing of sample collections.
Diagnostic accuracy of pNGAL for identifying AKI in the defined cohort
The 12 h group was excluded from further analysis due to insufficient study numbers. Individual study sensitivities ranged from 0.39–0.94, 0.62–1.00 to 0.56–0.91 in the < 4 h, 4–8 h, and 24 h groups respectively. Forest plots of sensitivity and specificity for all studies are presented in Fig. 4. The summary estimates of sensitivity were 0.68 (95% CI 0.52–0.8), 0.81 (95% CI 0.74–0.87), 0.78 (95% CI 0.64–0.87) and specificity 0.82 (95% CI 0.75–0.88), 0.8 (95% CI 0.71–0.87), 0.73 (95% CI 0.69–0.77) in the < 4 h, 4–8 h, and 24 h groups respectively (Table 3).
Table 3.
Time since CPB (h) | Description of studies | No studies | Sensitivity | 95% CI | Specificity | 95% CI |
---|---|---|---|---|---|---|
< 4 | All studies | 8 | 0.676 | 0.521–0.800 | 0.822 | 0.753–0.875 |
4–8 | All studies | 10 | 0.814 | 0.743–0.869 | 0.801 | 0.713–0.868 |
4–8 | Othman, 2021 study removed | 9 | 0.802 | 0.734–0.856 | 0.791 | 0.698–0.860 |
4–8 | Haase, 2009 study removed | 9 | 0.831 | 0.752–0.888 | 0.810 | 0.712–0.880 |
24 | All studies | 6 | 0.776 | 0.638–0.871 | 0.729 | 0.685–0.769 |
24 | Perry, 2010 study removed | 5 | 0.810 | 0.686–0.893 | 0.744 | 0.692–0.790 |
4–8 | NGAL ELISA methods only | 4 | 0.986 | 0.362–0.999 | 0.869 | 0.753–0.935 |
4–8 | Triage NGAL methods only | 6 | 0.755 | 0.680–0.818 | 0.751 | 0.642–0.836 |
Summary ROC curves (Fig. 5) suggest pNGAL when taken at 4–8 h post-cessation of CPB to be the optimal test. The 4–8 h group was, therefore, subjected to in depth analysis. When the method type (Triage NGAL versus NGAL ELISA) was added as a covariate the shape of the summary ROC further improved. However, when the hSROC analysis was applied, the prediction regions indicated a large degree of heterogeneity in both sensitivity and specificity estimates (Table 3). It is worth noting the individual sample sizes in the four studies in this group were small.
Figure 6a–c show the hSROC for each time point with 95% confidence region for the summary operating point and 95% prediction region. At < 4 h, the confidence and prediction regions indicate a greater degree of heterogeneity in sensitivity estimates than in specificity estimates between studies. Specificity estimates were reasonably homogeneous. At 4–8 h, the confidence and prediction regions indicate a greater degree of heterogeneity in specificity estimates than in sensitivity estimates between studies. At 24 h, specificity estimates were reasonably homogeneous but there was a considerable degree of heterogeneity in the sensitivity estimates.
Heterogeneity and covariates
The between-study variation of the effect sizes is evident from visual inspection of the forest plot. In the 4–8 h group the I2 statistic was 40.6%, indicating mild heterogeneity27. Subgroup analysis based on method type showed that there was no observed heterogeneity in the NGAL ELISA group (I2 = 0%) and moderate heterogeneity (I2 = 62.1%) in the Triage NGAL test group. The test of homogeneity of study-specific effect sizes was also rejected in the Triage NGAL test subgroup, with a chi-squared test statistic of 13.02 and a p-value of 0.02 (Fig. 7).
Publication bias
Funnel plot asymmetry was evident in the 4–8 h group. There was a clear absence of studies in the lower left portion of the plot (Fig. 8). Although it should be noted that the number of studies is small and therefore funnel plot asymmetry does not necessarily indicate publication bias.
Discussion
Our meta-analysis of sensitivity and specificity estimates indicates that pNGAL taken 4–8 h following cessation of CPB in cardiac surgery patients is superior to pNGAL taken at < 4 h or 24 h for the early diagnosis of AKI. This is supported by data from the TRIBE-AKI study which reported that pNGAL concentrations peak at 6 h post cardiac surgery29. It was expected that the pooled estimates for sensitivity and specificity would be greater at 24 h, however the number of studies in this group was small. In addition, one study in the 24 h group was identified as high risk of bias due to the classification of AKI as a doubling of serum creatinine which would exclude early AKI defined by other standard criteria. The bias is supported by the lower prevalence of AKI (4.9%) compared to reported prevalence of AKI post-cardiac surgery (18.2%)13. Exclusion of this study from the meta-analysis improved summary points for sensitivity and specificity but the confidence and prediction regions increased.
The prediction regions and confidence limits at all time points were large due to considerable clinical and statistical heterogeneity observed across studies and the limited number of studies available for subgroup analyses. Surgery type, and patient related risk factors including age, sex, diabetes mellitus, basal renal function and congestive heart failure, contribute to the complex relationship between co-morbid pathophysiology and CPB as major causes of AKI in the population studied5,24,31. There was, however, significant variation between individual studies in patient inclusion criteria and risk factors that were included or excluded from the analysis. One study included only CABG procedures whereas others excluded this procedure which is of significance as it is associated with the lowest incidence of AKI post-cardiac surgery32. The importance of comorbidities as confounders is further supported by superior predictive performance in paediatric cohorts, in which co-morbidities are almost invariably absent17.
The threshold level for pNGAL varied considerably across studies. Despite this limitation, results across similar studies were pooled since a standardised cut-off for pNGAL has not yet been defined. The variability in thresholds and diagnostic accuracy in the population may in part be due to the complex origin of NGAL. NGAL exists in at least three different molecular forms; a 25 kDa monomer, a 45 kDa homodimer, and a 135 kDa NGAL/matrix metalloproteinase-9 (MMP-9) covalently complexed heterodimer. The renal cells predominantly produce the monomeric form and to lesser extent the heterodimeric form, whereas neutrophils contain all molecular forms. The homodimer is, therefore, specific for neutrophils33–35. The systemic inflammatory response triggered by CPB will activate circulating neutrophils to release their granular contents, including NGAL. Indeed, NGAL concentrations have been shown to be correlated with CPB duration and furthermore it is the homodimeric form that predominates suggesting that neutrophils as opposed to renal cells are the main source36. Various commercially available NGAL assays were utilised in the studies included in this review, several of which were marketed as ‘research use only’ assays. The NGAL assays cannot distinguish between the molecular forms released by different tissues. There is also currently no standardisation of NGAL assays and the specificity of the assays for the monomeric form was not stated in manufacturer’s instructions for use.
The diagnostic utility of pNGAL is inherently flawed when assessed against an imperfect reference standard. Serum creatinine is diluted in fluid-loaded patients; therefore, the incidence of AKI may be underestimated in this cohort37. Elevated NGAL in the absence of creatinine-based criteria for AKI is associated with an increased requirement for RRT and mortality, but it is unclear whether this represents subclinical AKI or severity of the systemic inflammatory response38,39.
A large value of I2 was interpreted as meaning that the effect size varies substantively across studies. The I2 statistic merely designates the extent of inconsistency of findings across studies in the meta-analysis and reflects the extent to which confidence intervals from the different studies overlap with each other. Univariate tests for heterogeneity in sensitivity and specificity and the estimates of the I2 statistic are not recommended as they do not account for heterogeneity explained by phenomena such as positivity threshold effects27. A degree of heterogeneity is inevitable, and it could be argued that any degree of heterogeneity is acceptable provided the protocol is clearly defined and risk of bias has been assessed and findings interpreted considering this.
Tests for funnel plot asymmetry are designed primarily for use in randomized trials and should not be used in systematic reviews of diagnostic test accuracy as there is potential to incorrectly indicate publication bias26. A more appropriate method for detecting funnel plot asymmetry in reviews of diagnostic studies has been developed but also has low power when there is heterogeneity in the diagnostic odds ratio, as is present in this study40.
The hSROC curves appear to show optimisation of sensitivity at the expense of specificity at 4–8 h possibly in attempt to rule-in AKI earlier whereas at 24 h there is considerably more variability in sensitivity, perhaps indicating optimisation of specificity i.e., a rule out approach. It is noted that in the individual studies there is seldom an explanation of how the optimal cut-off point for pNGAL was reached. We suggest that it may be more appropriate to assess the utility of pNGAL as a rule out test in this population. The NICE diagnostics assessment programme manual however does not recommend the separate analysis of negative and positive predictive values as this approach fails to take into account the correlation between the two parameters. In addition, pooled analysis of NPV or PPV is not recommended because of the impact of disease prevalence on these parameters, which is likely to vary between studies41. Therefore, larger prospective studies or randomised controlled trials (RCTs) in a representative population are required.
Limitations
The limitations of the meta-analysis are largely due to between study heterogeneity. Although risk of bias was assessed, and results of the meta-analysis were interpreted accordingly, the risk of bias in many cases was classified as unclear. Often measurement procedures performed within the studies differed significantly from those employed in routine practice, typically for reasons of pragmatism or cost. Many studies, for example, reported that samples were frozen and analysed as a single batch. The impact of analysing samples within a single batch is that it reduces variance increasing the likelihood of a significant finding. However, the results then may not translate to clinical practice, where samples are measured over many days and using different batches of reagents. The effect of freeze–thawing is also seldom stated, and this may be a potential source of a systematic increase or decrease in biomarker concentration. This could render clinical cut-off points invalid and lead to a higher FP or FN rate when introduced into routine practice. There was also, as previously discussed, significant heterogeneity in the characteristics of patients included in the studies.
Although the use of QUADAS-2 addresses the methodological issues concerning diagnostic accuracy, it does not address the issues associated with measurement. There are currently no guidelines available for evaluating the quality of measurement procedures in diagnostic accuracy studies. Therefore, this is an accepted limitation of the meta-analysis.
Conclusions
Whilst there is a potential role for the diagnostic utility of pNGAL in this clinical setting, because of the limited number of studies, substantial heterogeneity between studies and large 95% confidence and prediction regions, reliable conclusions cannot be drawn. There is currently no standardisation of assays or thresholds, and the assays included in this analysis cannot distinguish between the various molecular forms of NGAL released by different tissues. Larger prospective studies or RCTs, ideally distinguishing the monomeric form of NGAL, in a population truly representative of those undergoing cardiac surgery requiring CPB, are required.
Supplementary Information
Abbreviations
- AKI
Acute kidney injury
- AKIN
Acute kidney injury network
- AUC
Area under curve
- CI
Confidence interval
- CABG
Coronary artery bypass graft
- CPB
Cardio-pulmonary bypass
- DAP
Diagnostics Assessment Programme
- FN
False negative
- FP
False positive
- hSROC
Hierarchical summary receiver operator characteristic
- ICU
Intensive care unit
- KDIGO
Kidney Disease: Improving Global Outcomes
- LOS
Length of stay
- NGAL
Neutrophil gelatinase-associated lipocalin
- NICE
National Institute for Health and Care Excellence
- NPV
Negative predictive value
- PPV
Positive predictive value
- QUADAS 2
Quality assessment of diagnostic accuracy studies version 2
- ROC
Receiver operator characteristic
- RIFLE
Risk, injury, failure, loss of kidney function and End-stage kidney disease
- RRT
Renal replacement therapy
- TN
True negative
- TP
True positive
Author contributions
H.S-C. and J.F. reviewed all articles included in this meta-analysis and performed the data analysis. H.L. acted as an additional reviewer where stated. H.S-C. drafted the initial manuscript. All authors read and approved the final manuscript.
Data availability
All relevant data generated or analysed during this study are included in this published article.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-022-10477-5.
References
- 1.Hoste EA, Cruz DN, Davenport A, et al. The epidemiology of cardiac surgery-associated acute kidney injury. Int. J. Artif. Organs. 2008;31(2):158–165. doi: 10.1177/039139880803100209. [DOI] [PubMed] [Google Scholar]
- 2.Hoste EA, Bagshaw SM, Bellomo R, et al. Epidemiology of acute kidney injury in critically ill patients: The multinational AKI-EPI study. Intens. Care Med. 2015;41(8):1411–1423. doi: 10.1007/s00134-015-3934-7. [DOI] [PubMed] [Google Scholar]
- 3.Mishra PK, Luckraz H, Nandi J, et al. Long-term quality of life postacute kidney injury in cardiac surgery patients. Ann. Card. Anaesth. 2018;21(1):41–45. doi: 10.4103/aca.ACA_104_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Luckraz H, Gravenor MB, George R, et al. Long and short-term outcomes in patients requiring continuous renal replacement therapy post cardiopulmonary bypass. Eur. J. Cardiothorac. Surg. 2005;27(5):906–909. doi: 10.1016/j.ejcts.2005.01.057. [DOI] [PubMed] [Google Scholar]
- 5.Chertow GM, Burdick E, Honour M, Bonventre JV, Bates DW. Acute kidney injury, mortality, length of stay, and costs in hospitalized patients. J. Am. Soc. Nephrol. 2005;16(11):3365–3370. doi: 10.1681/asn.2004090740. [DOI] [PubMed] [Google Scholar]
- 6.Coca SG, Yusuf B, Shlipak MG, Garg AX, Parikh CR. Long-term risk of mortality and other adverse outcomes after acute kidney injury: A systematic review and meta-analysis. Am. J. Kidney Dis. 2009;53(6):961–973. doi: 10.1053/j.ajkd.2008.11.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hobson CE, Yavas S, Segal MS, et al. Acute kidney injury is associated with increased long-term mortality after cardiothoracic surgery. Circulation. 2009;119(18):2444–2453. doi: 10.1161/circulationaha.108.800011. [DOI] [PubMed] [Google Scholar]
- 8.Boldt J, Brenner T, Lehmann A, Suttner SW, Kumle B, Isgro F. Is kidney function altered by the duration of cardiopulmonary bypass? Ann. Thorac. Surg. 2003;75(3):906–912. doi: 10.1016/s0003-4975(02)04559-9. [DOI] [PubMed] [Google Scholar]
- 9.Ranucci M, Pavesi M, Mazza E, et al. Risk factors for renal dysfunction after coronary surgery: The role of cardiopulmonary bypass technique. Perfusion. 1994;9(5):319–326. doi: 10.1177/026765919400900503. [DOI] [PubMed] [Google Scholar]
- 10.Boodhwani M, Rubens FD, Wozny D, Nathan HJ. Effects of mild hypothermia and rewarming on renal function after coronary artery bypass grafting. Ann. Thorac Surg. 2009;87(2):489–495. doi: 10.1016/j.athoracsur.2008.10.078. [DOI] [PubMed] [Google Scholar]
- 11.Paparella D, Yau TM, Young E. Cardiopulmonary bypass induced inflammation: pathophysiology and treatment. An update. Eur. J. Cardiothorac. Surg. 2002;21(2):232–244. doi: 10.1016/s1010-7940(01)01099-5. [DOI] [PubMed] [Google Scholar]
- 12.Sreeram GM, Grocott HP, White WD, Newman MF, Stafford-Smith M. Transcranial Doppler emboli count predicts rise in creatinine after coronary artery bypass graft surgery. J. Cardiothorac. Vasc. Anesth. 2004;18(5):548–551. doi: 10.1053/j.jvca.2004.07.010. [DOI] [PubMed] [Google Scholar]
- 13.Pickering JW, James MT, Palmer SC. Acute kidney injury and prognosis after cardiopulmonary bypass: A meta-analysis of cohort studies. Am. J. Kidney Dis. 2015;65(2):283–293. doi: 10.1053/j.ajkd.2014.09.008. [DOI] [PubMed] [Google Scholar]
- 14.Delanaye P, Cavalier E, Pottel H. Serum creatinine: Not so simple! Nephron. 2017;136(4):302–308. doi: 10.1159/000469669. [DOI] [PubMed] [Google Scholar]
- 15.Liu KD, Brakeman PR. Renal repair and recovery. Crit. Care Med. 2008;36(4 Suppl):S187–S192. doi: 10.1097/CCM.0b013e318168ca4a. [DOI] [PubMed] [Google Scholar]
- 16.Mori K, Nakao K. Neutrophil gelatinase-associated lipocalin as the real-time indicator of active kidney damage. Kidney Int. 2007;71(10):967–970. doi: 10.1038/sj.ki.5002165. [DOI] [PubMed] [Google Scholar]
- 17.Haase M, Bellomo R, Devarajan P, Schlattmann P, Haase-Fielitz A. Accuracy of neutrophil gelatinase-associated lipocalin (NGAL) in diagnosis and prognosis in acute kidney injury: A systematic review and meta-analysis. Am. J. Kidney Dis. 2009;54(6):1012–1024. doi: 10.1053/j.ajkd.2009.07.020. [DOI] [PubMed] [Google Scholar]
- 18.Haase-Fielitz A, Bellomo R, Devarajan P, et al. Novel and conventional serum biomarkers predicting acute kidney injury in adult cardiac surgery—A prospective cohort study. Crit. Care Med. 2009;37(2):553–560. doi: 10.1097/CCM.0b013e318195846e. [DOI] [PubMed] [Google Scholar]
- 19.Hall PS, Mitchell ED, Smith AF, et al. The future for diagnostic tests of acute kidney injury in critical care: Evidence synthesis, care pathway analysis and research prioritisation. Health Technol. Assess. 2018;22(32):1–274. doi: 10.3310/hta22320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Brazzelli, M. et al. The Architect and Alinity Urine NGAL Assays, Urine NephroCheck Test, and Urine and Plasma NGAL Tests to Help Assess the Risk of Acute Kidney Injury for People Who are Being Considered for Admission to Critical Care (2019).
- 21.Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Int. J. Surg. 2021;88:105906. doi: 10.1016/j.ijsu.2021.105906. [DOI] [PubMed] [Google Scholar]
- 22.Khwaja A. KDIGO clinical practice guidelines for acute kidney injury. Nephron Clin. Pract. 2012;120(4):c179–c184. doi: 10.1159/000339789. [DOI] [PubMed] [Google Scholar]
- 23.Bellomo R, Ronco C, Kellum JA, Mehta RL, Palevsky P. Acute renal failure—Definition, outcome measures, animal models, fluid therapy and information technology needs: The Second International Consensus Conference of the Acute Dialysis Quality Initiative (ADQI) Group. Crit. Care. 2004;8(4):R204–R212. doi: 10.1186/cc2872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Mehta RL, Kellum JA, Shah SV, et al. Acute Kidney Injury Network: Report of an initiative to improve outcomes in acute kidney injury. Crit. Care. 2007;2:R31. doi: 10.1186/cc5713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Whiting PF, Rutjes AW, Westwood ME, et al. QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med. 2011;155(8):529–536. doi: 10.7326/0003-4819-155-8-201110180-00009. [DOI] [PubMed] [Google Scholar]
- 26.Macaskill P, Gatsonis C, Deeks JJ, Harbord RMT. Chapter 10: Analysing and presenting results. In: Deeks JJBP, Gatsonis C, editors. Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Version 10. The Cochrane Collaboration; 2010. [Google Scholar]
- 27.Higgins JP, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. 2003;327(7414):557–560. doi: 10.1136/bmj.327.7414.557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Othman HMEDM, Hassan AEM, Elsersi MH, Soliman AKMA, Emam DF. Neutrophil gelatinase-associated lipocalin as a biomarker for predicting acute kidney injury after coronary artery bypass grafting. Ain-Shams J. Anesthesiol. 2021;13(1):5. doi: 10.1186/s42077-020-00123-5. [DOI] [Google Scholar]
- 29.Parikh CR, Coca SG, Thiessen-Philbrook H, et al. Postoperative biomarkers predict acute kidney injury and poor outcomes after adult cardiac surgery. J. Am. Soc. Nephrol. 2011;22(9):1748–1757. doi: 10.1681/asn.2010121302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ghonemy TA, Amro GM. Plasma neutrophil gelatinase-associated lipocalin (NGAL) and plasma cystatin C (CysC) as biomarker of acute kidney injury after cardiac surgery. Saudi J. Kidney Dis. Transpl. 2014;25(3):582–588. doi: 10.4103/1319-2442.132194. [DOI] [PubMed] [Google Scholar]
- 31.Thakar CV, Arrigain S, Worley S, Yared JP, Paganini EP. A clinical score to predict acute renal failure after cardiac surgery. J. Am. Soc. Nephrol. 2005;16(1):162–168. doi: 10.1681/asn.2004040331. [DOI] [PubMed] [Google Scholar]
- 32.Seabra VF, Alobaidi S, Balk EM, Poon AH, Jaber BL. Off-pump coronary artery bypass surgery and acute kidney injury: A meta-analysis of randomized controlled trials. Clin. J. Am. Soc. Nephrol. 2010;5(10):1734–1744. doi: 10.2215/cjn.02800310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kjeldsen L, Johnsen AH, Sengeløv H, Borregaard N. Isolation and primary structure of NGAL, a novel protein associated with human neutrophil gelatinase. J. Biol. Chem. 1993;268(14):10425–10432. doi: 10.1016/S0021-9258(18)82217-7. [DOI] [PubMed] [Google Scholar]
- 34.Kjeldsen L, Bainton DF, Sengeløv H, Borregaard N. Identification of neutrophil gelatinase-associated lipocalin as a novel matrix protein of specific granules in human neutrophils. Blood. 1994;83(3):799–807. doi: 10.1182/blood.V83.3.799.799. [DOI] [PubMed] [Google Scholar]
- 35.Xu SY, Carlson M, Engström A, Garcia R, Peterson CG, Venge P. Purification and characterization of a human neutrophil lipocalin (HNL) from the secondary granules of human neutrophils. Scand. J. Clin. Lab Investig. 1994;54(5):365–376. doi: 10.3109/00365519409088436. [DOI] [PubMed] [Google Scholar]
- 36.Cai L, Rubin J, Han W, Venge P, Xu S. The origin of multiple molecular forms in urine of HNL/NGAL. Clin. J. Am. Soc. Nephrol. 2010;5(12):2229–2235. doi: 10.2215/cjn.00980110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Liu KD, Thompson BT, Ancukiewicz M, et al. Acute kidney injury in patients with acute lung injury: Impact of fluid accumulation on classification of acute kidney injury and associated outcomes. Crit. Care Med. 2011;39(12):2665–2671. doi: 10.1097/CCM.0b013e318228234b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Haase M, Devarajan P, Haase-Fielitz A, et al. The outcome of neutrophil gelatinase-associated lipocalin-positive subclinical acute kidney injury: A multicenter pooled analysis of prospective studies. J. Am. Coll. Cardiol. 2011;57(17):1752–1761. doi: 10.1016/j.jacc.2010.11.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Nickolas TL, Schmidt-Ott KM, Canetta P, et al. Diagnostic and prognostic stratification in the emergency department using urinary biomarkers of nephron damage: A multicenter prospective cohort study. J. Am. Coll. Cardiol. 2012;59(3):246–255. doi: 10.1016/j.jacc.2011.10.854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Deeks JJ, Macaskill P, Irwig L. The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J. Clin. Epidemiol. 2005;58(9):882–893. doi: 10.1016/j.jclinepi.2005.01.016. [DOI] [PubMed] [Google Scholar]
- 41.NICE . Diagnostics Assessment Programme Manual. NICE; 2011. [PubMed] [Google Scholar]
- 42.Merkle J, Daka A, Deppe AC, Wahlers T, Paunel-Görgülü A. High levels of cell-free DNA accurately predict late acute kidney injury in patients after cardiac surgery. PLoS ONE. 2019;14(6):e0218548. doi: 10.1371/journal.pone.0218548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Introcaso G, Nafi M, Bonomi A, et al. Improvement of neutrophil gelatinase-associated lipocalin sensitivity and specificity by two plasma measurements in predicting acute kidney injury after cardiac surgery. Biochem. Med. (Zagreb) 2018;28(3):030701. doi: 10.11613/bm.2018.030701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Guerci P, Claudot JL, Novy E, Settembre N, Lalot JM, Losser MR. Immediate postoperative plasma neutrophil gelatinase-associated lipocalin to predict acute kidney injury after major open abdominal aortic surgery: A prospective observational study. Anaesth. Crit. Care Pain Med. 2018;37(4):327–334. doi: 10.1016/j.accpm.2017.09.006. [DOI] [PubMed] [Google Scholar]
- 45.Schley G, Köberle C, Manuilova E, et al. Comparison of plasma and urine biomarker performance in acute kidney injury. PLoS ONE. 2015;10(12):e0145042. doi: 10.1371/journal.pone.0145042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Park CM, Kim JS, Moon HW, et al. Usefulness of plasma neutrophil gelatinase-associated lipocalin as an early marker of acute kidney injury after cardiopulmonary bypass in Korean cardiac patients: 2 prospective observational study. Clin. Biochem. 2015;48(1–2):44–49. doi: 10.1016/j.clinbiochem.2014.09.019. [DOI] [PubMed] [Google Scholar]
- 47.Kidher E, Harling L, Ashrafian H, et al. Pulse wave velocity and neutrophil gelatinase-associated lipocalin as predictors of acute kidney injury following aortic valve replacement. J Cardiothorac. Surg. 2014;9:89. doi: 10.1186/1749-8090-9-89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Perry TE, Muehlschlegel JD, Liu KY, et al. Plasma neutrophil gelatinase-associated lipocalin and acute postoperative kidney injury in adult cardiac surgical patients. Anesth Analg. 2010;110(6):1541–1547. doi: 10.1213/ANE.0b013e3181da938e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Haase M, Bellomo R, Devarajan P, et al. Novel biomarkers early predict the severity of acute kidney injury after cardiac surgery in adults. Ann. Thorac. Surg. 2009;88(1):124–130. doi: 10.1016/j.athoracsur.2009.04.023. [DOI] [PubMed] [Google Scholar]
- 50.Perrotti A, Miltgen G, Chevet-Noel A, et al. Neutrophil gelatinase-associated lipocalin as early predictor of acute kidney injury after cardiac surgery in adults with chronic kidney failure. Ann. Thorac. Surg. 2015;99(3):864–869. doi: 10.1016/j.athoracsur.2014.10.011. [DOI] [PubMed] [Google Scholar]
- 51.Gaipov A, Solak Y, Turkmen K, et al. Serum uric acid may predict development of progressive acute kidney injury after open heart surgery. Ren. Fail. 2015;37(1):96–102. doi: 10.3109/0886022x.2014.976130. [DOI] [PubMed] [Google Scholar]
- 52.Prabhu A, Sujatha DI, Ninan B, Vijayalakshmi MA. Neutrophil gelatinase associated lipocalin as a biomarker for acute kidney injury in patients undergoing coronary artery bypass grafting with cardiopulmonary bypass. Ann. Vasc. Surg. 2010;24(4):525–531. doi: 10.1016/j.avsg.2010.01.001. [DOI] [PubMed] [Google Scholar]
- 53.Tuladhar SM, Püntmann VO, Soni M, Punjabi PP, Bogle RG. Rapid detection of acute kidney injury by plasma and urinary neutrophil gelatinase-associated lipocalin after cardiopulmonary bypass. J. Cardiovasc. Pharmacol. 2009;53(3):261–266. doi: 10.1097/FJC.0b013e31819d6139. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data generated or analysed during this study are included in this published article.