Abstract
Background
Adherence to antiretroviral treatment (ART) among postpartum women with HIV is essential for optimal health and prevention of perinatal transmission. However suboptimal adherence with subsequent viremia is common and adherence challenges are often under-reported. We aimed to predict viremia to facilitate targeted adherence support in sub-Saharan Africa during this critical period.
Methods
Data are from PROMISE 1077BF/FF which enrolled perinatal women between 2011–2014. This analysis includes postpartum women receiving ART per study randomization or country-specific criteria to continue from pregnancy. We aimed to predict viremia (single and confirmed events) after 3 months on ART at >50, >400 and >1000 copies/mL within 6-month intervals through 24 months. We built models with routine clinical and demographic data using the least absolute shrinkage and selection operator (LASSO) and SuperLearner (which incorporates multiple algorithms).
Results
Among 1321 women included, the median age was 26 years and 96% were WHO Stage 1. Between 0–24 months postpartum, 42%, 31% and 28% experienced viremia >50, >400, and >1000 copies/mL at least once, respectively. Across models, the cross-validated area under the receiver operating curve (AUC) ranged from 0.74 (95% confidence interval [CI]: 0.72–0.76) to 0.78 (95% CI: 0.76–0.80). To achieve 90% sensitivity predicting confirmed viremia >50 copies/mL, 64% would be classified as high risk.
Conclusions
Using routinely collected data to predict viremia in >1300 postpartum women with HIV, we achieved moderate model discrimination, but insufficient to inform targeted adherence support. Psychosocial characteristics or objective adherence metrics may be required for improved prediction of viremia in this population.
Keywords: HIV, viral load, postpartum period, medication adherence, risk prediction, differentiated service delivery
INTRODUCTION
Roughly 1.2 million women living with HIV in sub-Saharan Africa gave birth in 2020, and nearly 90% had access to antiretroviral treatment (ART).1 With adequate adherence, ART during pregnancy and postpartum optimizes maternal health2 and virtually eliminates perinatal and sexual HIV transmission.3–9 However, adherence to daily medication can be particularly challenging during the postpartum period, given the disruption of routine with a newborn. Indeed, suboptimal adherence and subsequent viremia are frequently observed in postpartum women.2,10–15 Evidence-based approaches to improve ART adherence, such as peer support, pill boxes, and enhanced counselling,16–18 can be resource-intensive, and are not always appropriately targeted to those in need of support, in part because inadequate adherence is not readily identified. Both recall bias and social desirability bias can lead to under-reporting of “unhealthy” behaviors, and, as such, self-reported adherence can be unreliable.19,20 Scalable strategies to identify postpartum women in need of adherence support, before virologic failure, antiretroviral resistance,21 or HIV transmission during breastfeeding occurs, are urgently needed.
A risk score using readily collected information to predict which women are most likely to experience viremia could facilitate timely and targeted enhanced adherence interventions to those most in need. In addition to reaching the correct individuals to prevent viremia, appropriately targeting services could improve the quality of delivery of resource-intensive interventions which may be challenging to deliver well to a wider population. Indeed, differentiated service delivery according to patient need is increasingly important as universal ART places high demands on constrained health facilities in resource-limited settings such as sub-Saharan Africa.22–24 Risk scores predicting virologic failure in non-pregnant or postpartum populations have been developed to inform targeted virologic monitoring with moderate prediction performance,25–30 although their implementation has not been reported. The postpartum period presents unique challenges to adherence with new responsibilities and disruptions in routines, often including travel to stay with relatives post-delivery who may be unaware of the mother’s HIV status. Simultaneously, adherence is especially critical during this time for prevention of HIV transmission to infants while breastfeeding. Thus, factors linked to the postpartum experience, such as disclosure of HIV status to partner or relatives, breastfeeding status, or pregnancy outcomes, may help predict viremia during this unique period.
We therefore aimed to develop a risk score to predict viremia using routinely collected clinical and demographic data, including pregnancy-related predictors, to support the identification of women who might benefit most from enhanced adherence interventions. This analysis leverages data from a large cohort of postpartum women on ART across sub-Saharan Africa enrolled in the Promoting Maternal-Infant Survival Everywhere (PROMISE) Study,3,5,31 and uses machine learning methods to formulate a score to predict viremic events within 6 months, applicable from delivery through 2 years postpartum.
METHODS
Study setting, population, and procedures.
Data are from the International Maternal Pediatric Adolescent AIDS Clinical Trials Network (IMPAACT) PROMISE 1077BF/1077FF Study (NCT01061151), which was conducted between 2011–2015 at 14 sites where breastfeeding was standard (in Zimbabwe, Malawi, South Africa, Uganda, Zambia, Tanzania, and India) to examine optimal strategies for prevention of perinatal transmission of HIV and improving maternal health, prior to universal treatment guidelines.32 The study enrolled pregnant and postpartum women living with HIV who were not yet eligible for ART at the time. Thus, women with CD4 counts <350 cells/mm (or a higher threshold, according to individual country guidelines) and women with any clinical signs or symptoms that designated WHO clinical Class IV status33 were excluded. Study procedures and primary results have been reported.3,5,31 Briefly, PROMISE 1077BF included three components with three separate randomizations: during pregnancy (antepartum), during breastfeeding up to 18 months (postpartum), and after breastfeeding cessation (maternal health). PROMISE 1077FF enrolled women who chose not to breastfeed, and therefore only included antepartum and maternal health components. In the antepartum component, women were randomized to either tenofovir disoproxil fumarate (TDF)/emtricitabine (FTC)/lopinavir/ritonavir (LPV/r), zidovudine/lamivudine/LPV/r, or zidovudine plus single-dose nevirapine at delivery followed by a tenofovir/emtricitabine “tail” (for 6 to 14 days after delivery). In the postpartum component, women were randomized to either maternal TDF/FTC/LPV/r or daily infant nevirapine during breastfeeding. In the maternal health component, women already on ART in the prior component were randomized to stop (consistent with temporal guidelines) or continue ART. Participants who met country-specific guidelines for treatment during any component were started or maintained on ART. In July 2015, in response to the START trial results,34 all participants not currently receiving ART were given the recommendation to initiate ART.
All participants provided informed consent prior to PROMISE study participation. The study was approved by local and collaborating institutional review boards (Supplement, Section I) and other relevant regulatory authorities and was reviewed for safety and efficacy by an independent Data and Safety Monitoring Board.
This analysis includes all women from African sites on ART postpartum (per randomization or meeting country-specific guidelines to continue from pregnancy). Participants attended study visits according to protocol with viral load assessed at least every 6 months with more frequent assessments early in each component. We included viral loads collected after 3 months on ART through 2 years postpartum. Follow-up was censored when new pregnancies occurred (not all potential predictors, e.g., breastfeeding status, were collected in subsequent pregnancies) or if ART was stopped per the PROMISE protocol.
Study outcomes
Our study outcome was viremia (plasma HIV-1 RNA) at thresholds of >50, >400, and >1000 copies/mL. We considered these thresholds for the following reasons: 50 because breastfeeding transmission has been observed with low-level viremia35–37 and low-level viremia has been associated with subsequent disease progression;38 400 because this is the lower limit of detection in many assays used in resource-limited settings and was the outcome in PROMISE; 1000 as this is what the WHO defines as virologic failure. We developed models to predict viremic events within six-month intervals (from delivery to six months postpartum, and from 6–12, 12–18, and 18–24 months postpartum), using data collected prior to or at the baseline of each interval as predictors. We assessed both “any” viremia (at least one measure) as well as “confirmed” viremia (2 consecutive measures) during each six-month interval.
Candidate predictors
We considered readily collected demographic predictors including maternal age, education level, employment status, gestational age at first antenatal care visit, and gravidity. Pregnancy-related predictors included time since delivery (modeled as indicators for each 6-month interval postpartum), ART initiation in pregnancy vs. at delivery, breastfeeding status, preterm delivery, twin pregnancy, infant death (neonatal or in follow-up), and infant HIV acquisition. Routine clinical predictors included: nadir CD4 count, hemoglobin in pregnancy/at delivery, recent viremia (>50 copies/mL in the prior 6 months), history of viremia (>6 months prior and >3 months after ART initiation), WHO stage, self-reported ART adherence, missed visit history, and prior hospitalizations. We also considered variables that may not be routinely collected in some settings for potential inclusion in “enhanced” models: food insecurity, alcohol use, HIV status disclosure to male partners or others in the household, absolute neutrophil count, white blood cells, lymphocyte count, and creatinine.
Model development
We first described the unadjusted association between each candidate predictor and viremia (>50 copies, confirmed) using generalized estimating equation logistic models.. All continuous variables were standardized prior to analyses and highly skewed variables were log transformed prior to standardization. Because complete data is required for some machine learning methods,, we handled missing data in model building as follows: for categorical predictors we created a “missing” category; for continuous variables we set missing values to the mean and created a missing indicator variable.
We applied two approaches to model building. First, we used the least absolute shrinkage and selection operator (LASSO) method, which shrinks coefficients to optimize calibration (comparable frequencies of predicted and observed risks), and performs variable selection by shrinking some coefficients to zero.40 This approach ensures a parsimonious final model which could be translated into a simple point-based risk calculator. We selected the shrinkage parameter with the smallest cross-validated prediction error. Second, we used SuperLearner41 which optimizes prediction performance by combining predictions from multiple algorithms (“learners”), weighting each algorithm’s prediction according to its cross-validated prediction error. SuperLearner does not perform variable selection and would require a computer or tablet to calculate predictions. As candidate learners, we included standard logistic regression, LASSO, random forests, and gradient boostingRandom forests and gradient boosting both combine results from a large number of decision trees, each randomly defined based on designated parameters (e.g., the number of trees, the number of variables used in each tree for prediction). In random forests, trees are built in bootstrapped samples; in gradient boosting trees are built sequentially, each fit to the residuals from the prior tree.42
To consider model performance in the context of clinical care, we conducted two secondary analyses. First, we built models following the same approach described above with LASSO and SuperLearner, but excluded participant observations with recent viremia (>50 copies within 6 months prior to the baseline of each interval,after ≥3 months on ART) as women would already be identified as at risk at these time points, not requiring a risk assessment tool. We then assigned a predicted probability of 1 to those excluded observations. Second, we built simple logistic models using recent viremia (>50 copies/mL in the prior 6 months) alone to classify persons as high risk, as a reference point for risk assessment without a novel scoring tool.
Assessment of model performance
We summarized performance in terms of internally cross-validated discrimination (the degree to which the model assigns a high predicted risk to those who actually have the outcome, and low predicted risk to those who do not), quantified as the area under the receiver operating curve (AUC), and calibration (by comparing predicted and observed risks by decile of the predicted risk).43 Cross-validation folds (n=10) were defined by study site. Small sites were included in the same fold of a neighboring site to avoid small numbers in a single fold (two sites represented Gauteng Province, South Africa, two represented KwaZulu-Natal Province, South Africa, and Tanzania and Zambia both had small numbers and were combined). Splitting the data by region, rather than randomly selecting observations, provides a more stringent internal validation.44 Moreover, this approach ensures that participants contributing >1 observation have all of their observations in the same fold.This avoids overly optimistic estimates of performance which would occur when evaluating prediction in the same individuals among whom the model was developed. To visualize calibration, we plotted the predicted risks from the model against observed events in the sample. To further assess the clinical utility of the models, we estimated sensitivity, specificity, positive and negative predictive values (PPV, NPV), and report these values when requiring minimum sensitivities of 80% and 90%, and report the proportion classified as “high risk” at these thresholds. We did not specify minimum thresholds across each of these parameters, but estimated them at 80 and 90% sensitivity with an aim to reach the vast majority of women at high risk. Additionally, for appropriate targeting of resource-intensive adherence support that minimizes unnecessary burden for both patients and clinicians, PPVs >50% would be desirable.
Analyses were conducted in STATA (StataCorp, College Station, TX) version 16 and R (R Foundation for Statistical Computing, Vienna, Austria) version 4.0.2.
RESULTS
Among 1486 PROMISE participants on ART postpartum (via randomization or continued from pregnancy), 1321 were included in this analysis; those excluded did not have any viral load data after 3 months on ART, largely due to early termination of ART (Supplement, Section II). The median age of included participants was 26 years (interquartile range [IQR] 23, 30) and 831 (63%) initiated ART in pregnancy (Table 1). In this early treatment initiation trial, nadir CD4 was relatively high (median 526, IQR 433–664) and 96% were classified clinically as WHO Stage 1. Median follow-up time in this analysis (which was censored at 2 years, ART stop, or a new pregnancy) was 23 months (IQR 15–24).
Table 1.
Demographic and clinical characteristics of 1321 postpartum women on ART in the PROMISE 1077BF/FF Study
Characteristics | n (%) or median (IQR) N=1321 |
---|---|
Country | |
Malawi | 454 (34.4%) |
South Africa | 307 (23.2%) |
Tanzania | 25 (1.9%) |
Uganda | 212 (16.0%) |
Zambia | 30 (2.3%) |
Zimbabwe | 293 (22.2%) |
Age | 26 (23–30) |
Gestational age at first antenatal care visit | 23.4 (18.6–28.0) |
Gravidity | 3 (2–4) |
Education | |
None or some primary | 202 (19.8%) |
Completed primary but not secondary | 461 (45.2%) |
Completed secondary | 357 (35.0%) |
Missing | 301 |
Employment status | |
Not working | 1,066 (83.1%) |
Working part-time | 63 (4.9%) |
Working full-time | 154 (12.0%) |
Missing | 38 |
Food insecurity | |
None/mild | 1,012 (83.9%) |
Moderate | 98 (8.1%) |
Severe | 96 (8.0%) |
Missing | 115 |
HIV status not disclosed (to partner/household members) | 77 (6.6%) |
Still birth or infant death within 7 days | 23 (1.7%) |
Gestational age at delivery in weeks | |
< 34 | 24 (1.8%) |
34 – <37 | 150 (11.5%) |
37+ | 1,130 (86.7%) |
WHO clinical classification | |
Clinical stage I | 1,274 (96.4%) |
Clinical stage II | 46 (3.5%) |
Clinical stage III | 1 (0.1%) |
Nadir CD4+ count | 526 (433–664) |
Treatment assignment during antepartum component | |
Enrolled on/after delivery | 49 (3.7%) |
Triple-drug ART | 831 (62.9%) |
Zidovudine | 441 (33.3%) |
Treatment assignment during postpartum component | |
ART continued from antepartum for maternal health | 241 (18.2%) |
Maternal ART | 1,080 (81.8%) |
Treatment assignment during maternal health component | |
Continue maternal ART | 409 (31.0%) |
Discontinue triple ART | 233 (17.6%) |
Not enrolled | 679 (51.4%) |
Viremia frequency
Between delivery and 24 months postpartum, 42%, 31% and 28% of women experienced at least one viremic episode >50, >400, and >1000 copies/mL, respectively (Table 2). Confirmed viremic episodes (2 consecutive measures) were observed in 26%, 20%, and 17% at thresholds of 50, 400, and 1000 copies/mL, respectively. The median time between a first viremic event (>50 copies/mL) and a confirmatory measure was 56 days (IQR 25–84), with similar times at higher thresholds. The frequency of viremia was similar across 6-monthly intervals, with slightly lower frequencies after 6 months postpartum (e.g., 27% experienced viremia >50 copies/mL at least once between 0–6 months, while 21%, 23%, and 24% did so in each subsequent interval).
Table 2.
Viremia frequency by six-monthly intervals and overall, from 0–24 months postpartum among 1321 women on ART in the PROMISE Study
0–6 months | 6–12 months | 12–18 months | 18–24 months | |||
---|---|---|---|---|---|---|
Full sample | ||||||
N=1,299 | N=1,198 | N=998 | N=682 | Ever, 0–24 months, N=1321 participants | ||
Viral load threshold (copies/mL) | ||||||
50 | single | 352 (27%) | 253 (21%) | 226 (23%) | 163 (24%) | 550 (42%) |
confirmed | 211 (16%) | 163 (14%) | 134 (13%) | 79 (12%) | 337 (26%) | |
400 | single | 228 (18%) | 203 (17%) | 165 (17%) | 131 (19%) | 414 (31%) |
confirmed | 146 (11%) | 126 (11%) | 96 (10%) | 55 (8%) | 261 (20%) | |
1000 | single | 199 (15%) | 173 (14%) | 139 (14%) | 113 (17%) | 376 (28%) |
confirmed | 116 (9%) | 101 (8%) | 74 (7%) | 43 (6%) | 220 (17%) |
Notes: Total Ns within each interval represent the number of participants with VL assessed after 3 months on ART within the interval
Model building and performance
Consistent with prior work, we observed several strong predictors of viremia in bivariate analysis, for example, younger age (standardized; OR 0.69, 95% CI 0.61–0.79), lower nadir CD4 (standardized, OR 0.83 95% CI 0.73–0.96), and recent viremia (OR 6.78, 95% CI 5.60–8.21; Table 3). In LASSO models developed with routine data, the cross-validated area under the receiver operating curve (AUC) ranged from 0.76 to 0.78 (Table 4). LASSO shrunk very few coefficients to zero; nearly all were retained (Supplemental Table 2). Performance was comparable in SuperLearner (Figure 1) and when incorporating non-routine data (additional lab values, food insecurity, alcohol use) in “enhanced” models (Supplemental Table 3). Across SuperLearner models, standard logistic regression and LASSO predictions were weighted most heavily (Supplemental Table 3).
Table 3.
Bivariate associations between candidate predictors and confirmed viral load >50 copies/mL; all continuous variables are standardized.
Baseline Characterisitics | OR (95% CI) | p-value |
---|---|---|
Age | 0.69 (0.61–0.79) | <0.0001 |
Education | ||
None or some primary | (ref) | (0.40 overall) |
Completed primary but not secondary | 1.04 (0.71–1.51) | 0.86 |
Completed secondary | 0.83 (0.55–1.24) | 0.36 |
Gravidity | 0.90 (0.80–1.01) | 0.07 |
Gestational age at first antenatal care visit | 1.10 (0.97–1.25) | 0.16 |
Nadir CD4 | 0.83 (0.73–0.96) | 0.01 |
Hemoglobin | 0.91 (0.81–1.03) | 0.12 |
Initiated ART in pregnancy (vs. at delivery) | 1.28 (0.98–1.68) | 0.07 |
Preterm | ||
37 weeks | (ref) | (0.21 overall) |
34 up to <37 | 0.80 (0.54–1.18) | 0.26 |
<34 weeks | 1.61 (0.78–3.32) | 0.20 |
Twins (vs. singleton) | 3.88 (1.85–8.14) | <0.001 |
Time-varying characteristics | ||
Time since delivery | ||
0–6 months | (ref) | |
6–12 months | 0.81 (0.69–0.95) | 0.01 |
12–18 months | 0.80 (0.66–0.97) | 0.03 |
18–24 months | 0.68 (0.52–0.87) | 0.003 |
WHO clinical classification | ||
1 | (ref) | (0.19 overall) |
2 | 0.74 (0.42–1.32) | 0.31 |
3/4 | 2.86 (0.73–11.20) | 0.13 |
Recent viral load >50 (prior 6 months) | 6.78 (5.60–8.21) | <0.0001 |
History of viral load >50 (before the prior 6 months) | 2.48 (1.93–3.18) | <0.0001 |
Stopped breastfeeding prior to current visit | 1.08 (0.86–1.37) | 0.51 |
Baby acquired HIV prior to current visit | 5.51 (1.96–15.50) | 0.001 |
Infant death any time prior to current visit | 1.73 (0.78–3.84) | 0.18 |
Any missed visits in prior interval | 1.33 (0.81–2.18) | 0.27 |
Any hospitalizations in prior interval | 1.23 (0.62–2.43) | 0.56 |
Any alcohol use* | 0.65 (0.33–1.29) | 0.22 |
Absolute neutrophil count* | 1.11 (1.02–1.20) | 0.01 |
White blood cell count (log transformed)* | 1.12 (1.02–1.22) | 0.01 |
Lymphocytes (log transformed)* | 0.93 (0.84–1.03) | 0.17 |
Creatinine (log transformed)* | 0.90 (0.82–1.00) | 0.05 |
Adherence, any missed in past 4 weeks | 1.24 (1.04–1.47) | 0.02 |
Employment status | ||
Not working | (ref) | (0.63 overall) |
Working part-time | 0.96 (0.63–1.45) | 0.83 |
Working full-time | 0.85 (0.62–1.18) | 0.33 |
Food insecurity* | ||
None/mild | (ref) | (0.10 overall) |
Moderate | 0.91 (0.62–1.36) | 0.66 |
Severe | 1.49 (1.02–2.19) | 0.04 |
HIV status not disclosed to partner or household members | 0.91 (0.57–1.47) | 0.72 |
non-routine data considered in “enhanced” models
Table 4.
Model performance (with 95% confidence intervals) and proportion classified as “high risk” at selected viremia outcomes
Viremia outcome | ||||||
---|---|---|---|---|---|---|
50 copies/mL | 400 copies/mL | 1000 copies/mL | ||||
single | confirmed | single | confirmed | single | confirmed | |
Viremia incidence (across 4177 observations) | 550 (42%) | 337 (26%) | 414 (31%) | 261 (20%) | 376 (28%) | 220 (17%) |
LASSO models | ||||||
AUC | 0.77 (0.76–0.79) | 0.78 (0.76–0.80) | 0.78 (0.76–0.80) | 0.78 (0.75–0.80) | 0.76 (0.74–0.78) | 0.76 (0.74–0.79) |
Requiring 80% sensitivity | ||||||
Proportion classified high risk | 0.51 | 0.44 | 0.48 | 0.42 | 0.48 | 0.43 |
Specificity | 0.59 (0.55–0.63) | 0.62 (0.55–0.67) | 0.59 (0.54–0.63) | 0.62 (0.56–0.68) | 0.58 (0.53–0.63) | 0.60 (0.53–0.68) |
PPV | 0.38 (0.36–0.40) | 0.26 (0.22–0.29) | 0.29 (0.27–0.31) | 0.19 (0.17–0.22) | 0.25 (0.23–0.27) | 0.15 (0.13–0.18) |
NPV | 0.90 (0.90–0.91) | 0.95 (0.94–0.95) | 0.93 (0.93–0.94) | 0.96 (0.96–0.97) | 0.94 (0.94–0.95) | 0.97 (0.97–0.98) |
Requiring 90% sensitivity | ||||||
Proportion classified high risk | 0.69 | 0.64 | 0.68 | 0.62 | 0.67 | 0.61 |
Specificity | 0.38 (0.33–0.44) | 0.39 (0.33–0.47) | 0.37 (0.31–0.44) | 0.41 (0.34–0.48) | 0.37 (0.30–0.44) | 0.42 (0.36–0.50) |
PPV | 0.31 (0.29–0.33) | 0.20 (0.18–0.22) | 0.23 (0.21–0.25) | 0.15 (0.13–0.16) | 0.20 (0.19–0.22) | 0.12 (0.11–0.14) |
NPV | 0.92 (0.91–0.93) | 0.96 (0.95–0.97) | 0.95 (0.94–0.95) | 0.97 (0.97–0.98) | 0.95 (0.95–0.96) | 0.98 (0.98–0.98) |
Super Learner models AUC | 0.76 (0.75–0.78) | 0.77 (0.75–0.79) | 0.76 (0.74–0.78) | 0.76 (0.74–0.78) | 0.74 (0.72–0.76) | 0.74 (0.72–0.77) |
Logistic regression (sole predictor: viremia >50 in prior 6 months) | ||||||
AUC | 0.66 (0.61–0.70) | 0.66 (0.62–0.70) | 0.68 (0.64–0.72) | 0.66 (0.62–0.71) | 0.66 (0.62–0.71) | 0.65 (0.60–0.70) |
Proportion classified high risk | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 |
Sensitivity | 0.40 (0.37–0.43) | 0.44 (0.40–0.48) | 0.46 (0.43–0.50) | 0.47 (0.42–0.52) | 0.46 (0.41–0.50) | 0.46 (0.41–0.52) |
Specificity | 0.93 (0.92–0.94) | 0.90 (0.89–0.91) | 0.92 (0.91–0.92) | 0.89 (0.87–0.90) | 0.90 (0.89–0.91) | 0.88 (0.87–0.89) |
PPV | 0.62 (0.52–0.66) | 0.40 (0.30–0.44) | 0.52 (0.40–0.57) | 0.31 (0.22–0.35) | 0.44 (0.33–0.48) | 0.24 (0.16–0.27) |
NPV | 0.83 (0.83–0.83) | 0.91 (0.90–0.91) | 0.89 (0.88–0.89) | 0.94 (0.93–0.94) | 0.90 (0.90–0.91) | 0.95 (0.94–0.95) |
AUC = Area under the receiver operating curve; PPV = positive predictive value; NPV = negative predictive value
Figure 1.
Receiver operating characteristic curves for LASSO and SuperLearner models predicting viremia at thresholds of 50, 400, and 1000.
In the LASSO model predicting confirmed viremia >50 copies/mL, with a predicted probability cutoff achieving 90% sensitivity, the PPV was 0.20 (95% CI: 0.18–0.22) and 64% would be classified as high risk. To achieve 80% sensitivity, the PPV was 0.26 (95% CI 0.22–0.29) and 44% would be classified as high risk. Across viremia outcomes, 42%−50% would be classified as high risk to achieve 80% sensitivity, reflecting only moderate gains in efficiency while missing 20% of those at risk.
In secondary analyses, first, we assigned a predicted risk of 1 for all persons with recent viremia (>50 copies/mL in the prior 6 months) and only modeled risk predictions in those without recent viremia, i.e., those who might otherwise be considered clinically stable. Model performance (accounting for both those assumed to be high risk and those with model estimated risk) was nearly identical to the primary results reported in Table 4 (data not shown). When we used recent viremia (>50 copies/mL in the prior 6 months) alone to classify persons as high risk, AUCs ranged from 0.65–0.66. This approach would identify 40% who would go on to have at least one viremic event >50 copies/mL, or 47% who would experience confirmed viremia >400 copies/mL. Notably, only 15% would be classified as high risk. At the same sensitivities achieved with recent viremia alone, the specificities from LASSO and SuperLearner models were nearly the same as those from the simple logistic models (data not shown).
In the model predicting confirmed viremia >50 copies/mL, calibration was suboptimal, with a non-linear association between predicted risk and observed outcomes (Supplemental Figure).
DISCUSSION
In a cohort of over 1,300 women across six countries in sub-Saharan Africa who initiated ART in pregnancy or at delivery with asymptomatic HIV and high CD4 counts, approximately one third experienced at least one episode of postpartum viremia and about one in five had confirmed viremia on two consecutive specimens during two years of follow-up. Using rigorously collected clinical and socioeconomic data and machine learning methods, we developed models to identify women most likely to experience viremia during this critical period of breastfeeding transmission risk and achieved moderate model discrimination (AUC). Our aim was to develop a simple tool that would enable efficient delivery of enhanced ART adherence interventions to those who stand to gain the most benefit.. However, to achieve high sensitivity (90%), these models would require targeting support to roughly two-thirds of all women, fewer than a third of whom who would go on to experience viremia. Still, in contrast to targeting support based on recent viremia alone, through which we cannot achieve 80–90% sensitivity, the multivariable setting allows for the selection of risk thresholds across a range of higher levels of sensitivity, despite declining specificity, reflected in the higher estimated AUCs in our machine learning models. These results suggest that modeling has the potential to improve risk classification with the incorporation of additional, perhaps non-routine, data.
Prior risk scores to predict virologic failure have been developed in the context of limited access to viral load assays as a potential strategy to inform selective monitoring.25–30 Some of these models performed moderately better than ours; most were developed with logistic regression while one was developed with SuperLearner29 Aside from being developed in non-pregnant/postpartum populations, three factors may explain differences in performance. First, prior studies included late presenters and persons with advanced disease or long histories of ART use. These characteristics, reflecting a broad spectrum of HIV disease, could be strong predictors of poor adherence and viremia. Our study sample included generally asymptomatic young women who recently initiated ART with high CD4 counts who were not yet eligible for treatment according to country guidelines at the time. Thus, our cohort had little variability in disease progression and we were unable to incorporate variables such as opportunistic infections or long duration of ART use. Second, most prior risk scores included changes in CD4 counts over time as predictors of viremia. Because CD4 counts are no longer routinely collected for treatment monitoring in sub-Saharan Africa, these values would not be available to calculate risk in these settings today. For this reason, we only included nadir CD4 in our models, though we acknowledge that not all settings collect even this baseline measure. Third, one study29 achieved an AUC of 0.88 by incorporating electronically captured pill container openings as an objective adherence metric.
Several psychosocial factors associated with the postpartum period may be important drivers of adherence, yet many of these characteristics are not commonly assessed in routine care in resource-limited settings, nor were they systematically collected in the PROMISE study. Some notable examples include depression, which is a risk factor for suboptimal ART adherence45 and is common postpartum;46 the impact of pregnancy and breastfeeding on relationships and social support networks may influence adherence; stigma and attitudes about ART use, particularly in the context of the perinatal transmission risk, could affect adherence during this time. Additionally, staying with extended family for postpartum support could interrupt care engagement, prompt hiding pills from family members due to fear of stigma, or otherwise impact pill-taking routines.47 Many of these more nuanced psychosocial factors are not routinely collected but may be critical predictors of adherence or viremia in postpartum women.
The PROMISE dataset has both strengths and limitations pertinent to this analysis. A major strength is the systematic and rigorous data collection of key predictors and viral loads. In many resource-limited settings, electronic medical records are still in development, and viral load data is frequently missing. This strength also carries two limitations: (1) although we prioritized models built with routine data, in many resource-limited settings even these variables are not always systematically collected due to resource constraints, and (2) the nature of clinical trial participation is inherently different from routine clinical care. Motivations to adhere may differ in clinical trial settings, where follow-up, tracing, and access to health care are generally improved beyond routine care. Nonetheless, we expect that the variables we considered would have a similar relationship with adherence both within and outside of clinical trial settings. Moreover, the high frequency of viremia that we observed is comparable to what has been observed in other cohorts of pregnant and postpartum women.2,10–15 We also note that PROMISE was conducted prior to the availability of more modern regimens in sub-Saharan Africa, specifically integrase inhibitors. Dolutegravir, which is associated with higher rates of viral suppression compared to other medications,48 may be more “forgiving” of suboptimal adherence. Whether dolutegravir is more forgiving in the context of breastfeeding transmission risk is not known, but prediction of plasma viral loads may be more challenging with increasingly rare viremic events.
CONCLUSIONS
In conclusion, a third of postpartum women on ART in sub-Saharan Africa in this study experienced viremia in the first two years following delivery, which carries implications for both perinatal transmission and lifelong maternal health. We used machine learning methods and routine clinical data to develop models to predict postpartum viremia in order to facilitate timely and targeted adherence support to postpartum women most at risk of viremia in sub-Saharan Africa. However, despite achieving moderate model discrimination, prediction performance was insufficient to correctly identify the majority of women at risk without also classifying an even larger number of low-risk women as high-risk. Nonetheless, our moderate model discrimination was substantially better than prediction with recent viremia alone, suggesting that this approach, with the incorporation of additional variables, could improve prediction. Factors such as depression, partner support, HIV-related stigma, or objective metrics of adherence, for example, may be needed to better inform targeted support. Given the high frequency of viremia during breastfeeding in this cohort and others, and the critical importance of virologic suppression for prevention of perinatal HIV transmission, there remains an urgent need to identify and evaluate approaches for early identification of suboptimal adherence.
Supplementary Material
ACKNOWLEDGEMENTS
The authors would like to acknowledge all of the participants in the study, all of the PROMISE Study Investigators, and the IMPAACT Statistical and Data Analysis Center, in particular Patricia DeMarrais and Sean Brummel. In addition, we appreciate the invaluable critical feedback on early drafts from members of the UCSF Bixby Center for Global Reproductive Health Early Career Investigator works-in-progress seminars, as well as the comments and suggestions from the anonymous peer reviewers.
FUNDING
This work was supported by the National Institute of Mental Health (NIMH) under award number K01MH119910 (PMM). Overall support for the International Maternal Pediatric Adolescent AIDS Clinical Trials Network (IMPAACT) was provided by the National Institute of Allergy and Infectious Diseases (NIAID) with co-funding from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) and NIMH, all components of the National Institutes of Health (NIH), under Award Numbers UM1AI068632 (IMPAACT LOC), UM1AI068616 (IMPAACT SDMC) and UM1AI106716 (IMPAACT LC), and by NICHD contract number HHSN275201800001I. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. Study products were provided free of charge by AbbVie, Gilead Sciences, Boehringer Ingelheim, and ViiV/GlaxoSmithKline.
Footnotes
Conflicts of interest: None declared.
REFERENCES
- 1.UNAIDS. Start Free, Stay Free, AIDS Free: Final report on 2020 targets. Geneva, Switzerland, 2021. [Google Scholar]
- 2.Currier JS, Britto P, Hoffman RM, et al. Randomized trial of stopping or continuing ART among postpartum women with pre-ART CD4 >/= 400 cells/mm3. PLoS One. 2017;12(5):e0176009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Fowler MG, Qin M, Fiscus SA, et al. Benefits and risks of antiretroviral therapy for perinatal HIV prevention. N Engl J Med. 2016;375(18):1726–1737. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Jamieson DJ, Chasela CS, Hudgens MG, et al. Maternal and infant antiretroviral regimens to prevent postnatal HIV-1 transmission: 48-week follow-up of the BAN randomised controlled trial. Lancet. 2012;379(9835):2449–2458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Flynn PM, Taha TE, Cababasay M, et al. Prevention of HIV-1 transmission through breastfeeding: efficacy and safety of maternal antiretroviral therapy versus infant nevirapine prophylaxis for duration of breastfeeding in HIV-1-infected women with high CD4 cell count (IMPAACT PROMISE): A randomized, open-label, clinical trial. J Acquir Immune Defic Syndr. 2018;77(4):383–392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Donnell D, Baeten JM, Kiarie J, et al. Heterosexual HIV-1 transmission after initiation of antiretroviral therapy: a prospective cohort analysis. Lancet. 2010;375(9731):2092–2098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cohen MS, Chen YQ, McCauley M, et al. Prevention of HIV-1 infection with early antiretroviral therapy. N Engl J Med. 2011;365(6):493–505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rodger AJ, Cambiano V, Bruun T, et al. Sexual activity without condoms and risk of HIV transmission in serodifferent couples when the HIV-positive partner is using suppressive antiretroviral therapy. JAMA. 2016;316(2):171–181. [DOI] [PubMed] [Google Scholar]
- 9.Centers for Disease Control and Prevention. Evidence of HIV treatment and viral suppression in preventing the sexual transmission of HIV. Atlanta, GA: 2017. [Google Scholar]
- 10.Chetty T, Newell ML, Thorne C, Coutsoudis A. Viraemia before, during and after pregnancy in HIV-infected women on antiretroviral therapy in rural KwaZulu-Natal-South Africa, 2010–2015. Trop Med Int Health. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hosseinipour M, Nelson JAE, Trapence C, et al. Viral suppression and HIV drug resistance at 6 months among women in Malawi’s Option B+ Program: results from the PURE Malawi Study. J Acquir Immune Defic Syndr. 2017;75 Suppl 2:S149–S155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ruperez M, Noguera-Julian M, Gonzalez R, et al. HIV drug resistance patterns in pregnant women using next generation sequence in Mozambique. PLoS One. 2018;13(5):e0196451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Myer L, Redd AD, Mukonda E, et al. Antiretroviral adherence, elevated viral load and drug resistant mutations in HIV-infected women initiating treatment in pregnancy: a nested case-control study. Clin Infect Dis. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hoffman RM, Warshaw MG, Amico KR, et al. Predictors of Viremia in Postpartum Women on Antiretroviral Therapy. J Acquir Immune Defic Syndr. 2020;83(1):72–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Yotebieng M, Mpody C, Ravelomanana NL, et al. HIV viral suppression among pregnant and breastfeeding women in routine care in the Kinshasa province: a baseline evaluation of participants in CQI-PMTCT study. J Int AIDS Soc. 2019;22(9):e25376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sam-Agudu NA, Ramadhani HO, Isah C, et al. The impact of structured mentor mother programs on 6-month postpartum retention and viral suppression among HIV-positive women in rural Nigeria: A prospective paired cohort study. J Acquir Immune Defic Syndr. 2017;75 Suppl 2:S173–S181. [DOI] [PubMed] [Google Scholar]
- 17.Mills EJ, Lester R, Thorlund K, et al. Interventions to promote adherence to antiretroviral therapy in Africa: a network meta-analysis. Lancet HIV. 2014;1:e104–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bonner K, Mezochow A, Roberts T, Ford N, Cohn J. Viral load monitoring as a tool to reinforce adherence: a systematic review. J Acquir Immune Defic Syndr. 2013;64(1):74–78. [DOI] [PubMed] [Google Scholar]
- 19.Berg KM, Arnsten JH. Practical and conceptual challenges in measuring antiretroviral adherence. J Acquir Immune Defic Syndr. 2006;43 Suppl 1:S79–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kagee A, Nel A. Assessing the association between self-report items for HIV pill adherence and biological measures. AIDS Care. 2012;24(11):1448–1452. [DOI] [PubMed] [Google Scholar]
- 21.Beyrer C, Pozniak A. HIV Drug Resistance - An Emerging Threat to Epidemic Control. N Engl J Med. 2017;377(17):1605–1607. [DOI] [PubMed] [Google Scholar]
- 22.The Global Fund. A new toolkit for differentiated care in HIV and TB programs. https://www.theglobalfund.org/en/news/2015-12-04-new-toolkit-for-differentiated-care-in-hiv-and-tb-programs/. Published 2015. Accessed March 14, 2018.
- 23.McNairy ML, Abrams EJ, Rabkin M, El-Sadr WM. Clinical decision tools are needed to identify HIV-positive patients at high risk for poor outcomes after initiation of antiretroviral therapy. PLoS Med. 2017;14(4):e1002278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Srivastava M, Sullivan D, Phelps BR, Modi S, Broyles LN. Boosting ART uptake and retention among HIV-infected pregnant and breastfeeding women and their infants: the promise of innovative service delivery models. J Int AIDS Soc. 2018;21(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mungwira RG, Divala TH, Nyirenda OM, et al. A targeted approach for routine viral load monitoring in Malawian adults on antiretroviral therapy. Trop Med Int Health. 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Phan V, Thai S, Koole O, et al. Validation of a clinical prediction score to target viral load testing in adults with suspected first-line treatment failure in resource-constrained settings. J Acquir Immune Defic Syndr. 2013;62(5):509–516. [DOI] [PubMed] [Google Scholar]
- 27.van Griensven J, Phan V, Thai S, Koole O, Lynen L. Simplified clinical prediction scores to target viral load testing in adults with suspected first line treatment failure in Phnom Penh, Cambodia. PLoS One. 2014;9(2):e87879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Meya D, Spacek LA, Tibenderana H, et al. Development and evaluation of a clinical algorithm to monitor patients on antiretrovirals in resource-limited settings using adherence, clinical and CD4 cell count criteria. J Int AIDS Soc. 2009;12:3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Petersen ML, LeDell E, Schwab J, et al. Super Learner analysis of electronic adherence data improves viral prediction and may provide strategies for selective HIV RNA monitoring. J Acquir Immune Defic Syndr. 2015;69(1):109–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Robbins GK, Johnson KL, Chang Y, et al. Predicting virologic failure in an HIV clinic. Clin Infect Dis. 2010;50(5):779–786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Taha TE, Brummel S, Angelidou K, et al. PROMISE trial: results of continued vs discontinued ART after end of breastfeeding. Conference on Retroviruses and Opportunistic Infections; Mar 4–7, 2018; Boston, MA. [Google Scholar]
- 32.World Health Organization. Consolidated guideline on the use of antiretroviral drugs for treating and preventing HIV infection: recommendations for a public health approach. Geneva, Switzerland: World Health Organization;2016. [PubMed] [Google Scholar]
- 33.World Health Organization. WHO case definitions of HIV for surveillance and revised clinical staging and immunological classification of HIV-related disease in adults and children. Geneva, Switzerland: World Health Organization;2007. [Google Scholar]
- 34.INSIGHT START Study Group, Lundgren JD, Babiker AG, et al. Initiation of antiretroviral therapy in early asymptomatic HIV infection. N Engl J Med. 2015;373(9):795–807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Davis NL, Miller WC, Hudgens MG, et al. Maternal and breastmilk viral load: impacts of adherence on peripartum HIV infections averted-The Breastfeeding, Antiretrovirals, and Nutrition Study. J Acquir Immune Defic Syndr. 2016;73(5):572–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Waitt C, Low N, Van de Perre P, Lyons F, Loutfy M, Aebi-Popp K. Does U=U for breastfeeding mothers and infants? Breastfeeding by mothers on effective treatment for HIV infection in high-income settings. Lancet HIV. 2018;5(9):e531–e536. [DOI] [PubMed] [Google Scholar]
- 37.Flynn P, Taha T, Cababasay M, et al. Association of maternal viral load and CD4 count with perinatal HIV-1 transmission risk during breastfeeding in the PROMISE postpartum component. 22nd International AIDS Conference; 2018; Amsterdam, Netherlands. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Bernal E, Gomez JM, Jarrin I, et al. Low-level viremia Is associated with clinical progression in HIV-infected patients receiving antiretroviral treatment. J Acquir Immune Defic Syndr. 2018;78(3):329–337. [DOI] [PubMed] [Google Scholar]
- 39.Hosmer DW, Lemeshow S, Sturdivant RX. Model-building strategies and methods for logistic regression. In: Applied logistic regression. Hoboken, NJ: Wiley; 2013:89–151. [Google Scholar]
- 40.Tibshirani R Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological). 1996;58(1):267–288. [Google Scholar]
- 41.van der Laan MJ, Polley EC, Hubbard AE. Super learner. Stat Appl Genet Mol Biol. 2007;6:Article25. [DOI] [PubMed] [Google Scholar]
- 42.James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning with Applications in R. New York, NY: Springer; 2013. [Google Scholar]
- 43.Harrell FE Jr., Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15(4):361–387. [DOI] [PubMed] [Google Scholar]
- 44.Altman DG, Royston P. What do we mean by validating a prognostic model? Stat Med. 2000;19(4):453–473. [DOI] [PubMed] [Google Scholar]
- 45.Gonzalez JS, Batchelder AW, Psaros C, Safren SA. Depression and HIV/AIDS treatment nonadherence: a review and meta-analysis. J Acquir Immune Defic Syndr. 2011;58(2):181–187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tuthill EL, Pellowski JA, Young SL, Butler LM. Perinatal Depression Among HIV-Infected Women in KwaZulu-Natal South Africa: Prenatal Depression Predicts Lower Rates of Exclusive Breastfeeding. AIDS Behav. 2017;21(6):1691–1698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Phillips TK, Clouse K, Zerbe A, Orrell C, Abrams EJ, Myer L. Linkage to care, mobility and retention of HIV-positive postpartum women in antiretroviral therapy services in South Africa. J Int AIDS Soc. 2018;21 Suppl 4:e25114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Kanters S, Vitoria M, Zoratti M, et al. Comparative efficacy, tolerability and safety of dolutegravir and efavirenz 400mg among antiretroviral therapies for first-line HIV treatment: A systematic literature review and network meta-analysis. EClinicalMedicine. 2020;28:100573. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.