Abstract
To control the Human Immunodeficiency Virus (HIV) pandemic, the World Health Organization (WHO) set the 90-90-90 target to be reached by 2020. One major threat to those goals is late presentation, which is defined as an individual presenting a TCD4+ count lower than 350 cells/mm3 or an AIDS-defining event. The present study aims to identify determinants of late presentation in Europe based on the EuResist database with HIV-1 infected patients followed-up between 1981 and 2019. Our study includes clinical and socio-demographic information from 89851 HIV-1 infected patients. Statistical analysis was performed using RStudio and SPSS and a Bayesian network was constructed with the WEKA software to analyze the association between all variables. Among 89,851 HIV-1 infected patients included in the analysis, the median age was 33 (IQR: 27.0–41.0) years and 74.4% were males. Of those, 28,889 patients (50.4%) were late presenters. Older patients (>56), heterosexuals, patients originated from Africa and patients presenting with log VL >4.1 had a higher probability of being late presenters (p < 0.001). Bayesian networks indicated VL, mode of transmission, age and recentness of infection as variables that were directly associated with LP. This study highlights the major determinants associated with late presentation in Europe. This study helps to direct prevention measures for this population.
Keywords: HIV-1 infection, late presentation, Europe
1. Introduction
At the end of 2019, there were 38.0 million people living with the Human Immunodeficiency Virus (HIV) and 1.7 million people were newly infected worldwide. However, 7.1 million people were still unaware of their HIV status [1].
For the control of the HIV pandemic, the World Health Organization (WHO) had set a 90-90-90 target until 2020. 90% of people living with HIV know their status, of those 90% are receiving antiretroviral therapy (ART) and of those 90% achieve viral suppression. These targets had been successful in some countries. Globally, by the end of 2019, there were 81% of people living with HIV who knew their status. Of those, 67% were receiving antiretroviral therapy and of those 59% had reached HIV viral suppression. The success of these goals is dependent on the region of origin, the vulnerability of populations and on the national HIV programs that are implemented. Yet, between 2010 and 2019, the percentage of new infections dropped by 31% [2].
New goals were set to end the pandemic by 2030, the 95-95-95 targets, based on the same definition of the previous targets. In order to attain the WHO goals by 2030, early diagnosis is essential [3].
One major concern threatening those goals is late presentation. Late presentation can have consequences in the health and treatment of infected individuals, leading to poorer outcomes and increased health care costs, since it has been shown that late presenters, especially those aged above 50 years old, are at higher risk for developing non-infectious co-morbidities and complex multimorbidity [4]. In addition, late presentation can have a negative impact on the control of the pandemic, increasing the risk of onward HIV transmission in individuals that are not aware of their HIV status. Besides, late presentation to HIV care was shown to be the main reason for virological failure [5,6].
Late presentation is defined as an individual presenting a TCD4+ count lower than 350 cells/mm3 or an AIDS-defining event, regardless of TCD4+ cell count. This is the definition according to the European Late Presenter Consensus working group [7]. It is estimated that Late Presenters (LP) account for 40–60% of HIV cases in Europe, in Asia the percentage of LP range from 72 to 83%, in Africa range from 35 to 89% and in Brazil, it is estimated that the percentage is near 40% [8,9,10]. For prevention and treatment of HIV, timely diagnosis and linkage to health care are essential tools [11].
The present study has the objective of identifying determinants of late presentation in Europe. To achieve this goal, we analyzed a population of patients from the EuResist database, a European database.
2. Results
2.1. Characteristics of European Population
Among 89851 HIV-1 infected patients included in the analysis, the median age was 33 (IQR: 27.0–41.0) years and 74.4% were males. From those 28889 patients (50.4%) were LP and 28388 (49.6%) were non-late presenters (NLP). The majority of patients with information about treatment status were naïve, 11487 (58.6%). 41.9% of patients were men who have sex with men (MSM) and 78.5% originated from Western Europe. The most prevalent subtype in this population was subtype B (64.4%), followed by Subtype G (20.4%), CRF 02_AG (15.9%) and Subtype A (13.5%). Most of the patients included in this study were classified as Native (75.4%) and as having Chronic Infection (59.8%) based on the ambiguity rate of the first genomic sequence. CD4 count at diagnosis and viral load at diagnosis (log10) presented a median of 348 cells/mm3 (IQR 170-548) and 4.4 copies/mL (IQR 3.4–5.1), respectively.
50.4% of patients were classified as LP (CD4 < 350 cells/mm3). Males accounted for the higher proportion of LP (74.9%). The median age of LPs was 34 years (IQR 28.0–43.0; p < 0.001). LPs were mainly from Western Europe and the HIV exposure category was mainly heterosexuals (77.4 and 37.1%; p < 0.001, respectively) (Table 1).
Table 1.
Patient Characteristics | Total | Late Presenters | Non-Late Presenters |
p-Value |
---|---|---|---|---|
Total, n (%) | 89,851 (100%) | 28,889 (50.4%) | 28,388 (49.6%) | |
Sex, n (%) | 81,777 (91.0%) | 27,972 (50.6%) | 27,315 (49.4%) | <0.001 |
Male | 60,852 (74.4%) | 20,955 (74.9%) | 20,969 (76.8%) | |
Female | 20,925 (25.6%) | 7017 (25.1%) | 6346 (23.2%) | |
Treatment status, n (%) | 19,605 (21.8%) | 10,905 (55.6%) | 8700 (44.4%) | <0.001 |
Naïve | 11,487(58.6%) | 6040 (55.4%) | 5447 (62.6%) | |
Treated | 8118 (41.4%) | 4865 (44.6%) | 3253 (37.4%) | |
Median age at diagnosis in years IQR, n (%) | 25,530 (28.4%) | 11,929 (52.3%) | 10,897 (47.7%) | <0.001 |
33.0 (27.0–41.0) | 34.0 (28.0–43.0) | 31.0 (26.0–39.0) | ||
≤18 | 700 (2.7%) | 241 (2.0%) | 340 (3.1%) | <0.001 |
19–30 | 9767 (38.3%) | 4002 (33.5%) | 4823 (44.3%) | |
31–55 | 13,815 (54.1%) | 6920 (58.0%) | 5384 (49.4%) | |
≥56 | 1248 (4.9%) | 766 (6.4%) | 350 (3.2%) | |
Mode of transmission, n (%) | 47,007 (52.3%) | 21,283 (49.5%) | 21,677 (50.5%) | <0.001 |
Heterosexual | 15,165 (32.3%) | 7894 (37.1%) | 6071 (28.0%) | |
MSM | 19,696 (41.9%) | 7657 (36.0%) | 10,693 (49.3%) | |
IDU | 9532 (20.3%) | 4453 (20.9%) | 3896 (18.0%) | |
Other | 2614 (5.6%) | 1279 (6.0%) | 1017 (4.7%) | |
Region of origin, n (%) | 54,529 (60.7%) | 21,584 (50.1%) | 21,495 (49.9%) | <0.001 |
Western Europe | 42,790 (78.5%) | 16,693 (77.4%) | 17,398 (81.0%) | |
Eastern Europe | 1862 (3.4%) | 693 (3.2%) | 672 (3.1%) | |
Africa | 5349 (9.8%) | 2250 (10.4%) | 1422 (6.6%) | |
South America | 3233 (5.9%) | 1341 (6.2%) | 1460 (6.8%) | |
Other | 1286 (2.4%) | 607 (2.8%) | 543 (2.5%) | |
Subtype, n (%) | 54,176 (60.3%) | 17,449 (52.7%) | 15,638 (47.3%) | <0.001 |
HIV-1 Subtype B | 35,454 (64.4%) | 11,966 (68.6%) | 11,745 (75.1%) | |
HIV-1 Subtype non-B | 18,722 (34.6%) | 5483 (31.4%) | 3893 (24.9%) | |
Distribution of non-B Subtypes | ||||
HIV-1 CRF 01_AE | 447 (2.4%) | 183 (3.3%) | 108 (2.8%) | |
HIV-1 CRF 02_AG | 2973 (15.9%) | 871 (15.9%) | 556 (14.3%) | |
HIV-1 CRF 06_cpx | 248 (1.3%) | 81 (1.5%) | 58 (1.5%) | |
HIV-1 CRF 14_BG | 1106 (5.9%) | 337 (6.1%) | 203 (5.2%) | |
HIV-1 Subtype A | 2521 (13.5%) | 626 (11.4%) | 527 (13.5%) | |
HIV-1 Subtype C | 1943 (10.4%) | 550 (10.0%) | 400 (10.3%) | |
HIV-1 Subtype D | 307 (1.6%) | 102 (1.9%) | 74 (1.9%) | |
HIV-1 Subtype F | 1619 (8.6%) | 444 (8.1%) | 362 (9.3%) | |
HIV-1 Subtype G | 3815 (20.4%) | 1156 (21.1%) | 701 (28.0%) | |
Others | 3743 (20.0%) | 1133 (20.7%) | 3893 (23.2%) | |
Migrant status, n (%) | 54,520 (60.7%) | 21,584 (50.1%) | 21,495 (49.9%) | <0.001 |
Migrant | 13,408 (24.6%) | 5588 (25.9%) | 4895 (22.8%) | |
Native | 41,112 (75.4%) | 15,996 (74.1%) | 16,600 (77.2%) | |
Recentness of infection, n (%) | 50,132 (55.8%) | 15,897 (52.6%) | 14,304 (47.4%) | <0.001 |
Chronic | 29,972 (59.8%) | 11,069 (69.6%) | 7803 (54.6%) | |
Recent | 20,160 (40.2%) | 4828 (30.4%) | 6501 (45.4%) | |
Median CD4 count at diagnosis (cells/mL) IQR, n (%) | 57,277 (63.7%) | 28,889 (50.4%) | 28,388 (49.6%) | <0.001 |
348.0 (170.0–548.0) | 172.0 (69.0–264.0) | 550.0 (442.0–720.0) | ||
Viral Load at diagnosis (log10 copies/mL) IQR, n (%) | 34,046 (37.9%) | 15,106 (50.8%) | 14,605 (49.2%) | <0.001 |
4.4 (3.4–5.1) | 4.7 (3.8–5.3) | 4.1 (3.1–4.8) | ||
≤4.0 | 12,994 (38.2%) | 4485 (29.7%) | 6819 (46.7%) | <0.001 |
4.1–5.0 | 11,715 (34.4%) | 5034 (33.3%) | 5295 (36.3%) | |
≥5.1 | 9337 (27.4%) | 5587 (37.0%) | 2491 (17.1%) |
2.2. Determinants Associated with Late Presentation
In the unadjusted model (Table 2), sex was associated with LP. In the HIV exposure category, significant differences were found for MSM and Intravenous Drug Users (IDU) compared with heterosexuals. Significantly more LP were from Africa and other regions compared to Western Europe. In addition, the variables age at diagnosis, viral load, subtype, recentness of infection and migrant status were significantly associated with LP.
Table 2.
Late Presenters/ Non-Late Presenters |
Unadjusted | Final Model | ||
---|---|---|---|---|
Variable | OR (95%CI) | p-Value | aOR (95%CI) | p-Value |
Sex | ||||
Female | Ref | Ref | Ref | Ref |
Male | 0.90 (0.87–0.94) | <0.001 | 1.05 (0.91–1.21) | 0.522 |
Age at diagnosis | 1.03 (1.02–1.03) | <0.001 | ||
Age groups | ||||
<18 | 0.55 (0.47–0.65) | <0.001 | 0.48 (0.33–0.69) | <0.001 |
19–30 | 0.65 (0.61–0.68) | <0.001 | 0.70 (0.63–0.79) | <0.001 |
31–55 | Ref | Ref | Ref | Ref |
>56 | 1.70 (1.49–1.94) | <0.001 | 1.54 (1.15–2.06) | 0.004 |
Mode of transmission | ||||
Heterosexual | Ref | Ref | Ref | Ref |
MSM | 0.55 (0.53–0.58) | <0.001 | 0.74 (0.64–0.86) | <0.001 |
IDU | 0.88 (0.83–0.93) | <0.001 | 1.12 (0.96–1.31) | 0.137 |
Other | 0.97 (0.88–1.06) | 0.462 | 1.29 (0.99–1.70) | 0.062 |
Region of Origin | ||||
Western Europe | Ref | Ref | Ref | Ref |
Eastern Europe | 1.08 (0.97–1.20) | 0.191 | 1.07 (0.78–1.48) | 0.683 |
Africa | 1.65 (1.54–1.77) | <0.001 | 1.76 (1.37–2.26) | <0.001 |
South America | 0.96 (0.89–1.03) | 0.267 | 1.41 (1.07–1.87) | 0.015 |
Other | 1.17 (1.04–1.31) | 0.011 | 1.39 (0.92–2.09) | 0.118 |
Subtype | ||||
HIV-1 Subtype B | Ref | Ref | ||
HIV-1 Subtype non-B | 1.38 (1.32–1.45) | <0.001 | ||
Migrant Status | ||||
Migrant | Ref | Ref | ||
Native | 0.84 (0.81–0.88) | <0.001 | ||
Recentness of infection | ||||
Chronic | Ref | Ref | Ref | Ref |
Recent | 0.52 (0.50–0.55) | <0.001 | 0.61 (0.55–0.68) | <0.001 |
Log Viral load at diagnosis | 1.45 (1.42–1.48) | <0.001 | ||
Log Viral load groups | ||||
<4.0 | Ref | Ref | Ref | Ref |
4.1–5.0 | 1.45 (1.37–1.53) | <0.001 | 1.37 (1.22–1.55) | <0.001 |
>5.1 | 3.41 (3.21–3.62) | <0.001 | 3.41 (2.96–3.91) | <0.001 |
Determinants associated with late presentation were age at diagnosis (Table 2): patients with less than 30yo had lower probability of being late presenters and patients aged above 56yo had higher probability of being late presenters when compared with patients aged between 31 and 55yo (>18yo: aOR 0.31 (0.20–0.49), p < 0.001; 19–30yo: aOR 0.46 (0.34–0.62), p < 0.001; >56: aOR 1.70 (1.49–1.94), p = 0.004), transmission via MSM had lower probability when compared with heterosexuals (aOR 0.74 (0.64–0.86); p < 0.001). Patients originating from Africa and South America had 1.76 and 1.41 more probability, respectively, of presenting late than those from Western Europe (aOR 1.76 (1.37–2.26), p < 0.001; aOR 1.41 (1.07–1.87), p = 0.015, respectively) and patients presenting with a viral load between 4.1 and 5.0 and higher than 5.1 had a higher probability of being LP than those with a viral load lower than 4.0 (aOR 1.45 (1.37–1.53) and aOR 3.41 (3.21–3.62); p < 0.001 and p < 0.001, respectively). As expected, but confirming the reliability of our classification of recentness of infection based on the ambiguity rate, patients with a recent infection—as classified based on the ambiguity rate of the genomic sequence from the first drug resistance test—had a lower probability of being LP than those classified as being chronically infected (aOR 0.61 (0.55–0.68); p < 0.001).
2.3. Bayesian Network
For the bayesian network, we used the HillClimber algorithm with nine as the maximum number of parents a node in BN can have. This algorithm is based on a “hill climbing adding and deleting arcs with no fixed ordering of variables” [12]. The BN had a LogScore Bayes of −35615.94 and an accuracy of 61%. In the BN (Figure 1), LPs are directly associated with the viral load, recentness of infection, mode of transmission and age, as we can see in the figure below, those were direct links between the nodes. The indirectly associated links were between LP and region of origin. As we can also see in the figure, there was no direct link between those two nodes. We can see that the mode of transmission is the variable with more direct associations and the variable sex is the only one that is not associated with LP. This BN is in accordance with our logistic regression model.
The variables Subtype and Migrant status had been removed from the logistic regression model due to the conflict with the variable region of origin. As we can see in Figure 1, the region of origin is directly associated with those two variables and that the migrant status is only associated with region of origin.
2.4. Ambiguity Rate and CD4 Analysis
We performed an analysis to understand the association between CD4 count and the ambiguity rate overall and on subtype B, non-B and G. This association was inversely proportional in all correlations, this means that for higher values of CD4 count the ambiguity rate is lower. In this study, the LP population had higher ambiguity rates in their sequences, since their CD4 count is lower. We also performed a linear regression in order to explain how much of the CD4 count could the ambiguity rate explain. We divided that analysis in the same categories as mentioned above and the higher result was from only individuals with non-B subtype, in which the ambiguity rate explained 5% of the variation from CD4 count (Table A1, Table A2, Table A3 and Table A4).
2.5. Analysis of Late Presenters Rate over Time
We also constructed a graph to evaluate the evolution in time of the rate of LP (Figure 2). The confidence intervals were also calculated for each point. We did not include in the analysis the first three years (1981–1983) since the total number of patients in those years was low and the confidence intervals had high values. In 2019 the sample size was also small, but we included this year in the analysis to see the trend that LPs in Europe will have. As we can see in the graph, LPs have had constant values through the years. In 1984 we had 57.5% LPs, in 1991 we had the lowest value of LPs (45.1%). The evolution through the years maintained between 45 and 60% the rate of LP. Since 2017 the rate of LPs was growing until 2019 that peaked, beyond 60%.
3. Discussion
This study had the goal of explaining the determinants of late presentation for HIV-1 infection in Europe.
In our population, late presenters represented 50.4% of the patients. A study in Georgia, using the same definition of late presentation as we used, reported 63.4% of late presenters. Another study analyzing late presentation in different settings indicated a rate of late presentation ranging between 40 and 67%, depending on the region of study. This study corresponded to the Swiss data incorporated in the COHERE study, a Collaboration of Observational HIV Epidemiological Research Europe Study. Our results are concordant with the results reported in these studies [13,14,15].
In our study, late presenters were more frequently males, with heterosexual transmission, from Western Europe and aged between 31 and 55 years old. In a study in East of England, the percentage of late presenters was higher in older patients and patients with heterosexual contact, when compared with homosexual and bisexual contact. Furthermore, according to other studies in Poland and the Netherlands, males were also more prevalent in the late presenters’ population. These results are consistent with our study [16,17,18].
Patients originated from Africa had a higher probability of being LPs when compared to patients originated from Western Europe. This percentage of African migrants in the LP population can be explained by the lower access to health care. Furthermore, African migrants have a higher probability of being in conditions of unemployment, poverty and poorer household, which further increase their barriers to access to health care. A positive status for HIV also stigmatizes individuals, and they fear the reactions of their communities, since HIV is mostly associated among these communities with inappropriate and promiscuous sexual behavior [19]. The migrants of our study from South America were mainly from Brazil and the LP rate was lower than the NLPs. This can be explained in two ways: Brazil has a concentrated HIV epidemic among MSM population and that population is frequently tested [20]. These results are in accordance with HIV studies about the migrant population [21,22,23].
The results from a previous study showed a statistically significant correlation between late presentation and IDUs [24]. In our study, we found this significant association between LP and IDU in our univariate analysis, but in the logistic regression analysis, we only found significant the association between MSM when compared to heterosexuals. The prevalence of HIV-positive IDU population is mainly from Eastern Europe. In our study, the IDU group maybe underrepresented since the larger proportion of cases are from Western Europe, in which the major mode of transmission is through heterosexual and MSM contacts [25,26].
We also studied the association between CD4 count and the ambiguity rate of the sequences included in this study. Our results show a negative correlation between CD4 count and the ambiguity rate, for lower values of CD4 we had higher values of the ambiguity rate. There is still little information regarding this topic, but our results were in accordance with a study about sequence ambiguity and HIV incidence trends [27]. In fact, the ambiguity rate could be an alternative variable to be used for the definition of Late Presentation. As we know, the initial drop of CD4 count in the acute phase of HIV infection can be a cause of bias when we define Late Presentation based on a CD4 count lower than 350 cells/mm3.
The results from the graph showed stable and high values for LPs rate. This indicates that LPs were and remain a big part of the HIV epidemic and represent a major threat to treatment and prevention strategies.
The main goal of this study was to identify determinants associated with late presentation. Those determinants included age at diagnosis, mode of transmission, region of origin, recentness of infection and viral load at diagnosis (Figure 3). Our results were in concordance with other previously published studies [13,28,29].
The last study about late presentation in Europe was published in 2015 and the timeline of the study was between 2010 and 2013. This was an update from the first study published in 2013, with a timeline of analysis between 2000 and 2011 [29,30]. Our study analyzes a European database with a timeline between 1981 and 2019. The main strength of our study was the database used, which is one of the largest datasets and integrates clinical, socio-demographic and viral genotypic information from HIV-1 patients from all over Europe. This large dataset allows for a robust analysis of the data, and up to date information regarding late presentation. In addition, we can analyze trends in the evolution of late presentation in Europe.
The major limitation of our study was the lack of information about the stage of HIV infection and AIDS-defining events. While we used the ambiguity rate to minimize this problem, we only used the definition of a CD4 count below 350 cells/mm3 to define an individual as LP or NLP.
Yet, this study is the most recent update on the HIV epidemic of late presentation in Europe, since the last one was published in 2015.
Since late presentation is a major obstacle to the 95-95-95 targets, it is necessary to reinforce the follow-up of this population. Increased HIV testing is key to reduce late presentation since it results in earlier HIV diagnosis. Prevention measures like targeting the vulnerable populations and increasing screening programs for those populations are the most urgent strategies to halt and decrease the percentage of late presenters. In low- and middle-income countries, point-of-care testing would be a major advance to stop the spread of the virus by those who do not know their serological status and therefore decreasing late presentation at diagnosis.
4. Materials and Methods
4.1. Study Group
Our study includes clinical and socio-demographic information from 89851 HIV-1 infected patients from the EuResist Integrated Database (EIDB) between 1981 and 2019. The EuResist integrated database (EIDB) is one of the largest existing datasets which integrate clinical, socio-demographic and viral genotypic information from HIV-1 patients. It integrates longitudinal, periodically updated data mainly from Italy (ARCA database), Germany (AREVIR database) Spain (CoRIS and IRISCAIXA), Sweden, Belgium, Portugal and Luxembourg [31,32,33].
In this study, information from the ARCA, AREVIR, Luxembourg, IRISCAIXA, Portugal, Russia, United Kingdom and CoRIS databases were used.
4.2. Subtyping
The genomic data included HIV-1 protease and reverse transcriptase sequences, generated through routine drug resistance testing and as stored in the EuResist database. Only the first HIV genomic sequence per patient was considered.
HIV-1 subtyping was performed using the consensus of the result obtained through three different tools: Rega HIV Subtyping Tool (https://www.genomedetective.com/app/typingtool/hiv, accessed on 1 July 2021) [34], COMET: adaptive context-based modeling for HIV-1 (https://comet.lih.lu, accessed on 1 July 2021) [35] and SCUEAL (http://classic.datamonkey.org/dataupload_scueal.php, accessed on 1 July 2021).
4.3. Study Variables
We used the information from the EuResist database regarding the following variables: Country of follow-up, Year of Birth, Gender, Country of Origin, Mode of transmission, Date of the first HIV Positive test, Date and value of the first CD4 count, Date and value of the first Viral Load count, first genomic sequence and sample collection date, Date of therapy initiation.
With this information we created new variables such as:
Migrant/Native-Based on Country of Origin and Country of Follow-up (if country of origin and country of follow-up is the same, then patient is native; if country of origin and country of follow-up is not the same, then patient is migrant)
Age at Diagnosis-Based on the difference between Year of Birth and Date of the first HIV Positive test;
Region of Origin- Based on Country of Origin;
Treatment Status at date of first CD4 count-Based on the difference between sample collection date of first CD4 count and first therapy date; for purposes of classification of Late Presentation, only patients naïve at date of first CD4 count were considered;
Treatment Status at date of first Drug Resistance Test-based on the difference between sample collection date for first drug resistance test and date of start of first therapy;
After creating these two variables, for quality control purposes, we only included in the analysis patients for which treatment status at date of first CD4 count and Treatment Status at date of first Drug Resistance test were consistent.
Recentness of infection-Based on ambiguity rate of genomic sequences. We defined Chronic infection as an ambiguity value higher than 0.45% and Recent infection as an ambiguity value equal or below 0.45% [36]. Additionally, only genomic sequences larger than 500 nucleotides and with ambiguity rate lower than 2.5% were considered.
LP vs. NLP- Based on CD4 count, LP were defined as patients with CD4 count lower than 350 cells/mm3 and NLP were defined as patients with CD4 count higher than 350 cells/mm3.
4.4. Statistical Analysis
The proportion and median (interquartile range, IQR) of LP and non- Late presenters (NLP) were calculated for every categorical and continuous variable, respectively. Our interest variables were compared with the categorical variables with Chi-square test, and continuous variables with Mann–Whitney U test.
To study the relationship between our dependent variable (LP or NLP) and the independent variables, logistic regression models were calculated. We first presented the logistic regression with the unadjusted odds ratios (uOR) and confidence intervals at 95% (95% CI), in order to see the probability of our event, the dependent variable, (late presentation vs non-late presentation) on the occurrence of each independent variable, individually, e.g., the probability of a woman being late presenter. Variables with a p-value < 0.05 were considered to enter the model since it is the most used threshold. The final model for LP vs. NLP was adjusted for sex, this variable was forced into the model regardless of its significance and the reference class was women. The final model included only the variables that were considered statistically significant (p < 0.05) and the variables that suited the best regression model according to the backward stepwise regression analysis through SPSS. The odds ratio and 95% confidence intervals were calculated for those variables as well. Data were analyzed using RStudio (Version 1.2.5033) and SPSS (Version 26.0.0.0).
4.5. Bayesian Networks
A Bayesian network (BN) is a tool that consists of a directed acyclic graph (DAG), made of nodes and directed links between the nodes, which allows us to understand the representation of a probabilistic distribution. Each node is a representation of a variable, and the links indicate that one node is directly influencing another. The lack of a direct link does not mean that one variable is not associated with another. These networks are able to intuitively create causal links between variables since they are built from probability distributions and for prediction [37].
We constructed a BN to analyze the association between all variables, specifically, we wanted to see how the variables were associated with one another. With the different levels and connections between variables, it is possible to see if they are directly or indirectly associated. We used the WEKA software version 3.8.5. WEKA stands for Waikato Environment for Knowledge Analysis. After the upload of the dataset, the first step is to choose a classifier to start the analysis. We used a statistical-based learning scheme, the Bayes classifier, specifically the BayesNet [38]. After choosing the classifier, we used different search algorithms as a local score structure learning. Our final choice of algorithm was based on the LogScore Bayes value and the percentage of correctly classified instances.
5. Conclusions
In summary, late presentation still accounts for 50% of the new diagnosis in Europe. Its most important determinants are age at diagnosis, mode of transmission, region of origin and viral load at diagnosis (Figure 3). In addition, the evolution of the rate of late presentation through the years was stable, except for the last two years analyzed (2018 and 2019) when that rate showed an increase. This study highlights the major determinants associated with late presenters in Europe, and this will help to strengthen some prevention measures.
Appendix A
Table A1.
CD4 Count | R2 | Unstandardized B | Standardized Coefficients Beta | p-Value | 95% Confidence Interval for B | Spearman’s Correlation | ||
---|---|---|---|---|---|---|---|---|
Lower Bound | Upper Bond | p-Value | Correlation Coefficient | |||||
Ambiguity Rate | 0.023 | −69.52 | −0.152 | <0.001 | −74.61 | −64.44 | <0.001 | −0.190 |
Table A2.
CD4 Count | R2 | Unstandardized B | Standardized Coefficients Beta | p-Value | 95% Confidence Interval for B | Spearman’s Correlation | ||
---|---|---|---|---|---|---|---|---|
Lower Bound | Upper Bond | p-Value | Correlation Coefficient | |||||
Ambiguity Rate | 0.015 | −57.11 | −0.123 | <0.001 | −63.27 | −50.95 | <0.001 | −0.159 |
Table A3.
CD4 Count | R2 | Unstandardized B | Standardized Coefficients Beta | p-Value | 95% Confidence Interval for B | Spearman’s Correlation | ||
---|---|---|---|---|---|---|---|---|
Lower Bound | Upper Bond | p-Value | Correlation Coefficient | |||||
Ambiguity Rate | 0.051 | −96.72 | −0.226 | <0.001 | −105.54 | −87.91 | <0.001 | −0.265 |
Table A4.
CD4 Count | R2 | Unstandardized B | Standardized Coefficients Beta | p-Value | 95% Confidence Interval for B | Spearman’s Correlation | ||
---|---|---|---|---|---|---|---|---|
Lower Bound | Upper Bond | p-Value | Correlation Coefficient | |||||
Ambiguity Rate | 0.014 | −52.78 | −0.120 | <0.001 | −73.38 | −32.17 | <0.001 | −0.144 |
Author Contributions
Conceptualization, M.N.S.M., M.P. and A.A.; Methodology, M.N.S.M., M.P., V.P., A.-M.V., M.d.R.O.M. and A.A.; Software, M.N.S.M., V.P.; Validation, M.N.S.M., M.P., F.I. and A.A.; Formal Analysis, M.N.S.M., V.P., M.P. and A.A.; Investigation, M.N.S.M., M.P., V.P., A.-M.V. and M.d.R.O.M.; Resources, M.B. (Marina Bobkova), M.B. (Michael Böhm), C.S.-D., R.P., R.R., M.Z. and F.I.; Data Curation, M.B. (Marina Bobkova), M.B. (Michael Böhm), C.S.-D., R.P., R.R., M.Z. and F.I.; Writing—Original Draft Preparation, M.N.S.M., M.P. and A.A.; Writing—Review and Editing, M.N.S.M., M.P., M.Z., F.I. and A.A.; Visualization, M.N.S.M., M.P., V.P., A.-M.V., M.d.R.O.M. and A.A.; Supervision, A.A.; Project Administration, A.A.; Funding Acquisition, A.A. All authors have read and agreed to the published version of the manuscript.
Funding
This study was financed by FCT through the following projects: GHTM- UID/04413/2020, INTEGRIV (PTDC/SAU-INF/31990/2017) and the scholarship PD/BD/135714/2018 and Gilead Geénese HIVLatePresenters.
Institutional Review Board Statement
The protocol was in accordance with the declaration of Helsinki. This database contains anonymized patients’ information, including demographic and clinical data from patients from the EuResist Integrated Database (Date of approval: 15 January 2021).
Data Availability Statement
Restrictions apply to the availability of these data. Data was obtained from the EuResist Network and is available for request through a study application form at https://www.euresist.org/become-a-partner with the permission of the EuResist Network.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.WHO HIV/AIDS. [(accessed on 4 January 2021)]; Available online: https://www.who.int/news-room/fact-sheets/detail/hiv-aids.
- 2.Global HIV & AIDS Statistics — 2020 Fact Sheet | UNAIDS. [(accessed on 4 January 2021)]; Available online: https://www.unaids.org/en/resources/fact-sheet.
- 3.Understanding Fast-Track. [(accessed on 18 May 2020)]; Available online: https://www.unaids.org/sites/default/files/media_asset/201506_JC2743_Understanding_FastTrack_en.pdf.
- 4.Guaraldi G., Zona S., Menozzi M., Brothers T.D., Carli F., Stentarelli C., Dolci G., Santoro A., Da Silva A.R.D., Rossi E., et al. Late presentation increases risk and costs of non-infectious comorbidities in people with HIV: An Italian cost impact study. AIDS Res. Ther. 2017;14:1–7. doi: 10.1186/s12981-016-0129-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Conway A.S., Esteve A., Fernández-Quevedo M., Casabona J. Determinants and Outcomes of Late Presentation of HIV Infection in Migrants in Catalonia, Spain: PISCIS Cohort 2004–2016. J. Immigr. Minor. Health. 2018;21:920–930. doi: 10.1007/s10903-018-0834-2. [DOI] [PubMed] [Google Scholar]
- 6.Gesesew H.A., Ward P., Woldemichael K., Mwanri L. Late presentation for HIV care in Southwest Ethiopia in 2003–2015: Prevalence, trend, outcomes and risk factors. BMC Infect. Dis. 2018;18:59. doi: 10.1186/s12879-018-2971-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Antinori A., Coenen T., Costagiola D., Dedes N., Ellefson M., Gatell J., Girardi E., Johnson M., Kirk O., Lundgren J., et al. Late presentation of HIV infection: A consensus definition. HIV Med. 2010;12:61–64. doi: 10.1111/j.1468-1293.2010.00857.x. [DOI] [PubMed] [Google Scholar]
- 8.The Late Presentation Working Groups in EuroSIDA and COHERE Estimating the burden of HIV late presentation and its attributable morbidity and mortality across Europe 2010–2016. BMC Infect. Dis. 2020;20:1–11. doi: 10.1186/s12879-020-05261-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hu X., Liang B., Zhou C., Jiang J., Huang J., Ning C., Liu J., Zhou B., Zang N., Lai J., et al. HIV late presentation and advanced HIV disease among patients with newly diagnosed HIV/AIDS in Southwestern China: A large-scale cross-sectional study. AIDS Res. Ther. 2019;16:6. doi: 10.1186/s12981-019-0221-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Luma H.N., Jua P., Donfack O.-T., Kamdem F., Ngouadjeu E., Mbatchou H.B., Doualla M.-S., Mapoure Y.N. Late presentation to HIV/AIDS care at the Douala general hospital, Cameroon: Its associated factors, and consequences. BMC Infect. Dis. 2018;18:298. doi: 10.1186/s12879-018-3204-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wilton J., Light L., Gardner S., Rachlis B., Conway T., Cooper C., Cupido P., Kendall C.E., Loutfy M., McGee F., et al. Late diagnosis, delayed presentation and late presentation among persons enrolled in a clinical HIV cohort in Ontario, Canada (1999–2013) HIV Med. 2019;20:110–120. doi: 10.1111/hiv.12686. [DOI] [PubMed] [Google Scholar]
- 12.Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., Witten I.H. The WEKA Data Mining Software: An Update. SIGKDD Explor. 2009;11:10–18. doi: 10.1145/1656274.1656278. [DOI] [Google Scholar]
- 13.Chkhartishvili N., Chokoshvili O., Bolokadze N., Tsintsadze M., Sharvadze L., Gabunia P., Dvali N., Abutidze A., Tsertsvadze T. Late presentation of HIV infection in the country of Georgia: 2012–2015. PLOS ONE. 2017;12:e0186835. doi: 10.1371/journal.pone.0186835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Infographic: HIV Infection - Late Diagnosis. [(accessed on 4 January 2021)]; Available online: https://www.ecdc.europa.eu/en/publications-data/infographic-hiv-infection-late-diagnosis.
- 15.Darling K.E., Hachfeld A., Cavassini M., Kirk O., Furrer H., Wandeler G. Late presentation to HIV care despite good access to health services: Current epidemiological trends and how to do better. Swiss Med. Wkly. 2016;146:14348. doi: 10.4414/smw.2016.14348. [DOI] [PubMed] [Google Scholar]
- 16.Wójcik-Cichy K., Jabłonowska O., Piekarska A., Jabłonowska E. The high incidence of late presenters for HIV/AIDS infection in the Lodz province, Poland in the years 2009–2016: We are still far from the UNAIDS 90% target. AIDS Care. 2018;30:1538–1541. doi: 10.1080/09540121.2018.1470306. [DOI] [PubMed] [Google Scholar]
- 17.Van Opstal S.E.M., Van Der Zwan J.S., Wagener M.N., Been S.K., Miedema H.S., Roelofs P., Van Gorp E.C.M. Late Presentation of HIV Infection in the Netherlands: Reasons for Late Diagnoses and Impact on Vocational Functioning. AIDS Behav. 2018;22:2593–2603. doi: 10.1007/s10461-018-2082-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bath R.E., Emmett L., Verlander N.Q., Reacher M. Risk factors for late HIV diagnosis in the East of England: Evidence from national surveillance data and policy implications. Int. J. STD AIDS. 2018;30:37–44. doi: 10.1177/0956462418793327. [DOI] [PubMed] [Google Scholar]
- 19.Fakoya I., Arco D., Álvarez D., Copas A.J., Teixeira B., Block K., Gennotte A.-F., Volny-Anne A., Bil J.P., Touloumi G., et al. Factors Associated With Access to HIV Testing and Primary Care Among Migrants Living in Europe: Cross-Sectional Survey. JMIR Public Health Surveill. 2017;3:84. doi: 10.2196/publichealth.7741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.MacCarthy S., Brignol S., Reddy M., Nunn A., Dourado I. Late presentation to HIV/AIDS care in Brazil among men who self-identify as heterosexual. Revista de Saúde Pública. 2016;50:54. doi: 10.1590/S1518-8787.2016050006352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ross J., Cunningham C.O., Hanna D.B. HIV outcomes among migrants from low-income and middle-income countries living in high-income countries. Curr. Opin. Infect. Dis. 2018;31:25–32. doi: 10.1097/QCO.0000000000000415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Pimentel V., Pingarilho M., Alves D., Diogo I., Fernandes S., Miranda M., Pineda-Pena A.-C., Libin P., Martins M.R.O., Vandamme A.-M., et al. Molecular Epidemiology of HIV-1 Infected Migrants Followed Up in Portugal: Trends between 2001–2017. Viruses. 2020;12:268. doi: 10.3390/v12030268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hachfeld A., Darling K., Calmy A., Ledergerber B., Weber R., Battegay M., Wissel K., Di Benedetto C., Fux C., E Tarr P., et al. Why do sub-Saharan Africans present late for HIV care in Switzerland? HIV Med. 2019;20:418–423. doi: 10.1111/hiv.12727. [DOI] [PubMed] [Google Scholar]
- 24.Suárez-García I., Sobrino-Vegas P., Dalmau D., Rubio R., Iribarren J.A., Blanco J.R., Gutierrez F., Alonso M.M., Bernal E., García D.V., et al. Clinical outcomes of patients infected with HIV through use of injected drugs compared to patients infected through sexual transmission: Late presentation, delayed anti-retroviral treatment and higher mortality. Addiction. 2016;111:1235–1245. doi: 10.1111/add.13348. [DOI] [PubMed] [Google Scholar]
- 25.Degenhardt L., Peacock A., Colledge S., Leung J., Grebely J., Vickerman P., Stone J., Cunningham E.B., Trickey A., Dumchev K., et al. Global prevalence of injecting drug use and sociodemographic characteristics and prevalence of HIV, HBV, and HCV in people who inject drugs: A multistage systematic review. Lancet Glob. Health. 2017;5:1192–1207. doi: 10.1016/S2214-109X(17)30375-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Balayan T., Oprea C., Yurin O., Jevtović Đ., Begovac J., Lakatos B., Sedlacek D., Karpov I., Horban A., Kowalska J.D., et al. People who inject drugs remain hard-to-reach population across all HIV continuum stages in Central, Eastern and South Eastern Europe – data from Euro-guidelines in Central and Eastern Europe Network. Infect. Dis. 2019;51:277–286. doi: 10.1080/23744235.2019.1565415. [DOI] [PubMed] [Google Scholar]
- 27.Lunar M.M., Lepej Židovec S., Poljak M. Sequence ambiguity determined from routine pol sequencing is a reliable tool for real-time surveillance of HIV incidence trends. Infect. Genet. Evol. 2019;69:146–152. doi: 10.1016/j.meegid.2019.01.015. [DOI] [PubMed] [Google Scholar]
- 28.Darcis G., Lambert I., Sauvage A.-S., Frippiat F., Meuris C., Uurlings F., LeComte M., Léonard P., Giot J.-B., Fombellida K., et al. Factors associated with late presentation for HIV care in a single Belgian reference center: 2006–2017. Sci. Rep. 2018;8:8594. doi: 10.1038/s41598-018-26852-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Mocroft A., Lundgren J.D., Sabin M., Monforte A.D., Brockmeyer N.N., Casabona J.J., Castagna A., Costagliola D., Dabis F., De Wit S., et al. Risk Factors and Outcomes for Late Presentation for HIV-Positive Persons in Europe: Results from the Collaboration of Observational HIV Epidemiological Research Europe Study (COHERE) PLoS Med. 2013;10:e1001510. doi: 10.1371/journal.pmed.1001510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Mocroft A., Lundgren J.D., Antinori A., Ad M. The late presenters working group in COHERE in EuroCoord Late presentation for HIV care across Europe: Update from the Collaboration of Observational HIV Epidemiological Research Europe (COHERE) study, 2010 to 2013. Eurosurveillance. 2015;20:30070. doi: 10.2807/1560-7917.es.2015.20.47.30070. [DOI] [PubMed] [Google Scholar]
- 31.Euresist Data Analysis—Database. [(accessed on 4 January 2021)]; Available online: http://engine.euresist.org/database/
- 32.Lawyer G., Altmann A., Thielen A., Zazzi M., Sönnerborg A., Lengauer T. HIV-1 mutational pathways under multidrug therapy. AIDS Res. Ther. 2011;8:26. doi: 10.1186/1742-6405-8-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zazzi M., Incardona F., Rosen-Zvi M., Prosperi M., Lengauer T., Altmann A., Sönnerborg A., Lavee T., Schülter E., Kaiser R. Predicting Response to Antiretroviral Treatment by Machine Learning: The EuResist Project. Intervirology. 2012;55:123–127. doi: 10.1159/000332008. [DOI] [PubMed] [Google Scholar]
- 34.Pineda-Peña A.-C., Faria N.R., Imbrechts S., Libin P., Abecasis A., Deforche K., Gómez-López A., Camacho R., de Oliveira T., Vandamme A.-M. Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools. Infect. Genet. Evol. 2013;19:337–348. doi: 10.1016/j.meegid.2013.04.032. [DOI] [PubMed] [Google Scholar]
- 35.Struck D., Lawyer G., Ternes A.-M., Schmit J.-C., Bercoff D.P. COMET: Adaptive context-based modeling for ultrafast HIV-1 subtype identification. Nucleic Acids Res. 2014;42:e144. doi: 10.1093/nar/gku739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Andersson E., Shao W., Bontell I., Cham F., Cuong D.D., Wondwossen A., Morris L., Hunt G., Sönnerborg A., Bertagnolio S., et al. Evaluation of sequence ambiguities of the HIV-1 pol gene as a method to identify recent HIV-1 infection in transmitted drug resistance surveys. Infect. Genet. Evol. 2013;18:125–131. doi: 10.1016/j.meegid.2013.03.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Sesen M.B., Nicholson A.E., Banares-Alcantara R., Kadir T., Brady M. Bayesian Networks for Clinical Decision Support in Lung Cancer Care. PLoS ONE. 2013;8:e82349. doi: 10.1371/journal.pone.0082349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kumar Y., Sahoo G. Analysis of Parametric & Non Parametric Classifiers for Classification Technique using WEKA. Int. J. Inf. Technol. Comput. Sci. 2012;4:43–49. doi: 10.5815/ijitcs.2012.07.06. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Restrictions apply to the availability of these data. Data was obtained from the EuResist Network and is available for request through a study application form at https://www.euresist.org/become-a-partner with the permission of the EuResist Network.