Dynamic modeling of mortality risk factors in Ebola virus disease using logistic regression on unbalanced panel data from a randomized controlled trial in the Democratic Republic of Congo

Leader Lawanga Ontshick; Jepsy Yango; Ange Mubiala Yaya; Olivier Tshiani Mbaya; Joule Madinga Twan; Jean-Michel Nsengi Ntamabyaliro; Rosine Ali; Patrick Mutombo Lupola; Joseph-Desiré Bukweli; Sifa Marie-joelle Muchanga; Gaston Tona Lutete; Placide Mbala Kiangebeni; Sabue Mulangu; Rostin Mabela Makengo Matendo

doi:10.1371/journal.pgph.0004901

. 2025 Jul 11;5(7):e0004901. doi: 10.1371/journal.pgph.0004901

Dynamic modeling of mortality risk factors in Ebola virus disease using logistic regression on unbalanced panel data from a randomized controlled trial in the Democratic Republic of Congo

Leader Lawanga Ontshick ^1,^2,^*, Jepsy Yango ^1,^*,^#, Ange Mubiala Yaya ³, Olivier Tshiani Mbaya ^3,⁴, Joule Madinga Twan ¹, Jean-Michel Nsengi Ntamabyaliro ⁵, Rosine Ali ^6,⁷, Patrick Mutombo Lupola ¹, Joseph-Desiré Bukweli ², Sifa Marie-joelle Muchanga ^8,⁹, Gaston Tona Lutete ⁵, Placide Mbala Kiangebeni ^1,⁴, Sabue Mulangu ^3,^4,^10,^#, Rostin Mabela Makengo Matendo ^2,^#

Editor: Vishal Goyal¹¹

¹Department of Epidemiology and Global Health, National Institute of Biomedical Research, Kinshasa, Democratic Republic of Congo

²Department of Mathematics, Statistics and Computer Science, University of Kinshasa, Kinshasa, Democratic Republic of Congo

³Immunology Department, National Institute of Biomedical Research, Kinshasa, Democratic Republic of Congo

⁴Department of Medical Biology, University of Kinshasa, Kinshasa, Democratic Republic of Congo

⁵Department of Pharmacology and Therapeutics, University of Kinshasa, Kinshasa, Democratic Republic of Congo

⁶Department of Parasitology, National Institute of Biomedical Research, Kinshasa, Democratic Republic of Congo

⁷Department of Biology, University of Kinshasa, Kinshasa, Democratic Republic of Congo

⁸Department of Obstetrics and Gynecology, Kinshasa, Democratic Republic of the Congo

⁹Department of International Trials, National Center for Global Health and Medicine, Tokyo, Japan

¹⁰Ridgeback Biotherapeutics, Miami, Florida, United States of America

¹¹Ex DNDi, UNITED STATES OF AMERICA

The authors have declared that no competing interests exist.

^✉

* E-mail: leader.ontshick@gmail.com (LLO); jepsyango@gmail.com (JY)

Contributed equally.

Roles

Leader Lawanga Ontshick: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

Jepsy Yango: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

Ange Mubiala Yaya: Data curation, Formal analysis, Methodology, Software

Olivier Tshiani Mbaya: Writing – review & editing

Joule Madinga Twan: Writing – review & editing

Jean-Michel Nsengi Ntamabyaliro: Validation, Writing – original draft, Writing – review & editing

Rosine Ali: Conceptualization, Writing – original draft, Writing – review & editing

Patrick Mutombo Lupola: Data curation, Methodology, Software

Joseph-Desiré Bukweli: Supervision, Writing – review & editing

Sifa Marie-joelle Muchanga: Methodology, Writing – original draft, Writing – review & editing

Gaston Tona Lutete: Validation, Writing – original draft, Writing – review & editing

Placide Mbala Kiangebeni: Supervision, Writing – original draft, Writing – review & editing

Sabue Mulangu: Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

Rostin Mabela Makengo Matendo: Formal analysis, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

Vishal Goyal: Editor

PMCID: PMC12250521 PMID: 40644455

Abstract

Ebola Virus Disease (EVD) remains a significant public health threat, particularly in sub-Saharan Africa. During the 10th Ebola outbreak in the Democratic Republic of Congo (DRC), the Pamoja Tulinde Maisha clinical trial (PALM-RCT) provided a unique opportunity to evaluate new therapeutic interventions. Despite these advances, limited knowledge exists regarding the dynamic evolution of mortality risk factors in EVD patients. This study aimed to model risk factors associated with mortality using logistic regression on unbalanced panel data from patients enrolled in this trial.We conducted a retrospective secondary analysis of longitudinal data from 617 EVD patients included in the PALM-RCT. Data were collected at five time points: Day0 (admission), Day7, Day14, Day21, and Day28. A binary logistic regression model was applied at each time point to identify significant predictors of mortality. The Hosmer-Lemeshow test was used to assess model calibration and internal validation. At Day0 (admission), six significant predictors of mortality were identified: viral load (RT-PCR cycle threshold value), creatinine, alanine aminotransferase (ALAT), aspartate aminotransferase (ASAT), haemorrhage, shortness of breath, and conjunctivitis. By Day7, five predictors emerged: sodium, ASAT, coma, abdominal pain, and shortness of breath. At Day14, two predictors remained significant: ASAT and mental state changes. No significant predictors were identified at Day21 and Day28. The dynamic nature of these risk factors highlights the importance of continuous monitoring throughout the clinical course of EVD.Our study demonstrates that mortality risk factors in EVD patients evolve over time, suggesting that a dynamic approach to patient monitoring is critical. Early risk factors such as viral load and renal function should guide initial interventions, while neurological symptoms and electrolyte imbalances require attention in later stages. These findings support a personalized approach to EVD management, where clinical care is adjusted based on real-time clinical data to improve patient outcomes.

Introduction

The Ebola virus epidemic (EVD) remains a major threat to global public health, particularly in sub-Saharan Africa. The 10^th Ebola epidemic in the Democratic Republic of Congo (DRC), which began in 2018, was one of the most serious, both in terms of the number of cases and the complexity of the response to the crisis [1]. Clinical trials conducted during this period have assessed the efficacy of new treatments and vaccines, offering promising prospects for controlling this highly lethal disease [2].

While several studies have explored risk factors associated with mortality in EVD outbreaks, most of these analyses rely on data collected at a single time point, typically at patient admission. This static approach limits the understanding of how risk evolves throughout the course of illness. There remains a critical need for analytical methods that can capture the dynamic progression of disease and changes in risk factors over time especially in clinical trial settings where longitudinal data are available but often underutilized [2,3].

Analysis of risk factors for Ebola virus-related mortality in clinical trials is essential to optimize clinical care including therapeutic interventions and improve survival rates [4]. However, most studies focus only on data from patient inclusion (Day₀), which limits the ability to understand the impact of changes in patients’ clinical status over time [2,5–8]. Given the longitudinal data collected during phase II clinical trials, it is crucial to develop analytical methods capable of considering the temporal dynamics of these data. Such an approach could reveal risk factors that are not apparent in a conventional static analysis, thus providing a more comprehensive and integrated perspective of the factors influencing Ebola mortality during the follow-up period of participants in a clinical trial [9–11].The primary challenge addressed by our study lies in the limitations of current analytical approaches to comprehensively assess mortality risk factors in the context of longitudinal data that are often unbalanced, characterized by varying numbers of observations per patient due to differences in follow-up durations or incomplete data collection during Ebola clinical trials [12].

Many studies have employed traditional logistic regression models that are typically based on a single data collection point (often Day₀), overlooking dynamic changes in patients’ clinical conditions and variations in data panels over time. [13]. For instance, Loubet et al. (2016) developed a predictive model for EVD mortality using retrospective data from N’Zérékoré, Guinea, based on a single time-point analysis. Similarly, Levine et al. (2015) created a risk prediction score in Liberia using clinical and laboratory data at admission only. Yango et al. (2024) and Tshomba et al. (2022) also employed single-point predictive models in the DRC context, which constrained their ability to capture evolving risk patterns. While foundational, these models were unable to account for the temporal evolution of disease severity [8,14–17].

To address these gaps, our study employs a dynamic modeling framework that uses unbalanced panel data to evaluate changes in mortality predictors across multiple follow-up time points. This approach allows for a more detailed understanding of the disease trajectory and supports the implementation of timely and personalized interventions. It also accommodates the variability in follow-up observations common in clinical trials, where data are often incomplete or collected at irregular intervals [12,18].

The proposed statistical framework applies logistic regression iteratively at key follow-up visits, allowing us to identify mortality risk factors as they evolve over time. By integrating the temporal dimension, our approach maximizes the clinical insights gained from ongoing patient monitoring and offers a significant improvement over static model [18].

The primary objective of this study is to model mortality risk factors associated with Ebola Virus Disease (EVD) using binary logistic regression applied to unbalanced panel data from a phase II clinical trial. The specific aims are:

To develop a statistical model tailored for the analysis of unbalanced panel data, accounting for variability across individuals, variables, and time in clinical trials.
To apply this model to sequentially evaluate mortality risk factors in EVD patients enrolled in the PALM-RCT during the 10th epidemic in the Democratic Republic of Congo (DRC).
To identify key mortality predictors that vary across follow-up visits, enabling the optimization of clinical care strategies and improving patient survival rates.

Methods

Ethics statement

The PALM-RCT clinical trial (ClinicalTrials.gov ID: NCT03719586) received ethical approval from the Ethics Committee of the Kinshasa School of Public Health (Approval No. ESP/CE/129/2018, issued November 20, 2018). Our secondary analysis of anonymized data from the PALM-RCT trial obtained ethical approval from the same committee (Approval No. ESP/CE/088/2023, issued January 11, 2023). An extension for data access was subsequently approved on January 15, 2024 (Approval No. ESP/CE/047/2024), allowing data use from January 15, 2024, to January 15, 2026.

The original data were collected during the 10th Ebola virus disease outbreak in the DRC (2018–2020) following ethical guidelines. Since the data were fully anonymized before analysis, the Ethics Committee waived the requirement for individual patient consent.

Study design

This study is a retrospective secondary analysis of data from the PALM-RCT phase II clinical trial (ClinicalTrials.gov ID: NCT03719586) conducted during the 10th Ebola virus epidemic in the DRC, which evaluated the efficacy of several experimental treatments for the Ebola virus. We are using a retrospective cohort approach based on data collected during longitudinal visits of patients included in the PALM-RCT trial. These data include clinical, demographic, and biological variables collected at different times during the follow-up of Ebola-infected patients [2]

Study population

The study population includes all patients enrolled in the PALM-RCT clinical trial, who were followed longitudinally during the epidemic. The study was carried out in three localities in the east of the Democratic Republic of Congo, in the provinces of Ituri and North Kivu, specifically Beni, Butembo Mangina and Katwa.

Inclusion and exclusion criteria

The primary inclusion criterion was a confirmed diagnosis of Ebola virus infection.
Patients of any age, including pregnant women, with a positive RT-PCR result within 3 days prior to screening and a history of infection were eligible.

Data collection

The data used in this analysis were obtained from the PALM-RCT clinical trial databases. These data include:

Demographic information (age, gender, place of residence, contact details, Ebola vaccination status)
Clinical data (symptoms, severity score, co-morbidities)
Biological data (haematological and biochemical markers, viral load)
Longitudinal follow-up data (clinical status at each follow-up visit).

Each visit date was treated as a panel, with the number of participants varying across panels, resulting in imbalanced data. Participants were followed up at multiple time points in accordance with the PALM-RCT trial protocol: at admission (Day 0), and subsequently on Days 7, 14, and 28, with an optional visit on Day 21. The primary endpoint for the study was mortality on Day 28.

Clinical, demographic, and biological data collected at these time points were used to analyze changes in mortality risk factors throughout the follow-up period. Data for this analysis were accessed for research purposes on 04/02/2024.

Statistical analysis

Data description.

The data for this study were initially collected and organized in Microsoft Excel 2020 sheets. For statistical analysis, we used SPSS IBM Statistics version 27 software.
The variable of interest was death (binary variable). Chi-square tests for categorical variables were used to test the association between each potential predictor and the dependent variable. For quantitative variables, a t-test (Student’s t-test) was performed. The Kruskal-Wallis test was used to assess mortality in the different groups (panels), and the significance level used for all analyses was 5% (p < 0.05);
The Area under curve (AUC) was calculated at each visit to assess the model’s ability and the Hosmer-Lemeshow tests were used to calibrate the model and validate it internally.
The study included 617 participants, spread over the different follow-up periods: Day₀ = 617 (inclusion), Day₇ = 389, Day₁₄ = 274, Day₂₁ = 81, and Days₂₈ = 142.

Methods development

Logistic regression is a widely utilized statistical method in health research for identifying risk factors associated with specific outcomes and estimating the probability of an event based on independent variables. [13,19]. For this study, we employed a top-down elimination approach, specifically the step-by-step Wald method, to build the predictive model. This iterative process selects the best-fit model by evaluating the statistical significance of each variable at each step, ensuring that only the most relevant predictors are retained [20]].

The top-down elimination process begins by including all available explanatory variables in the model. At each step, the Wald statistic is used to assess the significance of each variable, and the variable with the least statistical significance (p-value exceeding a predefined threshold, typically 0.05) is removed. This iterative evaluation and elimination continue until all remaining variables meet the significance criteria (p < 0.05).

The Wald statistic quantifies the relative importance of each variable, ensuring a rigorous selection process that results in a final model comprising only variables with substantial contributions to predicting the outcome. This method enhances model precision and interpretability by excluding non-influential variables and focusing on the strongest predictors.

The process stops when all the remaining variables in the model are significant, and no further variables can be added. [19,21].

In our case, we apply this method to panel data, where each patient visit constitutes a separate panel. We analyzed a total of five panels corresponding to the visits (panel₁ = visit₀ = Day₀ = day of inclusion, panel₂ = visit₁ = Day₇, panel₃ = visit₂ = Day₁₄, panel₄ = visit₃ = Day₂₁ et panel₅ = visit₄ = Day₂₈). Y_t takes values in {0,1}, as a function of the explanatory variables X_i at each time point t (visit time). The logistic model proposes to model the distribution of $Y_{t} / X_{i, t} = x_{t}$ by a Bernoulli distribution with parameter:

\begin{matrix} 𝐩_{β} (𝐱_{𝐭}) = 𝐏 (𝐘_{𝐭} = 1 | 𝐗_{𝐢, 𝐭} = 𝐱_{𝐭}) \end{matrix}

(1.1)

Such as:

l o g (\frac{𝐩_{β (𝐱_{𝐭})}}{1 - 𝐩_{β (𝐱_{𝐭})}}) = β_{0} + β_{1} 𝐱_{1, 0} + \dots + β_{𝐢} 𝐱_{𝐭} = {x^{'}}_{𝐭} β

$logit 𝐩_{β} (𝐱_{𝐭}) = {x^{'}}_{𝐭} β$ .

$logit$ Denoting the bijective and differentiable function of $] 0, 1 [$ in $ℝ ⟼ \log (p / (1 - p))$ .

The equality in (1.1) can be written as:

𝐩_{β} (𝐱_{𝐭}) = 𝐏 (𝐘_{𝐭} = 1 | 𝐗_{𝐢, 𝐭} = 𝐱_{𝐭}) = \frac{\exp ({𝐱^{'}}_{𝐭} β)}{1 + \exp (𝐱 {x^{'}}_{𝐭} β)}

(1.2)

Coefficients for estimating the odds ratio $(\exp β) .$

In our study, the exogenous variables are not always binary and change over time. We obtain the odds ratio (OR) using the following formula:

R C = \frac{\frac{P (Y_{t} = 1 | X_{i, t} = 1)}{1 - P (Y_{t} = 1 | X_{i, t} = 1)}}{\frac{P (Y_{t} = | X_{i, t} = 0)}{1 - P (Y_{t} = | X_{i, t} = 0)}}

(1.3)

By the equality of (1.1) we have:

R C = \frac{\exp (β_{0} + β_{i})}{\exp (β_{0})} = \exp β_{i}

(1.4)

The estimator $β_{i}$ gives the odds ratio when $X_{i, t}$ increases by one unit.

(a)
Calibration and adjustment:

We use the Hosmer-Lemeshow test, a statistical test that has been used to assess the fit and calibration of the logistic regression model. It compares observed and predicted values to determine whether there are significant differences between them, which indicates goodness of fit and internal model validation Statistically the test consists of dividing the sample into groups (usually 10) according to estimated probabilities, then calculating a χ² from the differences between observed and expected results in each group [19,20).

The χ² is calculated by the following formula:

χ^{2} = \sum_{i = 1}^{g} \frac{{(O_{i} - E_{i})}^{2}}{E_{i}}

(1.5)

$O_{i}$ are the observed observations, $E_{i}$ are the expected observations, and g is the number of groups. In our case, we will carry out the test at each visit time, and we will obtain:

{χ^{2}}_{t} = \sum_{i = 1}^{g} \frac{{(O_{i t} - E_{i t})}^{2}}{E_{i t}}

(1.5)

Where $t = D a y_{0}, D a y_{7}, D a y_{14}, D a y_{21}, D a y_{28}$

The p-value returned by the test statistic must be greater than 5% (significance level) to indicate acceptable agreement.

Results

In our study, 617 participants were recruited, all of whom met the inclusion criteria, defined as a positive PCR test result for EVD. These participants were subsequently allocated to different visit dates for follow-up (Fig 1).

The demographic, biochemical, vital, and clinical characteristics of the participants, summarized in Table 1, indicate that the mean age of participants was 29 ± 18 years, with a consistent across the visits with a mean age around 29 years. The proportion of male participants was 55.58%, decreasing slightly from 55.58% on Day0 to 46.29% on Day21. Biochemically, viral load increased from 24.22 on Day0 to 37.61 on Day21, with a slight decrease to 34.54 on Day28. Creatinine levels fell from 2.64 mg/dL on Day0 to 0.62 mg/dL on Day14, reflecting improved renal function. Sodium levels remained stable at approximately 137.50 mEq/L, while liver enzymes aspartate aminotransferase (ASAT) and alanine aminotransferase (ALAT) showed significant declines, indicating enhanced liver function.

Table 1. Demographic, biochemical and clinical characteristics.

Characteristic	All, Day₀ (n = 617)	Day₇ (n = 389)	Day₁₄ (n = 274)	Day₂₁ (n = 82)	Day₂₈ (n = 142)
Age-yr (Mean ±standard-deviation)	29 ± 18	28 ± 18	29 ± 17	22 ± 16	29 ± 20
Sex (n (%))
Male	343(55.58)	210(53.96)	150(54.68)	38(46.29)	71(50.39)
Female	274(44.42)	179(46.40)	124(44.32)	44(53.71)	70(49.59)
Biochemical parameters (Mean ±standard-deviation)
Nucleoprotein Ct value	24.22 ± 4.21	31.83 ± 10.29	33.62 ± 9.21	37.61 ± 6.87	34.54 ± 5.29
Creatine-mg/dl	2.64 ± 2.80	1.01 ± 1.31	0.62 ± 0.33	0.62 ± 0.32	0.63 ± 0.21
Potassium-mmol/liter	4.33 ± 0.91	4.33 ± 1.14	4.72 ± 0.92	4.71 ± 1.03	4.34 ± 0.71
Sodium_cu	132.21 ± 5.74	138.23 ± 5.14	137.52 ± 4.91	137.05 ± 5.31	137.50 ± 3.71
ASAT-U/liter	639.32 ± 532.91	129.72 ± 228.93	57.40 ± 33.91	88.40 ± 21.34	44.41 ± 46.73
ALAT-U/liter	383.53 ± 435.82	86.25 ± 70.03	50.22 ± 36.84	54.20 ± 84.23	33.50 ± 25.71
Vital signs (Mean ±standard-deviation)
Blood pressure-mm Hg
Systolic	100.14 ± 32.40	114.33 ± 17.03	113.11 ± 13.81	112.60 ± 14.03	117.73 ± 15.72
Diastolic	63.55 ± 21.90	72 ± 17.03	73.74 ± 12.11	71.60 ± 12.91	75.44 ± 12.73
Pulse-beats/min	97.20 ± 22.90	92.30 ± 12.50	93.4 ± 16.9	99.50 ± 23.30	95.82 ± 16.60
Body temperature -C	37.39 ± 1.79	36.69 ± 0.92	36.53 ± 0.71	36.51 ± 0.74	36.32 ± 0.52
Respiratory rate-breaths/min	26.52 ± 8.70	23.83 ± 7.04	22.12 ± 5.12	220 ± 3.92	21.31 ± 3.13
Oxygen saturation	94.92 ± 7.91	97.32 ± 2.81	97.63 ± 2.42	98.42 ± 1.50	97.66 ± 1.83
Clinical signs (n, %)
Fever	312(50.56)	62(15.93)	17(6.20)	5(6.09)	4(2.81)
Cough	61(9.88)	20(5.14)	8(2.91)	1(1.22)	3(2.11)
Headache	275(44.57)	23(5.91)	9(3.28)	2(2.44)	16(11.27)
Vomiting	237(38.41)	23(5.91)	2(0.72)	1(1.22)	0(0.00)
Diarrhea	324(52.51)	73(18.76)	14(5.11)	1(1.22)	1(0.70)
Haemorrhage	92(14.91)	14(3.59)	0(0.00)	0(0.00)	1(0.70)
Convulsions	10(1.62)	4(1.03)	0(0.00)	0(0.00)	2(1.41)
Coma	30(4.86)	15(3.85)	4(1.46)	0(0.00)	2(1.41)
Abdominal	283(38.57)	31(7.96)	2(0.72)	2(2.44)	6(4.23)
Shortness of breath,	44(7.13)	26(6.63)	4(1.46)	0(0.00)	0(0.00)
Hiccups	19(3.08)	4(1.03)	0(0.00)	0(0.00)	0(0.00)
Rash	11(1.78)	5(1.28)	2(0.72)	0(0.00)	1(0.70)
Conjunctival injection	110(17.82)	19(4.88)	5(1.82)	0(0.00)	1(0.70)

Open in a new tab

ASAT: Aspartate aminotransferase.

ALAT : Alanine aminotransferase.

Vital signs revealed a gradual increase in systolic blood pressure (from 100.14 to 117.73mmHg) and diastolic blood pressure (from 63.55 to 75.4 mmHg) between Day 0 and Day28. Heart rate stabilized at around 95 beats per minute, with a slight dip on Day 7. Body temperature and respiratory rate showed slight decreases over the course of the study. Clinically, symptoms such as fever (312 cases on Day 0–4 cases on Day 28), cough, headache, vomiting, diarrhea, abdominal pain, and conjunctival injection decreased significantly, demonstrating marked clinical improvement in participants by Day 28.

Data on mortality according to patient follow-up, presented in Fig 2, show a decreasing trend over time. 265 deaths were recorded at the inclusion (Day₀), which represent 43% of all the deaths. At Day₇, mortality fell sharply to 27 deaths (7–4%), indicating a rapid stabilization of the survival status. Less than 1,82% of deaths occurred on and after Day₁₄, suggesting a consolidation of the positive clinical results. The Kruskal-Wallis test (p-value = 0.002) showed that the reduction in mortality in the different groups (panels) was significant at the 5% threshold.

We used data from inclusion (n = 617) to find relationships between different variables in the model. The table 2 shows that certain clinical and biochemical characteristics are strongly associated with an increased risk of death. Important features such as signs of haemorrhage, coma and laboratory test results are important indicators that require special attention in patient management (see Table 2 for detailed results)..

Table 2. Relationship between sociodemographic, clinical and biochemical factors and participant outcome.

	Deaths
Characteristic	No (n = 352)	Yes (n = 265)	p-value
Socio-demographic characteristics
dm_sex (n (%))			0.78 ^*
Male	194(56.59)	149(43.41)
Female	158(57.680)	116(42.32)
Age (Mean ±standard-deviation)	29.03 ± 18.01	30.04 ± 18.03	0.41^**
Clinical sign present (n (%))
Fever	178(57.12)	134(42.88)	0.99*
Cough	30(49.19)	31(50.81)	0.19^*
Headache	167(60.64)	108(39.26)	0.98^*
Vomiting	118(49.56)	119(50.44)	0.004^*
Diarrhea	167(51.56)	157(48.44)	0.004^*
Haemorrhage	17(18.77)	75(81.23)	0.00^*
Convulsions	3(30.05)	7(69. 95.00)	0.81^*
Coma	7(23.31)	23(76.09)	0.00^*
Abdominal pain	156(55.15)	127(44.95)	0.37^*
Shortness of breath,	11(25.04)	33(74.96)	0.00^*
Hiccups	3(15.65)	16(84.35)	0.00^*
Rash	5(45.57)	6(54.43)	0.43^*
Conjunctival injection	38(34.43)	72(65.57)	0.00^*
Biochemical parameters (Mean ±standard-deviation)
Nucleoprotein Ct value	25.76 ± 4.02	22.10 ± 3.42	0.00^**
Creatinine	1.74 ± 2.14	3.67 ± 3.10	0.00^**
Potassium	4.18 ± 0.88	4.53 ± 0.96	0.00^**
Sodium	132.83 ± 536.12	131.48 ± 5.88	0.004^**
ASAT	510.64 ± 33.91	810.07 ± 478.89	0.00^**
ALAT	197.07 ± 226.14	632.12 ± 517.17	0.00^**

Open in a new tab

^** Student’s T test

^* Chi-square test

The risk factors associated with mortality at different follow-up periods, presented in Table 3, vary according to the time of assessment. At Day₀(Model₁), six factors were associated with death: Viral load 0.78 (0.72-0.85), Creatinine 1.18 (1.08-1.29), ALAT 1.00 (1.01-1.03), Hemorrhage 3.81 (1.97-7.35), Shortness of breath 2.99 (1.21-7.39) and Conjunctivitis 1.95 (1.09-3.50). By Day7 (Model 2), five predictors were identified: sodium 1.09 (1.03–1.17), ASAT 1.00 (1.00–1.01), coma 8.40 (2.22–31.77), abdominal pain 4.30 (1.33–13.90), and shortness of breath 4.88 (1.55–15.38). Notably, shortness of breath persisted as a significant factor from Day0, while the other predictors either newly emerged or became significant at this later stage. At Day14 (Model 3), two predictors remained associated with mortality: ASAT 1.03 (1.01–1.05) and coma 57.05 (1.25–2612.16). These findings highlight the dynamic evolution of risk factors over time, with some predictors persisting while others vary across different stages of disease progression.

Table 3. Assessment of risk factors using logistic regression.

Variables in the equation	B	E. S	Wald	ddl	p-value	Odd-Ratio/ (Exp(B))	95% confidence interval for EXP(B)
Model e₁ = Day₀ = Inclusion							Lower	Superior
Nucleoprotein Ct value	-0.25	0.04	38.13	1	0.00^*	0.78	0.72	0.85
Creatine	0.17	0.05	13.25	1	0.00^*	1.18	1.08	1.29
ALAT	0.00	0.00	31.3	1	0.00^*	1.00	1.01	1.03
Headache	-0.39	0.22	3.11	1	0.08	0.68	0.44	1.04
Haemorrhage	1.34	0.34	15.84	1	0.00^*	3.81	1.97	7.35
SOB breathing	1.09	0.46	5.61	1	0.02^*	2.99	1.21	7.39
Hiccups	1.34	0.71	3.62	1	0.057	3.83	0.96	15.31
Conjunctival injection	0.67	0.29	4.99	1	0.03^*	1.95	1.09	3.50
Constant	4.15	0.98	17.876	1	0.00	63.16
Model₂ = Day₇
Sodium Cu	0.09	0.03	7.36	1	0.01^*	1.09	1.03	1.17
ASAT	0.004	0.00	14.58	1	0.00^*	1.00	1.00	1.01
ALAT	-0.01	0.01	3.53	1	0.06	0.99	0.98	1
Coma	2.13	0.67	9.84	1	0.00^*	8.40	2.22	31.77
Abdominal pain	1.46	0.598	5.95	1	0.02^*	4.30	1.33	13.90
Shortness of breath	1.59	0.586	7.32	1	0.01^*	4.88	1.55	15.38
Constant	-15.63	4.665	11.22	1	0.001	0.00
Model₃ = Day₁₄
ASAT	0.03	0.01	10.43	1	0.00^*	1.03	1.01	1.05
Fever	2.40	1.35	3.14	1	0.08	11.02	0.78	156.72
Coma	4.04	1.95	4.29	1	0.04^*	57.05	1.25	2612.16
Constant	-7.69	1.56	24.29	1	0.00	0.00

Open in a new tab

B: Regression coefficient.

E. S: (Standard error): This is a measure of the precision of the estimate of the B coefficient.

Wald: The Wald test is used to determine whether the B coefficient is statistically significant.

ddl (degrees of freedom): This refers to the number of free values in a calculation.

95% confidence interval for EXP(B): This interval indicates the range within which we can be 95% sure that the true value of Exp(B) lies.

*: Significative P-value.

The performance of the models, illustrated in Fig 3, varies across the follow-up periods. Model 1 demonstrates good performance with an area under the curve (AUC) of 0.88, indicating a risk factor-related probability of death of 88%. Similarly, Model 2 shows strong performance with an AUC of 0.85, reflecting an 85% probability of death among individuals with associated risk factors. Model 3 performs exceptionally well, achieving an AUC of 0.97, suggesting that individuals with the risk factors identified in this model have a greater than 97% likelihood of mortality. In contrast, Models 4 and 5, with AUCs of 0.50 each, exhibit poor predictive ability and fail to effectively distinguish between individuals at risk of death and those not at risk.

The evaluation of models at different time points, as shown in Fig 4, highlights the evolution of predictors influencing mortality. Model 1, based on data at inclusion, identified six significant predictors: elevated viremia (nucleoprotein Ct value), renal dysfunction (elevated creatinine), liver dysfunction (elevated alanine aminotransferase (ALAT)), hemorrhage, shortness of breath, and conjunctival infection. At Day7, Model 2 identified five predictors of mortality: electrolyte imbalances (sodium), liver dysfunction (elevated aspartate aminotransferase (ASAT)), central nervous system dysfunction (Coma), abdominal pain, and shortness of breath. By Day14, Model 3 retained two key predictors: persistent liver dysfunction (elevated ASAT) and central nervous system dysfunction (Coma).

The calibration and fit of our logistic regression models were assessed using the Hosmer-Lemeshow test (see Table 4 for detailed results). The results indicate that for Day0, Day7, and Day14, the models demonstrated a good fit to the data, as evidenced by high p-values exceeding the threshold of 0.05. These findings suggest that the conclusions drawn from these three models are robust and reliable. Conversely, the models for Day21 and Day28 exhibited poor fit, with p-values below the acceptable threshold, rendering them invalid for drawing reliable conclusions.

Table 4. Hosmer-Lemeshow Test Results for Model Calibration Across Time Points.

Model	Chi-square	ddl	p-value
Day₀	18.44	8	0.82
Day₇	2.60	8	0.96
Day₁₄	2.68	8	0.95
Day₂₁	–	–	–
Day₂₈	–	–	–

Open in a new tab

ddl (degrees of freedom): This refers to the number of free values in a calculation.

Discussion

The aim of our study was to model the risk factors associated with mortality in patients with Ebola virus disease, using logistic regression applied to unbalanced longitudinal data. In the epidemiological context of Ebola virus infection, this approach captures variations in predictors of mortality over time, reinforcing the importance of dynamic risk assessment to improve care. We identified key predictors at different time points during follow-up (Day₀, Day₇, and Day₁₄), highlighting the need for an evolutionary approach to assessing risk and adapting care optimally.

On Day₀, six factors proved significant: a high level of viremia (nucleoprotein Ct value), deterioration in renal function (high creatinine level), deterioration in liver function (high ALAT level), hemorrhage, shortness of breath and conjunctival injection. These results point to critical markers on admission that could justify aggressive management from the time of diagnosis. At Day₇, five other predictive factors emerged: electrolyte imbalances (sodium), deterioration in liver function (elevated ASAT), alterations in the central nervous system (coma), abdominal pain and shortness of breath. Finally, at Day₁₄, two predictive factors were identified: deterioration in liver function (elevated ASAT) and central nervous system alterations (coma).

The results of our study compare favorably with those of previous research, notably those of [2,7,8,16,21,22] had also observed the importance of viral load and liver function markers at the start of the disease, but their study did not take account of changes in risk factors over time.

Our study, on the other hand, has shown that the initial predictors at Day₀ evolve, and that parameters such as electrolyte imbalances (sodium) and neurological symptoms (coma) become more relevant after the first week of follow-up.

In contrast to static analyses, such as the studies by Loubet et al. (2016), Levine et al. (2015), Yango J et al. (2024) or Tshomba AO and al. (2022) and also Leader L et al.(2023), which focus solely on data at inclusion (Day₀), our dynamic analysis shows that risk factors evolve over time and that recognizing this evolution is critical for effective management. [23] studies have emphasized the importance of patient-centered approaches, and our results reinforce this idea by demonstrating that personalizing care according to patients’ clinical evolution is crucial for improving outcomes [2,7,8,22].The results of this study have important implications for the clinical management of patients with Ebola virus disease.

The identification of different predictors at each stage of follow-up means that therapeutic interventions can be better targeted. For example, viral load and markers of renal function (creatinine) are critical indicators on admission, justifying aggressive treatment from the moment of diagnosis. On Day₇, the appearance of factors such as changes in mental status and sodium underlines the importance of neurological and biochemical monitoring to anticipate potential deterioration. Finally, on Day₁₄, the persistence of elevated liver enzymes (ASAT) and neurological symptoms may require more intensive management of vital functions.

These results suggest that dynamic assessment of Ebola patients could lead to more personalized management, where treatments are adjusted according to changes in risk over time, and not just based on initial patient characteristics. It could also help optimize the use of medical resources, particularly in resource-poor settings, by targeting higher-risk patients at different stages of the disease. A major strength of our study is the use of unbalanced longitudinal data, which has allowed us to capture the dynamics of risk factors throughout the course of the disease. This approach offers a better understanding of the clinical changes that occur over time in Ebola-infected patients, and we believe that this reinforces the clinical value of our results.

Conclusion

Our study successfully modeled mortality risk factors in patients with EVD using logistic regression on unbalanced longitudinal data. The results identified significant predictors at different stages of clinical follow-up (Day0, Day7, and Day14), underscoring the critical importance of monitoring changes in risk factors over time to adapt therapeutic strategies. For instance, viral load and renal and hepatic function emerged as key predictors at admission, whereas changes in mental status and electrolyte imbalances became more prominent at later stages of the disease. These findings highlight the potential of personalized care tailored to patients’ evolving clinical profiles to significantly improve therapeutic outcomes and reduce mortality. Continuous monitoring and real-time adjustments in clinical management could help optimize care and mitigate severe complications effectively.

Finally, prospective multicenter studies are essential to validate these findings and support the integration of this dynamic risk assessment model into routine clinical practice, further enhancing patient outcomes in future EVD outbreaks.

Limitation

This study has several limitations. First, as a retrospective secondary analysis, the findings may not be directly generalizable to future outbreaks without further validation. Additionally, the dynamic nature of EVD and variability across clinical settings require cautious interpretation and underscore the importance of external prospective validation in diverse geographic and epidemiologic contexts. Finally, the unbalanced nature of the dataset due to differences in follow-up durations and patient outcomes may affect statistical power at later time points, potentially limiting the generalizability of visit-specific results despite the use of appropriate analytical methods.

Future studies should consider

Conducting prospective, multicenter cohort studies to externally validate these findings.
Integrating dynamic risk prediction models into routine clinical workflows for real-time decision-making.
Investigating the longitudinal evolution of clinical and biochemical markers to refine disease staging and therapeutic strategies.

Supporting information

S1 Data. PALM Dataset.

(XLSX)

pgph.0004901.s001.xlsx^{(114.5KB, xlsx)}

Acknowledgments

We extend our heartfelt gratitude to the researchers from the Department of Epidemiology and Global Health and the Department of Immunology at the National Institute for Biomedical Research for their invaluable contributions to the discussions during the scientific days. We also sincerely thank the PALM-RCT consortium authorities, particularly Dr. Jean-Luc Biampata, for generously providing the data that made this article possible.

Data Availability

All relevant data supporting the findings of this study are included in the manuscript and its Supporting Information files. These materials are available for download and review.

Funding Statement

The author(s) received no specific funding for this work.

References

1.WHO. Ebola virus disease. Fact sheet. Geneva: World Health Organization; [Internet]. 2018. [cited 2024 Nov 13]. Available from: https://www.who.int/news-room/fact-sheets/detail/ebola-virus-disease [Google Scholar]
2.Mulangu S, Dodd LE, Davey RT Jr, Tshiani Mbaya O, Proschan M, Mukadi D, et al. A Randomized, Controlled Trial of Ebola Virus Disease Therapeutics. N Engl J Med. 2019;381(24):2293–303. doi: 10.1056/NEJMoa1910993 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Henao-Restrepo AM, Camacho A, Longini IM, Watson CH, Edmunds WJ, Egger M, et al. Efficacy and effectiveness of an rVSV-vectored vaccine in preventing Ebola virus disease: final results from the Guinea ring vaccination, open-label, cluster-randomised trial (Ebola Ça Suffit!). Lancet. 2017;389(10068):505–18. doi: 10.1016/S0140-6736(16)32621-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Towner JS, Rollin PE, Bausch DG, Sanchez A, Crary SM, Vincent M, et al. Rapid diagnosis of Ebola hemorrhagic fever by reverse transcription-PCR in an outbreak setting and assessment of patient viral load as a predictor of outcome. J Virol. 2004;78(8):4330–41. doi: 10.1128/jvi.78.8.4330-4341.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Richardson ET, Kelly JD, Barrie MB, Mesman AW, Karku S, Quiwa K, et al. Minimally Symptomatic Infection in an Ebola “Hotspot”: A Cross-Sectional Serosurvey. PLoS Negl Trop Dis. 2016;10(11):e0005087. doi: 10.1371/journal.pntd.0005087 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Caleo G, Theocharaki F, Lokuge K, Weiss HA, Inamdar L, Grandesso F, et al. Clinical and epidemiological performance of WHO Ebola case definitions: a systematic review and meta-analysis. Lancet Infect Dis. 2020;20(11):1324–38. doi: 10.1016/S1473-3099(20)30193-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Lawanga OL, Mulangu Sabue J-C, Mbala KP, Tshiani MO, Nsengi Ntamabyaliro J-M. Comparison of the performance of linear discriminant analysis and binary logistic regression applied to risk factors for mortality in Ebola virus disease patients. J Electromed Eng Med Inform. 2023. Jul 29;5(3):205–210. Available from: https://jeeemi.org/index.php/jeeemi/article/view/303 [Google Scholar]
8.Yango J, Tshomba AO, Kwete P, Madinga J, Mulangu S, Mbala-Kingebeni P, et al. Development of a clinical prediction score for Ebola virus disease screening at triage centers in the Democratic Republic of the Congo. PLOS Glob Public Health. 2024;4(8):e0003583. doi: 10.1371/journal.pgph.0003583 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Kaiwan O, Sethi Y, Khehra N, Padda I, Chopra H, Chandran D, et al. Emerging and re-emerging viral diseases, predisposing risk factors, and implications of international travel: a call for action for increasing vigilance and imposing restrictions under the current threats of recently emerging multiple Omicron subvariants. Int J Surg. 2023;109(3):589–91. doi: 10.1097/JS9.0000000000000176 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Craiu RV, Duchesne T, Fortin D. Inference methods for the conditional logistic regression model with longitudinal data. Biom J. 2008;50(1):97–109. doi: 10.1002/bimj.200610379 [DOI] [PubMed] [Google Scholar]
11.Irimata KM, Broatch J, Wilson JR. Partitioned GMM logistic regression models for longitudinal data. Stat Med. 2019;38(12):2171–83. doi: 10.1002/sim.8099 [DOI] [PubMed] [Google Scholar]
12.Dietz K. The estimation of the basic reproduction number for infectious diseases. Stat Methods Med Res. 1993;2(1):23–41. doi: 10.1177/096228029300200103 [DOI] [PubMed] [Google Scholar]
13.Breslow NE, Day NE. Statistical methods in cancer research. Volume I - The analysis of case-control studies. IARC Sci Publ. 1980;(32):5–338. [PubMed] [Google Scholar]
14.Kratz T, Roddy P, Tshomba Oloma A, Jeffs B, Pou Ciruelo D, de la Rosa O, et al. Ebola virus disease outbreak in Isiro, Democratic Republic of the Congo, 2012: signs and symptoms, management and outcomes. PLoS One. 2015;10(6):e0129333. doi: 10.1371/journal.pone.0129333 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Loubet P, Palich R, Kojan R, Peyrouset O, Danel C, Nicholas S, et al. Development of a prediction model for Ebola Virus disease: a retrospective study in nzérékoré Ebola treatment center, Guinea. Am J Trop Med Hyg. 2016;95(6):1362–7. doi: 10.4269/ajtmh.16-0026 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Levine AC, Shetty PP, Burbach R, Cheemalapati S, Glavis-Bloom J, Wiskel T, et al. Derivation and Internal Validation of the Ebola Prediction Score for Risk Stratification of Patients With Suspected Ebola Virus Disease. Ann Emerg Med. 2015;66(3):285-293.e1. doi: 10.1016/j.annemergmed.2015.03.011 [DOI] [PubMed] [Google Scholar]
17.Fitzgerald F, Wing K, Naveed A, Gbessay M, Ross JCG, Checchi F, et al. Development of a pediatric ebola predictive score, Sierra Leone1. Emerg Infect Dis. 2018;24(2):311–9. doi: 10.3201/eid2402.171018 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Collett D. Modelling Survival Data in Medical Research. Chapman and Hall/CRC. 2014. doi: 10.1201/b18041 [DOI] [Google Scholar]
19.Hosmer DWJ, Lemeshow S, Sturdivant RX. Applied logistic regression. 3rd ed. Hoboken (NJ): Wiley. 2013. [Google Scholar]
20.IBM Corp. IBM SPSS Regression 26 [Internet]. Armonk (NY): IBM Corp; 2019. [cited 2025 Jun 18]. Available from: https://www.ibm.com/docs/en/SSLVMB_26.0.0/pdf/fr/IBM_SPSS_Regression.pdf [Google Scholar]
21.Bu F, Deng X-H, Zhan N-N, Cheng H, Wang Z-L, Tang L, et al. Development and validation of a risk prediction model for frailty in patients with diabetes. BMC Geriatr. 2023;23(1):172. doi: 10.1186/s12877-023-03823-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Tshomba AO, Mukadi-Bamuleka D-R, De Weggheleire A, Tshiani OM, Kitenge RO, Kayembe CT, et al. Development of Ebola virus disease prediction scores: Screening tools for Ebola suspects at the triage-point during an outbreak. PLoS One. 2022;17(12):e0278678. doi: 10.1371/journal.pone.0278678 [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Tshomba AO, Mukadi-Bamuleka D, De Weggheleire A, Tshiani OM, Kayembe CT, Mbala-Kingebeni P, et al. Cost-effectiveness of incorporating Ebola prediction score tools and rapid diagnostic tests into a screening algorithm: A decision analytic model. PLoS One. 2023;18(10):e0293077. doi: 10.1371/journal.pone.0293077 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLOS Glob Public Health. doi: 10.1371/journal.pgph.0004901.r001

Decision Letter 0

Vishal Goyal

PGPH-D-24-02895

Dynamic Modeling of Mortality Risk Factors in Ebola Virus Disease Using Logistic Regression on Unbalanced Panel Data from a Randomized Controlled Trial in the Democratic Republic of Congo

PLOS Global Public Health

Dear Mr. Leader Lawanga Ontshick,

Thank you for submitting your manuscript to PLOS Global Public Health. After careful consideration, we feel that it has merit but does not fully meet PLOS Global Public Health’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by 30 April 2025. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at globalpubhealth@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pgph/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Dr Vishal Goyal

Academic Editor

PLOS Global Public Health

Journal Requirements:

1. We have noticed that you have uploaded Supporting Information files, but you have not included a list of legends. Please add a full list of legends for your Supporting Information files after the references list.

Comments to the Author

**********

Reviewer #1:

Reviewer’s Comments of PGPH-D-242895

The authors studied “Dynamic Modeling of Mortality Risk Factors in Ebola Virus Disease Using Logistic Regression on Unbalanced Panel Data from a Randomized Controlled Trial in the

Democratic Republic of Congo." The findings highlight the potential of personalized care tailored to patients' evolving clinical profiles to significantly improve therapeutic outcomes and reduce mortality. Continuous monitoring and real-time adjustments in clinical management could help optimize care and mitigate severe complications effectively. The work seems to be good.

However, in my view, the authors need to consider the following suggestions (comments) to improve this article.

1. There are some grammatical errors in the manuscript. I think it needs to be polished further, and some typos need to be revised. Further, punctuation marks should be checked throughout the paper, especially after the equations. Authors need to correct them throughout the manuscript. For instance,

i. In line 232, there should be a full stop behind the validation. And all other typos should be corrected

2. All equations must be well punctuated, centered and typed well. For instance, equations in lines 214, 216, 218 should be well. The fraction in line 222 must be typed in the form A=D/C.

3. Authors should clearly state their motivation for this research in the introduction section.

4. According to the authors at lines 118 and 119, “Many studies have employed traditional logistic regression models that are typically based on a single data collection point…” Yet, they cited only two papers at the review or related work section. Authors should expand the review of literature, compare their current research to the existing literature, and clearly state the innovation in this research in the introduction section.

5. Authors should indicate the source of the parameter values used for their simulations. If taken from literature, then they should cite the papers in the current study.

6. Authors should indicate limitations of their research

7. Authors should indicate the future direction of their research.

8. Authors should state the implication of their findings/results

9. Titles/ Labels of all figures(graphs) should be given.

Reviewer #2: At D0, six significant predictors of mortality were determined, although seven are listed.

At Day7, reference is made to five new predictors. Not all of these are entirely new, as two were previously identified at D0. Would suggest rewording the use of “new predictors”

Line 155: Not sure the significance of it. It is confusing

Line 206: Shouldn’t read “further variables can be removed” because of the top-down elimination process?

Linke 245: Update PALM to “PALM-RCT”

Line 265: Consider using “age was consistent across the visits…” vs “consistent distribution between 22 and 29 years of age”

Ensure consistent use of AST vs ASAT, ALT vs ALAT, Haemorrhage (check consistent and spelling throughout the document), Difficulty breathing vs SoB, …

Line 268: Creatine levels remained at 0.6 at D21 and D28.

Lines 283 – 285: Re-calculate the percentages. For example, 265 deaths of 617 = 43%?

Table 2: Please show the number of participants associated with the Biochemical parameters.

Was Mental state captured at D0? Would be good to know what its inclusion means.

Was Generalized Linear Mixed Model considered for the modelling?

I think we could describe the statistical methods clearly without the formulas.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: Response to Reviewers.docx

pgph.0004901.s002.docx^{(21.4KB, docx)}

PLOS Glob Public Health. doi: 10.1371/journal.pgph.0004901.r002

Decision Letter 1

Vishal Goyal

PGPH-D-24-02895R1

Dynamic Modeling of Mortality Risk Factors in Ebola Virus Disease Using Logistic Regression on Unbalanced Panel Data from a Randomized Controlled Trial in the Democratic Republic of Congo

PLOS Global Public Health

Dear Mr Leader Lawanga Ontshick,

==============================

Please submit your revised manuscript by June 26 2025. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at globalpubhealth@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pgph/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Dr Vishal Goyal

Academic Editor

PLOS Global Public Health

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Reviewers Comments to the Author

Reviewer #1: Reviewer’s Comments (PGPH-D-24-02895R1)

The authors studied the Dynamic Modeling of Mortality Risk Factors in Ebola Virus Disease Using Logistic Regression on Unbalanced Panel Data from a Randomized Controlled Trial in the

Democratic Republic of Congo.

I have checked the revised version of the manuscript according to the comments given, but I still have a few suggestions to be addressed by the authors. These are:

Comment 1: Keywords

Authors should either use ‘’EVD’’ or Ebola Virus Disease but not Ebola Virus Disease(EVD) within the Keywords.

Comment 2: Equation formatting

All equations involving fractions in the manuscript should be written in the form A= B/C (B/C should be vertical), but not (A=B)/C. Authors should please address that.

Comment 2: Limitations and Future direction

a. Limitations and Future directions should be part of the conclusion. Authors should summarize them and add them to the ending section of the conclusion, just before the references.

b. Limitations should be stated first before suggestions for future considerations.

c. Suggestions for future considerations should be stated in points just before the references.

Conclusion: The manuscript can be accepted after these minor revisions have been made to be published in PLOS Global Public Health.

Reviewer #2: There are some occurrences of both PALM001-RCT and PALM-RCT. Please review this for consistency.

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Attachment

Submitted filename: The Editor Plos Global Health R1.docx

pgph.0004901.s003.docx^{(12.7KB, docx)}

Attachment

Submitted filename: The Editor Plos Global Health R1 (1).pdf

pgph.0004901.s004.pdf^{(398KB, pdf)}

PLOS Glob Public Health. doi: 10.1371/journal.pgph.0004901.r003

Decision Letter 2

Vishal Goyal

Dynamic Modeling of Mortality Risk Factors in Ebola Virus Disease Using Logistic Regression on Unbalanced Panel Data from a Randomized Controlled Trial in the Democratic Republic of Congo

PGPH-D-24-02895R2

Dear Mr Leader Lawanga Ontshick,

We are pleased to inform you that your manuscript 'Dynamic Modeling of Mortality Risk Factors in Ebola Virus Disease Using Logistic Regression on Unbalanced Panel Data from a Randomized Controlled Trial in the Democratic Republic of Congo' has been provisionally accepted for publication in PLOS Global Public Health.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they'll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact globalpubhealth@plos.org.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Global Public Health.

Best regards,

Vishal Goyal

Academic Editor

PLOS Global Public Health

Attachment

Submitted filename: Response to Reviewers.pdf

pgph.0004901.s005.pdf^{(145.6KB, pdf)}

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Data. PALM Dataset.

(XLSX)

pgph.0004901.s001.xlsx^{(114.5KB, xlsx)}

Attachment

Submitted filename: Response to Reviewers.docx

pgph.0004901.s002.docx^{(21.4KB, docx)}

Attachment

Submitted filename: The Editor Plos Global Health R1.docx

pgph.0004901.s003.docx^{(12.7KB, docx)}

Attachment

Submitted filename: The Editor Plos Global Health R1 (1).pdf

pgph.0004901.s004.pdf^{(398KB, pdf)}

Attachment

Submitted filename: Response to Reviewers.pdf

pgph.0004901.s005.pdf^{(145.6KB, pdf)}

Data Availability Statement

All relevant data supporting the findings of this study are included in the manuscript and its Supporting Information files. These materials are available for download and review.

[pgph.0004901.ref001] 1.WHO. Ebola virus disease. Fact sheet. Geneva: World Health Organization; [Internet]. 2018. [cited 2024 Nov 13]. Available from: https://www.who.int/news-room/fact-sheets/detail/ebola-virus-disease [Google Scholar]

[pgph.0004901.ref002] 2.Mulangu S, Dodd LE, Davey RT Jr, Tshiani Mbaya O, Proschan M, Mukadi D, et al. A Randomized, Controlled Trial of Ebola Virus Disease Therapeutics. N Engl J Med. 2019;381(24):2293–303. doi: 10.1056/NEJMoa1910993 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref003] 3.Henao-Restrepo AM, Camacho A, Longini IM, Watson CH, Edmunds WJ, Egger M, et al. Efficacy and effectiveness of an rVSV-vectored vaccine in preventing Ebola virus disease: final results from the Guinea ring vaccination, open-label, cluster-randomised trial (Ebola Ça Suffit!). Lancet. 2017;389(10068):505–18. doi: 10.1016/S0140-6736(16)32621-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref004] 4.Towner JS, Rollin PE, Bausch DG, Sanchez A, Crary SM, Vincent M, et al. Rapid diagnosis of Ebola hemorrhagic fever by reverse transcription-PCR in an outbreak setting and assessment of patient viral load as a predictor of outcome. J Virol. 2004;78(8):4330–41. doi: 10.1128/jvi.78.8.4330-4341.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref005] 5.Richardson ET, Kelly JD, Barrie MB, Mesman AW, Karku S, Quiwa K, et al. Minimally Symptomatic Infection in an Ebola “Hotspot”: A Cross-Sectional Serosurvey. PLoS Negl Trop Dis. 2016;10(11):e0005087. doi: 10.1371/journal.pntd.0005087 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref006] 6.Caleo G, Theocharaki F, Lokuge K, Weiss HA, Inamdar L, Grandesso F, et al. Clinical and epidemiological performance of WHO Ebola case definitions: a systematic review and meta-analysis. Lancet Infect Dis. 2020;20(11):1324–38. doi: 10.1016/S1473-3099(20)30193-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref007] 7.Lawanga OL, Mulangu Sabue J-C, Mbala KP, Tshiani MO, Nsengi Ntamabyaliro J-M. Comparison of the performance of linear discriminant analysis and binary logistic regression applied to risk factors for mortality in Ebola virus disease patients. J Electromed Eng Med Inform. 2023. Jul 29;5(3):205–210. Available from: https://jeeemi.org/index.php/jeeemi/article/view/303 [Google Scholar]

[pgph.0004901.ref008] 8.Yango J, Tshomba AO, Kwete P, Madinga J, Mulangu S, Mbala-Kingebeni P, et al. Development of a clinical prediction score for Ebola virus disease screening at triage centers in the Democratic Republic of the Congo. PLOS Glob Public Health. 2024;4(8):e0003583. doi: 10.1371/journal.pgph.0003583 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref009] 9.Kaiwan O, Sethi Y, Khehra N, Padda I, Chopra H, Chandran D, et al. Emerging and re-emerging viral diseases, predisposing risk factors, and implications of international travel: a call for action for increasing vigilance and imposing restrictions under the current threats of recently emerging multiple Omicron subvariants. Int J Surg. 2023;109(3):589–91. doi: 10.1097/JS9.0000000000000176 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref010] 10.Craiu RV, Duchesne T, Fortin D. Inference methods for the conditional logistic regression model with longitudinal data. Biom J. 2008;50(1):97–109. doi: 10.1002/bimj.200610379 [DOI] [PubMed] [Google Scholar]

[pgph.0004901.ref011] 11.Irimata KM, Broatch J, Wilson JR. Partitioned GMM logistic regression models for longitudinal data. Stat Med. 2019;38(12):2171–83. doi: 10.1002/sim.8099 [DOI] [PubMed] [Google Scholar]

[pgph.0004901.ref012] 12.Dietz K. The estimation of the basic reproduction number for infectious diseases. Stat Methods Med Res. 1993;2(1):23–41. doi: 10.1177/096228029300200103 [DOI] [PubMed] [Google Scholar]

[pgph.0004901.ref013] 13.Breslow NE, Day NE. Statistical methods in cancer research. Volume I - The analysis of case-control studies. IARC Sci Publ. 1980;(32):5–338. [PubMed] [Google Scholar]

[pgph.0004901.ref014] 14.Kratz T, Roddy P, Tshomba Oloma A, Jeffs B, Pou Ciruelo D, de la Rosa O, et al. Ebola virus disease outbreak in Isiro, Democratic Republic of the Congo, 2012: signs and symptoms, management and outcomes. PLoS One. 2015;10(6):e0129333. doi: 10.1371/journal.pone.0129333 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref015] 15.Loubet P, Palich R, Kojan R, Peyrouset O, Danel C, Nicholas S, et al. Development of a prediction model for Ebola Virus disease: a retrospective study in nzérékoré Ebola treatment center, Guinea. Am J Trop Med Hyg. 2016;95(6):1362–7. doi: 10.4269/ajtmh.16-0026 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref016] 16.Levine AC, Shetty PP, Burbach R, Cheemalapati S, Glavis-Bloom J, Wiskel T, et al. Derivation and Internal Validation of the Ebola Prediction Score for Risk Stratification of Patients With Suspected Ebola Virus Disease. Ann Emerg Med. 2015;66(3):285-293.e1. doi: 10.1016/j.annemergmed.2015.03.011 [DOI] [PubMed] [Google Scholar]

[pgph.0004901.ref017] 17.Fitzgerald F, Wing K, Naveed A, Gbessay M, Ross JCG, Checchi F, et al. Development of a pediatric ebola predictive score, Sierra Leone1. Emerg Infect Dis. 2018;24(2):311–9. doi: 10.3201/eid2402.171018 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref018] 18.Collett D. Modelling Survival Data in Medical Research. Chapman and Hall/CRC. 2014. doi: 10.1201/b18041 [DOI] [Google Scholar]

[pgph.0004901.ref019] 19.Hosmer DWJ, Lemeshow S, Sturdivant RX. Applied logistic regression. 3rd ed. Hoboken (NJ): Wiley. 2013. [Google Scholar]

[pgph.0004901.ref020] 20.IBM Corp. IBM SPSS Regression 26 [Internet]. Armonk (NY): IBM Corp; 2019. [cited 2025 Jun 18]. Available from: https://www.ibm.com/docs/en/SSLVMB_26.0.0/pdf/fr/IBM_SPSS_Regression.pdf [Google Scholar]

[pgph.0004901.ref021] 21.Bu F, Deng X-H, Zhan N-N, Cheng H, Wang Z-L, Tang L, et al. Development and validation of a risk prediction model for frailty in patients with diabetes. BMC Geriatr. 2023;23(1):172. doi: 10.1186/s12877-023-03823-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref022] 22.Tshomba AO, Mukadi-Bamuleka D-R, De Weggheleire A, Tshiani OM, Kitenge RO, Kayembe CT, et al. Development of Ebola virus disease prediction scores: Screening tools for Ebola suspects at the triage-point during an outbreak. PLoS One. 2022;17(12):e0278678. doi: 10.1371/journal.pone.0278678 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgph.0004901.ref023] 23.Tshomba AO, Mukadi-Bamuleka D, De Weggheleire A, Tshiani OM, Kayembe CT, Mbala-Kingebeni P, et al. Cost-effectiveness of incorporating Ebola prediction score tools and rapid diagnostic tests into a screening algorithm: A decision analytic model. PLoS One. 2023;18(10):e0293077. doi: 10.1371/journal.pone.0293077 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Dynamic modeling of mortality risk factors in Ebola virus disease using logistic regression on unbalanced panel data from a randomized controlled trial in the Democratic Republic of Congo

Leader Lawanga Ontshick

Jepsy Yango

Ange Mubiala Yaya

Olivier Tshiani Mbaya

Joule Madinga Twan

Jean-Michel Nsengi Ntamabyaliro

Rosine Ali

Patrick Mutombo Lupola

Joseph-Desiré Bukweli

Sifa Marie-joelle Muchanga

Gaston Tona Lutete

Placide Mbala Kiangebeni

Sabue Mulangu

Rostin Mabela Makengo Matendo

Roles

Abstract

Introduction

Methods

Ethics statement

Study design

Study population

Data collection

Statistical analysis

Data description.

Methods development

Results

Fig 1. The flow chart illustrates the process of selecting participants for a randomized controlled trial on Ebola Virus Disease (EVD).

Table 1. Demographic, biochemical and clinical characteristics.

Fig 2. Mortality by group (follow-up time).

Table 2. Relationship between sociodemographic, clinical and biochemical factors and participant outcome.

Table 3. Assessment of risk factors using logistic regression.

Fig 3. Model performance represented by receiver operating characteristic (ROC) curves across follow-up periods.

Fig 4. Key risk factors associated with mortality.

Table 4. Hosmer-Lemeshow Test Results for Model Calibration Across Time Points.

Discussion

Conclusion

Limitation

Future studies should consider

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Vishal Goyal

Roles

Decision Letter 1

Vishal Goyal

Roles

Decision Letter 2

Vishal Goyal

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases