Skip to main content
Gates Open Research logoLink to Gates Open Research
. 2024 Aug 30;6:143. Originally published 2022 Nov 11. [Version 2] doi: 10.12688/gatesopenres.14041.2

Characterization of longitudinal nasopharyngeal microbiome patterns in maternally HIV-exposed Zambian infants

Aubrey R Odom 1,a, Christopher J Gill 2, Rachel Pieciak 2, Arshad Ismail 3,4, Donald Thea 2, William B MacLeod 2, W Evan Johnson 1,5, Rotem Lapidot 6,7
PMCID: PMC11427455  PMID: 39345284

Version Changes

Revised. Amendments from Version 1

Several updates have been made to the text overall, including clarifications on study limitations and more descriptive language throughout the text and captions. Some portions of text have been rearranged to more appropriate sections.

Abstract

Background

Previous studies of infants born to HIV-positive mothers have linked HIV exposure to poor outcomes from gastrointestinal and respiratory illnesses, and to overall increased mortality rates. The mechanism behind this is unknown, but it is possible that differences in the nasopharyngeal (NP) microbiome between infants who are HIV-unexposed or HIV-exposed could play a role in perpetuating some outcomes.

Methods

We conducted a longitudinal analysis of 170 NP swabs of healthy infants who are HIV-exposed (n=10) infants and their HIV(+) mothers, and infants who are HIV-unexposed, uninfected (HUU; n=10) .and their HIV(-) mothers. These swabs were identified from a sample library collected in Lusaka, Zambia between 2015 and 2016. Using 16S rRNA gene sequencing, we characterized the maturation of the microbiome over the first 14 weeks of life to determine what quantifiable differences exist between HIV-exposed and HUU infants, and what patterns are reflected in the mothers' NP microbiomes.

Results

In both HIV-exposed and HUU infants, Staphylococcus and Corynebacterium began as primary colonizers of the NP microbiome but were in time replaced by Dolosigranulum, Streptococcus, Moraxella and Haemophilus. When evaluating the interaction between HIV exposure status and time of sampling among infants, the microbe Staphylococcus haemolyticus showed a distinctive high association with HIV exposure at birth. When comparing infants to their mothers with paired analyses, HIV-exposed infants’ NP microbiome composition was only slightly different from their HIV(+) mothers at birth or 14 weeks, including in their carriage of S. pneumoniae, H. influenzae, and S. haemolyticus.

Conclusions

Our analyses indicate that the HIV-exposed infants in our study exhibit subtle differences in the NP microbial composition throughout the sampling interval. Given our results and the sampling limitations of our study, we believe that further research must be conducted in order to confidently understand the relationship between HIV exposure and infants’ NP microbiomes.

Keywords: nasopharyngeal microbiome, longitudinal cohort study, microbial communities, HIV exposure, children

Introduction

Currently, more than one million infants are born to women with HIV [HIV(+)] worldwide every year 1 . Fortunately, with antiretroviral treatment for mothers and prophylaxis for their infants, the vast majority of infants with HIV exposure will not become infected with HIV 2 . However, prevention of mother-to-child transmission (PMTCT) does not eradicate health disparities between infants who are HIV-exposed, uninfected (HIV-exposed) or HIV-unexposed, uninfected (HUU) by eliminating HIV transmission. Data suggests these children are still directly or indirectly affected by their mother’s HIV status. For example, recent meta-analyses published by members of our team reported a 60% increased risk of death 3 and an increased risk of pneumonia and diarrhea 4 among HIV-exposed compared with HUU children, thereby supporting the observed phenomenon that infants are vulnerable not only to morbidity, but also to increased mortality. These findings have also been observed in numerous other studies that have shown the linkage between poor outcomes and gastrointestinal and respiratory illnesses 59 . Hypothesized explanations include dysregulation of passive immunity via maternal antibodies, changes in the maturation of infant lymphocytes, exposure to microbes present in the mother’s birth canal at delivery, and/or social factors, which may have an impact on morbidity and mortality rates in the early stages of life.

Alternatively, it remains possible that many of these reported health differences are merely artifacts of various selection biases. Previous studies on the effects of HIV exposure were cross sectional studies based on convenience sampling and/or lacked precision due to small sample sizes 3, 4 . Further, few such studies were longitudinal, making it difficult to observe changes over time 3, 4 . When studying the microbiome, integral to the early development of the immune system, the lack of longitudinal structure is a crucial limitation given that it evolves dynamically over the first weeks of life, as both we and others have previously demonstrated 10, 11 . Thus, it remains possible that much of the HIV-exposed ‘phenomenon’ of increased morbidity and/or mortality rates as currently described could be due to sociological factors, or simply selection bias. To better understand the potential biological basis for this phenomenon, unbiased, systematic data are required to build confidence in its existence.

In the gut, interactions between the microbial community and the host influence the development of the immune system and, consequently, the development of diseases 12 . While much is known about the evolution of the intestinal microbiome, far less is known about the dynamics surrounding the microbiome of the upper respiratory tract, which plays an important role in respiratory health. If the increased rates of respiratory disease observed among HIV-exposed children have a biological explanation, we hypothesized that the respiratory microbiomes of HIV-exposed infants would differ systematically from HUU infants over the first few months of life, potentially explaining their greater susceptibility to certain respiratory diseases. Previous studies on the impact of HIV exposure have highlighted strong correlations between HIV exposure and increased risk of pneumococcal colonization and disease 7, 13, 14 , whereas others have demonstrated no differences in pathogen carriage between children with HIV infection and control groups 1517 . The current analysis seeks to address this important knowledge gap.

Given the complex dynamics of interactions between the host, microbes, and environment beginning at birth, we conducted an exploratory longitudinal comparison of the nasopharyngeal (NP) microbiomes of HIV-exposed and HUU infants and their mothers during the first 14 weeks of life. We reasoned that by characterizing quantifiable differences in NP microbiota distinguishing these two groups, this pilot study could indicate differences encouraging future immunology-focused studies.

Methods

Cohort characteristics

A subset of 170 NP swabs of 20 infants and their mothers were identified from a sample library collected in Lusaka, Zambia between 2015 and 2016. The sample library was part of a nested time-series case comparator study within the prospective longitudinal Southern Africa Infant-mother Pertussis study (SAMIPS) 18 . From 1,981 total mother infant pairs, we selected a subset that had 3 or more study visits, had no siblings under the age of five years, and who enrolled in the study from April to July 2015. From this subset, we randomly selected 10 mother infant pairs with an HIV-positive [HIV(+)] mother who started antiretroviral therapy (ART) before pregnancy. These 10 mother infant pairs were randomly matched to 10 mother infant pairs with an HIV negative mother by education, month of entry into the study, and maternal age. For the cohort, infants were included if they were otherwise ‘healthy’ when screened at one week of age. Healthy infants were born at term (>37 weeks); not underweight (>2500 grams); had no acute or chronic conditions known at the time of enrollment; were not born via cesarean section; and had no known complications during pregnancy or labor and delivery. None of the mothers in the cohort experienced obstructed labor, sepsis, or hemorrhage complications during the labor and delivery period.

The institutional review boards at Boston Medical Center and Excellence in Research Ethics and Science Converge in Lusaka jointly provided ethical oversight (The ERES Converge, Lusaka. REF# 2015-Jan-002, Date: 01/02/2015; BUMC IRB, Boston. # H-33521, Date: 12/12/2014). All mothers provided written informed consent, with consent forms presented in English and the two dominant vernacular languages spoken in Lusaka: Bemba and Nyanja. The present analysis uses HIV-exposed ( n=10) infants and their HIV(+) mothers alongside HUU healthy control ( n=10) infants and their mothers who are HIV-negative [HIV(-)] collected as part of the SAMIPS study. Infant-mother pairs were recruited during their first scheduled postpartum well-child visit at approximately one week of age. Infants and their mothers were enrolled from the Chawama Primary Health Clinic (PHC) in Chawama compound, a densely populated peri-urban area near central Lusaka. Chawama PHC is the only government-supported clinic in this community and is the primary source of medical care for Chawama residents, allowing for maximal study reach. NP swabs were obtained from infants at enrollment and approximately every two to three weeks thereafter through 14 weeks of age for a total of seven scheduled time points each. Mothers’ samples were gathered at all time points at which the infants were swabbed, but only t=0 and t=6 at weeks 0–2 and 12–14 were sequenced for analysis for a total of 40 samples.

HIV(+) mothers enrolled in the SAMIPS study were required to be on ART to prevent mother-child transmission. Among the mothers in our immediate cohort, 50% (10/20) were HIV-infected, of whom 100% had initiated ART prior to conception. Given that the SAMIPS study was designed to be focused on Pertussis incidence, CD4 counts were not collected from study subjects, and assessment of mother’s HIV status relied upon previous testing done at the clinic. Neither viral load nor CD4 testing were routinely performed in Lusaka at this time and neither of these were deemed essential for the purposes of the original study beyond noting the mothers’ HIV status. Although data on maternal HIV status was available, final HIV status could not be ascertained on the infants themselves, which typically is not possible until the infant is four to six months of age. The study also did not collect information on breastfeeding or consumption of other foods in the study timespan, although formula is rarely used in Zambia and breastfeeding is nearly universal at such young ages 19 . As all mothers received ART during pregnancy, pooled transmission rates of breastfeeding mothers would be about 3.54% (95% CI: 1.15–5.93%) at the six month mark 20 . Therefore, it can be assumed that infants in this study becoming HIV(+) would be rare, and so all infants born to an HIV(+) mother were classified here as being HIV-exposed with two unknown subpopulations of HIV-exposed and HIV(+) infants. All enrolled infants received the pentavalent and pneumococcal vaccines at ages six, 10 and 14 weeks, which offer protection against 10 pneumococcal serotypes and Haemophilus influenzae type B. Additional information about the study structure and sampling methods can be found in Gill et al. (2016) 18 .

Sample processing and storage

NP swabs were obtained from the posterior nasopharynx using a sterile flocked tipped nylon swab (Copan Diagnostics, Merrieta, California). The swabs were then placed in universal transport media, put on ice and transferred to our onsite lab on the same campus, where they were aliquoted and stored at -80°C until DNA extraction. DNA was extracted using the NucliSENS EasyMagG System (bioMérieux, Marcy l’Etoile, France). Extracted DNA was stored at our lab located at the University Teaching Hospital in Lusaka at -80°C. Sample collection, processing and storage were previously described (Gill et al., 2016) 18 .

16S ribosomal DNA amplification and MiSeq sequencing

For 16S library preparations, two PCR reactions were completed on the template DNA. Initially the DNA was amplified using universal bacterial primers 21 specific to the V3–V4 region of the 16S rRNA gene 21 . Library preparation was performed according to the standard instructions of the 16S Metagenomic Sequencing Library Preparation protocol (Illumina, USA). The 16S primer pairs incorporated the Illumina overhang adaptor (16S forward primer 5’-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGNGGCWGCAG-3’; 16S reverse primer 5’-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTACHVGGGTATCTAATCC-3’)

Each PCR reaction contained DNA template (~12 ng), 5µℓ forward primer (1μM), 5 µℓ reverse primer (1μM), 12.5 µℓ 2 X Kapa HiFi Hotstart ready mix (KAPA Biosystems Woburn, MA), and PCR grade water to a final volume of 25µℓ. PCR amplification was carried out as follows: heated lid 110°C, 95°C for 3 min, 25 cycles of 95°C for 30s, 55°C for 30s, 72°C for 30s, then 72°C for 5 min and held at 4°C. Negative control reactions without any template DNA were carried out simultaneously.

The size of the amplicons was then visualized using the 4200 TapeStation (Agilent Technologies, Germany). Successful PCR products were cleaned using AMPure XP magnetic bead-based purification (Beckman Coulter, IN). The IDT for Illumina Nextera DNA UD Indexes kit (Illumina, San Diego, CA) with unique dual index adapters were used to allow for multiplexing. Each PCR reaction contained purified DNA (5 μℓ), 10 μℓ index primer mix, 25 μℓ 2X Kapa HiFi Hot Start Ready mix and 10 μℓ PCR grade water. PCR reactions were performed on a Bio-Rad C1000 Thermal Cycler (Bio-Rad, Hercules, CA) Cycling conditions consisted of one cycle of 95°C for 3 min, followed by eight cycles of 95°C for 30 s, 55°C for 30 s and 72°C for 30 s, followed by a final extension cycle of 72°C for 5 min. PCR products of negative controls were confirmed negative on Agilent TapeStation (no band observed).

Prior to library pooling, the indexed libraries were purified with Ampure XP beads and quantified using the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific, Waltham, MA). Purified amplicons were run on the Agilent TapeStation (Agilent Technologies, Germany) for quality analysis before sequencing. The sample pool (2 nM) was denatured with 0.2N NaOH, then diluted to 4 pM and combined with 10% (v/v) denatured 20 pM PhiX, prepared following Illumina guidelines. Libraries were then sequenced on the Illumina MiSeq sequencing platform (Illumina, USA) at the Sequencing Core Facility, National Institute for Communicable Diseases (NICD) of the National Health Laboratory Service, South Africa, using a 2 x 300 cycle V3 kit, following standard Illumina sequencing protocols. Negative controls were sequenced as well, resulting in extremely low reads that were not further analyzed.

In addition to using negative controls, all samples were processed at random to account for reagent contamination. Lab technicians were blinded to the timing of sample collection and clinical data.

We assessed the quality of the sequencing data using FastQC v0.11.9 22 . Trimmomatic 23 v0.39 was used to trim Illumina adapters and remove low-quality sequences. We performed a sliding window trim, cutting once when the average quality score within a window of six bases falls below 15. We removed both leading and trailing low quality or N bases below quality six. All other parameters used the default settings.

Sequencing data were aligned to bacterial genomes and profiled using the PathoScope 2.0 pipeline 24 . All RefSeq representative bacterial genomes available as of November 2, 2018 were used as a PathoScope reference library. We obtained target read counts delineated by National Center for Biotechnology Information (NCBI) unique identifiers (UIDs) and matched them to the NCBI Taxonomy database to retrieve accurate taxonomic hierarchy information. We then aggregated reads by genera. Data were transformed to relative abundances using counts per million (CPM) and normalized using log CPM for subsequent analyses. In most cases, taxa belonging to genera with average relative abundances of less than 1% were grouped as “Other” in analyses. All code has been made available via Zenodo 25 .

Visualizing longitudinal trends in NP microbiome composition

Microbial abundances across sample groups were visualized using alluvial diagrams and stacked bar plots using the R packages ggplot2 v3.3.5 and alluvial v0.2-0. The alluvial diagrams illustrate individual genera as stream fields that change position at different time points. The height of a stream field represents the relative abundance of that taxon. At a given time point, stream fields are ranked from the highest to lowest abundance (top to bottom). These were plotted for infants according to HIV exposure status over several time points. Stacked bar plots were used to visualize the relative abundance of microbes at a given taxonomic level in each sample, represented as a single bar, labeled by time point, and plotted within each HIV exposure status group for separate mothers and infant comparisons. These plotting and diagramming techniques allow for an efficient overview of the types of differences inherently present in the data at the group level.

Modeling effects of HIV status on infant NP microbiome composition across time

Generalized estimating equations (GEEs) as described in Liang and Zeger (1986) 26 and extended by Agresti (2002) 27 have been widely used for modeling longitudinal data 28 , and more recently for longitudinal microbiome data 29, 30 . For each genus present in the microbial aggregate of samples, we modeled normalized log CPM relative taxon counts, estimating the effects of time point and HIV exposure status and their interaction, while accounting for the underlying structure of clusters formed by individual subjects. We defined the responses Y 1, Y 2,..., Y n as the collection of infant relative abundances for a given taxon in log CPM, with n = 129. We identified the mean model µ ij for the i th infant and j th timepoint. With regression parameters β k representing HIV exposure status and time point, and the AR(1) variance structure V i , we formed the estimating equation:

U(β)=i=1nμiβVi1{Yiμi(β)}.

This then becomes an optimization problem, such that solving U( β) = 0 estimates the parameters β k . We modeled abundances for all 12 genera and nine of the top species. The link function g was chosen to be a Gaussian link. It was assumed that these abundances are correlated within infants for the various sampling time points. As such, we accounted for this with a first-order autoregressive AR(1) working correlation structure with homogenous variances such that the correlation between adjacent time points was assumed to be more similar. Parameter estimates were collected from each model, along with Wald test p-values. A Bonferroni correction was applied to account for multiple hypothesis testing, with an initial α=0.05. Due to the conservative nature of the Bonferroni correction, adjusted p-values between 0.05 and 0.10 were noted as marginally significant as effects that could retain some level of practical significance. Models were created in R using geepack v1.3.10 31 .

Testing differences in NP microbiome composition across time and cohort groups

Hotelling’s T 2 tests 32 were used to determine whether the microbiome profiles exhibited notable differences or trends across time and groups. Student’s t-tests were used to identify which genera contributed most to these differences. Groups of mothers and infants or HIV-exposed and HUU infants are designated as the two sampling units on which the relative abundances of the p most abundant genera were measured. For paired tests, we chose p = 6 variables to ensure that n < p so that singularity could be avoided and T 2 could be properly computed, where n is the number of measurements in a sampling unit. We tested the hypotheses

H0:μy=μxvs.HA:μyμx

which, in the paired case given μ diff = μ y μ x , is equivalent to

H0:μdiff=0vs.HA:μdiff0

to conduct a comparison between groups for the p most abundant genera for testing groups. The population means μ x and μ y represent the two sampling units of a given test, which were generally either μ mothers and μ infants or μ HUU and μ HEU . We treated samples as paired in both cases such that the mothers are paired with their own infants or HIV-exposed and HUU infants were paired according to the pre-analysis matching schema. When testing HIV positive mothers versus HIV negative mothers, we did not pair mothers and instead relied on a standard two-sample test. We assumed that the relevant conditions for testing are met, meaning that the groups are correlated and have a multivariate normal distribution. Normality assumptions are met by using microbe abundances in log CPM units.

We conducted three main groups of testing with the T 2 statistic. The first paired test was between infant-mother pairings at the first or last time point ( t ∈ [0, 6]) and for either the infant/mother HIV-exposed/HIV(+) or HUU/HIV(-) groups for a total of four tests. These tests offer a clear between-group comparison while accounting for similarity in the infant-mother dyads and calculating genera-specific differences in ruling whether the hypotheses are met. The second testing approach consisted of seven tests. We compared relative abundances of the six most highly abundant genera at each of the seven time points ( t ∈ [0, 1,..., 6]) from paired HIV-exposed and HUU infants. The final testing approach compared all HIV(+) and HIV(-) mothers’ samples for all 12 genera across mothers, including the “Other” designation as described previously. All samples were unpaired and therefore utilized the unpaired multivariate T 2 generalization.

Test statistics were calculated to test whether microbiome profiles in paired infants were notably different among the seven time points, and whether HIV-exposed mothers and infants had similar profiles at t=0 and t=6. After calculating the T 2 test statistic, we calculate an equivalent F p,n–p test statistic distributed with p and np degrees of freedom and use this distribution to calculate the test’s p-value:

p-value=Pr{Fp,np+1>np+1npT2}

Paired or two-sample t-tests were used to distinguish which genera are most important to the differences identified with Hotelling’s T 2 tests, and α-levels were adjusted after performing the p tests by using a Bonferroni critical value to reduce Type I error rate inflation.

Beta diversity using the Bray-Curtis dissimilarity metric was compared using a non-parametric Wilcoxon rank-sum test between (1) HIV-exposed and HUU infants, and (2) between HIV(+) and HIV(-) mothers. Tests were then conducted separately for each subset of samples for (3) t=0 and (4) t=6 for a total of four tests. The null hypothesis conjectures that the distributions of both populations are equal under the general assumption that all observations from both groups are independent of each other.

All analyses were performed using R Statistical Software (v4.2.1; R Core Team 2022).

Results

Sample characteristics

Characteristics of the study cohort are delineated in Table 1.

Table 1. Baseline demographic characteristics of the full infant cohort, stratified by HIV exposure status.

IQR refers to the interquartile range, BCG refers to the Bacille Calmette-Guérin vaccine for tuberculosis disease, and OPV refers to oral polio vaccine.

Parameter HIV-Exposed HIV-Unexposed All Subjects
Infant Demographics
Number of Infants 10 10 20
Male Sex % (n) 50.0% (5/10) 40.0% (4/10) 45.0% (9/20)
Median Age in Weeks at Enrollment (IQR) 0.86 (0.86–1) 0.93 (0.86–1) 0.86 (0.86–1)
Median Estimated Gestational Age at Delivery (weeks) (IQR) 40.0 (38–40) 40.0 (39–40) 40.0 (39–40)
Median Birth Weight (IQR) 3050.0 (2800–3200) 3100.0 (3000–3100) 3100.0 (2900–3200)
Maternal Demographics
Number of Mothers 10 10 20
Median Age in Years at Enrollment (IQR) 34.0 (29–36) 34.0 (30–37) 34.0 (30–37)
Married % (n) 100.0% (10/10) 100.0% (10/10) 100.0% (20/20)
Immunizations at birth
BCG % (n) 30.0% (3/10) 50.0% (5/10) 40.0% (8/20)
OPV % (n) 30.0% (3/10) 20.0% (2/10) 25.0% (5/20)
Labor and Delivery Complications
Birth Asphyxia % (n) 10.0% (1/10) 0.0% (0/10) 5.0% (1/20)
Maternal Immunization
Median Number of Tetanus Toxoid Doses (IQR) 5.0 (3–5) 5.0 (3–5) 5.0 (3–5)
Maternal HIV Status
Mother HIV(+) % (n) 100.0% (10/10) 0.0% (0/10) 50.0% (10/20)
Mother on ART % (n) 100.0% (10/10) . % (./0) 100.0% (10/10)
Mother on ART Prior to Pregnancy % (n) 100.0% (10/10) . % (./0) 100.0% (10/10)
Place of Birth
UTH % (n) 40.0% (4/10) 30.0% (3/10) 35.0% (7/20)
Chawama Clinic % (n) 60.0% (6/10) 60.0% (6/10) 60.0% (12/20)
Home Delivery % (n) 0.0% (0/10) 10.0% (1/10) 5.0% (1/20)
Household composition
Median Household Size (IQR) 5.5 (4–7) 6.0 (5–7) 6.0 (5–7)

Some missingness in the data is present; for ten samples, these visits either never occurred or swabs were not collected. An additional three samples were excluded from analysis because fewer than 10,000 reads aligned to RefSeq reference genomes. This left us with 129 infant samples and 38 mother samples for analysis. In total, 16/20 infants had data for all seven time points, 2/20 infants had only six time points, 1/20 infants had three time points, and 1/20 infants had only two time points. All infants had swabs for t=0 and t=1. Two of 20 mothers lacked swabs at t=6 ( Table 2).

Table 2. Counts and percentages of samples available for analysis from both infants and mothers.

These are stratified by time point, infant age in weeks, and HIV status.

Time point HUU Infant HIV(-) Mother HIV-Exposed Infant HIV(+) Mother Overall
(N=68) (N=19) (N=61) (N=19) (N=167)
0 10 (14.7%) 10 (52.6%) 10 (16.4%) 10 (52.6%) 40 (24.0%)
1 10 (14.7%) 0 (0%) 10 (16.4%) 0 (0%) 20 (12.0%)
2 10 (14.7%) 0 (0%) 9 (14.8%) 0 (0%) 19 (11.4%)
3 9 (13.2%) 0 (0%) 8 (13.1%) 0 (0%) 17 (10.2%)
4 9 (13.2%) 0 (0%) 8 (13.1%) 0 (0%) 17 (10.2%)
5 10 (14.7%) 0 (0%) 8 (13.1%) 0 (0%) 18 (10.8%)
6 10 (14.7%) 9 (47.4%) 8 (13.1%) 9 (47.4%) 36 (21.6%)

Although we included 10 HUU and 10 HIV-exposed infants and their mothers in the study, there was some small variation in numbers of samples at the different time points, but they were overall close in evenness. Infants were approximately 1 ± 0.14 (mean ± SD) weeks old at time of first swabbing and were on average 3.14 ± 0.28 weeks, 6 ± 0.42 weeks, 8.29 ± 0.29 weeks, 10.57 ± 0.29 weeks, 12.57 ± 0.29 weeks, and 15 ± 0.29 weeks respectively at subsequent time points t=1 through t=6.

Our FastQC analysis indicated that the overall sequencing quality was excellent, with mean Phred quality scores remaining greater than 25 (99.5% accuracy) for at least 175 bp for both forward and reverse reads. Trimmomatic removed less than 3% of reads in any given sample. The analysis covered 129 infant and 38 mother swab samples, with an average of 124,905 ± 299,120 (mean ± SE) reads per infant sample (max = 3,016,276; min = 13,218) and 70,463 ± 48,730 reads per mother sample (max = 208,029; min = 14,339). The read count for infant samples was significantly higher than that of mothers’ swabs (p = 0.02), which may in part be the result of mothers’ acquired immunity over time and therefore lower overall NP carriage. In our raw data, we uniquely identified 17 phyla, 647 genera, and 758 total species across all samples. Due to the high diversity of taxa with relatively low abundance among individual microbes, we labeled taxa with average relative abundances of less than 1% as “Other” at the genera and species level. Post-grouping, the present taxa were limited to three distinctly identifiable phyla, encompassing 12 genera and 87 species. The most abundant phyla were the Firmicutes (~56.5%), Proteobacteria (~22.4%), and Actinobacteria (~16.1%). The remaining 5% of reads were characterized as “Other.” At the genus level, the most abundant groups were the Dolosigranulum (~23.5%), Staphylococcus (17.2%), Corynebacterium (~16.1%), Streptococcus (~13.5%) and Moraxella (~12.4%) genera.

Visualizing longitudinal trends in NP microbiome composition

Longitudinal trends in NP microbiome composition by HIV exposure status are depicted in Figures 1A and 1B for infants and mothers, respectively. The alluvial plot in Figure 1C graphically depicts the most abundant genera in log CPM present in HIV-exposed and HUU infants. As relative abundances (in log CPM) change over time, the position of a flow stream representing a single genus may change position relative to the other genera. At a given time point, the flow streams are stacked according to the abundance relative to other streams.For both the HEU and HUU infant groups Across all infant samples, the respiratory microbiome during the first months of life is dominated by Staphylococci and Corynebacteria. Early on, we observed the emergence of more typical respiratory bacteria such as Moraxella and Streptococcus sp. Additionally, the commensal Dolosigranulum emerges as a dominant member of the microbiome within the first weeks of life. Haemophilus appeared later at around four to six weeks.

Figure 1.

Figure 1.

The maturation over 17 weeks of the NP microbiomes of A) healthy HIV-exposed (n=10) and HUU infants (n=10) and B) HIV(+) (n=10) and HIV(-) mothers (n=10). These stacked bar plots reveal variation in the relative abundance of microbes between groups of either infants or mothers clustered at the genus level. Each bar represents a single time point binned by age and is the average of ~10 samples. Genera with an average relative abundance of <1% across all samples are labeled as “Other.” The alluvial plot in C) depicts the changing presence of genera across all infant samples by HIV exposure status. As relative abundances change over time, the position of a flow stream representing a single genus may change position relative to the other genera.

Effect of HIV status on infant NP microbiome composition across time

While longitudinal trends were strongly apparent across infant groups, the observed differences between the HIV-exposed and HUU infants were subtle. Figure 1A illustrates higher amounts of Streptococcus and suppression of Dolosigranulum among HIV-exposed vs. HUU infants. Furthermore, Figure 1C suggests increasing amounts of Streptococcus and Haemophilus over time for HIV-exposed infants, whereas Staphylococcus largely declines after t=3 in both groups.

GEEs revealed some differences among genera and species for time point and HIV exposure status when adjusting for subject variation and the interaction effect. Models were created for all 12 genera, including “Other” genera, and for the top nine species that averaged greater than 1% abundance across all infant samples. Of all taxa tested, four of nine species and six of 12 genera exhibited substantial instability across time points ( Table 3). One species, Staphylococcus haemolyticus, was highly associated with HIV exposure in infants (p <0.01, adj. p = 0.01). Additionally, Streptococcus mitis indicated a strong interaction effect between time and HIV exposure status (p <0.01, adj. p = 0.04). Figure 2 shows the estimated marginal means of the time and HIV-exposure status effects for S. haemolyticus, S. mitis, Haemophilus influenzae, and the Dolosigranulum genus. We chose to include these microbes in the figure as a representative selection of microbes with significant and non-significant effects. In Figure 2a, differences in the abundance of S. haemolyticus for HIV-exposed and HUU infants is cleanly pronounced for several time points, reflecting the Wald test result, but this is not the case for the other microbes in Figures 2b, 2c, or 2d. All of these had at least marginally significant p-values when testing the time point effect but lacked strong HIV exposure status effects.

Figure 2. Plots of estimated marginal means of the abundance of a given microbe, given the estimated GEE effects of time point of sampling (x axis) and HIV status (line color).

Figure 2.

A line illustrates the estimated change in microbe abundance over time, with positive or negative slopes illustrating increased or decreased estimated abundance in log CPM (respectively). HIV-exposed infants (denoted as HEU) are depicted in blue, and HIV-unexposed, uninfected infants (HUU) are depicted in red. Given the separate lines for each status and changing slope across time, these plots depict the interaction effect between time point and HIV status. All microbes had at least marginally significant time effects, but only S. haemolyticus had a very strong HIV exposure status effect.

Table 3. Table of P-values from Wald tests on GEE models.

The effects modeled were HIV status, and time point effects, with their interaction. Results are stratified by species and genus. All p-values less than alpha=0.05 are bolded.

Time point Coefficient Estimate Time point Unadjusted p-values Time point Adjusted p-values HIV Status Coefficient Estimate HIV Status Unadjusted p-values HIV Status Adjusted p-values Interaction Coefficient Estimate Interaction Unadjusted p-values Interaction Adjusted p-values
Genus
Acinetobacter -0.09 0.00 0.02 -0.06 0.78 1.00 0.04 0.29 1.00
Alkalihalophilus 0.18 0.01 0.07 0.67 0.15 1.00 -0.26 0.03 0.30
Anaerobacillus -0.11 0.19 1.00 0.77 0.12 1.00 -0.06 0.56 1.00
Corynebacterium -0.18 0.01 0.09 -0.39 0.11 1.00 0.12 0.18 1.00
Dolosigranulum 0.25 0.01 0.13 -0.46 0.35 1.00 0.09 0.46 1.00
Haemophilus 0.43 0.00 0.00 0.69 0.14 1.00 -0.18 0.23 1.00
Moraxella 0.35 0.00 0.00 0.54 0.21 1.00 -0.14 0.07 0.89
Other -0.09 0.03 0.30 -0.04 0.81 1.00 0.06 0.25 1.00
Paracoccus -0.04 0.44 1.00 -0.50 0.08 0.99 0.12 0.09 1.00
Pseudomonas -0.15 0.00 0.00 -0.52 0.19 1.00 0.17 0.03 0.40
Staphylococcus -0.38 0.00 0.00 -0.13 0.53 1.00 0.04 0.35 1.00
Streptococcus 0.13 0.00 0.01 -0.18 0.43 1.00 0.09 0.12 1.00
Species
Corynebacterium accolens -0.27 0.52 1.00 -2.34 0.26 1.00 0.70 0.18 1.00
Corynebacterium pseudodiphtheriticum 0.25 0.58 1.00 -1.91 0.30 1.00 0.51 0.34 1.00
Dolosigranulum pigrum 0.90 0.01 0.06 -1.82 0.26 1.00 0.33 0.42 1.00
Haemophilus influenzae 1.33 0.00 0.00 1.76 0.11 0.96 -0.44 0.30 1.00
Staphylococcus epidermidis -0.57 0.03 0.24 -0.07 0.97 1.00 -0.21 0.56 1.00
Staphylococcus haemolyticus -0.45 0.01 0.07 4.04 0.00 0.01 -0.62 0.07 0.59
Staphylococcus simiae -1.27 0.00 0.01 -4.49 0.11 1.00 0.88 0.08 0.69
Streptococcus mitis -1.24 0.00 0.00 -2.60 0.06 0.53 0.65 0.00 0.04
Streptococcus pneumoniae 1.81 0.00 0.00 1.85 0.16 1.00 -0.20 0.52 1.00

Hotelling’s T 2 test statistics were used to compare relative abundances of the top six most abundant genera between paired HIV-exposed and HUU infants at each time point, namely Dolosigranulum, Streptococcus, Moraxella, Staphylococcus, Corynebacterium, and Haemophilus. Paired infants did not present significantly different microbiome profiles at any of the time points at α = 0.05 ( Table 4). The largest differences seemed to occur at t=5 (p = 0.07).

Table 4. Table of Hotelling’s T 2 test results.

Tests compared log CPM relative genera abundances of paired HIV-exposed and HUU infants at all seven time points. The first and second degrees of freedom for the test are denoted by df 1 and df 2, respectively.

t=0 t=1 t=2 t=3 t=4 t=5 t=6
df 1 6 6 6 6 6 6 6
df 2 4 4 3 1 1 2 2
Critical F value 6.16 6.16 8.94 233.99 233.99 19.33 19.33
F statistic 0.90 1.99 0.47 3.41 22.66 14.56 1.26
p-value 0.57 0.26 0.80 0.39 0.16 0.07 0.50

We examined variations in overall microbiome composition by observing differences in beta diversity between HUU and HIV-exposed infants. We conducted two Wilcoxon rank-sum tests of the Bray-Curtis dissimilarity at t=0 and t=6. We found a non-significant difference in microbiome composition between HUU and HIV-exposed infants at t=0 (p = 0.68), but large differences at t=6 (p <0.01). Performing additional tests within the HIV-exposed and HUU groups themselves, we also identified heavy inter-subject variability at both t=0 and t=6 (all p-values < 0.01). As a result of this inter-subject variability, alpha diversity metrics were not performed for this analysis.

Associations within mother and infant duad NP microbiome composition over time

We also investigated the NP microbiome relationship within infant-mother duads. Paired Hotelling’s T 2 tests were used to test the log CPM of genera as a measure of relative abundance at t=0 and t=6 for the HIV-exposed/HIV(+) and HUU/HIV(-) groups ( Table 5). Genera tested were the same as listed for the infant-specific Hotelling’s tests. HIV-exposed/ HIV(+) infant-mother pairs had marginally significant profile differences at t=0 ( T6,42; p = 0.10) but were similar enough to not present as having significant profiles at t=6 ( T6,12; p = 0.60). HUU/HIV(-) infant-mother pairs also had marginally significant differences at the t=0 ( T6,42; p = 0.10) but varied notably at t=6 ( T6,32; p = 0.02). Paired t-tests for the six genera were separately conducted for HUU/HIV(-) infant-mother pairs to distinguish which genera are most important to the identified difference at that time point. The largest difference in abundance occurred for the Haemophilus ( t 8, p = 0.02; adj. p-value = 0.10) and Staphylococcus genera ( t 9; p-value <0.01; adj. p-value <0.01). HUU Infants were noted as having greater Haemophilus carriage than their HIV(-) mothers, whereas mothers had greater Staphylococcus carriage than their infants.

Table 5. Table of Hotelling’s T 2 test results.

Tests compared log CPM relative genera abundances at t=0 and t=6 for the HIV-exposed/HIV(+) and HUU/HIV(-) groups using infant-mother pairs.

Time point t=0 Time point t=6
HIV-exposed/HIV(+) HUU/HIV(-) HIV-exposed/HIV(+) HUU/HIV(-)
df 1 6 6 6 6
df 2 4 4 1 3
Critical F value 6.16 6.16 233.99 8.94
F Statistic 4.13 4.14 1.20 19.52
p-value 0.10 0.10 0.60 0.02

Although all infants in our cohort were free from illness, we found that certain pathogenic species were present in modest abundance across samples. To ascertain the possibility of HIV-exposed infants acquiring pathogens from their HIV(+) mothers, we conducted paired t-tests for three well-known pathogenic species: Streptococcus pneumoniae; Haemophilus influenzae; and Staphylococcus haemolyticus. Tests compared pairs at the t=0 and t=6 time points and within either HIV(+) or HIV(-) subgroups for a total of four different tests per species ( Table 6). Inequality in pathogen carriage was ascertained by the mean of differences in log CPM; positive values indicate higher abundance in mothers. H. influenzae was more abundant in HUU infants than in their HIV(-) mothers at t=6 (adj. p = 0.1), and S. haemolyticus was likely to be found in the HUU infants at t=0 in a higher concentration than in their HIV(-) mothers (adj. p = 0.1). S. pneumoniae was more likely to be found in HUU infants than their HIV(-) mothers at t=6 (adj. p = 0.1). Each of these findings were marginally significant.

Table 6. Results table from paired t-tests of log CPM relative abundances of well-known pathogenic species.

Tests compared infant-mother pairs at the first and last time points, and within either HIV or control subgroups, for a total of four different tests per species. Lower and upper 95% confidence interval bounds are denoted by the Lower CI and Upper CI columns. A positive mean of differences value indicates higher abundance of a pathogen in mothers. A Bonferroni correction was applied to account for multiple hypothesis testing (see adjusted p-values).

Species HIV Status Time point Mean of differences (log CPM) Lower CI Upper CI p-value Adjusted p-value
Streptococcus pneumoniae HIV-exposed/HIV(+) 0 2.03 0.26 3.80 0.03 0.4
6 -1.37 -3.47 0.73 0.16 1.0
HUU/HIV(-) 0 1.42 -0.43 3.28 0.12 1.0
6 -2.83 -4.86 -0.79 0.01 0.1
Haemophilus influenzae HIV-exposed/HIV(+) 0 0.16 -1.55 1.87 0.84 1.0
6 -1.08 -3.87 1.70 0.38 1.0
HUU/HIV(-) 0 0.09 -1.18 1.36 0.88 1.0
6 -1.57 -2.69 -0.45 0.01 0.1
Staphylococcus haemolyticus HIV-exposed/HIV(+) 0 -2.31 -4.36 -0.27 0.03 0.4
6 0.56 -0.94 2.06 0.40 1.0
HUU/HIV(-) 0 -1.81 -3.02 -0.61 0.01 0.1
6 0.54 -0.52 1.60 0.27 1.0

Effect of HIV status on mother NP microbial community composition

We compared HIV(+) and HIV(-) mothers at the t=0 and t=6 time points. We used the Bray-Curtis dissimilarity metric as a measure of beta diversity to compare compositional dissimilarity between NP microbiomes. We identified strong dissimilarity between HIV(+) and HIV(-) mothers at t=6 (p <0.01), but higher similarity at t=0 (p = 0.70). Additionally, the summed abundances of all twelve genera, including “Other” genera, were used to conduct a two-sample Hotelling’s T 2 test on unpaired mothers. The differences we observed were more noticeable at the latter ( T12,52; p = 0.04). Unpaired t-tests indicated that the “Other” taxa (p = 0.03, adj. p = 0.37), Alkalihalophilus (p = 0.08, adj. p = 1), and Paracoccus (p = 0.10, adj. p=1) were the largest contributors to this difference. These taxa were all more highly abundant in the HIV(-) mothers.

Discussion

Our study explored differences in the NP microbiome among Zambian HIV-exposed and HUU infants and their mothers over a three-month period. Our starting point for this analysis was to better understand the observed excess mortality and increased rates of respiratory disease among HIV-exposed infants. Assuming that such differences are not merely due to sampling biases, the chief hypotheses explaining them are that 1) HIV-exposed infants have subtle immunological deficits, or 2) these are the consequence of confounding due to environmental and sociological factors. These are not mutually exclusive, and we acknowledge that all drivers are likely inter-related and difficult to differentiate. We reasoned that they may be in turn associated with differences in the microbiota which could then serve as a quantifiable indicator of differences in respiratory health between HIV-exposed and HUU infants. Our analyses indicate that the HIV-exposed infants in our study exhibited subtle differences in the NP microbial composition throughout the sampling interval. Given our limited number of samples, it is within reason that these differences are a result of sample variation. Although we cannot exclude true differences between the populations, we are left with uncertain evidence that there is an HIV exposure effect on the NP microbiome in the first 14 weeks after birth.

To date, many studies have examined the effect of the microbiome of HIV(+) mothers on their infants and found noticeable differences. Bender et al. (2016) reported that although very few differences were apparent in the microbiomes of mothers with and without HIV infection, maternal HIV infection was associated with changes in the mouth, skin, and gut microbiome of HIV-exposed infants from Haiti 33 . Higher abundance of Pseudomonadaceae and Thermaceae, along with decreased bacterial diversity in stools of HIV-exposed infants was suggested as one mechanism that accounts for the immunologic derangements and poor growth observed in these children. It has been noted that the microbiome of mother and infant dyads reveals some associations with HIV infection 34 , particularly that infants’ microbiomes reflect the dysbiosis of their mothers, but how this dysbiosis is established in the HIV-exposed infant is poorly understood. Significantly higher bacterial diversities have been found in the fecal matter of HIV-exposed infants, compared to HIV-unexposed infants in an African cohort 35 . The relatively small number of studies looking specifically at the microbiome of the nasopharynx in HIV-exposed infants have found few changes associated with HIV infection 36 , even though increased risk of pneumococcal colonization and disease remains apparent for these infants 7, 13, 14 . Although dysbiosis seems to be a reasonable factor in this risk, the few available data contradict this hypothesis, demonstrating no differences in pathogen carriage between children with HIV infection and control groups 1517 .

Longitudinal trends in infant NP microbiome composition

Initial plots of the microbial relative abundances appeared to show dynamic changes in abundances of certain genera; it is apparent from Figure 1A and 1C that Staphylococcus and Corynebacterium began as primary colonizers of the NP microbiome but were replaced by Dolosigranulum, Streptococcus, Moraxella and Haemophilus over time. Overall, these transitions occurred in an orderly and stepwise pattern over time. These transitional patterns align with those found in a longitudinal East Asian infant cohort 37 .

Given the few studies conducted on the NP microbiome in Zambia, there are no conclusive baseline expectations for a typical microbiome composition. However, we have compiled the results of a published longitudinal study that analyzed NP swabs from a cohort of 234 healthy infants from Washington, D.C. 38 at ~two, ~six, and ~12 months ( Table 7). Slight differences appear to be present, which may be in part due to differences in living conditions, sample size, and choice of analysis database used. The Teo et al. study utilized the Greengenes database, which produces far less sensitive results on 16S amplicon sequencing data when compared to other databases such as RefSeq and Silva 39 . Overall, Corynebacterium, Moraxella, Streptococcus averages seem to be quite similar, with stark differences in Staphylococcus and Dolosigranulum abundances. While this study is not directly comparable to the Teo et al. study, it appears that the NP microbiome profiles observed here are similar to those seen in healthy infants of a similar age.

Table 7. Comparison of average genera relative abundance with an NP longitudinal study of a healthy infant cohort from Washington, D.C.

The number of samples, age group by month (m), sequenced 16S region, patient condition and genus mean relative proportions are enumerated. While the studies are not a one-to-one comparison, the relative abundances between studies appear to be similar. NR denotes statistic not reported in original paper.

Study Teo et al. (2015) 32 Teo et al. (2015) 32 This study This study
Number of samples 1,021 ~177 68 61
Age group 2–12 m 2 m only 0–3.5 m 0–3.5 m
16S Region V4 V4 V3–V4 V3–V4
Patient condition Healthy Healthy Healthy HIV-exposed
Microbe
Dolosigranulum 8.8% 14.0% 32.0% 12.4%
Streptococcus 14.0% 14.0% 8.9% 14.5%
Moraxella 31.2% 9.0% 10.7% 15.7%
Staphylococcus 10.3% 41.0% 19.7% 14.5%
Corynebacterium 13.5% 22.0% 17.9% 14.9%
Haemophilus 9.7% NR 4.9% 9.2%
Anaerobacillus NR NR 0.9% 2.0%
Paracoccus NR NR 1.2% 0.6%
Acinetobacter 13.0% NR 1.1% 0.7%
Pseudomonas NR NR 0.3% 3.6%
Alkahilophilus NR NR 0.2% 0.7%
Other NR NR 2.2% 5.6%

Effect of HIV status on infant NP microbiome composition across time

Our plots appear to show Dolosigranulum may have higher carriage with simultaneous lower carriage of Streptoccocus in HUU when compared to HIV-exposed infants. Some of these observations are supported by the GEE findings ( Table 3). For instance, some taxa appeared to be present in larger proportions at certain time points for HIV-exposed or HUU infants, but in general, taxa seemed to follow the same general trends in both groups. Regardless of HIV exposure status, it was apparent that as the child grows, an increasing amount of respiratory flora emerges. As time went on, we found that HIV-exposed infants diverged in their microflora profiles and diversity from the HUU infants, as was shown by the Wilcoxon test result at t=6. Interestingly, this was not verifiable in the multivariate Hotelling’s T-squared test that collectively tested the top six of the most prevalent genera over time. This indicates that differences may have occurred for individual genera at a given time point yet did not result in holistic trends involving several of the top genera. The largest difference seemed to occur at t=5. It would seem reasonable to suggest that at birth, infants’ microbiome profiles are more similar than different, and that the diversification of these profiles occurs over time; however, the lack of any considerable difference at t=6 disputes this argument.

When comparing HIV-exposed and HUU infants, the only microbe that appeared to have a distinct presence was that of Staphylococcus haemolyticus, one of the most frequent aetiological agents of staphylococcal infections 40 . The microbe indicated a distinctive high association with HIV exposure at birth and across time points, a finding confirmed by separate statistical tests. Staphylococcus haemolyticus has also been commonly found in the hospital setting, with a tendency to become resistant to multiple antibiotics 4143 . Ternes et al. (2013) found that 55.9% of infants harbored multidrug-resistant CNS in their nasal cavity, with S. haemolyticus being the most frequently isolated species (38.3%) 44 . However, it should be noted there is a possibility that our genomic identification process could be misidentifying what are actually S. aureus reads, which is one of the most common pathogens colonizing the nasopharynx and the lower airways 45, 46 . S. aureus carriage has been found to be significantly higher in RSV than infants infected with rhinovirus 47 . In a case series from India, NP carriage densities of Streptococcus pneumoniae and S. aureus were higher in both mothers and children living in HIV-affected households, regardless of the child's HIV status 48 .

From the stacked barplots, we observed that Dolosigranulum seemed to have a higher relative proportion at all time points in HUU infants, but this was not verified as a distinguishable difference in our statistical testing. The microbe Dolosigranulum pigrum is generally accepted as a marker of a healthy microbiome, positively associated with Corynebacterium and potentially protective against colonization by S. aureus and S. pneumoniae 49 . High Dolosigranulum carriage has been found to be correlated with positive outcomes of RSV 50 , COVID-19 51 , HIV exposure 52 , and Bronchiolitis 53 as a mediator in defense against illness, albeit the mechanism by which it does so is unclear.

Associations within mother and infant duad NP microbiome composition over time

When conducting infant-mother duad paired analyses, HIV-exposed infants’ NP microbiome composition was not vastly different from their HIV(+) mothers at birth or 14 weeks, including in their carriage of S. pneumoniae, H. influenzae, and S. haemolyticus. HUU infants were similar to their mothers at birth, but apparently grew apart from their HIV(-) mothers by 14 weeks as infants acquired more Haemophilus ( influenzae) and decreased in Staphylococcus haemolyticus carriage.

Effect of HIV status on mother NP microbial community composition

One of our study objectives was to compare the NP microbiomes of the HIV(+) and HIV(-) mothers at t=0 and t=6. From Figure 1B, we observed that HIV(+) mothers may have had higher Streptococcus abundance than HIV(-) mothers overall. A significant inter-group beta diversity test result at t=6 showed strong differences in taxa among HIV statuses, a finding reflected by the stacked bar plots indicating a larger presence of taxa with relatively small abundances in HIV(-) women (denoted as “Other” throughout our analyses). This finding implies more microbial diversity in the NP microbiomes of the HIV(-) women. In the nasopharynx, lower diversity has been associated with individuals with rhinovirus illness 54 and in children with HIV-associated bronchiectasis 55 , suggesting that greater NP diversity is a sign of a healthy microbiome. This mirrors previous findings in the gut microbiome of healthy individuals 56 .

Study limitations

Based on our results, we see some nuance, but must acknowledge that our study has several limitations. First, and most importantly, given this is both a pilot and a longitudinal study, our sample size is small. Second, given that all samples were collected from participants born in Zambia, these results may not be generalizable to HIV-exposed infants in other countries. Third, as the SAMIPS study was not focused on studying HIV transmission as a main aspect of its structure, several key variables were not measured that would have been informative as potential confounders in our study results. For example, we do not know the viral loads and CD4 counts of the mothers at any point. We also lack information on whether mothers continued to take ART post-pregnancy or were prescribed Bactrim Pneumocystis jirovecii Pneumonia (PJP) prophylaxis, which could affect the microbiome. We are also unable to determine whether infants were infected with HIV as proper testing to determine transmission was not conducted as part of the study. Fourth, one infant was noted as having birth asphyxia as a complication experienced during the labor and delivery process ( Table 1). Follow-up questions about severity or treatment were not asked. Given that the study eligibility criteria limited enrollment to healthy infants with no acute or chronic medical conditions, it is unlikely that the birth asphyxia was severe. However, we cannot rule out the use of respiratory instrumentation that could impact the infant’s respiratory microbiome.

These limitations, and especially the small sample sizes, also negatively affect the power of our testing and modeling efforts. This is intended as a pilot study for learning about what trends may be present in this cohort that merit further research with more samples for comparison. It is also of interest as few microbiome studies are longitudinal, providing us with a new perspective on how the NP microbiome may be characterized across time in the different groups.

Conclusions

Acknowledging these issues, our findings suggest that there are subtle nuances between HIV-exposed and HUU infant populations. The effects we have found here warrant further research and discussion regarding the role HIV exposure plays in infant health before readily affirming or denying that HIV exposure affects infants’ NP microbiomes. The potential effect of the HIV(+) mother’s microbiome on an infant may present further changes in pathogen carriage, or community diversity. If associations hold, identifying HIV exposure as a predisposing factor to illness and poor health in infants would present opportunities for further research and development to support infants’ living situations, especially in areas of higher HIV transmission.

Consent

Written informed consent for publication of the infants’ patient details was obtained from the parents of the infants. Consent for inclusion of mothers’ data was also obtained from the participating mothers themselves.

Funding Statement

This work was supported by the Bill and Melinda Gates Foundation [OPP1105094] who funded the Southern Africa Mother Infant Pertussis Study: Phase I (SAMIPS-1) and also by the NIAID of the NIH under grant R01AI133080 to C.G. W.E.J and A.R.O.M. were supported in part by the NIH under grant R21AI154387. A.R.O.M. was supported in part by the NIGMS of the NIH under grant T32GM100842. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; peer review: 1 approved, 2 approved with reservations]

Data availability

Underlying data

Zenodo: Underlying data for ‘Characterization of longitudinal nasopharyngeal microbiome patterns in maternally HIV-exposed Zambian infants.’ https://doi.org/10.5281/zenodo.7255313 25

This project contains the following underlying data:

  • Data file 1: FinalDatOther.rds

  • Data file 2: FinalDatPICRUSt2.RDS

  • Data file 3: animalculesFinalHIV.rds

  • Data file 4: animalcules_data_2021.rds

  • Data file 5: mappingFaits.csv

  • Data file 6: mappingFinalHIV.csv

  • Data file 7: mappingFinalHIV.tsv

  • Data file 8: preclean_MetaData.txt

  • Data file 9: samips_immunization.csv

  • Data files 10–176: *-sam-report.tsv

Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Accession numbers

NCBI BioProject: Characterization of longitudinal nasopharyngeal microbiome patterns in maternally HIV-exposed Zambian infants. Accession number PRJNA874826. https://identifiers.org/NCBI/BioProject:PRJNA874826

Software availability

Source code available from: https://github.com/aubreyodom/HIV_Exposed_Infants

Archived source code at time of publication: https://doi.org/10.5281/zenodo.7255313 25

License: CC0 1.0 Public domain dedication

References

  • 1. Sugandhi N, Rodrigues J, Kim M, et al. : HIV-exposed infants: rethinking care for a lifelong condition. AIDS. 2013;27 Suppl 2(0 2):S187–95. 10.1097/QAD.0000000000000090 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Nielsen-Saines K, Watts DH, Veloso VG, et al. : Three postpartum antiretroviral regimens to prevent intrapartum HIV infection. N Engl J Med. 2012;366(25):2368–79. 10.1056/NEJMoa1108275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Brennan AT, Bonawitz R, Gill CJ, et al. : A meta-analysis assessing all-cause mortality in HIV-exposed uninfected compared with HIV-unexposed uninfected infants and children. AIDS. 2016;30(15):2351–60. 10.1097/QAD.0000000000001211 [DOI] [PubMed] [Google Scholar]
  • 4. Brennan AT, Bonawitz R, Gill CJ, et al. : A meta-analysis assessing diarrhea and pneumonia in HIV-Exposed Uninfected Compared with HIV-Unexposed Uninfected infants and children. J Acquir Immune Defic Syndr. 2019;82(1):1–8. 10.1097/QAI.0000000000002097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Thea DM, St Louis ME, Atido U, et al. : A prospective study of diarrhea and HIV-1 infection among 429 zairian infants. N Engl J Med. 1993;329(23):1696–702. 10.1056/NEJM199312023292304 [DOI] [PubMed] [Google Scholar]
  • 6. Shapiro RL, Lockman S, Kim S, et al. : Infant morbidity, mortality, and breast milk immunologic profiles among breast-feeding HIV-infected and HIV-uninfected women in Botswana. J Infect Dis. 2007;196(4):562–9. 10.1086/519847 [DOI] [PubMed] [Google Scholar]
  • 7. von Mollendorf C, von Gottberg A, Tempia S, et al. : Increased risk for and mortality from invasive pneumococcal disease in HIV-exposed but uninfected infants aged <1 year in South Africa, 2009-2013. Clin Infect Dis. 2015;60(9):1346–56. 10.1093/cid/civ059 [DOI] [PubMed] [Google Scholar]
  • 8. Locks LM, Manji KP, Kupka R, et al. : High burden of morbidity and mortality but not growth failure in infants exposed to but uninfected with human immunodeficiency virus in Tanzania. J Pediatr. 2017;180:191–199. e2. 10.1016/j.jpeds.2016.09.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Pierre RB, Fulford TA, Lewis K, et al. : Infectious disease morbidity and growth among young HIV-exposed uninfected children in Jamaica. Rev Panam Salud Publica. 2016;40(6):401–409. [PubMed] [Google Scholar]
  • 10. Lapidot R, Faits T, Ismail A, et al. : Nasopharyngeal dysbiosis precedes the development of lower respiratory tract infections in young infants: a longitudinal infant cohort study. medRxiv. 2021. 10.1101/2021.10.13.21264939 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Biesbroek G, Bosch AA, Wang X, et al. : The impact of breastfeeding on nasopharyngeal microbial communities in infants. Am J Respir Crit Care Med. 2014;190(3):298–308. 10.1164/rccm.201401-0073OC [DOI] [PubMed] [Google Scholar]
  • 12. Garcia-Carbonell R, Yao SJ, Das S, et al. : Dysregulation of intestinal epithelial cell RIPK pathways promotes chronic inflammation in the IBD gut. Front Immunol. 2019;10:1094. 10.3389/fimmu.2019.01094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Farley JJ, King JC, Nair P, et al. : Invasive pneumococcal disease among infected and uninfected children of mothers with human immunodeficiency virus infection. J Pediatr. 1994;124(6):853–8. 10.1016/s0022-3476(05)83170-1 [DOI] [PubMed] [Google Scholar]
  • 14. Kinabo GD, van der Ven A, Msuya LJ, et al. : Dynamics of nasopharyngeal bacterial colonisation in HIV-exposed young infants in Tanzania. Trop Med Int Health. 2013;18(3):286–295. 10.1111/tmi.12057 [DOI] [PubMed] [Google Scholar]
  • 15. Rusen ID, Fraser-Roberts L, Slaney L, et al. : Nasopharyngeal pneumococcal colonization among Kenyan children: antibiotic resistance, strain types and associations with human immunodeficiency virus type 1 infection. Pediatr Infect Dis J. 1997;16(7):656–662. 10.1097/00006454-199707000-00007 [DOI] [PubMed] [Google Scholar]
  • 16. Leibovitz E, Dragomir C, Sfartz S, et al. : Nasopharyngeal carriage of multidrug-resistant Streptococcus pneumoniae in institutionalized HIV-infected and HIV-negative children in northeastern Romania. Int J Infect Dis. 1999;3(4):211–215. 10.1016/s1201-9712(99)90027-9 [DOI] [PubMed] [Google Scholar]
  • 17. Polack FP, Flayhart DC, Zahurak ML, et al. : Colonization by Streptococcus penumoniae in human immunodeficiency virus-infected children. Pediatr Infect Dis J. 2000;19(7):608–612. 10.1097/00006454-200007000-00005 [DOI] [PubMed] [Google Scholar]
  • 18. Gill CJ, Mwananyanda L, MacLeod W, et al. : Incidence of severe and nonsevere pertussis among HIV-exposed and -unexposed Zambian infants through 14 weeks of age: results from the Southern Africa Mother Infant Pertussis Study (SAMIPS), a longitudinal birth cohort study. Clin Infect Dis. 2016;63(suppl 4):S154–S164. 10.1093/cid/ciw526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Katepa-Bwalya M, Mukonka V, Kankasa C, et al. : Infants and young children feeding practices and nutritional status in two districts of Zambia. Int Breastfeed J. 2015;10: 5. 10.1186/s13006-015-0033-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Bispo S, Chikhungu L, Rollins N, et al. : Postnatal HIV transmission in breastfed infants of HIV-infected women on ART: a systematic review and meta-analysis. J Int AIDS Soc. 2017;20(1): 21251. 10.7448/IAS.20.1.21251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Klindworth A, Pruesse E, Schweer T, et al. : Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2013;41(1): e1. 10.1093/nar/gks808 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Andrews S: FastQC: a quality control tool for high throughput sequence data.Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom; 2010. [Google Scholar]
  • 23. Bolger AM, Lohse M, Usadel B: Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Hong C, Manimaran S, Shen Y, et al. : PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples. Microbiome. 2014;2: 33. 10.1186/2049-2618-2-33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Aubreyodom/HIV_exposed_infants: initial release. Zenodo. 2022. [Google Scholar]
  • 26. Liang KY, Zeger SL: Longitudinal data analysis using generalized linear models. Biometrika. 1986;73(1):13–22. 10.1093/biomet/73.1.13 [DOI] [Google Scholar]
  • 27. Agresti A: Categorical data analysis.2nd Edition. Edited by: Wiley JSI Hoboken, NJ, 2002. 10.1002/0471249688 [DOI] [Google Scholar]
  • 28. Gibbons RD, Hedeker D, DuToit S: Advances in analysis of longitudinal data. Annu Rev Clin Psychol. 2010;6:79–107. 10.1146/annurev.clinpsy.032408.153550 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Luo D, Liu W, Chen T, et al. : A distribution-free model for longitudinal metagenomic count data. Genes (Basel). 2022;13(7):1183. 10.3390/genes13071183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Sun H, Huang X, Huo B, et al. : Detecting sparse microbial association signals adaptively from longitudinal microbiome data based on generalized estimating equations. Brief Bioinform. 2022;23(5): bbac149. 10.1093/bib/bbac149 [DOI] [PubMed] [Google Scholar]
  • 31. Højsgaard S, Halekoh U, Yan J: The R package geepack for generalized estimating equations. Journal of statistical software. 2006;15(2):1–11. 10.18637/jss.v015.i02 [DOI] [Google Scholar]
  • 32. Rencher A, Christensen W: Methods of multivariate analysis. Third edition ed. Wiley series in probability and statistics. John Wiley & Sons, Inc.;2012. 10.1002/9781118391686 [DOI] [Google Scholar]
  • 33. Bender JM, Li F, Martelly S, et al. : Maternal HIV infection influences the microbiome of HIV-uninfected infants. Sci Transl Med. 2016;8(349): 349ra100. 10.1126/scitranslmed.aaf5103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. González R, Maldonado A, Martín V, et al. : Breast milk and gut microbiota in African mothers and infants from an area of high HIV prevalence. PLoS One. 2013;8(11): e80299. 10.1371/journal.pone.0080299 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Claassen-Weitz S, Gardner-Lubbe S, Nicol P, et al. : HIV-exposure, early life feeding practices and delivery mode impacts on faecal bacterial profiles in a South African birth cohort. Sci Rep. 2018;8(1): 5078. 10.1038/s41598-018-22244-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Kelly MS, Plunkett C, Yu Y, et al. : Non-diphtheriae Corynebacterium species are associated with decreased risk of pneumococcal colonization during infancy. ISME J. 2022;16(3):655–665. 10.1038/s41396-021-01108-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Salter SJ, Turner C, Watthanaworawit W, et al. : A longitudinal study of the infant nasopharyngeal microbiota: the effects of age, illness and antibiotic use in a cohort of South East Asian children. PLoS Negl Trop Dis. 2017;11(10): e0005975. 10.1371/journal.pntd.0005975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Teo SM, Mok D, Pham K, et al. : The infant nasopharyngeal microbiome impacts severity of lower respiratory infection and risk of asthma development. Cell Host Microbe. 2015;17(5):704–715. 10.1016/j.chom.2015.03.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Faits T, Odom-Mabey AR, Castro-Nallar E, et al. : Metagenomic profiling pipelines improve taxonomic classification for 16S amplicon sequencing data. bioRxiv. 2022; 2022.07.27.501757. 10.1101/2022.07.27.501757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Takeuchi F, Watanabe S, Baba T, et al. : Whole-genome sequencing of Staphylococcus haemolyticus uncovers the extreme plasticity of its genome and the evolution of human-colonizing staphylococcal species. J Bacteriol. 2005;187(21):7292–7308. 10.1128/JB.187.21.7292-7308.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Barros EM, Ceotto H, Bastos MCF, et al. : Staphylococcus haemolyticus as an important hospital pathogen and carrier of methicillin resistance genes. J Clin Microbiol. 2012;50(1):166–168. 10.1128/JCM.05563-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Rossi CC, Ahmad F, Giambiagi-deMarval M: Staphylococcus haemolyticus: an updated review on nosocomial infections, antimicrobial resistance, virulence, genetic traits, and strategies for combating this emerging opportunistic pathogen. Microbiol Res. 2024;282: 127652. 10.1016/j.micres.2024.127652 [DOI] [PubMed] [Google Scholar]
  • 43. Eltwisy HO, Twisy HO, Hafez MH, et al. : Clinical infections, antibiotic resistance, and pathogenesis of Staphylococcus haemolyticus. Microorganisms. 2022;10(6): 1130. 10.3390/microorganisms10061130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Ternes YM, Lamaro-Cardoso J, André MCP, et al. : Molecular epidemiology of coagulase-negative Staphylococcus carriage in neonates admitted to an intensive care unit in Brazil. BMC Infect Dis. 2013;13: 572. 10.1186/1471-2334-13-572 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Suárez-Arrabal MC, Mella C, Lopez SM, et al. : Nasopharyngeal bacterial burden and antibiotics: influence on inflammatory markers and disease severity in infants with respiratory syncytial virus bronchiolitis. J Infect. 2015;71(4):458–469. 10.1016/j.jinf.2015.06.010 [DOI] [PubMed] [Google Scholar]
  • 46. Thorburn K, Harigopal S, Reddy V, et al. : High incidence of pulmonary bacterial co-infection in children with severe Respiratory Syncytial Virus (RSV) bronchiolitis. Thorax. 2006;61(7):611–615. 10.1136/thx.2005.048397 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Rosas-Salazar C, Shilts MH, Tovchigrechko A, et al. : Differences in the nasopharyngeal microbiome during acute respiratory tract infection with human rhinovirus and respiratory syncytial virus in infancy. J Infect Dis. 2016;214(12):1924–1928. 10.1093/infdis/jiw456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Khan T, Das RS, Bhattacharya SD: Increased household exposure to respiratory pathogens in HIV exposed uninfected children due to maternal HIV. J Family Med Prim Care. 2019;8(3):1232–1234. 10.4103/jfmpc.jfmpc_111_19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Brugger SD, Eslami SM, Pettigrew MM, et al. : Dolosigranulum pigrum cooperation and competition in human nasal microbiota. mSphere. 2020;5(5):e00852–20. 10.1128/mSphere.00852-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. de Steenhuijsen Piters WA, Heinonen S, Hasrat R, et al. : Nasopharyngeal microbiota, host transcriptome, and disease severity in children with respiratory syncytial virus infection. Am J Respir Crit Care Med. 2016;194(9):1104–1115. 10.1164/rccm.201602-0220OC [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Islam M, Albarracin L, Melnikov V, et al. : Dolosigranulum pigrum modulates immunity against SARS-CoV-2 in respiratory epithelial cells. Pathogens. 2021;10(6):634. 10.3390/pathogens10060634 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Kelly MS, Surette MG, Smieja M, et al. : The nasopharyngeal microbiota of children with respiratory infections in Botswana. Pediatr Infect Dis J. 2017;36(9):e211–e218. 10.1097/INF.0000000000001607 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Hasegawa K, Mansbach JM, Ajami NJ, et al. : Association of nasopharyngeal microbiota profiles with bronchiolitis severity in infants hospitalised for bronchiolitis. Eur Respir J. 2016;48(5):1329–1339. 10.1183/13993003.00152-2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Allen EK, Koeppel AF, Hendley JO, et al. : Characterization of the nasopharyngeal microbiota in health and during rhinovirus challenge. Microbiome. 2014;2: 22. 10.1186/2049-2618-2-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Bhattacharya SD, Niyogi SK, Bhattacharyya S, et al. : Associations between potential bacterial pathogens in the nasopharynx of HIV infected children. Indian J Pediatr. 2012;79(11):1447–1453. 10.1007/s12098-012-0762-4 [DOI] [PubMed] [Google Scholar]
  • 56. Lozupone CA, Stombaugh JI, Gordon JI, et al. : Diversity, stability and resilience of the human gut microbiota. Nature. 2012;489(7415):220–230. 10.1038/nature11550 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gates Open Res. 2024 Sep 26. doi: 10.21956/gatesopenres.17682.r37923

Reviewer response for version 2

Rupert Kaul 1

Thank you for the opportunity to review the manuscript from Odom and colleagues entitled " Characterization of longitudinal nasopharyngeal microbiome patterns in maternally HIV-exposed Zambian infants". This is a pilot longitudinal study of the nasal microbiota in HIV uninfected and exposed, uninfected Zambian infants. It is an interesting concept, although the small sample size is a major limitation. They find subtle differences and conclude that “further research must be conducted in order to confidently understand the relationship between HIV exposure and infants’ NP microbiomes”.

1) The rationale is that mortality is increased in HEU infants, with increased rates of pneumonia and diarrhea. There are already several well-defined reasons for this, both based on maternal antibody transfer and socioeconomics. The lack of substantial differences in this pilot study could alternatively have led the authors to conclude that larger nasal microbiome comparisons may not be a fruitful area for large future studies.

2) The authors state that “Previous studies on the impact of HIV exposure have highlighted strong correlations between HIV exposure and increased risk of pneumococcal colonization and disease7,13,14, whereas others have demonstrated no differences in pathogen carriage between children with HIV infection and control groups15–17. The current analysis seeks to address this important knowledge gap.” However, it does not seem that this study would actually be powered to address this gap.

3) They also state that “this pilot study could indicate differences encouraging future immunology-focused studies”. Was any thought given to performing some pilot immune studies within the sample set?

4) It would be useful to provide socioeconomic details of cases and controls, since housing conditions and environment may influence the nasal microbiome and might be expected to vary based on maternal HIV status.

5) It is unclear why infant HIV testing could not be done to confirm HUU status. Even if this is not part of routine clinical care, PCR testing might be performed if stored blood is available.

6) “We identified strong dissimilarity between HIV(+) and HIV(-) mothers at t=6 (p <0.01), but higher similarity at t=0 (p = 0.70).” What do the authors hypothesize are the causes for these unexpected variations in maternal microbiome comparability over time?

7) “The largest difference seemed to occur at t=5. It would seem reasonable to suggest that at birth, infants’ microbiome profiles are more similar than different, and that the diversification of these profiles occurs over time; however, the lack of any considerable difference at t=6 disputes this argument.” It does indeed seem to disprove this argument, which perhaps should therefore be removed?

8) “…it should be noted there is a possibility that our genomic identification process could be misidentifying what are actually S. aureus reads, which is one of the most common pathogens colonizing the nasopharynx”. Since the differences in S. hemolyticus are the main difference seen, this seems quite concerning.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Partly

Are the conclusions drawn adequately supported by the results?

Partly

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Genital microbiome, genital immunology, HIV transmission.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Gates Open Res. 2024 Sep 24. doi: 10.21956/gatesopenres.17682.r37848

Reviewer response for version 2

Bryan Vonasek 1

All of my previous comments have been adequately addressed.

Is the work clearly and accurately presented and does it cite the current literature?

Partly

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Partly

Are the conclusions drawn adequately supported by the results?

No

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Pediatric infectious diseases

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Gates Open Res. 2024 Jul 12. doi: 10.21956/gatesopenres.15340.r36374

Reviewer response for version 1

Bryan Vonasek 1

This is an interesting study that investigates an important question: why do HIV exposed infants have poor health outcomes compared to unexposed infants? The key aspects of this study that are more novel are 1) longitudinal analysis of the microbiome and 2) focus on the NP microbiome. Especially when combined with the paired microbiome data from the mothers and the nice use of controls (HUU infants), this is a good study design that can be scaled up from this exploratory/pilot study. 

As the authors do a good job of describing and acknowledging, it's hard to glean much from these data given the small sample sizes. Especially considering the wide variation within groups that is described, making comparisons between groups then becomes challenging. That said, I think it is important to  Index these findings, but I would first like to see several improvements to allow readers to better understand: 1) the results, 2) some additional key limitations, and 3) coherent interpretations of the results.

General considerations

1. The authors acknowledge the limitation of not actually knowing whether the exposed infants are uninfected. I would be extra-careful with this consideration. Don't define them as HEU, maybe instead just "HIV exposed."

2. From what I can tell, a P-value cutoff to define statistically significant differences was not defined in the Methods. Of course, <0.05 is common and seems to be used in the bolding of some of the P-values in the Results. Especially for a study like this in which there are so many potential confounders that aren't really being addressed and in which the sample size is limited, I would be really careful going too much into differences that aren't statistically significant. Phrases throughout the Results and Discussion like "marginally significant" and "nominally significant" should be de-emphasized or removed. Especially in the Discussion, focus on the significant differences that you did find (including a coherent discussion about these differences), rather than emphasizing too much about differences that weren't significant.

3. The finding of more Staph haemolyticus in the exposed babies at birth is certainly worth highlighting (even more than you already do). But it's confusing as written in the Abstract and in the text: "...even when accounting for the interaction between HIV exposure status and time of sampling." Why "account for" exposure when that is the input variable you're looking at? Why "account for" time of sampling when they are all "at birth." This can be worded more clearly. Also, know that Staph haemolyticus can certainly be pathogenic in humans, especially in immunocompromised individuals (i.e. newborns, HIV exposed or infected)--you can revamp the Discussion with this in mind. In relation to this, also worth commenting on whether the HIV(+) mothers were more likely or not to have S. haemolyticus. 

4. A big factor not addressed is whether the exposed infants and their mothers living with HIV were on Bactrim prophylaxis. If you don't know, this needs to be clearly stated as a limitation. If you do know, please incorporate as a covariate in your analyses. Bactrim would likely have a major impact on the NP microbiome.

5. The manuscript will be much more digestible for the reader describe the various analyses you are doing here in a parallel order in the Methods, Results, and Discussion.

Introduction

1. Overall, this is well written. In fact, the quality of the Introduction compared to the Discussion is so striking to me that I suspect different authors being mainly responsible for these two sections? If I'm correct, then the author that led writing of the Intro should please help polish up the Discussion!

2. "We collected 167 samples...through 14 weeks of life." >> this sentence doesn't belong in the Introduction.

Methods

1. "Characteristics of the study cohort are in Table 1." >> this goes in the Results.

2. Important to give some sense of how the 10 exposed infants were sampled for inclusion in this study from the larger study. Its says it was those entering the study from April to June--all of them from the parent study? Or some sub-set? Random selection?

3. I also think the text related to Table 2 would be more appropriate in the Results.

Results

1. Table 1 -- Remove the rows without any data (Chilenje clinic, father lives with child, all rows are trimester starting HIV)

2. First paragraph: Ages of infants at time of swabbing doesn't match what I see in Table 2. Along these lines, be consistent using either days or weeks.

3. Second paragraph: "...which may in part be the result of..." >> interpretation of data belongs in the Discussion.

4. Fig 1. "120 days" -- This is different than 14 weeks?

5. Third paragraph: Rephrase the first sentence because "heavy presence" and "relatively low" are contradictory. Maybe "diversity" instead of "presence"?

6. "These transitional patterns align with..." >> this belongs in the Discussion.

7. Paragraph starting with "While longitudinal trends..." To me, this should be the first paragraph under the existing heading "Relationship between NP microbiome comp...".

8. The paragraph and data in the table about the Teo study are rather distracting. At the very least, all this needs to be moved to the Discussion. That cohort is very different from the cohort in this study, so I suggest not allocating so much text to that--maybe just a brief comparison as you do with ref 32 (which, again, belongs in the Discussion).

9. Fig 2 -- Define the widths of the lines plotted in the figure description. Double check, and consider rephrasing, your description of the slopes: I'm not sure what "accounted for" means.

10. "As previously noted, we chose p=6..." >> No need to reiterate.

11. "...difficulty in identifying clear differences between populations." >> rephrase (even remove!) as this reads as if you're assuming that there are differences.

12. "Overall microbiome composition varied significantly..." >> Isn't this redundant with what is written at the beginning of the Results? 

13. The paragraph describing beta-diversity needs to be revamped and written more clearly.

14. In the tables and text, just use one of these terms and be consistent throughout: "HEU/HIV(+), etc." vs "HIV(+)/HEU" -- flipping the order around confuses the reader.

15. The sentences about Strep pneumo differences are confusing. Reword to mirror how H. influenzae is described--though in that sentence, write "more abundant" not "more highly abundant." Write your sentence re S. haemolyticus in the same way.

16. Also be consistent with how you write about the time points: "t=6" vs "time point six" vs "last time point" >> just pick one way and stick with it.

17. Table 7 -- Reorder the columns to a more conventional order: Mean, 95% CIs, P-value (left to right). Consider removing 't' and 'df' columns. In the table heading, mention what variables you are adjusting for to get the "adjusted p-value." Why are you now using "HIV" and "Control" to define the groups? I suggest naming the groups the same as for Table 6, unless you have a good reason.

Discussion

Overall, this needs a lot of improvement. 

1. The first paragraph should summarize the keys findings from YOUR data.

2. In the second paragraph, maybe just clearly state microbiome as a third possible hypotheses and acknowledge that all three drivers are likely inter-related and hard to differentiate.

3. Be clear when describing differences in the NP microbiome in the exposed infants 1) longitudinally vs 2) compared to un-exposed infants vs 3) c/t mothers. This is an multiple areas. I can certainly infer your meaning, but it would be easier to read if the comparison is clearly reiterated.

4. The connection between description of Dolosigranulum and the cited studies is not clear. 

5. Rather than describing the diversity of the microbiome for the mothers as having lots of "other" organisms, wouldn't it be more valid to look formally at beta and alpha diversity?

6. You argue pretty clearly that you have good samples. I don't think you need to state "retrospective analysis of samples" as a limitation. 

7. I don't agree with the statements about minimizing the limitation of lack of HIV testing. In this age group, HIV infections can certainly be rapidly progressive. Also, because the underlying causes of HEU having poor health c/t HUU aren't well understood in the first place, one can't assume the effects ("direction of bias") of contaminating the HEU cohort with infected infants.   

8. "This is intended as a case study..." >> Use the term "exploratory" or "pilot" study instead because "case study" has a different, specific meaning.

Conclusions

Focus here on what can be gleaned from your study results. You did not evaluate "sociological differences."

Consent

Because you also report data from the mothers, I hope you also consented them for that (not just the infant's data)?

Is the work clearly and accurately presented and does it cite the current literature?

Partly

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Partly

Are the conclusions drawn adequately supported by the results?

No

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Pediatric infectious diseases

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Gates Open Res. 2024 Aug 13.
Aubrey Odom-Mabey 1

Reviewer 2

Comment 2-1: This is an interesting study that investigates an important question: why do HIV exposed infants have poor health outcomes compared to unexposed infants? The key aspects of this study that are more novel are 1) longitudinal analysis of the microbiome and 2) focus on the NP microbiome. Especially when combined with the paired microbiome data from the mothers and the nice use of controls (HUU infants), this is a good study design that can be scaled up from this exploratory/pilot study. 

As the authors do a good job of describing and acknowledging, it's hard to glean much from these data given the small sample sizes. Especially considering the wide variation within groups that is described, making comparisons between groups then becomes challenging. That said, I think it is important to  Index these findings, but I would first like to see several improvements to allow readers to better understand: 1) the results, 2) some additional key limitations, and 3) coherent interpretations of the results.

Response 2-1: We thank the reviewer for providing thorough, thoughtful feedback for this paper.

Comment 2-2: The authors acknowledge the limitation of not actually knowing whether the exposed infants are uninfected. I would be extra-careful with this consideration. Don't define them as HEU, maybe instead just "HIV exposed."

Response 2-2: We thank the reviewer for this feedback. We have changed all instances of “HEU” throughout the manuscript to “HIV-exposed”.

Comment 2-3: From what I can tell, a P-value cutoff to define statistically significant differences was not defined in the Methods. Of course, <0.05 is common and seems to be used in the bolding of some of the P-values in the Results. Especially for a study like this in which there are so many potential confounders that aren't really being addressed and in which the sample size is limited, I would be really careful going too much into differences that aren't statistically significant. Phrases throughout the Results and Discussion like "marginally significant" and "nominally significant" should be de-emphasized or removed. Especially in the Discussion, focus on the significant differences that you did find (including a coherent discussion about these differences), rather than emphasizing too much about differences that weren't significant.

Response 2-3: We have clarified in the methods that α=0.05 (see line 216). Where relevant, a Bonferroni correction is applied to adjust the alpha value (type I error rate) to account for Type I error rate inflation as noted for the GEE and the Hotelling’s T-squared test methods descriptions, and elsewhere in the manuscript (e.g. the table descriptions).

Our rationale for including “marginally significant” results is that Bonferroni corrections are known to be at times overly conservative in deflating Type I error, and thus adjusted p-values that are around the newly calculated alpha value can still be practically interesting. To make this more clear for the reader, we added the following sentence to the methods section:

“Due to the conservative nature of the Bonferroni correction, adjusted p-values between 0.05 and 0.10 were noted as marginally significant as effects that could retain some level of practical significance.”

We removed a few results that were greater than the cutoff of 0.10 and altered the discussion accordingly.

Comment 2-4: The finding of more Staph haemolyticus in the exposed babies at birth is certainly worth highlighting (even more than you already do). But it's confusing as written in the Abstract and in the text: "...even when accounting for the interaction between HIV exposure status and time of sampling." Why "account for" exposure when that is the input variable you're looking at? Why "account for" time of sampling when they are all "at birth." This can be worded more clearly.

Also, know that Staph haemolyticus can certainly be pathogenic in humans, especially in immunocompromised individuals (i.e. newborns, HIV exposed or infected)--you can revamp the Discussion with this in mind. In relation to this, also worth commenting on whether the HIV(+) mothers were more likely or not to have S. haemolyticus.

Response 2-4: We can see how this wording can be confusing to the reader. The inclusion of a variable in a statistical model is by definition to acknowledge or account for the quantifiable effect of that variable on the response. The wording in the abstract is meant to convey that concept. We can understand how this might not be clear to readers lacking a statistical background, and so we rewrote these points:

[Discussion] The microbe indicated a distinctive high association with HIV exposure at birth and across time points, a finding confirmed by separate statistical tests. (lines 452-453)

[Abstract] When evaluating the interaction between HIV exposure status and time of sampling among infants, the microbe Staphylococcus haemolyticus showed a distinctive high association with HIV exposure at birth. (lines 29-31)

Comment 2-5: A big factor not addressed is whether the exposed infants and their mothers living with HIV were on Bactrim prophylaxis. If you don't know, this needs to be clearly stated as a limitation. If you do know, please incorporate as a covariate in your analyses. Bactrim would likely have a major impact on the NP microbiome.

Response 2-5: We thank the reviewer for this observation. Unfortunately, the SAMIPS study did not track Bactrim Pneumocystis jirovecii Pneumonia (PJP) prophylaxis, only acute use of antibiotics in response to infection. This cohort required that infants were healthy as a criterion of limitation, and so antibiotic use in response to infection was not included as a covariate.

We have acknowledged this limitation in the discussion section (lines 494-496).

Comment 2-6: The manuscript will be much more digestible for the reader describe the various analyses you are doing here in a parallel order in the Methods, Results, and Discussion.

Response 2-6: We agree that the current delineation of analyses is difficult to follow, and have implemented the following changes throughout the Methods, Results, and Discussion, with an emphasis on reorganizing text under appropriately titled, standardized subheadings.

Changes to subheadings themselves were made as follows.

  • In the methods section, we replaced the “Statistical analysis” subheading with ones that referenced the type of analysis conducted. The wording was reflected in the Results and Discussion subsections. These are: “Visualizing longitudinal trends in microbiome composition”, “Modeling effects of HIV status on infant microbiome composition across time”, and “Testing differences in microbiome composition across time and cohort groups”. These subheadings are broad enough to allow all analyses to be grouped together more concisely by method.

  • In the Results section, we made the following changes to the previous subheadings:
    • “Longitudinal differences in the nasopharyngeal microbiome composition” was rewritten as “Visualizing longitudinal trends in NP microbiome composition”
    • “Relationship between nasopharyngeal microbiome composition and HIV exposure” was rewritten as “Effects of HIV status on infant NP microbiome composition across time”
    • “Relationship between infant and mother nasopharyngeal microbiome composition across time” was rewritten as “Associations within mother and infant duad NP microbiome composition over time”
    • “Trends in NP microbial community composition between HIV(+) and HIV(-) mothers” was rewritten as “Effect of HIV status on mother NP microbial community composition”
  • In the Discussion, we added subheaders that were nearly identical to the Results subheaders.

Introduction

Comment 2-7: Overall, this is well written. In fact, the quality of the Introduction compared to the Discussion is so striking to me that I suspect different authors being mainly responsible for these two sections? If I'm correct, then the author that led writing of the Intro should please help polish up the Discussion!

Response 2-7: We have updated the wording in the Discussion based on suggestions provided by each reviewer, including the order of results discussed, the relocation of two paragraphs discussing previous findings in the literature, and the addition of literature examining S. haemolyticus.

Comment 2-8: "We collected 167 samples...through 14 weeks of life." >> this sentence doesn't belong in the Introduction.

Response 2-8: We have rewritten the relevant sentences of the introduction to remove specific mentions of sample numbers or exact time of analyses to focus on the context of the study (i.e., the locale, the infant groups, and the NP microbiome sampling). This now reads as follows:

“Given the complex dynamics of interactions between the host, microbes, and environment beginning at birth, we conducted an exploratory longitudinal comparison of the nasopharyngeal (NP) microbiomes of HEU and HUU infants and their mothers during the first 14 weeks of life. We reasoned that by characterizing quantifiable differences in NP microbiota distinguishing these two groups, this pilot study could indicate differences encouraging future immunology-focused studies” (lines 81-83).

Methods

Comment 2-9: "Characteristics of the study cohort are in Table 1." >> this goes in the Results.

Response 2-9: We have moved Table 1 and some accompanying text to the Results section (line 267).

Comment 2-10: Important to give some sense of how the 10 exposed infants were sampled for inclusion in this study from the larger study. Its says it was those entering the study from April to June--all of them from the parent study? Or some sub-set? Random selection?

Response 2-10: We added the following clarification to the first paragraph of the Methods section (see lines 89-94):

From 1,981 total mother infant pairs, we selected a subset that had 3 or more study visits, had no siblings under the age of five years, and who enrolled in the study from April to July 2015.  From this subset, we randomly selected 10 mother infant pairs with an HIV positive mother who started ARV treatment before pregnancy.  These 10 mother infant pairs were randomly matched to 10 mother infant pairs with an HIV negative mother by education, month of entry into the study, and maternal age.

Comment 2-11: I also think the text related to Table 2 would be more appropriate in the Results.

Response 2-11: We have moved Table 2 and some accompanying text to the Results section (line 268-273).

Results

Comment 2-12: First paragraph: Ages of infants at time of swabbing doesn't match what I see in Table 2. Along these lines, be consistent using either days or weeks.

Response 2-12: We note the discrepancy and have removed the delineation of days in the first column of Table 2. We updated Table 1 to reflect weeks at enrollment rather than days. We changed the delineation of days at swabbing time points to weeks.

Comment 2-13: Table 1 -- Remove the rows without any data (Chilenje clinic, father lives with child, all rows are trimester starting HIV)

Response 2-13: These rows have been removed and Table 1 has been restructured to better delineate parameter sections/groups.

Comment 2-14: Second paragraph: "...which may in part be the result of..." >> interpretation of data belongs in the Discussion.

Response 2-14: The paragraph on the Teo et al. study has been moved to the discussion section.

Comment 2-15: Fig 1. "120 days" -- This is different than 14 weeks?

Response 2-15: We believe this was written in error. We have changed this caption to read “14 weeks”.

Comment 2-16: Third paragraph: Rephrase the first sentence because "heavy presence" and "relatively low" are contradictory. Maybe "diversity" instead of "presence"?

Response 2-16: We have altered this sentence to read “Due to the high diversity of taxa with relatively low abundance among individual microbes…” (line 288-289).

Comment 2-17: "These transitional patterns align with..." >> this belongs in the Discussion.

Response 2-17: This sentence has been relocated to the discussion section (lines 423-433).

Comment 2-18: Paragraph starting with "While longitudinal trends..." To me, this should be the first paragraph under the existing heading "Relationship between NP microbiome comp...".

Response 2-18: We agree with the reviewer and have moved that paragraph to the heading titled “Relationship between nasopharyngeal microbiome composition and HIV exposure.” (lines 312-316)

Comment 2-19: The paragraph and data in the table about the Teo study are rather distracting. At the very least, all this needs to be moved to the Discussion. That cohort is very different from the cohort in this study, so I suggest not allocating so much text to that--maybe just a brief comparison as you do with ref 32 (which, again, belongs in the Discussion).

Response 2-19: We relocated the Teo et al. study to the discussion and renumbered the tables accordingly (lines 423-433). The text discussing the study has been shortened to remove some excess discussion of differences while still introducing the cohort and thus retaining the table’s relevance.

Comment 2-20: Fig 2 -- Define the widths of the lines plotted in the figure description. Double check, and consider rephrasing, your description of the slopes: I'm not sure what "accounted for" means.

Response 2-20: We agree with the reviewer that the explanation of the figure according to the caption is difficult to parse, and so we have rewritten the caption for clarity in delineating each visual aspect of the plot grid. We removed the “accounted for” phrasing.

“Plots of estimated marginal means of the abundance of a given microbe, given the estimated GEE effects of time point of sampling (x axis) and HIV status (line color). A line illustrates the estimated change in microbe abundance over time, with positive or negative slopes illustrating increased or decreased estimated abundance in log CPM (respectively). HIV-exposed infants (denoted as HEU) are depicted in blue, and HIV-unexposed, uninfected infants (HUU) are depicted in red. Given the separate lines for each status and changing slope across time, these plots depict the interaction effect between time point and HIV status. All microbes had at least marginally significant time effects, but only S. haemolyticus had a very strong HIV exposure status effect.”

Comment 2-21: "As previously noted, we chose p=6..." >> No need to reiterate.

Response 2-21: We have removed this sentence from the Results section.

Comment 2-22: "...difficulty in identifying clear differences between populations." >> rephrase (even remove!) as this reads as if you're assuming that there are differences.

Response 2-22: We agree with the reviewer and acknowledge that our point may have been difficult to grasp especially for audiences without familiarity with the test in question. We have removed the sentence from the paragraph to avoid confusion.

Comment 2-23: "Overall microbiome composition varied significantly..." >> Isn't this redundant with what is written at the beginning of the Results? 

Response 2-23: We have reread the Results and it is unclear what section the reviewer is referring to as redundant. We have retained this section as is.

Comment 2-24: The paragraph describing beta-diversity needs to be revamped and written more clearly.

Response 2-24: We rewrote the beta diversity paragraph describing infant results as follows:

“We examined variations in overall microbiome composition by observing differences in beta diversity between HUU and HIV-exposed infants. We conducted two Wilcoxon rank-sum tests of the Bray-Curtis dissimilarity at t=0 and t=6. We found a non-significant difference in microbiome composition between HUU and HIV-exposed infants at t=0 (p = 0.68), but large differences at t=6 (p <0.01). Performing additional tests within the HIV-exposed and HUU groups themselves, we also identified heavy inter-subject variability at both t=0 and t=6 (all p-values < 0.01). As a result of this inter-subject variability, alpha diversity metrics were not performed for this analysis.” (lines 339-345)

Comment 2-25: In the tables and text, just use one of these terms and be consistent throughout: "HEU/HIV(+), etc." vs "HIV(+)/HEU" -- flipping the order around confuses the reader.

Response 2-25: We have replaced all instances of status groups with HEU/HIV(+) and HUU/HIV(-), and replaced “mother-infant” with “infant-mother” pairs to match the ordering of the descriptors.

Comment 2-26: The sentences about Strep pneumo differences are confusing. Reword to mirror how H. influenzae is described--though in that sentence, write "more abundant" not "more highly abundant." Write your sentence re S. haemolyticus in the same way.

Response 2-26: We rewrote the test results as follows (lines 369-372):

“H. influenzae was more abundant in HUU infants than in their HIV(-) mothers at t = 6 (adj. p = 0.1), and S. haemolyticus was likely to be found in the HUU infants at t = 0 in a higher concentration than in their HIV(-) mothers (adj. p = 0.1). S. pneumoniae was more likely to be found in HUU infants than their HIV(-) mothers at t = 6 (adj. p = 0.1). Each of these findings were marginally significant.”

Comment 2-27: Also be consistent with how you write about the time points: "t=6" vs "time point six" vs "last time point" >> just pick one way and stick with it.

Response 2-27: We thank the reviewer for this feedback. We decided to change all instances of alternate references to time points to variable notation (e.g., t=0 or t=6).

Comment 2-28: Table 7 -- Reorder the columns to a more conventional order: Mean, 95% CIs, P-value (left to right). Consider removing 't' and 'df' columns. In the table heading, mention what variables you are adjusting for to get the "adjusted p-value." Why are you now using "HIV" and "Control" to define the groups? I suggest naming the groups the same as for Table 6, unless you have a good reason.

Response 2-28: We have reordered the columns to reflect the suggested order and removed the t statistic and degrees of freedom columns to reduce table information. We also relabeled “HIV” and “Control” to HEU/HIV(+) and HUU/HIV(-), which reflects the Table 6 nomenclature. We noted in the table caption that a Bonferroni correction was used to obtain the adjusted p-values.

Table 7 was renamed as Table 6.

Discussion

Comment 2-29: The first paragraph should summarize the keys findings from YOUR data.

Response 2-29: We moved the paragraph discussing previous findings in the literature to after the initial paragraph (line 400). We combined the first two paragraphs to now compose the first paragraph of the discussion, which gives an overview of our findings (that there is nuance).

Comment 2-30: In the second paragraph, maybe just clearly state microbiome as a third possible hypotheses and acknowledge that all three drivers are likely inter-related and hard to differentiate.

Response 2-30: We thank the reviewer for this feedback. Given that the hypotheses we proposed were related to microbiota differences as potential upstream or downstream indicators, we have adjusted our wording to read as follows:

“Assuming that such differences are not merely due to sampling biases, the chief hypotheses explaining them are that 1) HEU infants have subtle immunological deficits, or 2) these are the consequence of confounding due to environmental and sociological factors. These are not mutually exclusive, and we acknowledge that all drivers are likely inter-related and difficult to differentiate. We reasoned that they may be in turn associated with differences in the microbiota which could then serve as a quantifiable indicator of differences in respiratory health between HEU and HUU infants.” (lines 392-394)

Comment 2-31: Be clear when describing differences in the NP microbiome in the exposed infants 1) longitudinally vs 2) compared to un-exposed infants vs 3) c/t mothers. This is an multiple areas. I can certainly infer your meaning, but it would be easier to read if the comparison is clearly reiterated.

Response 2-31: We adjusted unclear wording and added headers throughout the discussion to properly delineate the various analysis types (see Response 2-6).

Comment 2-32: The connection between description of Dolosigranulum and the cited studies is not clear. 

Response 2-32: We have rewritten the paragraph discussing Dolosigranulum in the results section (lines 463-469) to better describe Dolosigranulum pigrum in the literature but refrain from making conclusions based on the data given the lack of statistically significant results.

Comment 2-33: Rather than describing the diversity of the microbiome for the mothers as having lots of "other" organisms, wouldn't it be more valid to look formally at beta and alpha diversity?

Response 2-33: It appears that the interpretation of beta diversity results was only implied at the time of writing, and so we have rewritten this paragraph to explain the results more clearly:

“One of our study objectives was to compare the NP microbiomes of the HIV(+) and HIV(-) mothers at t=0 and t=6. From Figure 1B, we observed that HIV(+) mothers may have had higher Streptococcus abundance than HIV(-) mothers overall. A significant inter-group beta diversity test result at t=6 showed strong differences in taxa among HIV statuses, a finding reflected by the stacked bar plots indicating a larger presence of taxa with relatively small abundances in HIV(-) women (denoted as “Other” throughout our analyses). This finding implies more microbial diversity in the NP microbiomes of the HIV(-) women. In the nasopharynx, lower diversity has been associated with individuals with rhinovirus illness50 and in children with HIV-associated bronchiectasis,51 suggesting that greater NP diversity is a sign of a healthy microbiome. This mirrors previous findings in the gut microbiome of healthy individuals.52” (lines 477-486)

Comment 2-34: You argue pretty clearly that you have good samples. I don't think you need to state "retrospective analysis of samples" as a limitation. 

Response 2-34: We agree with the reviewer and have removed this limitation from the Conclusion section.

Comment 2-35: I don't agree with the statements about minimizing the limitation of lack of HIV testing. In this age group, HIV infections can certainly be rapidly progressive. Also, because the underlying causes of HEU having poor health c/t HUU aren't well understood in the first place, one can't assume the effects ("direction of bias") of contaminating the HEU cohort with infected infants.   

Response 2-35: We have removed the two sentences in question that served to minimize the argument against HIV incidence in the cohort.

Comment 2-36: "This is intended as a case study..." >> Use the term "exploratory" or "pilot" study instead because "case study" has a different, specific meaning.

Response 2-36: We have corrected this oversight by using the term “pilot study” (line 504).

Conclusions

Comment 2-37: Focus here on what can be gleaned from your study results. You did not evaluate "sociological differences."

Response 2-37: We appreciate the reviewer’s remarks and have subsequently removed references to sociological differences in the Conclusion. We have altered the first sentence to read as follows:

“Acknowledging these issues, our findings suggest that there are subtle nuances between HIV-exposed and HUU infant populations.” (lines 509-510)

Consent

Comment 2-38: Because you also report data from the mothers, I hope you also consented them for that (not just the infant's data)?

Response 2-38: We thank the reviewer for noticing this oversight. We did obtain consent from the mothers who were also participants in the study. We have added the following to the Consent section of the manuscript (lines 554-556):

“Written informed consent for publication of the infants’ patient details was obtained from the parents of the infants. Consent for inclusion of mothers’ data was also obtained from the participating mothers themselves.”

Gates Open Res. 2023 Jun 16. doi: 10.21956/gatesopenres.15340.r33429

Reviewer response for version 1

Christiana Smith 1

This is a well-written description of a very small study comparing the nasopharyngeal microbiome of Zambian HEU infants (and their mothers) to HUU infants (and their mothers) over the first 14 weeks of life. The hypothesis that the maternal or infant microbiome plays a key role in the increased morbidity/mortality observed in HEU infants has been under-explored in the literature, justifying the indication for the study. However, the differences identified between HEU and HUU infants in this study were rather minimal (the authors use the word “subtle”), which diminishes the overall impact/significance of the publication.

Strengths of the study include that the participants were followed longitudinally over time, and the HEU and HUU cohorts were well-matched and recruited from the same physical location.

The two major limitations of the study, which are described appropriately by the authors, are: 1) the small sample size results in very limited power to identify differences between HEU and HUU infants, leading to “uncertain evidence that there is an HIV exposure effect on the NP microbiome...” and 2) no information is provided on the immune status of the HIV+ mothers (viral load, CD4) or the HIV infection status of the HIV-exposed infants (i.e. HIV NAAT results after birth), which are likely very important contributors to the NP microbiome. For example – a person with poorly controlled HIV and low CD4 count is at risk of opportunistic respiratory infections (TB, PJP, MAC) which would likely influence the composition of their NP microbiome. Was there really no information available to the study authors that could be included to better describe the health of the mothers and infants (any maternal viral loads or CD4 counts, known maternal comorbidities, infant dried blood spot testing at birth, etc.)?

Person-first language should be used throughout the abstract and manuscript. It is probably fine to use the abbreviations as-is (HIV+, HEU, HUU), but phrases such as “HIV-positive mothers” or “HIV-infected women” should be changed to “women/mothers with HIV” or “women/mothers living with HIV.” The term “mother-to-child transmission” is also outdated and should be changed to “vertical transmission” or “perinatal transmission.” For more information on why this matters, please see: Dilmitis et al. (2012 1 ).

Introduction, paragraph 2: the authors reference “the HEU phenomenon” several times without really explaining what “the phenomenon” is. Are they referring to the increased morbidity and mortality observed in this population? Please be more specific.

Introduction, paragraph 4: The authors state that their study might “provide insight into the presence or absence of immunological factors distinguishing [HEU vs. HUU infants].” However, differences in the microbiome do not necessarily indicate differences in “immunological factors.” Recommend revising the text appropriately.

Methods, paragraph 1: it is necessary to state whether all infants breastfed or whether some infants consumed any foods other than breast milk in the first 14 weeks of life, as nutrition sources have a profound ability to impact the microbiome.

There are several examples where the manuscript is organized in a non-standard way:

  • Tables containing data (such as Table 1 and Table 2) belong in the results section, not the methods section. Recommend moving these tables, and the text describing them, to the beginning of Results.

  • The text in the methods section describing the reasons that some samples were excluded (beginning with “Some missingness in the data is present…”) also belongs in results, not methods.

  • There are some “editorial” comments in the results section, which really belong in the discussion instead. For example, page 7, last paragraph: “…which may in part be the result of the mothers’ acquired immunity over time and therefore lower overall NP carriage;” page 8, 3 rd to last paragraph: “These transitional patterns align with those found in a longitudinal East Asian infant cohort.”

Table 1 is very confusing and poorly organized:

  • The headers often do not apply to the rows beneath them (male sex and median age should not be categorized under “place of birth;” median birth weight and twin births should not be categorized under “gestational age;” number of mothers, mothers age, married, father lives with child should not be categorized under “immunizations at birth;” etc.

  • There are many rows for which the result is 0 in every column, or for which there are apparently no data. These rows should be removed from the table completely. For example, why is the Chilenje clinic described in table when only the Chawama clinic is described in the text? Why is “other place of birth” listed in the table if none of the infants were born in an “other place”? Why are “father lives with child” and “trimester initiated ART” included in the table if there are no data for these variables? The L&D complications that did not occur in any infants (obstructed labor, sepsis, hemorrhage, other) can be removed and a sentence could be added to the text that says “none of these complications occurred: X,Y,Z.”

  • The table shows one infant with birth asphyxia. This infant should be further described in the text. How severe was the birth asphyxia? Did the infant require respiratory support, intubation, oxygen, etc. (since respiratory instrumentation could potentially have impacted this infant’s respiratory microbiome)?

Table 2:

  • The text states that two of 20 mothers lacked swabs at both the t=0 and t=6 time points. However, the table shows all 20 mothers had a swab at t=0 and 2 were missing swabs at t=6. Either the text or table should be revised so that they harmonize.

  • It would be preferred if the table could be stratified by HIV status at each time point, rather than consolidating all of the HEU and HUU results at the bottom. This way it would be easy to see whether either group was over-represented at any one given time point.

Table 3: please standardize the number of decimal points across columns (some columns have 2 decimal points, others have none).

Table 4: several columns describe “adjusted p-values” but there is no description of what variables were adjusted for. The reader should not have to refer back to the methods section to understand the statistical analysis. This is also a problem in Table 7. Please add a description of what variables the analyses were adjusted for to the Table heading or to a footnote for each table.

Table 6: I believe there is a typo in the table; the two time points being compared are t=0 and t=6 according to the text; however, the table shows t=0 and t=1.

Table 7: It is very difficult to understand this table without reading the text. Specifically, what direction is indicated by the positive and negative numbers in the column titled “mean of differences” (higher abundance in infants or higher abundance in mothers)? After a long period of scrutiny, I was able to determine that negative numbers mean higher abundance in infants, but it should be immediately apparent to the reader. Please explain this in the Table heading or in a footnote.

Also related to Table 7, in the results section, page 12, paragraph 2: The authors describe that the purpose of the analysis in Table 7 is to compare the abundance of 3 particular bacterial species between the mothers and infants within each cohort (HIV+/HEU and HIV-/HUU), and that this comparison will help explain why HEU infants may be more susceptible to morbidity/mortality. It is not clear how comparing the abundance of these bacteria between mothers vs. infants WITHIN each cohort would help explain the HEU infants’ increased susceptibility to morbidity/mortality. It seems that to answer this question, it would be more valuable to compare the abundance of bacteria between HEU vs. HUU infants (or between HIV+ vs. HIV- mothers). Please clarify how you expect this analysis to answer the question proposed, or modify/remove the language describing the reason for this analysis.

Discussion: The authors should add to the list of limitations that their results may not be generalizable to HEU infants around the globe, given that all samples were collected from participants living in a single African country.

Discussion, page 14: There is some redundancy between the last 2 sentences of the first paragraph, and the 6 th paragraph, both of which describe differences in bacterial diversity between HIV+ and HIV- mothers.

There are several typos in the manuscript. For example, page 11, paragraph 3: “The same test for all infant samples at time point six was contrastingly showed….” Also Conclusions: “our findings may suggest that that sociological….”

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

I cannot comment. A qualified statistician is required.

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Pediatric infectious diseases clinician, research focused on the immunology of HIV-exposed and HIV-infected infants and children.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

References

  • 1. : Language, identity and HIV: why do we keep talking about the responsible and responsive use of language? Language matters. Journal of the International AIDS Society .2012;15(4(Suppl 2)) : 10.7448/IAS.15.4.17990 10.7448/IAS.15.4.17990 22360788 [DOI] [Google Scholar]
Gates Open Res. 2024 Aug 13.
Aubrey Odom-Mabey 1

Reviewer 1

Comment 1-1: This is a well-written description of a very small study comparing the nasopharyngeal microbiome of Zambian HEU infants (and their mothers) to HUU infants (and their mothers) over the first 14 weeks of life. The hypothesis that the maternal or infant microbiome plays a key role in the increased morbidity/mortality observed in HEU infants has been under-explored in the literature, justifying the indication for the study. However, the differences identified between HEU and HUU infants in this study were rather minimal (the authors use the word “subtle”), which diminishes the overall impact/significance of the publication.

Strengths of the study include that the participants were followed longitudinally over time, and the HEU and HUU cohorts were well-matched and recruited from the same physical location.

Response 1-1: We appreciate the reviewer’s summary and thank the reviewer for providing their feedback for this paper.

Comment 1-2: The two major limitations of the study, which are described appropriately by the authors, are: 1) the small sample size results in very limited power to identify differences between HEU and HUU infants, leading to “uncertain evidence that there is an HIV exposure effect on the NP microbiome...” and 2) no information is provided on the immune status of the HIV+ mothers (viral load, CD4) or the HIV infection status of the HIV-exposed infants (i.e. HIV NAAT results after birth), which are likely very important contributors to the NP microbiome. For example – a person with poorly controlled HIV and low CD4 count is at risk of opportunistic respiratory infections (TB, PJP, MAC) which would likely influence the composition of their NP microbiome. Was there really no information available to the study authors that could be included to better describe the health of the mothers and infants (any maternal viral loads or CD4 counts, known maternal comorbidities, infant dried blood spot testing at birth, etc.)?

Response 1-2: We agree that information on the immune status of the HIV+ mothers is indeed integral to identifying additional factors in the composition of their NP microbiota. We regrettably did not have access to this data as detailed HIV status information was not collected as part of the SAMIPS study, which was focused on Pertussis incidence. It was not optimally designed to look at the impact of HIV exposure as the original study purpose and did not seek to characterize the mothers HIV health status nor collect breastfeeding information. Neither viral load nor CD4 testing were routinely performed in Lusaka at this time and neither of these were deemed essential for the purposes of the original study beyond noting the mothers’ HIV status.

We have clarified this point further in the Methods section on line 118, and in the discussion on lines 492-494 (“several key variables were not measured that would have been informative as potential confounders in our study results”).

Comment 1-3: Person-first language should be used throughout the abstract and manuscript. It is probably fine to use the abbreviations as-is (HIV+, HEU, HUU), but phrases such as “HIV-positive mothers” or “HIV-infected women” should be changed to “women/mothers with HIV” or “women/mothers living with HIV.” The term “mother-to-child transmission” is also outdated and should be changed to “vertical transmission” or “perinatal transmission.” For more information on why this matters, please see: Dilmitis  et al. (2012 1).

References

1. Dilmitis S, Edwards O, Hull B, Margolese S, et al.: Language, identity and HIV: why do we keep talking about the responsible and responsive use of language? Language matters.  Journal of the International AIDS Society. 2012;  15 (4(Suppl 2)).  Publisher Full Text

Response 1-3: We thank the reviewer for their attentiveness to this matter. We have implemented person-first identifying language throughout the manuscript.

Comment 1-4: Introduction, paragraph 2: the authors reference “the HEU phenomenon” several times without really explaining what “the phenomenon” is. Are they referring to the increased morbidity and mortality observed in this population? Please be more specific.

Response 1-4: The reviewer’s inclination is correct; we are referring to the increased morbidity and mortality rates observed in HEU infants. We have altered the manuscript to read more clearly in the introduction, by removing one instance and clarifying in another case the “HEU ‘phenomenon’ of increased morbidity and/or mortality rates” (see lines 51, 64, 66).

In lines 511-513 of the Conclusions, we altered a sentence reading “This warrants further research and discussion regarding the phenomenon” to “The effects we have found here warrant further research and discussion regarding the role HIV exposure plays in infant health before readily affirming or denying that HIV exposure affects infants’ NP microbiomes.”

Comment 1-5: Introduction, paragraph 4: The authors state that their study might “provide insight into the presence or absence of immunological factors distinguishing [HEU vs. HUU infants].” However, differences in the microbiome do not necessarily indicate differences in “immunological factors.” Recommend revising the text appropriately.

Response 1-5: We agree with the reviewer’s reasoning that differences present in the microbiome do not necessarily correlate to immunological differences. We made a jump in reasoning that can only be made with something akin to an analysis of host-microbe interactions, which was not conducted here. We have altered the sentence to read, “We reasoned that by characterizing quantifiable differences in NP microbiota distinguishing these two groups, this pilot study could indicate differences encouraging future immunology-focused studies” (lines 82-83).

Comment 1-6: Methods, paragraph 1: it is necessary to state whether all infants breastfed or whether some infants consumed any foods other than breast milk in the first 14 weeks of life, as nutrition sources have a profound ability to impact the microbiome.

Response 1-6: We agree that information on breastfeeding would be extremely relevant for the purposes of microbiome analysis. After reviewing the case report forms, we found that the study did not assess and collect breast feeding status. For purposes of this study, we can clarify that formula is rarely used and breastfeeding is nearly universal. We have clarified this in the manuscript and added a relevant citation (lines 123-125).

Comment 1-7: There are several examples where the manuscript is organized in a non-standard way:

Tables containing data (such as Table 1 and Table 2) belong in the results section, not the methods section. Recommend moving these tables, and the text describing them, to the beginning of Results.

Response 1-7: We have moved the tables and appropriate text descriptions to the beginning of the Results section (lines 267-273).

Comment 1-8: The text in the methods section describing the reasons that some samples were excluded (beginning with “Some missingness in the data is present…”) also belongs in results, not methods.

Response 1-8: We have moved the description of the excluded samples to the beginning of the Results section (line 268).

Comment 1-9: There are some “editorial” comments in the results section, which really belong in the discussion instead. For example, page 7, last paragraph: “…which may in part be the result of the mothers’ acquired immunity over time and therefore lower overall NP carriage;” page 8, 3 rd to last paragraph: “These transitional patterns align with those found in a longitudinal East Asian infant cohort.”

Response 1-9: We have moved the comments in question to the Discussion section, including the larger paragraph on parallel findings from previous studies (lines 285-287; 421-422; other comments were also moved when relevant).

Comment 1-10: Table 1 is very confusing and poorly organized:

The headers often do not apply to the rows beneath them (male sex and median age should not be categorized under “place of birth;” median birth weight and twin births should not be categorized under “gestational age;” number of mothers, mothers age, married, father lives with child should not be categorized under “immunizations at birth;” etc.

There are many rows for which the result is 0 in every column, or for which there are apparently no data. These rows should be removed from the table completely. For example, why is the Chilenje clinic described in table when only the Chawama clinic is described in the text? Why is “other place of birth” listed in the table if none of the infants were born in an “other place”? Why are “father lives with child” and “trimester initiated ART” included in the table if there are no data for these variables? The L&D complications that did not occur in any infants (obstructed labor, sepsis, hemorrhage, other) can be removed and a sentence could be added to the text that says “none of these complications occurred: X,Y,Z.”

The table shows one infant with birth asphyxia. This infant should be further described in the text. How severe was the birth asphyxia? Did the infant require respiratory support, intubation, oxygen, etc. (since respiratory instrumentation could potentially have impacted this infant’s respiratory microbiome)?

Response 1-10: We thank the reviewer for their attention to this oversight. We have reordered the rows to clearly define parameter sections, and removed all rows for which the parameter did not apply to the infant demographics.

We have added the following sentence in place of the excluded rows: “None of the mothers in this cohort experienced obstructed labor, sepsis, or hemorrhage complications during the labor and delivery period” (lines 97-99).

During the SAMIPS data collection procedures, each mother was asked whether they had any complications with labor and delivery. This was a closed ended question with five possible responses: obstructed labor, birth asphyxia, sepsis, hemorrhage, or other. Follow-up questions about severity or treatment were not asked. Given that the study eligibility criteria limited enrollment to healthy infants with no acute or chronic medical conditions, it is unlikely that the birth asphyxia was severe. However, we cannot rule out the use of respiratory instrumentation that could impact the infant’s respiratory microbiome. We have noted this clarification in our study limitations of the Discussion section (lines 502-503).

Comment 1-11: Table 2:

The text states that two of 20 mothers lacked swabs at both the t=0 and t=6 time points. However, the table shows all 20 mothers had a swab at t=0 and 2 were missing swabs at t=6. Either the text or table should be revised so that they harmonize.

It would be preferred if the table could be stratified by HIV status at each time point, rather than consolidating all of the HEU and HUU results at the bottom. This way it would be easy to see whether either group was over-represented at any one given time point.

Response 1-11: We appreciate the reviewer’s attention to this discrepancy. We verified that the table contains an accurate summary of the data and updated the text to reflect the table (line 273). We have also updated Table 2 to delineate each status/subject group per time point.

Comment 1-12: Table 3: please standardize the number of decimal points across columns (some columns have 2 decimal points, others have none).

Response 1-12: We have adjusted the table to have one decimal point standardized across the columns and right-justified the entries (now Table 7).

Comment 1-13: Table 4: several columns describe “adjusted p-values” but there is no description of what variables were adjusted for. The reader should not have to refer back to the methods section to understand the statistical analysis. This is also a problem in Table 7. Please add a description of what variables the analyses were adjusted for to the Table heading or to a footnote for each table.

Response 1-13: We thank the reviewer for their comment. We acknowledge that there may also be some confusion as to what we mean by adjusted p-values. The concept of adjusting p-values is widely used in ‘omics analysis, and stems from the application of a correction to limit the Type I error (false positive) inflation in multiple hypothesis testing. There are several techniques that can be used for this purpose; we used the Bonferroni correction which is the most conservative. The actual implementation of this correction is to identify a corrected significance level, α, so each p-value is compared to this new value resulting in an adjusted p-value.

We have clarified language throughout the manuscript and caption for Table 7 (now Table 6) by changing to the following language: “A Bonferroni correction was applied to account for multiple hypothesis testing.” (lines 753-754).

We recognize that Table 4 (now Table 3) is especially confusing given the different variables. We have rearranged the columns to list the coefficient estimate followed by the unadjusted and adjusted p-values for each variable included in the model.

Comment 1-14: Table 6: I believe there is a typo in the table; the two time points being compared are t=0 and t=6 according to the text; however, the table shows t=0 and t=1.

Response 1-14: We have adjusted Table 6 accordingly (now Table 5).

Comment 1-15: Table 7: It is very difficult to understand this table without reading the text. Specifically, what direction is indicated by the positive and negative numbers in the column titled “mean of differences” (higher abundance in infants or higher abundance in mothers)? After a long period of scrutiny, I was able to determine that negative numbers mean higher abundance in infants, but it should be immediately apparent to the reader. Please explain this in the Table heading or in a footnote.

Response 1-15: We agree that the mean of differences column is not well explained. We have gone ahead and corrected the table caption and heading to explain the sign directionality for mean of differences values for Table 7 (now Table 6).

Comment 1-16: Also related to Table 7, in the results section, page 12, paragraph 2: The authors describe that the purpose of the analysis in Table 7 is to compare the abundance of 3 particular bacterial species between the mothers and infants within each cohort (HIV+/HEU and HIV-/HUU), and that this comparison will help explain why HEU infants may be more susceptible to morbidity/mortality. It is not clear how comparing the abundance of these bacteria between mothers vs. infants WITHIN each cohort would help explain the HEU infants’ increased susceptibility to morbidity/mortality. It seems that to answer this question, it would be more valuable to compare the abundance of bacteria between HEU vs. HUU infants (or between HIV+ vs. HIV- mothers). Please clarify how you expect this analysis to answer the question proposed or modify/remove the language describing the reason for this analysis.

Response 1-16: We agree that the justification for performing this set of tests is invalid as written, and so we have rewritten the test justification as follows (see lines 363-366):

“Although all infants in our cohort were free from illness, we found that certain pathogenic species were present in modest abundance across samples. To ascertain the possibility of HIV-exposed infants acquiring pathogens from their HIV(+) mothers, we conducted paired t-tests for three well-known pathogenic species: Streptococcus pneumoniae; Haemophilus influenzae; and Staphylococcus haemolyticus.”

Comment 1-17: Discussion: The authors should add to the list of limitations that their results may not be generalizable to HEU infants around the globe, given that all samples were collected from participants living in a single African country.

Response 1-17: We agree with this limitation and have added it to our conclusion accordingly on lines 490-491. (“Second, given that all samples were collected from participants born in Zambia, these results may not be generalizable to HIV-exposed infants in other countries.”)

Comment 1-18: Discussion, page 14: There is some redundancy between the last 2 sentences of the first paragraph, and the 6 th paragraph, both of which describe differences in bacterial diversity between HIV+ and HIV- mothers.

Response 1-18: We have relocated the sentence referring to Figure 1B (“From Figure 1B, we observed that…”) to lines 478-479 and removed the repetitive sentence referring to increased microbial diversity.

Comment 1-19: There are several typos in the manuscript. For example, page 11, paragraph 3: “The same test for all infant samples at time point six was contrastingly showed….” Also Conclusions: “our findings may suggest that that sociological….”

Response 1-19: We have made the appropriate corrections throughout the manuscript and thank the reviewer for their attention to this matter.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Availability Statement

    Underlying data

    Zenodo: Underlying data for ‘Characterization of longitudinal nasopharyngeal microbiome patterns in maternally HIV-exposed Zambian infants.’ https://doi.org/10.5281/zenodo.7255313 25

    This project contains the following underlying data:

    • Data file 1: FinalDatOther.rds

    • Data file 2: FinalDatPICRUSt2.RDS

    • Data file 3: animalculesFinalHIV.rds

    • Data file 4: animalcules_data_2021.rds

    • Data file 5: mappingFaits.csv

    • Data file 6: mappingFinalHIV.csv

    • Data file 7: mappingFinalHIV.tsv

    • Data file 8: preclean_MetaData.txt

    • Data file 9: samips_immunization.csv

    • Data files 10–176: *-sam-report.tsv

    Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

    Accession numbers

    NCBI BioProject: Characterization of longitudinal nasopharyngeal microbiome patterns in maternally HIV-exposed Zambian infants. Accession number PRJNA874826. https://identifiers.org/NCBI/BioProject:PRJNA874826


    Articles from Gates Open Research are provided here courtesy of Bill & Melinda Gates Foundation

    RESOURCES