Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Sep 1.
Published in final edited form as: Biomed Signal Process Control. 2023 Aug 21;86(Pt C):105358. doi: 10.1016/j.bspc.2023.105358

Information Based Similarity Analysis of Oxygen Saturation Recordings to Detect Pulmonary Hypertension in Preterm Infants

Pravitha Ramanand a,*, Premananda Indic a, Samuel J Gentle b, Namasivayam Ambalavanan b
PMCID: PMC10487283  NIHMSID: NIHMS1927512  PMID: 37692106

Abstract

Pulmonary hypertension (PH) is a complex cardiovascular condition associated with multiple morbidities and mortality risk in preterm infants. PH often complicates the clinical course of infants who have bronchopulmonary dysplasia (BPD), a more common lung disease in these neonates, causing respiratory deterioration and an even higher risk of mortality. While risk factors and prevalence of PH are not yet well defined, early screening and management of PH in infants with BPD are recommended by consensus guidelines from the American Heart Association. In this study, we propose a screening method for PH by applying a signal analysis technique to oxygen saturation in infants. Oxygen saturation data from infant groups with BPD (41 with and 60 without PH), recorded prior to their clinical PH diagnosis were analyzed in this study. An information-based similarity approach was applied to quantify the regularity of SpO2 fluctuations represented as binary words between adjacent five-minute segments. Similarity indices (SI) were observed to be lower in subjects with PH compared to those with BPD alone (p<0.001). These measures were also assessed for performance in screening for PH. SI of 7-bit words, exhibited 80% detection accuracy, 76% sensitivity and specificity of 83%. This index also exhibited a cross-validated mean (SD) F1-score of 0.80 (0.08) ensuring that sensitivity and recall of the screening were balanced. Similarity analysis of oxygen saturation patterns is a novel technique that can be potentially developed into a signal based early PH detection method to support clinical decision and care in this vulnerable population.

Keywords: Information-based similarity, Oxygen saturation, Preterm Infants, Pulmonary Hypertension, Pulse oximetry

Graphical Abstract

graphic file with name nihms-1927512-f0001.jpg

Introduction

Survival rates of extremely preterm infants, born between 24 and 28 weeks’ gestation have risen over the past decades due to advances in neonatal medicine [1]. The risk of respiratory morbidity and mortality is nevertheless high in these infants, primarily due to neonatal chronic lung disease referred to as bronchopulmonary dysplasia (BPD) [2]. Pulmonary hypertension (PH), increased pressure in the pulmonary arterial system, complicates the clinical course of extremely preterm infants and is associated with cardiovascular and neurocognitive morbidities and increased risk of late-stage mortality, in this vulnerable group [3-5] . Among infants with BPD, PH is a complication in ~ 40% of infants and increases their risk of mortality [3,6] . While risk factors for development of PH in extremely preterm infants are still ill-defined, the need for evaluation and management of this condition is recommended by consensus guidelines from the American Heart Association [3,7-9] . In addition, early detection of PH using screening techniques may lead to early and better treatment [3].

However, infants with BPD regardless of PH presence, frequently exhibit deterioration of respiratory status with persistent or increasing oxygen requirement, making clinical features of PH such as severe oxygen desaturations and poor growth indistinguishable from that of the underlying BPD [10] . Results from recent studies suggest that the frequency and duration of these desaturations, known as intermittent hypoxemia (IH), may provide clinical utility for BPD prediction[11,12] .

We compared the frequency, duration and severity of such events between infants with BPD and PH (BPD-PH) and those with BPD alone (BPD group). This was carried out using oxygen saturation data collected in the week before PH was diagnosed clinically by echocardiogram. We reported that the average duration of IH events were longer in duration in infants with BPD-PH as compared to infants with BPD-alone and, in infants with established BPD-PH, IH event duration differed by survival status [12] . Such differences in desaturation events led us to consider the signal analysis of oxygen saturation data in this population to develop a screening method for PH. From our previous study, we realized that identification of IH events that might differentiate infants with BPD alone from those with BPD-PH would require longitudinal data analysis over the entirety of infants’ hospitalization. However, fluctuations in oxygen saturation may be more severe and persistent in the PH group than in the group with BPD alone. Our objective was hence to derive a time limited signal characteristic to effectively quantify similarities in the fluctuations of short-term oxygen saturation levels to infants with both BPD and PH.

Various linear and nonlinear methods applied to physiological heart rate variability and oxygen saturation are being actively investigated in risk stratification of infant and pediatric subjects for conditions such as BPD, obstructive sleep apnea and sepsis [13-16] . These measures mostly describe the overall static properties over a time interval related to the variability, irregularity or signal complexity, but are inadequate to quantify the dynamic changes to the regularity of signal fluctuations. Hence in this study, SpO2 data was analyzed using an information-based similarity technique, that quantifies similarities in the occurrence of binary sequences representing changes in signal levels. This measure was introduced by Yang et al. to discriminate between heart rate patterns generated from healthy subjects and those with pathological congestive heart failure [17] . In addition to physiological signals, this measure has been applied extensively to analyze data in diverse fields such as genomic sequences and linguistic texts [18,19] . Recently, adjacent segments of inter heart beat data were studied to derive similarity indices that distinguished normal subjects from groups with moderate and severe obstructive sleep apnea (OSA) [20] . Akin to this scenario, BPD with PH may be considered as manifesting a more severe contributor to IH than BPD alone. Persistent patterns of SpO2 fluctuations over neighboring time segments were hence quantified to derive differences in short-term oxygenation patterns between infants with BPD alone and those with BPD and PH. These were finally leveraged to screen for PH presence prior to clinical diagnosis. Such a screening tool based on continuously monitored signals in neonatal intensive care units (NICU) can potentially aid early clinical decision and provide a continuous bedside marker for development of PH in neonates.

1. Methods

1.1. Data and Subjects

Oxygen saturation data was prospectively collected in extremely preterm infants admitted at the University of Alabama at Birmingham (UAB) between 2018 and 2020 as part of the Prematurity-Related Ventilatory Control (Pre-Vent) study. This study approved by the Institutional Review Board (IRB-UAB) at the University of Alabama at Birmingham, had oversight provided by both IRB-UAB and an observational and safety monitoring board, appointed by the NHLBI. Approval from the institutional review board was obtained for physiological and clinical data collection with waiver of consent. At UAB, infants remaining on respiratory support after 28 postnatal days are systematically screened every month for pulmonary hypertension by echocardiogram. Infants were included if born at <29 weeks’ gestation, remained on respiratory support on postnatal day 28, had an echocardiogram performed to screen for PH, and had oxygen saturation data available in the week preceding echocardiographic diagnosis. Infants with major congenital anomalies or genetic syndromes were excluded. Table 1 gives a description of demographic and clinical characteristics of the subjects. Among these, data on some known risk factors for BPD-PH such as, small for gestational age, (congenital) patent ductus arteriosus (persistent opening between the two major blood vessels leading from the heart), sepsis and respiratory support in the study population have been presented.

Table 1:

Demographic and clinical characteristics of the infant groups with BPD and PH (BPD-PH) and BPD alone (BPD). Values are given as median (IQR) and number (%) where applicable.

BPD group
N=60
BPD-PH group
N=41
Gestational age (weeks) 25.9 (24.1,28.1) 24.4 (23.3,25.9)
Birth weight (grams) 700 (604, 939) 587 (522,650)
Sex Male: 32 (53) Male: 21 (51)
Race White: 28/59(47) White: 11 (27)
Small for gestational age 9 (15) 13(32)
Maximum fraction of inspired oxygen at 28 days, FiO2 40 (30,51) 51 (45,75)
Ventilatory support at 28 days 49 (82) 38 (93)
Outcomes
Patent ductus arteriosus 15 (25) 23(56)
Necrotizing enterocolitis 7(12) 4(10)
Sepsis 9 (15) 15 (37)
Death 4 (7) 9 (22)

Bedside cardiorespiratory monitoring was carried out by Philips IntelliVue MP70 or MP50 monitors using Nellcor pulse oximetry sensors, an averaging time of 8 seconds, and recorded using the BedMaster system (Excel Medical Electronics, Jupiter FL, USA) at 125Hz with data analyzed using numeric data at 1 Hz. Forty- one infants with BPD and diagnosed PH (BPD-PH group) were included in the study along with sixty infants without PH (BPD group).

Six hours of SpO2 data acquired seven days prior to PH diagnosis were included in this study. In our previous study [12] , we studied distribution of hypoxemic event statistics on a subset of these subjects and used continuous data from all 7 days leading up to the clinical diagnosis. This analysis is therefore focused on using a short-term recording to seek differences in oxygenation between the groups and to provide PH detection with good diagnostic accuracy metrics. Similarity indices were estimated over short non-overlapping data segments and averaged over the available observed time for each subject. Hence, subjects were not excluded, as enough data segments were available from all to average, even though some subjects had shorter than 6 hours of data. Data were preprocessed prior to analysis as follows. Recording errors caused missing SpO2 values which were identified from the simultaneously recorded time vector. Signal gaps of less than 10 seconds were imputed by the average of previous three seconds. Segments with over 5% of missing values were excluded from the analysis. Missing data over observed time [median (min,max)] for all subjects was 0 (0, 3.9%).

1.2. Information based similarity analysis

An information based approach to quantify the similarity of two symbolic sequences has been developed to classify varied physiological signals including inter heart beat (RR) intervals and DNA sequences [17,18] . This method relies on finding temporal dynamical structures hidden in the complex fluctuations of output signals from biological systems and examining their rank order statistics to differentiate between states, such as healthy and pathological [17] . Noninvasive detection of conditions such as congestive heart failure and obstructive sleep apnea has been achieved previously by applying this measure to heart rate variability signal [20,21] .

This technique quantifies the similarity between two data series by first defining a binary symbolic sequence for each. Here, we converted the oximetric data series, x = {x1,x2,…, xN} to a binary symbolic sequence as, 0 if xi+1 = xi; i=1,2,…N and 1 otherwise. Hence, in the case of SpO2 data, the sequence represents variations or changes from the previous value. Following this, the sequence was divided into m-bit words by sliding the binary number one bit at a time. Hence a sequence of length (N-1) could give 2m, unique m-bit words. Figure 1 illustrates the binary sequence creation and construction of 5-bit words for an SpO2 data snippet. The first four SpO2 values in the snippet are the same and are hence represented by three leading 0 bits in the binary sequence indicating no difference from their respective previous values. The fifth value is different from the fourth and this is represented by 1 in the next position and so on. From the binary sequence, the first m-bits are included to form the first word, the next m-bits starting from the second bit, form the second word etc. as shown for the case of m=5 in Figure 1.

Figure 1:

Figure 1:

Schematic illustration for creation of binary sequence and construction of 5-bit words for SpO2 data. The words generated by sliding one bit at a time, and the decimal equivalent of each word, 3,7, 14 etc. are also shown here.

Word frequencies are enumerated by their occurrence in the sequence and these rank numbers are sorted for words with highest frequency to lowest. The similarity of two data series is quantified by such word ranks generated from the symbolic sequences of each series. For this, the rank of each word in one series is plotted against the corresponding word in the other one. If the two series are similar in their rank order of words, the scattered points will lie close to the diagonal line. The average deviations of these points from the diagonal are considered a measure of ‘distance’ between them. The more scattered the points are, the less similar the two segments and vice-versa [17] . For two data series, S1 and S2, this average deviation for m-bit words is defined as,

D(S1,S2)=1Lk=1LR1(wk)R2(wk)F(wk) (3)

where L denotes the total number of unique m-bit words, Ri(wk), i = 1,2 is the rank of word wk for the binary sequences from S1 and S2. The rank difference is weighted by the factor F(wk) that accounts for contributions from different words to the overall measure and is calculated as

F(wk)=[p1(wk)log(p1(wk))p2(wk)log(p2(wk))]Z (4)

and the normalization factor, Z is,

Z=k=1L[p1(wk)log(p1(wk))p2(wk)log(p2(wk))] (5)

It can be noted that pi(wk)log(pi(wk)), i = 1,2 represents the Shannon entropy of the word wk for each sequence. The weighing factor, F(wk) hence incorporates the information content carried by the m-bit words in the sequences being compared [22]. In this manner, the higher the frequency of occurrence of words, greater will be their contribution to quantifying similarity. D(S1,S2), the information-based similarity index represents the average deviation of the ranks from the diagonal line and in this sense is a measure of the ‘dissimilarity’ between S1 and S2. The greater the scatter from the diagonal, higher the D(S1,S2) or SI value and lower the similarity between the data series being compared. In addition, it has been specified that this similarity index is an empirical measure which does not always obey the triangular inequality criterion for a distance measure [22]. The normalization by the total number of words keeps the index in the range [0,1].

In this analysis, the data series compared were adjacent non-overlapping segments of SpO2 signal series for the two subject groups. This ensured that dynamic changes to signal similarity that might be related to the underlying pathology were studied and quantified within each subject group. Information based similarity index for word lengths 3 to 10 for adjacent segments of 5 minutes, (Si+1,Si), 1 ≤ i ≤ (n − 1) where n is the number of segments in each recording were derived. The final similarity index for a subject, SI was obtained by averaging over all the values computed for each recording. Figure 2 displays a schematic of estimating the SI from the data record of a single subject.

Figure2:

Figure2:

Schematic representing the screening for PH by estimating information-based similarity index for a subject.

1.3. Statistical analysis and validation

Signal preprocessing, estimation of indices as well as statistical analyses were carried out using MATLAB (R2021b, MathWorks, Natick, MA, USA). Similarity indices were calculated using the software in [23] . Kolmogorov–Smirnov test was used to check normality of the variables. The indices for BPD and BPD-PH groups were tested for equality of means by the independent samples t-test in case of normal variables and the Wilcoxon rank test for equality of medians otherwise. The results were expressed as the mean ± standard deviation (SD). Significant difference was indicated by p < 0.05. Univariate logistic regression was used to classify infants with PH using each SpO2 similarity feature as predictor and the performance was evaluated using accuracy, sensitivity, and specificity. In addition, the Mathews correlation coefficient which accounts for agreement between observed and predicted labels by including all elements of the confusion matrix was also estimated [24,25]. This metric ranges from −1 to 1, with 0 corresponding to a random classifier. The receiver operating characteristic (ROC) curve and area under the ROC curve (AUROC) of the screening models were also determined for each classifier. In addition, the bootstrap method with 1000 replications was applied to derive the 95% confidence interval for the ROC.

A hold-out strategy was adopted for cross validating the classifiers. 30 infants from the BPD-PH group (40%) and 45 from the BPD group (60%) were randomly sampled to form a stratified training set with the remaining (25%) as test data. Stratification ensured that the same distribution of positive (BPD-PH) and negative (BPD) classes was retained across the original and test-train sets. Performance was evaluated by calculating the balanced accuracy and F1 score, metrics recommended for evaluating classification on imbalanced data sets [26] . Balanced accuracy denotes mean of the sensitivity and specificity while F1 is the harmonic mean of the sensitivity and positive predictive value of the classifier. This was repeated 100 times and the mean and SD of these metrics were calculated for each feature.

2. Results

2.1. Similarity Index estimation and analysis between BPD-PH and BPD groups

The information-based similarity index (SI) was estimated for adjacent non-overlapping SpO2 data segments. Representative examples from a PH subject are shown in Figure 3. adjacent segments with more constant levels of SpO2 as in subplot (a) showed more ranks for 5-bit words closer to the line of identity, resulting in a lower SI and another pair of segments as in (b) that were more dissimilar had higher SI estimates. Hence the similarity index can track the variations in the regularity of SpO2 fluctuations between adjacent non-overlapping segments.

Figure 3:

Figure 3:

Adjacent SpO2 segments from a BPD-PH subject (a) and (b) representing different regularity in their fluctuations. The rank comparison of 5-bit words for the segments in (a) is plotted in (c). The corresponding plot for the segments in (b) is shown in (d). The line of equality in (c) and (d) helps visualize agreement between ranks for different words between the two segments.

2.2. Comparison of SpO2 similarity indices between BPD-PH and BPD groups

Similarity indices were estimated over all adjacent segments available from the recording for each subject. These were averaged over the observation period to provide a mean similarity index for each subject and compared between groups. 98 subjects had all 6 hours, one BPD-PH infant had 4.63 hours and two BPD infants had 5.31 and 5.64 hours of data respectively. The choice of 5-minute long segments was made to ensure adequate frequency for the binary words of differing bit-lengths from m = 3 to 10 [27,28] .

The dynamic changes of SpO2 measures for the two groups are shown in Figure 4. SI for 7-bit words was well separated between subject groups, with the BPD group’s values topping the BPD-PH group’s indices over the observed periods.

Figure 4:

Figure 4:

Temporal variation of the estimated SI for 7-bit words for the two groups. The lines represent the mean value of the index and the shaded portion, standard error for the BPD and BPD-PH groups at each analyzed segment over the whole recording.

The mean (SD) of these measures are summarized by groups in Table 2. All measures passed the Kolmogorov test for normality. The SI indices were all significantly higher in the BPD group compared to the BPD-PH group (p<0.001). We also conducted this analysis for the shorter recording times of 2 and 4 hours with similar results (Supplementary Table 1). However, the measures averaged over the entire 6 hour period had more 5-minute segments included and consequently gave lower standard deviation estimates and the greatest differences between the BPD-PH and BPD groups.

Table 2:

SpO2 similarity indices (mean (SD)) for the BPD-PH and BPD groups.

Measure BPD
N=60
BPD-PH
N=41
P
SI_m3 0.10(0.03) 0.08(0.01) <0.001
SI_m4 0.12(0.02) 0.10(0.01) <0.001
SI_m5 0.14(0.02) 0.12(0.01) <0.001
SI_m6 0.16(0.03) 0.14(0.02) <0.001
SI_m7 0.20(0.03) 0.17(0.02) <0.001
SI_m8 0.24(0.05) 0.20(0.03) <0.001
SI_m9 0.29(0.06) 0.24(0.03) <0.001
SI_m10 0.33(0.08) 0.27(0.04) <0.001

SI_mi, i = 3,4, …,10: Similarity index of , i-bit word

2.3. Screening Performance of Similarity indices

Univariate logistic regression was used to screen for PH in the subject sample. The estimated coefficients, model equations and p-values are given in Supplementary Table 2. The ROC curve was considered and the area under it (AUROC) computed to provide a measure of the discriminative ability of each SI feature. Screening analysis was also conducted using estimates averaged over a shorter period of 4 hours and the comparison between the computed AUROC metrics is presented in Figure 5. This analysis ascertained whether the discriminative ability differed when the observation time was longer, and secondly, decided which measures exhibited superior classification capability.

Figure 5:

Figure 5:

Area under ROC of screening models for each of the SpO2 features averaged 4 hours and 6 hours.

SI of binary words m ≥4, averaged over 6 hours observation time exhibited AUROC > 0.75. When averaged over 4 hours, only four of these indices had AUROC higher than 0.75. SI_m4 had the highest discriminative ability among all measures considered and close values (~ 0.81) at both averaging periods. This was also observed for SI_m5. SI for all other word lengths had higher AUROC when averaged for 6-hours. This analysis clearly demonstrated that similarity indices discriminated between BPD-PH and BPD groups, and estimates averaged over (at least) 6 hours would likely perform better classification than those over shorter observation periods. However, SI at shorter word lengths of 4 and 5 may still be suitable for PH screening when only short SpO2 data sets are available.

The performance metrics of PH screening by similarity indices of words with 3 ≤ m ≤ 10 bits are presented in Table 3. Sensitivity of the screening is of utmost importance in a clinical setting, so that ideally all PH subjects are correctly classified. With more correctly classified PH subjects, SI for m = 4 to 7 showed incremental sensitivity, specificity, and accuracy. SI_m7 had the best sensitivity, specificity pairing as (0.76, 0.83). Figure 6 displays the ROC curve and confusion matrix of the screening based on this similarity index. The Mathews correlation coefficient (MCC) evaluated for models with these features was found to vary from 0.21 at m=3 to 0.3 at m=10, with a peak value of 0.59 for word length of 7, all of which were higher 0, the value for a random classifier. In addition to the area under ROC values themselves, the lower bound of 95% confidence interval was also more than 0.5 corresponding to a random classifier. The similarity features thus differentiated between the two subject groups considered. Based on all these metrics, the best screening performance for PH was achieved by the information-based similarity indices estimated for adjacent 5-minute segments and averaged over 6 hours and for word lengths of 4 ≤ m ≤ 7.

Table 3:

Screening performance of similarity indices

Classifier
Feature
TP FP TN FN Acc. Sen. Spe. AUROC
(95% CI)
SI_m3 20 17 43 21 0.624 0.488 0.717 0.73(0.62,0.82)
SI_m4 28 15 45 13 0.723 0.683 0.750 0.81(0.71,0.88)
SI_m5 28 15 45 13 0.723 0.683 0.750 0.78(0.69,0.86)
SI_m6 29 13 47 12 0.752 0.707 0.783 0.78(0.68,0.86)
SI_m7 31 10 50 10 0.802 0.756 0.833 0.79(0.69,0.87)
SI_m8 27 13 47 14 0.733 0.659 0.783 0.78(0.66,0.87)
SI_m9 25 13 47 16 0.713 0.610 0.783 0.77(0.64,0.85)
SI_m10 23 14 46 18 0.683 0.561 0.767 0.76(0.66,0.85)

SI_mi, i = 3,4, …,10: Similarity index of i-bit word, TP: true positive, FP: false positive, TN: true negative, FN: false negative, Acc.: accuracy, Sen.: sensitivity, Spe.: specificity, AUROC: area under ROC, CI: confidence interval

Figure 6:

Figure 6:

PH screening performance of the SI_m7 feature. (a) Receiver operating curve (b) confusion matrix

Finally, screening performance of features was validated on 100 trials with held out test sets of ~ 25 subjects. Figure 7 displays the mean (SD) of balanced accuracy and F1 metrics for the classification by SI of m=4 to 9 words. For unbalanced data, with more controls (BPD) than cases (BPD-PH), these metrics are recommended for performance evaluation of prediction models. SI_m7 exhibited the highest F1 and balanced accuracy of 0.80(0.08) and 0.77(0.08) respectively, while other SI had test metrics ~ 70%. With high sensitivity, specificity, and positive predictive values, SpO2 similarity indices can thus be effective in PH screening.

Figure 7:

Figure 7:

Cross-validation metrics (Mean +/− SD) of screening with SpO2 features over 100 trials. (a) Balanced Accuracy, (b) F1 score.

3. Discussion

Information-based similarity analysis of oxygen saturation data was demonstrated to detect pulmonary hypertension (PH), a debilitating cardiorespiratory complication, in a population of preterm infants with BPD. This non-invasive technique could detect PH in infants with good diagnostic accuracy by analyzing adjacent data segments, one week before clinical diagnosis. The similarity index corresponding to 7-bit binary words achieved maximum screening accuracy, sensitivity, and specificity of 80%, 76% and 83% respectively. To the best of our knowledge, this is the first application of a signal analysis technique leveraging routinely monitored oximetric data in the NICU for PH screening.

3.1. Proposed method and screening performance

Information-based similarity analysis quantifies the regularity of SpO2 fluctuations between adjacent segments of data by enumerating patterns of m-bit binary words. The method identified greater similarity between adjacent SpO2 segments for the BPD-PH group when averaged over time (Table 1) and over subjects (Figure 3), supporting our initial hypothesis. Similarity indices of different word lengths were consistently lower for BPD-PH in comparison to the BPD group (p<0.001). Based on area under ROC and diagnostic metrics, these indices also discriminated BPD subjects with PH from those with BPD alone. For segments of 5-minute duration, best screening performance was by 7-bit binary words beyond which performance deteriorated. We can therefore infer that if segment length increased, longer words may likely occur, plausibly with differing frequencies, thus providing different estimates for the similarity index in each group and hence better screening performance. In this study, we used 6 hours of data, but have also shown discrimination between groups with SI estimates averaged over 4 hours, albeit with lower performance metrics (Figure 4). The optimal choice of segment length for similarity analysis, the length m of the binary word and the averaging time will depend on this available data length.

The cross-validation of the univariate screening showed variability (SD: 8%-10%) in the balanced accuracy and F1 metrics. The training set was appropriately stratified based on the subject sample to reflect outcomes that may occur in a realistic setting where more subjects without PH are likelier than those with PH. The observed variability in the cross-validation metrics may therefore stem from two sources. Firstly, the test sample size was small and due to this, even a few misclassifications could affect the performance metrics detrimentally. Secondly and perhaps more importantly, the screening is affected by how well differentiated the groups are, based on the similarity index. Because these infants all have underlying BPD, the cardiorespiratory patterns they exhibit may be similar despite a subset of them also having PH. During the observed 6 hour period, if some BPD-PH infants had similar patterns of oxygenation to that of a BPD infant (or vice-versa), their similarity index may overlap the BPD group and cause them to be misclassified. This is a challenge in developing efficient screening models for this condition that can only be overcome by including large sets of subject data accounting for all sources of variability in cardiorespiratory behavior and identifying signal measures that can quantify subtle differences between groups reliably. Despite this, the lowest validation metrics corresponding to SI_m7 and SI_m8 were over 70%, displaying good sensitivity and PPV balanced with specificity, all important metrics from a clinical perspective.

3.2. Current methods for PH diagnosis and strengths of SpO2 based PH screening

A serious cardiovascular complication in preterm infants, PH is associated with higher risk of mortality, non-optimal growth and developmental outcomes, especially in those with BPD [3-5]. Diagnostic echocardiogram in infants with moderate to severe BPD at a postnatal age of 36 weeks’ postmenstrual age is the current recommendation for PH screening [29]. However, serial electrocardiography even when evaluated by trained neonatologists has limited sensitivity to detect the disease and its severity [30] . Risk factors for this condition suggest early detection of PH recommended by AHA consensus [3,7] . Screening models based on clinical characteristics and physiological signals which provide early and additional predictions of PH presence, may therefore support development of preventive protocols.

A recent model with demographic and known clinical risk factors such as small for gestational age and BPD severity grade was found to predict PH with AUROC of 0.93 [31] . Degree of respiratory support over the first five days of life was used to predict development of future PH with AUROC of 0.76 [32]. We studied distribution characteristics of severe desaturations in a case-control sample and found significantly longer durations per event in the PH group. The odds ratio failed to reach significance in detecting PH after adjustment for baseline and clinical characteristics, though mortality was well predicted within the PH group (AUROC = 0.77 ) [12]. These findings prompted us to explore indices of oxygenation dynamics that can identify characteristics of PH-compromised cardio-vascular functioning in the affected group. Similarity analysis of short term SpO2 fluctuations as presented here is a novel methodology which can potentially be developed into a bedside screening protocol for PH. In addition, as it is based on portable, cost-effective pulse oximetry, this method can be easily adopted in low-resource settings with limited access to NICU hospitals [33].

3.3. Limitations and Future directions of the work

In this study, we focused on the similarity index as a marker of the differential oxygenation dynamics between the BPD and BPD-PH groups of infants with BPD. While this showed good performance, measures characterizing other aspects such as signal irregularity and spectral content may carry complementary information that can differentiate between groups. These could be investigated as potential signal based features along with the current measure, to identify infants with PH. Detecting cardio-respiratory diseases using physiological signals is challenging because disease markers may be spread over time or have signal patterns similar to the underlying condition, in this case BPD[10] . Hence, more effective signal analysis techniques and indices that extract information related to the pathology need to be investigated. An analogous situation arises in applying SpO2 or heart rate variability analysis to distinguish severe from mild cases of OSA in different subject populations [14,34] . The methodology developed here should be validated on more subjects from different centers and its efficacy for early warning tested on data from earlier periods, preferably a few weeks before diagnostic testing. Our future work will pursue these directions as well as incorporate clinical characteristics to risk stratify pre-term infants likely to develop PH.

4. Conclusion

This work proposed information-based similarity analysis of routinely monitored oxygen saturation to screen preterm infants with BPD for PH. Quantifying the similarity in SpO2 fluctuation patterns between adjacent segments from data recorded one week prior to the clinical diagnosis, this method outperformed other measures describing varied signal characteristics, in identifying PH in infants with BPD. The validation F1-score of screening with similarity indices was high, pointing to balanced sensitivity and positive predictive value, both important metrics in clinical screening. In summary, similarity analysis of oxygen saturation is a promising physiological signal-based early screening tool for PH, a complicated cardiovascular condition in preterm infants.

Supplementary Material

1

Highlights.

  • Pulmonary hypertension (PH) complicates the clinical course and outcomes in preterm infants.

  • Early screening for the condition can inform medical decision and neonatal care.

  • Information based similarity analysis of oxygen saturation patterns identified infants with PH

  • Our method using short data sets detected PH prior to diagnostic echocardiography

  • Noninvasive signal-based screening can be a continuous marker of disease development

Acknowledgements

Funding Sources: This work was supported by the National Institutes of Health (LDCC U01 HL133708 and UAB U01 HL133536).

List of Abbreviations

PH

Pulmonary hypertension

BPD

Bronchopulmonary dysplasia

AHA

American Heart Association

SI_mi

Similarity index for a word of i bits

AUROC

Area under receiver operating curve

Acc.

Accuracy

Sen.

Sensitivity

Spe.

Specificity

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Disclaimer:

The views expressed in this article are those of the authors and do not necessarily represent those of the National Institutes of Health or the U.S. Department of Health and Human Services.

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  • [1].Stoll BJ, Hansen NI, Bell EF, Shankaran S, Laptook AR, Walsh MC, Hale EC, Newman NS, Schibler K, Carlo WA, Kennedy KA, Poindexter BB, Finer NN, Ehrenkranz RA, Duara S, Sánchez PJ, O’Shea TM, Goldberg RN, Van Meurs KP, Faix RG, Phelps DL, Frantz ID, Watterberg KL, Saha S, Das A, Higgins RD, Eunice Kennedy Shriver National Institute of Child Health and Human Development Neonatal Research Network, Neonatal outcomes of extremely preterm infants from the NICHD Neonatal Research Network., Pediatrics. 126 (2010) 443–456. 10.1542/peds.2009-2959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Mourani PM, Sontag MK, Younoszai A, Miller JI, Kinsella JP, Baker CD, Poindexter BB, Ingram DA, Abman SH, Early pulmonary vascular disease in preterm infants at risk for bronchopulmonary dysplasia., Am. J. Respir. Crit. Care Med 191 (2015) 87–95. 10.1164/rccm.201409-1594OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Arjaans S, Zwart EAH, Ploegstra M-J, Bos AF, Kooi EMW, Hillege HL, Berger RMF, Identification of gaps in the current knowledge on pulmonary hypertension in extremely preterm infants: A systematic review and meta-analysis., Paediatr. Perinat. Epidemiol 32 (2018) 258–267. 10.1111/ppe.12444. [DOI] [PubMed] [Google Scholar]
  • [4].Choi EK, Shin SH, Kim E-K, Kim H-S, Developmental outcomes of preterm infants with bronchopulmonary dysplasia-associated pulmonary hypertension at 18-24 months of corrected age., BMC Pediatr. 19 (2019) 26. 10.1186/s12887-019-1400-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Nakanishi H, Uchiyama A, Kusuda S, Impact of pulmonary hypertension on neurodevelopmental outcome in preterm infants with bronchopulmonary dysplasia: a cohort study., J. Perinatol 36 (2016) 890–896. 10.1038/jp.2016.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].del Cerro MJ, Sabaté Rotés A, Cartón A, Deiros L, Bret M, Cordeiro M, Verdú C, Barrios MI, Albajara L, Gutierrez-Larraya F, Pulmonary hypertension in bronchopulmonary dysplasia: clinical findings, cardiovascular anomalies and outcomes., Pediatr. Pulmonol 49 (2014) 49–59. 10.1002/ppul.22797. [DOI] [PubMed] [Google Scholar]
  • [7].Abman SH, Hansmann G, Archer SL, Ivy DD, Adatia I, Chung WK, Hanna BD, Rosenzweig EB, Raj JU, Cornfield D, Stenmark KR, Steinhorn R, Thébaud B, Fineman JR, Kuehne T, Feinstein JA, Friedberg MK, Earing M, Barst RJ, Keller RL, Kinsella JP, Mullen M, Deterding R, Kulik T, Mallory G, Humpl T, Wessel DL, American Heart Association Council on Cardiopulmonary, Critical Care, Perioperative and Resuscitation; Council on Clinical Cardiology; Council on Cardiovascular Disease in the Young; Council on Cardiovascular Radiology and Intervention; Council on Cardiovascular Surgery and Anesthesia; and the American Thoracic Society, Pediatric pulmonary hypertension: guidelines from the american heart association and american thoracic society., Circulation. 132 (2015) 2037–2099. 10.1161/CIR.0000000000000329. [DOI] [PubMed] [Google Scholar]
  • [8].Kim D-H, Kim H-S, Choi CW, Kim E-K, Kim BI, Choi J-H, Risk factors for pulmonary artery hypertension in preterm infants with moderate or severe bronchopulmonary dysplasia., Neonatology. 101 (2012) 40–46. 10.1159/000327891. [DOI] [PubMed] [Google Scholar]
  • [9].Krishnan U, Feinstein JA, Adatia I, Austin ED, Mullen MP, Hopper RK, Hanna B, Romer L, Keller RL, Fineman J, Steinhorn R, Kinsella JP, Ivy DD, Rosenzweig EB, Raj U, Humpl T, Abman SH, Pediatric Pulmonary Hypertension Network (PPHNet), Evaluation and Management of Pulmonary Hypertension in Children with Bronchopulmonary Dysplasia., J. Pediatr 188 (2017) 24–34.e1. 10.1016/j.jpeds.2017.05.029. [DOI] [PubMed] [Google Scholar]
  • [10].Ambalavanan N, Mourani P, Pulmonary hypertension in bronchopulmonary dysplasia., Birth Defects Res. Part A Clin. Mol. Teratol 100 (2014) 240–246. 10.1002/bdra.23241. [DOI] [PubMed] [Google Scholar]
  • [11].Jensen EA, Whyte RK, Schmidt B, Bassler D, Vain NE, Roberts RS, Canadian Oxygen Trial Investigators, Association between Intermittent Hypoxemia and Severe Bronchopulmonary Dysplasia in Preterm Infants., Am. J. Respir. Crit. Care Med 204 (2021) 1192–1199. 10.1164/rccm.202105-1150OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Gentle SJ, Travers CP, Nakhmani A, Indic P, Carlo WA, Ambalavanan N, Intermittent Hypoxemia and Bronchopulmonary Dysplasia with Pulmonary Hypertension in Preterm Infants., Am. J. Respir. Crit. Care Med 207 (2023) 899–907. 10.1164/rccm.202203-0580OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Lake DE, Fairchild KD, Moorman JR, Complex signals bioinformatics: evaluation of heart rate characteristics monitoring as a novel risk marker for neonatal sepsis., J. Clin. Monit. Comput 28 (2014) 329–339. 10.1007/s10877-013-9530-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Álvarez D, Alonso-Álvarez ML, Gutiérrez-Tobal GC, Crespo A, Kheirandish-Gozal L, Hornero R, Gozal D, Terán-Santos J, Del Campo F, Automated screening of children with obstructive sleep apnea using nocturnal oximetry: an alternative to respiratory polygraphy in unattended settings., J. Clin. Sleep Med 13 (2017) 693–702. 10.5664/jcsm.6586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Arce-Alvarez A, Melipillán C, Andrade DC, Toledo C, Marcus NJ, Del Rio R, Heart rate variability alterations in infants with spontaneous hypertonia., Pediatr. Res 86 (2019) 77–84. 10.1038/s41390-019-0318-7. [DOI] [PubMed] [Google Scholar]
  • [16].Sullivan BA, Wallman-Stokes A, Isler J, Sahni R, Moorman JR, Fairchild KD, Lake DE, Early pulse oximetry data improves prediction of death and adverse outcomes in a two-center cohort of very low birth weight infants., Am. J. Perinatol 35 (2018) 1331–1338. 10.1055/s-0038-1654712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Yang AC-C, Hseu S-S, Yien H-W, Goldberger AL, Peng CK, Linguistic analysis of the human heartbeat using frequency and rank order statistics., Phys. Rev. Lett 90 (2003) 108103. 10.1103/PhysRevLett.90.108103. [DOI] [PubMed] [Google Scholar]
  • [18].Peng CK, Yang AC-C, Goldberger AL, Statistical physics approach to categorize biologic signals: from heart rate dynamics to DNA sequences., Chaos. 17 (2007) 015115. 10.1063/1.2716147. [DOI] [PubMed] [Google Scholar]
  • [19].Yang AC-C, Peng CK, Yien HW, Goldberger AL, Information categorization approach to literary authorship disputes, Phys. A: Stat. Mech. Appl 329 (2003) 473–483. 10.1016/S0378-4371(03)00622-8. [DOI] [Google Scholar]
  • [20].Wu S, Liang D, Yang Q, Liu G, Regularity of heart rate fluctuations analysis in obstructive sleep apnea patients using information-based similarity, Biomed. Signal Process. Control 65 (2021) 102370. 10.1016/j.bspc.2020.102370. [DOI] [Google Scholar]
  • [21].Liu Z, Chen T, Wei K, Liu G, Liu B, Similarity changes analysis for heart rate fluctuation regularity as a new screening method for congestive heart failure., Entropy (Basel). 23 (2021). 10.3390/e23121669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Yang AC, Tsai S-J, Hong C-J, Wang C, Chen T-J, Liou Y-J, Peng C-K, Clustering heart rate dynamics is associated with β-adrenergic receptor polymorphisms: analysis by information-based similarity index., PLoS ONE. 6 (2011) e19232. 10.1371/journal.pone.0019232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Yang A, Information-based Similarity Toolbox, 2022 (2022). [Google Scholar]
  • [24].Chicco D, Jurman G, The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation., BMC Genomics. 21 (2020) 6. 10.1186/s12864-019-6413-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Gunduz H, An efficient dimensionality reduction method using filter-based feature selection and variational autoencoders on Parkinson’s disease classification, Biomed. Signal Process. Control 66 (2021) 102452. 10.1016/j.bspc.2021.102452. [DOI] [Google Scholar]
  • [26].Bekkar M, Djemaa HK, Alitouche TA, Evaluation Measures for Models Assessment over Imbalanced Data Sets, Journal of Information Engineering and Applications. 3 (2013) 27–38. [Google Scholar]
  • [27].Yentes JM, Hunt N, Schmid KK, Kaipust JP, McGrath D, Stergiou N, The appropriate use of approximate entropy and sample entropy with short data sets., Ann. Biomed. Eng 41 (2013) 349–365. 10.1007/s10439-012-0668-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Álvarez D, Hornero R, Marcos JV, Del Campo F, Feature selection from nocturnal oximetry using genetic algorithms to assist in obstructive sleep apnoea diagnosis., Med. Eng. Phys 34 (2012) 1049–1057. 10.1016/j.medengphy.2011.11.009. [DOI] [PubMed] [Google Scholar]
  • [29].Abman SH, Ivy DD, Archer SL, Wilson K, AHA/ATS Joint Guidelines for Pediatric Pulmonary Hypertension Committee, Executive summary of the american heart association and american thoracic society joint guidelines for pediatric pulmonary hypertension., Am. J. Respir. Crit. Care Med 194 (2016) 898–906. 10.1164/rccm.201606-1183ST. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Mourani PM, Sontag MK, Younoszai A, Ivy DD, Abman SH, Clinical utility of echocardiography for the diagnosis and management of pulmonary vascular disease in young children with chronic lung disease., Pediatrics. 121 (2008) 317–325. 10.1542/peds.2007-1583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Wang C, Ma X, Xu Y, Chen Z, Shi L, Du L, A prediction model of pulmonary hypertension in preterm infants with bronchopulmonary dysplasia., Front. Pediatr 10 (2022) 925312. 10.3389/fped.2022.925312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Seo YM, Yum SK, Sung IK, Respiratory Severity Score with Regard to Birthweight during the Early Days of Life for Predicting Pulmonary Hypertension in Preterm Infants., J. Trop. Pediatr 66 (2020) 561–568. 10.1093/tropej/fmaa013. [DOI] [PubMed] [Google Scholar]
  • [33].Garde A, Zhou G, Raihana S, Dunsmuir D, Karlen W, Dekhordi P, Huda T, Arifeen SE, Larson C, Kissoon N, Dumont GA, Ansermino JM, Respiratory rate and pulse oximetry derived information as predictors of hospital admission in young children in Bangladesh: a prospective observational study., BMJ Open. 6 (2016) e011094. 10.1136/bmjopen-2016-011094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Del Campo F, Crespo A, Cerezo-Hernández A, Gutiérrez-Tobal GC, Hornero R, Álvarez D, Oximetry use in obstructive sleep apnea., Expert Rev. Respir. Med 12 (2018) 665–681. 10.1080/17476348.2018.1495563. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES