Abstract
The presented collection of data comprises of a set of 12 linear and nonlinear indices computed at different time scales and extracted from Fetal Heart Rate (FHR) traces acquired through Hewlett Packard CTG fetal monitors (series 1351A), connected to a PC. The sampling frequency of the recorded FHR signal is equal 2 Hz. The recorded populations consist of two groups of fetuses: 60 healthy and 60 Intra Uterine Growth Restricted (IUGR) fetuses. IUGR condition is a fetal condition defined as the abnormal rate of fetal growth. In clinical practice, diagnosis is confirmed at birth and may only be suspected during pregnancy. The pathology is a documented cause of fetal and neonatal morbidity and mortality. The described database was employed in a set of machine learning approaches for the early detection of the IUGR condition: “Integrating machine learning techniques and physiology based heart rate features for antepartum fetal monitoring” [1]. The added value of the proposed indices is their interpretability and close connection to physiological and pathological aspect of FHR regulation. Additional information on data acquisition, feature extraction and potential relevance in clinical practice are discussed in [1].
Keywords: Fetal heart rate monitoring, Cardiotocography, Intra uterine growth restricted, Physiology-based features, Multivariate analysis, Predictive analytics
Specifications Table
| Subject | Medicine - Obstetrics, Gynecology and Women's Health |
| Specific subject area | Perinatal Medicine, Fetal Monitoring |
| Type of data | Table of Fetal Heart Rate (FHR) parameters |
| How data were acquired | A set of linear and nonlinear algorithms were computed by means of algorithms implemented in Matlab acquired using Hewlett Packard CTG fetal monitors (series 1351A), connected to a PC |
| Data format | Raw table of FHR indices |
| Parameters for data collection | 60 normals and 60 IUGR CTG recordings, each lasting > 30 minutes were collected on pregnant women, at different weeks of gestation. Parameters were extracted by means of algorithms reported in the scientific literature, considering non overlapping time windows of different duration, depending on the parameter. Each index reported in the table represents the average of the relevant parameter computed over the whole recording. |
| Description of data collection | CTG traces were recorded using Hewlett Packard CTG fetal monitors (series 1351A), connected to a PC during standard non-stress tests on pregnant women in a clinical environment under supervision of trained personnel. FHR was sampled at 2 Hz (one value each 500 ms) |
| Data source location | Department of Obstetrical-Gynaecological and Urological Science and Reproductive Medicine, Federico II University, Naples, Italy |
| Data accessibility | Repository name: Mendeley Data Data identification number: 10.17632/2953f8fgcy.1 Direct URL to data: https://doi.org/10.17632/2953f8fgcy.1#file-693e38db-a819-4b36-960e-446634ce45e5 |
| Related research article | Signorini, M. G., Pini, N., Malovini, A., Bellazzi, R., & Magenes, G. (2020). Integrating machine learning techniques and physiology based heart rate features for antepartum fetal monitoring. Computer methods and programs in biomedicine, 185, 105015. https://doi.org/10.1016/j.cmpb.2019.105015 |
|
|
|
|
1. Data description
The data enclosed with this Data in Brief article contain a table of 12 Fetal Heart Rate indices extracted from CTG tracings collected from two equally numerous populations of fetuses during the antepartum period: 60 healthy fetuses and 60 Intra Uterine Growth Restricted (IUGR) fetuses [1].
The 120 CTG tracings were recorded at Department of Obstetrical-Gynaecological and Urological Science and Reproductive Medicine, Federico II University, Naples, Italy. These tracings cannot be made publicly available due to privacy and ethical reasons. The indices contained in the data table, were computed on CTG tracings as described in Experimental Design, Materials, and Methods section below. Each datum reported in the table represents the average of the relevant index for the whole duration of the recording. This condensed information represents the raw data that we employed in the related research article for testing several machine learning methods with the purpose of discriminating healthy versus IUGR fetuses. This dataset is fully available as reported in Data accessibility section.
Coming to the description of the available dataset, it comprises of two different excel sheets which contain the parameters of the two populations.
The data are organized as follows:
-
1.
Each row corresponds to a single participant (fetus) and each column corresponds to a given index.
-
2.
The first column after the numerical id of the patient (second column in the sheet) contains the week of gestation at which the CTG was recorded.
-
3.
Each cell contains the average value of the index (column) for each patient (row) computed on the whole FHR tracing. The averaging procedure was performed since the total length of each recording contains both activity and quiet periods of the relevant fetus.
Missing values are coded as NaN.
2. Experimental Design, materials, and methods
The presented Supplementary Material provides a more in-depth description of the steps towards the estimation of parameters employed in this study starting from the FHR signal. The original raw FHR recordings were sampled at 2 Hz and they were measured in beats per minutes (bmp).
1.1. Time domain parameters
Each FHR value coming from Hewlett Packard CTG fetal monitors (series 1351A) was transformed in equivalent RR interval and expressed in milliseconds for the computation of time domain parameters.
Short Term Variability (STV) [ms]: it quantifies FHR variability on a short time scale. Considering an interbeat sequence of 1-minute duration, STV is defined as:
where is the average FHR computed by diving the FHR recording in nonoverlapping windows of 5 consecutive FHR values (2.5 s for a sampling frequency of 2 Hz). is computed in a window of 1-minute duration so that 24 values are obtained for each window. The corresponding STV estimate is obtained by averaging the differences between adjacent values, having accelerations and deceleration excluded [2].
Interval Index (II): it provides an estimated of short term variability scaled by STV, defined as:
where are the total number of FHR values recorded in 1 minute [2].
Delta [ms]: considering a window in time of 1-minute duration, Delta is defined as:
where which are the total number of FHR values recorded in 1 minute [2].
Long Term Irregularity (LTI) [ms]: is defined as the interquartile range of the distribution which is defined as:
where which are the total number of FHR values in a 3-minute window. LTI quantifies FHR variability on a longer time scale with respect to the previously reported time domain indices excluding accelerations and decelerations [2].
1.2. Frequency domain parameters
Each FHR value coming from Hewlett Packard CTG fetal monitors (series 1351A) was transformed in equivalent RR interval and expressed in milliseconds for the computation of frequency domain parameters.
PSD Power Spectral Density (PSD) is a widely employed tool for HRV frequency analysis as it can quantitatively measure the periodic oscillations related to neural control activity, namely autonomic nervous system (ANS) modulation over the cardiac system.
In the context of this analysis, the PSD estimation for FHR was performed based on autoregressive (AR) modelling (parametric spectral estimation). The AR model utilized to mimic FHR dynamic is defined as:
where (White Gaussian Noise), p is the model order (from 8 to 12), and are the model parameters. The modelled FHR windows () are of duration equal to 3 minutes (.
Model parameters are calculated recursively by means of the Levinson-Durbin algorithm. Once the proper model order is defined, so that the model parameters are determined, PSD is defined as:
where is the mean value of in seconds and is the z-transform of the transfer function of the AR process previously defined. Through this parametric approach, FHR signal undergoes an automatic decomposition into a sum of sinusoidal contributions, themselves identified by their corresponding central frequencies and the associated power [3].
In the context of frequency analysis applied to FHR it is possible to identify three specific frequency bands of interest, namely LF [0.03–0.15] Hz; MF [0.15–0.5] Hz; HF [0.5–1 Hz]. The corresponding associated power is computed by integrating the PSD power in these intervals, thus obtaining LF_pow [ms2/Hz]; MF_pow [ms2/Hz]; HF_pow [ms2/Hz]. Lastly, LF/(MF + HF) is obtained as: LF_pow/(MF_pow + HF_pow) [3].
1.3. Nonlinear domain parameters
Each FHR value coming from Hewlett Packard CTG fetal monitors (series 1351A) was transformed in equivalent RR interval and expressed in milliseconds for the computation of Approximate Entropy and Lempel and Ziv complexity.
Approximate Entropy (ApEn) [bits]: quantifies a signal regularity by assessing the occurrence rate of patterns by comparing the patters themselves to a reference one of length m. Pattern similarity is defined based on a threshold r [4].
Given a sequence of N data points , the algorithm creates sequences (based on window length ) and it computes for each the quantity expressed as:
Approximate Entropy (ApEn) is defined as:
where .
In the context of finite time series of length N as for FHR, ApEn can be written as:
In this work, ApEn was estimated by considering non-overlapping windows of duration equal to 3 minutes, with the following parameter setting: and named ApEn(1,0.1) [5,6].
Lempel and Ziv complexity (LZC) [bits]: the first step towards its formulation encompasses the definition of the quantity which measures the number of different sub strings and the rate of their recurrence in a given time series. According to the Information Theory, in turn it assesses the minimum quantity of information needed to define a binary string. LZC quantifies the rate of new patterns arising as signal evolves [7].
Suppose the number of symbols in the alphabet is and the length of sequence is equal . The upper bound for is given by:
where
and means the logarithm of to the base .
When is large enough , so as a result:
where is the asymptotic behavior of for a random string.
The normalized complexity is thus defined as:
In order to obtain LZC estimation for FHR time series, the latter requires to be transformed into a symbolic sequence according to a binary and/or a ternary coding procedure.
Binary coding: given a FHR series , the sequence is built by assigning 1 when the condition is verified for . On the opposite case of signal decrease, is assigned to 0 when the condition is met.
Ternary coding: given a FHR series , the sequence is built as in the binary coding case with the additional condition of signal invariance which is defined as and coded with the symbol 2 [7].
Additionally, in the context of recorded time series a factor is introduced to define the minimum quantization level for a symbol change in the coded string (e.g. if .
In this work, LZC was estimated by considering non-overlapping windows of duration equal to 3 minutes, with the following parameter setting: binary coding and and named LZC(2,0). The choice of reflects the current value for the quantization level, which is actually ± 0.5 bpm [8].
Each FHR value coming from Hewlett Packard CTG fetal monitors (series 1351A) was expressed in bpm for the computation of Acceleration Phase Rectified Slope and Deceleration Phase Rectified Slope in order to be concordant with the common definition of acceleration and deceleration in fetal heart rate monitoring.
Acceleration Phase Rectified Slope (APRS) [bpm]: the computation of Phase Rectified Signal Averaging (PRSA) curve (which APRS is extracted from) starts from considering a time series of length (as FHR in this work. The first step towards PRSA computation is the determination of the so-called anchor point (APs). In this context, APs are defined as the time series points fulfilling the following inequality:
where the parameter is employed as the upper frequency bound for the periodicities to be detected by PRSA methodology.
After APs being detected, windows of length are built symmetrically with respect to each AP. Given the fact that the majority of APs are temporally close one each other, the resulting windows are effectively overlapping. An additional specification for the parameter is that it should be larger than the period of slowest oscillation to be detected [9].
The PRSA curve is obtained by averaging the derived windows synchronized in their APs. After obtaining the PRSA curve, it is useful to summarize its characteristics by extracting different parameters. An example of such is APRS defined as:
The parameter APRS is a descriptor of the average increase in FHR amplitude and the time span of such increase event.
In this work, the considered signals is the whole available FHR recording, thus resulting in a single APRS value. The parameters and are equal 40 and 200 respectively [6,10].
Deceleration Phase Rectified Slope (DPRS) [bpm]: the computations are analogous of those previously reported for APRS apart for the definition of APs which are defined as the time series points fulfilling [9]:
Acknowledgments
The authors are grateful to Dr. M. Campanile and her team, Department of Obstetrical-Gynaecological and Urological Science and Reproductive Medicine, Federico II University, Naples, Italy for the collection of CTG tracings of healthy and IUGR fetuses. The research presented in this paper was supported by PRIN 2017 MIUR, ICT4MOMs [2017RR5EW3].
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.dib.2020.105164
Conflict of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Appendix A. Supplementary data
The following is the Supplementary data to this article:
References
- 1.Signorini M.G., Pini N., Malovini A., Bellazzi R., Magenes G. Integrating Machine Learning Techniques and Physiology Based Heart Rate Features for Antepartum Fetal Monitoring. Comput. Methods Progr. Biomed. 2019:105015. doi: 10.1016/j.cmpb.2019.105015. [DOI] [PubMed] [Google Scholar]
- 2.Arduini D., Rizzo G., Piana A., Bonalumi P., Brambilla P., Romanini C. Computerized analysis of fetal heart rate—Part I: description of the sys- tem (2CTG) J. Matern. Fetal Invest. 1993;3:159–164. [Google Scholar]
- 3.Signorini M.G., Magenes G., Cerutti S., Arduini D. Linear and nonlinear parameters for the analysis of fetal heart rate signal from cardiotocographic recordings. IEEE Trans. Biomed. Eng. 2003;50(3):365–374. doi: 10.1109/TBME.2003.808824. [DOI] [PubMed] [Google Scholar]
- 4.Pincus S. Approximate entropy (ApEn) as a complexity measure. Chaos. Mar. 1995;5(1):110–117. doi: 10.1063/1.166092. [DOI] [PubMed] [Google Scholar]
- 5.Pincus S.M., Viscarello R.R. Approximate entropy: a regularity measure for fetal heart rate analysis. Obstet. Gynecol. 1992;79(2):249–255. [PubMed] [Google Scholar]
- 6.Fanelli A., Magenes G., Campanile M., Signorini M.G. Quantitative assessment of fetal well-being through ctg recordings: a new parameter based on phase-rectified signal average. IEEE J. Biomed. Health Inf. 2013;17(5):959–966. doi: 10.1109/JBHI.2013.2268423. [DOI] [PubMed] [Google Scholar]
- 7.Lempel A., Ziv J. On the complexity of finite sequences. IEEE Trans. Inf. Theor. 1976;22(1):75–81. [Google Scholar]
- 8.Ferrario M., Signorini M.G., Magenes G. Complexity analysis of the fetal heart rate for the identification of pathology in fetuses. Comput. Cardiol. 2005;32:989–992. [Google Scholar]
- 9.Bauer A. Phase-rectified signal averaging detects quasi-periodicities in non-stationary data. Phys. Stat. Mech. Appl. 2006;364:423–434. [Google Scholar]
- 10.Signorini M.G., Fanelli A., Magenes G. Monitoring fetal heart rate during pregnancy: contributions from advanced signal processing and wearable technology. Comput. Math. Methods Med. 2014;2014 doi: 10.1155/2014/707581. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
