Skip to main content
NPJ Digital Medicine logoLink to NPJ Digital Medicine
. 2023 Oct 11;6:189. doi: 10.1038/s41746-023-00935-3

Prevalence and course of pregnancy symptoms using self-reported pregnancy app symptom tracker data

Michael Nissen 1,, Nuria Barrios Campo 1, Madeleine Flaucher 1, Katharina M Jaeger 1, Adriana Titzmann 2, Dominik Blunck 3, Peter A Fasching 2, Victoria Engelhardt 4, Bjoern M Eskofier 1,5, Heike Leutheuser 1
PMCID: PMC10567694  PMID: 37821584

Abstract

During pregnancy, almost all women experience pregnancy-related symptoms. The relationship between symptoms and their association with pregnancy outcomes is not well understood. Many pregnancy apps allow pregnant women to track their symptoms. To date, the resulting data are primarily used from a commercial rather than a scientific perspective. In this work, we aim to examine symptom occurrence, course, and their correlation throughout pregnancy. Self-reported app data of a pregnancy symptom tracker is used. In this context, we present methods to handle noisy real-world app data from commercial applications to understand the trajectory of user and patient-reported data. We report real-world evidence from patient-reported outcomes that exceeds previous works: 1,549,186 tracked symptoms from 183,732 users of a smartphone pregnancy app symptom tracker are analyzed. The majority of users track symptoms on a single day. These data are generalizable to those users who use the tracker for at least 5 months. Week-by-week symptom report data are presented for each symptom. There are few or conflicting reports in the literature on the course of diarrhea, fatigue, headache, heartburn, and sleep problems. A peak in fatigue in the first trimester, a peak in headache reports around gestation week 15, and a steady increase in the reports of sleeping difficulty throughout pregnancy are found. Our work highlights the potential of secondary use of industry data. It reveals and clarifies several previously unknown or disputed symptom trajectories and relationships. Collaboration between academia and industry can help generate new scientific knowledge.

Subject terms: Reproductive signs and symptoms, Pathogenesis, Medical research, Computational biology and bioinformatics

Introduction

Throughout pregnancy, almost all women experience pregnancy-related symptoms1,2. Prior work identified up to 41 different symptoms3,4. Reported incidences in literature highly differ. Common symptoms include fatigue (up to 98%)36, nausea (up to 88%)14,69, poor sleep (up to 74%)3,4,6,8, back pain (up to 60%)1,3,4,7, and vomiting (up to 57%)1,2,4,69.

Pregnancy symptoms can cause discomfort for women, decrease the quality of life1012, and lead to higher socioeconomic costs due to prolonged sick leave13,14. Furthermore, several pregnancy symptoms are associated with adverse outcomes for mothers and children: The risk of preterm delivery, lower birth weight, consequences for infant development, neonatal wellbeing, and obstetric complications are significantly linked to depression, anxiety, and maternal stress1518. Furthermore, significant associations exist between individual symptoms, such as depression, anxiety, increased nausea, and vomiting13,19. Vaginal bleeding between 6 and 8 weeks’ gestation can result in an increased risk of pregnancy loss20. Severe nausea and vomiting in pregnancy are associated with higher hospitalizations and negative outcomes21,22. Insomnia and sleep disturbances are linked to increased rates of cesarean delivery, preterm birth, and postpartum depression2328.

The overall relationship between pregnancy symptoms as well as their relation to clinical outcomes is not well understood19,23,29,30. Previous work explicitly calls for big data sources with detailed collected information on multiple signs and symptoms, data from pregnancy mobile health apps, as well as more accurate reporting29.

The integration of digital health, particularly mobile health apps, into clinical trials and research has several benefits: It can facilitate recruitment, reach broader populations, and provide real-world data31. Previous studies on pregnancy symptoms have been mostly collected retrospectively using questionnaires during hospital visits29. Retrospective symptom assessment is often biased compared to momentary assessment32.

The number of pregnancy apps exceeds the market for all other medical topics33. In previous pregnancy app reviews, all of the assessed applications were of commercial nature34, and the minority of apps mentions an involvement of medical experts35. Many applications feature a large user base and offer tools such as symptom trackers. These may still be helpful to generate new scientific insights, such as in the context of pregnancy symptoms and outcomes. Meanwhile, real-world data is often messy, heterogeneous, and requires new analysis techniques36.

This work analyzes a data set originating from the symptom tracker of the Keleya pregnancy app (Keleya, Keleya Digital-Health Solutions GmbH, Berlin, Germany). This app is designed to meet the needs of pregnant women in German-speaking countries. It aims to support women on their pregnancy and motherhood journey. Keleya promotes a collaboration with medical experts and cooperates with several health insurances. Features included are information about the current week of pregnancy, recipes, workouts, meditation, breastfeeding advice, and a symptom tracker.

We report the results of a retrospective analysis of 1,549,186 symptom reports, from 183,732 users and subsets thereof. To our knowledge, this is the most comprehensive work on pregnancy symptoms to date and exceeds previous works.

This work has two main goals: First, to present methods to handle noisy, real-world app data from commercial health applications to understand the prevalence and trajectory of user and patient-reported data. Second, to examine symptom report percentages, their changes over time, and their correlation at a detailed level. We then discuss collaboration between industry and academia as a tool to generate real-world evidence, as well as the resulting implications.

Results

User definition and subpopulation selection

In traditional clinical studies, participants are often actively approached and recruited by physicians. In contrast, pregnancy app users often choose to use pregnancy apps at their own choice, often without intervention or impulse by their care practitioners. Similarly, there is no structured follow-up. Use, and consecutively generation of research data, is entirely dependent on users’ intrinsic motivation to use apps and can vary widely.

Therefore, defining an appropriate subpopulation for consecutive analysis is a major challenge. In the analyzed data set, many users only used the symptom tracker feature once, and it is unclear whether the reported data are accurate in this case.

The application of a fixed threshold for the definition of “active users” was not useful. We found that the selection of subpopulations should be guided by three pillars: The analysis goals, time frame of interest, and data distribution/properties. In this work, the analysis goals are an estimation of symptom prevalence, symptom changes over time, and symptom correlation. The time frame of interest is the whole pregnancy duration and the postpartum period. The data characteristics are described in the next subchapter. The choice of the most appropriate user definition for analysis can only be made after an assessment of the characteristics of the subpopulation. For this work, we ideated 14 different definitions for subgroups criteria for data selection. These are shown in Table 1.

Table 1.

Symptom tracker data set characteristics (unique users, total tracked symptoms, mean and standard deviation of reported symptoms per user, mean and standard definition of usage time in days) for different data selection (inclusion/exclusion) criteria.

Data Set Description Users Symptoms Symptoms/User Usage time
A: All data 183,732 1,549,186 8.44 ± 16.68 17.18 ± 68.26
B: Only symptoms from first tracker use 183,732 606,705 3.30 ± 2.61 0.00 ± 0.00
C: Only symptoms from first day of tracker use 183,732 960,653 5.23 ± 3.89 0.00 ± 0.03
D: Only users that tracked minimum 2 symptoms 168,225 1,533,679 9.12 ± 17.27 18.76 ± 71.13
E: Only users tracking on a single day 145,336 763,469 5.25 ± 3.83 0.00 ± 0.02
F: Excl. symptoms from first tracker use 99,838 942,481 9.45 ± 21.61 23.64 ± 78.75
G: Only users tracking symptoms on min. 2 diff. days 38,396 785,717 20.48 ± 33.06 82.18 ± 130.21
H: Excl. symptoms from first day of tracker use 38,396 588,533 15.34 ± 32.32 48.06 ± 110.76
I: Excl. symptoms tracked in first 3 days after registration 38,375 564,613 14.72 ± 31.90 45.58 ± 109.57
J: Excl. symptoms tracked in first week after registration 35,119 525,522 14.97 ± 32.42 46.89 ± 111.96
K: Only users tracking symptoms in min. 2 diff. trimesters 18,596 527,168 28.37 ± 44.09 138.27 ± 150.45
L: Only users tracking symptoms in min. 4 diff. weeks 10,182 457,284 44.95 ± 55.99 157.53 ± 171.07
M: Only users tracking min. 1 symptom in all trimesters 3735 210,729 56.48 ± 74.66 241.79 ± 198.81
N: Only users tracking symptoms in min. 5 diff. months 2919 226,351 77.62 ± 85.14 246.16 ± 212.99

The respective data selection criteria and consecutive data sets are denoted by the corresponding letters in the remainder of this work. Symptoms/user is reported as mean ± standard deviation. Usage time is defined as difference between first and last symptom tracker use. Usage time is reported per user in days, mean ± standard deviation.

Usage and tracked symptoms

The number of unique users, total number of reported symptoms, the mean and standard deviation of reported symptoms per user, and the mean and standard definition of usage time in days are also reported in Table 1.

After preprocessing, the data set contained symptom reports from 183,732 users with a total of 1,549,186 tracked symptoms, tracked from May 31, 2018 to December 19, 2022. The mean duration between the first and last symptom report (usage time) was 17.18 days. When only users that tracked more than one symptom are used as criteria, the data set characteristics are similar.

The use of stricter criteria for selecting the data set results in a lower number of included users and total tracked symptoms, while the mean value of reported symptoms per user and usage time increases.

Figure 1 shows the weekly total number of reported symptoms based on different data set definitions. Reported symptoms increase sharply around 3–4 weeks of gestation. Peaks in weekly reported symptoms are observed around gestation weeks 4–5, 11, and 28 when weaker data selection criteria are considered.

Fig. 1. Total number of symptoms recorded each week, for selected data set selection criteria.

Fig. 1

Some data set selection criteria have been omitted for clarity. See Table 1 for details on the respective data selection criteria. The highest number of symptoms is recorded around gestation weeks 4–5.

User demographics

Due to the anonymized nature of the data set, only limited demographic information is available. The app’s developer, Keleya, applies the principle of data minimization, which further reduces the available information. 83% of the users use the German version of the application, 17% of users use the English version. The users are equally divided (50%) between iOS and Android. Based on a previous user survey (n = 157) by Keleya, 70% (n = 110) of their users are in their first pregnancy. Most users are 31–39 years old (64.3%, n = 101), 23.6% are 26–30 years old (n = 37), 7% are 40 or older (n = 11), and 5.1 % are 18–25 years old (n = 8).

Symptom occurrence and frequency

Individual symptom occurrences for each data selection criteria are listed in Table 2. Symptom percentages differ greatly for different data selection criteria. Stricter data selection criteria result in higher percentages overall. The data selection criterion “all users” (selection criterion A) shows fatigue (48.7%), back pain (44.2%), constipation (32.2%), sleeping difficulty (30.1%), and breathlessness (29.0%) as most prevalent symptoms. This order is also largely consistent across other data selection criteria, although breathlessness (81.0%) is more common than sleeping difficulty (79.4%) among “users that tracked symptoms in at least 5 different months” (data selection criterion N). Detailed statistics such as total symptom counts, their occurrence, and frequency for this criterion (selection criterion N) are included in the Supplementary Table 1.

Table 2.

Symptom occurrence by data selection criteria, across the whole pregnancy.

Symptom A B C D E F G J K L M N
Back Pain 44.2 23.4 38.6 48.2 39.1 44.5 63.3 53.3 73.3 82.8 86.4 92.6
Bladder Weakness 13.0 5.7 10.2 14.1 10.3 14.7 22.9 20.7 30.6 38.9 47.1 53.1
Breathlessness 29.0 13.3 24.1 31.6 23.8 32.2 48.5 40.0 58.0 68.9 74.8 81.0
Constipation 32.2 18.6 27.8 35.2 28.0 29.4 48.2 36.0 55.8 65.6 69.8 76.2
Diarrhea 12.8 6.2 9.8 13.9 9.9 13.1 23.5 17.0 27.2 37.5 41.2 49.0
Fatigue 48.7 36.1 43.0 51.5 42.5 33.0 72.0 52.7 75.8 86.0 90.0 92.9
Flatulence 22.6 9.5 19.4 24.6 19.2 26.6 35.1 22.8 37.5 45.0 48.5 49.7
Foot Pain 6.6 2.0 5.1 7.3 5.2 8.9 11.9 9.5 15.4 19.9 24.3 26.4
Headache 25.1 11.9 20.6 27.4 20.8 26.6 41.5 30.3 48.4 61.1 64.2 72.7
Heartburn 23.4 12.4 19.4 25.5 19.7 23.1 37.3 31.4 46.5 55.3 62.8 69.0
Incontinence 16.2 5.8 13.5 17.6 13.9 20.1 24.5 18.0 29.3 36.1 39.7 45.6
Mood: Happy 26.9 17.1 22.3 27.6 22.3 21.9 44.4 31.8 50.0 63.2 65.4 74.0
Mood: Normal 45.2 34.3 39.2 44.8 40.4 27.1 63.3 45.2 69.4 81.4 81.3 89.0
Mood: Scared 13.3 7.2 10.6 14.4 10.6 12.5 23.4 15.7 26.3 36.1 38.6 46.1
Mood: Stressed 15.2 8.1 11.9 16.5 12.1 14.4 26.7 19.2 32.1 42.9 47.1 57.3
Mood: Swings 25.2 15.7 20.7 27.1 21.1 20.4 40.5 29.4 46.4 58.7 61.3 70.3
Nausea 26.1 13.2 21.9 28.4 21.8 26.9 42.1 29.1 45.5 55.2 63.6 65.9
Neck Pain 25.6 12.5 21.5 27.9 21.4 27.2 41.5 31.1 49.0 60.6 64.7 72.9
Nutrition Deficiencies 44.9 28.6 40.5 48.9 41.7 41.1 57.0 43.8 64.9 71.8 76.4 82.0
Pelvic Pain 20.3 7.9 16.1 22.1 16.1 24.6 35.9 29.3 43.9 54.9 61.2 69.0
Sleeping Difficulty 30.1 13.4 25.1 32.9 25.1 34.0 49.1 39.8 56.5 68.5 72.0 79.4

A–N represent different data selection criteria, see Table 1. A represents the data selection criterion with the highest user count, but lowest mean reported symptoms per user. N is the data selection criteria with the lowest user count, but most active users. Symptom occurrences for data selection criterion N (in bold) are most meaningful and should be used for comparison with other works. Occurrences show high differences based on the data selection criteria applied. Flatulence and foot pain symptoms were added during the investigated period. Occurrences for flatulence and foot pain are unreliable and only indicate a lower bound.

Symptom changes over time

Figure 2 shows the week-by-week occurrence for each symptom over the duration of pregnancy. Each symptom shows a unique time pattern.

Fig. 2. Symptoms progress by gestation week for 15 different pregnancy symptoms.

Fig. 2

A-O) Week-by-week symptom progress. Each symptom shows clear longitudinal characteristics. Colors depict different data selection criteria, of which some have been omitted for clarity (see Table 1). The offset areas behind each curve show the 95% confidence intervals based on the Agresti-Coull interval (p = 0.05). Asterisks indicate symptoms that were added during the analysis period and for which symptom progression is reliable, but occurrence is not. Two asterisks indicate the data selection criteria used to analyze difference significance and effect size.

Constipation, fatigue, headache, nausea, and neck pain show clear and distinct peaks around gestation week 7–16. Constipation and neck pain decrease steadily thereafter, while the occurrence of fatigue, headache and nausea increases slightly again toward the end of pregnancy, although much less drastically than before. Back pain has a flat maximum toward the end of pregnancy (around gestation week 24–37). Flatulence is most common at the beginning of pregnancy and decreases steadily toward the end, with a slight spike around gestation weeks 26–28. Diarrhea tends to occur at the beginning and end of pregnancy and has a clear minimum around gestation week 20. Breathlessness, foot pain, heartburn, incontinence, sleeping difficulty, pelvic pain, and water retention increase steadily toward the end of pregnancy, reaching a peak around gestation week 37 and 38. Back pain, breathlessness, constipation, heartburn, pelvic pain show a small local minimum in weekly occurrence around gestation week 27 and 28. Flatulence, headache, and nausea show a small local maximum during these weeks.

The weekly occurrences differ depending on the chosen data selection criterion. The occurrences of “only data from the first tracker usage” (selection criterion B) are always lower than those of the other criteria. The occurrences derived from the other data selection criteria are similar in many cases.

The offset areas behind each curve in Fig. 2 show the 95% confidence intervals based on the Agresti-Coull interval. Furthermore, a χ2 test was used to test for significant differences between weekly occurrences for all 15 symptoms and 41 weeks. Examined data sets were those of users that used the tracker on one day only (selection criterion E) and those that used the tracker for at least 2 days (selection criterion G). These two data selection criteria are mutually exclusive. No significant difference between the groups was found in 337 (55%) of symptom-week-pairs. An effect size analysis was conducted for all symptom-week-pairs, using the ϕ coefficient, odds ratios, and risk ratios. The mean symptom-week effect sizes between the two groups were 0.03 ± 0.02, 1.07 ± 0.2, and 1.05 ± 0.16 (mean ± standard deviation), respectively.

Symptom correlation

A correlation analysis was conducted using the Pearson correlation coefficient (PCC). The data set derived from the most active users (selection criterion N) were used. Correlations between each symptom for four time periods (each trimester and the Puerperium) were analyzed. The 10 largest absolute correlation coefficients are shown in Table 3. No negative values are included, as they were not among the largest absolute values. Spearman correlation coefficients (SCC) are reported in Supplementary Table 2. The full correlation matrices depicted as heat maps are available in Supplementary Fig. 1 (PCC) and Supplementary Fig. 2 (SCC).

Table 3.

Ten largest Pearson correlation coefficients (PCC).

Time Frame: Symptom Time Frame: Symptom PCC
First: Fatigue First: Nausea 0.54
Postpartum: Back Pain Postpartum: Neck Pain 0.53
Postpartum: Constipation Postpartum: Nutrition Deficiencies 0.53
Postpartum: Bladder Weakness Postpartum: Mood: Stressed 0.50
Postpartum: Incontinence Postpartum: Neck Pain 0.46
Postpartum: Bladder Weakness Postpartum: Breathlessnes 0.45
Postpartum: Bladder Weakness Postpartum: Sleeping Difficulty 0.43
Postpartum: Fatigue Postpartum: Pelvic Pain 0.42
Second: Mood: Happy Third: Mood: Happy 0.41
Postpartum: Diarrhea Postpartum: Flatulence 0.41

Each symptom in each time period (trimesters, Puerperium) was correlated, using data from the most active users (selection criterion N). No negative values are included, as they were not among the largest absolute values.

Discussion

We reported findings on the use of a pregnancy app symptom tracker, the symptom occurrences, symptom progress, as well as methods to analyze pregnancy symptom tracker reports. This section is structured as follows: First, the progress of symptoms is discussed in relation to previous work. Then, we discuss symptom tracker engagement, activity, and use. We discuss the methodology and its suitability for this work. We suggest additional reasons for the wide variation in reporting symptom prevalence in the literature. The validity of the symptom tracker used and additional limitations are discussed. Finally, we provide general implications and recommendations for industry and academia.

Symptom tracker reports are unevenly spread across pregnancy

The total count of reported symptoms for each week (Fig. 1) for the full data set (i.e., without data selection criteria applied) shows three distinct spikes around gestational weeks 6, 10, and 28. Thus, symptoms are not reported uniformly, and the chosen methodology must adjust for this. The first spike coincides with the mean gestational age of pregnancy awareness (5.5 weeks) of US women37. This awareness of pregnancy is likely the trigger for downloading and trying pregnancy apps. The spike in week 10 could be related to pregnancy confirmation by a medical provider. Week-by-week miscarriage rates see a sharp reduction after 12 weeks of gestation38. Women and parents often decide to tell friends following week 1239, which may result in recommendations to use apps by friends. The third peak may relate to the beginning of the third trimester, specific pregnancy examinations, increased preparation for birth and the puerperium, or the passing of the periviable period40. No push notifications were sent by the application to promote the use of the pregnancy tracker or the application.

Most users use the symptom tracker more than once, but only on one day

The characteristics of the data sets in Table 1 show that most users report more than one symptom, as the characteristics of the data sets “all users” (selection criterion A, 183,732 users, 1,549,186 symptoms) and “tracked at least two symptoms” (selection criterion D, 168,225 users, 1,533,679 symptoms) are similar. User and symptom counts decrease dramatically when only data from those users who used the tracker on at least two different days are considered (selection criterion G, 38,396 users, 785,717 symptoms). Consequently, only 20.9% of users open the symptom tracker on two different days. If the first day of use is excluded for all users (selection criterion H), the size of the data set decreases to 38.0% of the original data set size (588,533 of 1,549,186 symptoms). If symptoms from the first tracker use are excluded (selection criterion F), 99,838 users with 942,481 tracked symptoms remain. Thus, only 54.3% of users are using the symptom tracker at least twice.

Possible explanations are that most users try the symptom tracker but are not interested in using it for a longer period of time or that the tracker does not appeal to them.

Single tracker use could lead to invalidate data

When the tracker is used immediately after registration, it is also unclear whether users are actually suffering from a symptom or arbitrarily record symptoms to simply test the system. This raises questions about the validity of the overall data. This was the reason for evaluating different data set selection criteria. It is also important to note that this phenomenon contributes to data sparsity, which must be taken into account in the subsequent evaluation.

The differences between single-day and very active users are negligible

To investigate the aforementioned validity, we plotted the weekly symptom occurrence for different data selection criteria in Fig. 2. Data from the first tracker use (selection criterion B) consistently show a lower occurrence than all other data selection criteria. The occurrences derived from the remaining data selection criteria have more similar characteristics. The overall trend in the data from the first tracker use by the user (selection criterion B) is in line with the data from other data selection criteria.

Statistical differences were evaluated between users that tracked on one day only (selection criterion E) and users that tracked on two or more days (selection criterion G). These groups are mutually exclusive. 45% of the symptom-week-pairs showed a statistical difference. This can be explained by the large sample sizes. Literature interprets the observed effect sizes (ϕ: 0.03 ± 0.02, odds ratio: 1.07 ± 0.2, risk ratio: 1.05 ± 0.16, mean ± standard deviation) as small41,42. Thus, although statistical differences exist, we see the differences between the two groups as negligible.

Users initially test the system with a few correct symptoms, but report additional symptoms in direct succession

The reported occurrence of symptoms is too low when only data from the first tracker use are considered (selection criterion B). A low number of reported symptoms on the first tracker use could explain this. However, a user’s first symptom report is the one with the highest mean count of reported symptoms. Instead, we suspect that many users make another symptom report in direct succession to their first. This is based on the observation of 606,705 reported symptoms on first tracker use (selection criterion B), but 960,653 reported symptoms on the first day of usage (selection criterion C). Similarly, the usage time, i.e., the difference between the first and last use of the symptom tracker in the respective data set, is short (0.00 ± 0.03 days, mean ± standard deviation, Table 1, selection criterion C).

Sparse data makes occurrence estimation difficult and unreliable for large time periods

Table 2 shows large differences between the symptom occurrences of the different data selection criteria. Occurrences increase with the strictness of the selection criteria. They are high for the data set with the most active users and low when all users are included.

This is due to sparse data and successive underreporting of symptoms among users with a small number of reports. For example, suppose a user reports symptoms only at week 8 of pregnancy. If they did not experience back pain in week 8, but instead in week 26, this symptom is not captured. The mean occurrence across the whole pregnancy duration is too low.

In summary, with increasing sparsity of data and longer evaluated time periods, the occurrence of symptoms becomes increasingly underreported. This implies an analysis at smaller time intervals.

Weekly symptom occurrence estimation is an adequate method for symptom progress analysis, and the only means available for analysis of sparse data

Note that the expressiveness differs, i.e., a weekly occurrence of water retention in pregnancy week 38 of ~37% is only a lower bound for overall occurrence across pregnancy: Those patients that did not experience this symptom in week 38 may have experienced it in week 39, so the overall occurrence can be higher.

The appropriateness of the approach we chose (time-period-based occurrence estimation at the gestation week level) is emphasized by the stability and similarity of the symptom occurrence between gestation weeks. If the approach were not adequate, the curves in Fig. 2 would show more noise and differences between successive gestation weeks. Instead, overall trends are clearly visible. Larger differences between consecutive weeks occur in the first 4 weeks of pregnancy, especially after applying data selection criteria with a small number of users (e.g., selection criterion N). This is due to the low number of reporting users in the first weeks of pregnancy and lower explanatory power.

Time frame and time of reporting lead to incomparability of symptom prevalences

Table 2 shows symptom prevalences for different data selection criteria. As discussed before, sparse data are inappropriate for prevalence estimation over the entire gestational period. Data from users active over at least 5 months (selection criterion N) is denser and the best available choice for comparison with the literature.

Previous works found vast differences in symptom prevalences across different literature sources4347. Several reasons have been suggested as causes: The use of different study settings (e.g., retrospective, prospective studies), symptom criteria (e.g., self-reported, based on clinical history, based on clinical examination), insufficient differentiation between diseases (e.g., between lower back pain and pelvic girdle pain), and choice of measurement tool (e.g., validity of questionnaires)45,48.

Absolute symptom prevalences are not comparable across different time frames or time points: In our work, the highest reported weekly occurrence of back pain is 54% at 28 weeks’ gestation. At the same time, the occurrence throughout pregnancy is 93%. One study asked for symptoms in the past 3 months at three different times in pregnancy, resulting in a prevalence of 69% at 28 weeks and 68% at 36 weeks7. In another prospective study, women were interviewed at the end of each trimester of pregnancy and 6 weeks postpartum whether they could recall any symptoms in the preceding trimester. This resulted in an overall prevalence of 48%, with 63% in the second trimester, 76% in the third, and 39% in the puerperium3. Other work asked about the frequency of symptoms “in the last month”, resulting in a total prevalence of 60%. However, the questionnaire was not issued at a set point in time; rather, some participants were in their second, others in their third trimester4. This is aside from any additional effects that may be caused by retrospective analysis of questionnaires.

Furthermore, symptom prevalence is not necessarily of interest, but may be rather misleading. The high occurrence of back pain in our data only indicates that almost all women suffer from back pain, but does not provide information about severity or frequency. Larger time spans increase the likelihood that prevalences are elevated by chance. Instead, symptom frequency, time course, and severity may be of greater interest.

More research is needed to establish symptom occurrence as a surrogate parameter for symptom prevalence

As presented before, the similarity of neighboring weekly occurrence in Fig. 2 can be used as an argument for data validity. However, as outlined in the previous paragraph, our occurrences cannot be compared with findings from literature due to different time frames and reporting times. However, such a comparison is necessary to evaluate the validity of the symptom occurrence as a surrogate parameter for symptom prevalence. In order to perform such a validation, close attention should be paid to the data density in the respective time frames. For example, if data are compared on a week-by-week basis (as in Fig. 2), sparse data sets can be used, as long as the subsets used for weekly analysis are sufficiently dense. If validity is evaluated at larger time frames (e.g., throughout the whole pregnancy, as in Table 2), data must be sufficiently dense throughout the whole duration of pregnancy.

Symptom course in literature

Our results in Fig. 2 suggest that the occurrence of back pain is more continuous, with week-by-week occurrence peaking at about 24–26 weeks’ gestation. Pelvic pain peaks at gestational week 38. A review by Casgrande et al. reports that low back pain begins in the second trimester, averaging around 22 weeks’ gestation, and affects between 20% and 90% of women, whereas most studies report a prevalence above 50%44. The work by Vermani et al. described that low back and pelvic girdle pain begins around 18 weeks of gestation and peaks between 24 and 36 weeks of gestation43. Both papers described that pelvic girdle pain peaks between 24 and 36 weeks of gestation.

Incontinence was found less prevalent in early than in late pregnancy, which is in line with our results47.

Our data suggests a rapid increase of breathlessness in the first trimester to about 30% weekly occurrence. The highest weekly occurrence was found around 32 weeks of gestation, with 40% weekly occurrence. Other works have distinguished between breathlessness at rest and during exercise3,5. One review found activity-related breathlessness to be prevalent in 75% of pregnant women by ~30 weeks gestation, and stated that severity widely varies49. Severity or quality of breathlessness were not captured by the symptom tracker.

We could not find comprehensive reviews of the prevalence and progress of constipation or diarrhea. Our data show that occurrence is highest around 9–12 weeks of gestation and declines steadily thereafter. A recent work describes constipation to be higher in the first and second trimester, than in the third5052. Sex hormones have been suggested as a major influence53.

The longitudinal changes of diarrhea presented in our data shows an increase in occurrence around gestation week 6 and toward the end of pregnancy. Diarrhea has been described as a precursor to labor, but also as a potential symptom of more serious causes51. Earlier work suggested that there is no convincing evidence that the pathogenesis of diarrhea in pregnant women differs from that of the non-pregnant population and that pregnancy promotes constipation rather than diarrhea54. More recent work suggests pregnancy-related factors, such as hormonal changes, dietary changes, or preexisting conditions55. Our work supports the latter.

Our results clearly show three phases of fatigue: An early peak of high weekly occurrence around gestation weeks 7 and 8, a trough around gestation week 21, and slightly elevated levels toward the end of the pregnancy. No clear picture of the progress of fatigue across pregnancy exists: Previous work found no significant differences in fatigue severity between pregnancy trimesters56. Other work found statistically significant higher fatigue levels in the first and third trimesters, compared to the second trimester57. This also applies to prevalence3,58. Other work found an increase from early to late pregnancy59,60, while others found the opposite61. Prior works may not have used sufficiently fine-grained time periods for analysis and thus were unable to capture these longitudinal changes.

We found a steadily increasing weekly occurrence of foot pain, peaking just before delivery. This is consistent with reports from the literature and can be explained by weight gain, shift of center of mass, and changes in biomechanics62,63.

The course of headache shows a distinct spike in headache occurrence at gestation week 16, and a small increase in headache occurrence shortly before birth. Migraine and tension-type headaches are the most common primary headache types during pregnancy. A systematic review reports that most women who suffer from migraine experience marked improvement during pregnancy, with a significant reduction in frequency and intensity, if not complete resolution6467. One study only found a reduction in the third semester68. Another study described an increase in headache burden four weeks before delivery in multiparous women69. The onset of migraine during pregnancy is relatively uncommon and poorly understood, with incidences between 1.8% and 18%65. Overall, we did not find data on the longitudinal progress of weekly headache prevalence across pregnancy previously reported in literature. Our data set does not distinguish between different headache types and does not include information on migraine history. The small increase in headache occurrence shortly before birth may support prior work69.

For heartburn, our data indicate a continuous increase in weekly occurrence, peaking between gestational weeks 32–36. Some studies have found that heartburn increases throughout pregnancy, whereas other studies suggest the opposite70.

Nausea peaks with around 60% weekly occurrence at approximately gestation week 8 in our data. Previous work has found that nausea is prevalent in more than 80% of women, peaks around gestational weeks 4–12, and thereafter 2.5–10% of women experience it7174. This is clearly in line with our results.

Our work clearly outlines increased sleeping difficulties in the third trimester. A previous scoping review on sleep health in pregnancy found that sleep in each trimester is highly variable between women, but did not support the notion of large changes in sleep during pregnancy in healthy women75. In addition, it was found that there is a lack of data on perceived sleep satisfaction/quality.

Symptom Correlation

We analyzed correlations between symptoms in the different trimesters as well as the Puerperium. The largest correlation (PCC: 0.54) was found between Fatigue and Nausea in the first trimester. This correlation has been reported by several prior works76,77.

Several strong correlations exist in the Puerperium. The involved symptoms (back pain, neck pain, incontinence, breathlessness, sleeping difficulty) may be linked to the physiological changes after birth. The correlation between headache and neck pain also exist in other trimesters (PCC: 0.32–0.43). Fatigue and sleeping difficulties are moderately associated in all trimesters (PCC: 0.38–0.41). Furthermore, headache and nausea were correlated in the first trimester (PCC: 0.4).

We can confirm previous reports on an association between back pain and girdle pain using the PCC in the first trimester (Theirs: 0.33, ours: 0.25)1.

Symptom Tracker Validity

This study reports real-world data, generated through a symptom tracker used as patient-reported outcome measure. We found no previous works that investigated the accuracy and validity of symptom trackers such as the one used to collect the data analyzed in this work. Previous work has mostly focused on multi-page symptom checkers or questionnaires7880. Pregnancy apps often use more simplified, iconified, single-page symptom trackers. Figure 3 shows an image of the symptom tracker used for this work.

Fig. 3. Screenshot of the symptom tracker in the Keleya pregnancy app.

Fig. 3

Some symptoms require a “left swipe” to be accessible.

For symptoms for which there is already extensive work (e.g., nausea), our results are similar to findings in the literature, which gives hope that these trackers can provide some level of validity. Due to a lack of ground truth, more detailed conclusions cannot be made.

Some users started tracking symptoms early. Figure 1 shows ~1000–3000 symptoms reported weekly at gestation weeks 1–3. These data may be incorrect. One possible explanation is incorrect data entry (e.g., entering incorrect ovulation data used to calculate the week of pregnancy). However, the data may also be of natural and correct origin. For example, these data may originate from women using reproductive technologies or those tracking pregnancy signs using ovulation predictor kits or home pregnancy tests81. In 2020, 12,503 children were born in Germany (total births: 773,144, 1.6%) using assisted reproductive technologies82,83. We, therefore, decided to include these data.

Visible spikes in Fig. 2, especially in the “users that tracked symptoms in at least 5 different months” group (selection criterion N), can be attributed to a small number of users. Details on the time-period-based occurrence estimation are provided in the Methods section.

Limitations

Our work and its data set are subject to several limitations. Most arise from the data set, which was not primarily intended to generate new scientific insights. The main limitation is that the data set does not contain information about clinical pregnancy outcomes. This information is not queried from users. The symptom tracker only provides data on occurrence, but not on the severity or quality of the reported symptoms. In addition, symptoms can have different dimensions, such as fatigue, which can refer to different body parts84. Furthermore, the symptom tracker does not cover all symptoms suggested by previous work4.

To take full advantage of the data set size, most of our analysis was conducted cross-sectionally for all users (selection criterion A). A first longitudinal analysis was performed in the form of a correlation analysis on the data of those users who were active for at least 5 months (selection criterion N). An initial application of unsupervised machine learning techniques for clustering did not yield meaningful results. We will use more advanced methods in the future.

The order of symptoms on the symptom tracker screen is not randomized, which may cause unwanted side effects. Furthermore, some symptoms are only accessed after a “right swipe” on the symptom tracker screen. It is unclear whether this affects the reporting frequency of those symptoms. No information is available on the impact of usability on adequate symptom tracking, as well as symptom tracker validity.

Foot pain and flatulence were added to the symptom tracker during the investigated period and were not available prior to these dates. While the overall symptom progress can still be analyzed, the occurrence estimation is incorrect, and we have marked these results as unreliable. Similarly, the application underwent continuous development during the investigated period. This may have had an impact on the usage of the pregnancy tracker.

Lastly, the data collected in this work were generated by a German pregnancy app, and thus the data were generated mainly by women in Germany. Keleya operates on a freemium model. The app is reimbursed by some health insurances in German. Nonetheless, the need to pay for some features may impact the user base. The demographics of app users may therefore not be generalizable to society as a whole. This may promote health inequity if derived research findings are used without considering this aspect.

Implications

In summary, this work is subject to limitations, which originate in the secondary use of tracker data from a commercial pregnancy app. On the other hand, these real-life data offer an unprecedented data quantity, that exceeds previous work by orders of magnitude. It is the first study to extensively evaluate symptom progress in a longitudinal fashion. It is not subject to several reporting shortcomings of prior work: We are able to include early pregnancy weeks and no retrospective data collection bias is present29.

This allows our work to contribute in several ways: We presented new insights on symptom progress in several cases, inadequate reporting of time span and time of reporting as additional explanations for prevalence differences in literature, and lastly an overall approach for analysis of sparse symptom reporting data.

Close cooperation between academia and industry is desirable to maximize benefits for all parties. For best cooperation, it is important to understand the needs, requirements, and goals of all stakeholders. These include pregnant women (in some cases patients) as users, their relatives/friends, application manufacturers and their developers, as well as researchers.

Consistent with user-centered design, user aims, requirements, motivation, and benefit should always come first. However, the use of pregnancy apps has implications for users. It may carry risks and entails privacy considerations, such as the use and potential misuse of data and the corresponding privacy implications. Data minimization as principle may be able to reduce these risks. On the other hand, users and pregnant women in general can benefit from better care and new scientific insights. Mobile health has the potential to improve care. Mobile health has a particularly high potential to improve care in developing countries85. Users are able to learn from their own data, feel and are more empowered, increase their health literacy86. It is important to keep equity and diversity in mind, as outlined in the limitations of this work, in order to make these benefits available to everyone. The use of frameworks and guiding principles can help to increase health equity87,88.

Researchers need to understand that application developers are subject to commercial constraints. Likewise, they need to understand the development process of applications. Although the primary aim for both parties is to maximize the benefit for the user, this goal may be interpreted differently by each side. Developers may aim to maximize user needs, requirements, and demands. Scientific validity of the data generated may only be a secondary goal. Unlike in most medical studies, measurement devices, and thus the data collected, can change as a result of continuous development. In addition, researchers must accept that measurement instruments (such as the symptom tracker used in this work) may offer lower validity or detail compared to traditional instruments such as comprehensive, validated questionnaires. Users may not be motivated to complete extensive questionnaires.

Application developers can integrate basic features to support research. Sociodemographic data, even if only in a basic form, offer new insights into disease trajectories. Information on basic pregnancy outcomes would be helpful, such as date of birth, birth weight, and mode of delivery. A separate section for citizen science or data donation could be an adequate form for requesting these data.

Developers should ensure that they at least capture any symptoms known to be associated with pregnancy outcomes. This includes nausea, insomnia, vaginal bleeding, vomiting, and headache. On the other hand, they should avoid incorporating features for which no clear evidence exists, such as kick or fetal movement counting89. Overall, industry benefits from a large user base, and academia subsequently from good data availability.

This combination of factors (user demographics, reported symptoms, outcomes) may enable new research in the future and create new large data sets for prospective analysis by machine learning.

Some user-tailored recommendations based on symptom tracker use can already be implemented: Reported vaginal bleeding between weeks 6 and 8 could prompt a suggested physician visit20. Reporting headache without prior history could lead to additional prompts, for example, based on “red flags” for headache in pregnancy64,90. Severe outcomes aside, symptom-based responses can also be used to inform pregnant women about treatment options for conditions for which women are reluctant to seek help, such as stress incontinence47,91.

It is important to note that functionalities must be integrated with care and close assessment and evaluation (as in initial clinical trials) is necessary to avoid adverse effects, unnecessary panic, or anxiety92.

Overall, real-life symptom tracker data provide a large data quantity, while enabling longitudinal analysis. Future work should aim to investigate the quality, severity, genesis, and phenotypes of symptoms and their relationship to outcomes.

Methods

Keleya pregnancy app

The presented data originates from the symptom tracker of Keleya, a pregnancy app. The app is available in Google Play (Android) and Apple App Store (iOS). A screenshot of the symptom tracker is shown in Fig. 3. The tracker is divided into several categories. The deficiency category is not analyzed in this work. Only fatigue (tired) was analyzed in the mood category. Nausea, flatulence (bloating), constipation, diarrhea, and heartburn were shown as part of the digestion group. The pain group included back pain, foot pain, headache, neck tension, and pelvic area pain. Breathlessness, incontinence (bladder weakness), sleeping difficulty, and water retention were included in the complaints group. The app manufacturer did not include symptoms that are highly likely to indicate adverse events to avoid users simply reporting the symptom in the app but not consulting their caring medical professional. No free text symptom input is possible. Tracking symptoms has an impact on the workouts and recipes suggested by the application, for example if nausea or heartburn are tracked. Users reporting headaches will not be shown exercises involving a “head down” position.

Data extraction and processing

A written data sharing agreement between the parties was in place for data use. Each reported symptom is stored in a relational database, with one entry per symptom, including a time stamp. If several symptoms are reported at the same time (in “one report”), each symptom is stored in its own row, featuring the same time stamp. Consequently, if the symptoms are evaluated in the smallest possible time span, symptoms are considered co-occuring if their time stamps match.

The production data is stored in a relational database. Data were anonymized and exported as *.csv-file for consecutive processing and analysis. All processing and analysis steps were performed using Python 3.9.1, Pandas 1.2.0 was used for data management, Matplotlib 3.6.2 to generate graphs, and Scipy 1.10.1 as well as Statsmodels 0.13.5 for statistical analysis.

As preprocessing step, data recorded after pregnancy week 50 were excluded. Data after gestation week 41 are not included in Fig. 2, as weekly symptom counts were too low (<10 in some cases) to be expressive. Trimester 1 was defined as weeks 0–13, trimester 2 as weeks 14–27, and trimester 3 from weeks 28–40. As the exact birth date of pregnancies is not known, data was assigned to the Puerperium if symptoms were reported from weeks 41–50.

We excluded all “deficiency” categories, as they are not present in similar works14,69, and it is unclear how pregnant women decide that these symptoms affect them. These symptoms are denoted and grouped as “Nutrition Deficiencies” in selected tables and analysis.

The data selection criteria presented in Table 1 were collectively brainstormed and discussed by the authors. As little previous work in literature exists on this topic, ideas from commercial web usage analytics tools were adapted, such as tracking frequency or a minimum usage time. We furthermore differentiated the data selection criteria by new and returning users.

The registration date is the date at which the user registered an account for the underlying Keleya pregnancy app. The usage time is defined as the difference between the first and last day of usage of the symptom tracker. Note that the registration time can differ from the first day of usage.

Two symptoms were added during the investigated time frame: Foot pain (March 6, 2020) and flatulence (June 20, 2020). The resulting occurrence calculations are thus unreliable.

Analysis and statistical evaluation

Symptom occurrence was calculated as the number of users affected by a symptom divided by all users that reported a symptom in the respective time period. In fact, if five users reported back pain in gestation week 10, and 20 users reported symptoms in total in this period, the occurrence is 25%.

The week-by-week symptom occurrence can be seen as a Bernoulli experiment. Users experience or do not experience a symptom. We thus used the Agresti-Coull interval to estimate the 95% confidence intervals, using the proportion_confint function of the Statsmodels package. The Agresti-Coull interval was suggested by prior work for large samples sizes93.

Chi-squared tests were performed using the chi2_contingency function of the Scipy package. A p value of 0.05 was used as threshold for significance analysis. For calculating effect sizes, we defined ϕ=(χ2)/N, where χ2 is the respective statistics, and N the total number of samples. Odds ratios were calculated using Scipy’s odds_ratio function, and the risk ratio was derived by dividing the two occurrences.

To assess correlation coefficients, the data set with the most active users (selection criterion N) was used. For each user, the respective symptom counts in each time frame (Trimesters 1–3, Puerperium) were calculated. This resulted in an array of size 84 for each user (21 symptoms for 4 time frames). Data was normalized user-wise across all time frames to accommodate for differences in reporting, e.g., where one user reported 50 symptoms in total, and another 500 in total. Thus, each symptom-period count was normalized by the total sum of reported symptoms by this user. From this, the total sum of all reported symptoms for each individual user is 1 after normalization.

Supplementary information

Supplementary Material (503.9KB, pdf)

Acknowledgements

This work was supported by the Federal Ministry of Health on the basis of a decision by the German Bundestag (grant number 2519DAT400). Bjoern M. Eskofier gratefully acknowledges the support of the German Research Foundation (DFG) within the framework of the Heisenberg professorship programme (grant number ES 434/8-1). The authors would like to acknowledge the financial support from Deutsche Forschungsgemeinschaft and Friedrich-Alexander Universität Erlangen-Nürnberg within the funding program Open Access Publication Funding.

Author contributions

M.N., V.E., B.M.E., H.L. conceived the study. M.N., N.B.C. analyzed the data and created all figures. M.N. drafted the paper. N.B.C., M.F., K.M.J., D.B., V.E., H.L. assisted in writing the paper. D.B., P.A.F., B.M.E., H.L. assisted with statistical analysis. A.T., P.A.F. provided clinical expertize and guidance. P.A.F., V.E., B.M.E., H.L. supervised the project. All authors commented, critically reviewed, and approved the final paper.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Data availability

The data sets used and/or analyzed during the current study available from the corresponding author on reasonable request and approval from Keleya Digital-Health Solutions GmbH.

Code availability

The underlying code for this study and data sets is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author.

Competing interests

V.E. is an executive director of Keleya Digital-Health Solutions GmbH, the manufacturer of the application from which the analyzed data set was derived. The remaining authors declare no competing interests.

Ethical approval

This study was retrospectively conducted on an anonymized data set. The ethics committee of the Friedrich-Alexander-Universität Erlangen-Nürnberg confirmed that no ethical approval was required, and provided a written waiver with the respective statement (23-54-ANF).

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s41746-023-00935-3.

References

  • 1.Lutterodt, M. C. et al. Examining to what extent pregnancy-related physical symptoms worry women in the first trimester of pregnancy: a cross-sectional study in general practice. BJGP Open3 (4) https://bjgpopen.org/content/3/4/bjgpopen19X101674. 10.3399/bjgpopen19X101674 (2019). [DOI] [PMC free article] [PubMed]
  • 2.Skarica B. Effectiveness of Manual Treatment on Pregnancy Symptoms: Usefulness of Manual Treatment in Treating Pregnancy Symptoms. Med. Arch. 2018;72:131–135. doi: 10.5455/medarh.2018.72.131-135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Zib M, Lim L, Walters WAW. Symptoms During Normal Pregnancy: A Prospective Controlled Study. Australian New Zealand J. Obstetr. Gynaecol. 1999;39:401–410. doi: 10.1111/j.1479-828x.1999.tb03122.x. [DOI] [PubMed] [Google Scholar]
  • 4.Foxcroft KF, Callaway LK, Byrne NM, Webster J. Development and validation of a pregnancy symptoms inventory. BMC Preg. Childbirth. 2013;13:3. doi: 10.1186/1471-2393-13-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Rodriguez A, Bohlin G, Lindmark G. Symptoms across pregnancy in relation to psychosocial and biomedical factors: Symptoms across pregnancy. Acta Obstetr. et Gynecol. Scandinavica. 2001;80:213–223. doi: 10.1034/j.1600-0412.2001.080003213.x. [DOI] [PubMed] [Google Scholar]
  • 6.Bai G, et al. Associations between Nausea, Vomiting, Fatigue and Health-Related Quality of Life of Women in Early Pregnancy: The Generation R Study. PloS ONE. 2016;11:e0166133. doi: 10.1371/journal.pone.0166133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Meyer LC, Peacock JL, Bland JM, Anderson HR. Symptoms and health problems in pregnancy: their association with social factors, smoking, alcohol, caffeine and attitude to pregnancy. Paediatr. Perinatal Epidemiol. 1994;8:145–155. doi: 10.1111/j.1365-3016.1994.tb00445.x. [DOI] [PubMed] [Google Scholar]
  • 8.Beebe KR, Gay CL, Richoux SE, Lee KA. Symptom Experience in Late Pregnancy. J. Obstetr. Gynecol. Neonatal Nursing. 2017;46:508–520. doi: 10.1016/j.jogn.2017.03.007. [DOI] [PubMed] [Google Scholar]
  • 9.Järnfelt-Samsioe A, Samsioe G, Velinder G-M. Nausea and Vomiting in Pregnancy - A Contribution to Its Epidemiology. Gynecol. Obstetr. Investig. 1983;16:221–229. doi: 10.1159/000299262. [DOI] [PubMed] [Google Scholar]
  • 10.Brynhildsen J, Hansson A, Persson A, Hammar M. Follow-up of patients with low back pain during pregnancy. Obstetr. Gynecol. 1998;91:182–186. doi: 10.1016/s0029-7844(97)00630-3. [DOI] [PubMed] [Google Scholar]
  • 11.Lagadec N, et al. Factors influencing the quality of life of pregnant women: a systematic review. BMC Preg. Childbirth. 2018;18:455. doi: 10.1186/s12884-018-2087-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Boutib A, Chergaoui S, Marfak A, Hilali A, Youlyouz-Marfak I. Quality of Life During Pregnancy from 2011 to 2021: Systematic Review. Int. J. Women’s Health. 2022;14:975–1005. doi: 10.2147/IJWH.S361643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Andersson L, Sundström-Poromaa I, Wulff M, Aström M, Bixo M. Implications of antenatal depression and anxiety for obstetric outcome. Obstetr. Gynecol. 2004;104:467–476. doi: 10.1097/01.AOG.0000135277.04565.e9. [DOI] [PubMed] [Google Scholar]
  • 14.Dørheim S, Bjorvatn B, Eberhard-Gran M. Sick leave during pregnancy: a longitudinal study of rates and risk factors in a Norwegian population. BJOG: Int. J. Obstetr. Gynaecol. 2013;120:521–530. doi: 10.1111/1471-0528.12035. [DOI] [PubMed] [Google Scholar]
  • 15.Maisonneuve E. Mode de vie et règles hygiénodiététiques pour la prévention de la prématurité spontanée chez la femme enceinte asymptomatique. J. de Gynécologie Obstétrique et Biologie de la Reproduct. 2016;45:1231–1246. doi: 10.1016/j.jgyn.2016.09.019. [DOI] [PubMed] [Google Scholar]
  • 16.Vollrath ME, Sengpiel V, Landolt MA, Jacobsson B, Latal B. Is maternal trait anxiety a risk factor for late preterm and early term deliveries? BMC Pregnancy Childbirth. 2016;16:286. doi: 10.1186/s12884-016-1070-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dunkel Schetter C, Tanner L. Anxiety, depression and stress in pregnancy: implications for mothers, children, research, and practice. Curr. Opin. Psychiatr. 2012;25:141–148. doi: 10.1097/YCO.0b013e3283503680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Alder J, Fink N, Bitzer J, Hösli I, Holzgreve W. Depression and anxiety during pregnancy: A risk factor for obstetric, fetal and neonatal outcome? A critical review of the literature. J. Maternal-Fetal Neonatal Med. 2007;20:189–209. doi: 10.1080/14767050701209560. [DOI] [PubMed] [Google Scholar]
  • 19.Swallow B, Lindow S, Masson E, Hay D. Psychological health in early pregnancy: relationship with nausea and vomiting. J. Obstetr. Gynaecol. 2004;24:28–32. doi: 10.1080/01443610310001620251. [DOI] [PubMed] [Google Scholar]
  • 20.DeVilbiss EA, et al. Vaginal bleeding and nausea in early pregnancy as predictors of clinical pregnancy loss. Am. J. Obstetr. Gynecol. 2020;223:570.e1–570.e14. doi: 10.1016/j.ajog.2020.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gazmararian JA, et al. Hospitalizations during pregnancy among managed care enrollees. Obstetr. Gynecol. 2002;100:94–100. doi: 10.1016/s0029-7844(02)02024-0. [DOI] [PubMed] [Google Scholar]
  • 22.MacGibbon, K., Fejzo, M. & Mullin, P. Mortality Secondary to Hyperemesis Gravidarum: A Case Report. Womens’ Health % Gynecol.1 (2) (2015).
  • 23.Sedov ID, Anderson NJ, Dhillon AK, Tomfohr-Madsen LM. Insomnia symptoms during pregnancy: A meta-analysis. J. Sleep Res. 2021;30:e13207. doi: 10.1111/jsr.13207. [DOI] [PubMed] [Google Scholar]
  • 24.Felder JN, Baer RJ, Rand L, Jelliffe-Pawlowski LL, Prather AA. Sleep Disorder Diagnosis During Pregnancy and Risk of Preterm Birth. Obstetr. Gynecol. 2017;130:573. doi: 10.1097/AOG.0000000000002132. [DOI] [PubMed] [Google Scholar]
  • 25.Haney A, Buysse DJ, Rosario BL, Chen Y-F, Okun ML. Sleep disturbance and cardiometabolic risk factors in early pregnancy: a preliminary study. Sleep Med. 2014;15:444–450. doi: 10.1016/j.sleep.2014.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Okun ML, Hanusa BH, Hall M, Wisner KL. Sleep Complaints in Late Pregnancy and the Recurrence of Postpartum Depression. Behav. Sleep Med. 2009;7:106–117. doi: 10.1080/15402000902762394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Li R, et al. Sleep disturbances during pregnancy are associated with cesarean delivery and preterm birth. J. Maternal-Fetal Neonatal Med. 2017;30:733–738. doi: 10.1080/14767058.2016.1183637. [DOI] [PubMed] [Google Scholar]
  • 28.Micheli K, et al. Sleep patterns in late pregnancy and risk of preterm birth and fetal growth restriction. Epidemiology (Cambridge, Mass.) 2011;22:738–744. doi: 10.1097/EDE.0b013e31822546fd. [DOI] [PubMed] [Google Scholar]
  • 29.Sapra KJ, et al. Signs and Symptoms of Early Pregnancy Loss: A Systematic Review. Reprod. Sci. 2017;24:502–513. doi: 10.1177/1933719116654994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Cheng CY, Pickler RH. Perinatal Stress, Fatigue, Depressive Symptoms, and Immune Modulation in Late Pregnancy and One Month Postpartum. Sci. World J. 2014;2014:e652630. doi: 10.1155/2014/652630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Inan OT, et al. Digitizing clinical trials. npj Digit. Med. 2020;3:1–7. doi: 10.1038/s41746-020-0302-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Van den Bergh O, Walentynowicz M. Accuracy and bias in retrospective symptom reporting. Curr. Opin. Psychiatr. 2016;29:302. doi: 10.1097/YCO.0000000000000267. [DOI] [PubMed] [Google Scholar]
  • 33.Tripp N, et al. An emerging model of maternity care: Smartphone, midwife, doctor? WOMEN BIRTH. 2014;27:64–67. doi: 10.1016/j.wombi.2013.11.001. [DOI] [PubMed] [Google Scholar]
  • 34.Musgrave LM, Kizirian NV, Homer CSE, Gordon A. Mobile Phone Apps in Australia for Improving Pregnancy Outcomes: Systematic Search on App Stores. JMIR mHealth and uHealth. 2020;8:e22340. doi: 10.2196/22340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Scott KM, Gome GA, Richards D, Caldwell PHY. How trustworthy are apps for maternal and child health? Health Technol. 2015;4:329–336. [Google Scholar]
  • 36.Liu F, Panagiotakos D. Real-world data: a brief review of the methods, applications, challenges and opportunities. BMC Med. Res. Methodol. 2022;22:287. doi: 10.1186/s12874-022-01768-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Branum AM, Ahrens KA. Trends in Timing of Pregnancy Awareness Among US Women. Mater. Child Health J. 2017;21:715–726. doi: 10.1007/s10995-016-2155-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ammon Avalos L, Galindo C, Li D-K. A systematic review to calculate background miscarriage rates using life table analysis. Birth Defects Res. Part A: Clin. Mol. Teratol. 2012;94:417–423. doi: 10.1002/bdra.23014. [DOI] [PubMed] [Google Scholar]
  • 39.Robinson GE. Pregnancy loss. Best Pract. Res. Clin. Obstetr. Gynaecol. 2014;28:169–178. doi: 10.1016/j.bpobgyn.2013.08.012. [DOI] [PubMed] [Google Scholar]
  • 40.Ecker JL, et al. #3: Periviable birth. Am. J. Obstetr. Gynecol. 2015;213:604–614. doi: 10.1016/j.ajog.2015.08.035. [DOI] [PubMed] [Google Scholar]
  • 41.Dey, R. & Mulekar, M. S. in Effect Size as a Measure of Difference Between Two Populations (eds Alhajj, R. & Rokne, J.) Encyclopedia of Social Network Analysis and Mining 715–726 (Springer, New York, NY, 2018). 10.1007/978-1-4939-7131-2_110195.
  • 42.Chen H, Cohen P, Chen S. How Big is a Big Odds Ratio? Interpreting the Magnitudes of Odds Ratios in Epidemiological Studies. Commun. Statistics - Simul. Comput. 2010;39:860–864. [Google Scholar]
  • 43.Vermani E, Mittal R, Weeks A. Pelvic Girdle Pain and Low Back Pain in Pregnancy: A Review. Pain Pract. 2010;10:60–71. doi: 10.1111/j.1533-2500.2009.00327.x. [DOI] [PubMed] [Google Scholar]
  • 44.Casagrande D, Gugala Z, Clark SM, Lindsey RW. Low Back Pain and Pelvic Girdle Pain in Pregnancy. J. Am. Acad. Orthopaedic Surg. 2015;23:539. doi: 10.5435/JAAOS-D-14-00248. [DOI] [PubMed] [Google Scholar]
  • 45.Wu WH, et al. Pregnancy-related pelvic girdle pain (PPP), I: Terminology, clinical presentation, and prevalence. Eur. Spine J. 2004;13:575–589. doi: 10.1007/s00586-003-0615-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wesnes SL, Lose G. Preventing urinary incontinence during pregnancy and postpartum: a review. Int. Urogynecol. J. 2013;24:889–899. doi: 10.1007/s00192-012-2017-3. [DOI] [PubMed] [Google Scholar]
  • 47.Moossdorff-Steinhauser HFA, Berghmans BCM, Spaanderman MEA, Bols EMJ. Prevalence, incidence and bothersomeness of urinary incontinence in pregnancy: a systematic review and meta-analysis. Int. Urogynecol. J. 2021;32:1633–1652. doi: 10.1007/s00192-020-04636-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Vleeming A, Albert HB, Östgaard HC, Sturesson B, Stuge B. European guidelines for the diagnosis and treatment of pelvic girdle pain. Eur. Spine J. 2008;17:794–819. doi: 10.1007/s00586-008-0602-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Jensen D, Ofir D, O’Donnell DE. Effects of pregnancy, obesity and aging on the intensity of perceived breathlessness during exercise in healthy humans. Respir. Physiol. Neurobiol. 2009;167:87–100. doi: 10.1016/j.resp.2009.01.011. [DOI] [PubMed] [Google Scholar]
  • 50.Derbyshire E, Davies J, Costarelli V, Dettmar P. Diet, physical inactivity and the prevalence of constipation throughout and after pregnancy. Matern. Child Nutr. 2006;2:127–134. doi: 10.1111/j.1740-8709.2006.00061.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Zielinski R, Searing K, Deibel M. Gastrointestinal Distress in Pregnancy: Prevalence, Assessment, and Treatment of 5 Common Minor Discomforts. J. Perinatal Neonatal Nursing. 2015;29:23–31. doi: 10.1097/JPN.0000000000000078. [DOI] [PubMed] [Google Scholar]
  • 52.Bradley CS, Kennedy CM, Turcea AM, Rao SSC, Nygaard IE. Constipation in pregnancy: prevalence, symptoms, and risk factors. Obstetr. Gynecol. 2007;110:1351–1357. doi: 10.1097/01.AOG.0000295723.94624.b1. [DOI] [PubMed] [Google Scholar]
  • 53.Gomes CF, Sousa M, Lourenço I, Martins D, Torres J. Gastrointestinal diseases during pregnancy: what does the gastroenterologist need to know? Ann. Gastroenterol. 2018;31:385–394. doi: 10.20524/aog.2018.0264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wald A. Constipation, diarrhea, and symptomatic hemorrhoids during pregnancy. Gastroenterol. Clin. North America. 2003;32:309–322. doi: 10.1016/s0889-8553(02)00069-9. [DOI] [PubMed] [Google Scholar]
  • 55.Menees SB, Lembo A, Charabaty A. Fecal Incontinence and Diarrhea During Pregnancy. Off. J. Am. Coll. Gastroenterol ACG. 2022;117:26. doi: 10.14309/ajg.0000000000001964. [DOI] [PubMed] [Google Scholar]
  • 56.Lee KA, Zaffke M. Longitudinal Changes in Fatigue and Energy During Pregnancy and the Postpartum Period. J. Obstetr. Gynecol. Neonatal Nursing. 1999;28:183–191. doi: 10.1111/j.1552-6909.1999.tb01983.x. [DOI] [PubMed] [Google Scholar]
  • 57.Chou F-H, Kuo S-H, Wang R-H. A Longitudinal Study of Nausea and Vomiting, Fatigue and Perceived Stress in, and Social Support for, Pregnant Women Through the Three Trimesters. Kaohsiung J. Med. Sci. 2008;24:306–314. doi: 10.1016/S1607-551X(08)70157-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Paarlberg KM, et al. Psychosocial factors as predictors of maternal well-being and pregnancy-related complaints. J. Psychosomatic Obstetr. Gynecol. 1996;17:93–102. doi: 10.3109/01674829609025669. [DOI] [PubMed] [Google Scholar]
  • 59.Fawcett J, York R. Spouses’ Physical and Psychological Symptoms During Pregnancy and the Postpartum. Nursing Res. 1986;35:144. [PubMed] [Google Scholar]
  • 60.Elek SM, Hudson DB, Fleck MO. Expectant Parents’ Experience with Fatigue and Sleep During Pregnancy. Birth. 1997;24:49–54. doi: 10.1111/j.1523-536x.1997.tb00336.x. [DOI] [PubMed] [Google Scholar]
  • 61.Affonso DD, Lovett S, Paul SM, Sheptak S. A Standardized Interview That Differentiates Pregnancy and Postpartum Symptoms from Perinatal Clinical Depression. Birth. 1990;17:121–130. doi: 10.1111/j.1523-536x.1990.tb00716.x. [DOI] [PubMed] [Google Scholar]
  • 62.Varol T, Göker A, CezayiRli E, Özgür S, Tuç Yücel A. Relation between foot pain and plantar pressure in pregnancy. Turkish J Med. Sci. 2017;47:1104–1108. doi: 10.3906/sag-1601-185. [DOI] [PubMed] [Google Scholar]
  • 63.Foti T, Davids JR, Bagley A. A Biomechanical Analysis of Gait During Pregnancy*. JBJS. 2000;82:625. [PubMed] [Google Scholar]
  • 64.Negro A, et al. Headache and pregnancy: a systematic review. J. Headache Pain. 2017;18:106. doi: 10.1186/s10194-017-0816-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Menon R, Bushnell CD. Headache and Pregnancy. Neurologist. 2008;14:108. doi: 10.1097/NRL.0b013e3181663555. [DOI] [PubMed] [Google Scholar]
  • 66.MacGregor EA. Headache in Pregnancy. Neurol. Clin. 2012;30:835–866. doi: 10.1016/j.ncl.2012.04.001. [DOI] [PubMed] [Google Scholar]
  • 67.Burch R. Headache in Pregnancy and the Puerperium. Neurol. Clin. 2019;37:31–51. doi: 10.1016/j.ncl.2018.09.004. [DOI] [PubMed] [Google Scholar]
  • 68.Aegidius K, Zwart J-A, Hagen K, Stovner L. The Effect of Pregnancy and Parity on Headache Prevalence: The Head-HUNT Study. Headache: J. Head Face Pain. 2009;49:851–859. doi: 10.1111/j.1526-4610.2009.01438.x. [DOI] [PubMed] [Google Scholar]
  • 69.Scharff L, Marcus DA, Turk DC. Headache During Pregnancy in the Postpartum: A Prospective Study. Headache: J. Head Face Pain. 1997;37:203–210. doi: 10.1046/j.1526-4610.1997.3704203.x. [DOI] [PubMed] [Google Scholar]
  • 70.Vazquez JC. Constipation, haemorrhoids, and heartburn in pregnancy. BMJ Clin. Evid. 2010;2010:1411. [PMC free article] [PubMed] [Google Scholar]
  • 71.Gadsby R, Barnie-Adshead AM, Jagger C. A prospective study of nausea and vomiting during pregnancy. Br. J. General Pract.: J. Royal Coll. General Pract. 1993;43:245–248. [PMC free article] [PubMed] [Google Scholar]
  • 72.Smith C, Crowther C, Beilby J, Dandeaux J. The impact of nausea and vomiting on women: a burden of early pregnancy. Aust. New Zealand J. Obstetr. Gynaecol. 2000;40:397–401. doi: 10.1111/j.1479-828x.2000.tb01167.x. [DOI] [PubMed] [Google Scholar]
  • 73.Czeizel AE, Puhó E. Association between severe nausea and vomiting in pregnancy and lower rate of preterm births. Paediatr. Perinatal Epidemiol. 2004;18:253–259. doi: 10.1111/j.1365-3016.2004.00568.x. [DOI] [PubMed] [Google Scholar]
  • 74.Wood H, McKellar LV, Lightbody M. Nausea and vomiting in pregnancy: Blooming or bloomin’ awful? A review of the literature. Women Birth. 2013;26:100–104. doi: 10.1016/j.wombi.2012.10.001. [DOI] [PubMed] [Google Scholar]
  • 75.Ladyman C, Signal TL. Sleep Health in Pregnancy. Sleep Med. Clin. 2018;13:307–333. doi: 10.1016/j.jsmc.2018.04.004. [DOI] [PubMed] [Google Scholar]
  • 76.van Lier D, Manteuffel B, Dilorio C, Stalcup M. Nausea and Fatigue During Early Pregnancy. Birth. 1993;20:193–197. doi: 10.1111/j.1523-536x.1993.tb00226.x. [DOI] [PubMed] [Google Scholar]
  • 77.Reeves N, Potempa K, Gallo A. Fatigue in early pregnancy An exploratory study. J. Nurse-Midwifery. 1991;36:303–309. doi: 10.1016/0091-2182(91)90045-q. [DOI] [PubMed] [Google Scholar]
  • 78.Lewis EF, et al. Development, psychometric evaluation, and initial feasibility assessment of a symptom tracker for use by patients with heart failure (HFaST) J. Patient-Reported Outcomes. 2019;3:26. doi: 10.1186/s41687-019-0113-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Nahum M, et al. Immediate Mood Scaler: Tracking Symptoms of Depression and Anxiety Using a Novel Mobile Mood Scale. JMIR mHealth and uHealth. 2017;5:e6544. doi: 10.2196/mhealth.6544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Ceney A, et al. Accuracy of online symptom checkers and the potential impact on service utilisation. PLOS ONE. 2021;16:e0254088. doi: 10.1371/journal.pone.0254088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Watson K, Angelotta C. The frequency of pregnancy recognition across the gestational spectrum and its consequences in the United States. Perspect. Sexual Reproduct. Health. 2022;54:32–37. doi: 10.1363/psrh.12192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Czeromin, U. Auszug aus dem DIR Jahrbuch 2021 https://www.deutsches-ivf-register.de/perch/resources/dir-jahrbuch-2021-sonderausgabe-fuer-paare.pdf (2022).
  • 83.Geburten. https://www.destatis.de/DE/Themen/Gesellschaft-Umwelt/Bevoelkerung/Geburten/_inhalt.html.
  • 84.Chien L-Y, Ko Y-L. Fatigue during pregnancy predicts caesarean deliveries. J. Adv. Nursing. 2004;45:487–494. doi: 10.1046/j.1365-2648.2003.02931.x. [DOI] [PubMed] [Google Scholar]
  • 85.Kahn JG, Yang JS, Kahn JS. ‘Mobile’ Health Needs And Opportunities In Developing Countries. Health Affairs. 2010;29:252–258. doi: 10.1377/hlthaff.2009.0965. [DOI] [PubMed] [Google Scholar]
  • 86.Anderson K, Burford O, Emmerton L. Mobile Health Apps to Facilitate Self-Care: A Qualitative Study of User Experiences. PLOS ONE. 2016;11:e0156164. doi: 10.1371/journal.pone.0156164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Richardson S, Lawrence K, Schoenthaler AM, Mann D. A framework for digital health equity. npj Digit. Med. 2022;5:1–6. doi: 10.1038/s41746-022-00663-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Krishnamurti T, Birru Talabi M, Callegari LS, Kazmerski TM, Borrero S. A Framework for Femtech: Guiding Principles for Developing Digital Reproductive Health Tools in the United States. J. Med. Internet Res. 2022;24:e36338. doi: 10.2196/36338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Mangesi, L., Hofmeyr, G. J., Smith, V. & Smyth, R. M. D. Fetal movement counting for assessment of fetal wellbeing. Cochrane Database Syst. Rev. (10), CD004909 10.1002/14651858.CD004909.pub3 (2015). [DOI] [PMC free article] [PubMed]
  • 90.Robbins MS, Farmakidis C, Dayal AK, Lipton RB. Acute headache diagnosis in pregnant women: a hospital-based study. Neurology. 2015;85:1024–1030. doi: 10.1212/WNL.0000000000001954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Mason L, Glenn S, Walton I, Hughes C. Women’s reluctance to seek help for stress incontinence during pregnancy and following childbirth. Midwifery. 2001;17:212–221. doi: 10.1054/midw.2001.0259. [DOI] [PubMed] [Google Scholar]
  • 92.Prescott J, Mackie L, Rathbone AL. Predictors of health anxiety during pregnancy. mHealth. 2018;4:16. doi: 10.21037/mhealth.2018.04.04. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Brown LD, Cai TT, DasGupta A. Interval Estimation for a Binomial Proportion. Statis. Sci. 2001;16:101–133. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material (503.9KB, pdf)

Data Availability Statement

The data sets used and/or analyzed during the current study available from the corresponding author on reasonable request and approval from Keleya Digital-Health Solutions GmbH.

The underlying code for this study and data sets is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author.


Articles from NPJ Digital Medicine are provided here courtesy of Nature Publishing Group

RESOURCES