Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Mar 15;14:6292. doi: 10.1038/s41598-024-55914-9

Development of phenotyping algorithms for hypertensive disorders of pregnancy (HDP) and their application in more than 22,000 pregnant women

Satoshi Mizuno 1, Maiko Wagata 2, Satoshi Nagaie 1, Mami Ishikuro 3, Taku Obara 3, Gen Tamiya 4, Shinichi Kuriyama 3, Hiroshi Tanaka 5, Nobuo Yaegashi 6, Masayuki Yamamoto 7, Junichi Sugawara 6,8, Soichi Ogishima 1,9,
PMCID: PMC10943000  PMID: 38491024

Abstract

Recently, many phenotyping algorithms for high-throughput cohort identification have been developed. Prospective genome cohort studies are critical resources for precision medicine, but there are many hurdles in the precise cohort identification. Consequently, it is important to develop phenotyping algorithms for cohort data collection. Hypertensive disorders of pregnancy (HDP) is a leading cause of maternal morbidity and mortality. In this study, we developed, applied, and validated rule-based phenotyping algorithms of HDP. Two phenotyping algorithms, algorithms 1 and 2, were developed according to American and Japanese guidelines, and applied into 22,452 pregnant women in the Birth and Three-Generation Cohort Study of the Tohoku Medical Megabank project. To precise cohort identification, we analyzed both structured data (e.g., laboratory and physiological tests) and unstructured clinical notes. The identified subtypes of HDP were validated against reference standards. Algorithms 1 and 2 identified 7.93% and 8.08% of the subjects as having HDP, respectively, along with their HDP subtypes. Our algorithms were high performing with high positive predictive values (0.96 and 0.90 for algorithms 1 and 2, respectively). Overcoming the hurdle of precise cohort identification from large-scale cohort data collection, we achieved both developed and implemented phenotyping algorithms, and precisely identified HDP patients and their subtypes from large-scale cohort data collection.

Subject terms: Diseases, Medical research, Data mining, Computational biology and bioinformatics, High-throughput screening

Introduction

In recent years, many phenotyping algorithms have been developed for scalable and high-throughput cohort identification from electronic health records (EHRs)15 for genome- and phenome-wide association studies6,7 and further biomedical analyses8. In past studies, many phenotyping algorithms have been developed using EHRs913. Phenotyping is also expected to stratify patients into subgroups that reflect the molecular and clinical diversity of diseases14, which is expected to have substantial potential to contribute to risk estimation15, diagnostic support and therapeutic development16 and the determination of the best available care for each patient based on their stratified subgroups.

Prospective genome cohort studies are known as one of the most important resources for developing precision medicine17,18. In these cohort studies, a wide range of data, including lifestyle, genomic, molecular, medical record and phenotypic data1921, are collected and analyzed to elucidate clinical problems. The Birth and Three-Generation Cohort Study (BirThree Cohort Study)19 of the Tohoku Medical Megabank (TMM) project is a large-scale prospective genome cohort study that recruited more than 70,000 subjects, including more than 20,000 pregnant women and their family members in 48 hospitals in Miyagi Prefecture from regional populations.

Hypertensive disorders of pregnancy (HDP) is one of the leading cause of maternal morbidity and mortality, affecting approximately 2–8%22 of all pregnancies, and is a clinical syndrome that mainly includes three subgroups: gestational hypertension (GH), preeclampsia (PE), and superimposed preeclampsia (SPE)23. It is known that there are major differences between the subgroups in the likelihood to develop severe complications and in pathogenesis2429. However, accurately diagnosing these subgroups is quite difficult in clinical practice, and the same treatment has been applied across the several subgroups. In addition to the subgrouping, further classification into early onset (EO) and late onset (LO) based on the timing of onset has gained attention because of the different clinical forms with different pathophysiological features30,31. Because of these major differences in the subgroups of HDP, the precise classification of patient diseases into subgroups by phenotyping algorithms is necessary for genomic, epidemiologic and other biomedical analyses.

In hospital-based clinical research, clinical research coordinators (CRC) are able to identify patients precisely by creating a case report form (CRF) using clinical data, including clinical notes. However, there are severe limitations in the phenotyping of patients in large-scale cohort studies that lack of complete unstructured clinical notes and have limited accessible EHR data for partial subjects due to the lack of linkages in cohort data collection because the subjects of the cohort study are not patients of a hospital but are part of the regional population. Recently, the UK Biobank32 and All of Us33, which are large-scale prospective genome cohort studies, collected structured data on EHRs, such as diagnoses, laboratory results, medications, and billing codes, using record linkages; however, collection of clinical notes were still not achieved. Because of this limitation, it is not realistic to perform phenotyping of hundreds of thousands of subjects in a large-scale cohort using the conventional approach in clinical research. Consequently, it is necessary to develop new high-throughput and objective phenotyping algorithms that exploit clinical notes.

In this study, we developed two rule-based HDP phenotyping algorithms and programmatically implemented them that follow American and Japanese clinical guidelines because there are no internationally standardized guidelines for HDP, but domestic guidelines have been developed3437. The developed algorithms were applied to phenotype approximately 23,000 pregnant women in the BirThree cohort. We also compared the phenotyped subgroups with a clinician chart review to evaluate the consistency of the clinical concepts between the developed algorithms and the clinical guidelines.

Methods

Development of hypertensive disorders of pregnancy (HDP) phenotyping algorithm 1

We developed HDP phenotyping algorithm 1 according to the American College of Obstetricians and Gynecologists (ACOG) guidelines23. Algorithm 1 detects HDP patients based on hypertensive disease history, blood pressure (BP), proteinuria (PU), timing of onset and PE-related clinical conditions (see “Details of the identification rules for the HDP subgroups in algorithm 1” section in the Supplemental Methods).

PU in early pregnancy is caused by several possibilities both related and unrelated to HDP, such as some degree of renal insufficiency and chronic kidney disease (CKD). To avoid overestimating the HDP group, we implemented rules to exclude subjects with PU in early pregnancy from the HDP group.

Development of hypertensive disorders of pregnancy (HDP) phenotyping algorithm 2

We developed HDP phenotyping algorithm 2 according to the guidelines of the Japan Society of Obstetrics and Gynecology (JSOG) (see “Details of the identification rules for the HDP subgroups in algorithm 2” section in the Supplemental Methods). In algorithm 2, hypertensive subjects without PU were assigned to SPE or PE when the following conditions were met: maternal organ dysfunction (see Supplementary Table 1) or light-for-date. (see “Selection of subjects with light-for-date” section in the Supplemental Methods). The patients having maternal organ dysfunction were determined by analyzing clinical notes (see “Selection of subjects with maternal organ dysfunction” section in the Supplemental Methods). More details about the process used to reach the target conditions are provided in the “Details of the process used to analyze unstructured clinical notes” section in the Supplemental Methods.

Implementation of the developed phenotyping algorithms

The developed phenotyping algorithms according to American and Japanese clinical guidelines were implemented by Python 3.8.10 and Perl 5.16.3 (https://github.com/ogishimalab/HDP_phenotyping_algorithms). The implementations take no arguments and take both data for phenotyping and the lists of subjects who have specific phenotype about exclusion and inclusion criteria of HDP and their subtypes. The lists were required to prepare before phenotyping by analysis of clinical notes performed by such as pattern matching based on rules (see “Details of the processes used to analyze unstructured clinical notes” and “Selection of subjects with light-for-date” in Supplemental Methods).

Data sources used to apply the developed algorithms

We selected research subjects from approximately 23,000 pregnant women who were included in the BirThree Cohort study, which is a population-based prospective cohort study. We excluded subjects who did not have medical records at delivery, who did not have BP and PU data, who did not have a live birth and who withdrew from the study. Using these criteria, we included 22,452 research subjects in the dataset. The baseline characteristics of all pregnant women in the BirThree Cohort study are provided in a previous report38.

All data in our dataset were transcribed from the clinical records of usual care by trained genome medical research coordinators (GMRCs) from the interview conducted at the first visit, prenatal checkups and records at delivery (left side of Fig. 1). The average number of visits for prenatal checkups and measurements of BP and PU for pregnant women are shown in Supplementary Table 2. All items used in our phenotyping algorithms are listed in Supplementary Table 3.

Figure 1.

Figure 1

Overview of the sources of the cohort data collection and diagnosis dataset. The left side represents data from the cohort that consist of an interview conducted at the first visit, prenatal checkups and records at delivery. The right side represents the diagnosis dataset from the medical record used in validation. The diagnosis dataset consisted of both intrapartum and postpartum data and ultrasound measurements, in addition to cohort collection.

Comparison of phenotyped subgroups and diagnoses

We conducted a clinician chart review to compare the subgroups phenotyped by the algorithms with the diagnoses. The comparison was performed to evaluate the consistency of clinical concepts between the developed algorithms and the clinical guidelines. A clinician chart review was performed based on the American guidelines23 for 252 subjects who met the following criteria: (1) subjects recruited at Tohoku University Hospital between September 2015 and November 2016 and (2) subjects who were able to access their medical records. The clinician chart review was conducted by two obstetricians, a mid-career and a senior obstetrician. If the subgroups assigned by the two doctors differed, the subgroups were determined by discussion. The chart review was performed blindly and annotated in a binary manner for each subgroup of HDP, CH and normotensive. The baseline characteristics of the 252 subjects are shown in Supplementary Table 4.

We calculated positive predictive values (PPVs), negative predictive values (NPVs), accuracies, sensitivities and specificities to compare our algorithms against the diagnoses. To calculate sensitivity and specificity, we identified true positive (TP), true negative (TN), false positive (FP), false negative (FN) by comparison of identified subgroups between phenotyping algorithms and diagnosis of 252 subjects. Based on the four indices, we identified sensitivity and specificity as TP/(TP + FN) and TN/(TN + FP). We identified accuracy using scikit-learn in python3. The diagnosed subgroups of the 252 subjects are listed in Supplementary Table 5. In this study, there were differences in data coverage between the cohort data for phenotyping and the diagnosis dataset that only diagnosis dataset included both intrapartum and postpartum data and ultrasound measurements, in addition to cohort data (right side of Fig. 1). The all items used for the diagnoses are listed in Supplementary Table 6.

Ethics declarations

This study was approved by the ethical committee of Tohoku Medical Megabank Organization, Tohoku University. We confirm that all methods took place in accordance with relevant guidelines and regulations, under informed consent from all participants.

Results

The developed HDP phenotyping algorithms

The rules of the developed HDP phenotyping algorithms 1 and 2 are shown in Fig. 2a and b, respectively. The detailed results of the identification of subjects with a hypertensive disease history and maternal organ dysfunction by analyzing unstructured clinical notes are described in the “Details of the identified subjects with a hypertensive disease history and maternal organ dysfunction” section in the Supplemental Results. The detailed numbers of HDP subgroups phenotyped by the developed phenotyping algorithms are described in “The number of phenotyped subgroups of HDP” section.

Figure 2.

Figure 2

The developed rule-based phenotyping algorithms. (a) The rules of the developed HDP phenotyping algorithm 1. Phenotyping by algorithm 1 was conducted based on hypertensive disease history, BP, PU, timing of onset and PE-related conditions. The black boxes represent the conditions of rules, the arrow diagrams represent IF/THEN rules, and the orange boxes represent phenotyped subgroups. (b) The rules of the developed HDP phenotyping algorithm 2. The rules of the developed HDP phenotyping algorithm 2. Phenotyping by algorithm 2 was conducted with maternal organ dysfunction and light-for-date in addition to the factors included in algorithm 1.

The number of phenotyped subgroups of HDP

Of the 22,452 research subjects, 1781 (7.93%) were phenotyped as having HDP by phenotyping algorithm 1. Among the subgroups of HDP, 890 subjects (3.96%) had GH, 304 (1.35%) had SPE, and 587 (2.61%) had PE. According to phenotyping with algorithm 2, 1813 (8.08%) subjects had HDP. Among the subgroups of HDP, 825 subjects (3.67%) had GH, 336 (1.50%) had SPE, and 652 (2.90%) had PE (Table 1). Compared with algorithm 1, fewer subjects had GH or CH, while more had PE or SPE according to algorithm 2 because the algorithm 2 classifies hypertensive subjects without PU into PE or SPE based on the criteria such as subjects having epigastralgia, HELLP syndrome, eclampsia and FGR, in addition to the criteria of algorithm 1.

Table 1.

Number of subgroups phenotyped by the phenotyping algorithms.

Algorithm 1 Algorithm 2
EO LO EO LO
GH 356 (1.59%) 534 (2.38%) 325 (1.45%) 500 (2.23%)
SPE 207 (0.92%) 97 (0.43%) 219 (0.98%) 117 (0.52%)
PE 140 (0.62%) 447 (1.99%) 149 (0.66%) 503 (2.24%)
CH 542 (2.41%) 510 (2.27%)
Normotensive 20,129 (89.65%)
Total 22,452

Among the subgroups phenotyped by algorithm 1 and 2, 97 (0.44%) subjects were classified into different subgroups of HDP (Supplementary Table 7).

The causes of the differences in the phenotyped HDP subgroups between algorithm 1 and 2 are shown in Supplementary Table 8. The sum of the subjects was not equal to 97 because one person may have had multiple symptoms and differences in the timing of the symptoms.

Comparison of the subgroups identified by HDP phenotyping algorithms with diagnoses

Of the 252 subjects included in the diagnosis dataset, 36 (14.2%) were diagnosed as HDP. The subgroups of the 252 subjects are shown in Supplementary Table 5. The performance of phenotyping algorithm 1 and 2 compared to the diagnoses is shown in Tables 2 and 3, respectively. Comparisons for GH EO were not applicable because none of the 252 subjects was diagnosed with GH EO.

Table 2.

The performance of phenotyping algorithm 1.

Algorithm 1
PPV NPV Accuracy Sensitivity Specificity
HDP 0.96 0.96 0.96 0.72 1.00
GH 0.75 0.98 0.97 0.33 1.00
GH EO
GH LO 0.67 0.97 0.97 0.22 1.00
SPE 0.92 1.00 0.99 0.92 1.00
SPE EO 0.89 1.00 0.99 0.89 1.00
SPE LO 1.00 1.00 1.00 1.00 1.00
PE 0.91 0.98 0.98 0.67 1.00
PE EO 1.00 1.00 1.00 0.50 1.00
PE LO 0.90 0.98 1.00 0.50 1.00

Table 3.

The performance of phenotyping algorithm 2.

Algorithm 2
PPV NPV Accuracy Sensitivity Specificity
HDP 0.90 0.96 0.95 0.75 0.99
GH 0.67 0.97 0.97 0.22 1.00
GH EO
GH LO 0.50 0.97 0.96 0.11 1.00
SPE 0.80 0.99 0.99 1.00 0.99
SPE EO 0.82 1.00 0.99 1.00 0.99
SPE LO 0.75 1.00 1.00 1.00 1.00
PE 0.83 0.97 0.97 0.67 0.99
PE EO 1.00 1.00 1.00 0.50 1.00
PE LO 0.82 1.00 0.98 0.50 1.00

For phenotyping of HDP overall, the PPVs, NPVs, accuracies, and specificities of both algorithms 1 and 2 were high (0.96, 0.96, 0.96, and 1.00 for the PPVs, NPVs, accuracy and specificity of algorithm 1 and 0.90, 0.96, 0.95, and 0.99 for the PPV, NPV, accuracy and specificity of algorithm 2, respectively), and the sensitivities were modest (0.72 and 0.75 for algorithm 1 and 2, respectively). Of the 252 subjects in the diagnosis datasets, 15 and 16 subjects were classified into different subgroups by algorithm 1 and algorithm 2, respectively. Of the 15 subjects with different subgroups by algorithm 1, 10 of the differences were caused by missing intrapartum and postpartum data, 1 was caused by a missing creatinine level, 1 was caused by transcription error and 3 were caused by headache that was not accepted as an unexplained new-onset headache that was unresponsive in the chart review (see Supplementary Table 9).

Of the 16 subjects who showed differences by phenotyping algorithm 2, three were new because the subgroups phenotyped by algorithm 1 matched the diagnosed subgroups (see Supplementary Table 10).

Discussion

In this study, we developed two HDP phenotyping algorithms, algorithm 1 and 2. Algorithm 1 identifies HDP patients based on the hypertensive disease history, BP, PU, timing of onset and PE-related clinical conditions. In addition to the factors included in algorithm 1, algorithm 2 conducts phenotyping by considering maternal organ dysfunction and light-for-date, which originate from placental dysfunction in early pregnancy39. In these phenotyping results, 97 (0.44%) subjects were phenotyped into different subgroups by the two algorithms, and they actually had clinical conditions related to maternal organ dysfunction, such as epigastralgia and light-for-date (Supplementary Table 8). The choice of which algorithm to use depends on whether the phenotyping will be performed with the clinical concepts of the American or Japanese clinical guidelines used in algorithm 1 and 2, respectively. The Japanese clinical guidelines of HDP have been repeatedly updated in the past35,36,40,41 and will be continually updated according to the progress of research to understand the pathogenesis of HDP. Therefore, algorithm 2 will be updated continuously according to updates to the Japanese guidelines.

In this study, the number of identified subjects with a hypertensive disease history (115; 0.51%) and underlying hepatic or renal disorders (323; 1.44%) by analysis of unstructured clinical notes was slightly smaller than that in previous reports42,43. This result suggests the possibility of the underestimation of subjects with these conditions because of differences between the diagnoses and the self-reported interviews, which were the sources of the transcribed data regarding disease history.

Of the 22,452 subjects in the BirThree cohort study, the proportions of the phenotyped HDP patients by algorithms 1 and 2 were almost identical to the previously reported prevalence rate of HDP38,44,45.

To evaluate the performances of our developed phenotyping algorithms, we performed chart review by two obstetricians for 252 among 22,452 research subjects (see “Comparison of phenotyped subgroups and diagnoses” section). Among 252 subjects, 203 (80.56%) were identified as normotensive, 13 (5.16%) were as CH, and 36 (14.29%) were as HDP patients. For HDP subgroups, we identified 9 subjects (3.57%) as GH, 15 (5.95%) as PE, and 12 (4.76%) as SPE (Supplementary Table 5).

The prevalence of HDP in the diagnosis dataset was 14.29%, which was higher than that in the full cohort (Supplementary Table 5). The baseline characteristics of the diagnosis dataset were higher risk of HDP in terms such as BMI and age than the full cohort (Supplementary Table 4). These differences may have occurred because the subjects in the diagnosis dataset were recruited at Tohoku University Hospital, which is a leading hospital that actively accepts high-risk cases, including patients with HDP.

In the evaluation of the performances of our developed algorithms, our phenotyping algorithms showed high PPVs, NPVs, accuracies and specificities and modest sensitivities, despite the limitation of data availability of intrapartum and postpartum data (Table 2, 3). In the results comparison between algorithm 1 and diagnosis, 15 of the 252 subjects in the diagnosis dataset were phenotyped into different subgroups of HDP by algorithm 1 (Supplementary Table 9). All of these 15 differences will be resolved by linking EHRs and cohort data19 because all differences were caused by lacking intrapartum and postpartum data and detailed description about headaches in cohort data. In the comparison for algorithm 2, 16 of 252 subjects were phenotyped into different subgroups of HDP (Supplementary Table 10). In particular, three of the sixteen differences are caused by fit to criteria of algorithm 2 added to algorithm 1, and these are not included in the diagnostic criteria. These results show that the major cause of miss-phenotyping was not inadequacy of phenotyping algorithms but differences of data sources between phenotyping and diagnosis. Based on these results, we concluded that there was consistency in clinical concepts between phenotyping algorithms and clinical guidelines. Furthermore, these results showed that our phenotyping algorithms are ready to be applied in HDP phenotyping tasks.

We used the transcribed medical records of the BirThree Cohort study as data sources for the phenotyping algorithms. The advantage of using the BirThree Cohort study data is the high representativeness of over 70% of the pregnant women in the regional population during the study period. This study is premised on the representativeness of the data sources because we evaluated the performances of our algorithms by comparing the number of subjects having HDP between the study population and the prevalence of HDP in previous reports, and this was large enough to compare the proportion of phenotyped subgroups of HDP with the prevalence rates.

Conclusions

We developed two phenotyping algorithms to identify subjects with HDP and their subtypes from large-scale cohort data collection to perform further biomedical analyses. Our algorithms were developed according to American and Japanese guidelines that differ in the criteria for HDP and its subtypes. Compared to the diagnosis, our phenotyping algorithms showed high PPVs (0.96 and 0.90 for algorithm 1 and 2, respectively), NPVs (0.96 for algorithms 1 and 2), accuracies (0.96 and 0.95 for algorithm 1 and 2, respectively) and specificities (1.00 and 0.99 for algorithm 1 and 2, respectively), and modest sensitivities (0.72 and 0.75 for algorithm 1 and 2, respectively), despite the limitation of data availability for intrapartum and postpartum data in the cohort collection. The application of our phenotyping algorithms to large-scale cohort data will substantially accelerate HDP research by facilitating the high-throughput and precise identification of patients with HDP.

Supplementary Information

Supplementary Tables. (173.7KB, pdf)

Acknowledgements

The authors would like to thank all the participants and staff of the TMM Organization, Tohoku University (a full list of members is available at https://www.megabank.tohoku.ac.jp/english/a230901/) and the Iwate TMM Organization. This work was supported by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) KAKENHI (Grant Number 16K20899), the Japan Agency for Medical Research and Development (AMED) (Grant Numbers JP21tm0124005 and JP21tm0424601).

Author contributions

S.M.: participated in the methodology, formal Analysis and writing original draft. M.W. : participated in the methodology, investigation and data curation. S.N. : participated in the methodology and review & editing. M.I. and T.O : participated in the investigation and data curation. G.T., H.T. and N.Y. : participated in the methodology, investigation and supervision. S.K. : participated in the investigation and supervision. M.Y. : participated in the supervision and writing—review & editing. J.S. : participated in the methodology, investigation, data curation and supervision. S.O. : participated in the project administration and writing—review & editing. All authors have seen and approved the final version.

Data availability

The data underlying this article were provided by the Tohoku Medical Megabank project with permission. Data will be shared on request to the Tohoku Medical Megabank project with permission.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-55914-9.

References

  • 1.Wilke RA, Berg RL, Peissig P, Kitchner T, Sijercic B, McCarty CA, McCarty DJ. Use of an electronic medical record for the identification of research subjects with diabetes mellitus. Clin. Med. Res. 2007;5:1–7. doi: 10.3121/cmr.2007.726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.McCarty CA, Chisholm RL, Chute CG, Kullo IJ, Jarvik GP, Larson EB, Li R, Masys DR, Ritchie MD, Roden DM, Struewing JP, Wolf WA. The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med. Genomics. 2011;4:13. doi: 10.1186/1755-8794-4-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J. Am. Med. Inform. Assoc. 2013;20:117–121. doi: 10.1136/amiajnl-2012-001145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gottesman O, Kuivaniemi H, Tromp G, Faucett WA, Li R, Manolio TA, Sanderson SC, Kannry J, Zinberg R, Basford MA, Brilliant M, Carey DJ, Chisholm RL, Chute CG, Connolly JJ, Crosslin D, Denny JC, Gallego CJ, Haines JL, Hakonarson H, Harley J, Jarvik GP, Kohane I, Kullo IJ, Larson EB, McCarty C, Ritchie MD, Roden DM, Smith ME, Böttinger EP, Williams MS. The electronic medical records and genomics (eMERGE) network: Past, present, and future. Genet. Med. 2013;15:761–771. doi: 10.1038/gim.2013.72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Esteban S, Tablado MR, Peper FE, Mahumud YS, Ricci RI, Kopitowski KS, Terrasa SA. Development and validation of various phenotyping algorithms for Diabetes Mellitus using data from electronic health records. Comput. Methods Programs Biomed. 2017;152:53–70. doi: 10.1016/j.cmpb.2017.09.009. [DOI] [PubMed] [Google Scholar]
  • 6.Ritchie MD, Denny JC, Zuvich RL, Crawford DC, Schildcrout JS, Bastarache L, Ramirez AH, Mosley JD, Pulley JM, Basford MA, Bradford Y, Rasmussen LV, Pathak J, Chute CG, Kullo IJ, McCarty CA, Chisholm RL, Kho AN, Carlson CS, Larson EB, Jarvik GP, Sotoodehnia N, Manolio TA, Li R, Masys DR, Haines JL, Roden DM. Genome- and phenome-wide analyses of cardiac conduction identifies markers of arrhythmia risk. Circulation. 2013;127:1377–1385. doi: 10.1161/circulationaha.112.000604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Shameer K, Denny JC, Ding K, Jouni H, Crosslin DR, de Andrade M, Chute CG, Peissig P, Pacheco JA, Li R, Bastarache L, Kho AN, Ritchie MD, Masys DR, Chisholm RL, Larson EB, McCarty CA, Roden DM, Jarvik GP, Kullo IJ. A genome- and phenome-wide association study to identify genetic variants influencing platelet count and volume and their pleiotropic effects. Hum. Genet. 2014;133:95–109. doi: 10.1007/s00439-013-1355-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Li L, Cheng WY, Glicksberg BS, Gottesman O, Tamler R, Chen R, Bottinger EP, Dudley JT. Identification of type 2 diabetes subgroups through topological analysis of patient similarity. Sci. Transl. Med. 2015;7:314. doi: 10.1126/scitranslmed.aaa9364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Shivade C, Raghavan P, Fosler-Lussier E, Embi PJ, Elhadad N, Johnson SB, Lai AM. A review of approaches to identifying patient phenotype cohorts using electronic health records. J. Am. Med. Inform. Assoc. 2014;21:221–230. doi: 10.1136/amiajnl-2013-001935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wei WQ, Leibson CL, Ransom JE, Kho AN, Caraballo PJ, Chai HS, Yawn BP, Pacheco JA, Chute CG. Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus. J. Am. Med. Inform. Assoc. 2012;19:219–224. doi: 10.1136/amiajnl-2011-000597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kagawa R, Kawazoe Y, Ida Y, Shinohara E, Tanaka K, Imai T, Ohe K. Development of type 2 diabetes mellitus phenotyping framework using expert knowledge and machine learning approach. J. Diabetes Sci. Technol. 2017;11:791–799. doi: 10.1177/1932296816681584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Teixeira PL, Wei WQ, Cronin RM, Mo H, VanHouten JP, Carroll RJ, LaRose E, Bastarache LA, Rosenbloom ST, Edwards TL, Roden DM, Lasko TA, Dart RA, Nikolai AM, Peissig PL, Denny JC. Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals. J. Am. Med. Inform. Assoc. 2017;24:162–171. doi: 10.1093/jamia/ocw071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Connolly, J. CHOP. ADHD phenotype algorithm. PheKB. https://phekb.org/phenotype/179, 2013 (accessed 28 July 2021).
  • 14.Robinson PN. Deep phenotyping for precision medicine. Hum. Mutat. 2012;33:777–780. doi: 10.1002/humu.22080. [DOI] [PubMed] [Google Scholar]
  • 15.Deo RC. Machine learning in medicine. Circulation. 2015;132:1920–1930. doi: 10.1161/circulationaha.115.001593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wishart DS. Emerging applications of metabolomics in drug discovery and precision medicine. Nat. Rev. Drug Discov. 2016;15:473–484. doi: 10.1038/nrd.2016.32. [DOI] [PubMed] [Google Scholar]
  • 17.Collins FS, Varmus H. A new initiative on precision medicine. N. Engl. J. Med. 2015;372:793–795. doi: 10.1056/NEJMp1500523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ashley EA. The precision medicine initiative: A new national effort. JAMA. 2015;313:2119–2120. doi: 10.1001/jama.2015.3595. [DOI] [PubMed] [Google Scholar]
  • 19.S. Kuriyama, N. Yaegashi, F. Nagami, T. Arai, Y. Kawaguchi, N. Osumi, M. Sakaida, Y. Suzuki, K. Nakayama, H. Hashizume, G. Tamiya, H. Kawame, K. Suzuki, A. Hozawa, N. Nakaya, M. Kikuya, H. Metoki, I. Tsuji, N. Fuse, H. Kiyomoto, J. Sugawara, A. Tsuboi, S. Egawa, K. Ito, K. Chida, T. Ishii, H. Tomita, Y. Taki, N. Minegishi, N. Ishii, J. Yasuda, K. Igarashi, R. Shimizu, M. Nagasaki, S. Koshiba, K. Kinoshita, S. Ogishima, T. Takai-Igarashi, T. Tominaga, O. Tanabe, N. Ohuchi, T. Shimosegawa, S. Kure, H. Tanaka, S. Ito, J. Hitomi, K. Tanno, M. Nakamura, K. Ogasawara, S. Kobayashi, K. Sakata, M. Satoh, A. Shimizu, M. Sasaki, R. Endo, K. Sobue, The Tohoku Medical Megabank Project Study Group, M. Yamamoto The Tohoku medical megabank project: design and mission. J. Epidemiol. 2016;26:493–511. doi: 10.2188/jea.JE20150268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pembrey M. The avon longitudinal study of parents and children (ALSPAC): A resource for genetic epidemiology. Eur. J. Endocrinol. 2004;151:U125–U129. doi: 10.1530/eje.0.151u125. [DOI] [PubMed] [Google Scholar]
  • 21.Fraser A, Macdonald-Wallis C, Tilling K, Boyd A, Golding J, Davey Smith G, Henderson J, Macleod J, Molloy L, Ness A, Ring S, Nelson SM, Lawlor DA. Cohort profile: the avon longitudinal study of parents and children: ALSPAC mothers cohort. Int. J. Epidemiol. 2013;42:97–110. doi: 10.1093/ije/dys066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.World Health Organization . The World Health Report 2005: Make Every Mother and Child Count. Geneva, Switzerland: World Health Organization; 2005. [Google Scholar]
  • 23.American College of Obstetricians Gynecologists Hypertension in pregnancy. Report of the American college of obstetricians and gynecologists’ task force on hypertension in pregnancy. Obstet. Gynecol. 2013;122:1122–1131. doi: 10.1097/01.aog.0000437382.03963.88. [DOI] [PubMed] [Google Scholar]
  • 24.Sibai B, Dekker G, Kupferminc M. Pre-eclampsia. Lancet. 2005;365:785–799. doi: 10.1016/s0140-6736(05)17987-2. [DOI] [PubMed] [Google Scholar]
  • 25.Uzan J, Carbonnel M, Piconne O, Asmar R, Ayoubi JM. Pre-eclampsia: Pathophysiology, diagnosis, and management. Vasc. Health Risk Manag. 2011;7:467–474. doi: 10.2147/vhrm.s20181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Al-Jameil N, Aziz Khan F, Fareed Khan M, Tabassum H. A brief overview of preeclampsia. J. Clin. Med. Res. 2014;6:1–7. doi: 10.4021/jocmr1682w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chaiworapongsa T, Chaemsaithong P, Yeo L, Romero R. Pre-eclampsia part 1: Current understanding of its pathophysiology. Nat. Rev. Nephrol. 2014;10:466–480. doi: 10.1038/nrneph.2014.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Vanek M, Sheiner E, Levy A, Mazor M. Chronic hypertension and the risk for adverse pregnancy outcome after superimposed pre-eclampsia. Int. J. Gynaecol. Obstet. 2004;86:7–11. doi: 10.1016/j.ijgo.2004.03.006. [DOI] [PubMed] [Google Scholar]
  • 29.Magee LA, Pels A, Helewa M, Rey E, von Dadelszen P. Diagnosis, evaluation, and management of the hypertensive disorders of pregnancy. Pregnancy Hypertens. 2014;4:105–145. doi: 10.1016/j.preghy.2014.01.003. [DOI] [PubMed] [Google Scholar]
  • 30.Madazli R, Yuksel MA, Imamoglu M, Tuten A, Oncul M, Aydin B, Demirayak G. Comparison of clinical and perinatal outcomes in early- and late-onset preeclampsia. Arch. Gynecol. Obstet. 2014;290:53–57. doi: 10.1007/s00404-014-3176-x. [DOI] [PubMed] [Google Scholar]
  • 31.Steegers EA, von Dadelszen P, Duvekot JJ, Pijnenborg R. Pre-eclampsia. Lancet. 2010;376:631–644. doi: 10.1016/s0140-6736(10)60279-6. [DOI] [PubMed] [Google Scholar]
  • 32.Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, Motyer A, Vukcevic D, Delaneau O, O’Connell J, Cortes A, Welsh S, Young A, Effingham M, McVean G, Leslie S, Allen N, Donnelly P, Marchini J. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–209. doi: 10.1038/s41586-018-0579-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.The All of Us Research Program Investigators The "all of us" research program. N. Engl. J. Med. 2019;381:668–676. doi: 10.1056/NEJMsr1809937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.National high blood pressure education program working group report on high blood pressure in pregnancy, Am. J. Obstet. Gynecol. 163: 1691–1712. 10.1016/0002-9378(90)90653-o (1990). [DOI] [PubMed]
  • 35.Brown MA, Lindheimer MD, de Swiet M, Van Assche A, Moutquin JM. The classification and diagnosis of the hypertensive disorders of pregnancy: Statement from the international society for the study of hypertension in pregnancy (ISSHP), Hypertens. Pregnancy. 2001;20:IX–XIV. doi: 10.1081/prg-100104165. [DOI] [PubMed] [Google Scholar]
  • 36.Watanabe K, Naruse K, Tanaka K, Metoki H, Suzuki Y. Outline of definition and classification of "pregnancy induced hypertension (PIH)". Hypertens. Res. Pregnancy. 2013;1:3–4. doi: 10.14390/jsshp.1.3. [DOI] [Google Scholar]
  • 37.Watanabe K, Matsubara K, Nakamoto O, Ushijima J, Ohkuchi A, Koide K, Makino S, Mimura K, Morikawa M, Naruse K, Tanaka K, Nohira T, Metoki H, Takeda S, Seki H, Takagi K, Yamasaki M, Ichihara A, Kimura T, Saito S. New definition and classification of “hypertensive disorders of pregnancy (HDP)". Hypertens. Res. Pregnancy. 2017;5:39–40. doi: 10.14390/jsshp.HRP2017-018. [DOI] [Google Scholar]
  • 38.Sugawara J, Ishikuro M, Obara T, Onuma T, Murakami K, Kikuya M, Ueno F, Noda A, Mizuno S, Kobayashi T, Hamanaka Y, Suzuki K, Kodama E, Tsuchiya N, Uruno A, Suzuki Y, Tanabe O, Kiyomoto H, Tsuboi A, Shimizu A, Koshiba S, Minegishi N, Ogishima S, Tamiya G, Metoki H, Hozawa A, Fuse N, Kinoshita K, Kure S, Yaegashi N, Kuriyama S, Yamamoto M. Maternal baseline characteristics and perinatal outcomes: the Tohoku Medical Megabank project birth and three-generation cohort study. J. Epidemiol. 2020 doi: 10.2188/jea.JE20200338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Gyselaers W. Preeclampsia is a syndrome with a cascade of pathophysiologic events. J. Clin. Med. 2020;9:2245. doi: 10.3390/jcm9072245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Japan Society for the Study of Toxemia of Pregnancy, Historical Perspective of Study of Pregnancy-Induced Hypertension in Japan, Medical View Co., Ltd., Tokyo, (2005).
  • 41.Japan Society for the Study of Hypertension in Pregnancy, Guideline 2009 for Care and Treatment of Hypertension in Pregnancy (PIH), Medical View Co., Ltd., Tokyo, (2009).
  • 42.Bateman BT, Bansil P, Hernandez-Diaz S, Mhyre JM, Callaghan WM, Kuklina EV. Prevalence, trends, and outcomes of chronic hypertension: A nationwide sample of delivery admissions. Am. J. Obstet. Gynecol. 2012;206(134):e1–8. doi: 10.1016/j.ajog.2011.10.878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Munkhaugen J, Lydersen S, Romundstad PR, Widerøe TE, Vikse BE, Hallan S. Kidney function and future risk for adverse pregnancy outcomes: A population-based study from HUNT II, Norway. Nephrol. Dial. Transplant. 2009;24:3744–3750. doi: 10.1093/ndt/gfp320. [DOI] [PubMed] [Google Scholar]
  • 44.Villar J, Carroli G, Wojdyla D, Abalos E, Giordano D, Ba'aqeel H, Farnot U, Bergsjø P, Bakketeig L, Lumbiganon P, Campodónico L, Al-Mazrou Y, Lindheimer M, Kramer M. Preeclampsia, gestational hypertension and intrauterine growth restriction, related or independent conditions? Am. J. Obstet. Gynecol. 2006;194:921–931. doi: 10.1016/j.ajog.2005.10.813. [DOI] [PubMed] [Google Scholar]
  • 45.Seely EW, Ecker J. Clinical practice. Chronic hypertension in pregnancy. N. Engl. J. Med. 2011;365:439–446. doi: 10.1056/NEJMcp0804872. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Tables. (173.7KB, pdf)

Data Availability Statement

The data underlying this article were provided by the Tohoku Medical Megabank project with permission. Data will be shared on request to the Tohoku Medical Megabank project with permission.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES