Skip to main content
Journal of the American Medical Informatics Association : JAMIA logoLink to Journal of the American Medical Informatics Association : JAMIA
. 2024 Aug 10;31(11):2440–2446. doi: 10.1093/jamia/ocae182

PheMIME: an interactive web app and knowledge base for phenome-wide, multi-institutional multimorbidity analysis

Siwei Zhang 1, Nick Strayer 2, Tess Vessels 3, Karmel Choi 4,5, Geoffrey W Wang 6, Yajing Li 7, Cosmin A Bejan 8, Ryan S Hsi 9, Alexander G Bick 10, Digna R Velez Edwards 11, Michael R Savona 12, Elizabeth J Phillips 13,14, Jill M Pulley 15, Wesley H Self 16, Wilkins Consuelo Hopkins 17, Dan M Roden 18, Jordan W Smoller 19,20,21, Douglas M Ruderfer 22,23,24, Yaomin Xu 25,26,
PMCID: PMC11491640  PMID: 39127052

Abstract

Objectives

To address the need for interactive visualization tools and databases in characterizing multimorbidity patterns across different populations, we developed the Phenome-wide Multi-Institutional Multimorbidity Explorer (PheMIME). This tool leverages three large-scale EHR systems to facilitate efficient analysis and visualization of disease multimorbidity, aiming to reveal both robust and novel disease associations that are consistent across different systems and to provide insight for enhancing personalized healthcare strategies.

Materials and Methods

PheMIME integrates summary statistics from phenome-wide analyses of disease multimorbidities, utilizing data from Vanderbilt University Medical Center, Mass General Brigham, and the UK Biobank. It offers interactive and multifaceted visualizations for exploring multimorbidity. Incorporating an enhanced version of associationSubgraphs, PheMIME also enables dynamic analysis and inference of disease clusters, promoting the discovery of complex multimorbidity patterns. A case study on schizophrenia demonstrates its capability for generating interactive visualizations of multimorbidity networks within and across multiple systems. Additionally, PheMIME supports diverse multimorbidity-based discoveries, detailed further in online case studies.

Results

The PheMIME is accessible at https://prod.tbilab.org/PheMIME/. A comprehensive tutorial and multiple case studies for demonstration are available at https://prod.tbilab.org/PheMIME_supplementary_materials/. The source code can be downloaded from https://github.com/tbilab/PheMIME.

Discussion

PheMIME represents a significant advancement in medical informatics, offering an efficient solution for accessing, analyzing, and interpreting the complex and noisy real-world patient data in electronic health records.

Conclusion

PheMIME provides an extensive multimorbidity knowledge base that consolidates data from three EHR systems, and it is a novel interactive tool designed to analyze and visualize multimorbidities across multiple EHR datasets. It stands out as the first of its kind to offer extensive multimorbidity knowledge integration with substantial support for efficient online analysis and interactive visualization.

Keywords: PheWAS, multimorbidity, interactive network analysis, UK Biobank, interoperability and reproducibility, electronic health records (EHRs)

Introduction

The global rise of multimorbidity, the simultaneous occurrence of several diseases within an individual, poses a significant challenge to public health and healthcare providers.1–4 In-depth understanding of intricate multimorbidity can uncover the complex interactions between diseases that affect their course, severity, and the patient's treatment response.5–8 Moreover, it may reveal shared molecular mechanisms among various diseases,9–11 offering opportunities for innovative prevention strategies, targeted interventions, and personalized treatments for patients with multimorbidities.12–15

Large-scale electronic health record (EHR) systems, linked to comprehensive health and clinical data, have significantly enhanced the statistical power to examine robust multimorbidity patterns that mirror real-world scenarios.7,11,16–19 Network analysis has been a powerful approach to decipher complex multimorbidity patterns.20–22 Our recent work has demonstrated the interoperability of EHR-based multimorbidities and multimorbidity networks when compared across distinct EHR systems using standardized diagnostic codes (eg, ICD9 or ICD10).23,24

Integrating multiple EHR systems can significantly improve the generalizability and robustness of multimorbidity characterization by identifying commonalities and differences across various populations. However, current standards for quantifying, representing, and analyzing multimorbidity patterns, especially for phenome-wide analyses leveraging large-scale EHRs, are still lacking.1,25

In this study, we introduce the Phenome-wide Multi-Institutional Multimorbidity Explorer (PheMIME), an interactive web application developed using the R programming language and the Shiny library.26 PheMIME enables researchers to interactively explore, compare, and discover multimorbidity patterns using a phenome-wide multimorbidity knowledge base compiled from three major EHR databases: Vanderbilt University Medical Center (VUMC), Mass General Brigham Hospital (MGB), and UK Biobank (UKB).24 With the selection of a specific disease phenotype of interest, users can directly compare multimorbidities between different institutions, conduct a network analysis based on individual or multi-institution combined multimorbidity network, and visualize dynamic clustering of significant multimorbidities by applying filters.

PheMIME provides a rich, explorable view of the multimorbidity patterns and relationships among diseases across multiple institutions. It can facilitate the validation of novel disease associations, the discovery of robust disease multimorbidity patterns, and potentially reveal new insights for future investigation. We demonstrate PheMIME’s analytical capabilities and customization options through a schizophrenia case study. Additional detailed case studies available online further showcase diverse multimorbidity-based discovery schemes.

Methods

Data integration and multimorbidity knowledge base

For our analysis, we accessed three EHR databases that included individual-level data for 250 000 random patients each from the VUMC and MGB EHR systems, as well as data from 431 105 subjects in the UKB. For UKB, in-hospital records field-41270, which records the distinct diagnosis codes (ICD 10) a participant has had recorded across all the hospital inpatient records in either the primary or secondary position, was used for the analysis. The ICD 10 codes were then mapped to phecodes27,28 where a phecode was denoted as an occurrence for an individual if it appeared at least once in a UKB subject’s collapsed record across all hospital inpatient records, or if occurred two or more times in a patient’s record for either VUMC or MGB. Specifically, a total of 1815 phecodes were identified across the dataset. Broken down by resource, VUMC, MGB, and UKB reported 1815, 1805 and 1484 phecodes, respectively. The most significant factor contributing to the variation in phecode counts is the difference in population characteristics between institutions. VUMC and MGB primarily serve as hospitals, where subjects are predominantly patients seeking medical care. In contrast, the UKB represents a general population cohort, comprising individuals from the community who may or may not have a history of medical conditions.29 Despite these variations, our comprehensive analysis revealed in total 1 485 133 pairs of phecodes that demonstrated overlap and relationships, highlighting the connections between different phecodes within our datasets. Logistic regressions adjusting for patient age at last recorded visit, sex, race, and the number of unique phecodes present in patients’ records were run for each pair of two phecodes with two conditions of either phecode A or phecode B regarded as the outcome and the other one treated as the independent variable.22,24 The averaged test statistic from the two regression analyses is then used as an estimate for the multimorbidity strength between the phecode pair A and B. Multimorbidity strengths of all pairs are subsequently calculated and used to construct a phenome-wide multimorbidity network, which represents an undirected weighted network with disease as nodes and disease-disease connections as edges, weighted by the multimorbidity strengths. In addition, Pearson correlation of the common multimorbidity patterns between each pair of phecodes is used as a similarity score, which was also used to generate another undirected weighted network using the similarity scores as weights with disease as nodes and connections as edges.24 We call this a multimorbidity similarity network. We observed that the similarities in multimorbidity patterns among diseases significantly correlate with their genetic correlations,24 suggesting that multimorbidity similarity can serve as a valuable indicator of potential shared etiology. We finally consolidate all the summarized data from three institutions into a database. This serves as a knowledge base to fuel PheMIME for phenome-wide multimorbidity analysis.

Design scheme

PheMIME is an interactive web-based tool designed to enable clinical researchers to explore complex multimorbidity patterns across large-scale EHR data, translating robust findings into actionable insights. Strayer et al demonstrated the interoperability of network analysis methods for phenome-wide multimorbidity data.24 Building upon this, PheMIME addresses the challenges of analyzing extensive multimorbidity datasets while enabling focused exploration relevant to specific clinical areas or context. It harnesses the advantages of a phenome-wide approach, using phecodes to map to real-world clinical vocabularies, which enhances interpretability. Users can select phenotypes of interest and investigate complex patterns in a comprehensive and customizable manner. Its dual approach, combining broad-scale visualization with deeper dives into specific multimorbidity patterns, facilitates the efficient extraction of meaningful insights. Additionally, PheMIME supports researchers to validate findings across different institutions, illustrating their reproducibility.

PheMIME incorporates five primary modules: (1) “Disease Selection” module facilitates an interactive phecode table where users can readily search, filter, and select a disease phecode of interest. (2) “Multimorbidity Consistency Inspection” module enables users to assess the overall consistency of multimorbidity strength measurements from the knowledge base and compare them across multiple institutions. Additionally, this module incorporates features that underscore significant multimorbidity strengths linked to the selected phecode of interest, aids in assessing their distribution amidst all multimorbidity measurements, and enables comparison across institutions. (3) “Multimorbidity Network Visualization” module presents interactive visual representations of the multimorbidity networks constructed based on the multimorbidity strength measurements and allows users to examine and compare these networks. This module integrates associationSubgraphs20 as an advanced tool for dynamic network clustering and visualization (detailed in the “Dynamic Network Clustering of Multimorbidities With AssociationSubgraphs” section). This module permits exploration of the network's subgraph structures and dynamic clustering for any multimorbidity network from a single institution or an amalgamation of multiple institutions. Moreover, this module enables users to apply filters and emphasize any significant multimorbidities and investigate their interconnections and enriched subgraphs. (4) “Reproducible Multimorbidities Exploration” module provides an interactive environment for examining a customizable subset of significant and/or reproducible multimorbidities across the institutions based on the multimorbidity strength measurements hosted in the knowledge base. This interface allows users to visualize the interconnections among chosen phecodes and the enriched multimorbidity subgraphs within the combined multimorbidity networks. Furthermore, this module accommodates pairwise comparisons between all institutions. (5) “Multimorbidity Similarities Exploration” module, much like the preceding one, uses multimorbidity similarity measurements as the strength measurement. It permits visualization of interconnected phecodes and the multimorbidity subgraphs enriched in the combined multimorbidity similarity networks and enables pairwise comparisons between all institutions.

PheMIME is built using the R programming language, the JavaScript D3.js library, and the Shiny library. To manage extensive computations, we employ parallel processing and cloud computing resources (AWS EC2 instance, 64 cores, 500 GB memory). Data storage is handled through AWS S3. The platform dynamically recalculates the network based on user input, allowing real-time customization of network structure and enabling flexible analyses tailored to specific multimorbidity discovery needs.

Dynamic network clustering of multimorbidities with associationSubgraphs

In PheMIME, we integrate a dynamic network clustering technique called the associationSubgraphs.20 This is an interactive clustering approach that identifies subgraphs, or disease clusters, in multimorbidity networks by leveraging the principles of threshold graph construction.24 The algorithm initiates by sorting the edges of the network, which represent disease-disease connections, in descending order of their comorbidity strength. The strongest link forms the cores of the initial cluster, with subsequent edges added iteratively. As each edge is incorporated, its nodes are evaluated: if one node is already part of a cluster, the other node joins the same cluster; if both nodes are unconnected, they start a new cluster; if each node belongs to a different cluster, those clusters merge. The algorithm meticulously records the state of clusters after each addition, enabling the complete tracking of how clusters evolve over time.

To establish the optimal point for halting the clustering process, we apply the “largest-smallest” rule, which identifies when well-defined clusters begin transitioning into a large, connected component, indicating the onset of structural randomness. For further details on this methodological approach, see Strayer et al.20,24

Our implementation of the associationSubgraphs algorithm demonstrates its effectiveness in dynamically clustering diseases based on varying comorbidity strengths. This method offers nuanced insights into the hierarchical structure of disease clusters and their patterns of amalgamation at different comorbidity thresholds. Importantly, this process identifies expansive clusters even in scenarios where diseases do not show densely connected or uniformly strong pairwise comorbidities. Such capabilities distinguish this technique from traditional community detection methods, which may falter under these sparse conditions.

Results

Analysis outline using schizophrenia as an example

This section outlines the workflow of PheMIME to explore phenome-wide multimorbidity patterns among various analysis modules provided. Starting with phecode 295.10 (schizophrenia), PheMIME generates an interactive Manhattan plot, enabling users to select phecodes based on their pairwise multimorbidity strength, also known as comorbidity strength (Figure 1A). A scatter plot for comparing between two systems is also generated, helping users to select robust multimorbidities that exhibit reproducible patterns across systems (Figure 1A). The interactive features of the Manhattan and Scatter plots enable the selection and highlighting of a consistent set of the same phecodes, based on the magnitude (Manhattan plot) and consistency (scatter plot) of disease multimorbidities. For instance, the Manhattan plot might reveal phecodes highly co-occurring with schizophrenia, whereas the scatter plot could highlight phecodes that are either highly consistent across institutions or remain exclusive to individual institutions. The data table in Figure 1B shows comorbid phecodes of schizophrenia, its description, disease categories and corresponding multimorbidity strengths among three institutions. This table is interactive with the Manhattan and Scatter plots, allowing users to add or remove phecodes by clicking on the rows in the table. If a disease multimorbidity exhibits both a large magnitude and high consistency across different systems, it strongly indicates a robust disease multimorbidity across the systems.

Figure 1.

Figure 1.

Reproducible multimorbidity exploration analysis outline for schizophrenia. (A) Interactive Manhattan and scatter plot enable users to select top-co-occurred and consistent disease phenotypes. (B) Interactive table allows users to add or remove disease phenotypes by clicking on the rows in the table and incorporate the updated phenotypes into the final selection by clicking the Update Manhattan/Scatter button, highlighting the user-selected phenotypes. (C) Enhanced associationSubgraphs provides dynamic network analysis to explore disease clusters, highlighting the user-selected phenotypes from part A and B, and their corresponding subgraphs.

For dynamic network analysis, the associationSubgraphs method20 has been enhanced to provide an interactive visualization to rapidly explore subgraph structures of schizophrenia multimorbidities in the selected phecodes that are both significant and reproducible across institutions (filled circles in Figure 1A and B). As shown in Figure 1C, network nodes are annotated into two groups, with the selected phecodes color-filled based on disease categories and the other unselected phecodes (nodes) color-filled in grey. As expected, a major subgraph enriching the selected schizophrenia multimorbidity phecodes is mental disorders, with other subgraphs showing enrichment in infectious diseases and neoplasms.

PheMIME also provides an expedient and accessible method for comparative analyses of multimorbidity patterns across different populations. Using schizophrenia again as an example, we discovered a strong comorbidity with viral hepatitis B and C (as detailed in the online materials), a finding in line with prior research.30,31 Yet, a detailed examination into multimorbidity patterns revealed more consistency of comorbidity within the patient cohorts of VUMC and MGB, with the comorbidity intensities being notably stronger in these cohorts as compared to the general UKB cohort, especially in the case of viral hepatitis B. This distinction could underscore heterogeneity due to population differences—consistent with the observation of higher prevalence of blood-borne viral infections for patients with serious mental illness compared to the general population.32 However, the exact mechanisms behind viral hepatitis and schizophrenia comorbidity remain elusive, the observed disparities among different cohorts could motivate additional investigation regarding the mechanisms and impact of population differences on disease multimorbidity patterns.

Exploring context-specific phenome-wide disease clusters

We implemented the associationSubgraphs algorithm in PheMIME to identify disease clusters in multimorbidity networks.20 This method enables systematic discovery of disease clusters by allowing users to dynamically adjust the threshold for each unique comorbidity strength value, providing a complete view of hierarchical clustering process across all levels of comorbidity strengths from low to high. The effectiveness of associationSubgraphs has been demonstrated by its ability to identify prominent disease condition clusters consistent with previously reported findings. For further details, refer to the online Supplementary material—“Case Study: Identification of Robust Disease Condition Clusters”, and Strayer et al.24

Here, we illustrate how PheMIME facilitates the exploration of robust disease clusters provided a specific disease phenotype of interest, using statistical significance and multimorbidity correlation as criteria. Using schizophrenia as an example, users can uncover the following insights:

  • 1) Identify multimorbidities associated with schizophrenia across different systems and explore disease clusters at specified statistical significance levels through the Multimorbidity Network Visualization module.

  • 2) Identify consistent disease phenotypes based on comorbidity strength as shown in the Manhattan and Scatter plot through the Reproducible Multimorbidities Exploration module and visualize the disease subgraphs using associationSubgraphs.

  • 3) Similarly, identify consistent disease phenotypes based on multimorbidity similarities in the Manhattan and Scatter plot through the Multimorbidity Similarities Exploration module and visualize the disease subgraphs using associationSubgraphs.

  • 4) Supplement these findings with arbitrary disease phenotypes of interest by selecting them from the customizable table through the Reproducible Multimorbidities Exploration or Multimorbidity Similarities Exploration modules. Incorporate them into the final selection by clicking the Update Manhattan/Scatter button.

associationSubgraphs analysis enables the identification of clinically relevant disease clusters within specific disease contexts. Prior research suggests that individuals with schizophrenia may have a slightly elevated risk for certain cancers, such as colon and breast cancer. Our findings confirm the enrichment of cancer-related subgraphs. Figure 2 displays the consistent association of these multimorbidities with schizophrenia across multiple systems, with significant comorbidity defined at a P value cutoff of .05. Network nodes, color-coded by disease categories, highlight phenotypes significantly associated with schizophrenia, including colon, breast, and prostate cancer, known to correlate with schizophrenia.33–36 Additionally, we observed an unusual pattern with skin cancer within this subgraph, rarely explored in previous studies, potentially offering novel insights.

Figure 2.

Figure 2.

Schizophrenia-associated disease cluster enriched in cancers across multiple institutions. (A) Consistent presence of disease cluster strongly associated with schizophrenia in VUMC. (B) Consistent presence of disease cluster enriched in colon cancer and breast cancer strongly associated with schizophrenia in MGB. (C) Consistent presence of disease cluster enriched in colon cancer and skin cancer strongly associated with schizophrenia in UKB.

Discussion

The development of PheMIME significantly enhances research in exploring and comparing multimorbidity patterns across multiple institutions. This tool meets the critical need for comprehensive analysis of multimorbidity patterns by integrating data from various institutions, thereby transforming the approach of researchers and clinicians to the study of multimorbidity diseases. A key strength of PheMIME lies in its ability to integrate and visualize large-scale EHR data from diverse populations, which is crucial for developing effective prevention strategies and targeted interventions. Moreover, it offers efficient, user-customizable online visualization, enabling researchers to perform dynamic analysis and inference of disease clusters.

In this study, we utilize phecodes27,28 to harness the benefits of a phenome-wide approach, aligning with our goal to enable straightforward interpretation by clinicians engaged in data-driven discoveries. Phecodes map ICD codes into clinically meaningful categories, facilitating the exploration of disease relationships across the entire phenome. While specialized solutions like OHDSI’s Phenotype Library37 and PheKB38 offer greater precision, our phecode-based approach prioritizes a user-friendly interface and direct links to real-world clinical vocabularies. This design choice improves PheMIME’s usability and potential for EHR data-driven clinical translational research and continuous learning in healthcare, especially for researchers who may find more complex solutions less accessible.

However, our study has limitations. While PheMIME encompasses phenome-wide analysis, exploring disease-disease networks interconnected through common genes, proteins, or metabolites is also vital. This exploration could uncover shared biomolecular mechanisms, potentially identifying targets for drug repurposing and contributing to precision medicine. Additionally, the current dataset, though extensive, is limited to three major institutions, potentially impacting the generalizability of our findings. Future enhancements of PheMIME could involve expanding the database to encompass more diverse populations and healthcare settings, thus augmenting the robustness of the analysis.

Conclusion

We have introduced PheMIME, an interactive visualization tool specifically designed for analyzing multimorbidities across multiple EHR datasets, which simultaneously presents an extensive multimorbidity knowledge base consolidating data from three major EHR systems. Our intention is for users to utilize PheMIME to detect and extract meaningful disease multimorbidities, and to compare and validate them across a variety of institutions. To our understanding, PheMIME is the first knowledge base of its kind, integrating and comparing data from multiple extensive EHR systems while providing substantial support for efficient online analysis and interactive visualization to facilitate the discovery of complex multimorbidity patterns.

Acknowledgments

We are deeply grateful for the insightful discussions with Kedir N. Turi, Sharon E. Phillips, Xiaopeng Sun, Lydia Yao, Eric Chen, Brian Sharber, and Matt Krantz, which significantly enriched the development and direction of this work. Their input and expertise were invaluable and greatly appreciated.

Contributor Information

Siwei Zhang, Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Nick Strayer, Posit PBC, Boston, MA 02210, United States.

Tess Vessels, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

Karmel Choi, Psychiatric & Neuro Developmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, United States; Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA 02114, United States.

Geoffrey W Wang, Department of Statistics, North Carolina State University, Raleigh, NC 27695, United States.

Yajing Li, Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Cosmin A Bejan, Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Ryan S Hsi, Department of Urology, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

Alexander G Bick, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

Digna R Velez Edwards, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

Michael R Savona, Division of Hematology and Oncology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

Elizabeth J Phillips, Center for Drug Safety and Immunology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States; Institute for Immunology and Infectious Diseases, Murdoch University, Murdoch, WA 6150, Australia.

Jill M Pulley, Vanderbilt Institute for Clinical and Translational Science, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Wesley H Self, Vanderbilt Institute for Clinical and Translational Science, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Wilkins Consuelo Hopkins, Vanderbilt Institute for Clinical and Translational Science, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Dan M Roden, Department of Pharmacology, Vanderbilt University Medical Center, Nashville, TN 37232, United States.

Jordan W Smoller, Psychiatric & Neuro Developmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, United States; Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA 02114, United States; Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA 02142, United States.

Douglas M Ruderfer, Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States; Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN 37212, United States.

Yaomin Xu, Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, United States; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States.

Author contributions

Yaomin Xu conceived the idea and designed the study, Siwei Zhang and Nick Strayer primarily conducted the major statistical analysis and played a key role in the design and development of the web application. Tess Vessels provided assistance in data analysis and contributed significantly to the interpretation of the genetic data analysis. Siwei Zhang prepared the initial draft of the manuscript, and Yaomin Xu was crucial in its revision. Yaomin Xu and Douglas M. Ruderfer offered significant guidance in both the design of the study and the data collection process. All authors reviewed, provided valuable feedback, and gave their final approval for the published version of the manuscript.

Funding

N.S. and Y.X. are supported by the Vanderbilt University Department of Biostatistics Development Award; Y.X., C.B., and R.H. are supported by R21DK127075; Y.X., D.E., E.P., and D.R. are supported by P50GM115305; J.W.S. is supported in part by R01 MH118233. The Vanderbilt University Medical Center dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center’s SD/BioVU, which is supported by numerous sources: institutional funding, private agencies, and federal grants. These include the NIH funded Shared Instrumentation Grant S10RR025141; and CTSA grants UL1TR002243, UL1TR000445, and UL1RR024975. Genomic data are also supported by investigator-led projects that include U01HG004798, R01NS032830, RC2GM092618, P50GM115305, U01HG006378, U19HL065962, R01HD074711; and additional funding sources listed at https://victr.vanderbilt.edu/pub/biovu/. This research has been conducted using the UK Biobank Resource under Application Number 43397.

Conflicts of interest

None declared.

Data availability

Individual-level data are not shared due to ethical and privacy constraints. However, all source code used in the analysis is publicly available on our GitHub repository at https://github.com/tbilab/PheMIME. Additionally, summary data from the multimorbidity analyses can be accessed through the PheMIME App, available at https://prod.tbilab.org/PheMIME/.

Author declarations

All relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained. All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived. I have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE, or other checklist(s) as supplementary files, if applicable.

References

  • 1. Pearson-Stuttard J, Ezzati M, Gregg EW.. Multimorbidity—a defining challenge for health systems. Lancet Public Health. 2019;4(12):e599-e600. 10.1016/S2468-2667(19)30222-1 [DOI] [PubMed] [Google Scholar]
  • 2. Morris JE, Roderick PJ, Harris S, et al. Treatment burden for patients with multimorbidity: cross-sectional study with exploration of a single-item measure. Br J Gen Pract. 2021;71(706):e381-e390. 10.3399/BJGP.2020.0883 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Hounkpatin HO, Roderick P, Harris S, et al. Change in treatment burden among people with multimorbidity: a follow-up survey. Br J Gen Pract. 2022;72(724):e816-e824. 10.3399/BJGP.2022.0103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Skou ST, Mair FS, Fortin M, et al. Multimorbidity. Nat Rev Dis Primers. 2022;8(1):48. 10.1038/s41572-022-00376-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Rodrigues LP, Vissoci JRN, França DG, et al. Multimorbidity patterns and hospitalisation occurrence in adults and older adults aged 50 years or over. Sci Rep. 2022;12(1):11643. 10.1038/s41598-022-15723-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Pati S, Swain S, Metsemakers J, Knottnerus JA, van den Akker M.. Pattern and severity of multimorbidity among patients attending primary care settings in Odisha, India. PLoS One. 2017;12(9):e0183966. 10.1371/journal.pone.0183966 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. McQueenie R, Foster HME, Jani BD, et al. Multimorbidity, polypharmacy, and COVID-19 infection within the UK Biobank cohort. PLoS One. 2020;15(8):e0238091. 10.1371/journal.pone.0238091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Chong LW, Hsu CC, Lee CY, et al. Association of viral hepatitis and bipolar disorder: a nationwide population-based study. J Transl Med. 2018;16(1):173. 10.1186/s12967-018-1542-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Fabbri E, Zoli M, Gonzalez-Freire M, Salive ME, Studenski SA, Ferrucci L.. Aging and multimorbidity: new tasks, priorities, and frontiers for integrated gerontological and clinical research. J Am Med Dir Assoc. 2015;16(8):640-647. 10.1016/j.jamda.2015.03.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Koch L. Shared genetic components of multimorbidity. Nat Rev Genet. 2021;22(10):624. 10.1038/s41576-021-00402-3 [DOI] [PubMed] [Google Scholar]
  • 11. Dong G, Feng J, Sun F, Chen J, Zhao XM.. A global overview of genetically interpretable multimorbidities among common diseases in the UK Biobank. Genome Med. 2021;13(1):110. 10.1186/s13073-021-00927-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Smith SM, Wallace E, Clyne B, Boland F, Fortin M.. Interventions for improving outcomes in patients with multimorbidity in primary care and community setting: a systematic review. Syst Rev. 2021;10(1):271. 10.1186/s13643-021-01817-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Nicholson K, Makovski TT, Griffith LE, Raina P, Stranges S, van den Akker M.. Multimorbidity and comorbidity revisited: refining the concepts for international health research. J Clin Epidemiol. 2019;105:142-146. 10.1016/j.jclinepi.2018.09.008 [DOI] [PubMed] [Google Scholar]
  • 14. Jakovljević M, Ostojić L.. Comorbidity and multimorbidity in medicine today: challenges and opportunities for bringing separated branches of medicine closer to each other. Psychiatr Danub. 2013;25(Suppl 1):18-28. [PubMed] [Google Scholar]
  • 15. Kuan V, Denaxas S, Patalay P, Multimorbidity Mechanism and Therapeutic Research Collaborative (MMTRC), et al. Identifying and visualising multimorbidity and comorbidity patterns in patients in the English National Health Service: a population-based study. Lancet Digit Health. 2023;5(1):e16-e27. 10.1016/S2589-7500(22)00187-X [DOI] [PubMed] [Google Scholar]
  • 16. van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JAM.. A text-mining analysis of the human phenome. Eur J Hum Genet. 2006;14(5):535-542. 10.1038/sj.ejhg.5201585 [DOI] [PubMed] [Google Scholar]
  • 17. Calvin CM, Conroy MC, Moore SF, Kuźma E, Littlejohns TJ.. Association of multimorbidity, disease clusters, and modification by genetic factors with risk of dementia. JAMA Netw Open. 2022;5(9):e2232124. 10.1001/jamanetworkopen.2022.32124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Hanlon P, Jani BD, Nicholl B, Lewsey J, McAllister DA, Mair FS.. Associations between multimorbidity and adverse health outcomes in UK Biobank and the SAIL Databank: a comparison of longitudinal cohort studies. PLoS Med. 2022;19(3):e1003931. 10.1371/journal.pmed.1003931 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Hassaine A, Canoy D, Solares JRA, et al. Learning multimorbidity patterns from electronic health records using non-negative matrix factorisation. J Biomed Inform. 2020;112:103606. 10.1016/j.jbi.2020.103606 [DOI] [PubMed] [Google Scholar]
  • 20. Strayer N, Zhang S, Yao L, et al. Interactive network-based clustering and investigation of multimorbidity association matrices with associationSubgraphs. Bioinformatics. 2023;39(1):btac780. 10.1093/bioinformatics/btac780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Fotouhi B, Momeni N, Riolo MA, Buckeridge DL.. Statistical methods for constructing disease comorbidity networks from longitudinal inpatient data. Appl Netw Sci. 2018;3(1):46. 10.1007/s41109-018-0101-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Aguado A, Moratalla-Navarro F, López-Simarro F, Moreno V.. MorbiNet: multimorbidity networks in adult general population. Analysis of type 2 diabetes mellitus comorbidity. Sci Rep. 2020;10(1):2416. 10.1038/s41598-020-59336-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Vessels TJ, Strayer NJ, Choi KW, et al. Integrating electronic health records and polygenic risk to identify genetically unrelated comorbidities of schizophrenia that may be modifiable. Biol Psychiatry Glob Open Sci. 2024;4(3):100297. 10.1016/j.bpsgos.2024.100297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Strayer N, Vessels TJ, Choi KW, et al. Interoperability of phenome-wide multimorbidity patterns: a comparative study of two large-scale EHR systems. medRxiv. Published online January 1, 2024;2024.03.28.24305045. 10.1101/2024.03.28.24305045 [DOI] [Google Scholar]
  • 25. Ho ISS, Azcoaga-Lorenzo A, Akbari A, et al. Variation in the estimated prevalence of multimorbidity: systematic review and meta-analysis of 193 international studies. BMJ Open. 2022;12(4):e057017. 10.1136/bmjopen-2021-057017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Chang W, Cheng J, Allaire J, et al. shiny: Web Application Framework for R. https://shiny.rstudio.com/.
  • 27. Denny JC, Ritchie MD, Basford MA, et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics. 2010;26(9):1205-1210. 10.1093/bioinformatics/btq126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Carroll RJ, Bastarache L, Denny JC.. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics. 2014;30(16):2375-2376. 10.1093/bioinformatics/btu197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Fry A, Littlejohns TJ, Sudlow C, et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol. 2017;186(9):1026-1034. 10.1093/aje/kwx246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Leucht S, Burkard T, Henderson J, Maj M, Sartorius N.. Physical illness and schizophrenia: a review of the literature. Acta Psychiatr Scand. 2007;116(5):317-333. 10.1111/j.1600-0447.2007.01095.x [DOI] [PubMed] [Google Scholar]
  • 31. Lluch E, Miller BJ.. Rates of hepatitis B and C in patients with schizophrenia: a meta-analysis. Gen Hosp Psychiatry. 2019;61:41-46. 10.1016/j.genhosppsych.2019.10.007 [DOI] [PubMed] [Google Scholar]
  • 32. Hughes E, Bassi S, Gilbody S, Bland M, Martin F.. Prevalence of HIV, hepatitis B, and hepatitis C in people with severe mental illness: a systematic review and meta-analysis. Lancet Psychiatry. 2016;3(1):40-48. 10.1016/S2215-0366(15)00357-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Kim S, Nam S.. The causal relationship of colorectal cancer on schizophrenia: a Mendelian randomization study. Medicine (Baltimore) 2023;102(40):E35517. 10.1097/MD.0000000000035517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Lu D, Song J, Lu Y, et al. A shared genetic contribution to breast cancer and schizophrenia. Nat Commun. 2020;11(1):4637. 10.1038/s41467-020-18492-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Hippisley-Cox J, Vinogradova Y, Coupland C, Parker C.. Risk of malignancy in patients with schizophrenia or bipolar disorder: nested case-control study. Arch Gen Psychiatry. 2007;64(12):1368-1376. 10.1001/archpsyc.64.12.1368 [DOI] [PubMed] [Google Scholar]
  • 36. Wang Y, Cao Y, Huang X, et al. Allele-specific expression of mutated in colorectal cancer (MCC) gene and alternative susceptibility to colorectal cancer in schizophrenia. Sci Rep. 2016;6(1):26688. 10.1038/srep26688 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Hripcsak G, Ryan PB, Duke JD, et al. Characterizing treatment pathways at scale using the OHDSI network. Proc Natl Acad Sci U S A. 2016;113(27):7329-7336. 10.1073/pnas.1510502113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Denny JC, Bastarache L, Ritchie MD, et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat Biotechnol. 2013;31(12):1102-1110. 10.1038/nbt.2749 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Individual-level data are not shared due to ethical and privacy constraints. However, all source code used in the analysis is publicly available on our GitHub repository at https://github.com/tbilab/PheMIME. Additionally, summary data from the multimorbidity analyses can be accessed through the PheMIME App, available at https://prod.tbilab.org/PheMIME/.


Articles from Journal of the American Medical Informatics Association : JAMIA are provided here courtesy of Oxford University Press

RESOURCES