Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Mar 5.
Published in final edited form as: Circulation. 2023 Aug 30;148(13):1061–1069. doi: 10.1161/CIR.0000000000001173

Principles for Health Information Collection, Sharing, and Use: A Policy Statement From the American Heart Association

Kayte Spector-Bagdady, Antonis A Armoundas, Rima Arnaout, Jennifer L Hall, Brooke Yeager McSwain, Joshua W Knowles, W Nicholson Price II, Danda B Rawat, Barbara Riegel, Tracy Y Wang, Kevin Wiley Jr, Mina K Chung, on behalf of the American Heart Association Advocacy Coordinating Committee
PMCID: PMC10912036  NIHMSID: NIHMS1969112  PMID: 37646159

Abstract

The evolution of the electronic health record, combined with advances in data curation and analytic technologies, increasingly enables data sharing and harmonization. Advances in the analysis of health-related and health-proxy information have already accelerated research discoveries and improved patient care. This American Heart Association policy statement discusses how broad data sharing can be an enabling driver of progress by providing data to develop, test, and benchmark innovative methods, scalable insights, and potential new paradigms for data storage and workflow. Along with these advances come concerns about the sensitive nature of some health data, equity considerations about the involvement of historically excluded communities, and the complex intersection of laws attempting to govern behavior. Data-sharing principles are therefore necessary across a wide swath of entities, including parties who collect health information, funders, researchers, patients, legislatures, commercial companies, and regulatory departments and agencies. This policy statement outlines some of the key equity and legal background relevant to health data sharing and responsible management. It then articulates principles that will guide the American Heart Association’s engagement in public policy related to data collection, sharing, and use to continue to inform its work across the research enterprise, as well as specific examples of how these principles might be applied in the policy landscape. The goal of these principles is to improve policy to support the use or reuse of health information in ways that are respectful of patients and research participants, equitable in impact in terms of both risks and potential benefits, and beneficial across broad and demographically diverse communities in the United States.

Keywords: AHA Scientific Statements; consumer health information; ethics, research; policy; social justice; technology


The American Heart Association (AHA) advocates for policies, funds research, and creates and disseminates clinical guidelines to encourage longer, healthier lives. As part of its mission, the AHA funds, supports, and encourages scientifically meritorious research and advocates for a broad strategic policy agenda at the local, state, and federal levels. This strategic policy agenda includes promoting access to quality, affordable health care; addressing primary and primordial prevention at a population level; and advancing heart disease and stroke research.1 The goal of this document is to transparently establish the principles that the AHA will use when engaging in public policy related to data sharing and data privacy.

As electronic health record information becomes increasingly harmonized (ie, curating or normalizing data such that they can be compared and validated against each other)—along with advances in data curation, sharing, and analytic technologies—uses of health-related and health-proxy information (health information) are already accelerating research discoveries and patient care.2 Broad data sharing can be an enabling driver of progress by, for example, providing data to develop, test, and benchmark innovative methods, scalable insights, and potential new paradigms for data storage and workflow.35 Data sharing can also improve collaboration between researchers across multiple disciplines, resulting in significant new discoveries. Moreover, increased data circulation can create a valuable common resource that facilitates reproducibility of results, leading to a more robust evidence base and informed health decision-making.6,7

Along with this promise come concerns about the sensitive nature of some health data, equity considerations about the involvement of historically excluded communities,8 and the complex intersection of laws attempting to govern behavior in this space. Even when the health information that is shared is deidentified, reidentification is increasingly common, especially given the prevalence of other kinds of information freely available on the internet.9 Data that include genomic or particular kinds of geolocation data are uniquely identifiable, meaning that even if they are stripped of the 18 identifiers (eg, name or date of service) included in the privacy rule of the Health Insurance Portability and Accountability Act of 1996 (HIPAA),10 they may still be unique to only 1 individual.11 Data are also revealing associated phenotypic information never thought possible, such as when researchers recently reported that certain artificial intelligence models were able to predict self-reported race from medical image pixel data12 or the identification of anonymous patients undergoing magnetic resonance imaging with facial recognition software.13 In addition, deidentified, or even identified, health information is being increasingly shared with commercial entities.14 This practice raises questions about contributor (eg, patient, research participant, stakeholder, or commercial customer; Table) preferences, as well as privacy concerns. In addition, even if health information is fully deidentified, risks related to individual privacy or “nonwelfare interests” remain (ie, “moral, religious, or cultural concerns about the future uses of their donations”).15

Table.

Definitions

Term Definition
Contributor A patient, research participant, or consumer of a product whose data are included in a research study or database
Health information analytics The use of data analytics, including artificial intelligence, to examine health information
HIPAA business associate A person or entity performing certain functions or activities that involve the use or disclosure of protected health information on behalf of, or which provides services to, a HIPAA-covered entity
Common rule Subpart A of the human subjects research regulations that describes the basic Department of Health and Human Services policy for the protection of human research subjects

HIPAA indicates Health Insurance Portability and Accountability Act of 1996.

Data-sharing principles are therefore necessary across a wide swath of entities, including parties who collect health information, nongovernmental agencies (NGOs), funders, researchers, patients, legislatures, commercial companies, and regulatory departments and agencies. Education related to these core principles, including both their benefits and burdens for contributors or entities, is critical to ensure scientific progress while respecting those who contribute data. As the largest nonprofit funder of cardiovascular research in the United States (second only to the US government), the AHA is committed to tailoring guidance that spans its policy research and advocacy work, investigators, and funding.

The present policy statement first outlines some of the key equity and legal background relevant to health data sharing and responsible management. It then articulates the principles that will help guide the AHA’s engagement in public policy related to data collection, sharing, and use and inform its work across the biomedical research enterprise.

NEED TO INCREASE THE INCLUSION OF UNDERREPRESENTED COMMUNITIES IN RESEARCH COHORTS

Within the promising exponential expansion of the use and applicability of health information analytics lie concerns about misuse, misapplication, and biases.16,17 Research with health information carries the extraordinary potential of improving health equity, but to leverage this opportunity fully, stakeholders must also consciously contribute to the development of a governance architecture founded on inclusion and equity. To do so, stakeholders must rely on improved methods of oversight, stewardship, and accountability. For example, medical algorithms are typically developed in high-resource medical settings such as academic medical centers because they are more likely to have been early adapters of electronic medical records and offer longitudinal data and reputational benefits.18 A recent study found that 71% of surveyed algorithms were trained on data sets from 3 coastal states (California, Massachusetts, and New York) and were not representative of the diversity of the nation.19

Research data sets have also historically been populated largely with participants who self-identify as non-Hispanic White and are of European ancestry.2026 Largely demographically or geographically homogeneous data sets can limit generalizability across diverse patient populations, resulting in potentially inaccurate recommendations for the populations not included in the data, compounding health care disparities.2732 Both historical and current reasons for the exclusion of certain populations from primary and secondary health research protocols can be closely tied to individual, institutional, and structural biases such as racism.30,31 Accessibility is limited by access to the health care and research that generate health information in the first place, recruitment design, and different consent rates across communities.32 Trust can be further compromised by personal, technological, and institutional factors that include fear of data exploitation, lack of digital access, and historical abuses.33 Increasing the trustworthiness of researchers and funders, as well as accessibility and comfort with database inclusion across historically excluded communities, is of utmost importance.34

LAWS AND REGULATIONS

One challenge with ascertaining best practices to achieve these goals for health information collection, sharing, and use is the complex intersection of current laws governing this space. In the United States, 2 major federal regulatory regimes oversee health information collection: HIPAA10 and the Human Subjects Research Regulations (HSRR).35,36 Although “covered entities” and human subjects researchers must comply with these regulations, they are complex and have many gaps in privacy protections.37 The 21st Century Cares Act (2016) also required the development of a “trusted exchange framework” with an infrastructure and governance model for health information networks to share health information with each other. The resultant Trusted Exchange Framework and the Common Agreement was published in 2022.38 That said, the health data collection and sharing space warrants additional policy and direction.

HIPAA sets out the rules for the disclosure and use of protected health information, defined as data that include “individually identifiable health information”39 such as name, date of birth, or medical record number,40 collected by covered entities. Covered entities, in turn, include hospitals, clinicians, or insurers,39 which means that HIPAA protects only this limited area of health information. Entities outside of this definition (eg, NGOs, direct-to-consumer genetic testing companies) typically do not have to follow the HIPAA rules. Health data that are not individually identifiable or are deidentified are not covered by HIPAA, even for covered entities, and can be shared without consent from contributors.

There are several exceptions to the HIPAA protections even for covered entities. First, covered entities may share protected health information with public health authorities without contributor consent for situations such as controlling disease in a population (like the spread of coronavirus disease 2019 [COVID-19]), as legally mandated by state law enforcement (like in the case of suspected child abuse or neglect), or births or deaths.41 In addition, HIPAA allows covered entities to share identified data with business associates if they have a data use agreement in place. This “business associate agreement” exception has recently been used broadly.42 HIPAA permits the use of identifiable data for operations and quality improvement and allows covered entities to share and use health information for research without contributor consent if it is used in minimal-risk research (ie, not greater than that encountered in daily life or routine medicine) and granted an Institutional Review Board waiver or alteration.

The HSRR lay out the appropriate collection and sharing of individually identifiable health information collected from some research participants. Like HIPAA, health data that are not individually identifiable or are deidentified are not covered and can be shared without consent from the research participants. In addition, HSRR do not cover all identifiable health information collected for research purposes, only identifiable health information collected by researchers with federal funding or data collected or related to a US Food and Drug Administration–authorized product. Many academic medical centers also require all their researchers to follow HSRR under all circumstances to streamline compliance structures.43

In 2018, regulators updated the Common Rule component of the HSRR (Subpart A) to address secondary research, among other things. Research informed consent forms must now include a disclosure, if relevant, that identified health information may be deidentified and used or shared for future studies without additional informed consent.44 There are also disclosure requirements if a contributor’s specimens may be used for commercial profit (and whether the contributor will share in that profit) and whether the research might include whole-genome sequencing. There are, however, several major limitations to these new disclosure requirements: some are limited to biospecimens and do not cover health information45; there are no equivalent mandated disclosures on the clinical side (above and beyond offering patients standard HIPAA disclosures)46; and most patients neither read nor understand these disclosures.47,48

To summarize, although both HIPAA and HSRR generally require explicit informed consent from a patient or research participant for the collection, sharing, and use of health information, they are limited to only individually identifiable information and cover only certain kinds of interactions, and there are many exceptions for minimal-risk research that may be done in many cases without the contributors’ consent or knowledge. Many contributors to secondary research are entirely unaware that their health information is being used in this manner. In addition, virtually no regulations comprehensively cover the collection, sharing, and use of health and health-proxy information by commercial companies, a lucrative industry that is expanding rapidly outside the auspices of general health privacy regulation.

Notably, although HIPAA and HSRR try to protect some identified health information from being shared in the first place, there are limited protections in place for the use of those data once they have been shared. An important example is the Genetic Information Nondiscrimination Act of 2008, which prohibits employers and some kinds of health insurance (but excludes long-term care or disability insurance) from discriminating against employees or insured individuals on the basis of genetic information.49 In conjunction with the Office of Science Policy and Technology’s new guidance to ensure that federally funded research data equitably benefit all researchers,50 the National Institutes of Health has put into place (effective in January 2023) stronger data privacy protections for health information generated during sponsored research.14 In addition, the Federal Trade Commission has increasingly regulated some components of data privacy as part of consumer protection. Last, several states have passed or are considering state-level data protection legislation.

As an example, California’s Consumer Privacy Act of 2020 has broader reach than HIPAA or HSRR. It creates both notice and access requirements for businesses that collect, sell, or disclose information. Consumers also can request that some kinds of information be deleted or not shared. The California Consumer Privacy Act has exceptions for research, but that research must be in the public interest. It can be challenging for entities such as commercial companies or NGOs to comply with state-level requirements that are more protective than federal law or are substantively different from each other. By default, many entities end up complying with the most restrictive state policy to ensure compliance with all state policies. For example, although the California Consumer Privacy Act technically applies only to personal information about California residents, many companies not based in California process data on California residents and may end up applying the more restrictive California standards to all data collection and use.51

In contrast to governing by the type of data being shared and used (eg, health information), the US federal approach to health data privacy protection is based on the type of entity that is controlling the data in the first place. Although this approach made sense when the regulations were conceptualized in the 1980s and 1990s, because health information was largely written or at least stored in paper copy by the entity that had originally collected it,48 it is less comprehensive today now that health data are generated and increasingly shared electronically.

An alternative approach appears in the European Union’s General Data Protection Regulation of 2016, which governs personal data (including health information) that are processed by many kinds of both public and private entities. Processing data concerning health without consent is generally prohibited unless it falls under an exception such as for public health and scientific research. The General Data Protection Regulation includes requirements that entities must only collect and process as much data as necessary and notify individuals whose data they receive.52 The feasibility of an approach that attempts to identify data concerning health will face growing enforcement challenges as big data and artificial intelligence allow health-related inferences from innocuous data that seem unrelated to health.53 Governance based on who originally collected the information no longer offers the American public comprehensive health data privacy protection.

AHA PRINCIPLES FOR HEALTH INFORMATION COLLECTION, SHARING, AND USE

Given the evolution in the health information landscape, the complex and limited legal protections in place for contributors, and the increasing use and value of health information to the AHA mission and priorities, we have assembled a set of key principles to help inform the AHA’s engagement in public policy and the research enterprise. The goal of these principles is to improve policy to support the use and reuse of health information in ways that are respectful toward contributors, equitable in impact in terms of both risks and potential benefits, and beneficial across broad and demographically diverse communities in the United States.

Principle 1

Funders of medical and scientific research should prioritize and support research generalizable to, or specifically designed for, historically underrepresented communities.

Given the vast disparities in access to health care and their impact on health, it is critical that health data practices are scrutinized through the lens of health equity and increasing access to scientific advances broadly generalizable across, or specifically designed for, demographically diverse patient populations. Data practices must be thoughtfully crafted and implemented with a focus on collective values such as privacy, equity, fairness, and public beneficence54,55 that neither exacerbate existing health inequities nor create new ones.5658

Identifying and minimizing potential biases requires greater involvement of experts who are familiar with the nature of the biases in the data space, engagement of diverse communities of contributors, and mitigation strategies to reduce both intentional and unintentional biases in clinical implementation and deployment.5965 Research with health information derived from and generalizable to historically excluded communities is critical. Achieving this goal might require the further inclusion of self-identified race and ethnicity in primary research protocols. Enabling the support and hiring of trainees, employees, and leadership from communities traditionally underrepresented in technology or health sciences is also a critical step.66 The AHA has focused several recent efforts such as investing more than $100 million in new focused research funding to help address these issues.

Principle 2

Entities that collect health information and researchers who use it should be held to the highest standards of behavior, including being respectful of the people from whom the data are derived and being responsible stewards of this valuable common resource.

Entities and researchers using health information should be held to high standards of privacy and confidentiality (also discussed in Principle 6). When applicable, health information policies should be reflective of the fact that diverse entities are increasingly collecting, processing, using, transferring, and selling data, sometimes without the awareness of the contributor. Non–HIPAA-covered third parties, including wearable device manufacturers, NGOs, social media sites, direct-to-consumer genetic testing companies, and smart devices, frequently accrue health or health-proxy information (eg, geolocation data from a reproductive health clinic). Although such use might be critical for improving support for stakeholders or important scientific or medical research, use guidance and potential restrictions should be clear, for example, not only requiring the deidentification of data but also restricting access to authenticated users and sharing with third parties only in a way that either contractually limits their abilities to misuse data (eg, a data use agreement) or technologically disallows external misuse (eg, data enclaves that provide access to restricted-use data on an external server), in addition to the enforcement of penalties associated with noncompliance.

Principle 3

Entities that collect health information and researchers who use it should be transparent about potential and actual future uses with patients and other data contributors.

Potential use (for purposes other than originally collected) of health information, whether identified or deidentified, should be disclosed at the time of collection to patients or other data contributors (eg, research participants or commercial consumers). This transparency may include disclosures about potential research or operational or commercial use of such information.

In addition to those mandated by law, disclosures may include the following: (1) that health information may be used and shared for research both within and externally (including internationally) from the entity that collected it; (2) what kinds of health information might be shared with what kinds of external entities; (3) what kinds of research the health information might support; (4) under what circumstances (if any) individualized research results will be returned to contributors; (5) whether health information may be commercialized and examples of how any proceeds might be used; (6) any protections against the reidentification of health information (eg, data use agreements); and (7) whether there is an opt-out process. Such disclosures may be included in general institutional consent forms for treatment; explanatory disclosure could also be included in office signage or informational pamphlets routinely distributed to patients. Ideally, methods sections of articles using such collected data should also disclose whether the health information used for the research was identified or deidentified and what kind of disclosure or consent the contributor received.67 Furthermore, all privacy policies should be easily accessible and clear, available in multiple languages reflective of the population, and at an appropriate reading level for the target community. However, considering unpredictable future developments, the failure of best efforts to disclose unanticipated future research should not unreasonably prevent it. Future legislation should encourage the reasonable and consistent establishment, implementation, and maintenance of organizational policies and procedures on the collection, processing, storage, and transfer of data, as well as internal compliance and enforcement mechanisms.

Principle 4

Awareness, education, and involvement of patients in minimal-risk research are encouraged to enable important future discoveries.

Medical and scientific research that benefits diverse communities can and should be supported and encouraged. The concept of the common good calls on patients to support fellow and future patients like them by contributing to medical and scientific research when the cost to themselves is minimal but the benefits for society are high.68,69 The goal of a learning health system is one in which “evidence is both applied and developed as a natural product of the care process”70 and relies on the contributions of both patients and clinicians. In addition, reciprocity encourages those who have benefited from health care advances to contribute to such minimal-risk learning activities themselves. This said, notification and transparency (Principle 3) are critical components of this expectation.

Principle 5

Medical and scientific researchers, research institutions, and publishers should commit to making health information–derived research findings and innovations widely accessible, providing access to supporting data of sufficient quality to validate and replicate research findings, and data documentation that permits reuse and interoperability of the data.

When an individual contributes identified health information that is used for future secondary research purposes, they should have access to research findings and publications as broadly as possible. For example, the Office of Science and Technology Policy of the Biden White House recently directed federal agencies to ensure that federally funded peer-reviewed scholarly publications are “freely available and publicly accessible” without embargo soon.71

Broad access to funded research could also include researchers sharing final publications with their participant cohorts. In addition, recognizing that the public varies in scientific background and the ability to understand scientific literature, researchers, funders, or institutions should consider sharing annotated major research findings in the form of press releases and blog posts, with links to publicly available scientific publications once posted.72 Once shared, annotations should be designed to engage the public using plain language, graphics, and clear explanations.

In addition, health information is a valuable common resource. Unlike self-depleting resources, knowledge generally only increases in value the more it is shared and used.7,9,73 As part of a commitment to acting as responsible stewards of health information (Principle 2), researchers should share the supporting data and data documentation of their research findings with other authorized researchers to the extent permissible and possible. This will likely require additional institutional and funder support because the resources and computational infrastructure necessary are likely beyond the scope of an individual investigator. These data should be of sufficient quality to validate and replicate research findings, as called for by the recent National Institute of Health Policy for Data Management and Sharing,74 and should include necessary clinical information, data dictionaries, code, and methods. The demographics of contributors should be explained in detail, particularly if there is impact on data interpretation or analytic generalizability.

The AHA Research Division has an Open Science Policy that includes requirements for both public access to publications resulting from AHA-funded research and free and public sharing of data needed for verification of research results; the policy also provides guidance on acceptable repositories. An opt-out provision for data sharing is available for some situations.

Such data sharing enables the validation of research results, supports derivative work and discoveries, and should be strongly encouraged and supported by funders, journals, institutions, and researchers alike. Institutions receiving government research funding should commit to enabling and facilitating their researchers to meet these goals. Funding for data cleaning and sharing is also necessary, as well as, potentially, expert support for accomplishing these tasks. Although investigations of compliance rates of health information–sharing requirements by funders have found that investigator-initiated compliance is often low,75,76 the responsibility for this scientific integrity lies with both investigators and custodians of the data.

Principle 6

Federal law should provide a consistent baseline of protection and enforcement for individuals whose health information is collected and used.

Although several states have implemented or begun to craft additional data privacy legislation for their citizens, federal privacy legislation can ensure that citizens have a robust baseline of protections across state lines. These protections could include additional standards for disclosure and transparency (including potential use, duration of use, and other types of data being combined, as well as third parties with whom data may be shared), in addition to options for limiting the amount or kind of data that are shared. A lack of a consistent federal baseline of protection can create challenges for entities, including NGOs, managing health data across their operations, companies working across state lines or globally, and patients accessing and using health services in states other than their primary residence. A federal standard would provide such entities with a consistent and clear understanding of their responsibilities and data contributors with consistent protections.

CONCLUSIONS

The AHA is the nation’s oldest and largest voluntary health care organization dedicated to reducing cardiovascular disease and supports innovative and cutting-edge research applicable across diverse communities. Given the promise of big data research and potential artificial intelligence applications, supporting the broad use of health information for research can make great strides in achieving these goals. However, there are also potential concerns for this expanding use, including for individual privacy and equity considerations across diverse communities. Data sharing and use must be done with a recognition of the obligations of contributors, journals, researchers, and funders within the context of the larger policy ecosystem. These principles will guide the AHA’s work across the changing landscape of policy and systems in terms of health information collection, sharing, and use to ensure respect for contributors, equity in impact of risk, and benefit across broad and demographically diverse communities in the United States.

Footnotes

The American Heart Association makes every effort to avoid any actual or potential conflicts of interest that may arise as a result of an outside relationship or a personal, professional, or business interest of a member of the writing panel. Specifically, all members of the writing group are required to complete and submit a Disclosure Questionnaire showing all such relationships that might be perceived as real or potential conflicts of interest.

This policy statement was approved by the American Heart Association Advocacy Coordinating Committee on May 3, 2023, and the American Heart Association Executive Committee on May 22, 2023. A copy of the document is available at https://professional.heart.org/statements by using either “Search for Guidelines & Statements” or the “Browse by Topic” area. To purchase additional reprints, call 215–356-2721 or email Meredith.Edelman@wolterskluwer.com

Disclosures
Writing Group Disclosures
Writing group member Employment Research grant Other research support Speakers’ bureau/honoraria Expert witness Ownership interest Consultant/advisory board Other
Kayte Spector-Bagdady University of Michigan Medical School NIH None None None None None None
Mina K. Chung Cleveland Clinic AHA; NIH None None None None None None
Antonis A. Armoundas Massachusetts General Hospital None None None None None None None
Rima Arnaout University of California San Francisco None None None None None None None
Jennifer L. Hall American Heart Association None None None None None None None
Joshua W. Knowles Stanford University School of Medicine None None None None None None None
Brooke Yeager McSwain American Heart Association None None None None None None None
W. Nicholson Price II University of Michigan None None None None None None None
Danda B. Rawat Howard University NIH* None None None None None None
Barbara Riegel University of Pennsylvania NIH None None None None None None
Tracy Y. Wang Duke Clinical Research Institute AstraZeneca; Abbott*; Boston Scientific*; Chiesi*; Cryolife; BMS* None None None None AstraZeneca*; Cryolife*; Novartis* None
Kevin Wiley Jr Medical University of South Carolina None None None None None None National Committee for Quality Assurance; Medical University of South Carolina
This table represents the relationships of reviewers that may be perceived as actual or reasonably perceived conflicts of interest as reported on the Disclosure Questionnaire, which all reviewers are required to complete and submit. A relationship is considered to be “significant” if (a) the person receives $5000 or more during any 12-month period, or 5% or more of the person’s gross income; or (b) the person owns 5% or more of the voting stock or share of the entity, or owns $5000 or more of the fair market value of the entity. A relationship is considered to be “modest” if it is less than “significant” under the preceding definition.
*
Modest.
Significant.
Reviewer Disclosures
Reviewer Employment Research grant Other research support Speakers’ bureau/honoraria Expert witness Ownership interest Consultant/advisory board Other
Kevin Larsen Optum None None None None None None None
Leslie A. Lenert Biomedical Informatics Center, Medical University of South Carolina None None None None None None None
Aaron Miri Baptist Health None None None None None None None
Brett Oliver Baptist Healthcare System None None None None None None None
This table represents the relationships of reviewers that may be perceived as actual or reasonably perceived conflicts of interest as reported on the Disclosure Questionnaire, which all reviewers are required to complete and submit. A relationship is considered to be “significant” if (a) the person receives $5000 or more during any 12-month period, or 5% or more of the person’s gross income; or (b) the person owns 5% or more of the voting stock or share of the entity, or owns $5000 or more of the fair market value of the entity. A relationship is considered to be “modest” if it is less than “significant” under the preceding definition.

REFERENCES

  • 1.American Heart Association. Intellectual property policy for research funding. 2022. Accessed October 4, 2022. https://professional.heart.org/en/research-programs/aha-research-policies-and-awardee-hub/policies-governing-all-research-awards
  • 2.American Heart Association. Open science policy statements for AHA funded research. 2022. Accessed October 4, 2022. https://professional.heart.org/en/research-programs/aha-research-policies-and-awardee-hub/policies-governing-all-research-awards
  • 3.Jones RD, Krenz C, Griffith KA, Spence R, Bradbury AR, De Vries R, Hawley ST, Zon R, Bolte S, Sadeghi N, et al. Governance of a learning health care system for oncology: patient recommendations. JCO Oncol Pract. 2021;17:e479–e489. doi: 10.1200/OP.20.00454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.National Institutes of Health. Final NIH policy for data management and sharing. 2020. Accessed January 25, 2022. https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html
  • 5.Rai AK. The case of rights-encumbered biomedical data silos. Notre Dame Law Rev. 2017;92:1–29. [Google Scholar]
  • 6.Eisenberg R Proprietary rights and the norms of science in biotechnology research. Yale Law J. 1987;97:177–231. [PubMed] [Google Scholar]
  • 7.McGuire AL, Majumder MA, Villanueva AG, Bardill J, Bollinger JM, Boerwinkle E, Bubela T, Deverka PA, Evans BJ, Garrison NA, et al. Importance of participant-centricity and trust for a sustainable medical information commons. J Law Med Ethics. 2019;47:12–20. doi: 10.1177/1073110519840480 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.ONC, HealthIT Advisory Committee. Final report of the Health Information Technology Advisory Committee on Public Health Data Systems. Accessed December 19, 2022. https://healthit.gov/sites/default/files/page/2022-11/2022-11-10_PHDS_TF_Recommendations_Report_Transmittal_Letter_508.pdf
  • 9.Price WN 2nd. Risk and resilience in health data infrastructure. Colo Tech Law J 2017;16:65–85. [Google Scholar]
  • 10.45 CFR §§160 and 164 (Subparts A and E).
  • 11.Presidential Commission for the Study of Bioethical Issues. Privacy and Progress in Whole Genome Sequencing. Government Printing Office: 2012. Accessed July 14, 2023. https://bioethicsarchive.georgetown.edu/pcsbi/node/764.html [Google Scholar]
  • 12.Simonite T These algorithms look at x-rays—and somehow detect your race. Wired. August 5, 2021. Accessed December 19, 2022. https://wired.com/story/these-algorithms-look-x-rays-detect-your-race/
  • 13.Schwarz CG, Kremers WK, Therneau TM, Sharp RR, Gunter JL, Vemuri P, Arani A, Spychalla AJ, Kantarci K, Knopman DS, et al. Identification of anonymous MRI research participants with face-recognition software. N Engl J Med. 2019;381:1684–1686. doi: 10.1056/NEJMc1908881 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Price WN 2nd, Kaminski ME, Minssen T, Spector-Bagdady K. Shadow health records meet new data privacy laws. Science. 2019;363:448–450. doi: 10.1126/science.aav5133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.De Vries RG, Tomlinson T, Kim HM, Krenz CD, Ryan KA, Lehpamer N, Kim SY. The moral concerns of biobank donors: the effect of non-welfare interests on willingness to donate. Life Sci Soc Policy. 2016;12:3. doi: 10.1186/s40504-016-0036-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bazoukis G, Hall J, Loscalzo J, Antman EM, Fuster V, Armoundas AA. The inclusion of augmented intelligence in medicine: a framework for successful implementation. Cell Rep Med. 2022;3:100485. doi: 10.1016/j.xcrm.2021.100485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sevakula RK, Au-Yeung WM, Singh JP, Heist EK, Isselbacher EM, Armoundas AA. State-of-the-art machine learning techniques aiming to improve patient outcomes pertaining to the cardiovascular system. J Am Heart Assoc. 2020;9:e013924. doi: 10.1161/JAHA.119.013924 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Price WN 2nd. Medical AI and contextual bias. Harvard J Law Technol. 2019;33:65–116. [Google Scholar]
  • 19.Kaushal A, Altman A, Langlotz C. Geographic distribution of US cohorts used to train deep learning algorithms. JAMA. 2020;324:1212–1213. doi: 10.1001/jama.2020.12067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Barocas SS, Selbst AD. Big data’s disparate impact. Calif Law Rev. 2016;671–732. [Google Scholar]
  • 21.Malanga SE, Loe JD, Robertson CT, Ramos KS. Who’s left out of big data?” In: Cohen IG, Lynch HF, Veyena E, Gasser U, eds. Big Data, Health Law, and Bioethics. Cambridge University Press: 2018. [Google Scholar]
  • 22.Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366:447–453. doi: 10.1126/science.aax2342 [DOI] [PubMed] [Google Scholar]
  • 23.Popejoy AB, Fullerton SM. Genomics is failing on diversity. Nature. 2016;538:161–164. doi: 10.1038/538161a [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Bonham VL, Callier SL, Royal CD. Will precision medicine move us beyond race? N Engl J Med. 2016;374:2003–2005. doi: 10.1056/NEJMp1511294 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Caswell-Jin JL, Gupta T, Hall E, Petrovchich IM, Mills MA, Kingham KE, Koff R, Chun NM, Levonian P, Lebensohn AP, et al. Racial/ethnic differences in multiple-gene sequencing results for hereditary cancer risk. Genet Med. 2018;20:234–239. doi: 10.1038/gim.2017.96 [DOI] [PubMed] [Google Scholar]
  • 26.Manrai AK, Funke BH, Rehm HL, Olesen MS, Maron BA, Szolovits P, Margulies DM, Loscalzo J, Kohane IS. Genetic misdiagnoses and the potential for health disparities. N Engl J Med. 2016;375:655–665. doi: 10.1056/NEJMsa1507092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Whittaker M, Alper M, Bennett CL, Hendren S, Kaziunas L, Mills M, Ringel Morris M, Rogers RJ, Salas E, Myers West M; S. Disability, Bias, and AI. AINOW. Accessed July 14, 2023. https://ainowinstitute.org/publication/disabilitybiasai-2019
  • 28.Michmerhuizen NL, Birkeland AC, Bradford CR, Brenner JC. Genetic determinants in head and neck squamous cell carcinoma and their influence on global personalized medicine. Genes Cancer. 2016;7:182–200. doi: 10.18632/genesandcancer.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Taylor P Personal genomes: when consent gets in the way. Nature. 2008;456:32–33. doi: 10.1038/456032a [DOI] [PubMed] [Google Scholar]
  • 30.Benjamin R Assessing risk, automating racism. Science. 2019;366:421–422. doi: 10.1126/science.aaz3873 [DOI] [PubMed] [Google Scholar]
  • 31.Boyd RW, Lindo EG, Weeks LD, McLemore MR. On racism: a new standard for publishing on racial health inequities. Health Affairs Forefront. 2020. Accessed December 19, 2022. https://healthaffairs.org/do/10.1377/fore-front.20200630.939347/ [Google Scholar]
  • 32.Spector-Bagdady K, Tang S, Jabbour S, Price WN 2nd, Bracic A, Creary MS, Kheterpal S, Brummett CM, Wiens J. Respecting autonomy and enabling diversity: the effect of eligibility and enrollment on research data demographics. Health Aff (Millwood). 2021;40:1892–1899. doi: 10.1377/hlthaff.2021.01197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Adjekum A, Blasimme A, Vayena E. Elements of trust in digital health systems: scoping review. J Med Internet Res. 2018;20:e11254. doi: 10.2196/11254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.American Heart Association. Live fierce. stand for all. 2022. Accessed October 1, 2022. https://www.heart.org/en/healthy-living/live-fierce/live-fierce-stand-for-all
  • 35.45 CFR §46.
  • 36.21 CFR §§50, 56.
  • 37.Golbus JR, Price WN 2nd, Nallamothu BK. Privacy gaps for digital cardiology data: big problems with big data. Circulation. 2020;141:613–615. doi: 10.1161/CIRCULATIONAHA [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Department of Health and Human Services. ONC completes critical 21st Century Cures Act requirement, publishes the trusted exchange framework and the common agreement for Health Information Networks. January 18, 2022. Accessed December 20, 2022. https://hhs.gov/about/news/2022/01/18/onc-completes-critical-21st-century-cures-act-requirement-publishes-trusted-exchange-framework-common-agreement-health-information-networks.html
  • 39.45 CFR §160.103.
  • 40.45 CFR §164.514.
  • 41.45 CFR 164.512(b).
  • 42.Cohen IG, Mello MM. Big data, big tech, and protecting patient privacy. JAMA. 2019;322:1141–1142. doi: 10.1001/jama.2019.11365 [DOI] [PubMed] [Google Scholar]
  • 43.Federal policy for the protection of human subjects. Fed Regist. 82 FR 7149. January 19, 2017. [PubMed] [Google Scholar]
  • 44.45 CFR §146.116(b)(9)(i).
  • 45.45 CFR §146.116(c)(7).
  • 46.Spector-Bagdady K Hospitals should act now to notify patients about research use of their data and biospecimens. Nat Med. 2020;26:306–308. doi: 10.1038/s41591-020-0795-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Beskow LM. Lessons from HeLa Cells: the ethics and policy of biospecimens. Annu Rev Genomics Hum Genet. 2016;17:395–417. doi: 10.1146/annurev-genom-083115-022536 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Emanuel EJ, Grady C. Case study: is longer always better? 38. Hastings Cent Rep. 2008;38:10–11. [PubMed] [Google Scholar]
  • 49.Lenartz A, Scherer AM, Uhlmann WR, Suter SM, Anderson Hartley C, Prince AER. The persistent lack of knowledge and misunderstanding of the Genetic Information Nondiscrimination Act (GINA) more than a decade after passage. Genet Med. 2021;23:2324–2334. doi: 10.1038/s41436-021-01268-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Tabak LA. Statement on NIH plans to speed access to federally funded research results. August 25, 2022. Accessed December 20, 2022. https://www.nih.gov/about-nih/who-we-are/nih-director/statements/statement-nih-plans-speed-access-federally-funded-research-results
  • 51.California Consumer Privacy Act of 2018. Accessed June 6, 2023. https://www.caprivacy.org
  • 52.Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation).
  • 53.Grote T, Berens P. On the ethics of algorithmic decision-making in healthcare. J Med Ethics. 2020;46:205–211. doi: 10.1136/medethics-2019-105586 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Habli I, Lawton T, Porter Z. Artificial intelligence in health care: accountability and safety. Bull World Health Organ. 2020;98:251–256. doi: 10.2471/BLT.19.237487. Accessed July 14, 2023. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7133468/ [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.UN General Assembly. Report of the Special Rapporteur on Extreme Poverty and Human Rights. United Nations; 2019. [Google Scholar]
  • 56.Zuboff S The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power. Profile Books, 2019. [Google Scholar]
  • 57.Couldry N, Mejias U. The Costs of Connection: How Data Is Colonizing Human Life and Appropriating it for Capitalism. Stanford University Press; 2019. [Google Scholar]
  • 58.Warner JJ, Benjamin IJ, Churchwell K, Firestone G, Gardner TJ, Johnson JC, Ng-Osorio J, Rodriguez CJ, Todman L, Yaffe K, et al. ; on behalf of the American Heart Association Advocacy Coordinating Committee. Advancing healthcare reform: the American Heart Association’s 2020 state ment of principles for adequate, accessible, and affordable health care: a presidential advisory from the American Heart Association. Circulation. 2020;141:e601–e614. doi: 10.1161/CIR.0000000000000759 [DOI] [PubMed] [Google Scholar]
  • 59.Benjamin R Informed refusal. Sci Technol Human Values. 2016;41:967–990. [Google Scholar]
  • 60.Creary MS. Bounded justice and the limits of health equity. J Law Med Ethics. 2021;49:241–256. doi: 10.1017/jme.2021.34 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Dang JH, Rodriguez EM, Luque JS, Erwin DO, Meade CD, Chen MS Jr. Engaging diverse populations about biospecimen donation for cancer research. J Community Genet. 2014;5:313–327. doi: 10.1007/s12687-014-0186-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Millon Underwood S, Buseh AG, Kelber ST, Stevens PE, Townsend L. Enhancing the participation of African Americans in health-related genetic research: findings of a collaborative academic and community-based research study. Nurs Res Pract. 2013;2013:749563. doi: 10.1155/2013/749563 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Ochs-Balcom HM, Jandorf L, Wang Y, Johnson D, Meadows Ray V, Willis MJ, Erwin DO. “It takes a village”: multilevel approaches to recruit African Americans and their families for genetic research. J Community Genet. 2015;6:39–45. doi: 10.1007/s12687-014-0199-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Yancey AK, Ortega AN, Kumanyika SK. Effective recruitment and retention of minority research participants. Annu Rev Public Health. 2006;27:1–28. doi: 10.1146/annurev.publhealth.27.021405.102113 [DOI] [PubMed] [Google Scholar]
  • 65.Cho MK. Rising to the challenge of bias in health care AI. Nat Med. 2021;27:2079–2081. doi: 10.1038/s41591-021-01577-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Bonham VL, Green ED. The genomics workforce must become more diverse: a strategic imperative. Am J Hum Genet. 2021;108:3–7. doi: 10.1016/j.ajhg.2020.12.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Sauer CM, Chen LC, Hyland SL, Girbes A, Elbers P, Celi LA. Leveraging electronic health records for data science: common pitfalls and how to avoid them. Lancet Digit Health. 2022;4:e893–e898 doi: 10.1016/S2589-7500(22)00154-6 [DOI] [PubMed] [Google Scholar]
  • 68.Faden RR, Kass NE, Goodman SN, Pronovost P, Tunis S, Beauchamp TL. An ethics framework for a learning health care system: a departure from traditional research ethics and clinical ethics. Hastings Cent Rep. 2013;Spec No:S16–S27. doi: 10.1002/hast.134 [DOI] [PubMed] [Google Scholar]
  • 69.Rawls J A Theory of Justice. Belknap Press of Harvard University Press; 1971. [Google Scholar]
  • 70.National Academies Press. IOM Roundtable on Evidence-Based Medicine. The Learning Health System; 2007. [Google Scholar]
  • 71.Executive Office of the President, Office of Science and Technology Policy. Memorandum Re: ensuring free, immediate, and equitable access to federally funded research. August 25, 2022. Accessed December 12, 2022. https://www.whitehouse.gov/wp-content/uploads/2022/08/08-2022-OSTP-Public-Access-Memo.pdf
  • 72.American Heart Association. AHA approved data repositories. Accessed December 20, 2022. https://professional.heart.org/en/research-programs/aha-research-policies-and-awardee-hub/aha-approved-data-repositories
  • 73.Hess C, Ostrom E. Introduction: an overview of the knowledge commons. In: Hess C, Ostrom E, eds. Understanding Knowledge as a Commons: From Theory to Practice. MIT Press; 2011. [Google Scholar]
  • 74.Final NIH policy for data management and sharing. October 29, 2020. Accessed December 20, 2022. https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html
  • 75.Piller C FDA and NIH let clinical trial sponsors keep results secret and break the law Science Insider. 2021. Accessed June 6, 2023. https://science.org/content/article/fda-and-nih-let-clinical-trial-sponsors-keep-results-secret-and-break-law
  • 76.Silverman E “A blind eye”: NIH fails to ensure clinical trial results are reported, and still funds researchers who don’t file results. Stat+. 2022. Accessed June 6, 2023. https://statnews.com/pharmalot/2022/08/17/nih-clinical-trials-transparency-fda/

RESOURCES