Abstract
Lipoprotein(a) [Lp(a)] is independently associated with atherosclerotic cardiovascular disease and calcific aortic valve stenosis. Elevated Lp(a) affects approximately one in five individuals and meaningfully contributes to the residual cardiovascular risk in individuals with otherwise well‐controlled risk factors. With targeted therapies in the therapeutic pipeline, there is a need to further characterize the clinical phenotypes and outcomes of individuals with elevated levels of this unique biomarker. The Mass General Brigham Lp(a) Registry will be built from the longitudinal electronic health record of two large academic medical centers in Boston, Massachusetts, to develop a detailed cohort of patients who have had their Lp(a) measured. In combination with structured data sources, clinical documentation will be analyzed using natural language processing techniques to accurately characterize baseline characteristics. Important outcome measures including all‐cause mortality, cardiovascular mortality, and cardiovascular events will be available for analysis. Approximately 30 000 patients who have had their Lp(a) tested within the Mass General Brigham system from January 2000 to July 2019 will be included in the registry. This large Lp(a) cohort will provide meaningful observational data regarding the differential risk associated with Lp(a) values and cardiovascular disease. With a new frontier of targeted Lp(a) therapies on the horizon, the Mass General Brigham Lp(a) Registry will help provide a deeper understanding of Lp(a)'s role in long term cardiovascular outcomes.
Keywords: ASCVD, lipoprotein(a), natural language processing, prevention
1. INTRODUCTION
Over the last four decades, numerous studies have demonstrated that elevated lipoprotein(a) (Lp(a)) is an independent risk factor for coronary heart disease, ischemic stroke, and calcific aortic valve stenosis. 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 With a unique molecular structure hypothesized to be both pro‐atherogenic and pro‐thrombotic, 13 Lp(a) is composed of a low‐density lipoprotein‐like moiety covalently bound to a genetically‐mediated apolipoprotein(a) molecule. 14 Synthesized in the liver and mediated almost entirely by the LPA gene, elevated Lp(a) is the most common inherited dyslipidemia, affecting approximately one in five individuals. 15 Lp(a) levels above 50 mg/dL (≈ 125 nmol/L) correspond to the 80th percentile in population‐based studies and is the threshold at which its impact on atherosclerotic cardiovascular disease (ASCVD) has been shown to become clinically meaningful. 10 , 16 , 17 Even for individuals with otherwise well‐controlled risk factors, elevated levels of Lp(a) have a significant and independent effect on residual atherosclerotic cardiovascular risk. 18
Despite the robust evidence linking elevated levels of Lp(a) to cardiovascular disease, there are currently no approved pharmacologic therapies directly targeting Lp(a). Presently, those with significantly elevated levels are counseled on overall risk reduction and—in certain scenarios—recommended to initiate statin therapy. 19 , 20 , 21 While statin therapy has no direct effect on Lp(a) levels, 22 its use in some individuals with elevated Lp(a) centers around overall ASCVD risk reduction. However, there are now a variety of promising therapeutics in development which may transform the management of this dyslipidemia and help mitigate this important cardiovascular risk factor. 12 , 23 , 24 , 25 , 26
With the landscape of Lp(a) management potentially shifting in the coming decade, there is a need to develop a deeper understanding as to which subsets of patients may derive the most benefit from future targeted therapies. Accordingly, we set out to assess the demographics, clinical characteristics, and outcomes of a large cohort of patients (n ≈ 30 000) who have had their Lp(a) measured over the last 20 years to further understand the association between Lp(a) levels and cardiovascular disease.
2. STUDY OBJECTIVES
The objectives of this study are to: (a) describe the distribution of Lp(a) across a large cohort of patients with and without baseline cardiovascular disease stratified by sex and ethnic background; (b) determine the association between Lp(a) level and cause‐specific mortality; (c) determine the association between Lp(a) level and incidence of myocardial infarction and non‐hemorrhagic stroke; (d) determine the association of Lp(a) level and the age of ASCVD events; (e) determine the association between Lp(a) level and venous thromboembolism; 13 (f) evaluate the association between elevated Lp(a) levels and imaging correlates for coronary artery disease; (g) evaluate the association between Lp(a) level and peripheral revascularization; (h) determine the association between Lp(a) level and progression of aortic stenosis and need for aortic valve replacement; (i) assess the impact of Lp(a) and recurrent ASCVD events; and (j) assess the interaction of sex and ethnic background on the above investigations.
3. METHODS
The Mass General Brigham Lp(a) Registry will be a retrospective cohort of approximately 30 000 patients who had their Lp(a) measured from 01 January 2000 to 07 January 2019 as part of routine clinical care. The Institutional Review Board at Mass General Brigham approved this study.
Figure 1 provides an outline of the study design.
3.1. Data sources
There are three primary data sources for our work:
The Research Patient Data Registry 27 (RPDR) at Mass General Brigham will serve as the primary source of data for this registry. The RPDR is a centralized clinical data registry that consolidates clinical information from all hospitals within the Mass General Brigham network, including Brigham and Women's Hospital and Massachusetts General Hospital. This system provides demographic data, laboratory and imaging data, diagnostic and procedural codes, medication data, vital status, and clinical documentation for individuals meeting specified search criteria. RPDR obtains vital status from the Social Security Administration Death Master File.
Data from Medicare and Medicaid, when available, will be linked to individuals in our cohort to assess outcomes that may have occurred outside of the Mass General Brigham healthcare system.
International Classification of Diseases (ICD) coded death information from the National Death Index (NDI) or the Massachusetts Office of Vital Statistics will be used to determine the underlying and proximal causes of death for each patient who expired during the study period.
3.2. Entry criteria
All individuals ≥18 years of age with at least 1 Lp(a) result will be included in our cohort. There will be two exclusion criteria based on baseline covariates: (a) patients with a diagnosis of severe renal dysfunction defined as stage 5 chronic kidney disease (estimated glomerular filtration rate < 15 mL/min/m2), those who have had a renal transplant, or those on dialysis; and (b) individuals with a diagnostic ICD code for a malignant neoplasm, excluding nonmelanoma skin cancers.
3.3. Standardization of Lp(a) values
Over the study period, two distinct Lp(a) assay methodologies were used as part of routine medical care. The first methodology was the standard immunochemical‐based assay with reference ranges <30 mg/dL or < 75 nmol/L. 10 , 28 , 29 The second assay type was a clinically validated electrophoretic assay measuring the cholesterol content of Lp(a) particles with a reference range < 3 mg/dL. 30 , 31 Prior analytic work has demonstrated that this electrophoretic cholesterol assay has a strong correlation with the standard immunochemical‐based assay, with correlation coefficients ranging from 0.89 to 0.96. 30 , 31
In order to avoid potential biases due to differences in testing modalities over the study period, percentile groups for each assay will be defined separately and then combined. This methodology has been performed previously in other large Lp(a) studies. 2 , 5 , 9 , 32 For selected analyses, if appropriate, we will maintain Lp(a) levels in their original form for use as continuous variables within individual assay types or testing methodologies.
3.4. Assessment of baseline covariates
Given the presence of more than 8 million clinical notes and tens of millions of structured data points (eg, laboratory values, ICD codes, and so on) relating to our study population, accurately characterizing all individuals in the cohort will be essential for analytic work.
3.4.1. Methods for baseline covariate assessment
Baseline covariates will be determined through the following methods:
Demographic data, such as age, sex, and race, will be obtained from the Mass General Brigham RPDR system.
Natural language processing (NLP) algorithms using the open‐source Canary NLP platform will be created and applied to clinical documentation to determine the presence of a variety of key baseline risk factors. 33 , 34 Covariates assessed in this fashion will be the clinical diagnoses and documentation at any time of hypertension, dyslipidemia, diabetes, transient ischemic attack, nonhemorrhagic stroke, coronary artery disease, myocardial infarction, or coronary revascularization. Each of these algorithms will be individually validated for their accuracy prior to all analytic work.
Smoking status for each individual per calendar year in the cohort will be determined through the use of a previously validated NLP system. 35 This NLP system has high accuracy and is able to assign an annual smoking status to each patient with a sensitivity of 92.4%, specificity of 86.2%, and AUC of 0.89. Because this system was developed using the Mass General Brigham hospital network RPDR database, it will be able to accurately assess for patient‐level smoking status within our registry.
- Laboratory data will be used for further assessment of baseline covariates. Key laboratory data will include creatinine, glycated hemoglobin (HgbA1c), and the individual components of the complete cholesterol panel (eg, total cholesterol, triglycerides, high‐density lipoprotein cholesterol, and low‐density lipoprotein cholesterol).
- Diabetes will be defined as a HgbA1c level ≥ 6.5%. For women under the age of 45, diabetes will be defined by the presence of two HgbA1c levels ≥6.5% separated by at least 9 months to avoid categorizing those with gestational diabetes as having a diagnosis of diabetes.
- Dyslipidemia will be defined as total cholesterol ≥240 mg/dL, serum triglycerides ≥150 mg/dL, high‐density lipoprotein cholesterol <40 mg/dL in men or <50 mg/dL in women.
- Chronic kidney disease will be defined as a median estimated glomerular filtration rate between 60 and 15 mL/min/m2.
Medication use, such as the use of cholesterol lowering therapies or the use of insulin therapy for those with diabetes, will be assessed using NLP as well as through analysis of electronic prescription data from the RPDR data repository.
Diagnostic and procedural ICD‐9, ICD‐10, and Current Procedural Terminology (CPT) codes will further augment baseline risk factor assessment. A full description of the covariates, which will be assessed using ICD and/or CPT codes is available in the Appendix S1.
Markers of socioeconomic status will be incorporated into our analyses using the Area Deprivation Index (ADI). The ADI is a validated marker of neighborhood level socioeconomic disadvantage, a surrogate for individual socioeconomic status. The ADI combines 17 measures of employment, income, housing, and education from the American Community Survey to create a score for each geographic unit in the United States. 36 , 37 , 38 , 39 , 40 , 41 , 42 Each patient's home address will be indexed to its associated neighborhood level socioeconomic disadvantage ADI score and incorporated into multivariate analyses.
3.4.2. Baseline covariate assessment period
The baseline covariate assessment period will be determined in one of two ways depending on the analysis:
Based on Lp(a) testing date: Baseline covariates will be assessed up to 12 months prior to and 12 months after the date of Lp(a) laboratory testing. For individuals with multiple Lp(a) tests over time, the baseline covariate assessment period will relate to the first chronologic Lp(a) measurement.
Based on first encounter: Baseline covariates will be assessed from the date of the first clinical encounter in the RPDR system and up to 24 months thereafter.
Additionally, the use of specific medications will be determined based on data 12 months prior to and up to 6 months after the date of Lp(a) laboratory testing or up to 12 months after the first encounter in the RPDR system. From information available during the covariate assessment period, we will calculate baseline 10‐year ASCVD risk for individuals without a history of myocardial infarction or stroke. 20 , 21 , 43
3.5. Outcomes and index date for follow‐up
The key outcome measures in our study will be all‐cause mortality, cardiovascular mortality, and the occurrence of cardiovascular events. Follow‐up for the study outcomes will commence after the baseline covariate assessment period.
3.5.1. All‐cause mortality
Vital status will be determined from the NDI and the Massachusetts Office of Vital Statistics in conjunction with data from the Social Security Administration Death Master File.
3.5.2. Cardiovascular mortality
Cardiovascular mortality will be determined based on the ICD‐coded underlying cause of death 44 , 45 , 46 , 47 as determined by the NDI or the Massachusetts Office of Vital Statistics. When applicable, cardiovascular disease codes listed as a contributing factor in noncardiovascular deaths will be used in sensitivity analyses to estimate the potential additional effect that overall cardiovascular disease has on all‐cause mortality.
3.5.3. Cardiovascular events
For cardiovascular events requiring hospitalization, such as acute myocardial infarction, unstable angina, venous thromboembolism, nonhemorrhagic stroke, or transient ischemic attack, the presence of a diagnostic ICD code in the primary hospital discharge position will be considered diagnostic. This methodology has been extensively validated with high specificity, high positive predictive value, and reasonable sensitivity. 48 , 49 , 50 , 51 , 52 , 53 , 54 , 55 , 56 , 57 When applicable, we will perform sensitivity analyses incorporating the same diagnostic codes as indicating the presence of a given outcome when they appear in nonprimary positions in hospital discharge claims data. 52 , 56
For outcomes, such as coronary revascularization, peripheral revascularization, transcatheter or surgical aortic valve replacement, and other relevant cardiovascular procedures, a single procedural code in the patient record (either ICD or CPT) will be considered diagnostic.
Linked Medicare and Medicaid claims data (where available) will be used to supplement outcomes data for individuals who received medical care for acute events and procedures outside of our hospital network. Sensitivity analyses will be performed to compare the incidence of cardiovascular events in our hospital system vs those that occurred in other systems.
3.6. Data management
Study‐related data for all patients who meet inclusion criteria will be stored on secure servers hosted by Mass General Brigham. Project managers (the primary investigator, analytic team, and data steward) will determine which members of the research team are authorized to access file system data. As part of research orientation at Mass General Brigham, all users are required to complete privacy, best practice, and ethics training provided by the Collaborative Institutional Training Initiative (www.citiprogram.org). Where appropriate, institutional data use agreements will be orchestrated to facilitate the research enterprise.
3.6.1. Statistical analysis
Clinical characteristics will be compared across different subgroups, including sex and ethnic background. Continuous variables will be reported as means or medians and compared with t tests, rank‐sum tests, or analysis of variance, as appropriate. Categorical variables will be reported as frequencies and proportions and will be compared with χ 2 or Fisher exact tests. Ordinal variables will be compared using an appropriate trend test. Analyses will be adjusted using regression techniques for available baseline covariates to adjust for confounding and to examine associations between Lp(a) level and the outcomes of interest. Cox proportional hazards modeling will be performed for time‐to‐event analyses. Incidence rate ratios will be calculated when comparing the rates of cardiovascular events between different levels of Lp(a). All analyses will be performed on deidentified data.
4. LIMITATIONS
As a retrospective cohort study spanning a 20‐year period, we are limited by differences in the techniques used for Lp(a) measurement. However, to address this, we will standardize each assay based on percentiles and subsequently perform analyses across predefined percentile groups. 2 , 5 , 9 , 32 Additionally, our cohort consists of patients who were tested as part of routine clinical care, which may introduce selection bias. However, at present, Lp(a) testing is typically performed on selected patients with cardiovascular disease. As such, our findings will be applicable to patients who are tested in the current era where Lp(a) screening is typically performed in those at elevated cardiovascular risk while also providing meaningful information for the vast majority of patients with elevated levels.
Given the retrospective nature of this work, unmeasured and residual confounding may still exist even after adjusting for known confounders. Because this study is limited to two large academic medical centers in one geographic region of the United States, the results may not be fully generalizable to other regions or practice environments. Finally, some nonfatal endpoints (eg, myocardial infarction, stroke, and so on) that occurred outside of our healthcare system or were not captured by Medicare and Medicaid claims data may be missed. However, we do not expect that these missed events will significantly bias our results as an individual's Lp(a) level should not impact whether they seek care for an acute event within a given medical system (eg, nondifferential data loss). Additionally, analyses based on all‐cause and cardiovascular mortality are expected to have complete information through national vital status and cause of death data repositories, thereby limiting the possibility of data loss affecting our findings.
Nevertheless, there are multiple important strengths to our study. Our registry will include a large number of patients, making it one of the largest Lp(a) registries with one of the longest follow‐up periods in the United States. Via linkage with Medicare and Medicaid claims data, we will be able to assess for events that occurred outside of our hospital network, thus leveraging the advantages of our detailed patient level information with the power of national claims data. Finally, our study data will be based on a combination of detailed clinical notes, laboratory values, and billing records, which will provide a high degree of accuracy and detail that would not be possible when relying on administrative data alone. Moreover, we will use innovative natural language processing algorithms to adjudicate baseline variables from clinical documentation, a technique which provides highly accurate patient‐level data. 33 , 58 , 59 , 60
5. DISCUSSION
Important work linking Lp(a) with cardiovascular disease combined with the ongoing development of targeted therapies have created a resurgence of interest in this unique lipoprotein. Current ongoing trials 23 , 24 of antisense oligonucleotides and the development of small interfering RNAs (siRNAs) 61 will elucidate the effect of such therapies and their impact on incident ASCVD.
Given the heterogenous level of risk associated with elevated levels of Lp(a), large observational studies are important for further informing the differential risk associated with Lp(a) values and cardiovascular disease. Additionally, developing a deeper understanding of how Lp(a) interacts with other established risk factors will be valuable in refining cardiovascular risk prediction. The Mass General Brigham Lp(a) Registry will provide real‐world data to help inform these important questions. Additionally, this registry may help identify subgroups of patients who may benefit the most from pharmacologic intervention as therapies emerge. Finally, given the large size of our cohort as well as the use of robust methods to determine baseline characteristics and outcomes, our study will provide patient level data that can be used to inform future clinical trials.
CONFLICT OF INTEREST
Dr D. L. B. discloses the following relationships‐Advisory Board: Cardax, Cereno Scientific, Elsevier Practice Update Cardiology, Medscape Cardiology, PhaseBio, PLx Pharma, Regado Biosciences; Board of Directors: Boston VA Research Institute, Society of Cardiovascular Patient Care, TobeSoft; Chair: American Heart Association Quality Oversight Committee; Data Monitoring Committees: Baim Institute for Clinical Research (formerly Harvard Clinical Research Institute, for the PORTICO trial, funded by St. Jude Medical, now Abbott), Cleveland Clinic (including for the ExCEED trial, funded by Edwards), Duke Clinical Research Institute, Mayo Clinic, Mount Sinai School of Medicine (for the ENVISAGE trial, funded by Daiichi Sankyo), Population Health Research Institute; Honoraria: American College of Cardiology (Senior Associate Editor, Clinical Trials and News, ACC.org; Vice‐Chair, ACC Accreditation Committee), Baim Institute for Clinical Research (formerly Harvard Clinical Research Institute; RE‐DUAL PCI clinical trial steering committee funded by Boehringer Ingelheim; AEGIS‐II executive committee funded by CSL Behring), Belvoir Publications (Editor in Chief, Harvard Heart Letter), Duke Clinical Research Institute (clinical trial steering committees, including for the PRONOUNCE trial, funded by Ferring Pharmaceuticals), HMP Global (Editor in Chief, Journal of Invasive Cardiology), Journal of the American College of Cardiology (Guest Editor; Associate Editor), Medtelligence/ReachMD (CME steering committees), Population Health Research Institute (for the COMPASS operations committee, publications committee, steering committee, and USA national co‐leader, funded by Bayer), Slack Publications (Chief Medical Editor, Cardiology Today's Intervention), Society of Cardiovascular Patient Care (Secretary/Treasurer), WebMD (CME steering committees); Other: Clinical Cardiology (Deputy Editor), NCDR‐ACTION Registry Steering Committee (Chair), VA CART Research and Publications Committee (Chair); Research Funding: Abbott, Afimmune, Amarin, Amgen, AstraZeneca, Bayer, Boehringer Ingelheim, Bristol‐Myers Squibb, Cardax, Chiesi, CSL Behring, Eisai, Ethicon, Ferring Pharmaceuticals, Forest Laboratories, Fractyl, Idorsia, Ironwood, Ischemix, Lexicon, Lilly, Medtronic, Pfizer, PhaseBio, PLx Pharma, Regeneron, Roche, Sanofi Aventis, Synaptic, The Medicines Company; Royalties: Elsevier (Editor, Cardiovascular Intervention: A Companion to Braunwald's Heart Disease); Site Co‐Investigator: Biotronik, Boston Scientific, CSI, St. Jude Medical (now Abbott), Svelte; Trustee: American College of Cardiology; Unfunded Research: FlowCo, Merck, Novo Nordisk, Takeda. Dr R. B. received research support from Amgen Inc and Astellas Inc. Dr C. P. C. repots research grants from Amgen, Boehringer‐Ingelheim (BI), Bristol‐Myers Squibb (BMS), Daiichi Sankyo, Janssen, Merck, Pfizer. Additionally, he reports consulting fees from Aegerion, Alnylam, Amarin, Amgen, Applied Therapeutics, Ascendia, BI, BMS, Corvidia, HLS Therapeutics, Innovent, Janssen, Kowa, Merck, Pfizer, Sanofi, Rhoshan. Dr P. N. reports grant support from Amgen, Apple, and Boston Scientific, and is a scientific advisor to Apple and Blackstone Life Sciences. Dr A. T. reports having equity in Brio Systems and research funding from Astra Zeneca, Eli Lilly, Edwards, Novo Nordisk and Sanofi.
Sources of Funding: Dr. Berman is supported by a T32 postdoctoral training grant from the National Heart, Lung, and Blood Institute (T32 HL094301). Dr. Divakaran is supported by a T32 postdoctoral training grant from the National Heart, Lung, and Blood Institute (T32 HL094301).
Supporting information
Berman AN, Biery DW, Ginder C, et al. Study of lipoprotein(a) and its impact on atherosclerotic cardiovascular disease: Design and rationale of the Mass General Brigham Lp(a) Registry. Clin Cardiol. 2020;43:1209–1215. 10.1002/clc.23456
Funding information National Heart, Lung, and Blood Institute, Grant/Award Number: T32 HL094301
DATA AVAILABILITY STATEMENT
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
REFERENCES
- 1. Emerging Risk Factors C , Erqou S, Kaptoge S, et al. Lipoprotein(a) concentration and the risk of coronary heart disease, stroke, and nonvascular mortality. JAMA. 2009;302(4):412‐423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Kamstrup PR, Benn M, Tybjaerg‐Hansen A, Nordestgaard BG. Extreme lipoprotein(a) levels and risk of myocardial infarction in the general population: the Copenhagen City heart study. Circulation. 2008;117(2):176‐184. [DOI] [PubMed] [Google Scholar]
- 3. Kamstrup PR, Tybjaerg‐Hansen A, Steffensen R, Nordestgaard BG. Genetically elevated lipoprotein(a) and increased risk of myocardial infarction. JAMA. 2009;301(22):2331‐2339. [DOI] [PubMed] [Google Scholar]
- 4. Thanassoulis G, Campbell CY, Owens DS, et al. Genetic associations with valvular calcification and aortic stenosis. N Engl J Med. 2013;368(6):503‐512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Kamstrup PR, Tybjaerg‐Hansen A, Nordestgaard BG. Elevated lipoprotein(a) and risk of aortic valve stenosis in the general population. J Am Coll Cardiol. 2014;63(5):470‐477. [DOI] [PubMed] [Google Scholar]
- 6. Pare G, Caku A, McQueen M, et al. Lipoprotein(a) levels and the Risk of myocardial infarction among seven ethnic groups. Circulation. 2019;139(12):1472‐1482. [DOI] [PubMed] [Google Scholar]
- 7. Clarke R, Peden JF, Hopewell JC, et al. Genetic variants associated with Lp(a) lipoprotein level and coronary disease. N Engl J Med. 2009;361(26):2518‐2528. [DOI] [PubMed] [Google Scholar]
- 8. Nave AH, Lange KS, Leonards CO, et al. Lipoprotein (a) as a risk factor for ischemic stroke: a meta‐analysis. Atherosclerosis. 2015;242(2):496‐503. [DOI] [PubMed] [Google Scholar]
- 9. Langsted A, Nordestgaard BG, Kamstrup PR. Elevated lipoprotein(a) and Risk of ischemic stroke. J Am Coll Cardiol. 2019;74(1):54‐66. [DOI] [PubMed] [Google Scholar]
- 10. Tsimikas S. A test in context: lipoprotein(a): diagnosis, prognosis, controversies, and emerging therapies. J Am Coll Cardiol. 2017;69(6):692‐711. [DOI] [PubMed] [Google Scholar]
- 11. Berman AN, Blankstein R. Current and future role of lipoprotein(a) in preventive cardiology. Curr Opin Cardiol. 2019;34(5):514‐518. [DOI] [PubMed] [Google Scholar]
- 12. Berman AN, Blankstein R. Optimizing dyslipidemia management for the prevention of cardiovascular disease: a focus on risk assessment and therapeutic options. Curr Cardiol Rep. 2019;21(9):110. [DOI] [PubMed] [Google Scholar]
- 13. Marston NA, Gurmu Y, Melloni GEM, et al. The effect of PCSK9 (proprotein convertase Subtilisin/Kexin type 9) inhibition on the risk of venous thromboembolism. Circulation. 2020;141(20):1600‐1607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Koschinsky ML, Marcovina SM. Structure‐function relationships in apolipoprotein(a): insights into lipoprotein(a) assembly and pathogenicity. Curr Opin Lipidol. 2004;15(2):167‐174. [DOI] [PubMed] [Google Scholar]
- 15. Thanassoulis G. Screening for high lipoprotein(a). Circulation. 2019;139(12):1493‐1496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Nordestgaard BG, Chapman MJ, Ray K, et al. Lipoprotein(a) as a cardiovascular risk factor: current status. Eur Heart J. 2010;31(23):2844‐2853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Varvel S, McConnell JP, Tsimikas S. Prevalence of elevated Lp(a) mass levels and patient thresholds in 532 359 patients in the United States. Arterioscler Thromb Vasc Biol. 2016;36(11):2239‐2245. [DOI] [PubMed] [Google Scholar]
- 18. Willeit P, Ridker PM, Nestel PJ, et al. Baseline and on‐statin treatment lipoprotein(a) levels for prediction of cardiovascular events: individual patient‐data meta‐analysis of statin outcome trials. Lancet. 2018;392(10155):1311‐1320. [DOI] [PubMed] [Google Scholar]
- 19. Mach F, Baigent C, Catapano AL, et al. 2019 ESC/EAS guidelines for the management of dyslipidaemias: lipid modification to reduce cardiovascular risk. Eur Heart J. 2020;41(1):111‐188. [DOI] [PubMed] [Google Scholar]
- 20. Grundy SM, Stone NJ, Bailey AL, et al. 2018 AHA/ACC/AACVPR/AAPA/ABC/ACPM/ADA/AGS/APhA/ASPC/NLA/PCNA guideline on the management of blood cholesterol: a report of the American College of Cardiology/American Heart Association task force on clinical practice guidelines. J Am Coll Cardiol. 2019;73(24):e285‐e350. [DOI] [PubMed] [Google Scholar]
- 21. Arnett DK, Blumenthal RS, Albert MA, et al. 2019 ACC/AHA guideline on the primary prevention of cardiovascular disease: A report of the American College of Cardiology/American Heart Association Task Force on clinical practice g. Circulation. 2019;140(11):e596‐e646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Khera AV, Everett BM, Caulfield MP, et al. Lipoprotein(a) concentrations, rosuvastatin therapy, and residual vascular risk: an analysis from the JUPITER trial (justification for the use of statins in prevention: an intervention trial evaluating Rosuvastatin). Circulation. 2014;129(6):635‐642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Tsimikas S, Karwatowska‐Prokopczuk E, Gouni‐Berthold I, et al. Lipoprotein(a) reduction in persons with cardiovascular disease. N Engl J Med. 2020;382(3):244‐255. [DOI] [PubMed] [Google Scholar]
- 24. Tsimikas S. RNA‐targeted therapeutics for lipid disorders. Curr Opin Lipidol. 2018;29(6):459‐466. [DOI] [PubMed] [Google Scholar]
- 25. O'Donoghue ML, Fazio S, Giugliano RP, et al. Lipoprotein(a), PCSK9 inhibition, and cardiovascular risk. Circulation. 2019;139(12):1483‐1492. [DOI] [PubMed] [Google Scholar]
- 26. Viney NJ, van Capelleveen JC, Geary RS, et al. Antisense oligonucleotides targeting apolipoprotein(a) in people with raised lipoprotein(a): two randomised, double‐blind, placebo‐controlled, dose‐ranging trials. Lancet. 2016;388(10057):2239‐2253. [DOI] [PubMed] [Google Scholar]
- 27. Identify Subjects/Request Data. https://rc.partners.org/research-apps-and-services/identify-subjects-request-data. Accessed 05/11/2020.
- 28. Marlys L, Koschinsky MBB, Marcovina SM. Lipoprotein(a) In: Ballantyne CM, ed. Clinical Lipidology: a Companion to Braunwald's Heart Disease. 2nd ed. Philadelphia, PA: Saunders; 2015. [Google Scholar]
- 29. Marcovina SM, Albers JJ. Lipoprotein (a) measurements for clinical application. J Lipid Res. 2016;57(4):526‐537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Cao J, Steffen BT, Guan W, et al. Evaluation of lipoprotein(a) electrophoretic and immunoassay methods in discriminating Risk of calcific aortic valve disease and incident coronary heart disease: the multi‐ethnic study of atherosclerosis. Clin Chem. 2017;63(11):1705‐1713. [DOI] [PubMed] [Google Scholar]
- 31. Baudhuin LM, Hartman SJ, O'Brien JF, et al. Electrophoretic measurement of lipoprotein(a) cholesterol in plasma with and without ultracentrifugation: comparison with an immunoturbidimetric lipoprotein(a) method. Clin Biochem. 2004;37(6):481‐488. [DOI] [PubMed] [Google Scholar]
- 32. Langsted A, Kamstrup PR, Nordestgaard BG. High lipoprotein(a) and high risk of mortality. Eur Heart J. 2019;40(33):2760‐2770. [DOI] [PubMed] [Google Scholar]
- 33. Malmasi S, Sandor NL, Hosomura N, Goldberg M, Skentzos S, Turchin A. Canary: an NLP platform for clinicians and researchers. Appl Clin Inform. 2017;8(2):447‐453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Turchin A. Canary NLP Tool Web site. http://canary.bwh.harvard.edu. Accessed 05/11/2020.
- 35. Regan S, Meigs JB, Grinspoon SK, Triant VA. Determinants of smoking and quitting in HIV‐infected individuals. PLoS One. 2016;11(4):e0153103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Singh GK. Area deprivation and widening inequalities in US mortality, 1969‐1998. Am J Public Health. 2003;93(7):1137‐1143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Singh GK, Lin CC. Area deprivation and inequalities in health and health care outcomes. Ann Intern Med. 2019;171(2):131‐132. [DOI] [PubMed] [Google Scholar]
- 38. Singh GK, Siahpush M. Increasing inequalities in all‐cause and cardiovascular mortality among US adults aged 25‐64 years by area socioeconomic status, 1969‐1998. Int J Epidemiol. 2002;31(3):600‐613. [DOI] [PubMed] [Google Scholar]
- 39. Kind AJ, Jencks S, Brock J, et al. Neighborhood socioeconomic disadvantage and 30‐day rehospitalization: a retrospective cohort study. Ann Intern Med. 2014;161(11):765‐774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Kind AJH, Buckingham WR. Making neighborhood‐disadvantage metrics accessible ‐ the neighborhood atlas. N Engl J Med. 2018;378(26):2456‐2458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Jencks SF, Schuster A, Dougherty GB, Gerovich S, Brock JE, Kind AJH. Safety‐net hospitals, neighborhood disadvantage, and readmissions under Maryland's all‐payer program: an observational study. Ann Intern Med. 2019;171(2):91‐98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Wadhera RK, Bhatt DL, Kind AJH, et al. Association of outpatient practice‐level socioeconomic disadvantage with Quality of care and outcomes among older adults with coronary artery disease: implications for value‐based payment. Circ Cardiovasc Qual Outcomes. 2020;13(4):e005977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Goff DC Jr, Lloyd‐Jones DM, Bennett G, et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association task force on practice guidelines. J am Coll Cardiol. 2014;63(25 Pt B):2935‐2959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Coady SA, Sorlie PD, Cooper LS, Folsom AR, Rosamond WD, Conwill DE. Validation of death certificate diagnosis for coronary heart disease: the atherosclerosis risk in communities (ARIC) study. J Clin Epidemiol. 2001;54(1):40‐50. [DOI] [PubMed] [Google Scholar]
- 45. Ives DG, Samuel P, Psaty BM, Kuller LH. Agreement between nosologist and cardiovascular health study review of deaths: implications of coding differences. J Am Geriatr Soc. 2009;57(1):133‐139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Chen Y, Freedman ND, Albert PS, et al. Association of cardiovascular disease with premature mortality in the United States. JAMA Cardiol. 2019;4(12):1230‐1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Harriss LR, Ajani AE, Hunt D, et al. Accuracy of national mortality codes in identifying adjudicated cardiovascular deaths. Aust N Z J Public Health. 2011;35(5):466‐476. [DOI] [PubMed] [Google Scholar]
- 48. Kiyota Y, Schneeweiss S, Glynn RJ, Cannuscio CC, Avorn J, Solomon DH. Accuracy of Medicare claims‐based diagnosis of acute myocardial infarction: estimating positive predictive value on the basis of review of hospital records. Am Heart J. 2004;148(1):99‐104. [DOI] [PubMed] [Google Scholar]
- 49. Initiative S. Mini‐sentinel medical product assessment: a protocol for active surveillance of acute myocardial infarction in association with use of anti‐diabetic agents. Anti‐Diabetes Drugs and Acute Myocardial Infarction, Hospitalized Heart Failure Web Site. https://www.sentinelinitiative.org/sites/default/files/Drugs/Assessments/Mini-Sentinel_AMI-and-Anti-Diabetic-Agents_Protocol_0.pdf. Published 2016. Updated January 27, 2016. Accessed October 16, 2019.
- 50. Yeh RW, Sidney S, Chandra M, Sorel M, Selby JV, Go AS. Population trends in the incidence and outcomes of acute myocardial infarction. N Engl J Med. 2010;362(23):2155‐2165. [DOI] [PubMed] [Google Scholar]
- 51. Andrade SE, Harrold LR, Tjia J, et al. A systematic review of validated methods for identifying cerebrovascular accident or transient ischemic attack using administrative data. Pharmacoepidemiol Drug Saf. 2012;21(suppl 1):100‐128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Birman‐Deych E, Waterman AD, Yan Y, Nilasena DS, Radford MJ, Gage BF. Accuracy of ICD‐9‐CM codes for identifying cardiovascular and stroke risk factors. Med Care. 2005;43(5):480‐485. [DOI] [PubMed] [Google Scholar]
- 53. Wang SV, Franklin JM, Glynn RJ, Schneeweiss S, Eddings W, Gagne JJ. Prediction of rates of thromboembolic and major bleeding outcomes with dabigatran or warfarin among patients with atrial fibrillation: new initiator cohort study. BMJ. 2016;353:i2607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Patorno E, Goldfine AB, Schneeweiss S, et al. Cardiovascular outcomes associated with canagliflozin versus other non‐gliflozin antidiabetic drugs: population based cohort study. BMJ. 2018;360:k119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Cozzolino F, Montedori A, Abraha I, et al. A diagnostic accuracy study validating cardiovascular ICD‐9‐CM codes in healthcare administrative databases. The Umbria data‐value project. PLoS One. 2019;14(7):e0218919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Tamariz L, Harkins T, Nair V. A systematic review of validated methods for identifying venous thromboembolism using administrative and claims data. Pharmacoepidemiol Drug Saf. 2012;21(Suppl 1):154‐162. [DOI] [PubMed] [Google Scholar]
- 57. White RH, Garcia M, Sadeghi B, et al. Evaluation of the predictive value of ICD‐9‐CM coded administrative data for venous thromboembolism in the United States. Thromb Res. 2010;126(1):61‐67. [DOI] [PubMed] [Google Scholar]
- 58. Nadkarni PM, Ohno‐Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Inform Assoc. 2011;18(5):544‐551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Friedman C, Hripcsak G. Natural language processing and its future in medicine. Acad Med. 1999;74(8):890‐895. [DOI] [PubMed] [Google Scholar]
- 60. Yim WW, Yetisgen M, Harris WP, Kwan SW. Natural language processing in oncology: a review. JAMA Oncol. 2016;2(6):797‐804. [DOI] [PubMed] [Google Scholar]
- 61. Randomized Study to Evaluate Efficacy, Safety, and Tolerability of AMG 890 in Subjects With Elevated Lipoprotein(a) https://clinicaltrials.gov/ct2/show/NCT04270760. Accessed May 14, 2020.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.