PharmGKB, an Integrated Resource of Pharmacogenomic Knowledge

Li Gong; Michelle Whirl-Carrillo; Teri E Klein

doi:10.1002/cpz1.226

. Author manuscript; available in PMC: 2021 Dec 7.

Published in final edited form as: Curr Protoc. 2021 Aug;1(8):e226. doi: 10.1002/cpz1.226

PharmGKB, an Integrated Resource of Pharmacogenomic Knowledge

Li Gong ¹, Michelle Whirl-Carrillo ¹, Teri E Klein ^1,²

PMCID: PMC8650697 NIHMSID: NIHMS1759296 PMID: 34387941

Abstract

The Pharmacogenomics Knowledgebase (PharmGKB) is an integrated online knowledge resource for the understanding of how genetic variation contributes to variation in drug response. Our focus includes not only pharmacogenomic information useful for clinical implementation (e.g., drug dosing guidelines and annotated drug labels), but also information to catalyze scientific research and drug discovery (e.g., variant-drug annotations and drug-centered pathways). As of April 2021, the annotated content of PharmGKB spans 715 drugs, 1761 genes, 227 diseases, 165 clinical guidelines, and 784 drug labels. We have manually curated data from more than 9000 published papers to generate the content of PharmGKB. Recently, we have also implemented an automated natural language processing (NLP) tool to broaden our coverage of the pharmacogenomic literature. This article contains a basic protocol describing how to navigate the PharmGKB website to retrieve information on how genes and genetic variations affect drug efficacy and toxicity. It also includes a protocol on how to use PharmGKB to facilitate interpretation of findings for a pharmacogenomic variant genotype or metabolizer phenotype. PharmGKB is freely available at http://www.pharmgkb.org.

Keywords: drug response, genetic variation, knowledge base, pharmacogenetics, pharmacogenomics

INTRODUCTION

Pharmacogenomics (PGx) is the study of how genetic variation contributes to variation in drug response. Driven by technology advancements in the post-genomic era, pharmacogenomics research has the potential to optimize drug efficacy and minimize toxicity. It bridges the gap between the scientific discoveries and clinical applications, and offers the exciting promise of precision medicine. The Pharmacogenomics Knowledgebase (PharmGKB; http://www.pharmgkb.org) was established in 2000 to collect, curate, and disseminate knowledge about pharmacogenomics from a number of sources, including the scientific literature, drug labels, and clinical guidelines (Klein & Altman, 2004; Whirl-Carrillo et al., 2012). It is the central knowledge repository for pharmacogenomics information including drug dosing guidelines, drug label annotations, clinical and variant annotations, drug-centered pathways, pharmacogene summaries, and relationships among genes, drugs, and diseases.

PharmGKB serves diverse user groups from both the clinical and scientific communities. It provides comprehensive and integrated drug, gene, variant, and disease information to pharmacologists, clinical investigators, biologists, and informaticians, as well as general public. The PharmGKB homepage has been designed in a way that highlights the primary interests of most users. All PharmGKB data is now freely available through a Creative Commons license, with no user registration required to view our content. PharmGKB is also an excellent educational portal for any person who is new to pharmacogenomics. We provide short “guided page tour” and training exercises (https://api.pharmgkb.org/v1/download/file/attachment/PharmGKB_Training_Exercises.zip) on how to use the website. We also provide lecture materials, tutorials, and useful links intended to help users familiarize themselves with the fundamental concepts of pharmacogenomics research and how the information is used clinically.

The protocols in this article describe how to use PharmGKB to browse and understand pharmacogenomic knowledge. Basic Protocol 1 walks a user through the PharmGKB homepage, helping users familiarize themselves with the content of PharmGKB and how to conduct basic searches. Basic Protocol 2 describes how to use PharmGKB to facilitate interpretation of pharmacogenomic variant genotypes or metabolizer phenotypes.

BASIC PROTOCOL 1

NAVIGATING THE HOMEPAGE OF PharmGKB AND SEARCHING BY DRUG

This protocol will introduce the basic techniques used for searching and browsing the content on the PharmGKB. It starts with a quick tour of the PharmGKB home page. Then, it walks through an example, searching by the drug “clopidogrel” and find the associated genes, variants, prescribing information, pathways and annotations.

Necessary Resources

Hardware

Computer with an Internet connection

Software

An up-to-date Web browser (such as Google Chrome, Firefox, or Safari)

Files

No input files required

Open the PharmGKB homepage at http://www.pharmgkb.org in a web browser.

The PharmGKB homepage is the common entry point for all users. It has been designed to highlight the types of information that are most sought after by diverse groups of users, providing short cuts to drugs, pathways, dosing guideline annotations, and drug label annotations (Fig. 1). The principal content of PharmGKB is listed in Table 1. The main search bar sits at the top of the page where a user can enter drugs, genes, variants, or combinations of these terms for a Google-type query. Prominently displayed in the center of the homepage are clickable icons that provides short cuts to drugs, pathways, dosing guidelines, and drug label annotations, as well as the summary statistics for them within the PharmGKB knowledge base. Right below the icons is a diagram where someone new to pharmacogenomics can click on “What is Pharmacogenomics?” (https://www.pharmgkb.org/whatIsPharmacogenomics) to learn about basic genetics and pharmacogenomics concepts, or “Learn more about PharmGKB” (https://www.pharmgkb.org/whatIsPharmgkb) to learn about various features at PharmGKB. There is also a link to an interactive “Page tour” (https://www.pharmgkb.org/?tour=true) at the top of the homepage under “Help.”
Click on the icon “Drug Label Annotations” in the left hand of the homepage to browse the list of labels annotated with a pharmacogenomics summary (Fig. 2).

As drug labels began to include information about how to adjust drug prescribing based on a person’s genotype or metabolizer phenotype, PharmGKB correspondingly annotated international labels from the U.S. Food and Drug Administration (FDA), European Medicines Agency (EMA), Swiss Agency of Therapeutic Products (Swissmedic), Pharmaceuticals and Medical Devices Agency, Japan (PMDA,) and Health Canada (Santé Canada) (HCSC). Figure 2 provides an overview of PharmGKB’s annotation of drug labels. By showing these international drug labels side by side, it gives a comparable summary on how different agencies provide PGx information within their labels. If the label provides a dose adjustment based on gene/protein/chromosomal variants or phenotypes, the label annotation is tagged with a “Dosing info” tag. For example, the FDA label for aripiprazole (https://www.pharmgkb.org/labelAnnotation/PA166104839) is tagged with “Dosing info” because the label states that CYP2D6 poor metabolizers taking aripiprazole should have their dose reduced by 50%. If the label states that a drug is either indicated or contraindicated for a particular set of patients based on gene/protein/chromosomal variants or phenotypes, the “Alternative Drug” tag is applied to the label annotation (e.g., FDA label for abacavir, https://www.pharmgkb.org/labelAnnotation/PA166104833, is tagged this way because the label states that abacavir is contraindicated in patients with the HLA-B*5701 allele due to risk for hypersensitivity reactions.)
Type the drug name clopidogrel in the search box, then click Enter.

Drugs can be searched for by either typing the drug’s generic name or trade name in the search box, or, by clicking on the “annotated drugs” icon from the home page and then browsing through the alphabetically sorted drug list. The search function of PharmGKB uses an autocomplete service to make suggestions as the user types. If no result is returned, try a synonym or partial name. Both alternative names and other symbols that might have been used in the literature for drugs or genes are included in PharmGKB.
Open the clopidogrel drug page (Fig. 3).

PharmGKB drug, gene, disease and variant pages are structured similarly, with the navigation pane on the left. The “overview” tab includes information on chemical structure of the drug, alternative names used, and metabolites, as well as details such as molecular properties and short summaries of pharmacokinetics (PK), pharmacodynamics (PD), and pharmacogenomics (PGx) for drugs that have been well-studied in pharmacogenomics. Additional tabs include prescribing info, drug label annotations, clinical and variant annotations, literature, pathways, automated annotations, links, and downloads. The user can click on each section on the left to retrieve specific information for the drug of interest.
Click on the “Prescribing Info” tab from the navigation pane on the left to access information regarding drug dose or therapy adjustments based on genetic information for clopidogrel (Fig. 4).

The prescribing information may come from clinical guidelines for how to adjust treatment of certain medications based on a person’s genetic information, drug labels that have prescribing recommendations based on a particular genotype/metabolizer phenotype, and Rx annotations from publications.
Under Source “Clinical Pharmacogenomics Implementation Consortium” (CPIC), click the “Read Now” button to access dosing guidelines for clopidogrel published by CPIC (Fig. 5).

PharmGKB annotates clinical dosing guidelines published by professional societies such as the Clinical Pharmacogenetics Implementation Consortium (CPIC), the Royal Dutch Association for the Advancement of Pharmacy–Pharmacogenetics Working Group (DPWG), the Canadian Pharmacogenomic Network for Drug Safety (CPNDS), or other professional society. The dosing guidelines are curated manually by PharmGKB curators to highlight the prescribing recommendations based on a patient’s genotype or metabolizer phenotype, with excerpts from the guidelines and links to the underlying evidence and the associated genes and drugs. In addition to the information directly provided by the guidelines, PharmGKB provides an interactive allele picker that allows specific alleles/genotypes to be selected from pull-down menus, and then provides the resulting inferred phenotype (e.g., ultra-metabolizer), associated dosing advice, and strength of the recommendation. PharmGKB has also created video explanations of a number of CPIC guidelines, embedded from a dedicated YouTube channel.
Click on the “Drug Label Annotations” tab from the navigation pane on the left to access label annotations from regulatory agencies around the world (Fig. 6). Click on the “View Drug Label Annotation Legend” to see a detailed description of the sources, and definition of PGx levels and tags. Click on “Read now” next to “U.S. Food and drug Administration” to access the individual label annotations from the FDA.

The “Drug Label Annotations” tab lists annotations on medication labels that contain PGx information. The label annotations include a brief summary of the PGx in the label, an excerpt from the label, and a downloadable highlighted label PDF file. The annotated labels are also tagged with “PGx Level” (“Testing required,” “Testing Recommended,” “Actionable PGx,” and “Informative PGx”; https://www.pharmgkb.org/page/drugLabelLegend#pgx-level) to indicate the PharmGKB interpretation of the level of action implied in each label. The “Prescribing” section of the annotation captures guidance from the label for patients with a particular genotype or metabolizer phenotype. For U.S. FDA labels, the annotation also includes information on whether the label is on the FDA’s Table of Pharmacogenomic Biomarkers in Drug Labeling (https://www.fda.gov/drugs/science-and-research-drugs/table-pharmacogenomic-biomarkers-drug-labeling). We recently also added a section a section on information from the Table of Pharmacogenetic Associations (https://www.fda.gov/medical-devices/precision-medicine/table-pharmacogenetic-associations) curated by the FDA. Furthermore, we have developed a new landing page (https://www.pharmgkb.org/fdaLabelAnnotations) specifically for FDA-approved drug label annotations that can be sorted and filtered by different criteria in the column headings, e.g., by genes, drugs, PGx levels, or the prescribing tags.
Click on the “Clinical Annotations” tab from the navigation pane on the left to access clinical annotations for clopidogrel. The clinical annotation table can be sorted and/or filtered by evidence level, variant, gene, type, or phenotype. Click on “Read Now” to access the clinical annotations for CYP2C19 *1, *2, *3, *4, *5, *6, *8 and clopidogrel response (Fig. 7)

For more details on clinical annotation, see Basic Protocol 2.

Clinical annotations summarize all of PharmGKB’s annotations of published literature for the relationship between a particular genetic variant and a medication. They are given a rating by PharmGKB depending on how much published evidence there is for a relationship found in PharmGKB, and the consistency and quality of that evidence. Clinical annotations are based on variant annotations in PharmGKB, as well as prescribing information from dosing guidelines and FDA labels. Clinical annotations are also tagged with the relevant drug, phenotype, and broad phenotypic categories of toxicity, dosage, efficacy, PD, or metabolism/PK.
Click on the “Variant Annotations” tab from the navigation pane on the left to access variant annotations for clopidogrel. The variant annotation table can be sorted and/or filtered by variant, molecules, p-value, study size, ethnicities, etc. Click on “24762860” under column “PMID” to see the list of variant annotations for that specific article.

Variant annotations are summaries of a genetic association between a single variant and a specific drug response (metabolism, efficacy, dose or toxicity) from a single publication. PharmGKB curators routinely scan pharmacogenomic literature, curating these associations and adding the information to PharmGKB. A single paper can by annotated with multiple variant annotations if multiple variants and/or phenotypes are investigated in the study. Sometimes variant annotations also document cases where no association is found between a variant and drug from the literature. It’s important to capture negative findings, as they also contribute to the overall evaluation of the evidence base between a variant and drug response phenotype. Clinical annotations are built upon summaries of variant annotations along with other variant-specific prescribing information. The associations that are less consistent across different studies are likely to be assigned a lower level of evidence. All variant annotations include a summary sentence constructed from standardized vocabularies. Curators can add additional notes or description of the findings as free text to appear below the summary sentence. Variant annotations are also tagged with a broad phenotypic category of toxicity, dosage, efficacy, PD, or metabolism/PK, and often include additional parameters about study size, cohort description, statistical significance, and ethnicity information. At the bottom of the variant annotation page, the history section describes when the annotation was created and if or when the annotation was changed or edited.
Click on the “Pathways” tab to view all pathways associated with clopidogrel. Click on “Clopidogrel Pathway, pharmacokinetics (PK)” to view the simplified diagram of genes/proteins involved in the metabolism of clopidogrel (Fig. 8).

The interactive drug-centered pathways displayed on PharmGKB provide an overview of how genes are involved in the pharmacokinetics (PK) and pharmacodynamics (PD) of drugs. Our pharmacokinetics (PK) pathways describe candidate genes involved in the absorption, distribution, metabolism, and excretion of a given drug, while the pharmacodynamic (PD) pathways illustrate the physiological effects of the drug, its mechanism of action and possible side effects. The pathway diagrams use standard shapes and colors to represent genes, metabolites, drugs, and interactions. All genes and drugs on the pathway diagram are clickable. If the user clicks on these objects, the PharmGKB gene or drug page opens in a new browser window. Below the pathway picture is a “Description” of the pathway that describes the complex gene-drug relationships depicted in the pathway diagram. The pathway authors and the date of the most recent update are listed below the text of the description on the bottom of the pathway diagram. There is a section for components, related pathways, and downloads in the navigation pane on the left. If PharmGKB has both PK and PD pathways for a given drug, the user will find them under “Related pathways.” Under “Downloads,” the user can download pathway diagrams in both Adobe Illustrator and pdf formats. Pathway data can also be downloaded in tsv, bioPax, or GPML formats. In addition to being freely available on the website, PharmGKB VIPs and pathways are also typically featured in the journal “Pharmacogenetics and Genomics.”
Click on “Automated annotations” to see a list of annotations derived from a text-mining system that has scanned sentences in the literature (Fig. 9). The annotation highlights a chemical and a variation in a sentence that likely involves pharmacogenomic information.

In order to broaden our coverage of the ever-expanding pharmacogenomic literature, we have developed an automated NLP/text mining tool (PGxMine) to computationally extract possible variant-drug relationships from abstracts in PubMed or full-text articles in PubMed Central (Lever et al., 2020). These annotations use completely computational methods to find variants, drugs, and their associations, and have not been reviewed by a curator to check their accuracy. The sentences are extracted from abstracts in PubMed and full-text articles from the PubMed Central Open Access subset and Author Manuscript Collection.
Under the “Links & Downloads” tab, click on links to go to external databases where addition information on clopidogrel may be found.

PharmGKB has established bidirectional links with many leading gene, protein, and drug resources, such as NCBI PubChem, ChEBI, RxNorm, Entrez Gene, GeneCards, UniProtKB, and DrugBank, etc.

The PharmGKB home page (http://www.pharmgkb.org).

Table 1.

Summary of Content at PharmGKB

*Clinical pharmacogenomics*
Clinical guideline annotations	Clinical guidelines on how to adjust treatment of certain medications based on genetic information
Drug label annotations	Annotated drug labels containing PGx information from regulatory agencies around the world
Clinical annotations	Evidence-rated summaries of all the literature evidence for a particular genetic variant–drug association
Rx annotations	Summaries of individual publications that provide medication dosing or prescribing information based on genetic information
*Research pharmacogenomics*
Variant annotations	Summaries of genetic variant–drug associations as reported in a single publication
Drug pathways	Illustrated diagrams of the pharmacokinetics or pharmacodynamics of a PGx-relevant drug and accompanying text describing the drug-gene interactions and pharmacogenomic findings
VIP PGx gene summaries	Written reviews of PGx-relevant genes, highlighting genomic organization, functional and clinical impact of genetic variations
PGx gene information tables	Gene resource tables created by CPIC and PharmGKB to summarize allele function, definition, frequency, and diplotype-phenotype translation

Open in a new tab

PharmGKB Prescribing info tab under the drug page for clopidogrel.

Example of PharmGKB clinical dosing guideline: The Clinical Pharmacogenetics Implementation Consortium (CPIC) guideline for clopidogrel and CYP2C19.

An overview of the annotated drug labels for clopidogrel on the PharmGKB.

Example of PharmGKB clinical annotation for clopidogrel.

Example of PharmGKB drug-centered pathway for clopidogrel, pharmacokinetics.

Automated annotations for clopidogrel using PGxMine.

BASIC PROTOCOL 2

SEARCHING BY VARIANT/HAPLOTYPE TO FACILITATE UNDERSTANDING OF THE IMPACT OF A GENETIC VARIATION ON DRUG RESPONSE

Direct-to-consumer genetic testing has become more widely available, and many people may have their genetic testing results in hand or have previously had their genome sequenced. The interpretation of genotyping results remains a challenge as more and more new evidence emerges. Many variants included in the tests on the market have varying levels of evidence to support their clinical relevance. Some may have an abundance of consistent clinical findings to support the claim, while others may either lack clinical data backing the claim or have disagreements about the role of the genetic variants contributing to a drug response phenotype. This protocol will introduce the basic techniques used for searching PharmGKB with a specific variant/haplotype to find out how it may impact the function of the gene and which drug therapies might be affected, as well as the level of confidence for a specific drug-variant association. We will use genotype calls on a specific SNP (rs4149056) and a diplotype call on a star allele (e.g., CYP2C9*3/*3) as examples. We will also demonstrate how to perform searches using combination of search terms.