Skip to main content
PeerJ logoLink to PeerJ
. 2023 Dec 5;11:e16087. doi: 10.7717/peerj.16087

Dark kinase annotation, mining, and visualization using the Protein Kinase Ontology

Saber Soleymani 1, Nathan Gravel 2, Liang-Chin Huang 2, Wayland Yeung 2, Elika Bozorgi 1, Nathaniel G Bendzunas 3, Krzysztof J Kochut 1,, Natarajan Kannan 2,3,
Editor: Aristóteles Góes-Neto
PMCID: PMC10704995  PMID: 38077442

Abstract

The Protein Kinase Ontology (ProKinO) is an integrated knowledge graph that conceptualizes the complex relationships among protein kinase sequence, structure, function, and disease in a human and machine-readable format. In this study, we have significantly expanded ProKinO by incorporating additional data on expression patterns and drug interactions. Furthermore, we have developed a completely new browser from the ground up to render the knowledge graph visible and interactive on the web. We have enriched ProKinO with new classes and relationships that capture information on kinase ligand binding sites, expression patterns, and functional features. These additions extend ProKinO’s capabilities as a discovery tool, enabling it to uncover novel insights about understudied members of the protein kinase family. We next demonstrate the application of ProKinO. Specifically, through graph mining and aggregate SPARQL queries, we identify the p21-activated protein kinase 5 (PAK5) as one of the most frequently mutated dark kinases in human cancers with abnormal expression in multiple cancers, including a previously unappreciated role in acute myeloid leukemia. We have identified recurrent oncogenic mutations in the PAK5 activation loop predicted to alter substrate binding and phosphorylation. Additionally, we have identified common ligand/drug binding residues in PAK family kinases, underscoring ProKinO’s potential application in drug discovery. The updated ontology browser and the addition of a web component, ProtVista, which enables interactive mining of kinase sequence annotations in 3D structures and Alphafold models, provide a valuable resource for the signaling community. The updated ProKinO database is accessible at https://prokino.uga.edu.

Keywords: Curation, User interface, Drug discovery, Cell signaling, Protein structure, Protein function, Network biology, Cancer genomes

Introduction

The protein kinase gene family with nearly 535 human members (collectively called the human kinome) is a biomedically important gene family associated with many human diseases such as cancer, diabetes, Alzheimer’s disease, Parkinson’s disease, and inflammatory disorders. They make up one-third of all drug-related protein target discoveries in the pharmaceutical industry, with over 50 FDA-approved drugs developed since 2001 (Ferguson & Gray, 2018; Zhang, Yang & Gray, 2009). However, despite decades of research on the protein kinase family, our current knowledge of the kinome is skewed towards a subset of well-studied kinases with nearly one-third of the kinome largely understudied. These understudied kinases, collectively referred to as the “dark” kinome by the Knowledge Management Center (KMC) (Nguyen et al., 2017) within the Illuminating the Druggable Genome (IDG) consortium, constitute both active kinases and inactive pseudokinases, which lack one or more of the active site residues, but perform important scaffolding and regulatory roles in signaling pathways (Byrne, Foulkes & Eyers, 2017; Eyers, Keeshan & Kannan, 2017; Eyers & Murphy, 2013; Murphy, Mace & Eyers, 2017) and are druggable (Foulkes et al., 2018). Incomplete knowledge of the structure, function, and regulation of these understudied kinases and pseudokinases presents a major bottleneck for drug discovery efforts. While multiple initiatives are beginning to generate essential tools and resources to characterize dark kinases, integrative mining of these datasets is necessary to develop new testable hypotheses on dark kinase functions. However, integrative mining of protein kinase data is a challenge because of the diverse and disparate nature of protein kinase data sources and formats. Information on the structural and functional aspects of dark kinases, for example, is scattered in the literature posing unique challenges for researchers interested in formulating routine queries such as “disease mutations mapping to conserved structural and functional regions of the kinome” or “post-translational modifications (PTMs) in the activation loop of dark kinases.” Formulating such aggregate queries requires researchers to go through the often time-consuming and error-prone process of collating information from various data sources through customized computer programs, which results in duplication of efforts across laboratories, and does not scale well with the growing complexity and diversity of protein kinase data. For these reasons, the IDG consortium has developed a unified resource, Pharos, for collating diverse forms of information on druggable proteins, including protein kinases (Nguyen et al., 2017; Sheils et al., 2020, 2021). A focused Dark Kinase Knowledgebase has also been developed to make experimental data available on dark kinases to the broader research community (Berginski et al., 2021; Moret et al., 2021). While these unified resources provide a wide range of valuable information on druggable proteins, they offer limited data analytics capabilities in mining sequence and structural data. They do not conceptualize protein kinases’ detailed structural and functional knowledge in a practical and understandable way for protein kinase researchers. Thus, to accelerate the biochemical characterization of understudied dark kinases, a semantically meaningful and mineable representation of the kinase knowledge base is needed (Fig. 1).

Figure 1. The ProKinO architecture and work-flow.

Figure 1

(A) A subset of curated data sources used in ontology population. (B) A schematic of the ontology schema with classes (boxes) and relationships (lines) connecting the classes. (C) Applications for ontology browsing and navigation.

To semantically represent protein kinase data in ways protein kinase researchers use and understand, we previously reported the development of a focused protein kinase ontology, ProKinO (Gosal, Kannan & Kochut, 2011; Gosal, Kochut & Kannan, 2011; McSkimming et al., 2015), which integrates and conceptualizes diverse forms of protein kinase data in computer- and human-readable format (Fig. 2). The ontology is instantiated with curated data from internal and external sources and enables aggregate queries linking diverse forms of data in one place. ProKinO enables the generation of new knowledge regarding kinases and pathways altered in various cancer types, and new testable hypotheses regarding the structural and functional impact of disease mutations (Bailey et al., 2015; Cicenas & Cicenas, 2016; Goldberg et al., 2013; Gosal, Kannan & Kochut, 2011; Hu et al., 2015; Liu, Liang & Zhang, 2016; McClendon et al., 2014; McSkimming et al., 2016, 2014, 2015; Meharena et al., 2013; Mohanty et al., 2016; Nguyen et al., 2015; Oruganty & Kannan, 2013; Ruan & Kannan, 2015; Simonetti et al., 2014; Taylor et al., 2015; ManChon et al., 2014; Vazquez et al., 2016). For example, through iterative ProKinO queries and follow-up experimental studies, we identified oncogenic mutations associated with abnormal protein kinase activation and drug sensitivity (Lubner et al., 2017; McSkimming et al., 2016, 2015; Mohanty et al., 2016; Patani et al., 2016; Ruan & Kannan, 2015; Ruan, Katiyar & Kannan, 2017). We have also employed federated queries linking ProKinO with other widely used ontologies and resources such as the Protein Ontology (PRO), neXtProt, Reactome, and the Mouse Genome Informatics (MGI) to prioritize understudied dark kinases for functional studies and generate testable hypotheses regarding post-translational modification and cancer mutations (Huang et al., 2018).

Figure 2. Subset of the updated ProKinO schema with new classes and relationships.

Figure 2

The full schema can be accessed at http://prokino.uga.edu/. New classes are colored in cyan and pre-existing classes are colored in pink. Black arrows indicate new relationships introduced to connect the new classes.

While our preliminary studies have demonstrated the utility of ProKinO in hypothesis generation and knowledge discovery, to fully realize the impact of ProKinO in drug discovery and dark kinome mining, the ontology and the associated analytics tools need to be further developed to expand its scope and usability. For example, mutations at specific functional regions of the protein kinase domain, such as the gatekeeper and activation segments, are known to impact drug binding efficacies (Gajiwala et al., 2009; Yun et al., 2008). Likewise, kinase mRNA expression profiles strongly correlate with drug response (Benhar, Engelberg & Levitzki, 2002; Duncan et al., 2012; Kim, Song & Haura, 2009; Niepel et al., 2017). Thus, integrative mining of disease mutations with drug sensitivity profiles and expression patterns can provide new hypotheses/data for the development and administration of combinatorial drugs where multiple mutated kinases in distinct pathways can be targeted for drug repurposing (Erika et al., 2016; Li & Jones, 2012), as demonstrated by the repurposing of Gleevec for targeting c-kit kinase in Gastrointestinal tumors (Joensuu et al., 2001). Furthermore, the recent generation of structural models of various dark kinases using AlphaFold (Jumper et al., 2021) provides a new framework for generating new hypotheses by interactive mining and visualization of sequence annotations in the context of 3D models. However, the lack of interactive visualization tools to overlay sequence and functional annotations in 3D structural models presents a bottleneck in the effective use of AlphaFold models for function prediction. To address this and other challenges described above related to dark kinase mining and annotation, we have expanded ProKinO by including kinase expression data, as well as a variety of data related to ligand-motif interaction, and ligand response prediction (Huang et al., 2021). We have also significantly revamped the ProKinO browser through incorporation of new visualization tools for the interactive mining of sequence annotations in the context of experimentally determined 3D structures and AlphaFold models. We demonstrate the application of these new tools in dark kinase annotation and mining using the understudied p21-activated protein kinase 5, as an example. The updated ontology and browser provide a valuable resource for mining, visualizing, and annotating the dark kinome and pseudokinome.

Materials and Methods

Data sources

The ProKinO ontology includes data obtained from curated internal sources as well as external sources. Information from internal sources include annotations of kinase sequence and structural motifs retrieved from curated multiple sequence alignments. External sources are used for information related to kinase sequence and classification (KinBase & UniProt) (Bairoch et al., 2005; Manning et al., 2002) cancer mutations (COSMIC) (Tate et al., 2018), pathways (Reactome) (Croft et al., 2011) and three dimernsioanl structure (PDB) (Berman et al., 2000). The ontology is populated and updated on a regular basis using protocols described in previous studies (Gosal, Kochut & Kannan, 2011; McSkimming et al., 2016). Here, we describe further enhancements and additions to ProKinO through integration of data on kinase expression patterns and drug interactions, as described below. In a seperate significant project, we have identified and classified nearly 30,000 pseudokinases spanning over 1,300 organisms (Kwon et al., 2019). The schematic representation of the classification of kinases into groups, families, and subfamilies was already in place (Hanks & Hunter, 1995; Manning et al., 2002). Consequently, the addition of the pseudokinases and their classification was relatively simple. However, it significantly enhanced ProKinO as a comprehensive knowledge graph representing kinase-related data. The definition and nomenclature of several kinome-wide conserved motifs were standardized based on several previously published studies which describe the kinase structural features such as subdomains (Hanks & Hunter, 1995), regulatory spine/shell (Meharena et al., 2013), and catalytic spine (Hu et al., 2015). A subset of redundant or family-specific motifs were removed in the updated ontology and motif information on some of the atypical kinases such as ALPHAK2 is not included as they cannot be reliably aligned with canonical protein kinases. Portions of the text were previously published as a part of a preprint (Soleymani et al., 2022).

Ligand interactions

Information on kinase-ligand interactions were retrieved from the Kinase-Ligand Interaction Fingerprints and Structures (KLIFS) database (Kanev et al., 2021). The KLIFS database stores detailed drug-protein kinase interaction information derived from diverse (>2,900) structures of catalytic domains of human and mouse protein kinases deposited in the Protein Data Bank. In addition, KLIFS provides an application programming interface (API) for programmatic access to data related to chemicals and structural chemogenomics (Kanev et al., 2021). However, it lacks information regarding kinase pathways or diseases which prevents the user from investigating the effect of drug-mutant protein binding on downstream pathways or diseases. KLIFS annotations, which report PDB residue positions, were converted to UniProt residue numbering using PDBrenum (Faezov & Dunbrack, 2021), then converted to prototypic Protein Kinase A (PKA) numbering using Multiply Aligned Profiles for Global Alignment of Protein Sequences (MAPGAPS) (Neuwald, 2009). Entries that could not be mapped or did not map to the kinase domain were filtered out.

Ligand responses

We have also incorporated information on kinase drug sensitivity profile in the updated ProKinO. In particular, we retrieved drug dose response data for kinase-relevant ligands/drugs from the Genomics of Drug Sensitivity in Cancer (GDSC) (Yang et al., 2013). Kinase-relevant ligands are defined based on our previous study (Huang et al., 2020), which collected 143 small-molecule protein kinase inhibitors from GDSC based on four drug-target databases: DrugBank (Wishart et al., 2018), Therapeutic Target Database (Li et al., 2018), Pharos (Nguyen et al., 2017), and the Library of Integrated Network-Based Cellular Signatures (LINCS) Data Portal (Koleti et al., 2018). GDSC provides the half-maximal inhibitory concentration values (IC50) of these 143 ligands in 988 cancer cell lines.

Ligand activities

Ligand activities were retrieved from Pharos, a flagship resource (Nguyen et al., 2017) of the National Institutes of Health (NIH) Illuminating the Druggable Genome (IDG) program that includes data on small molecules, including approved drug data and bioassay data. Based on the protein classification (Lin et al., 2017), the drug targets in Pharos include kinases, ion channels, G-protein coupled receptors (GPCRs), and others. In this phase of the project, we decided to include the data relevant to ligand binding in kinases. Pharos integrates drug-target relationships from several resources, such as ChEMBL (Bühlmann & Reymond, 2020) and DrugCentral (Avram et al., 2021).

Expression data

An important part of our recent additions was kinase expression data. Genomic expression data (protein, RNA), as well as transcription factors and epigenomic associations, are among many facets of the data included in Pharos. Furthermore, the GDSC repository contains gene expression data (Affymetrix Human Genome U219 Array), as well. Additionally, COSMIC’s Cell Lines Project includes a significant amount of gene expression data, including kinase expression.

Dark kinases

Dark kinases were labeled based on the information from Dark Kinase Knowledgebase (Berginski et al., 2021).

Protein kinase knowledge graph: schema and data organization

The ProKinO ontology consists of classes, sub-classes, class types, relationships, relationship types, and constraints of protein kinase and related data (Fig. 2). The hierarchy connects all classes to the root, which is ProKinOEntity. Moreover, the schema defines types and constraints for the relationships. With such explicit and constrained schema, composing queries is more intuitive than conventional relational databases. In particular, to enable integrative mining of dark kinase expression data in the context of kinase sequence and structural features, we have introduced three new classes in ProKinO, the Ligand class (including its name, source, and chemical structure) and the following three related classes: (1) LigandInteraction, placed between the Ligand and (already existing) Motif classes to capture kinase-ligand binding and selectivity at the motif and residue level; (2) LigandActivity, placed between the Ligand and (already existing) Protein classes to represent kinases targeted by ligands (and drugs); and (3) LigandResponse, located between the Ligand and (already existing) Sample classes and representing ligand (and drug) sensitivity in kinases. To capture kinase expression, we added the GeneExpression relationship linking the Protein and Sample classes. The outline of the recently added classes and their relationships in ProKinO is illustrated as a UML class diagram, shown in Fig. 2.

ProKinO population

The ProKinO knowledge graph is automatically populated from several external and local data sources at regular intervals, as originally described (Gosal, Kochut & Kannan, 2011), ProKinO schema and the associated knowledge graph population software are routinely updated to incorporate additional sources of data such as pseudokinase and “dark” kinase classification and incorporating information on ligand interactions, ligand responses, ligand activities, kinase expression and associated object and datatype properties. We have been using the Protégé ontology editor for the schema creation and its subsequent modifications. The organization of the schema after these modifications is available at https://prokino.uga.edu/about.

The population software has been coded in Java and uses the Jena Framework. The population process is performed in several steps to add instances, their properties, and a combination of reading the prepared data from CSV, RDF, XML, and other file formats and accessing many remote data sources using their provided API (for example, Reactome’s REST API). Entity interconnections across data retrieved from different data sources are accomplished using UniProt identifiers, kinase names, and other accession identifiers. We modified the population software to create instances and properties for the newly added classes and relationships.

More specifically, using the KLIFS API, we retrieved the relevant kinases, ligands, and residue-level interaction data. The data was retrieved and then processed by custom Perl scripts. ProKinO ontology schema was modified, and ligands were included as new data, while interaction data (motifs) were either reconciled with the motifs already present in ProKinO or added as new, if not already there.

Similarly, the ligand response data was retrieved from GDSC and then processed by custom Perl scripts to create suitable CSV files. Additional ligands were included as new data, while the response data and the relevant samples were either reconciled with the samples already present in ProKinO or added as new, if not already there.

In order to populate the data on ligand activities, we retrieved from Pharos kinase-relevant ligands, as well as their binding data on targeted kinases, for example, IC50 values. This data was retrieved and then processed by custom Perl scripts to produce the necessary CSV files. Additional ligands, not included in the KLIFS dataset, were included as new data. All kinases targeted by ligands were already present in ProKinO, so they were reused in this step.

Data on kinase expression was first retrieved from Pharos, COSMIC, and GDSC. As before, the relevant kinases were already present in the ProKinO knowledge graph. The expression data was stored as individuals in the Expression class. Some of the relevant data about samples were already present in ProKinO, as we already had a significant amount of sample data from COSMIC. Additional samples were included as new data.

We reviewed and updated all the motifs already present in ProKinO. Furthermore, we updated the motif naming in cases where there were differences with the standard motif names.

Finally, we assembled an up-to-date list of dark kinases (Berginski et al., 2021) and added a Boolean datatype property, isDarkKinase, to identify them among all other kinases in the ProKinO knowledge graph.

Results/discussion

The expanded ontology and its knowledge graph provide a wealth of data unifying the information available on both well-studied (light) kinases and understudied (dark) kinases that serve as a unified resource for mining the kinome. The current version of ProKinO (version 65), includes 842 classes, 31 objects and 67 data properties, and over seven million individuals (knowledge graph nodes). ProKinO contains information on 153 dark kinases. A total of 137 dark kinases have information on structural motifs, 148 have disease mutations mapped to the kinase domain, 45 dark kinases have pathway information, and 26 are associated with specific reactions, as defined in Reactome.

Users can navigate the ontology using the ontology browser by searching for a specific kinase of interest or by performing aggregate SPARQL queries linking multiple forms of data. Currently 35 pre-written queries linking different data types can be executed using the ProKinO browser (http://prokino.uga.edu/queries). A user can also download the ontology or browse data based on organisms, functional domains, diseases, or kinase domain evolutionary hierarchy. Below, we focus on the application of complex SPARQL queries and the ProtVista visualization tools for the illumination of understudied dark kinases.

Mutation and expression of understudied PAK5 in human cancers

One possible way to prioritize dark kinases for functional studies is to ask the question, “which dark kinases are most mutated in human diseases, such as cancers?”. Typically answering this question would require collating and post-processing data from multiple resources such as COSMIC, Pharos, and the Dark Kinase Knowledgebase. However, with the updated Protein Kinase Ontology, these questions can be quickly answered using SPARQL. Having the “isDarkKinase” property within the Protein class and the RDF triples connecting the “Mutation”, “Sample” and “Sequence” classes, one can formulate aggregate queries requesting all dark kinases mutated in cancer samples. To avoid biases introduced by the length of protein/gene sequences (longer proteins tend to have more mutations), the query can be modified to normalize mutation counts by sequence length. Executing this modified query (Query 27, available at http://prokino.uga.edu/queries) displays the rank-ordered list of dark kinases based on mutational density. The top ten dark kinases with the highest mutational density are shown in Fig. 3A. Notably, the p21 activated kinase 5 (PAK5) is at the top of the list with a mutational density of 1.917, followed by CRK7 (1.054), PKACG (1.011), PSKH2 (1.01) TSSK1 (1.008), CK1A2 (0.991), ERK4 (0.966), DCLK3 (0.912), PKCT (0.894) and PAK3 (0.859). Having identified PAK5 as the most frequently mutated dark kinase in cancers, one can further query the ontology to explore the role of this kinase in various cancers. With the addition of the new “GeneExpression” class in ProKinO and the RDF triples connecting gene expression to the “Sample” and “Protein” classes (GeneExpression:InSample: Sample; GeneExpression:hasProtein: Protein), one can formulate queries for PAK5 expression in different samples (Fig. 3B). Rank ordering the samples based on PAK5 expression (Query 33) reveals cancer types such as adenocarcinoma (Zscore: 4,701.5) and hepatocellular carcinoma (Zscore: 2,038.3) that have previously been associated with abnormal PAK5 expression (Fang et al., 2014; Han et al., 2018; Huo et al., 2019; Zhang et al., 2017). However, the role of PAK5 in other cancer types such as acute myeloid leukemia (Zscore: 136.5) is relatively underappreciated (Quan et al., 2020). The identification of new cancer sub-types with dark kinase expression and regulation further exemplifies the use of ProKinO in knowledge discovery.

Figure 3. SPARQL query results for Query 27 and 33.

Figure 3

(A) Output of Query 27 requesting top 10 dark kinases with most mutations in different cancer types. The mutation counts are normalized by sequence length. (B) Output of Query 33 listing samples with abnormal PAK5 expression. The query also lists histology, cancer subtypes, regulation, and Z-score. Only a subset of the query results is shown because of space constraints.

Mutational hotspots in the activation loop of PAK5

Because ProKinO encodes a wealth of information on the structural and regulatory properties of multiple kinases, it can be used to generate mechanistic predictions on cancer mutation impact. We demonstrate this for the PAK kinases by asking the question “where are PAK5 mutations located in the protein kinase domain?” Using the RDF triples connecting the “Mutation”, “Motif” and “Sequence” classes (“Mutation: LocatedIn: Motif”; “Mutation:InSequence: Sequence”), one can formulate a query (Query 28) listing mutations in different structural regions/motifs of the PAK5 kinase domain. Examination of the query results reveals that the C-terminal substrate binding lobe (C-lobe) is more frequently mutated (320 mutations) relative to the N-terminal ATP binding lobe (N-lobe: 173 mutations) (Fig. 4A). Within the C-lobe, nearly 78 mutations map to the activation loop, which is known to play a critical role in substrate recognition and activation in a diverse array of kinases (Huse & Kuriyan, 2002; Kornev & Taylor, 2015; Oruganty & Kannan, 2012). Despite the prevalence of activation loop mutations in PAK5, there is currently no information on how these mutations impact PAK5 kinase structure and function. Nonetheless, based on the evolutionary relationships captured in ProKinO (based on the alignment of human kinases to the prototypic protein kinase A), one can formulate queries mapping mutations to specific aligned positions in the shared protein kinase domain. A query listing (wild type) WT type and mutant type residues in the activation loop of PAK5 and the equivalent aligned residue positions in PKA (Query 29) provides additional context for these mutations. For example, two distinct mutations map to residue P602PAK5 in the activation loop of PAK5 that structurally corresponds to a phosphorylatable residue, T197PKA, in PKA (Yonemoto et al., 1993). Having this context provides a testable hypothesis that S602 mutations in PAK5 impact kinase phosphorylation and regulation. Likewise, WT residue P607PAK5 is mutated in four distinct cancer samples and this position is equivalent to PKA residue P202PKA, which configures the activation loop for substrate recognition (Knighton et al., 1991). Thus, mutation of this critical residue is expected to impact substrate binding and activation loop phosphorylation in PAK5. Additional insights into these mutations can also be obtained by visualizing these residues in the context of the PAK5 AlphaFold models using the ProtVista viewer described below.

Figure 4. SPARQL query results for Query 28 and 29.

Figure 4

(A) Output of Query 28 listing the number of unique cancer-linked mutations at various structural locations of PAK5 kinase. (B) Output of Query 29 listing unique point mutations in the activation loop of PAK5 kinase. The query also lists the equivalent PKA position, disease type, primary site of the tissue sample, equivalent residue for the PKA positioning of PKA, and subtype of the tissue sample. Entries containing only one mutation per position were filtered from the original query. Only a subset of the query results is shown.

Insights into PAK5 ligand binding sites

With the conceptualization of new information related to kinase ligands, their mode of action and interaction with specific motifs in the kinase domain, new aggregate queries linking mutated kinases to drug sensitivity profiles, mode of action, and ligand binding sites can be performed using the updated ProKinO. For example, queries such as “list proteins and drugs or ligands interacting with the protein’s gatekeeper residue (GK.45)” (Query 31) and “list ligands targeting the Epidermal Growth Factor Receptor (EGFR) kinase and their mode of action” (Query 34) can be rapidly performed using the updated ProKinO ontology. We demonstrate the application of these new additions in the context of PAK5 by asking the question “what are the drugs targeting PAK family (PAK1-6) kinases?” Query 30 answers this question using the RDF triples connecting the “Ligand”, “Motif” and “Protein” classes (list triples) (Fig. 5). Examination of the query results indicates multiple drugs targeting PAK family kinases, including STAUROSPORINE and N2-((1R-2S)-2-AMINOCYCLOHEXYL) that bind to structurally equivalent residues/motifs in the ligand binding pocket of PAK4 and PAK5, respectively. The ligand binding sites, and associated interactions can also be visualized using the ProtVista viewer described below. Additional queries linking dark kinases to drug sensitivities, structural motifs, and pathways are listed on the ProKinO website at https://prokino.uga.edu/queries.

Figure 5. SPARQL query results for Query 30.

Figure 5

Output of Query 30 listing ligands interactions with each PAK family member (PAK1-6). It also includes motif names and positions of full sequence and PKA positioning. The output of Query 30 was rearranged to highlight the homology of PAK4 and PAK5 motif/ligand interactions and the figure highlights only a subset of the query results. Run SPARQL query for full results.

Visualization tools for dark kinase annotation and mining

To provide structural context for cancer mutations and to enable interactive mining of dark kinase sequence annotations in the context of 3D structures and predicted models from AlphaFold (Jumper et al., 2021; Tunyasuvunakool et al., 2021), we developed and incorporated a modified version of the ProtVista viewer in ProKinO. The viewer can be deployed for any protein kinase of interest by navigating to the Structure tab in the protein summary page and selecting either a PDB structure or AlphaFold model of interest. A snapshot of the ProtVista viewer displaying the AlphaFold model of PAK5 kinase is shown in Fig. 6. The ProtVista viewer uses an enhanced version of the Mol* viewer and the PDB web component (Watkins et al., 2017) to provide two-way interactive navigation between the 3D structure (Fig. 6A, top panel) and annotation viewer (Fig. 6A, bottom panel).

Figure 6. ProtVista viewer.

Figure 6

(A) AlphaFold2 model of PAK5 kinase is shown in the structure viewer (top panel). Sequence viewer with annotations are shown in the bottom panel. (B and C) Zoomed in view of structural interactions associated with S602 and P607 in PAK5 activation loop.

The annotation viewer consists of multiple tracks populated dynamically based on data from ProKinO and external sources such as UniProt. In addition, prediction confidence scores for AlphaFold models are displayed in the annotation viewer along with additional annotations such as conserved sequence motifs, subdomains, and structural motifs involved in kinase regulation. The annotation viewer also shows other annotations from external sources such as ligand binding sites and predicted functional sites. Users can hover over the residues on the 3D structure viewer to view the equivalent information on the annotation viewer and vice versa. For example, selecting the “activation loop” in the annotation viewer highlights the corresponding structural region in the AlphaFold model of PAK5 (Fig. 6A). Likewise, the selection of residues in the activation loop (S602 and P607) in the structure viewer highlights the annotations associated with these and interacting residues in the sequence viewer. Such interactive mining is expected to accelerate the functional characterization of dark kinases and provide new insights into disease mutations. For example, visualizing the interactions associated with S602 in the activation loop of PAK5 (Fig. 6B) indicates a hydrogen bonding interaction with R567, which is part of the conserved HRD motif (sequence annotation). Because the HRD-Arg is known to play a role in kinase regulation by stabilizing activation loop conformation (Huse & Kuriyan, 2002), it provides additional context for predicting the impact of S602 altering mutations. Likewise, examining the structural and sequence context of P607 interacting residues provides new insights into how the alteration of this residue might impact substrate binding and kinase regulation. Together, these examples highlight the value added by the ProtVista viewer in the visualization and annotation of mutations in dark kinases.

Conclusions

This work presents an updated version of the Protein Kinase Otology (ProKinO) for mining and annotating dark kinases. ProKinO was developed following FAIR (Findable, Accessible, Interoperable, and Reusable) principles (Wilkinson et al., 2016) and serves as an integrated knowledge graph for relating and conceptualizing diverse forms of disparate data related to protein kinase sequence, structure, function, regulation, and disease (cancer). We present a new ontology browser for navigating these data and demonstrate the application of aggregate SPARQL queries in uncovering new testable hypotheses regarding understudied kinase members. We also provide several pre-written SPARQL queries that can rapidly retrieve information related to protein kinase mutations, pathways, expression, and ligand binding sites. However, writing new queries requires prior knowledge of the ontology schema and the SPARQL query language, which most bench biologists may not have. To alleviate this challenge, we are currently building a graphical SPARQL query interface, which will intuitively enable query formulation through the navigation of the knowledge graph schema. We are also exploring the application of ProKinO for machine learning-based knowledge discovery and hypotheses generation.

Acknowledgments

We acknowledge members of the Kannan lab for their valuable comments and suggestions. We additionally acknowledge the various contributions to the databases we utilized made through the efforts of the IDG consortium and numerous investigator-initiated efforts.

Funding Statement

This work was supported by the National Institutes of Health (U01CA239106 and R35 GM139656) to Natarajan Kannan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Contributor Information

Krzysztof J. Kochut, Email: kkochut@uga.edu.

Natarajan Kannan, Email: nkannan@uga.edu.

Additional Information and Declarations

Competing Interests

Natarajan Kannan is an Academic Editor for PeerJ.

Author Contributions

Saber Soleymani conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, updated the browser and ontology, and approved the final draft.

Nathan Gravel conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Liang-Chin Huang performed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Wayland Yeung analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Elika Bozorgi analyzed the data, authored or reviewed drafts of the article, updated the browser and ontology, and approved the final draft.

Nathaniel G. Bendzunas analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Krzysztof J. Kochut conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, updated the browser and ontology, and approved the final draft.

Natarajan Kannan conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The protein kinase ontology (ProKinO)’s current OWL file and previous versions are available at: https://prokino.uga.edu/downloads.html.

Future versions of the ontology also will be placed at the same address.

The ontology browser is available at https://prokino.uga.edu/browser. Users can save the results of queries in diagrams or other formats such as CSV.

References

  • Avram et al. (2021).Avram S, Bologa CG, Holmes J, Bocci G, Wilson TB, Nguyen DT, Curpan R, Halip L, Bora A, Yang JJ, Knockel J, Sirimulla S, Ursu O, Oprea TI. DrugCentral 2021 supports drug discovery and repositioning. Nucleic Acids Research. 2021;49(D1):D1160–D1169. doi: 10.1093/nar/gkaa997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Bailey et al. (2015).Bailey FP, Byrne DP, McSkimming D, Kannan N, Eyers PA. Going for broke: targeting the human cancer pseudokinome. Biochemical Journal. 2015;465(2):195–211. doi: 10.1042/BJ20141060. [DOI] [PubMed] [Google Scholar]
  • Bairoch et al. (2005).Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M. The universal protein resource (UniProt) Nucleic Acids Research. 2005;33(Database issue):D154–D159. doi: 10.1093/nar/gki070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Benhar, Engelberg & Levitzki (2002).Benhar M, Engelberg D, Levitzki A. ROS, stress-activated kinases and stress signaling in cancer. EMBO Reports. 2002;3(5):420–425. doi: 10.1093/embo-reports/kvf094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Berginski et al. (2021).Berginski ME, Moret N, Liu C, Goldfarb D, Sorger PK, Gomez SM. The dark kinase knowledgebase: an online compendium of knowledge and experimental results of understudied kinases. Nucleic Acids Research. 2021;49(D1):D529–D535. doi: 10.1093/nar/gkaa853. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Berman et al. (2000).Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The protein data bank. Nucleic Acids Research. 2000;28(1):235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Bühlmann & Reymond (2020).Bühlmann S, Reymond JL. ChEMBL-likeness score and database GDBChEMBL. Frontiers in Chemistry. 2020;8:46. doi: 10.3389/fchem.2020.00046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Byrne, Foulkes & Eyers (2017).Byrne DP, Foulkes DM, Eyers PA. Pseudokinases: update on their functions and evaluation as new drug targets. Future Medicinal Chemistry. 2017;9(2):245–265. doi: 10.4155/fmc-2016-0207. [DOI] [PubMed] [Google Scholar]
  • Cicenas & Cicenas (2016).Cicenas J, Cicenas E. Multi-kinase inhibitors, AURKs and cancer. Medical Oncology. 2016;33(5):43. doi: 10.1007/s12032-016-0758-4. [DOI] [PubMed] [Google Scholar]
  • Croft et al. (2011).Croft D, O’Kelly G, Wu G, Haw R, Gillespie M, Matthews L, Caudy M, Garapati P, Gopinath G, Jassal B. Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Research. 2011;39(Database):D691–D697. doi: 10.1093/nar/gkq1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Duncan et al. (2012).Duncan JS, Whittle MC, Nakamura K, Abell AN, Midland AA, Zawistowski JS, Johnson NL, Granger DA, Jordan NV, Darr DB, Usary J, Kuan PF, Smalley DM, Major B, He X, Hoadley KA, Zhou B, Sharpless NE, Perou CM, Kim WY, Gomez SM, Chen X, Jin J, Frye SV, Earp HS, Graves LM, Johnson GL. Dynamic reprogramming of the kinome in response to targeted MEK inhibition in triple-negative breast cancer. Cell. 2012;149(2):307–321. doi: 10.1016/j.cell.2012.02.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Erika et al. (2016).Erika G, Federica Z, Martina S, Anselmo P, Luigi R, Marina M, Davide C, Eleonora Z, Monica V, Silverio T. Old tyrosine kinase inhibitors and newcomers in gastrointestinal cancer treatment. Current Cancer Drug Targets. 2016;16(2):175–185. doi: 10.2174/1568009615666150817120712. [DOI] [PubMed] [Google Scholar]
  • Eyers, Keeshan & Kannan (2017).Eyers PA, Keeshan K, Kannan N. Tribbles in the 21st century: the evolving roles of tribbles pseudokinases in biology and disease. Trends in Cell Biology. 2017;27(4):284–298. doi: 10.1016/j.tcb.2016.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Eyers & Murphy (2013).Eyers PA, Murphy JM. Dawn of the dead: protein pseudokinases signal new adventures in cell biology. Biochemical Society Transactions. 2013;41(4):969–974. doi: 10.1042/BST20130115. [DOI] [PubMed] [Google Scholar]
  • Faezov & Dunbrack (2021).Faezov B, Dunbrack RL., Jr PDBrenum: a webserver and program providing protein data bank files renumbered according to their UniProt sequences. PLOS ONE. 2021;16(7):e0253411. doi: 10.1371/journal.pone.0253411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fang et al. (2014).Fang ZP, Jiang BG, Gu XF, Zhao B, Ge RL, Zhang FB. P21-activated kinase 5 plays essential roles in the proliferation and tumorigenicity of human hepatocellular carcinoma. Acta Pharmacologica Sinica. 2014;35(1):82–88. doi: 10.1038/aps.2013.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ferguson & Gray (2018).Ferguson FM, Gray NS. Kinase inhibitors: the road ahead. Nature Reviews Drug Discovery. 2018;17(5):353–377. doi: 10.1038/nrd.2018.21. [DOI] [PubMed] [Google Scholar]
  • Foulkes et al. (2018).Foulkes DM, Byrne DP, Yeung W, Shrestha S, Bailey FP, Ferries S, Eyers CE, Keeshan K, Wells C, Drewry DH. Covalent inhibitors of EGFR family protein kinases induce degradation of human Tribbles 2 (TRIB2) pseudokinase in cancer cells. Science Signaling. 2018;11(549):eaat7951. doi: 10.1126/scisignal.aat7951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gajiwala et al. (2009).Gajiwala KS, Wu JC, Christensen J, Deshmukh GD, Diehl W, DiNitto JP, English JM, Greig MJ, He Y-A, Jacques SL. KIT kinase mutants show unique mechanisms of drug resistance to imatinib and sunitinib in gastrointestinal stromal tumor patients. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(5):1542–1547. doi: 10.1073/pnas.0812413106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Goldberg et al. (2013).Goldberg JM, Griggs AD, Smith JL, Haas BJ, Wortman JR, Zeng Q. Kinannote, a computer program to identify and classify members of the eukaryotic protein kinase superfamily. Bioinformatics. 2013;29(19):2387–2394. doi: 10.1093/bioinformatics/btt419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gosal, Kannan & Kochut (2011).Gosal G, Kannan N, Kochut K. ProKinO: a framework for protein kinase ontology. Proceedings of the IEEE International Conference on Bioinformatics & Biomedicine, Atlanta, Georgia; Piscataway: IEEE; 2011. pp. 550–555. [Google Scholar]
  • Gosal, Kochut & Kannan (2011).Gosal G, Kochut KJ, Kannan N. ProKinO: an ontology for integrative analysis of protein kinases in cancer. PLOS ONE. 2011;6(12):e28782. doi: 10.1371/journal.pone.0028782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Han et al. (2018).Han K, Zhou Y, Tseng KF, Hu H, Li K, Wang Y, Gan Z, Lin S, Sun Y, Min D. PAK5 overexpression is associated with lung metastasis in osteosarcoma. Oncology Letters. 2018;15:2202–2210. doi: 10.3892/ol.2017.7545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Hanks & Hunter (1995).Hanks SK, Hunter T. Protein kinases 6. The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification. FASEB Journal. 1995;9:576–596. doi: 10.1096/fasebj.9.8.7768349. [DOI] [PubMed] [Google Scholar]
  • Hu et al. (2015).Hu J, Ahuja LG, Meharena HS, Kannan N, Kornev AP, Taylor SS, Shaw AS. Kinase regulation by hydrophobic spine assembly in cancer. Molecular and Cellular Biology. 2015;35(1):264–276. doi: 10.1128/MCB.00943-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Huang et al. (2018).Huang LC, Ross KE, Baffi TR, Drabkin H, Kochut KJ, Ruan Z, D’Eustachio P, McSkimming D, Arighi C, Chen C, Natale DA, Smith C, Gaudet P, Newton AC, Wu C, Kannan N. Integrative annotation and knowledge discovery of kinase post-translational modifications and cancer-associated mutations through federated protein ontologies and resources. Scientific Reports. 2018;8(1):6518. doi: 10.1038/s41598-018-24457-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Huang et al. (2021).Huang LC, Taujale R, Gravel N, Venkat A, Yeung W, Byrne DP, Eyers PA, Kannan N. KinOrtho: a method for mapping human kinase orthologs across the tree of life and illuminating understudied kinases. BMC Bioinformatics. 2021;22(1):446. doi: 10.1186/s12859-021-04358-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Huang et al. (2020).Huang LC, Yeung W, Wang Y, Cheng H, Venkat A, Li S, Ma P, Rasheed K, Kannan N. Quantitative structure-mutation-activity relationship tests (QSMART) model for protein kinase inhibitor response prediction. BMC Bioinformatics. 2020;21(1):520. doi: 10.1186/s12859-020-03842-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Huo et al. (2019).Huo FC, Pan YJ, Li TT, Mou J, Pei DS. PAK5 promotes the migration and invasion of cervical cancer cells by phosphorylating SATB1. Cell Death and Differentiation. 2019;26(6):994–1006. doi: 10.1038/s41418-018-0178-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Huse & Kuriyan (2002).Huse M, Kuriyan J. The conformational plasticity of protein kinases. Cell. 2002;109(3):275–282. doi: 10.1016/S0092-8674(02)00741-9. [DOI] [PubMed] [Google Scholar]
  • Joensuu et al. (2001).Joensuu H, Roberts PJ, Sarlomo-Rikala M, Andersson LC, Tervahartiala P, Tuveson D, Silberman S, Capdeville R, Dimitrijevic S, Druker B, Demetri GD. Effect of the tyrosine kinase inhibitor STI571 in a patient with a metastatic gastrointestinal stromal tumor. New England Journal of Medicine. 2001;344(14):1052–1056. doi: 10.1056/NEJM200104053441404. [DOI] [PubMed] [Google Scholar]
  • Jumper et al. (2021).Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, Bridgland A, Meyer C, Kohl SAA, Ballard AJ, Cowie A, Romera-Paredes B, Nikolov S, Jain R, Adler J, Back T, Petersen S, Reiman D, Clancy E, Zielinski M, Steinegger M, Pacholska M, Berghammer T, Bodenstein S, Silver D, Vinyals O, Senior AW, Kavukcuoglu K, Kohli P, Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kanev et al. (2021).Kanev GK, de Graaf C, Westerman BA, de Esch IJP, Kooistra AJ. KLIFS: an overhaul after the first 5 years of supporting kinase research. Nucleic Acids Research. 2021;49(D1):D562–D569. doi: 10.1093/nar/gkaa895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kim, Song & Haura (2009).Kim LC, Song L, Haura EB. Src kinases as therapeutic targets for cancer. Nature Reviews Clinical Oncology. 2009;6(10):587–595. doi: 10.1038/nrclinonc.2009.129. [DOI] [PubMed] [Google Scholar]
  • Knighton et al. (1991).Knighton DR, Zheng JH, Ten Eyck LF, Ashford VA, Xuong NH, Taylor SS, Sowadski JM. Crystal structure of the catalytic subunit of cyclic adenosine monophosphate-dependent protein kinase. Science. 1991;253(5018):407–414. doi: 10.1126/science.1862342. [DOI] [PubMed] [Google Scholar]
  • Koleti et al. (2018).Koleti A, Terryn R, Stathias V, Chung C, Cooper DJ, Turner JP, Vidovic D, Forlin M, Kelley TT, D’Urso A, Allen BK, Torre D, Jagodnik KM, Wang L, Jenkins SL, Mader C, Niu W, Fazel M, Mahi N, Pilarczyk M, Clark N, Shamsaei B, Meller J, Vasiliauskas J, Reichard J, Medvedovic M, Ma’ayan A, Pillai A, Schürer SC. Data portal for the library of integrated network-based cellular signatures (LINCS) program: integrated access to diverse large-scale cellular perturbation response data. Nucleic Acids Research. 2018;46(D1):D558–D566. doi: 10.1093/nar/gkx1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kornev & Taylor (2015).Kornev AP, Taylor SS. Dynamics-driven allostery in protein kinases. Trends in Biochemical Sciences. 2015;40(11):628–647. doi: 10.1016/j.tibs.2015.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kwon et al. (2019).Kwon A, Scott S, Taujale R, Yeung W, Kochut KJ, Eyers PA, Kannan N. Tracing the origin and evolution of pseudokinases across the tree of life. Science Signaling. 2019;12(578):289. doi: 10.1126/scisignal.aav3810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Li & Jones (2012).Li YY, Jones SJ. Drug repositioning for personalized medicine. Genome Medicine. 2012;4(3):27. doi: 10.1186/gm326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Li et al. (2018).Li YH, Yu CY, Li XX, Zhang P, Tang J, Yang Q, Fu T, Zhang X, Cui X, Tu G, Zhang Y, Li S, Yang F, Sun Q, Qin C, Zeng X, Chen Z, Chen YZ, Zhu F. Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic Acids Research. 2018;46(D1):D1121–D1127. doi: 10.1093/nar/gkx1076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lin et al. (2017).Lin Y, Mehta S, Küçük-McGinty H, Turner JP, Vidovic D, Forlin M, Koleti A, Nguyen DT, Jensen LJ, Guha R, Mathias SL, Ursu O, Stathias V, Duan J, Nabizadeh N, Chung C, Mader C, Visser U, Yang JJ, Bologa CG, Oprea TI, Schürer SC. Drug target ontology to classify and integrate drug discovery data. Journal of Biomedical Semantics. 2017;8(1):50. doi: 10.1186/s13326-017-0161-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Liu, Liang & Zhang (2016).Liu D, Liang XC, Zhang H. Culturing Schwann cells from neonatal rats by improved enzyme digestion combined with explants-culture method. Zhongguo Yi Xue Ke Xue Yuan Xue Bao. 2016;38:388–392. doi: 10.3881/j.issn.1000-503X.2016.04.004. [DOI] [PubMed] [Google Scholar]
  • Lubner et al. (2017).Lubner JM, Dodge-Kafka KL, Carlson CR, Church GM, Chou MF, Schwartz D. Cushing’s syndrome mutant PKA(L)(205R) exhibits altered substrate specificity. FEBS Letters. 2017;591(3):459–467. doi: 10.1002/1873-3468.12562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • ManChon et al. (2014).ManChon U, Talevich E, Katiyar S, Rasheed K, Kannan N. Prediction and prioritization of rare oncogenic mutations in the cancer kinome using novel features and multiple classifiers. PLOS Computational Biology. 2014;10(4):e1003545. doi: 10.1371/journal.pcbi.1003545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Manning et al. (2002).Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. The protein kinase complement of the human genome. Science. 2002;298(5600):1912–1934. doi: 10.1126/science.1075762. [DOI] [PubMed] [Google Scholar]
  • McClendon et al. (2014).McClendon CL, Kornev AP, Gilson MK, Taylor SS. Dynamic architecture of a protein kinase. Proceedings of the National Academy of Sciences of the United States of America. 2014;111(43):E4623–E4631. doi: 10.1073/pnas.1418402111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • McSkimming et al. (2016).McSkimming DI, Dastgheib S, Baffi TR, Byrne DP, Ferries S, Scott ST, Newton AC, Eyers CE, Kochut KJ, Eyers PA, Kannan N. KinView: a visual comparative sequence analysis tool for integrated kinome research. Molecular Biosystems. 2016;12(12):3651–3665. doi: 10.1039/C6MB00466K. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • McSkimming et al. (2014).McSkimming DI, Dastgheib S, Talevich E, Narayanan A, Katiyar S, Taylor SS, Kochut K, Kannan N. ProKinO: a unified resource for mining the cancer kinome. Human Mutation. 2014;36(2):175–186. doi: 10.1002/humu.22726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • McSkimming et al. (2015).McSkimming DI, Dastgheib S, Talevich E, Narayanan A, Katiyar S, Taylor SS, Kochut K, Kannan N. ProKinO: a unified resource for mining the cancer kinome. Human Mutation. 2015;36(2):175–186. doi: 10.1002/humu.22726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Meharena et al. (2013).Meharena HS, Chang P, Keshwani MM, Oruganty K, Nene AK, Kannan N, Taylor SS, Kornev AP. Deciphering the structural basis of eukaryotic protein kinase regulation. PLOS Biology. 2013;11(10):e1001680. doi: 10.1371/journal.pbio.1001680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Mohanty et al. (2016).Mohanty S, Oruganty K, Kwon A, Byrne DP, Ferries S, Ruan Z, Hanold LE, Katiyar S, Kennedy EJ, Eyers PA, Kannan N. Hydrophobic core variations provide a structural framework for tyrosine kinase evolution and functional specialization. PLOS Genetics. 2016;12(2):e1005885. doi: 10.1371/journal.pgen.1005885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Moret et al. (2021).Moret N, Liu C, Gyori BM, Bachman JA, Steppi A, Hug C, Taujale R, Huang L-C, Berginski ME, Gomez SM. A resource for exploring the understudied human kinome for research and therapeutic opportunities. BioRxiv. 2021 doi: 10.1101/2020.04.02.022277. [DOI] [Google Scholar]
  • Murphy, Mace & Eyers (2017).Murphy JM, Mace PD, Eyers PA. Live and let die: insights into pseudoenzyme mechanisms from structure. Current Opinion in Structural Biology. 2017;47:95–104. doi: 10.1016/j.sbi.2017.07.004. [DOI] [PubMed] [Google Scholar]
  • Neuwald (2009).Neuwald AF. Rapid detection, classification and accurate alignment of up to a million or more related protein sequences. Bioinformatics. 2009;25:1869–1875. doi: 10.1093/bioinformatics/btp342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Nguyen et al. (2017).Nguyen DT, Mathias S, Bologa C, Brunak S, Fernandez N, Gaulton A, Hersey A, Holmes J, Jensen LJ, Karlsson A, Liu G, Ma’ayan A, Mandava G, Mani S, Mehta S, Overington J, Patel J, Rouillard AD, Schurer S, Sheils T, Simeonov A, Sklar LA, Southall N, Ursu O, Vidovic D, Waller A, Yang J, Jadhav A, Oprea TI, Guha R. Pharos: collating protein information to shed light on the druggable genome. Nucleic Acids Research. 2017;45:D995–D1002. doi: 10.1093/nar/gkw1072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Nguyen et al. (2015).Nguyen T, Ruan Z, Oruganty K, Kannan N. Co-conserved MAPK features couple D-domain docking groove to distal allosteric sites via the C-terminal flanking tail. PLOS ONE. 2015;10(3):e0119636. doi: 10.1371/journal.pone.0119636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Niepel et al. (2017).Niepel M, Hafner M, Duan Q, Wang Z, Paull EO, Chung M, Lu X, Stuart JM, Golub TR, Subramanian A, Ma’ayan A, Sorger PK. Common and cell-type specific responses to anti-cancer drugs revealed by high throughput transcript profiling. Nature Communications. 2017;8(1):1186. doi: 10.1038/s41467-017-01383-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Oruganty & Kannan (2012).Oruganty K, Kannan N. Design principles underpinning the regulatory diversity of protein kinases. Philosophical Transactions of the Royal Society B: Biological Sciences. 2012;367(1602):2529–2539. doi: 10.1098/rstb.2012.0015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Oruganty & Kannan (2013).Oruganty K, Kannan N. Evolutionary variation and adaptation in a conserved protein kinase allosteric network: implications for inhibitor design. Biochimica et Biophysica Acta. 2013;1834(7):1322–1329. doi: 10.1016/j.bbapap.2013.02.040. [DOI] [PubMed] [Google Scholar]
  • Patani et al. (2016).Patani H, Bunney TD, Thiyagarajan N, Norman RA, Ogg D, Breed J, Ashford P, Potterton A, Edwards M, Williams SV, Thomson GS, Pang CS, Knowles MA, Breeze AL, Orengo C, Phillips C, Katan M. Landscape of activating cancer mutations in FGFR kinases and their differential responses to inhibitors in clinical use. Oncotarget. 2016;7(17):24252–24268. doi: 10.18632/oncotarget.8132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Quan et al. (2020).Quan L, Cheng Z, Dai Y, Jiao Y, Shi J, Fu L. Prognostic significance of PAK family kinases in acute myeloid leukemia. Cancer Gene Therapy. 2020;27(1–2):30–37. doi: 10.1038/s41417-019-0090-1. [DOI] [PubMed] [Google Scholar]
  • Ruan & Kannan (2015).Ruan Z, Kannan N. Mechanistic insights into R776H mediated activation of epidermal growth factor receptor kinase. Biochemistry. 2015;54(27):4216–4225. doi: 10.1021/acs.biochem.5b00444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ruan, Katiyar & Kannan (2017).Ruan Z, Katiyar S, Kannan N. Computational and experimental characterization of patient derived mutations reveal an unusual mode of regulatory spine assembly and drug sensitivity in EGFR kinase. Biochemistry. 2017;56(1):22–32. doi: 10.1021/acs.biochem.6b00572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sheils et al. (2021).Sheils TK, Mathias SL, Kelleher KJ, Siramshetty VB, Nguyen DT, Bologa CG, Jensen LJ, Vidović D, Koleti A, Schürer SC, Waller A, Yang JJ, Holmes J, Bocci G, Southall N, Dharkar P, Mathé E, Simeonov A, Oprea TI. TCRD and Pharos 2021: mining the human proteome for disease biology. Nucleic Acids Research. 2021;49(D1):D1334–D1346. doi: 10.1093/nar/gkaa993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sheils et al. (2020).Sheils T, Mathias SL, Siramshetty VB, Bocci G, Bologa CG, Yang JJ, Waller A, Southall N, Nguyen DT, Oprea TI. How to illuminate the druggable genome using Pharos. Current Protocols in Bioinformatics. 2020;69(1):e92. doi: 10.1002/cpbi.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Simonetti et al. (2014).Simonetti FL, Tornador C, Nabau-Moreto N, Molina-Vila MA, Marino-Buslje C. Kin-Driver: a database of driver mutations in protein kinases. Database. 2014;2014:bau104. doi: 10.1093/database/bau104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Soleymani et al. (2022).Soleymani S, Gravel N, Huang L-C, Yeung W, Bozorgi E, Bendzunas NG, Kochut KJ, Kannan N. Dark kinase annotation, mining and visualization using the protein kinase ontology. BioRxiv. 2022 doi: 10.1101/2022.02.25.482021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Tate et al. (2018).Tate JG, Bamford S, Jubb HC, Sondka Z, Beare DM, Bindal N, Boutselakis H, Cole CG, Creatore C, Dawson E, Fish P, Harsha B, Hathaway C, Jupe SC, Kok CY, Noble K, Ponting L, Ramshaw CC, Rye CE, Speedy HE, Stefancsik R, Thompson SL, Wang S, Ward S, Campbell PJ, Forbes SA. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Research. 2018;47(D1):D941–D947. doi: 10.1093/nar/gky1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Taylor et al. (2015).Taylor SS, Shaw AS, Kannan N, Kornev AP. Integration of signaling in the kinome: architecture and regulation of the alphaC Helix. Biochimica et Biophysica Acta. 2015;1854(10):1567–1574. doi: 10.1016/j.bbapap.2015.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Tunyasuvunakool et al. (2021).Tunyasuvunakool K, Adler J, Wu Z, Green T, Zielinski M, Žídek A, Bridgland A, Cowie A, Meyer C, Laydon A, Velankar S, Kleywegt GJ, Bateman A, Evans R, Pritzel A, Figurnov M, Ronneberger O, Bates R, Kohl SAA, Potapenko A, Ballard AJ, Romera-Paredes B, Nikolov S, Jain R, Clancy E, Reiman D, Petersen S, Senior AW, Kavukcuoglu K, Birney E, Kohli P, Jumper J, Hassabis D. Highly accurate protein structure prediction for the human proteome. Nature. 2021;596(7873):590–596. doi: 10.1038/s41586-021-03828-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Vazquez et al. (2016).Vazquez M, Pons T, Brunak S, Valencia A, Izarzugaza JM. wKinMut-2: identification and interpretation of pathogenic variants in human protein kinases. Human Mutation. 2016;37(1):36–42. doi: 10.1002/humu.22914. [DOI] [PubMed] [Google Scholar]
  • Watkins et al. (2017).Watkins X, Garcia LJ, Pundir S, Martin MJ, UniProt Consortium ProtVista: visualization of protein sequence annotations. Bioinformatics. 2017;33:2040–2041. doi: 10.1093/bioinformatics/btx120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wilkinson et al. (2016).Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten JW, da Silva Santos LB, Bourne PE, Bouwman J, Brookes AJ, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo CT, Finkers R, Gonzalez-Beltran A, Gray AJ, Groth P, Goble C, Grethe JS, Heringa J, t Hoen PA, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone SA, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, van der Lei J, van Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B. The FAIR guiding principles for scientific data management and stewardship. Scientific Data. 2016;3(1):160018. doi: 10.1038/sdata.2016.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wishart et al. (2018).Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z, Assempour N, Iynkkaran I, Liu Y, Maciejewski A, Gale N, Wilson A, Chin L, Cummings R, Le D, Pon A, Knox C, Wilson M. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Research. 2018;46(D1):D1074–D1082. doi: 10.1093/nar/gkx1037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yang et al. (2013).Yang W, Soares J, Greninger P, Edelman EJ, Lightfoot H, Forbes S, Bindal N, Beare D, Smith JA, Thompson IR, Ramaswamy S, Futreal PA, Haber DA, Stratton MR, Benes C, McDermott U, Garnett MJ. Genomics of drug sensitivity in cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Research. 2013;41(D1):D955–D961. doi: 10.1093/nar/gks1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yonemoto et al. (1993).Yonemoto W, Garrod SM, Bell SM, Taylor SS. Identification of phosphorylation sites in the recombinant catalytic subunit of cAMP-dependent protein kinase. Journal of Biological Chemistry. 1993;268(25):18626–18632. doi: 10.1016/S0021-9258(17)46675-0. [DOI] [PubMed] [Google Scholar]
  • Yun et al. (2008).Yun C-H, Mengwasser KE, Toms AV, Woo MS, Greulich H, Wong K-K, Meyerson M, Eck MJ. The T790M mutation in EGFR kinase causes drug resistance by increasing the affinity for ATP. Proceedings of the National Academy of Sciences of the United States of America. 2008;105(6):2070–2075. doi: 10.1073/pnas.0709662105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhang et al. (2017).Zhang YC, Huo FC, Wei LL, Gong CC, Pan YJ, Mou J, Pei DS. PAK5-mediated phosphorylation and nuclear translocation of NF-κB-p65 promotes breast cancer cell proliferation in vitro and in vivo. Journal of Experimental & Clinical Cancer Research. 2017;36(1):146. doi: 10.1186/s13046-017-0610-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhang, Yang & Gray (2009).Zhang J, Yang PL, Gray NS. Targeting cancer with small molecule kinase inhibitors. Nature Reviews Cancer. 2009;9(1):28–39. doi: 10.1038/nrc2559. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The following information was supplied regarding data availability:

The protein kinase ontology (ProKinO)’s current OWL file and previous versions are available at: https://prokino.uga.edu/downloads.html.

Future versions of the ontology also will be placed at the same address.

The ontology browser is available at https://prokino.uga.edu/browser. Users can save the results of queries in diagrams or other formats such as CSV.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES