Abstract
Influenza and COVID-19 are infectious respiratory diseases that represent a major concern to public health with social and economic impact worldwide, for which the available therapeutic options are not satisfactory. The RdRp has a central role in viral replication and thus represents a major target for the development of antiviral approaches. In this study, we focused on Influenza A virus PB1 polymerase protein and the betacoronaviruses nsp12 polymerase protein, considering their functional and structural similarities. We have performed conservation and druggability analysis to map conserved druggable regions, that may have functional or structural importance in these proteins. We disclosed the most promising and new targeting regions for the discovery of new potential polymerase inhibitors. Conserved druggable regions of putative interaction with favipiravir and molnupiravir were also mapped. We have also compared and integrated the current findings with previous research.
Keywords: Influenza A virus, Betacoronaviruses, SARS-CoV-2, PB1 protein, nsp12 protein, Conservation score, Druggability score, Consensus druggable pocket
Graphical abstract
Abbreviations
- aa
amino acid
- CDP
consensus druggable pockets
- CoV-RdRp
severe acute respiratory syndrome coronavirus 2 RNA-dependent RNA polymerase
- C-CDP
Coronavirus consensus druggable pocket
- COVID-19
coronavirus disease 2019
- cs
conservation score
- DGSS
DoGSiteScorer
- FDA
Food & Drug Administration
- F-RTP
ribofuranosyl-50-triphosphate
- IAV
influenza A virus
- IAV-RdRp
influenza A virus RNA-dependent RNA polymerase
- I-CDP
Influenza consensus druggable pockets
- MAFFT
Multiple alignment using fast fourier transform
- MERS
middle east respiratory syndrome
- MERS-CoV
middle east respiratory syndrome coronavirus
- MHRA
Medicines and Healthcare products Regulatory Agency
- MSA
Multiple Sequence Alignment
- NiRAN
N-terminal nidovirus RdRp associated nucleotidyltransferase
- NLS
Nuclear localization sequences
- NS1
non-structural protein 1
- nsp7
non-structural protein 7
- nsp8
non-structural protein 8
- nsp12
non-structural protein 12
- NTP
nucleoside triphosphate
- PA
polymerase acid
- PB1
polymerase basic 1
- PB2
polymerase basic 2
- PD
PockDrug
- PDB
Protein Data Bank
- RanBP5
Ran binding protein 5
- RdRp
RNA-dependent RNA polymerase
- RNA
Ribonucleic acid
- SARS
severe acute respiratory syndrome
- SARS-CoV
severe acute respiratory syndrome coronavirus
- SARS-CoV-2
severe acute respiratory syndrome coronavirus 2
- SF
MOE-SiteFinder
- UK
United Kingdom
- US
United States
- WHO
World Health Organization
1. Introduction
Influenza A virus (IAV) and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are the etiologic agents of influenza and Coronavirus Disease 2019 (COVID-19), respectively. IAV and SARS-CoV-2 represent a major concern to public health and economy worldwide, because of their epidemic and pandemic potential, associated to unsatisfactory pharmacological prophylaxis and treatment strategies (Prioritizing, 2022). Current vaccines for influenza and COVID-19 are safe, although several limitations are widely recognized (National Institutes of Health, 2019). The influenza vaccine needs to be annually reviewed and their effectiveness might be compromised due to an antigenic mismatch between the strains included in the vaccine and the circulating strains (Lewnard and Cobey, 2018). Regarding COVID-19, the vaccination program for SARS-CoV-2 remains challenging from a global perspective, and there is still lack of robust information concerning possible seasonality and the duration and effectiveness of vaccinal-induced immunity against newly emerging SARS-CoV-2 variants (Baker et al., 2021). Regarding clinically authorized antiviral strategies, only a few options are available for treatment (National Institutes of Health, 2019). Moreover, usually the effectiveness of influenza antivirals decreases after the initial 48h of the symptoms’ onset (Naesens et al., 2016). Also, available drugs can become less effective due to the emergence of viruses with reduced susceptibility to antivirals (López-Medrano et al., 2012). Thus, the search for robust targets and new antiviral strategies directed against such targets is of utmost relevance. Antivirals currently represent the most reliable therapeutic strategy in the advent of a new pandemic virus, considering the presumable incapability to develop new vaccines for global use to control the early fast-spreading phase of the pandemic disease (Longini et al., 2005). Moreover, the existence of several antiviral options for influenza and COVID-19, with different mechanisms of action, might allow the combination of therapeutic strategies which can be more effective and minimize the selection of resistant viruses (Perelson et al., 2012).
IAV and SARS-CoV-2 are ribonucleic acid (RNA) viruses that contain an RNA-dependent RNA polymerase (RdRp) complex to express their proteins and replicate their genome (Hillen et al., 2020; Fodor, 2013). Hence, the RNA polymerase has been highlighted as a major target for the development of anti-influenza and anti-coronaviruses approaches, considering its importance in the viral replication cycle (Buonaguro et al., 2020; Hayden and Shindo, 2019). Focusing on blocking the replication process via polymerase inhibition is a key strategy to reduce viral titer that plausibly minimize disease transmission and severity (Buonaguro et al., 2020; Wölfel et al., 2020).
Influenza A RNA polymerase (IAV-RdRp) is a protein complex formed by three subunits: polymerase basic 1 (PB1), polymerase acid (PA) and polymerase basic 2 (PB2) (Pflug et al., 2017). The heterotrimeric IAV-RdRp complex assembles in the nucleus of the infected cell, following PA-PB1 dimer and the PB2 import from the cytoplasm, separately (Hutchinson et al., 2011). Nuclear import of PA-PB1 dimers occurs due to the interaction between a nuclear import factor, the Ran binding protein 5 (RanBP5) and the nuclear localization sequences (NLS) of PB1 (Deng et al., 2006). The association between PA and PB1 is essential to form a suitable substrate for nuclear import, and consequently a functional trimeric RdRp (Fodor and Smith, 2004). While all the subunits are important for efficient transcription and replication, the 757 amino acids (aa) long PB1 protein plays a central role in the catalytic activities of the IAV-RdRp (Kobayashi et al., 1996; Zamarin et al., 2006). The overall organization of PB1 structure is depicted in Fig. 1 .
Fig. 1.
Schematic representation of the structure of IAV-RdRp PB1 protein. The PB1 protein has a characteristic right-hand fold (common to several RNA polymerases) and is composed by fingers, palm and thumb subdomains. Additionally, the protein is composed of a N-terminal extension (residues 1–34) and a C-terminal extension (residues 669–757), which interact with the PA and PB2 subunits, respectively. The interdomain limits are labeled with residues numbers. NTE: N-terminal extension; BR: Beta ribbon region; BH: Beta-hairpin region; CTE: C-terminal extension. Fingers, palm and thumb subdomains contain the polymerase motifs (not described in the picture)
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
The interactions between the PB1, PA and PB2 subunits are crucial to maintain the IAV-RdRp structure and assure the normal functioning of the transcription and replication processes (Walker and Fodor, 2019). For that reason, the PA, PB1 and PB2 subunits of the IAV-RdRp complex, as well as the regions of interface between these subunits may represent interesting targets for drug discovery or design. The active site of IAV-RdRp is composed by the motifs A to F placed in the fingers, palm and thumb subdomains of the PB1 subunit that, together with the N-terminal region of the PB2 subunit, form a large central cavity (Pflug et al., 2017). Favipiravir is a RdRp inhibitor that targets the IAV-RdRp and blocks IAV genome copies replication (Pandey et al., 2020). However, its use is only approved in Japan and France for limited clinical conditions (Furuta et al., 2013a). Additionally, resistance molecular markers - namely, mutations in a highly conserved region of the IAV-RdRp (motif F) - have been already selected in vitro, which pose as a potential threat for antiviral resistance development (Goldhill et al., 2018).
The RdRp complex of the betacoronaviruses (beta-CoVs) is composed of the catalytic subunit non-structural protein 12 (nsp12) and the nsp8 and nsp7 cofactors (CoV-RdRp) (Kirchdoerfer and Ward, 2019). It contains a right-hand-like structure with fingers, palm, and thumb subdomains, similarly to the structure described for the PB1 protein (Gao et al., 2020). Structure and domain organization of nsp12 are depicted in more detail in Fig. 2 .
Fig. 2.
Schematic representation of the structure of CoV-RdRp nsp12 protein. The nsp12 (residues S1-Q932) is the core component of the complex and is composed of an N-terminal nidovirus RdRp associated nucleotidyltransferase (NiRAN) domain (residues D60-R249), an interface domain (residues A250-R365), and a nsp12-RdRp domain (residues S367–F920). The interdomain limits are labeled with residues numbers. Highly conserved polymerase motifs A-F are coloured in grey. NiRAN: N-terminal nidovirus RdRp associated nucleotidyltransferase; RdRp: RNA-dependent RNA polymerase. The RdRp domain contains a right-hand fold with fingers, palm and thumb subdomains
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
Nsp12 subunit contains the CoV-RdRp active site, that is essential for both transcription and replication processes (Hillen et al., 2020). Similarly to the IAV-RdRp, the protein active site is formed by the conserved CoV-RdRp motifs A to G, in the fingers and palm subdomains of the nsp12 protein. These motifs contain essential residues for RNA synthesis, that interact and stabilize the RNA template. Specifically, motif B contains a loop region which, together with motif G residues, undergo conformational changes in response to bind the template-product (Yin et al., 2020). nsp8 and nsp7 are accessory subunits, that bind to nsp12 through the fingers (nsp8) and thumbs (nsp8 and nsp7) subdomains, respectively (Peng et al., 2020). The two copies of nsp8 and the nsp7 cofactors stabilize the nsp12 structure during the replication process and prevent the premature dissociation of the tetrameric CoV-RdRp complex (Yin et al., 2020). A new direct-acting antiviral, molnupiravir, targeting the CoV-RdRp complex was recently approved by the U.S Food & Drug Administration (FDA) and the Medicines and Healthcare products Regulatory Agency (MHRA) for the treatment of COVID-19 (U.S Food and Drug Administration, 2022). Molnupiravir is a broad-spectrum antiviral that induces RNA mutagenesis during viral replication (Gordon et al., 2021). However, recent studies showed that molnupiravir has a less-than-expected effectiveness in decreasing hospitalization due to COVID-19 (Kozlov, 2021). Furthermore, the use of molnupiravir is only approved in the United States (US) and United Kingdom (UK) and exclusively for cases of mild-to-moderate disease in adults with risk factors for developing severe illness (U.S Food and Drug Administration, 2022; Medicines and Healthcare products Regulatory Agency, 2022).
Despite the opposite polarity of the viral genome and the low homology between IAV-RdRp and CoV-RdRp complexes (Buonaguro et al., 2020), these share some common general features. For instance, both contain a right-hand-like structure, with fingers, palm and thumbs subdomains, that integrate the highly conserved motifs of the protein active site responsible for RNA interaction (Yin et al., 2020; Stubbs and Te Velthuis, 2014). Additionally, despite the conformational changes that occur in each IAV-RdRp and CoV-RdRp complexes, the PB1 protein and the nsp12 protein remain practically static, while the major rearrangements take place in the respective accessory subunits (te Velthuis and Fodor, 2016; Wang et al., 2020). Considering the analog function of these proteins and their structural similarities and, moreover, the existence of drug molecules as favipiravir and molnupiravir - that are known to interact with both RdRp complexes of these two viruses (Magro, 2020) –, it is important to map the pockets in these polymerases structures as the starting point to identify promising targets for each virus.
The development of a promising antiviral strategy primarily requires a drug target which is demanded to be (i) essential to the viral replication and (ii) druggable, i.e., it can interact and be modulated by potential inhibitors (Hajduk et al., 2005). Druggability is the protein's ability to bind drug-like molecules with high affinity and specificity (Hopkins and Groom, 2002). Druggable pockets are characterized by physicochemical parameters like high volume, large depth and enclosure and a higher percentage of charged residues and hydrophobicity. Also, the size of the pocket should be > 10 residues to enhance the interaction between the target and the ligand. “Decoy pockets” (pockets with less than 14 residues), may be incapable to properly bind a ligand (Hussein et al., 2015; Volkamer et al., 2012). Conservation studies allow to identify promising functionally/structurally important residues of the protein, that may represent therapeutically relevant disease-modifying targets (Blundell, 1996). Also, antiviral strategies targeting these residues are presumably less likely to become ineffective due to a mutation of the virus into a drug-resistant profile.
A robust structure-based analysis of the PB1 (influenza) and the nsp12 (beta-CoVs) polymerase proteins as drug targets has the potential to promote and accelerate the rational discovery of new antivirals, that target the most promising regions regarding the selected targets (Blundell, 1996; Kirchmair et al., 2011).
Here, we focused on the PB1 protein from IAV-RdRp and the nsp12 protein from CoV-RdRp as drug targets for a prospective antiviral strategy. We studied the conservation and druggability patterns of these proteins, individually, to identify and thoroughly explore conserved druggable hot spots. Similar structure-based antiviral research studies were successfully performed in the recent past concerning the non-structural protein 1 (NS1) from IAV and the Spike protein from a wide range of beta-CoVs (Trigueiro-Louro et al., 2019, 2020, 2022; Darapaneni et al., 2009; Sugiyama et al., 2009). Our findings highlight potential regions of functional or structural importance in these proteins, which may represent key targets for a potential pharmacological modulation of polymerase-mediated viral infection.
2. Results
For the conservation studies, each amino acid site within the PB1 subunit (IAV-RdRp) and the nsp12 protein (CoV-RdRp) sequences was scored between 0 (high variable) and 11 (highly conserved). As detailed in the Material and Methods, residues with conservation score of 11 corresponded to 100% amino acid identity; residues with conservation score (cs) ≥ 10 were considered highly conserved; residues with cs of 7–9 were considered conserved; and residues with cs ≤ 6 were considered variable. Residues with a cs of 11 were named top-conserved residues. Druggable residues/regions were predicted and scored from 0 to 2 (highly druggable). Residues with the maximum druggability score of 2 in all crystallographic structures were named top-druggable residues. Residues/regions both top-conserved and top-druggable were named top-ranked hot spots and represent the most promising sites to be considered for the development of a prospective antiviral strategy targeting either the IAV-RdRp or the CoV-RdRp.
2.1. Influenza PB1 protein
2.1.1. Conservation studies
Considering the influenza PB1 protein: 64.60% of the residues are top-conserved; 94.32% are conserved (cs between 7 and 9); and only 5.68% are considered variable (cs ≤ 6). Cs of each residue from the different PB1 domains are described in Table 1 .
Table 1.
Conservation grade classification for each amino acid position on the IAV PB1 protein. Residues with conservation score of 11 have the highest grade corresponding to absolute conserved sites (100% amino acid identity; top-conserved residues); residues with conservation score ≥10 were classified as highly conserved; residues with conservation score of 7–9 were classified as conserved; and residues with conservation score ≤6 were classified as variable. NTE: N-terminal extension; BR: Beta ribbon region; BH: Beta-hairpin region; CTE: C-terminal extension.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
Region | Conservation score ≤6 | Conservation score 7-9 | Conservation score ≥10 |
---|---|---|---|
NTE | A14 | V12 | M1, D2, V3, N4, P5, T6, L7, L8, F9, L10, K11, P13, Q15, N16, A17, I18, S19, T20, T21, F22, P23, Y24, T25, G26, D27, P28, P29, Y30, S31, H32, G33, T34 |
BR | E177, M179, M195, Q210 | I181, V200, R211 | E178, E180, T182, T183, H184, F185, Q186, R187, K188, R189, R190, V191, R192, D193, N194, T196, K197, K198, M199, T201, Q202, R203, T204, I205, G206, K207, K208, K209 |
Fingers | Q48, R52, T57, E75, V113, G154, T156, D175, L212, N213, G216, R327, I339, A401 | K54, L108, M111, V114, V149, M171, E172, M174, K176, R215, T257, S261, M317, I336 | G35, T36, G37, Y38, T39, M40, D41, T42, V43, N44, R45, T46, H47, Y49, S50, E51, G53, W55, T56, N58, T59, E60, T61, G62, A63, P64, Q65, L66, N67, P68, I69, D70, G71, P72, L73, P74, D76, N77, E78, P79, S80, G81, Y82, A83, Q84, T85, D86, C87, V88, L89, E90, A91, M92, A93, F94, L95, E96, E97, S98, H99, P100, G101, I102, F103, E104, N105, S106, C107, E109, T110, E112, Q115, Q116, T117, R118, V119, D120, K121, L122, T123, Q124, G125, R126, Q127, T128, Y129, D130, W131, T132, L133, N134, R135, N136, Q137, P138, A139, A140, T141, A142, L143, A144, N145, T146, I147, E148, F150, R151, S152, N153, L155, A157, N158, E159, S160, G161, R162, L163, I164, D165, F166, L167, K168, D169, V170, S173, K214, Y217, L218, I219, R220, A221, L222, T223, L224, N225, T226, M227, T228, K229, D230, A231, E232, R233, G234, K235, L236, K237, R238, R239, A240, I241, A242, T243, P244, G245, M246, Q247, I248, R249, G250, F251, V252, Y253, F254, V255, E256, L258, A259, R260, I262, C263, E264, N314, P315, R316, F318, L319, A320, M321, I322, T323, Y324, I325, T326, N328, Q329, P330, E331, W332, F333, R334, N335, L337, S338, A340, P341, I342, M343, F344, S345, N346, K347, M348, A349, R350, L351, D398, G399, T400, S402, L403, S404, P405, G406, M407, M408, M409, G410, M411 |
Palm | L298, E457, V473, K486 | K430, 433, H456, D464 | K265, L266, E267, Q268, S269, G270, L271, P272, V273, G274, G275, N276, E277, K278, K279, A280, K281, L282, A283, N284, V285, V286, R287, K288, M289, M290, T291, N292, S293, Q294, D295, T296, E297, S299, F300, T301, I302, T303, G304, D305, N306, T307, K308, W309, N310, E311, N312, Q313, F412, N413, M414, L415, S416, T417, V418, L419, G420, V421, S422, I423, L424, N425, L426, G427, Q428, K429, Y431, T432, T434, T435, Y436, W437, W438, D439, G440, L441, Q442, S443, S444, D445, D446, F447, A448, L449, I450, V451, N452, A453, P454, N455, G458, I459, Q460, A461, G462, V463, R465, F466, Y467, R468, T469, C470, K471, L472, G474, I475, N476, M477, S478, K479, K480, K481, S482, Y483, I484, N485, T487, G488, T489, F490, E491, F492, T493, S494, F495, F496, Y497 |
BH | R361, L364, S375, E383, K387, I397 | I368, M372, S384, R386 | G352, K353, G354, Y355, M356, F357, E358, S359, K360, M362, K363, R365, T366, Q367, P369, A370, E371, L373, A374, I376, D377, L378, K379, Y380, F381, N382, T385, K388, I389, E390, K391, I392, R393, P394, L395, L396 |
Thumb | I517, L576, Q584, A587, P596, D618, D619, R621, E638, I667 | I525, S573, D581, K586, L628 | R498, Y499, G500, F501, V502, A503, N504, F505, S506, M507, E508, L509, P510, S511, F512, G513, V514, S515, G516, N518, E519, S520, A521, D522, M523, S524, G526, V527, T528, V529, I530, K531, N532, N533, M534, I535, N536, N537, D538, L539, G540, P541, A542, T543, A544, Q545, M546, A547, L548, Q549, L550, F551, I552, K553, D554, Y555, R556, Y557, T558, Y559, R560, C561, H562, R563, G564, D565, T566, Q567, I568, Q569, T570, R571, R572, F574, E575, K577, K578, L579, W580, Q582, T583, S585, G588, L589, L590, V591, S592, D593, G594, G595, N597, L598, Y599, N600, I601, R602, N603, L604, H605, I606, P607, E608, V609, C610, L611, K612, W613, E614, L615, M616, D617, Y620, G622, R623, L624, C625, N626, P627, N629, P630, F631, V632, S633, H634, K635, E636, I637, S639, Y657, D658, A659, V660, A661, T662, T663, H664, S665, W666, P668 |
PL | V645, S654 | N642, M646 | V640, N641, A643, V644, P647, A648, H649, G650, P651, A652, K653, M655, E656 |
CTE | S741, I746 | K691, V709, I728, E739, S748, E752 | K669, R670, N671, R672, S673, I674, L675, N676, T677, S678, Q679, R680, G681, I682, L683, E684, D685, E686, M688, Y689, Q690, C692, C693, N694, L695, F696, E697, K698, F699, F700, P701, S702, S703, S704, Y705, R706, R707, P708, G710, I711, S712, S713, M714, V715, E716, A717, M718, V719, S720, R721, A722, R723, I724, D725, A726, R727, D729, F730, E731, S732, G733, R734, I735, K736, K737, E738, F740, E742, I743, M744, K745, C747, T749, I750, E751, L753, R754, R755, Q756, K757 |
Variable residues are mostly located at the fingers (namely, residues: T57, V113, G154, G216) and thumb subdomains (namely, residues: Q584, A587, R621, S741). Apart from variable residues, fingers subdomain contains 87.89% highly conserved residues (cs between 10 and 11); while thumb subdomain contains 88.81% (which represents 119 residues from a total of 134 that comprise this domain, as shown in Table 1). The palm subdomain showed the most promising results regarding the conservation analysis, with 94.12% highly conserved residues, 85.00% of which are top-conserved. Altogether, this resulted in three regions with the maximum cs: E78-A93 (from fingers subdomain), G274-M290 and V439-A453 (from palm subdomain). The N-terminal and C-terminal extensions – that interact with PA and PB2 subunits, correspondingly, – are vastly conserved, showing highly conserved/variable residues rates of 94.29%/5.71% and 90.91%/9.09%, respectively. The priming loop region of the PB1 structure, known as essential for the replication cycle of the IAV-RdRp, is composed of 18 amino acids, from which 14 are highly conserved and the remaining 4 have lower cs: N642 (cs = 9.48), V645 (cs = 5.50), M646 (cs = 8.38) and S654 (cs = 4.78).
2.1.2. Druggability studies
For the druggability analysis, we selected structures of the IAV-RdRp PA-PB1-PB2 trimer: 6QNW, 6QPF and 6TU5. We identified 209 druggable pockets with the DoGSiteScorer (DGSS) webserver along with 449 top-druggable residues in within the highly promising druggable pockets. These were mostly located in fingers subdomain (127 residues), followed by thumb subdomain (120 residues) and palm subdomain (92 residues). Hence, several regions with the maximum druggability score can be highlighted, namely: N283-D295, T489-A521 and M523-P541. The priming loop region was not crystallized in none of the analyzed structures, considering it is an highly flexible region. The N-terminal and C-terminal regions comprised 27 and 45 highly druggable residues, respectively, that correspond to 79.41% and 50.56% of the total residues from these regions. We additionally explored the most promising conserved druggable regions (three-dimensionally close), which formed CDPs, as described in the Materials and Methods section. We identified 10 CDP with ≥14 aa in the PB1 structure of IAV-RdRp (I-CDP): The described I-CDP are mostly placed in the fingers (I-CDP6, I-CDP8, I-CDP9, I-CDP10), palm (I-CDP3, I-CDP4, I-CDP6, I-CDP7), and thumb subdomains (I-CDP1, I-CDP2, I-CDP3, I-CDP4, I-CDP5) of the influenza PB1. All I-CDP are described in Table 2 and the top five pockets with higher sizes were mapped onto the PB1 tridimensional structure (PDB entry: 6QNW; Fig. 3 ).
Table 2.
Consensus Druggable Pockets (CDPs) identified for the PB1 protein tridimensional structures of influenza A virus. Pockets were identified according to their size, on a descending order from 1 (larger) to 10 (smaller). Only pockets with ≥10 residues were considered. Positions placed in the PB1 subunit of each consensus pocket are distributed according with the respective PB1 domains. I-CDP: influenza consensus druggable pockets.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
Consensus pocket |
Position (and corresponding PB1 regions) |
Size (aa) |
||||
---|---|---|---|---|---|---|
N-terminal |
Fingers |
Palm |
Thumb |
C-terminal |
||
I-CDP1 | 546 550 553 567 569 570 571 572 574 | – | 69 | |||
I-CDP2 | 517 518 519 520 521 523 524 527 528 531 555 590 591 592 599 604 605 606 607 608 609 610 613 624 625 626 627 629 630 631 632 633 634 635 635 636 637 638 657 658 659 660 661 662 663 |
– | 54 | |||
I-CDP3 | 5 6 8 9 19 20 21 22 23 24 25 26 | 496 | 504 505 506 507 508 509 538 539 540 543 | – | 50 | |
I-CDP4 | 22 | 303 304 305 446 447 479 480 481 482 483 484 491 494 496 497 | 498 499 | – | 30 | |
I-CDP5 | 557 560 561 563 566 568 570 572 573 576 577 | – | 24 | |||
I-CDP6 | 87 88 90 91 94 | 423 426 427 431 432 438 465 469 472 473 | – | 19 | ||
I-CDP7 | 300 456 459 460 484 463 485 | – | 14 | |||
I-CDP8 | 56 58 65 66 67 316 345 346 404 | – | 13 | |||
I-CDP9 | 117 121 122 124 163 249 250 252 253 | – | 11 | |||
I-CDP10 | 48 49 50 51 52 76 78 79 80 82 | – | 10 |
Fig. 3.
Mapping of the I-CDP1 to I-CDP5 onto the PB1 protein three-dimensional structure. Ribbon diagram of IAV-RdRp protein structure (PDB entry 6QNW) are showed. All the residues that comprise each I-CDP are represented, from all three subunits of the IAV-RdRp complex. The figure includes (i) two side views related by 180-degree rotation and (ii) a closer view from the five most promising I-CDP. The consensus druggable pockets are coloured as follows: I-CDP1, green; I-CDP2, blue; I-CDP3, orange; I-CDP4, purple; and I-CDP5, yellow. I-CDP1 represents the larger pocket (containing 68 aa) and the only which integrates all subunits of the IAV-RdRp. I-CDP2 is placed in both PB1 and PB2 subunits and I-CDP9 and I-CDP10 only in the PB1 subunit. The remaining 6 I-CDP are placed in the PA-PB1 interface.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
Some residues were spotted in more than one I-CDP, namely: F22, I484, F496, Q569, T570, R572. These represent both highly druggable and highly conserved residues (cs ≥ 10). The integration of conservation and druggability patterns for the several domains of the PB1 protein structure of IAV-RdRp is described in Table S1 (Supplemental data).
2.2. Beta-CoV nsp12 protein
2.2.1. Conservation studies
2.2.1.1. SARS-CoV-2
Overall, we found that the nsp12 protein of SARS-CoV-2 is highly conserved. When considering the SARS-CoV-2 sequences – that include SARS-CoV-2 ancestral lineage and all the SARS-CoV-2 emerging variants – 97.42% of the residues are highly conserved (cs ≥ 10), 2.25% are conserved (cs of 7–9) and 0.32% are variable (cs ≤ 6). The N-terminal extension of nsp12 (formed by NiRAN, β-hairpin and interface domains) contains only 0.55% variable residues, while the remain 94.79% (which represents 346 residues) are highly druggable residues. The nsp12-RdRp domain (composed by fingers, palm, and thumb subdomains) exhibits the most promising results with respect to conservation analysis, with 85.82%; 85.14%; and 82.05% top-conserved residues, respectively. From the nsp12-RdRp domain, there were only spotted 4 residues with cs ≤ 9: Y521 (cs = 9), M629 (cs = 9), Q822 (cs = 9) and L829 (cs = 7) and G671 (cs = 0).
2.2.1.2. SARSr-CoVs (SARS-CoV-2 and SARS-CoV)
When comparing nsp12 protein sequences from SARSr-CoVs (that includes both SARS-CoV-2 and SARS-CoV) we noticed a slightly lower level of conservation, mostly in residues from the NiRAN and interface domains. Considering the complete structure of the nsp12 protein, the percentage of highly conserved residues decreases from 94.79% to 96.35% and the number of variable residues (cs ≤ 6) increases from 3 to a total of 20 (2.14%). Residue G671 shows a cs of 0 in all groups referred so far. Palm subdomain is the most conserved domain, with 99.42% top-conserved residues. Fingers subdomain remains highly conserved (comparing with previous analysis of SARS-CoV-2), with 99.27% highly conserved residues. Despite the changes in conservation rates, there are also many large sequence regions of top-conserved residues, namely V398–S520 and E522-A580, L630-G670 (from fingers subdomain), T687-E811 (from palm subdomain) and P830-A923 (from thumb subdomain).
2.2.1.3. SARSr-CoVs and MERS-CoV
Sequences from the beta-CoVs included in the study (namely, SARSr-CoVs and MERS-CoV) were also comparatively analyzed. In this group, the conservation pattern changed considerably. Sequence conservation of each position of SARS-CoV-2, SARSr-CoVs and SARSr-/MERS-CoVs nsp12-RdRp domain is presented in detail in Table 3 .
Table 3.
Overall alignment of SARS-CoV-2, SARSr-CoV and MERS-CoV sequence conservation patterns of nsp12-RdRp domain. Analysis of nsp12 sequences from SARS-CoV-2, SARSr-CoV (SARS-CoV-2 and SARS-CoV) and SARSr-/MERS-CoV group at each amino acid position from the RdRp-domain according to its conservation. The druggable sites were predicted according to the results of two independent bioinformatic tools: DGSS and SF. Each site is coloured according to the conservation score scale (0-11) showed in this figure. Highly variable residues are coloured in red (scores 0–3), variable residues in pink (scores 4–6), conserved residues in light blue (scores 7–9) and highly conserved residues in dark blue (scores 10–11). Highly conserved, conserved, and variable residues represented 79.94%, 9.12% and 10.94%, respectively, considering all sequences in study. Distinct conservation patterns are mostly present in NiRAN domain, in which it was reported a decrease of the cs in 49.8% residues. Nevertheless, 68 were top-conserved in all grouped sequences, namely: S1, F7, L8, R19, V11, G13, R18, L19, G23, T28, C53, C54, R55, Q57, E58, E61, F70–N79, H82, I86, I171, L178, R181-Q184, L186, L187, V190, C193, A199, V202, G203, L205-Y217, F219, G220, D221, F222, V234, Y237, Y238, S239, L240, L241, M242, I244, L245, T246.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
Once again, the nsp12-RdRp domain stands as the most conserved region, particularly the thumb subdomain. SARS-CoV-2, SARSr-CoVs and SARSr-/MERS-CoVs sequences all show 86.05% top-conserved residues for this subdomain. Despite the overall slightly lower percentage of highly conserved residues, we have identified 619 top-conserved residues. This resulted in multiple regions comprised by top-conserved residues: S635–C646 (fingers subdomain) Q698-S709, K751–C765, K783-M794 (palm subdomain), D618-N628, S672-G683, (fingers and palm subdomains), A840-T853, A878-H898, E917-Q932 (thumb subdomain) and E802-K821 (palm and thumb subdomains).
2.2.1.4. Druggability studies
The available structures of SARS-CoV-2 and SARS-CoV RdRp complexes were collected and four of them were selected (PDB IDs: 6M71, 6NUR, 6NUS and 7BTF), according to the strategy described in Materials and Methods. These are structures of the nsp12 subunit monomers or in association with nsp8 and nsp7 accessory subunits (tetramers). NiRAN domain showed the lowest druggability score in both nsp12 monomers (19.38% druggable residues) and tetramers (25.55% druggable residues). β-hairpin and interface regions each showed approximately 50% residues with low druggability scores. Residues of β-hairpin region were only resolved for SARS-CoV-2 nsp12 structures. Besides its low druggability rate, the interface region is comprised by 2 top-druggable regions: V318–F324 and S346–H350. Globally, within these three domains, we only identified 40 top-druggable residues. The nsp12-RdRp domain presents promising druggability results. In this domain, we have identified 155 new druggable residues pertaining to druggable pockets with the maximum druggability in SARS-CoV-2 and SARS-CoV monomers. Similarly, 224 highly druggable aa were pointed-out for tetramers. In each conformation, the identified residues are mostly part of fingers subdomain. Furthermore, 140 top-druggable residues of these are shared by both conformations and are mainly from the palm subdomain. Some of these residues are spatially close and form CDPs. We have identified 17 CDP in both conformations of the nsp12 protein (C-CDP). All C-CDPs were labeled according to a descendent order of size and are described in Table 4 . The top 5 CDP with a larger size were mapped into the crystallographic structure of the tetrameric CoV-RdRp (PDB entry 6M71; Fig. 4 ). These C-CDPs are mostly placed in fingers subdomain. Results from both conservation and druggability studies of the nsp12 protein from CoV-RdRp are shown in Table S2 (Supplemental data).
Table 4.
Consensus Druggable Pockets (CDPs) identified for the nsp12-polymerase protein monomers and tetramers. Pockets were identified according to their size and druggability score, on a descending order from 1 (larger) to 17 (smaller). Only pockets with ≥10 residues were considered. Positions of each consensus pocket are distributed according with the respective nsp12 domains. C-CDP1, C-CDP7 and C-CDP9 are placed in fingers and palm subdomains; C-CDP4 and C-CDP6 in the interface region and fingers subdomain; C-CDP11 in NiRAN domain and β-hairpin region; and C-CDP12 and C-CDP14 in thumbs subdomain and fingers or palm subdomains, respectively. C-CDP3, C-CDP10 and C-CDP16 are exclusively placed in fingers subdomain; C-CDP13 and C-CDP17 in palm subdomain; and C-CDP15 in the interface region. On the other hand, C-CDP2, C-CDP5 and C-CDP8 are placed in NiRAN, interface and palm regions. Additionally, C-CDP2 and C-CDP5 are also located in the interface region. We also noticed that C-CDP1 and C-CDP13 overlap in sites placed in the palm subdomain. *Sites 33, 34 and 47 from CDP11 are placed in the β-hairpin region. C-CDP: coronavirus consensus druggable pocket.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
Consensus pocket |
Position (and corresponding nsp12 regions) |
Size (aa) |
||||
---|---|---|---|---|---|---|
nsp12-RdRp domain |
||||||
NiRAN | Interface | Fingers | Palm | Thumb | ||
C-CDP1 | 478 479 480 482 483 579 | 582 583 584 585 586 587 588 589 591 592 597 600 601 604 605 607 696 746 749 750 753 754 755 756 | 30 | |||
C-CDP2 | 131 132 165 172 175 176 243 244 245 246 247 248 249 | 250 251 252 316 319 320 | 460 461 462 463 630 | 701 787 788 789 791 | 29 | |
C-CDP3 | 388 390 392 400 402 404 405 406 407 408 446 447 448 449 450 452 453 541 542 543 544 556 668 669 670 671 672 674 | 28 | ||||
C-CDP4 | 298 299 300 301 302 303 304 307 | 568 571 572 631 632 635 636 639 640 648 649 651 652 654 655 658 659 662 663 | 27 | |||
C-CDP5 | 231 232 233 234 235 | 289 290 291 292 305 | 467 470 471 474 | 730 731 732 733 734 735 736 737 738 742 | 24 | |
C-CDP6 | 333 342 343 344 345 356 358 359 360 363 | 369 370 373 374 370 530 534 535 536 537 | 20 | |||
C-CDP7 | 452 453 456 540 542 553 554 556 557 558 560 621 623 624 665 667 676 | 680 681 682 | 20 | |||
C-CDP8 | 164 165 166 172 | 458 459 460 462 621 625 626 627 | 620 791 792 793 794 795 798 | 19 | ||
C-CDP9 | 495 497 498 500 501 502 512 516 560 561 562 565 569 573 576 577 580 | 685 689 | 19 | |||
C-CDP10 | 410 411 440 441 442 444 544 545 546 547 548 549 551 552 553 554 555 | 17 | ||||
C-CDP11 | 33* 34* 47* | 122 126 129 130 133 134 135 138 139 140 141 142 | 15 | |||
C-CDP12 | 416 441 548 | 840 841 843 844 845 847 848 854 855 858 859 862 | 15 | |||
C-CDP13 | 588 589 590 591 592 593 597 598 601 602 756 758 812 813 815 | 15 | ||||
C-CDP14 | 593 594 595 596 597 598 599 | 822 827 828 829 830 916 917 918 | 15 | |||
C-CDP15 | 254 255 256 257 259 266 268 278 281 282 285 320 321 | 13 | ||||
C-CDP16 | 490 491 493 494 495 498 516 521 524 566 569 570 | 12 | ||||
C-CDP17 | 616 617 618 619 761 762 763 798 800 811 812 | 11 |
Fig. 4.
Mapping results of the five highest-size and grade consensus druggable pockets onto the nsp12-polymerase protein three-dimensional structure. Ribbon diagram of nsp12-polymerase monomeric chain (PDB entry 6M71) are showed in (A) two side views related by 180-degree rotation; (B) top view and (C) bottom view related by 90-degree rotation. The consensus druggable pockets are coloured as follows: C-CDP1, green; C-CDP2, blue; C-CDP3, orange; C-CDP4, purple; and C-CDP5, yellow.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
3. Discussion
Structure-based approaches targeting highly conserved and highly druggable regions for the discovery and development of more robust antivirals are lacking (Trigueiro-Louro et al., 2020, 2022).
The RdRp has a central role in viral replication (Choi, 2012). Likewise, and considering its importance, it represents an antiviral target by excellence in many infectious diseases. The available anti-polymerase drugs approved for treatment of IAV and SARS-CoV-2 infections, such as favipiravir and molnupiravir, have shown limited effectiveness, so a thoroughly study of the IAV-RdRp and the CoV-RdRp potential for the development of alternative anti-polymerase strategies – or ways to optimize the existent ones – is of utmost importance. In the current work, we mainly explored the IAV-RdRp (with a major focus on the PB1 protein) and the CoV-RdRp (mainly regarding the nsp12).
3.1. Influenza PB1 protein
Overall, we identified 297 top-ranked hot spot residues (considering the entire structure of the protein), which were mostly placed in fingers (82 residues) and palm (76 residues) subdomains. It can be readily hypothesized that targeting top-ranked hot spot residues within these subdomains could impact the viral RNA synthesis.
Likewise, the N-terminus and C-terminus of the PB1 showed promising results in our analysis, with the identification of 19 and 24 top-ranked hot spot residues, respectively. These regions interact with the PA and PB2 subunits of IAV-RdRp and single amino acid substitutions in selected positions have already proved to impact the viral transcription process (Sugiyama et al., 2009). Specifically, Sugiyama et al. showed that V715/V750 and I746/V750 double mutants showed a significantly lower level of viral RNA (vRNA) replication in vitro. Moreover, the single V715S mutation also resulted in a replication with a reduced levels of viral messenger RNA (mRNA) detected (Sugiyama et al., 2009). Our studies showed that all these residues are highly conserved (except for the position I746) and V715 is also highly druggable. A summary table integrating the results from the conservation and druggability analysis performed in this study and the evidence collected from experimental studies performed by other authors and published in the literature is depicted in Table 5 .
Table 5.
Summary table of promising IAV-RdRp PB1 residues explored in this study and respective experimental findings described in literature regarding their potential role on protein function/structure. Conservation score scale: 0–11. Druggability score scale: 0–2.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
PB1 Residue | PB1 domain | Conservation/Druggability scores | I-CDP | Findings | Authors |
---|---|---|---|---|---|
K715 | C-terminal extension | 9.73/2 | – | Double mutations 715/750 and 746/750 were associated with lower levels of vRNA in vitro. | Sugiyama et al. (Sugiyama et al., 2009) |
I746 | 5.25/0 | ||||
V750 | 11/1 | ||||
K553 | Thumb | 11/2 | I-CDP1 | Residues reported to be involved in IAV-RdRp dimerization and interaction with 3′ cRNA. Binding site of 3′ cRNA identical in Influenza B virus. | Fan et al. (Fan et al., 2019) |
Y557 | 11/2 | I-CDP5 | |||
R571 | 9.98/2 | I-CDP1 | |||
K188 | Beta-ribbon | 11/0 | – | Residues from the bipartite NLS that interact with RanBDP5 (mediate the nuclear import of the PA-PB1 complex). | Hutchinson et al.; Deng et al., 2006 (Hutchinson et al., 2011; Deng et al., 2006) |
R189 | 11/9 | ||||
K208 | 9.78/0 | ||||
K209 | 9.97/1 | ||||
K121 | Fingers | 9.98/2 | I-CDP9 | Binding-site of the active form of favipiravir. | Sada et al. (Sada et al., 2020) |
Q124 | 11/2 | I-CDP9 | |||
M246 | 11/0 | – | |||
Y253 | 9.89/2 | I-CDP9 | |||
T257 | 7.08/2 | – | |||
T223 | Fingers | 11/2 | – | Residues from motif F of the PB1 protein. | |
L224 | 11/2 | Goldhill et al.; Abeldenabi et al. (Goldhill et al., 2018; Abdelnabi et al., 2017) | |||
N225 | 11/2 | ||||
T226 | 11/2 | ||||
T243 | 11/2 | ||||
P244 | 11/2 |
Regarding the PB1 thumb domain, this region was previously described by Fan et al. as crucial for protein dimerization; and consequently promotes (i) conformational changes that allow the opening of IAV-RdRp binding site and (ii) the correct positioning of complementary RNA (cRNA) for initiation of vRNA synthesis (Fan et al., 2019). Our studies showed that this domain bears one of the highest percentages of variable residues; notwithstanding, it also presented promising conserved druggable residues in specific regions, as previously highlighted. Targeting conserved druggable residues on this subdomain, such as the ones identified in Table S1, could result in an antiviral strategy that endangered the viral replication cycle by inhibiting protein dimerization. Particularly, residues K553, Y557 and R571, that were described to interact with the 5-terminal nucleotides of cRNA (Fan et al., 2019), have cs ≥ 10 and are also top-druggable residues. Also, we identified residues K553 and Y571 as part of the I-CDP1 (Table 5). Fan et al. also described that the 3′ cRNA interacts with the IAV-RdRp through the PB1 loop (residues F551-R571) and the PA residues E300, Y464, K488 and R496 (Fan et al., 2019). It is worth to mention that we also identify the PA residues Y464, K488 and R496 as part of the I-CDP1. The development of a potential inhibitor targeting this pocket might impact on the stability of the interaction between the RdRp and the cRNA. Also, the same binding site of 3’ cRNA was identified in the RdRp of influenza B virus (Fan et al., 2019). Future studies that include structures of polymerase from other influenza virus types should be considered to identify common consensus pockets that could accommodate, for instance, a pan-polymerase inhibitor that could be active against both influenza A and influenza B viruses.
As previously explained, it is important to identify target regions that have an important role in protein function to be able to discover a potential inhibitor that could, effectively and significantly impact the viral replication process (Mandal et al., 2009). For instance, the interface of interaction between RanBP5 and the PB1 subunit is essential for nuclear import of the PA-PB1 complex from the cytoplasm and consequently further assembly with PB2 to form a functional trimeric IAV-RdRp complex (Deng et al., 2005). Moreover, previous studies reported that the knock-down of the RanBP5-PB1 interaction resulted in lower levels of vRNA and nuclear PA-PB1 dimers detection, in vitro (Deng et al., 2006). RanBP5-binding site is exclusively placed in the first N-terminal 299 aa of the PB1 subunit, and it includes residues from its bipartite NLS such as K188, R189, K208 and K209 (Deng et al., 2006). These residues were already described as conserved and with a potential role in providing a viral fitness advantage (Hutchinson et al., 2011). Our studies are consistent with the high cs previously identified for this region, although a low druggability score has been attributed (Table 5). Notwithstanding, the overall PB1 N-terminal region (M1-S299) shows promising top-ranked hot spots and I-CDP3, I-CDP4, I-CDP6, I-CDP8 and I-CDP9 are placed in this region (Table 2). An antiviral strategy directed against this protein region could not only target an early step of the viral replication, by interfering in the nuclear import of the PA-PB1 complex but might also disturb the interaction between PA and PB1 subunits that occurs in both the cytoplasm and the nucleus compartments of the infected cells.
The binding site of favipiravir ribofuranosyl-50-triphosphate (F-RTP), the active form of favipiravir, with the H3N2 polymerase protein was also described by Sada et al. It includes the following PB1 residues: K121, Q124, M246, Y253 and T257 (Sada et al., 2020). Our studies show that these residues are highly druggable (except T257 which showed a cs of 7). These residues are also characterized by high druggability scores (except M246; druggability score = 0). Residues K121, Q124 and Y253 are also part of the I-CDP9 identified in the study, which is placed in the highly promising fingers subdomain of the PB1 protein (Table 5).
Goldhill et al. and Abeldnabi et al. highlighted the importance of PB1 motif F of the PB1 protein for the antiviral mechanism of action of favipiravir (Goldhill et al., 2018; Abdelnabi et al., 2017). Motif F is placed in fingers subdomain of the PB1 subunit and comprises residues T223-P244. According to our analysis, these are top-conserved residues, (excluding R233; cs = 9,96); and the residues T223, L224, N225, T226, T243 and P244 are also highly druggable (Table 5). Considering that the PB1 crystallographic structures available in the protein databases are not fully resolved for this region, it was not possible to properly study its druggability potential. Future analysis should include the motif F of PB1, considering its prospective importance as an antiviral target (Sheahan et al., 2020). Furthermore, considering the reported interaction of favipiravir with both IAV-RdRp and CoV-RdRp, a more comprehensive study of the motif F may potentially lead to the identification of additional pockets with common features in different RNA polymerases (e.g., PB1 and nsp12), and consequently provide insights for the development of an antiviral strategy directed against both SARS-CoV-2 and influenza virus.
3.2. Beta-CoV nsp12 protein
Our conservation studies showed that the nsp12 is highly conserved throughout its domains regarding different beta-CoVs (Table 3), even when considering the newly emergent Omicron variant. This suggests that this protein may display lower evolutionary rates compared with more-exposed surface proteins. In this vein, one can hypothesize that an antiviral or immunotherapeutic approach focusing on the CoV-RdRp may represent a promising strategy for the prevention and/or treatment of COVID-19, with lower chances of becoming ineffective due to the inherent viral diversity and evolution. Regarding the comparative study of the conservation and druggability analysis, we have identified 47 top-ranked hot spot residues in the N-terminal extension of nsp12 and 164 top-ranked hot spot residues in the nsp12-RdRp domain, mainly in the fingers subdomain. These represented top-druggable residues in both protein conformations and top-conserved residues in the distinct groups of viral sequences analyzed (Table 3) and so exhibit a higher antiviral target potential. Ultimately, C-CDP such as C-CDP1, C-CDP7, C-CDP8, C-CDP9 and C-CDP13 should be the focus for a prospective antiviral strategy targeting the nsp12-RdRp domain. These may represent the starting point for the structure-based discovery of new molecules with potential to inhibit the nsp12 functions or destabilize the protein conformational structure.
Our studies highlighted the fingers and palm subdomains as the most promising regions for a future development of antiviral molecules that target SARS-CoV-2, SARSr-CoVs and MERS-CoV nsp12. These subdomains contain the highly conserved motifs that are responsible to mediate template-directed RNA synthesis (Hillen et al., 2020). Interestingly, molnupiravir was described to bind to fingers subdomain, including residues K545, A547, I548, S549, R555 (Sheahan et al., 2020). In our studies, we have identified these residues as top-ranked hot spots and as part of the 17-residues-long C-CDP10, which is fully placed in the highly promising fingers subdomain and contains other catalytic residues from the nsp12 protein motif F. A summary table integrating results collected from our studies and available data collected from experimental studies performed by other authors and published in the literature regarding nsp12 residues is described in Table 6 , similarly to the one performed for the IAV-RdRp PB1 protein.
Table 6.
Summary table of promising CoV-RdRp nsp12 protein residues identified in this study and respective experimental findings described in literature regarding their potential role on protein function/structure. Conservation results concern the analysis of all SARSr-/MERS-CoV sequences. Conservation score scale: 0–11. Druggability score scale: 0–2.
Coloured web version. If accepted, a black-and-white version of the figure will be provided for print.
nsp12 Residue | nsp12 domain | Conservation/Druggability scores | C-CDP | Findings | Authors |
---|---|---|---|---|---|
K545 | Fingers | 11/2 | C-CDP10 | Interaction with molnupiravir. | Seahan et al. (Sheahan et al., 2020) |
A547 | 11/2 | ||||
I548 | 11/2 | ||||
S549 | 11/2 | ||||
R555 | 11/2 | ||||
K545 | Fingers | 11/2 | Residues from the NTP entry channel (motif F) | Gao et al. (Gao et al., 2020) | |
R553 | 11/2 | C-CDP10 | |||
R555 | 11/2 | ||||
D618 | Palm | C-CDP17 | Divalent cation-binding residue | Gao et al. (Gao et al., 2020) | |
S759 | Palm | 11/0 | – | SDD region (motif C) | Gao et al. (Gao et al., 2020) |
D760 | 11/1 | – | |||
D761 | 11/2 | C-CDP17 | |||
K500 | Fingers | 11/0 | C-CDP9 | Essential for nsp12 protein conformational rearrangements (motif G) | Schumer-Furmer et al. (Schueler-Furman and Baker, 2003) |
S501 | 11/2 | ||||
G502 | 11/2 | ||||
K545 | Fingers | 11/2 | C-CDP10 | Favipiravir binding site in the RdRp active site | Naydenova et al. (Naydenova et al., 2020) |
K551 | 10/2 | ||||
R553 | 11/2 | ||||
D618 | Palm | 11/2 | C-CDP17 | ||
K621 | Fingers | 11/2 | C-CDP8 | ||
D761 | Palm | 11/2 | C-CDP17 | ||
K798 | 11/2 | C-CDP17 |
Gao et al. previously explored the SARS-CoV-2 RdRp and identified residues in highly conserved motifs which were also highly conserved in RNA viruses as hepatitis C virus and poliovirus (Gao et al., 2020). The most promising residues reported by Gao et al. included: K545, R553 and R555, which are part of the nucleoside triphosphate (NTP) entry channel (motif F); the divalent-cation-binding residue D618 and the classic SDD region of motif C formed by residues S759, D760 and D761. Residues K545 and R555 are included in the molnupiravir binding site and the results from our study are consistent with both (i) their structural/functional relevance for the nsp12 protein and (ii) their suitable characteristics for a possible interaction with an antiviral molecule. The remaining residues described by Gao et al. (R553, D618, S759, D760 and D761) are top-conserved residues (in every group of sequences analyzed) but only the R553 and D761 are also top-druggable residues. It is also worth to highlight that the C-CDP10 identified in our study includes the residue R553 (Table 6) (relevant residue previously reported by Gao et al. (2020). Considering the mechanism of action of the molnupiravir, it is acknowledgeable that targeting some key residues belonging to these regions will impair SARS-CoV-2 replication (Kabinger et al., 2021). Furthermore, the design of a polymerase inhibitor (or the optimization of molnupiravir) focused on a larger number of these highly promising top-ranked hot spots may prompt a more effective inactivation of the CoV-RdRp active site and consequently the blockage of the RNA-template binding.
Wang et al. reported that the molecular rearrangements that occur in the nsp12 protein during the replication cycle are mainly determined by two of the highly conserved motifs: B (residues G682 to T686) and G (residues D499 to L514) (Schueler-Furman and Baker, 2003). The results from our study are consistent with the structural relevance of these residues, showing that they are mainly highly conserved across SARSr- and MERS-CoVs nsp12 sequences; motif B is also highly druggable in both nsp12-conformations. On the other hand, motif G has only three residues with high druggability scores (K500, S501, G502), from which K500, and S501 were described to interact with the RNA-template (Schueler-Furman and Baker, 2003). The residues K500, S501 and G502 are placed in CDP9 identified by our study, which also includes part of motif B (key residue: T685) (Table 6). The top-ranked hot spots found within these regions might allow the discovery of an anti-nsp12 inhibitor, which can interfere with the conformational determinants of the polymerase protein and consequently, affect the viral replication.
There are antiviral drugs which have been reported to inhibit the function of both IAV-RdRp and CoV-RdRp, such as favipiravir (Qi et al., 2021; Furuta et al., 2013b). This suggests that RdRp complexes have key common structural determinants that allow the interaction with the same ligands for the inhibition of their activity. Hence, it is reasonable to hypothesize that there can be other common/consensus pockets that allow the binding of a putative molecule that can pharmacologically modulate the polymerase function or interfere with its structure. For instance, favipiravir has been described to bind to residues from the nsp12 protein of the CoV-RdRp active site, as K545, K551, R553, R555, D618, K621, D761 and K798 (Naydenova et al., 2020). These residues are placed in the fingers and palm subdomains from the nsp12 protein and, which were previously described to have suitable conservation and druggability patterns – highly promising target regions (Table 6). Most of these residues comprise the C-CDP10 (K545, K551, R553 and R555), which is also composed by residues that were described to interact with molnupiravir. Considering the ability of favipiravir to bind to IAV and SARS-CoV-2, this consensus pockets could represent the starting point for the design of an antiviral molecule with the ability to inhibit the function of both IAV-RdRp and CoV-RdRp and for the development of a broad-spectrum (or even universal) antiviral strategy.
This study represents a computational approach to study both the PB1 protein from IAV-RdRp and the nsp12 protein from CoV-RdRp as promising antiviral targets. However, experimental approaches need to be performed to provide us additional insights on regions with crucial roles on IAV and SARS-CoV-2 polymerases structure and/or function. We suggest that further studies should be mainly focused on the top-ranked hot spots highlighted in the fingers and palm subdomains of PB1 and nsp12 proteins. Also, a structure-based rational strategy for drug discovery/design of new inhibitor compounds – or the optimization of the authorized/in approval drugs – requires the study of the effects of pharmacological blockage through the highly promising CDPs that we identified in our study. We suggest that further studies should be mainly focused on the top-ranked hot spots highlighted in the fingers and palm subdomains of PB1 and nsp12 proteins.
4. Conclusion
It is important to discover new and more effective antiviral strategies that could contribute to overcome the current widely recognized therapeutic limitations and to be readily available for prevention and treatment of influenza and COVID-19. This is even more relevant considering the diversity and unpredictable evolution patterns of IAV and beta-CoVs.
In this work, we performed conservation and druggability studies focusing on the IAV PB1 protein and the beta-CoVs nsp12 protein, to identify promising hot spots which can later provide the basis for the structure-based discovery/design of anti-polymerase compounds. The combination of the conservation and druggability analysis disclosed 10 CDPs and 158 top-ranked hot spots for the PB1 protein of IAV-RdRp; and 17 CDPs and 164 top-ranked hot spots for the nsp12 protein of CoV-RdRp – which were mostly placed in the fingers and palm subdomains of these proteins. Also, in this study we highlighted I-CDP9 and C-CDP10, which are not only composed by residues already described to bind favipiravir and/or molnupiravir, respectively, but also contain other residues described for its importance in polymerase structure and function. Furthermore, I-CDP1, I-CDP3, I-CDP4, I-CDP6, I-CDP8 and I-CDP10 (from PB1 structure) and C-CDP1, C-CDP7, C-CDP9 and C-CDP13 (from nsp12 structure) are also highly promising pockets composed by conserved druggable residues which potentially have a relevant role on proteins’ functions and structure during viral replication. Focusing on these CDPs can provide the structural basis for the discovery/design of new and potentially more effective polymerase inhibitors – via its pharmacological modulation -, or the optimization of the authorized/in approval antiviral drugs. Such optimization may enhance the binding efficiency and consequent overcome current antiviral limitations. In a more ambitious perspective, this study may also provide preliminary insights regarding the groundwork for the development of an antiviral rationale/strategy effective against both IAV and COVID-19 – that could, ultimately, represent a new class of pan-polymerase inhibitors.
5. Materials and Methods
5.1. Conservation studies
5.1.1. Dataset construction and sequence analysis
Regarding the Influenza polymerase protein conservation studies, 984 PB1 nucleotide sequences of worldwide circulating viruses of human-infecting Influenza A subtypes – H1N1, H2N2, H3N2, H5N1 and H7N9 – were obtained from GISAID's EpiFlu™ database at www.gisaid.org (Shu and McCauley, 2017; Elbe and Buckland-Merrett, 2017). For the conservation studies of the nsp12 protein of worldwide circulating human beta-CoVs, sequences from SARS-CoV-2 variants – Alpha, Beta, Delta, Gamma, Lambda, Mu and Omicron available until 6th December (regarding omicron) were collected from GISAID's EpiCov™ database (www.gisaid.org). Sequences of the nsp12 protein of SARS-CoV-2 ancestral lineage (until March 24, 2020), SARS-CoV and MERS-CoV were also collected from public databases, including: GISAID's EpiCov™ database (www.gisaid.org) (Shu and McCauley, 2017; Elbe and Buckland-Merrett, 2017), GenBank (https://www.ncbi.nlm.nih.gov/genbank/sars-cov-2-seqs/) (Sayers et al., 2018), 2019 Novel Coronavirus Resource (2019nCoVR) (https://bigd.big.ac.cn/ncov) (Zhao et al., 2020), NIAID Virus Pathogen Resource (ViPR) database (https://www.viprbrc.org) (Pickett et al., 2012) and NCBI Virus database (https://www.ncbi.nlm.nih.gov/labs/virus) (Hatcher et al., 2017). A total of 3019 nsp12 nucleotide sequences (nucleotide positions 13442–16236 of the Orf1ab gene) were collected. All available sequences with a complete coding region were selected for the study. Duplicated sequences and sequences containing degenerate nucleotides were removed. The final datasets, which were further analyzed, contained 926 PB1 nucleotide sequences (influenza A virus) and 2890 nsp12 nucleotide sequences (beta-CoVs). The sequence alignment was performed using the Multiple Alignment using Fast Fourier Transform (MAFFT) available at https://www.ebi.ac.uk/Tools/msa/mafft/(Katoh, 2002; Madeira et al., 2019). Aligned nucleotide sequences were then automatically translated into their respective amino acid sequences using MEGA X (https://www.megasoftware.net/). Using the translation command, the program automatically extracts all protein-coding domains for translation into their respective amino acid sequences using the genetic code – predefined settings - and displays the corresponding protein sequence (Kumar et al., 2018).
5.1.2. Conservation analysis
Position-specific conservation scores of the MSA were calculated using the Valdar scoring method from the Jalview AACons Web server (version 2.11.1.4) (Waterhouse et al., 2009; Valdar, 2002). This scoring method strives to normalize the data against redundancy without losing any evolutionary data (Valdar, 2002). Conservation studies of NS1 protein of influenza A virus and of Spike protein sequence of SARS-CoV-2, have been previously achieved using this algorithm (Trigueiro-Louro et al., 2019, 2022; Choi, 2012; Darapaneni et al., 2009). Conservation degree was calculated for each residue position in the MSA and a conservation threshold of 7 was established (Livingstone and Barton, 1993). Residues with conservation score of 11 corresponded to 100% amino acid identity; residues with conservation score ≥10 were considered highly conserved; residues with conservation score of 7–9 were considered conserved; and residues with conservation score ≤6 were considered variable.
5.2. Druggability studies
5.2.1. Crystallographic structures selection
Crystallographic structures of the PB1 subunit of Influenza A polymerase and the nsp12 of SARS-CoV-2 polymerase protein were obtained from the RCSB Protein Data Bank (PDB), a freely repository available through the website www.rcsb.org (Berman et al., 2000). Reasonable crystallographic systems for polymerase protein were explored and identified based on the (higher) number/percentage of crystalized residues and subunits or co-factors, to identify the presumed binding pockets. Also, structures with high resolution were prioritized. Regarding influenza A virus, the tridimensional structures of the a) dimer conformation of PA-PB1 protein complex (PDB IDs: 2ZNL and 3CM8) and PB1-PB2 protein complex (PDB IDs: 2ZTT and 3A1G) and the b) tetramer conformation of PA-PB1-PB2 (PDB IDs: 6QNW, 6QPF and 6TU5) were used for the druggability studies. Regarding beta-CoVs, crystallographic structures of the nsp12-nsp8-nsp7 polymerase protein complex from SARS-CoV-2 (6M71 and 6NUR) and SARS-CoV (6NUS and 7BTF) were selected for study. For all crystallographic 3D structures: i) all crystallized water molecules, bound ligands and other coordinated molecules were removed from the PDB files; ii) the protonation states of the amino acid residues were assigned, and energy minimization was performed using integrated tools from the MOE 2015.10001 program (Chemical Computing Group ULC. MOE (Molecular Operating Environment), 2013).
5.2.2. Druggability prediction
The druggable sites of the chosen crystallographic structures were predicted, – for either monomeric, dimeric and tetrameric forms – from a consensus of two distinct strategies: the webserver DoGSiteScorer (DGSS) and the commercial software MOE-SiteFinder (SF) (Volkamer et al., 2012; Chemical Computing Group ULC. MOE (Molecular Operating Environment), 2013) DoGSiteScorer server was able to predict potential druggable pockets on the protein surface, based only on the geometric (size, shape) and physicochemical descriptors (grid-based method). Pockets labeled as druggable had a druggability score ≥0.4 (on a scale between 0 and 1) (Peng et al., 2020). SiteFinder of MOE 2015.10001 software uses a geometry-based method based on parameters as the receptor size and the number of contacts with the receptor; and considers the chemical properties information to predict potential pockets for drug target. The top ranked sites with a positive score, were considered as “druggable” (Chemical Computing Group ULC. MOE (Molecular Operating Environment), 2013). The third approach initially used, using the webserver PockDrug (PD), was further withdraw, considering that the selected structures exceeded the size limit set by this specific server. Both programs required the upload of a structure in PDB format.
5.2.3. Druggability study design
The druggability prediction for each polymerase-conformation of influenza and SARS-CoV-2 viruses was independently studied, in a first stage. Tridimensional structures were divided in two conformational groups: (1.1) dimers (PDB IDs: 2ZNL, 2ZTT, 3A1G, 3CM8) and (1.2) tetramers (PDB IDs: 6QNW, 6QPF, 6TU5) – from influenza virus; (2.1) monomers (PDB IDs: 6NUR, 6NUS, 6M71, 7BTF); and (2.2) tetramers (PDB IDs: 6NUR, 6M71, 7BTF) – from SARS-CoV-2 virus. For each structure in the respective conformation set, the druggability analysis was conducted using the two bioinformatic tools - (a) SF and (b) DGSS-, individually. Subsequently, and respectively for the nsp12 protein of SARS-CoV-2 and the PB1 protein of influenza A, an extended analysis was performed to merge the druggability information from both bioinformatic tools for each one of the polymerase conformations groups. The last stage of the study included the analysis of the most prevailing predicted sites regarding the information from all bioinformatic tools together and shared by all the polymerase conformation groups, for a final druggability prediction. For regions which were not crystallized, we designed a proportional measure for the results obtained, to standardize the druggability scale for all sites in analysis. All positions were graded according to a coloured scale. Each scale is tailored to the conformation group in analysis, considering: i) the number of structures that comprise each group and ii) the number of bioinformatic tools used for the corresponding analysis. The druggability score scale predicted for the monomer conformations ranges from 0 (least druggable) to 2 (most druggable), considering that in this case 2 parameters were considered (number of structures analyzed and number of algorithms) for each conformation. Accordingly, sites or residues with a druggable score ≥1 (sites identified in more than half of the total of structures considered by both algorithms) were considered druggable for each conformation. The druggability consensus were integrated and analyzed in parallel with the conservation information to identify druggable binding pockets/sites and residues that overlap or are spatially close to the conserved polymerase regions/residues. Moreover, for each polymerase protein complex, we unified the pocket druggability information from DGSS server to identify regions located spatially closed, which could form larger pockets. These were named “consensus druggable pockets” (CDPs). Only CDPs present in over half of the 3D crystallographic structures in analysis, for each virus protein-conformation, and formed by sites with the same expression were considered, to guarantee meaningful results. Likewise, only pockets with ≥10 amino acid (aa) residues were selected.
Funding
This work was supported by the grant PD/BD/128402/2017 from Fundação para a Ciência e a Tecnologia (FCT) PhD Programme in Medicines and Pharmaceutical Innovation (i3DU).
Credit author statement
Inês Figueiredo-Nunes: Methodology, Formal analysis, Investigation, Writing (Original Draft), Writing (Review & Editing), Visualization, João M. Trigueiro-Louro: Conceptualization, Methodology, Investigation, Formal analysis, Writing (Original Draft), Writing (Review & Editing), Helena Rebelo-de-Andrade: Conceptualization, Resources, Writing (Review & Editing), Supervision, Project administration, Funding acquisition.
Declaration of competing interest
The authors declare no conflict of interest.
Acknowledgments
The authors would like to thank to all authors, originating and submitting laboratories that share their sequences through the GISAID EpiFlu™, Influenza Research Database, NCBI Influenza Virus, GISAID's EpiCov™, GenBank, 2019 Novel Coronavirus Resource (2019nCoVR), NIAID Virus Pathogen Resource (ViPR) and NCBI Virus databases.
The authors also thank Rita Guedes Lab Group (Medicinal Chemistry Unit, iMed.ULisboa, FFUL) not only for providing the required computational equipment but also for sharing their expertise regarding some computational methodologies applied to this study.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.virol.2022.11.001.
Appendix A. Supplementary data
The following is the Supplementary data to this article:
References
- Abdelnabi R., Morais A.T.S., Leyssen P., Imbert I., Beaucourt S., Blanc H., Froeyen M., Vignuzzi M., Canard B., Neyts J., Delang L. Understanding the mechanism of the broad-spectrum antiviral activity of favipiravir (T-705): key role of the F1 motif of the viral polymerase. J. Virol. 2017;91(12) doi: 10.1128/JVI.00487-17. May 26. e00487-17, PMID: 28381577; PMCID: PMC5446660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baker R.E., Park S.W., Wagner C.E., et al. The limits of SARS-CoV-2 predictability. Nat Ecol Evol. 2021;5:1052–1054. doi: 10.1038/s41559-021-01514-z. [DOI] [PubMed] [Google Scholar]
- Berman H.M., Battistuz T., Bhat T.N., Bluhm W.F., Bourne P.E., Burkhardt K., et al. The protein Data Bank. Nucleic Acids Res [Internet] 2000;28(1):235–242. doi: 10.1093/nar/28.1.235. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blundell T.L. Structure-based drug design. Nature. 1996;384(6604 Suppl. l):23–26. doi: 10.1038/384023a0. Nov 7. PMID: 8895597. [DOI] [PubMed] [Google Scholar]
- Buonaguro L., Tagliamonte M., Tornesello M.L., Buonaguro F.M. SARS-CoV-2 RNA polymerase as target for antiviral therapy. J. Transl. Med. 2020;18(1):185. doi: 10.1186/s12967-020-02355-3. https://doi: 10.1186/s12967-020-02355-3 May 5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chemical Computing Group ULC. MOE (Molecular Operating Environment) Canada, H3A 2R7; Montreal, QC: 2013. 08; 1010 Sherbooke St. West, Suite #910. 2011. [Google Scholar]
- Choi K.H. Viral polymerases. Adv. Exp. Med. Biol. 2012;726:267–304. doi: 10.1007/978-1-4614-0980-9_12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Darapaneni V., Prabhaker V.K., Kukol A. Large-scale analysis of influenza A virus sequences reveals potential drug target sites of non-structural proteins. J Gen Virol [Internet] 2009;90(9):2124–2133. doi: 10.1099/vir.0.011270-0. Available from: [DOI] [PubMed] [Google Scholar]
- Deng T., Sharps J., Fodor E., Brownlee G.G. In vitro assembly of PB2 with a PB1-PA dimer supports a new model of assembly of influenza A virus polymerase subunits into a functional trimeric complex. J. Virol. 2005;79(13):8669–8674. doi: 10.1128/JVI.79.13.8669-8674.2005. Jul. PMID: 15956611; PMCID: PMC1143706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deng T., Engelhardt O.G., Thomas B., Akoulitchev A.V., Brownlee G.G., Fodor E. Role of ran binding protein 5 in nuclear import and assembly of the influenza virus RNA polymerase complex. J. Virol. 2006;80(24):11911–11919. doi: 10.1128/JVI.01565-06. https://doi: 10.1128/JVI.01565-06 Dec. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elbe S., Buckland-Merrett G. Data, disease and diplomacy: GISAID's innovative contribution to global health. Glob Challenges. 2017;1(1):33–46. doi: 10.1002/gch2.1018. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fan H., Walker A.P., Carrique L., Keown J.R., Serna Martin I., Karia D., Sharps J., Hengrung N., Pardon E., Steyaert J., Grimes J.M., Fodor E. Structures of influenza A virus RNA polymerase offer insight into viral genome replication. Nature. 2019;573(7773):287–290. doi: 10.1038/s41586-019-1530-7. Sep. Epub 2019 Sep 4. PMID: 31485076; PMCID: PMC6795553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fodor E. The RNA polymerase of influenza a virus: mechanisms of viral transcription and replication. Acta Virol. 2013;57(2):113–122. doi: 10.4149/av_2013_02_113. https://doi: 10.4149/av_2013_02_113. PMID: 23600869 [DOI] [PubMed] [Google Scholar]
- Fodor E., Smith M. The PA subunit is required for efficient nuclear accumulation of the PB1 subunit of the influenza A virus RNA polymerase complex. J. Virol. 2004;78(17):9144–9153. doi: 10.1128/JVI.78.17.9144-9153.2004. https://doi: 10.1128/JVI.78.17.9144-9153.2004 Sep. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Furuta Y., Gowen B.B., Takahashi K., Shiraki K., Smee D.F., Barnard D.L. Favipiravir (T-705), a novel viral RNA polymerase inhibitor. Antivir. Res. 2013;100(2):446–454. doi: 10.1016/j.antiviral.2013.09.015. https://doi: 10.1016/j.antiviral.2013.09.015 Nov. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Furuta Y., Gowen B.B., Takahashi K., Shiraki K., Smee D.F., Barnard D.L. Favipiravir (T-705), a novel viral RNA polymerase inhibitor. Antivir. Res. 2013;100(2):446–454. doi: 10.1016/j.antiviral.2013.09.015. Nov. Epub 2013 Sep 29. PMID: 24084488; PMCID: PMC3880838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao Y., Yan L., Huang Y., Liu F., Zhao Y., Cao L., et al. Structure of the RNA-dependent RNA polymerase from COVID-19 virus. Science. 2020;368(6492):779–782. doi: 10.1126/science.abb7498. 80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldhill D.H., Te Velthuis A.J.W., Fletcher R.A., Langat P., Zambon M., Lackenby A., et al. The mechanism of resistance to favipiravir in influenza. Proc. Natl. Acad. Sci. U. S. A. 2018;115(45):11613–11618. doi: 10.1073/pnas.1811345115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gordon C.J., Tchesnokov E.P., Schinazi R.F., Götte M. Molnupiravir promotes SARS-CoV-2 mutagenesis via the RNA template. J. Biol. Chem. 2021;297(1) doi: 10.1016/j.jbc.2021.100770. Jul. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hajduk P.J., Huth J.R., Tse C. Predicting protein druggability. Drug Discov. Today. 2005;10(23–24):1675–1682. doi: 10.1016/S1359-6446(05)03624-X. Dec. [DOI] [PubMed] [Google Scholar]
- Hatcher E.L., Zhdanov S.A., Bao Y., Blinkova O., Nawrocki E.P., Ostapchuck Y., et al. Virus Variation Resource-improved response to emergent viral outbreaks. Nucleic Acids Res [Internet] 2017;45(D1):D482–D490. doi: 10.1093/nar/gkw1065. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hayden F.G., Shindo N. Influenza virus polymerase inhibitors in clinical development. Curr. Opin. Infect. Dis. 2019;32(2):176–186. doi: 10.1097/QCO.0000000000000532. https://doi: 10.1097/QCO.0000000000000532 Apr. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hillen H.S., Kokic G., Farnung L., Dienemann C., Tegunov D., Cramer P. Structure of replicating SARS-CoV-2 polymerase. Nature. 2020;584(7819):154–156. doi: 10.1101/2020.04.27.063180. [DOI] [PubMed] [Google Scholar]
- Hopkins A., Groom C. The druggable genome. Nat Rev Drug Discov [Internet] 2002;1(9):727–730. doi: 10.1038/nrd892. Available from: [DOI] [PubMed] [Google Scholar]
- Hussein H.A., Borrel A., Geneix C., Petitjean M., Regad L., Camproux A.C. PockDrug-Server: a new web server for predicting pocket druggability on holo and apo proteins. Nucleic Acids Res [Internet] 2015;43(W1):W436–W442. doi: 10.1093/nar/gkv462. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hutchinson E.C., Orr O.E., Man Liu S., Engelhardt O.G., Fodor E. Characterization of the interaction between the influenza A virus polymerase subunit PB1 and the host nuclear import factor Ran-binding protein 5. J. Gen. Virol. 2011;92(Pt 8):1859–1869. doi: 10.1099/vir.0.032813-0. https://doi: 10.1099/vir.0.032813-0 Aug. [DOI] [PubMed] [Google Scholar]
- Kabinger F., Stiller C., Schmitzová J., Dienemann C., Kokic G., Hillen H.S., Höbartner C., Cramer P. Mechanism of molnupiravir-induced SARS-CoV-2 mutagenesis. Nat. Struct. Mol. Biol. 2021;28(9):740–746. doi: 10.1038/s41594-021-00651-0. Sep. Epub 2021 Aug 11. PMID: 34381216; PMCID: PMC8437801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res [Internet] 2002;30(14):3059–3066. doi: 10.1093/nar/gkf436. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirchdoerfer R.N., Ward A.B. Structure of the SARS-CoV nsp12 polymerase bound to nsp7 and nsp8 co-factors. Nat. Commun. 2019;10(1):2342. doi: 10.1038/s41467-019-10280-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirchmair J., Distinto S., Liedl K.R., Markt P., Rollinger J.M., Schuster D., et al. Development of anti-viral agents using molecular modeling and virtual screening techniques. Infect. Disord.: Drug Targets. 2011;11:64–93. doi: 10.2174/187152611794407782. [DOI] [PubMed] [Google Scholar]
- Kobayashi M., Toyoda T., Ishihama A. Influenza virus PB1 protein is the minimal and essential subunit of RNA polymerase. Arch. Virol. 1996;141(3–4):525–539. doi: 10.1007/BF01718315. https://doi:10.1007/BF01718315 [DOI] [PubMed] [Google Scholar]
- Kozlov M. Merck's COVID pill loses its lustre: what that means for the pandemic. Nature. 2021 doi: 10.1038/d41586-021-03667-0. Dec 13. [DOI] [PubMed] [Google Scholar]
- Kumar S., Stecher G., Li M., Knyaz C., Tamura K. Mega X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018;35(6):1547–1549. doi: 10.1093/molbev/msy096. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewnard J.A., Cobey S. Immune history and influenza vaccine effectiveness. Vaccines. 2018;6(2):28. doi: 10.3390/vaccines6020028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Livingstone C.D., Barton G.J. Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation. Comput Appl Biosci. 1993;9(6):745–756. doi: 10.1093/bioinformatics/9.6.745. [Internet] Available from: [DOI] [PubMed] [Google Scholar]
- Longini I.M., Jr., Nizam A., Xu S., Ungchusak K., Hanshaoworakul W., Cummings D.A., Halloran M.E. Containing pandemic influenza at the source. Science. 2005;309(5737):1083–1087. doi: 10.1126/science.1115717. https://doi: 10.1126/science.1115717 Aug 12. [DOI] [PubMed] [Google Scholar]
- López-Medrano F., Fariñas M.C., Payeras A., Pachón J. Antiviral treatment and vaccination for influenza A(H1N1)pdm09 virus: lessons learned from the pandemic. Enferm. Infecc. Microbiol. Clín. 2012;30(Suppl. 4):49–53. doi: 10.1016/S0213-005X(12)70105-2. https://doi: 10.1016/S0213-005X(12)70105-2 Oct. [DOI] [PubMed] [Google Scholar]
- Madeira F., Park Y.M., Lee J., Buso N., Gur T., Madhusoodanan N., et al. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res. 2019;47(W1):W636. doi: 10.1093/nar/gkz268. [Internet] –41. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Magro G. COVID-19: review on latest available drugs and therapies against SARS-CoV-2. Coagulation and inflammation cross-talking. Virus Res. 2020;286 doi: 10.1016/j.virusres.2020.198070. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandal S., Moudgil M., Mandal S.K. Rational drug design. Eur. J. Pharmacol. 2009;625(1–3):90–100. doi: 10.1016/j.ejphar.2009.06.065. Dec 25. Epub 2009 Oct 14. PMID: 19835861. [DOI] [PubMed] [Google Scholar]
- Medicines and Healthcare products Regulatory Agency Regulatory approval of Lagevrio (molnupiravir) https://www.gov.uk/government/publications/regulatory-approval-of-lagevrio-molnupiravir Available at.
- Naesens L., Stevaert A., Vanderlinden E. Antiviral therapies on the horizon for influenza. Curr. Opin. Pharmacol. 2016;30:106–115. doi: 10.1016/j.coph.2016.08.003. https://doi: 10.1016/j.coph.2016.08.003 Oct. [DOI] [PubMed] [Google Scholar]
- National Institutes of Health COVID-19 treatment guidelines panel. Coronavirus disease 2019 (COVID-19) treatment guidelines. https://www.covid19treatmentguidelines.nih.gov/ Available at. [PubMed]
- Naydenova Katerina, Muir Kyle W., Wu Long-Fei, Zhang Ziguo, Coscia Francesca, Peet Mathew J., Castro-Hartmann Pablo, Qian Pu, Sader Kasim, Dent Kyle, Kimanius Dari, Sutherland John D., Jan Löwe, Barford David, Christopher J. Russo bioRxiv. 2020;10(21) doi: 10.1101/2020.10.21.347690. [DOI] [Google Scholar]
- Pandey A., Nitin A., Basavaraj A., Mutalik S.P. Potential therapeutic targets for combating SARS-CoV-2: drug repurposing, clinical trials and recent advancements. Life Sci. 2020;256 doi: 10.1016/j.lfs.2020.117883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peng Q., Peng R., Yuan B., Zhao J., Wang M., Wang X., Wang Q., Sun Y., Fan Z., Qi J., Gao G.F., Shi Y. Structural and biochemical characterization of the nsp12-nsp7-nsp8 core polymerase complex from SARS-CoV-2. Cell Rep. 2020;31(11) doi: 10.1016/j.celrep.2020.107774. Jun 16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perelson A.S., Rong L., Hayden F.G. Combination antiviral therapy for influenza: predictions from modeling of human infections. J. Infect. Dis. 2012;205(11):1642–1645. doi: 10.1093/infdis/jis265. https://doi:10.1093/infdis/jis265 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pflug A., Lukarska M., Resa-Infante P., Reich S., Cusack S. Structural insights into RNA synthesis by the influenza virus transcription-replication machine. Virus Res. 2017;234:103–117. doi: 10.1016/j.virusres.2017.01.013. https://doi: 10.1016/j.virusres.2017.01.013 Apr 15. [DOI] [PubMed] [Google Scholar]
- Pickett B.E., Sadat E.L., Zhang Y., Noronha J.M., Squires R.B., Hunt V., et al. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012;vol. 40 doi: 10.1093/nar/gkr859. [Internet] (Database issue):593–8. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prioritizing W.H.O. Diseases for research and development in emergency contexts. https://www.who.int/activities/prioritizing-diseases-for-research-and-development-in-emergency-contexts Available at:
- Qi Peng, Peng Ruchao, Yuan Bin, Wang Min, Zhao Jingru, Fu Lifeng, Qi Jianxun, Shi Yi. Structural basis of SARS-CoV-2 polymerase inhibition by favipiravir. Innovation. 2021;2(Issue 1) doi: 10.1016/j.xinn.2021.100080. ISSN 2666-6758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sada M., Saraya T., Ishii H., Okayama K., Hayashi Y., Tsugawa T., Nishina A., Murakami K., Kuroda M., Ryo A., Kimura H. Detailed molecular interactions of favipiravir with SARS-CoV-2, SARS-CoV, MERS-CoV, and influenza virus polymerases in silico. Microorganisms. 2020;8:1610. doi: 10.3390/microorganisms8101610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sayers E.W., Cavanaugh M., Clark K., Ostell J., Pruitt K.D., GenBank Karsch-Mizrachi I. Nucleic Acids Res. 2018;46(D1):D41–D47. doi: 10.1093/nar/gkx1094. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schueler-Furman O., Baker D. Conserved residue clustering and protein structure prediction. Proteins. 2003;52:225–235. doi: 10.1002/prot.10365. [DOI] [PubMed] [Google Scholar]
- Sheahan T.P., Sims A.C., Zhou S., Graham R.L., Pruijssers A.J., Agostini M.L., Leist S.R., Schäfer A., Dinnon K.H., 3rd, Stevens L.J., Chappell J.D., Lu X., Hughes T.M., George A.S., Hill C.S., Montgomery S.A., Brown A.J., Bluemling G.R., Natchus M.G., Saindane M., Kolykhalov A.A., Painter G., Harcourt J., Tamin A., Thornburg N.J., Swanstrom R., Denison M.R., Baric R.S. An orally bioavailable broad-spectrum antiviral inhibits SARS-CoV-2 in human airway epithelial cell cultures and multiple coronaviruses in mice. Sci. Transl. Med. 2020;12(541) doi: 10.1126/scitranslmed.abb5883. Apr 29. Epub 2020 Apr 6. PMID: 32253226; PMCID: PMC7164393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shu Y., McCauley J. GISAID: global initiative on sharing all influenza data – from vision to reality. Eurosurveillance. 2017;22(13):2–4. doi: 10.2807/1560-7917.ES.2017.22.13.30494. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stubbs T.M., Te Velthuis A.J. The RNA-dependent RNA polymerase of the influenza A virus. Future Virol. 2014;9(9):863–876. doi: 10.2217/fvl.14.66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sugiyama K., Obayashi E., Kawaguchi A., Suzuki Y., Tame J.R., Nagata K., Park S.Y. Structural insight into the essential PB1-PB2 subunit contact of the influenza virus RNA polymerase. EMBO J. 2009;28(12):1803–1811. doi: 10.1038/emboj.2009.138. Jun 17. Epub 2009 May 21. PMID: 19461581; PMCID: PMC2699363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- te Velthuis A., Fodor E. Influenza virus RNA polymerase: insights into the mechanisms of viral RNA synthesis. Nat. Rev. Microbiol. 2016;14:479–493. doi: 10.1038/nrmicro.2016.87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trigueiro-Louro J.M., Correia V., Santos L.A., Guedes R.C., Brito R.M.M., Rebelo-de-Andrade H. To hit or not to hit: large-scale sequence analysis and structure characterization of influenza A NS1 unlocks new antiviral target potential. Virology. 2019;535:297–307. doi: 10.1093/nar/gkv462. [Internet] Available from: [DOI] [PubMed] [Google Scholar]
- Trigueiro-Louro J., Correia V., Figueiredo-Nunes I., Gíria M., Rebelo-de-Andrade H. Unlocking COVID therapeutic targets: a structure-based rationale against SARS-CoV-2, SARS-CoV and MERS-CoV Spike. Comput. Struct. Biotechnol. J. 2020;18:2117–2131. doi: 10.1016/j.csbj.2020.07.017. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trigueiro-Louro J., Santos L.A., Almeida F., Correia V., Brito R.M.M., Rebelo-de-Andrade H. NS1 protein as a novel anti-influenza target: map-and-mutate antiviral rationale reveals new putative druggable hot spots with an important role on viral replication. Virology. 2022;565:106–116. doi: 10.1016/j.virol.2021.11.001. Jan 2. [DOI] [PubMed] [Google Scholar]
- U.S Food & Drug Administration Coronavirus (COVID-19) update: FDA authorizes additional oral antiviral for treatment of COVID-19 in certain adults. https://www.fda.gov/news-events/press-announcements/coronavirus-covid-19-update-fda-authorizes-additional-oral-antiviral-treatment-covid-19-certain Available at.
- Valdar W.S.J. Scoring residue conservation. Proteins Struct. Funct. Genet. 2002;48(2):227–241. doi: 10.1002/prot.10146. [Internet] Available from: [DOI] [PubMed] [Google Scholar]
- Volkamer A., Kuhn D., Rippmann F., Dogsitescorer Rarey M. A web server for automatic binding site prediction, analysis and druggability assessment. Bioinformatics. 2012;28(15):2074. doi: 10.1093/bioinformatics/bts310. [Internet] –5. Available from: [DOI] [PubMed] [Google Scholar]
- Walker A.P., Fodor E. Interplay between influenza virus and the host RNA polymerase II transcriptional machinery. Trends Microbiol. 2019;27(5):398–407. doi: 10.1016/j.tim.2018.12.013. https://doi: 10.1016/j.tim.2018.12.013 May. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Q., Wu J., Wang H., Gao Y., Liu Q., Mu A., et al. Structural basis for RNA replication by the SARS-CoV-2 polymerase. Cell. 2020;182:417–428. doi: 10.1016/j.cell.2020.05.034. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waterhouse A.M., Procter J.B., Martin D.M.A., Clamp M., Barton G.J. Jalview Version 2-A multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25(9):1189–1191. doi: 10.1093/bioinformatics/btp033. [Internet] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wölfel R., Corman V.M., Guggemos W., Seilmaier M., Zange S., Müller M.A., Niemeyer D., Jones T.C., Vollmar P., Rothe C., Hoelscher M., Bleicker T., Brünink S., Schneider J., Ehmann R., Zwirglmaier K., Drosten C., Wendtner C. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020;581(7809):465–469. doi: 10.1038/s41586-020-2196-x. https://doi: 10.1038/s41586-020-2196-x May. [DOI] [PubMed] [Google Scholar]
- Yin W., Mao C., Luan X., Shen D.D., Shen Q., Su H., et al. Structural basis for inhibition of the RNA-dependent RNA polymerase from SARS-CoV-2 by remdesivir. Science. 2020;368(6498):1499–1504. doi: 10.1126/science.abc1560. 80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zamarin D., Ortigoza M.B., Palese P. Influenza A virus PB1-F2 protein contributes to viral pathogenesis in mice. J. Virol. 2006;80(16):7976–7983. doi: 10.1128/JVI.00415-06. https://doi: 10.1128/JVI.00415-06 Aug. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao W.-M., Song S.-H., Chen M.-L., Zou D., Ma L.-N., Ma Y.-K., et al. The 2019 novel coronavirus resource. Yi chuan = Hered. 2020;42(2):212–221. doi: 10.16288/j.yczz.20-030. [Internet] Available from: [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.