Skip to main content
Journal of Virology logoLink to Journal of Virology
. 1998 Mar;72(3):2544–2547. doi: 10.1128/jvi.72.3.2544-2547.1998

N-Terminal Protease of Pestiviruses: Identification of Putative Catalytic Residues by Site-Directed Mutagenesis

Tillmann Rümenapf 1,*, Robert Stark 1, Manuela Heimann 1, Heinz-Jürgen Thiel 1
PMCID: PMC109561  PMID: 9499122

Abstract

Pestiviruses are the only members of the Flaviviridae that encode a nonstructural protease at the N terminus of their polyproteins. This N-terminal protease (Npro) cleaves itself off of the nascent polyprotein autocatalytically and thereby generates the N terminus of the adjacent viral capsid protein C. In previous reports, sequence similarities between Npro and the catalytic residues of papain-like cysteine proteases were put forward. To test this hypothesis, substitutions of cysteine and histidine residues within Npro were carried out by site-directed mutagenesis. Translation of the mutagenized Npro-C proteins in cell-free lysates confirmed that only the predicted Cys69 was an essential amino acid for proteolysis, not His130. Further essential residues were identified with His49 and Glu22. While it remains speculative whether Glu22-His49-Cys69 actually build a catalytic triad, these results invalidate the assumption that Npro is a papain-like cysteine protease.


Pestiviruses are small enveloped RNA viruses which constitute one genus within the Flaviviridae (23). Currently three pestivirus species are recognized, namely bovine viral diarrhea virus (BVDV), classical swine fever virus (CSFV), and border disease virus of sheep. The genome of pestiviruses is a positive-stranded RNA, usually with a size of about 12,300 nucleotides, which encodes a single polyprotein of almost 4,000 amino acids. Currently, 12 mature pestivirus proteins (Npro, C, Erns, E1, E2, p7, NS2, NS3, NS4A, NS4B, NS5A, and NS5B) have been identified as products of polyprotein processing, which occurs co- and posttranslationally due to virus- and host-cell-encoded proteases (20). All members of the virus family, including the genera Flavivirus, Hepacivirus, and Pestivirus have similar genome organizations (16). A divergent evolution of flaviviruses on one side and hepaciviruses and pestiviruses on the other is indicated by a fundamental difference between the two sides concerning translation initiation. The former require a cap structure at the 5′ ends of their genomic RNAs (16), while the latter are able to undergo cap-independent translation initiation due to an internal ribosomal entry site in their genomes (12, 17, 22).

A puzzling pestiviral protein is the N-terminal protease Npro, which has no counterpart among the other members of the Flaviviridae. While it was initially suggested that Npro represents the structural capsid protein (4, 24), it has since been demonstrated that Npro is a nonstructural protein (18, 21). The only known function of Npro is its proteolytic activity which leads to cleavage between Cys168 and Ser169 in the polyprotein; Ser169 has been determined to be the N terminus of the capsid protein (18). Proteases at the N termini of polyproteins are common in a variety of positive-stranded RNA viruses (6). All known members of the so-called “viral accessory leader proteases” (6) are cysteine proteases which resemble the active-site organization of the type protease papain (4, 7). Npro was also suggested to belong to this protease family; the catalytic site was proposed to consist of Cys69 and His130 (4, 18).

The Npro-C cleavage occurs efficiently in procaryotic and eucaryotic cells and also during translation in cell-free lysates (18, 24). To study the autoproteolytic properties of Npro, wheat germ lysate (Promega), together with metabolic [35S]methionine labeling, was chosen for in vitro translation experiments. Rabbit reticulocyte lysate was not used, because endogenous proteins in the 14- to 20-kDa range (e.g., globin) interfered with the separation of Npro and C during sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE).

M13mp18 DNA served as a vector for a transcription cassette which consisted of an SP6 promoter sequence, the 5′ nontranslated region (NTR) (56 nucleotides) of Sindbis virus and the cDNA encoding amino acids 1 to 246 (Npro-C) of CSFV Alfort (Fig. 1). The 5′ NTR of Sindbis virus has been shown previously to efficiently promote translation initiation (3). Transcripts were either synthesized from replicative form M13 DNA linearized with XhoI as a template or from fragments generated by PCR with M13 universal and reverse primers which flank the polylinker sequence. SP6 transcription was performed with the MaxiScript kit (Ambion) without the addition of a cap analog. After in vitro translation, the transcripts gave rise to three protein species, Npro-C, Npro, and C. In neither in vitro system did cleavage of Npro-C proceed to completion; pulse-chase experiments indicated that the remaining Npro-C molecules were not subject to further cleavage (data not shown). Preliminary data suggested that cleavage occurs during synthesis, probably in a monomolecular fashion (cis cleavage), of the nascent polypeptide chain (not shown). In a pilot experiment, we wished to determine whether Npro was sensitive to protease inhibitors. Prior to the translation reaction a number of reagents which specifically inhibit a variety of serine, cysteine, and aspartic acid proteases (antipain dihydrochloride, 1 mg/ml; aprotinin, 0.5 mg/ml; 4-(2-aminoethyl)benzenesulfonyl fluoride, 2 mg/ml; TLCK (Nα-p-tosyl-l-lysine chloromethyl ketone), 9 mg/ml; TPCK (N-tosyl-l-phenylalanine chloromethyl ketone), 8.5 mg/ml; chymostatin, 1 mg/ml; E64, 1 mg/ml; leupeptin, 0.1 mg/ml; pepstatin, 0.5 mg/ml; phosphoramidon, 1 mg/ml; bestatin, 0.25 mg/ml) were added to the wheat germ lysate. Although they were used at high concentrations, none of the substances showed significant inhibition of Npro-C cleavage (not shown). Thus, the use of inhibitors did not offer any clue about the type of protease.

FIG. 1.

FIG. 1

(Upper line) Structure of the cDNA construct used for in vitro transcription and translation. The cassette including the SP6 promoter, the NTR of Sindbis virus, and the Npro-C gene was cloned into M13mp18, which served for site-directed mutagenesis. For transcription, the sequence was amplified with M13 reverse and forward primers, and the PCR product was directly transcribed with SP6 RNA polymerase. (Lower line) Cleavage products of the Npro-C polyprotein. As the precise C terminus of C has not been established, it was tentatively located at amino acid 246 of the CSFV polyprotein.

We began to introduce specific mutations into the Npro gene by using oligonucleotide-directed mutagenesis according to the modified Kunkel method (8). To test for the suggested catalytic dyad (Cys69, His130) the amino acid substitutions Cys69Ser, Cys69Ala, and His130Leu were generated, and the respective mutant Npro-C polyproteins were assayed for cleavage by in vitro translation and SDS-PAGE. While the His130Leu mutation displayed a normal rate of cleavage (Fig. 2b), the Cys69Ser and Cys69Ala substitutions had deleterious effects on the cleavage of Npro-C (Fig. 2a). To support that Cys69 was actually involved in catalysis and not merely a determinant of the secondary structure of Npro, the other four conserved cysteine residues (Cys112, Cys134, Cys138, and Cys161) were replaced by serine and then alanine residues (Fig. 2a). None of these substitutions affected Npro-C cleavage. The catalytic mechanism of cysteine proteases is often based on the formation of a thiolate-imidazolium ion pair with cysteine as the nucleophile and histidine as the general base (19). Because the His130Leu substitution had no effect, the remaining three conserved histidine residues (His40, His49, and His99) were also changed to leucine residues; the substitution His49Leu abrogated the cleavage, which suggested a participation of this residue in catalysis (Fig. 2b). The involvement of His49 in proteolysis was proposed earlier (24), but no additional catalytic residues were revealed in this study.

FIG. 2.

FIG. 2

Identification of putative catalytic residues Cys69 and His49 by site-directed mutagenesis of Npro. Wheat germ lysate was used for translation of uncapped transcripts; metabolic labeling was done with [35S]methionine. (a) The conserved cysteine residues of Npro were replaced with serine and then alanine residues, and the effect on cleavage of Npro-C was assayed in wheat germ lysate. Only replacement of Cys69 affected cleavage. wt, unsubstituted Npro-C. (b) The four conserved histidine residues of Npro were replaced with leucine residues. Only His49Leu resulted in inactivation of Npro. wt, unsubstituted Npro-C.

The identification of the putative catalytic amino acid Cys69 supported the working hypothesis that Npro very likely is a cysteine protease. However, the position of the catalytic His49 was incompatible with the catalytic domain of papain-like cysteine proteases. In papain-like cysteine proteases, the catalytic histidine is positioned C terminally of the catalytic cysteine (Table 1). The inverted orientation of these residues, as determined for Npro, is found in a group of cysteine proteases which, according to a proposed novel taxonomy for proteases, encompass clan CB (13). The type protease of one family (C3) of clan CB is the picornavirus 3C protease, also known as picornain (14). All cysteine proteases of this clan are encoded by viruses and are related to the chymotrypsin-like serine proteases (14). Comparison of the amino acid sequences of Npro and picornain resulted in weak yet significant homology of the amino acid sequence surrounding Cys69 of Npro and the active-site Cys159 of encephalomyocarditis virus 3C protease (Fig. 3). Close inspection of the residual sequences revealed no further homologies. Predictions for poliovirus 3C protease indicated the presence of a third catalytic residue, an acidic amino acid which is positioned between the catalytic histidine and cysteine residues (5) (Table 1). A conserved acidic amino acid at an equivalent position is not found in Npro. The only conserved acidic residue in the respective region of Npro is Asp68, which could be replaced by Asn68 without affecting cleavage (not shown). Interestingly, the acidic residue as part of the catalytic triad of picornain was recently put into question. Crystal structure analysis of 3C protease of hepatitis A virus revealed a catalytic dyad consisting of histidine and cysteine residues rather than a triad including an aspartic or glutamic acid residue (1).

TABLE 1.

Comparison of the array of catalytic amino acids of serine and cysteine proteases with the putative catalytic triad of pestivirus Npro

Protease (type; database code) Clana Catalytic residuesb
Papain-like cysteine (papain; PAPA_CARPA) CA C25H159N175
Chymotrypsin-like serine proteases (bovine chymotrypsin; CTRA_BOVIN) SA H57D102gdS195g
Chymotrypsin-like cysteine proteases (poliovirus 3C; POLG_POL1M) CB H40E71gdC147g
Subtilisin-like serine proteases (subtilisin; SUBE_BACSU) SB D32H62gtS215xa/s
Npro (CSFV; HCVCGSA) E22H49gdC69rs
a

Classification into clans is according to Rawlings and Barrett (14, 15). 

b

Numbers in subscript refer to the positions of the respective catalytic residues (in boldface type) in the type proteases. The numbering for papain accounts for the mature protease after removal of the activation peptide (133 amino acids). Note the relatively close spacing of the putative catalytic residues in Npro compared to the other proteases. 

FIG. 3.

FIG. 3

Local alignment of amino acid sequences surrounding the catalytic cysteine residue (arrow) of Npro and members of the chymotrypsin-like cysteine proteases (picornain). The bottom line shows the sequence of the serine protease bovine chymotrypsin. Abbreviations: EMCV, encephalomyocarditis virus (database code, POLG_EMCV); Polio, poliovirus (database code, POLH_POL1M); bov.chym., bovine chymotrypsin (database code, CTRA_BOVIN).

Npro invariably consists of 168 amino acids in all sequenced pestivirus isolates. There are a few cases in which Npro is encoded twice in the viral genome; due to a complex genetic duplication and recombination event, Npro is then fused to the N terminus of the NS3 protein of cytopathogenic BVDVs (i.e., BVDV strain PE515cp [9]), and there is evidence that Npro mediates the release of NS3 autocatalytically. In BVDV strain PE515cp, the Npro fused to NS3 lacks the N-terminal 15 amino acids (9). This finding indicates that the N-terminal portion of Npro is probably not involved in catalysis. To determine the exact number of dispensable N-terminal amino acids, Npro deletion mutants that lacked amino acids 2 to 10, 2 to 15, 2 to 19, 2 to 20, 2 to 21, and 2 to 22 were generated (Fig. 4 and data not shown). While the stepwise deletions of N-terminal amino acids including Npro 2 to 21 had no effect on proteolysis (Fig. 4b, lanes 2, 4, and 5), the deletion of amino acids 2 to 22 resulted in an inactive protease (Fig. 4b, lane 6). Replacement of Glu22, a conserved residue in pestiviruses, with valine heavily affected cleavage in the context of N-terminally truncated Npro as well as full-length Npro (Fig. 4b, lanes 3 and 8). The substitution Glu22Asp also rendered the protease inactive, yet some residual activity was observed (Fig. 4b, lane 7). While it is reasonable to assume that His49 and Cys69 are constituents of the active site, participation of Glu22 in catalysis is uncertain. A catalytic triad consisting of Glu22-His49-Cys69 would be unusual, and there is no reported precedence among the cysteine proteases. However, such an array is somewhat reminiscent of the catalytic triad of the subtilisin-like serine proteases (Table 1). Subtilisin-like serine proteases are widespread among procaryotes and eukaryotes and include several prominent members, e.g., proteinase K, kexin, and furin (15). Interestingly, virus-encoded subtilisin-like serine proteases have not been identified so far. On the basis of sequence homology, the ORF 47 gene product of channel catfish herpesvirus was suggested to be a subtilisin-like protease (2). This assumption is questionable, since there is no direct proof for catalysis and especially because the reported homologies do not include the catalytic residues of the subtilisin-like serine proteases. Subtilisin-like serine proteases are believed to be structurally unrelated to the chymotrypsin-like serine proteases, and there is evidence that both classes have evolved separately (15). The vast majority of subtilisin-like proteases contain an active-site serine, and there is only one precedent for a subtilisin-like protease, from Bacillus smithii, which has a cysteine residue at this position (11). Subtilisin-like proteases usually are characterized by patterns of conserved amino acids surrounding the catalytic residues. Npro, however, has no apparent sequence homology with the subtilisin proteases.

FIG. 4.

FIG. 4

(a) Structures of N-terminal deletion mutants of Npro and amino acid substitutions of Glu22. (b) SDS-PAGE analysis of in vitro-translated mutants of Npro-C, as shown in panel a. wt, unsubstituted Npro-C.

Due to the inherent limitations of the mutagenesis approach, it could not be proven that Glu22, His49, and Cys69 are involved in catalysis. Because of this caveat and only minute sequence homologies to other proteases, it is not possible to state whether Npro is a member of one of the established protease families. A solution to this problem has probably to await the determination of the three-dimensional structure of Npro.

Linked to the issue of protease classification is the question of the origin of Npro. One obvious explanation is that pestiviruses acquired Npro from a host cellular source. Pestiviruses have been shown to frequently integrate parts of host cellular mRNAs into their genomes (for a review, see reference 10). In this case, there should be a good chance to detect remnants of the ancestral sequence. So far, database searches using the nucleotide and amino acid sequences of Npro have not revealed significant homology to any deposited sequence. It is also possible that pestiviruses “invented” the protease de novo by adapting a protein of a different function to their own needs. This implies the possibility that the protease domain is only one facet of a multifunctional protein, where the other one(s) remains to be elucidated.

As a first step toward understanding the function of Npro, the application of reverse genetics, together with the infectious pestivirus clones, will provide evidence whether Npro is essential for virus replication.

Acknowledgments

This work was funded by Deutsche Forschungsgemeinschaft (SFB 535) and Intervet International B.V.

REFERENCES

  • 1.Allaire M, Chernaia M M, Malcolm B A, James M N G. Picornaviral 3C cysteine proteinases have a fold similar to chymotrypsin-like serine proteinases. Nature. 1994;369:72–76. doi: 10.1038/369072a0. [DOI] [PubMed] [Google Scholar]
  • 2.Davison A. Channel catfish virus: a new type of herpesvirus. Virology. 1991;186:9–14. doi: 10.1016/0042-6822(92)90056-u. [DOI] [PubMed] [Google Scholar]
  • 3.de Groot R J, Rümenapf T, Kuhn R J, Strauss E G, Strauss J H. Sindbis virus RNA polymerase is degraded by the N-end rule pathway. Proc Natl Acad Sci USA. 1991;88:8967–8971. doi: 10.1073/pnas.88.20.8967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Dougherty W G, Semler B L. Expression of virus-encoded proteinases: functional and structural similarities with cellular enzymes. Microbiol Rev. 1993;57:781–822. doi: 10.1128/mr.57.4.781-822.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gorbalenya A E, Donchenko A P, Blinov V M, Koonin E V. Cysteine proteases of positive strand RNA viruses and chymotrypsin-like serine proteases. FEBS Lett. 1989;243:103–114. doi: 10.1016/0014-5793(89)80109-7. [DOI] [PubMed] [Google Scholar]
  • 6.Gorbalenya A E, Koonin E V, Lai M M. Putative papain-related thiol proteases of positive-strand RNA viruses. Identification of rubi- and aphthovirus proteases and delineation of a novel conserved domain associated with proteases of rubi-, alpha- and coronaviruses. FEBS Lett. 1991;288:201–205. doi: 10.1016/0014-5793(91)81034-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kräusslich H-G, Wimmer E. Viral proteinases. Annu Rev Biochem. 1988;57:701–754. doi: 10.1146/annurev.bi.57.070188.003413. [DOI] [PubMed] [Google Scholar]
  • 8.Maniatis T, Fritsch E F, Sambrook J. Molecular cloning: a laboratory manual. Cold Spring Harbor, N.Y: Cold Spring Harbor Laboratory; 1982. [Google Scholar]
  • 9.Meyers G, Tautz N, Stark R, Brownlie J, Dubovi E J, Collett M S, Thiel H-J. Rearrangement of viral sequences in cytopathogenic pestiviruses. Virology. 1992;191:368–386. doi: 10.1016/0042-6822(92)90199-Y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Meyers G, Thiel H-J. Molecular characterization of pestiviruses. Adv Virus Res. 1996;47:53–118. doi: 10.1016/s0065-3527(08)60734-4. [DOI] [PubMed] [Google Scholar]
  • 11.Milano A, Manachini P L, Parini C, Riccardi G. Sequence of the gene encoding an alkaline serine protease of thermophilic Bacillus smithii. Gene. 1994;145:149–150. doi: 10.1016/0378-1119(94)90340-9. [DOI] [PubMed] [Google Scholar]
  • 12.Poole T L, Wang C, Popp R A, Potgieter L N D, Siddiqui A, Collett M S. Pestivirus translation initiation occurs by internal ribosome binding. Virology. 1995;206:750–754. doi: 10.1016/s0042-6822(95)80003-4. [DOI] [PubMed] [Google Scholar]
  • 13.Rawlings N D, Barrett A J. Evolutionary families of peptidases. Biochem J. 1993;290:205–218. doi: 10.1042/bj2900205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rawlings N D, Barrett A J. Families of cysteine peptidases. Methods Enzymol. 1994;244:461–486. doi: 10.1016/0076-6879(94)44034-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Rawlings N D, Barrett A J. Families of serine peptidases. Methods Enzymol. 1994;244:19–61. doi: 10.1016/0076-6879(94)44004-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rice C M. Flaviviridae: the viruses and their replication. In: Fields B N, Knipe D M, Howley P M, et al., editors. Fields virology. 3rd ed. Philadelphia, Pa: Lippincott-Raven Publishers; 1996. pp. 931–959. [Google Scholar]
  • 17.Rijnbrand R, van der Straaten P A, van Rijn P A, Spaan W J M, Bredenbeek P J. Internal entry of ribosomes is directed by the 5′ noncoding region of classical swine fever virus and is dependent on the presence of an RNA pseudoknot upstream of the initiation codon. J Virol. 1997;71:451–457. doi: 10.1128/jvi.71.1.451-457.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Stark R, Meyers G, Rümenapf T, Thiel H-J. Processing of pestivirus polyprotein: cleavage site between autoprotease and nucleocapsid protein of classical swine fever virus. J Virol. 1993;67:7088–7095. doi: 10.1128/jvi.67.12.7088-7095.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Storer A C, Ménard R. Catalytic mechanism in papain family of cysteine peptidases. Methods Enzymol. 1994;244:486–500. doi: 10.1016/0076-6879(94)44035-2. [DOI] [PubMed] [Google Scholar]
  • 20.Thiel H-J, Plagemann P G W, Moennig V. Pestiviruses. In: Fields B N, Knipe D M, Howley P M, et al., editors. Fields virology. 3rd ed. Philadelphia, Pa: Lippincott-Raven Publishers; 1996. pp. 1059–1073. [Google Scholar]
  • 21.Thiel H-J, Stark R, Weiland E, Rümenapf T, Meyers G. Hog cholera virus: molecular composition of virions from a pestivirus. J Virol. 1991;65:4705–4712. doi: 10.1128/jvi.65.9.4705-4712.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tsukiyama-Kohara K, Iizuka N, Kohara M, Nomoto A. Internal ribosomal entry site within hepatitis C virus RNA. J Virol. 1992;66:1476–1483. doi: 10.1128/jvi.66.3.1476-1483.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wengler G, Bradley D W, Collett M S, Heinz F X, Schlesinger R W, Strauss J H. Flaviviridae. In: Murphy F A, Fauquet C M, Bishop D H L, Ghabrial S A, Jarvis A W, Martelli G P, Mayo M A, Summers M D, editors. Virus taxonomy. Sixth report of the International Committee on Taxonomy of Viruses. Vienna, Austria: Springer-Verlag; 1995. pp. 415–427. [Google Scholar]
  • 24.Wiskerchen M, Belzer S K, Collett M S. Pestivirus gene expression: the first protein product of the bovine viral diarrhea virus large open reading frame, p20, possesses proteolytic activity. J Virol. 1991;65:4508–4514. doi: 10.1128/jvi.65.8.4508-4514.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Virology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES