Abstract
When herpes simplex virus infects permissive cells, the viral regulatory protein VP16 forms a specific complex with HCF-1, a preexisting nuclear protein involved in cell proliferation. The majority of HCF-1 in the cell is a complex of associated amino (HCF-1N)- and carboxy (HCF-1C)-terminal subunits that result from an unusual proteolytic processing of a large precursor polypeptide. Here, we have characterized the structure and function of sequences required for HCF-1N and HCF-1C subunit association. HCF-1 contains two matched pairs of self-association sequences called SAS1 and SAS2. One of these matched association sequences, SAS1, consists of a short 43-amino-acid region of the HCF-1N subunit, which associates with a carboxy-terminal region of the HCF-1C subunit that is composed of a tandem pair of fibronectin type 3 repeats, a structural motif known to promote protein-protein interactions. Unexpectedly, the related protein HCF-2, which is not proteolyzed, also contains a functional SAS1 association element, suggesting that this element does not function solely to maintain HCF-1N and HCF-1C subunit association. HCF-1N subunits do not possess a nuclear localization signal. We show that, owing to a carboxy-terminal HCF-1 nuclear localization signal, HCF-1C subunits can recruit HCF-1N subunits to the nucleus.
Lytic infection of human cells by herpes simplex virus is characterized by a cascade of viral gene expression initiated by the virion protein VP16 (reviewed in reference 22). Following infection of the host cell, the viral transactivator VP16 (also known as Vmw65 and αTIF) is released from the virion, whereupon it associates with two preexisting nuclear factors, Oct-1 and HCF-1, to form a multiprotein-DNA complex on VP16-responsive cis-regulatory elements in the viral immediate-early promoters (8).
Oct-1 is a broadly expressed POU homeodomain-containing transcription factor (24). The function of HCF-1 (also known as HCF, C1, VCAF, and CFF) in the uninfected cell is less well understood. The principal HCF-1 translation product is a large protein of 2,035 amino acids (12, 27), but only an amino-terminal region of about 380 amino acids is essential for HCF-1 interaction with VP16 and stabilization of the VP16-induced complex with Oct-1 (10, 15, 26). Characterization of a temperature-sensitive hamster cell line called tsBN67 has indicated that HCF-1 is involved in cell proliferation (7): at nonpermissive temperature, tsBN67 cells stop proliferating owing to a missense mutation within the HCF-1 VP16 interaction region, which also inhibits HCF-1 association with VP16 (7, 26). The molecular mechanisms, however, by which HCF-1 regulates cell proliferation are not known.
HCF-1 is a member of a protein family that includes the recently described human protein HCF-2 (11), and both HCF-1 and HCF-2 are related to a protein in the worm Caenorhabditis elegans called CeHCF (16). These three proteins have conserved the VP16 interaction region and associate with VP16 to differing extents (11, 16).
In mammalian cells, HCF-1 undergoes an unusual maturation process. Its primary translation product is cleaved at a series of six conserved 26-amino-acid repeats called HCFPRO repeats located near the center of the primary translation product (12, 27, 29). The resulting fragments, called HCF-1N and HCF-1C, are stable, and they remain noncovalently associated, creating a mature HCF-1 complex (29). Although the majority of HCF-1N and HCF-1C subunits remain noncovalently associated after proteolysis, there is a variant HCF-1 translation product called HCF-1Δ382–450, which results from removal of an exon encoding amino-terminal HCF-1 residues 382 to 450 through alternative pre-mRNA processing; this smaller HCF-1 protein undergoes proteolytic processing, but the resulting HCF-1N and HCF-1C subunits do not remain associated (29). Unlike the HCF-1 protein, HCF-2 and CeHCF do not possess HCFPRO repeats, and HCF-2 is not proteolytically cleaved like HCF-1 (11).
HCF-1 processing and the subsequent association of the resulting HCF-1N and HCF-1C subunits are unusual, and little is known concerning the mechanisms by which they occur. Here, we describe aspects of human HCF-1N and HCF-1C subunit association. Our results show that the HCF-1N and HCF-1C subunits contain two matched pairs of subunit association elements and that subunit association can permit recruitment of HCF-1N subunits to the nucleus via a nuclear localization signal (NLS) located at the carboxyl terminus of the HCF-1C subunit. Unexpectedly, HCF-2, which is not proteolyzed, shares one of the HCF-1 subunit association elements, suggesting that the role of such an element is not solely to maintain HCF-1N and HCF-1C subunit association.
MATERIALS AND METHODS
Mammalian expression plasmids.
The hemagglutinin (HA)-epitope-tagged human HCF expression constructs pCGNHCFN1011Δ382–450, pCGNHCFN450–1011, pCGNHCFN450, and pCGNHCFN380 have been described previously (26). pCGTHCFC (C-terminal residues 1436 to 2035) is identical to pCGNHCFC (26) except that the influenza virus HA epitope is replaced by a bacteriophage T7 gene 10 (T7) epitope. To ensure detection and protein stability, HCF-1 residues 348 to 450 or smaller derivatives were expressed as fusions to the GAL4 DNA-binding domain (residues 1 to 94) by subcloning into pCGNGAL4(1–94). pCGTHCFSAS2C encodes HCF-1 residues 1436 to 1756, and pCGTHCFSAS1C encodes residues 1758 to 2035. Truncations were generated either by using suitable restriction sites or by PCR or oligonucleotide-mediated mutagenesis. A fragment spanning residues 341 to 394 of human HCF-2 (11) was amplified using PCR and subcloned into pCGNGAL4(1–94). The nucleotide sequences of PCR-generated fragments were verified by DNA sequencing.
Sequence analysis.
Searches of the protein databases were performed using BLASTP (1) and SMART (Simple Modular Architecture Research Tool) (23). Sequence alignments were compiled using ZEGA and CLUSTALW (provided on-line by the Molecular Modeling and Bioinformatics Group, Skirball Institute, New York University School of Medicine).
Transfections, immunoprecipitations, and immunoblotting.
Human 293 and 293T cells were transfected by electroporation or with Lipofectamine (Gibco BRL, Inc.), respectively (11, 26). Preparation of whole-cell nuclear extracts, immunoprecipitation, and immunoblotting were performed as described previously (26).
Immunofluorescence.
293 cells were transfected by electroporation and seeded onto sterile coverslips. After 36 h, the coverslips were washed in phosphate-buffered saline (PBS); the cells were fixed in 4% paraformaldehyde for 20 min and permeabilized for 5 min with PBS containing 0.1% Triton X-100. The samples were then washed three times with PBS and blocked for 30 min in PBS containing 2% dry milk. After washing in PBS, coverslips were incubated for 45 min with the primary antibody, washed, and incubated with the secondary antibody for an additional 45 min. Fluorescence was observed with a Nikon fluorescence microscope or a Zeiss confocal microscope.
RESULTS
Figure 1A shows a schematic of the structure of the primary human HCF-1 translation product called HCF-1300. HCF-1300 contains two previously described sets of repeat elements: centrally located HCF-1PRO repeats and six amino-terminal kelch-like repeats called HCF-1KEL, which likely fold into a six-bladed β-propeller structure (residues 17 to 360 [15, 26]). Here, we analyzed the sequence of the carboxy-terminal region of HCF-1, which was originally described as rich in charged residues and the large hydrophobic residues tryptophan, tyrosine, and phenylalanine (27). This region has been conserved in evolution: both HCF-2 (11) and CeHCF (16) share extensive sequence similarity to residues 1812 to 2001 of HCF-1 (Fig. 1B). This region also exhibits internal sequence similarity, which indicates a repeat of HCF-1 residues 1812 to 1875 and 1910 to 1992. These two sequences share a conserved tryptophan (W) residue (positions 1812 and 1910) and the interdigitated sequence FRXXXXNXXGXG, where “X” indicates any amino acid (residues 1864 to 1875 and 1981 to 1992 [Fig. 1B]).
A comparison of each carboxy-terminal repeat with other protein sequences using the BLASTP program (1) and also the SMART database (23) revealed significant similarity to fibronectin type 3 (Fn3) repeats. Fn3 repeats are typically 90 to 100 residues long and, although exhibiting relatively low primary amino acid sequence similarity among different proteins, display considerable structural similarity (reviewed in reference 2). Figure 1C shows a sequence alignment of four Fn3 repeats from the β4 subunit of α6β4 integrin. The first two (Fn3-1 and Fn3-2) and second two (Fn3-3 and Fn3-4) represent tandem repeats as with the two repeats in HCF-1. The structure of the first set of tandem repeats has been determined (3), and the positions of β strands in the structure are shown in Fig. 1C. Together, Fig. 1B and C show the similarity of the two carboxy-terminal HCF-1 repeats to Fn3 repeats. The predicted Fn3 repeats in HCF-1 have also been identified independently by the SMART database (http://smart.embl-heidelberg.de). These sequence analyses identify for the first time a predicted structural element within the carboxy-terminal region of HCF-1. We therefore refer to these carboxy-terminal HCF-1 repeats as HCF-1Fn31 and HCF-1Fn32.
The HCF-1N and HCF-1C subunits readily associate when synthesized separately.
In the mature endogenous HCF-1 complex, the noncovalently associated HCF-1N and HCF-1C subunits result from proteolytic processing of the HCF-1300 precursor (29); thus, the HCF-1N and HCF-1C subunits are in close proximity for coassociation prior to proteolytic cleavage. HCF-1N and HCF-1C subunit association might, however, be regulated through sequential dissociation and reassociation. To test the feasibility of such a hypothesis, we asked whether dissociated HCF-1N and HCF-1C subunits can associate without prior tethering as a single translation product.
We cotransfected human 293 cells with plasmids separately encoding engineered HCF-1N (HCF-1N1011) and HCF-1C (HCF-1C600) subunits. The HCF-1N subunit was tagged at its amino terminus with an influenza virus HA epitope, and the HCF-1C subunit was tagged at its amino terminus with a T7 epitope. As a positive control for protein-protein association, we coexpressed the HCF-1N subunit with a T7-tagged VP16 protein (26). Protein extracts were prepared from the transfected cells, and the HA-tagged amino-terminal polypeptide was recovered by immunoprecipitation with an anti-HA (αHA) monoclonal antibody. The resulting immune complexes were resolved on a sodium dodecyl sulfate (SDS)-polyacrylamide gel, and the coimmunoprecipitated HCF-1C and VP16 proteins were detected by immunoblotting with an αT7 monoclonal antibody.
Figure 2 shows the result of such an experiment. Neither VP16 nor the HCF-1C subunit was recovered in the absence of HA-HCF-1N subunit expression (lanes 2 and 3), but both were recovered effectively in the presence of the full-length HCF-1N subunit HA-HCF-1N1011 (compare lanes 4 and 5). This result demonstrates that HCF-1N and HCF-1C subunit association is not dependent on coexpression as the HCF-1300 precursor and indicates that the association of HCF-1N and HCF-1C subunits in endogenous HCF-1 can be dynamic.
The HCF-1N subunit contains two HCF-1C subunit association regions.
We have previously shown that the natural HCF-1 variant HCF-1Δ382–450 (Fig. 1A) is processed normally, but the resulting HCFNΔ382–450 and HCFC subunits do not remain associated (29). Consistent with this observation, when synthesized separately as shown in Fig. 2, an HCF-1N subunit lacking residues 382 to 450 but retaining the VP16 interaction region (HCF-1N1011Δ382–450) bound VP16 (lane 6) but not the engineered HCF-1C subunit (lane 7), suggesting that loss of residues 382 to 450 disrupts sequences involved in HCF-1N association with the HCF-1C subunit.
To map more precisely the HCF-1N region(s) involved in HCF-1 subunit association, we divided the HCF-1N subunit into two segments, with one fragment containing residues 1 to 450 (HCF-1N450)—thus including the 382–450 region involved in controlling HCF-1C subunit association—and the other spanning the remainder of the HCF-1N subunit (HCF-1N450–1011). Consistent with previous studies (26), the HCF-1N450 protein (Fig. 2, lane 8) but not the HCF-1N450–1011 protein (lane 10) associated with VP16. To our surprise, however, both nonoverlapping HCF-1N derivatives associated with the HCF-1C subunit (lanes 9 and 11), suggesting that the HCF-1N subunit contains two HCF-1C association sequences. We refer to the regions involved in HCF-1N and HCF-1C subunit association as self-association sequence (SAS) elements; the SAS elements in HCF-1N450 and HCF-1N450–1011 are referred to as HCF-1SAS1N and HCF-1SAS2N, respectively.
The presence of an HCF-1N SAS element in the HCF-1N450–1011 fragment was unexpected because the natural HCF-1NΔ382–450 subunit encompasses all but one residue of the HCF-1N450–1011 fragment and yet fails to associate with the HCF-1C subunit (Fig. 2, lane 7). Thus, loss of residues 382 to 450 in the HCF-1N subunit appears to have a dominant negative effect on the activity of HCF-1SAS2N element in HCF-1N450–1011. Given this unexpected result, we asked whether the HCF-1SAS1N element in the HCF-1N450 fragment (lane 9) might reside within the residues 1 to 380 (i.e., the VP16 interaction region), and its activity also be inhibited in the HCF-1NΔ382–450 deletion variant, by assaying the ability of an HCF-1N380 fragment to associate with the HCF-1C subunit. Consistent with previous results (26), the HCF-1N380 fragment associated with VP16 (lane 12); it failed, however, to associate with the HCF-1C subunit (lane 13). Thus, the HCF-1SAS1N element does not reside entirely within the VP16-interaction region, and VP16 and HCF-1C subunit associations are different HCF-1N activities.
The HCF-1C subunit contains two independent HCF-1N association elements.
Given the identification of two regions within the HCF-1N subunit that can associate independently with the HCF-1C subunit, HCF-1SAS1N and HCF-1SAS2N, we asked whether these regions might associate with the same region or different regions of the HCF-1C subunit. In addition to the Fn3 repeats, the HCF-1C subunit contains a region with more acidic than basic residues (reference 37 and Fig. 1A). To determine whether either the acidic or Fn3 repeat regions are involved in HCF-1N subunit association, we divided the HCF-1C subunit in two—residues 1436 to 1756 covering the acidic region (HCF-1C1436–1756) and residues 1758 to 2035 covering the HCF-1Fn3 repeats (HCF-1C1758–2035)—and tested each individually for association with the HCF-1N subunit as shown in Fig. 3. In addition to associating with the full-length HCF-1C subunit (lane 1) as expected, the full-length HCF-1N subunit associated with both the acidic HCF-1C1436–1756 (lane 2) and Fn3 repeat-containing HCF-1C1758–2035 (lane 3) fragments, although recovery of the HCF-1C1436–1756 fragment was considerably less efficient, which may reflect inhibition by the amino-terminal region of HCF-1 (see Discussion). In contrast, the alternative-splice variant HCF-1NΔ382–450 subunit associates neither with the full-length HCF-1C subunit (lane 4), as expected, nor with either HCF-1C subfragment (lanes 5 and 6) effectively. Thus, the full-length but not the splice-variant HCF-1 can associate with two independent regions within the carboxy-terminal HCF-1C subunit. We refer to the most carboxy-terminal HCF-1C self-association sequence in the HCF-1C1758–2035 segment as HCF-1SAS1C and the more internal HCF-1C self-association sequence in the HCF-1C1436–1756 segment as HCF-1SAS2C.
Having identified two SAS elements in both the HCFN and HCFC subunits, we asked whether there is any specificity in the interaction between the amino-terminal HCF-1SAS1N and HCF-1SAS2N elements and the carboxy-terminal HCF-1SAS1C and HCF-1SAS2C elements as shown in Fig. 3. Indeed, there is specificity: the HCF-1SAS1N-containing HCF-1N450 fragment associated only with the HCF-1SAS1C-containing HCF-1C1758–2035 fragment (compare lanes 7 to 9), and the HCF-1SAS2N-containing HCF-1N450–1011 fragment associated only with the HCF-1SAS2C-containing HCF-1C1436–1756 fragment (compare lanes 10 to 12). Thus, HCF-1 displays two matched pairs of SAS elements: the most amino- and carboxy-terminal SAS elements—HCF-1SAS1N and HCF-1SAS1C—form one association complex, and the more internal SAS elements—HCF-1SAS2N and HCF-1SAS2C—form a second association complex. We do not know, however, whether both SAS1 and SAS2 mediate HCF-1N and HCF-1C subunit association simultaneously in native HCF-1.
Mapping of the amino-terminal HCF-1SAS1N and HCF-1SAS2N elements.
Using the strategy shown in Fig. 3, in experiments not shown here we refined the location of the HCF-1SAS1N and HCF-1SAS2N elements. The results of these experiments are summarized in Fig. 4. As shown in Fig. 3, HCF-1N1011 associated with both HCF-1SAS1C and to a lesser extent with HCF-1SAS2C, whereas HCF-1N450 associated effectively only with the HCF-1SAS1C element. Consistent with the inability of the minimal VP16 interaction region in HCF-1N380 to associate with the entire HCF-1C subunit (Fig. 2), HCF-1N380 did not exhibit association with either the HCF-1SAS1C or HCF-1SAS2C element. In these experiments, we mapped the HCF-1SAS1N element to within HCF residues 348 to 450 (which we fused to the yeast GAL4 DNA-binding domain owing to our inability to detect this and smaller HCF fragments of this region on their own in our assay). Residues 382 to 450 alone, however, which represent the region removed in the alternative-splice variant of HCF, failed to associate with any of the HCF-1C fragments (Fig. 4). Consistent with this mapping of the HCF-1SAS1N element, a fragment covering residues 348 to 1011 (HCF-1N348–1011) associated with all three of the HCF-1C fragments used in the coimmunoprecipitation assay.
Analysis of a selection of fragments containing portions of a region enriched in basic residues (Fig. 1A)—HCF-1N491–1011, HCF-1N836–1011, HCF-1N491–836, HCF-1N491–755, and HCF-1N491–700—narrows the location of the HCF-1SAS2N element to within HCF residues 491 to 755 (Fig. 4). These results show that a subregion of the basic region contains the HCF-1SAS2N element. Residues 813 to 847 within the basic region have recently been shown to associate with the transcription factor GA-binding protein (GABP) (25). This association is unlikely to be related to the SAS2N function because these two elements of HCF-1 do not overlap each other (residues 491 to 755 [SAS2N] versus 813 to 847 [GABP association]).
The basic region has previously been implicated in cell proliferation, and HCF-1N sequences extending past residue 836 but not beyond residue 902 are required to rescue the temperature-sensitive defect in the hamster tsBN67 cell line (26). The ability of the HCF-1N491–836 fragment to associate effectively with the HCF-1SAS2C element suggests that the HCF-1SAS2N element does not precisely colocalize with the region that promotes cell proliferation in tsBN67 cells, and thus one or more additional functions of the basic region, such as possibly association with GABP (25), are required to rescue the tsBN67 defect.
Fine mapping of the paired HCF-1SAS1N and HCF-1SAS1C elements.
To map one pair of SAS elements in detail, we performed a deletion analysis of the HCF-1SAS1N and HCF-1SAS1C elements as shown in Fig. 5A. The results of the analysis shown in Fig. 4 located the HCF-1SAS1N element to within the 103-amino-acid HCF-1N348–450 fragment. This region contains the carboxy-terminal residues of the HCF-1KEL6 repeat (residues 348 to 360), which are conserved among the HCF-1, HCF-2, and CeHCF proteins (11, 16), and an additional region conserved among these three proteins (residues 361 to 396) as shown in Fig. 5B. To locate the HCF-1SAS1N element within residues 348 to 450, we performed a more extensive deletion analysis of this region fused to the GAL4 DNA-binding domain. As shown in Fig. 5A (lanes 1 to 3), deletion of residues 348 to 360, but not to 382, had no apparent effect on association with the HCF-1SAS1C-containing HCF-1C1758–2035 fragment. Also, deletion of residues 450 to 402, but not to 392, had no apparent effect on association with the HCF-1C1758–2035 fragment (lanes 4 to 6), suggesting that the 43-amino-acid segment from 360 to 402 is sufficient to associate with the HCF-1SAS1C element. Indeed, the 360–402 sequence fused to the GAL4 DNA-binding domain is sufficient to associate with the HCF-1C1758–2035 fragment (lane 7). Thus, the SAS1N element resides within residues 360 to 402. Interestingly, this region, which lies immediately adjacent to the HCF-1KEL6 region, coincides with the 361–401 segment that is conserved among HCF-1, HCF-2, and CeHCF (Fig. 5B).
We performed a similar analysis of the HCF-1SAS1C element (Fig. 5, lanes 8 to 13). Deletion of carboxy-terminal residues 2035 to 2002, but not to 1990, had no apparent effect on association with HCF-1N348–450 (lanes 8 to 10), and deletion of residues 1758 to 1812, but not to 1819, had only a small deleterious effect on association with HCF-1N348–450 (lanes 11 and 12). (Note that the HCF-1SAS1C1812–2035 construct [lane 11] consistently produced a faster-migrating fragment that did not associate with HCF-1N348–450; we do not know the origin of this fragment.) Consistent with these results, an HCF-1C1812–2002 fragment can associate with the HCF-1N348–450 fragment (lane 13). Indeed, the two smallest fragments capable of HCF self-association—the 43-amino-acid HCF-1N360–402 fragment and the 191-amino-acid HCF-1C1812–2002 fragment—associate with one another (lane 14). Interestingly, the HCF-1SAS1C element lies precisely within the predicted HCFFn3-repeat-containing carboxy-terminal region that is shared among the HCF-1, HCF-2, and CeHCF proteins (residues 1812 to 2002 [Fig. 1B]). The close match between the mapping results and the boundaries of the proposed Fn3 repeats strengthens the prediction that the HCF-1 carboxyl terminus is composed of a conserved pair of Fn3 repeats, which together form the HCF-1SAS1C element.
Hughes et al. (10) have reported a mapping analysis of the HCF-1SAS1 element using proteins synthesized in vitro and an assay involving retardation of the mobility of the VP16-induced complex during polyacrylamide gel electrophoresis. Their analysis mapped the HCF-1SAS1N element to within residues 1 to 450 and the HCF-1SAS1C to within residues 1786 to 2035, consistent with the in vivo association assay described here.
HCF-2 contains a conserved SAS1N element.
Unlike the HCF-1 protein, HCF-2 and CeHCF do not contain any HCF-1PRO processing repeats and there is no evidence that HCF-2 and CeHCF are proteolytically processed (11) (Y. Liu and W. Herr, unpublished results). Thus, there is no a priori reason to expect the SAS elements of HCF-1 to be conserved in HCF-2 and CeHCF. We were surprised, therefore, to find that the sequences of both the small SAS1N (residues 360 to 402) and larger SAS1C (residues 1812 to 2002) elements are conserved in the HCF-2 and CeHCF proteins (Fig. 1B and 5B). The sequence conservation led us to ask whether HCF-2 contains a functional SAS1 element. Unfortunately, the carboxy-terminal HCF-2 segment corresponding to HCF-1SAS1C could not be synthesized effectively in transfected cells. We therefore tested the ability of the putative amino-terminal SAS1 element (HCF-2 residues 341 to 394) to associate with the HCF-1SAS1C region as shown in Fig. 6. Consistent with an HCF-2SAS1N function, this HCF-2341–394 segment could associate effectively with HCF-1SAS1C (compare lanes 1 to 3). Thus, HCF-2 and perhaps CeHCF are likely to possess HCF self-association activity.
Association with the carboxy-terminal HCF-1C subunit results in nuclear localization of the HCF-1N subunit.
Immunofluorescence and subcellular fractionation studies have shown that HCF-1 is predominantly a nuclear protein (12–14, 29) and that nuclear localization is determined by a consensus bipartite NLS (5) at its carboxyl terminus (KRPMSSPEMKSAPKKSK; residues 2015 to 2031) (14). Thus, when expressed separately, the HCF-1C600 fragment accumulates in the nucleus (Fig. 7B) like the endogenous protein (Fig. 7F), and the HCF-1N1011 fragment accumulates largely in the cytoplasm (Fig. 7A). Deletion of the NLS sequence results in HCF-1C remaining largely in the cytoplasm (Fig. 7B and C).
The dependence of HCF-1 nuclear localization on a single sequence at its carboxyl terminus suggests that following proteolytic processing of the precursor, HCF-1 self-association maintains the HCF-1N subunit within the nucleus. To test this hypothesis, we asked whether, when synthesized separately in the same cell, the HCF-1C subunit can affect the subcellular localization of the HCF-1N subunit. Indeed, synthesis of the wild-type HCF-1C600 subunit (Fig. 7D), but not the HCF-1C600ΔNLS subunit (Fig. 7E), resulted in nuclear import of the HCF-1N subunit. These results show that the HCF-1C subunit can recruit the HCF-1N subunit to the nucleus. Such an activity of HCF-1 subunits could be important for the nuclear retention of the HCF-1N subunit after nuclear breakdown and resynthesis as a result of mitosis.
DISCUSSION
We have characterized the sequences required for HCF-1N and HCF-1C subunit association. To our surprise, this characterization revealed two matched pairs of HCF-1N and HCF-1C SAS elements: HCF-1SAS1, consisting of HCF-1SAS1N and HCF-1SAS1C partners, and HCF-1SAS2, consisting of HCF-1SAS2N and HCF-1SAS2C partners. The positions of the two sets of HCF-1 SAS elements are shown in Fig. 8A. The HCF-1SAS1 pair of elements has been mapped with considerable precision and represents a small 43-amino-acid amino-terminal HCF-1SAS1N sequence (residues 360 to 402) immediately adjacent to the β-propeller-containing VP16 interaction domain and a larger 191-amino-acid carboxy-terminal HCF-1SAS1C sequence (residues 1812 to 2002), which contains a pair of Fn3-type repeats. The HCF-1SAS2 element has been less rigorously mapped: the HCF-1SAS2N sequence lies within residues 491 to 755 in the so-called, basic region and the HCF-1SAS2C sequence lies within residues 1436 to 1756, which contains the so-called acidic region.
Regulation of HCF-1N and HCF-1C association.
The identification of two separate SAS elements raises a paradox because deletion of just one half of one of these sets, HCF-1SAS1N, inactivates HCF-1 subunit association. These results suggest that amino-terminal HCF-1 sequences can inhibit the activity of the HCF-1SAS2N element. First, the entire amino-terminal region of HCF-1 (residues 1 to 1011) associates less well with the HCF-1SAS2C region than residues 450 to 1011 alone (Fig. 3); second, deletion of residues 380 to 452 in the alternative-splice HCF-1 variant (Fig. 1), which impinges on the HCF-1SAS1N but not the HCF-1SAS2N sequence, prevents detectable HCF-1C association. Thus, inactivation of HCF-1SAS1N in this natural deletion variant also functionally inactivates HCF-1SAS2N. Perhaps, the region encompassing the HCF-1SAS1N sequence inhibits the activity of the HCF-1SAS2N element, particularly when residues 382 to 450 are removed, providing an efficient mechanism for regulation of HCF subunit association. Whatever the case, this result shows that HCF-1SAS2 element is not simply due to nonspecific association of the basic and acidic regions of HCF.
The HCF-1SAS1 association element is conserved in HCF family members.
The SAS elements in HCF-1 were identified because HCF-1 is processed to produce amino-terminal and carboxy-terminal HCF subunits that remain associated (27). We were very much surprised, therefore, to find that the sequence of one of the paired HCF-1 SAS elements, HCF-1SAS1, is conserved in human HCF-2 and C. elegans HCF (Fig. 1B and 5B), proteins that do not display any evidence of proteolytic cleavage. Furthermore, the sequence in HCF-2 corresponding to the HCF-1SAS1N sequence can associate with the HCF-1SAS1C sequence (Fig. 6). Thus, even though there is no evidence that HCF-1 and HCF-2 associate with one another in vivo (K. M. Johnson and A. C. Wilson, unpublished results), both proteins have conserved HCFSAS1N activity. As illustrated in Fig. 8B, we hypothesize that the HCFSAS1 element has been conserved in both proteins and retains self-association activity in HCF-2. Given the sequence conservation in C. elegans HCF, we also suggest that the C. elegans HCF protein possesses an active HCF-1SAS1 element, although this possibility has not been tested directly.
It is curious that HCF-2 (and perhaps C. elegans HCF) has conserved a SAS element when there is no evidence that it is proteolytically processed. It is also surprising that the HCF-2SAS1N element has retained the ability to associate with HCF-1 when there is no reason to believe that these proteins associate with one another in vivo. Perhaps, the function of the HCF-1SAS1 element is not only to associate with itself but also to associate with effector proteins in the cell (Fig. 8B). If, for example, the HCF-1SAS1N and HCF-2SAS1N elements share effector proteins, then the HCFSAS1N element may have been conserved in both HCF-1 and HCF-2 to maintain this interaction, a by-product being the conserved ability of the HCF-2SAS1N element to associate with the HCF-1SAS1C element.
We do not know why HCF-2 may have conserved an SAS element. Its conservation in HCF-2, however, helps explain why HCF-1 unexpectedly has two SAS elements: the first, SAS1, may play a role in both HCF-1 and HCF-2 in addition to self-association. Because, HCF-1, unlike HCF-2, is proteolytically cleaved, the SAS2 element may be essential in HCF-1 to maintain self-association, an activity not required in HCF-2 because it is not proteolytically cleaved.
Fn3 repeats form a protein-protein interaction module.
The sequence of the HCF-1SAS1C element exhibits similarity to a tandem pair of Fn3 repeats. Fn3 repeats from different proteins generally exhibit limited sequence similarity (2). Structurally, however, all Fn3 repeats fold into two β sheets packed against each other like a sandwich (3, 4, 9, 18). Thus, Fn3 repeats represent a conserved structure that can vary considerably in sequence to create many different types of functional surfaces. This flexibility may account for their widespread use (2).
Fn3 repeats mediate protein-protein interactions. In some instances, a single repeat is sufficient (20); in other instances, multiple repeats are required for specific association (21). In HCF-1SAS1C, both repeats are necessary for association with HCF-1SAS1N (Fig. 5). Recently, de Pereda et al. (3) described the structure of tandem Fn3 repeats from the cytoplasmic tail of the β4 subunit of integrin α6β4. This and other Fn3 repeat structures suggest two ways in which tandem Fn3 repeats can associate with other proteins. In one way, an Fn3 repeat can present short peptide sequences on its surface that can bind other proteins as in the case of the RGD tripeptide sequence of the 10th Fn3 repeat of fibronectin (3). In the second way, tandem Fn3 repeats can form an extended groove that is an attractive docking site for another protein surface. If the HCF-1SAS1C Fn3 repeats form such a surface, it could be a docking site for the HCF-1SAS1N element and perhaps an effector protein as illustrated in Fig. 8B.
Implications of the two-subunit composition of HCF-1 for its function.
Consistent with earlier results (14), we have identified a classical bipartite NLS at the extreme carboxyl terminus of HCF-1. We have also shown that the HCF-1N subunit does not contain an NLS and that HCF-1SAS1N subunits can migrate to the nucleus through association with the HCF-1SAS1C subunit. This arrangement resembles polymerase α primase, which is also composed of two stably associated subunits. In this case, a single NLS in the p54 subunit targets the smaller p46 subunit to the nucleus (19). Pulse-chase analysis of the HCF-1300 precursor has shown that it migrates to the nucleus prior to its initial proteolytic cleavage (29). Thus, HCF-1 subunit association is more likely to be required for the nuclear retention of the HCF-1N subunit rather than its initial nuclear import. Because HCF-1 is stable (29), it may persist through multiple rounds of cell division. Perhaps HCF-1 subunit association is required to maintain the nuclear location of the HCF-1N subunit following nuclear breakdown and resynthesis as a result of mitosis.
The HCF-1N and HCF-1C subunits perform different functions within the cell. The HCF-1N subunit associates with the viral and cellular HCF-binding proteins VP16 (15, 26), LZIP (6, 17), and GABP (25) and complements the temperature-sensitive cell proliferation defect in tsBN67 cells. As shown here, the HCF-1C subunit localizes the HCF-1 protein in the nucleus, and it can also stabilize VP16-induced complex formation with full-length VP16 (15). The regulated association of HCF-1 subunits, as evidenced by the alternative pre-mRNA splice variant, provides a unique opportunity to control the association of the different HCF-1N and HCF-1C subunit functions in the cell. It will be interesting to determine whether HCF-1N and HCF-1C subunit association is temporally regulated during phases of the cell cycle or in specific cell types.
ACKNOWLEDGMENTS
We thank David Spector and colleagues for advice on microscopy; Andrew Neuwald for aid in the analysis of the HCF-1 Fn3 repeat similarity; and J. Duffy, M. Ockler, and P. Renna for artwork.
This study was supported by PHS grant GM54598 to W.H. and by funds from the Kaplan Comprehensive Cancer Center and an institutional award from the American Cancer Society (IRG-14-39) to A.C.W.
REFERENCES
- 1.Altschul S F, Gish W, Miller W, Myers E W, Lipman D J. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 2.Campbell I D, Spitzfaden C. Building proteins with fibronectin type III modules. Structure. 1994;2:333–337. doi: 10.1016/s0969-2126(00)00034-4. [DOI] [PubMed] [Google Scholar]
- 3.de Pereda J M, Wiche G, Liddington R C. Crystal structure of a tandem pair of fibronectin type III domains from the cytoplasmic tail of integrin alpha6beta4. EMBO J. 1999;18:4087–4095. doi: 10.1093/emboj/18.15.4087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Dickinson C D, Veerapandian B, Dai X P, Hamlin R C, Xuong N H, Ruoslahti E, Ely K R. Crystal structure of the tenth type III cell adhesion module of human fibronectin. J Mol Biol. 1994;236:1079–1092. doi: 10.1016/0022-2836(94)90013-2. [DOI] [PubMed] [Google Scholar]
- 5.Dingwall C, Laskey R A. Nuclear targeting sequences—a consensus? Trends Biochem Sci. 1991;16:478–481. doi: 10.1016/0968-0004(91)90184-w. [DOI] [PubMed] [Google Scholar]
- 6.Freiman R N, Herr W. Viral mimicry: common mode of association with HCF by VP16 and the cellular protein LZIP. Genes Dev. 1997;11:3122–3127. doi: 10.1101/gad.11.23.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Goto H, Motomura S, Wilson A C, Freiman R N, Nakabeppu Y, Fukushima K, Fujishima M, Herr W, Nishimoto T. A single-point mutation in HCF causes temperature-sensitive cell-cycle arrest and disrupts VP16 function. Genes Dev. 1997;11:726–737. doi: 10.1101/gad.11.6.726. [DOI] [PubMed] [Google Scholar]
- 8.Herr W. The herpes simplex virus VP16-induced complex: mechanisms of combinatorial transcriptional regulation. Cold Spring Harbor Symp Quant Biol. 1998;63:599–607. doi: 10.1101/sqb.1998.63.599. [DOI] [PubMed] [Google Scholar]
- 9.Huber A H, Wang Y M, Bieber A J, Bjorkman P J. Crystal structure of tandem type III fibronectin domains from Drosophila neuroglian at 2.0 Å. Neuron. 1994;12:717–731. doi: 10.1016/0896-6273(94)90326-3. [DOI] [PubMed] [Google Scholar]
- 10.Hughes T A, LaBoissière S, O'Hare P. Analysis of functional domains of the host cell factor involved in VP16 complex formation. J Biol Chem. 1999;274:16437–16443. doi: 10.1074/jbc.274.23.16437. [DOI] [PubMed] [Google Scholar]
- 11.Johnson K M, Mahajan S S, Wilson A C. Herpes simplex virus transactivator VP16 discriminates between HCF-1 and a novel family member, HCF-2. J Virol. 1999;73:3930–3940. doi: 10.1128/jvi.73.5.3930-3940.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kristie T M, Pomerantz J L, Twomey T C, Parent S A, Sharp, A. P. The cellular C1 factor of the herpes simplex virus enhancer complex is a family of polypeptides. J Biol Chem. 1995;270:4387–4394. doi: 10.1074/jbc.270.9.4387. [DOI] [PubMed] [Google Scholar]
- 13.Kristie T M, Vogel J L, Sears A E. Nuclear localization of the C1 factor (host cell factor) in sensory neurons correlates with reactivation of herpes simplex virus from latency. Proc Natl Acad Sci USA. 1999;96:1229–1233. doi: 10.1073/pnas.96.4.1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.LaBoissière S, Hughes T, O'Hare P. HCF-dependent nuclear import of VP16. EMBO J. 1999;18:480–489. doi: 10.1093/emboj/18.2.480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.LaBoissière S, Walker S, O'Hare P. Concerted activity of host cell factor subregions in promoting stable VP16 complex assembly and preventing interference by the acidic activation domain. Mol Cell Biol. 1997;17:7108–7118. doi: 10.1128/mcb.17.12.7108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Liu Y, Hengartner M O, Herr W. Selected elements of herpes simplex virus accessory factor HCF are highly conserved in Caenorhabditis elegans. Mol Cell Biol. 1999;19:909–915. doi: 10.1128/mcb.19.1.909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lu R, Yang P, O'Hare P, Misra V. Luman, a new member of the CREB/ATF family, binds to herpes simplex virus VP16-associated host cell factor. Mol Cell Biol. 1997;17:5117–5126. doi: 10.1128/mcb.17.9.5117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Main A L, Harvey T S, Baron M, Boyd J, Campbell I D. The three-dimensional structure of the tenth type III module of fibronectin: an insight into RGD-mediated interactions. Cell. 1992;71:671–678. doi: 10.1016/0092-8674(92)90600-h. [DOI] [PubMed] [Google Scholar]
- 19.Mizuno T, Okamoto T, Yokoi M, Izumi M, Kobayashi A, Hachiya T, Tamai K, Inoue T, Hanaoka F. Identification of the nuclear localization signal of mouse DNA primase: nuclear transport of p46 subunit is facilitated by interaction with p54 subunit. J Cell Sci. 1996;109:2627–2636. doi: 10.1242/jcs.109.11.2627. [DOI] [PubMed] [Google Scholar]
- 20.Moyano J V, Carnemolla B, Dominguez-Jimenez C, Garcia-Gila M, Albar J P, Sanchez-Aparicio P, Leprini A, Querze G, Zardi L, Garcia-Pardo A. Fibronectin type III5 repeat contains a novel cell adhesion sequence, KLDAPT, which binds activated alpha4beta1 and alpha4beta7 integrins. J Biol Chem. 1997;272:24832–24836. doi: 10.1074/jbc.272.40.24832. [DOI] [PubMed] [Google Scholar]
- 21.Nagai T, Yamakawa N, Aota S, Yamada S S, Akiyama S K, Olden K, Yamada K M. Monoclonal antibody characterization of two distant sites required for function of the central cell-binding domain of fibronectin in cell adhesion, cell migration, and matrix assembly. J Cell Biol. 1991;114:1295–1305. doi: 10.1083/jcb.114.6.1295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.O'Hare P. The virion transactivator of herpes simplex virus. Semin Virol. 1993;4:145–155. [Google Scholar]
- 23.Schultz J, Milpetz F, Bork P, Ponting C P. SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci USA. 1998;95:5857–5864. doi: 10.1073/pnas.95.11.5857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sturm R A, Das G, Herr W. The ubiquitous octamer-binding protein Oct-1 contains a POU domain with a homeo box subdomain. Genes Dev. 1988;2:1582–1599. doi: 10.1101/gad.2.12a.1582. [DOI] [PubMed] [Google Scholar]
- 25.Vogel J L, Kristie T M. The novel coactivator C1 (HCF) coordinates multiprotein enhancer formation and mediates transcription activation by GABP. EMBO J. 2000;19:683–690. doi: 10.1093/emboj/19.4.683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wilson A C, Freiman R N, Goto H, Nishimoto T, Herr W. VP16 targets an amino-terminal domain of HCF involved in cell-cycle progression. Mol Cell Biol. 1997;17:6139–6146. doi: 10.1128/mcb.17.10.6139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wilson A C, LaMarco K, Peterson M G, Herr W. The VP16 accessory protein HCF is a family of polypeptides processed from a large precursor protein. Cell. 1993;74:115–125. doi: 10.1016/0092-8674(93)90299-6. [DOI] [PubMed] [Google Scholar]
- 28.Wilson A C, Parrish J E, Massa H F, Nelson D L, Trask B J, Herr W. The gene encoding the VP16-accessory protein HCF (HCFC1) resides in human Xq28 and is highly expressed in fetal tissues and the adult kidney. Genomics. 1995;25:462–468. doi: 10.1016/0888-7543(95)80046-o. [DOI] [PubMed] [Google Scholar]
- 29.Wilson A C, Peterson M G, Herr W. The HCF repeat is an unusual proteolytic cleavage signal. Genes Dev. 1995;9:2445–2458. doi: 10.1101/gad.9.20.2445. [DOI] [PubMed] [Google Scholar]