Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2017 Dec 26;115(2):313–318. doi: 10.1073/pnas.1706846115

High-resolution structure of podovirus tail adaptor suggests repositioning of an octad motif that mediates the sequential tail assembly

Lingfei Liang a, Haiyan Zhao a, Bowen An a, Liang Tang a,1
PMCID: PMC5777027  PMID: 29279385

Significance

Many dsDNA bacteriophages possess a tail that encodes functions essential for viral life cycles. Assembly of such a molecular machine has been shown to occur in a sequential manner. Here, we report the high-resolution crystal structure of the tail adaptor protein gp7 from Shigella phage Sf6. Comparative structural studies reveal that the N-terminal portion undergoes structural rearrangement by repositioning two consecutive repeats of a conserved octad sequence motif, turning the molecule from the preassembly state to the postassembly state, which creates the binding site for the next tail component to attach to. These results provide a structural basis for a mechanism of repositioning of sequence motifs by which the adaptor protein mediates the sequential assembly of the phage tail.

Keywords: virus assembly, sequential assembly, octad motif repositioning, bacteriophage, tail

Abstract

The sophisticated tail structures of DNA bacteriophages play essential roles in life cycles. Podoviruses P22 and Sf6 have short tails consisting of multiple proteins, among which is a tail adaptor protein that connects the portal protein to the other tail proteins. Assembly of the tail has been shown to occur in a sequential manner to ensure proper molecular interactions, but the underlying mechanism remains to be understood. Here, we report the high-resolution structure of the tail adaptor protein gp7 from phage Sf6. The structure exhibits distinct distribution of opposite charges on two sides of the molecule. A gp7 dodecameric ring model shows an entirely negatively charged surface, suggesting that the assembly of the dodecamer occurs through head-to-tail interactions of the bipolar monomers. The N-terminal helix-loop structure undergoes rearrangement compared with that of the P22 homolog complexed with the portal, which is achieved by repositioning of two consecutive repeats of a conserved octad sequence motif. We propose that the conformation of the N-terminal helix-loop observed in the Sf6-gp7 and P22 portal:gp4 complex represents the pre- and postassembly state, respectively. Such motif repositioning may serve as a conformational switch that creates the docking site for the tail nozzle only after the assembly of adaptor protein to the portal. In addition, the C-terminal portion of gp7 shows conformational flexibility, indicating an induced fit on binding to the portal. These results provide insight into the mechanistic role of the adaptor protein in mediating the sequential assembly of the phage tail.


Tailed dsDNA bacteriophages constitute the order of Caudovirales with three families, Myoviridae, Siphoviridae, and Podoviridae, defined by tail morphology. The tail attached to the capsid is a unique, sophisticated macromolecular machine responsible for a variety of roles essential in viral life cycle. Phage Sf6 is a podovirus infecting Shigella flexneri, and is a close relative to the prototype phage P22 (1). Like other P22-like phages, Sf6 possesses a short, noncontractile tail emanating from a unique pentameric vertex of the icosahedral capsid (2, 3). The tail has a mass of ∼1.8 MDa. It is composed of 39 copies of four gene products, including the dodecameric tail adaptor gp7, the hexameric tail nozzle gp8, the trimeric tail plug gp9, and six copies of the trimeric tailspike gp14 (1, 3). It is assumed that during infection this short tail is extended by three DNA-injection proteins to drill through the three-layer envelope of the host cell to inject phage DNA into the host cytoplasm, a mechanism that may be common in podoviruses (410).

The podovirus morphogenesis begins with the assembly of a procapsid, containing coat proteins, scaffolding proteins, and portal. Subsequently, the ∼40-kb viral genome is translocated into the procapsid through the central channel of the portal by DNA-packaging motor, resulting in densely packed DNA (11). The high pressure built by DNA packing in the capsid lattice triggers conformational change in portal protein, which subsequently releases the motor, terminating the DNA packaging (1214). A set of tail proteins attach to the portal vertex to prevent the leakage of DNA. The tail adaptor, gp7 in the case of phage Sf6, is the first tail protein to polymerize on the portal, followed by the sequential incorporation of the tail nozzle, tail plug, and tailspikes (15). Eventually, the portal, tail adaptor, and tail nozzle make up a conduit that is to be utilized for phage DNA injection into host cells during infection. The tail plug protein blocks the end of this conduit, preventing DNA from leaking out. This plug probably also serves to penetrate part of the host envelope and triggers the translocation of DNA-injection proteins as well as viral DNA (9). The last tail protein in the assembly is tailspike gp14, which has two domains. The tailspike head-binding domain (HBD) binds to the interface of the tail adaptor and the tail nozzle, and the tailspike receptor-binding domain (RBD) recognizes cell receptor and cleaves host O-antigen, pulling virions close to the host cell upon infection. Sequential assembly of tail components to the phage capsid has been recognized in several podoviruses such as T7 (16) and P22 (15, 1719). This assembly mechanism differs from that in the other two families, Myoviridae and Siphoviridae, of which the phage head, tail, and tail fibers are assembled independently and subsequently join to form the complete virion (20). The sequential mechanism ensures orderly assembly and avoids aberrant products from premature interaction between components, which may contribute to highly efficient, rapid phage assembly.

The podovirus sequential tail assembly has been long studied (15). However, the underlying mechanisms remain to be understood. The X-ray structure of phage P22 portal:adaptor complex showed the dodecameric ring-like architecture of the tail adaptor (P22-gp4) that binds to the portal dodecamer with elongated C-terminal portions (21). This and the subnanometer-resolution cryo-EM structures of the isolated P22 tail (22) and the P22 virion (23, 24) shed light on the assembly of the multiple components in the context of the tail and the virion. Here we report the high-resolution X-ray structure of phage Sf6 tail adaptor gp7 in its preassembly state. Comparison with the P22 tail adaptor structure reveals a conformational switch at the N-terminal segment upon tail assembly that ensures the subsequent attachment of the tail nozzle. Such conformational switch is enabled by repositioning of two consecutive repeats of a conserved sequence motif that are capable of binding to the same protein surface.

Results and Discussion

Overall Structure.

The Sf6-gp7 has 160 amino acid residues. We designed a truncated gp7 construct lacking 36 C-terminal residues which were predicted as disordered. A C-terminal His-tag was added to the construct to facilitate the protein purification. The structure was determined at 1.78 Å resolution (Table 1), with two molecules in the asymmetric unit related by a noncrystallographic twofold symmetry (Fig. 1A). One magnesium ion and four calcium ions were found in the structure. The structure is composed of four α-helices, namely α1–α4, connected by three loops. Despite low sequence homology, this protein fold is common in tail adaptor proteins of podoviruses, siphoviruses, and probably myoviruses as well. Structural superimposition shows a Cα rmsd of 1.289 Å between 53 atom pairs with podovirus P22 (gp4, PDB ID 3LJ4) (21), 0.884 Å between 18 atom pairs with siphovirus HK97 (gp6, PDB ID 3JVO) (25), 1.252 Å between 21 atom pairs with siphovirus SPP1 (gp15, PDB ID 2KBZ) (26), and 0.899 Å between 21 atom pairs with a putative neck protein of a Myoviridae prophage found in the Bacillus subtilis genome (YqbG, PDB ID 1ZTS) (27) (Fig. S1B). The highly conserved portion contains the two long helices α2 and α3 arranged in the antiparallel manner. The loop connecting helices α2 and α3, namely loop α2α3, spans >26 residues (residues 51–76) and is well defined in the electron density map. Multiple H-bonds with the water molecules which are coordinated to the calcium ion between the two gp7 molecules help stabilize the loop α2α3 (Fig. S3A). According to the dodecameric assembly of the gp7 ortholog P22-gp4 (21), the dimeric arrangement of gp7 is not likely to be biologically relevant, meaning that in the phage tail the loop α2α3 is not stabilized in the same way as in the crystal. As discussed later in this article, we fit the X-ray structure of gp7 to the cryo-EM map of the P22 tail (22). By fitting in the HBD of the neighboring tailspike at the meantime, we found that four loops of the HBD were located within 7 Å to the loop α2α3 of gp7 (Fig. S3B), suggesting the possible roles of these loops in stabilizing the loop α2α3. The C-terminal eight residues are highly extended and well defined in the electron density map for molecule A but are disordered in molecule B (Fig. 1B).

Table 1.

X-ray data collection and structure refinement statistics

Data collection Native Hg
Beamline APS 23ID-D APS 23ID-B
Wavelength, Å 1.03324 1.00699
Resolution, Å 29.6–1.78(1.81–1.78)* 50–2.66 (2.71–2.66)*
No. of measurements 235,103 154,347
No. of unique reflections 38,487 (1,936)* 11,922
Completeness, % 98.9 (89.2)* 99.8
I/σ 17.5 (1.7)* 53.7 (9.96)*
Rmerge, % 5.7 (67.9)* 9.6 (48)*
Space group P41212 P41212
Unit cell, Å a = 101.30, c = 76.46 a = 100.93, c = 77.73
Structure refinement
 Resolution, Å 29.55–1.78
 Rwork/Rfree 0.18/0.20
 No. of atoms
  Protein/water 1,838/230
 B-factors
  Protein/water 40.94/48.23
 rmsd
  Bond lengths, Å 0.007
  Bond angles, ° 0.938
 Ramachandran plot, %
  Most favored 98.28
  Allowed 1.72
  Disallowed 0.00

APS, Advanced Photon Source.

*

Values in parentheses are for the outermost resolution shells.

Rmerge = ΣhklΣi|Ii(hkl) − <I(hkl)>|/ΣhklΣiIi(hkl), where Ii(hkl) is the observed intensity of reflection hkl and <I(hkl)> is the averaged intensity of symmetry-equivalent measurements.

Rwork = Σhkl||Fobs| − |Fcalc||/Σhkl|Fobs|, where Fobs and Fcalc are structure factors of the observed reflections and those calculated from the refined model, respectively. Rfree has the same formula as Rwork, except that it was calculated against a test set of the data that were not included in the refinement.

Fig. 1.

Fig. 1.

The Sf6-gp7 is a globular protein composed of four α-helices. (A) The ribbon diagram of the gp7 molecules A and B (colored in pink and orange) in the asymmetric unit. One magnesium (small green sphere) and four calcium (large green sphere) ions are found. (B) Superimposition of the gp7 molecules A and B shows the only significant difference at the C-terminal fragment (T117–R124). C-ter, C-terminal; N-ter, N-terminal.

A Distinct Conformation of the N-Terminal Portion.

Sf6-gp7 and P22-gp4 share 37% identity and 56% similarity in amino acid sequences (Fig. 2A). We extracted a gp4 monomer from the P22 portal-gp4 complex (21) and compared its structure with Sf6-gp7. The protein folds were highly conserved, while the N-terminal and C-terminal portions showed significant differences (Fig. 2B). In the N-terminal segment, Sf6-gp7 has a long α1-helix and a nearly vertically oriented α1α2 loop, while P22-gp4 has the α1-helix half in length than that of Sf6-gp7 and an essentially horizontal α1α2 loop (Fig. 2B). The X-ray structure of Sf6-gp7 fits well into the isolated cryo-EM map of the P22 tail (Fig. S2). However, the α1-helix and the α1α2 loop of Sf6-gp7 penetrate into the portion of the cryo-EM map corresponding to the tail nozzle (P22-gp10 or Sf6-gp8), which locates right underneath the adaptor in the tail complex, creating a steric clash (Fig. 2C). This indicates that the Sf6-gp7 α1-helix and the α1α2 loop must undergo conformational changes upon assembly from the monomers free in solution into the tail.

Fig. 2.

Fig. 2.

The octad motif shifting in the α1-helix of tail adaptor. (A) The sequence alignment of Sf6-gp7 and P22-gp4 with 3D structure information of Sf6-gp7. The repeated octad motif in α1-helix is indicated with the black rectangles. The two images below are the projections of α1-helix in Sf6-gp7 (Left) and P22-gp4 (Right). The four conserved residues of the first gp7 octad motif and the second gp4 octad motif are colored in gold. (B) The superimposition of Sf6-gp7 (pink) and P22-gp4 (cyan) with N- and C-terminal segments enlarged. The sequences of α1-helix in Sf6-gp7 and P22-gp4 that are structurally superimposed are indicated by blue lines. (C) Sf6-gp7 (Left) and P22-gp4 (Right) are docked in the cryo-EM map of the P22 tail. The Sf6-gp7, but not the P22-gp4, clashes with the EM volume of tail nozzle (map colored in yellow). The contour level of the map is 4.46. Cter, C-terminal; Nter, N-terminal.

Interestingly, structural superposition shows that the first half (residue nos. 6–18) of the Sf6-gp7 α1-helix overlays the P22-gp4 α1-helix (residue nos. 12–23) structurally (Fig. 2B), while sequence alignment shows that the second half (residue nos. 14–24) of the Sf6-gp7 α1-helix aligns well with the P22-gp4 α1-helix (residue nos. 12–23) at the amino acid sequence level (Fig. 2A), indicating a correlation between the two structures. Notably, the amino acid sequences of tail adaptor loop α1α2 are highly conserved between phage Sf6 and P22. Moreover, the tail nozzle proteins in the two phages share as high as 93% sequence identity (3). Thus, it is reasonable to infer that the tail adaptor loop α1α2 may physically interact with the essentially identical tail nozzle in phages Sf6 and P22, and the modes of interactions between the tail adaptor loop α1α2 and the tail nozzle are largely identical, which indicates consistent conformations for the loop α1α2 of Sf6-gp7 and P22-gp4 in the context of the tail. Given that Sf6-gp7 is crystallized in solution while P22-gp4 is in complex with the portal protein, these results together suggest that the distinct conformation of the N-terminal portion of Sf6-gp7 represents the preassembly state, whereas that observed in the P22 portal:gp4 complex structure represents the postassembly state. While free in the infected cell cytoplasm after protein synthesis, the tail adaptor protein may adopt the conformation as observed in the present Sf6-gp7 X-ray structure—that is, its loop α1α2 and α1-helix adopt the vertical orientation. Upon assembly onto the portal ring, the tail adaptor undergoes conformational changes in its loop α1α2 and α1-helix, which move up so that the second half of the α1-helix takes the place of the first half and the loop α1α2 adopts a horizontal orientation. Such conformational changes would eliminate the steric clash with the tail nozzle and create a specific binding site for the tail nozzle.

When transitioning from the preassembly state to the postassembly state, the α1-helix repositions so that its second half takes the place of the first half in the preassembly state. Analysis of amino acid sequences reveals a sequence motif that supports such replacement. In Sf6-gp7, the amino acid region 7–24 (α1-helix) contains two repeats of an octad motif KXXΦΦXXA (Fig. 2A), where K (position 1) is lysine, X is any residue, Φ (positions 4 and 5) represents a hydrophobic residue, and A (position 8) is alanine. This motif is conserved in P22-gp4 (Fig. 2A) and many P22-like phages as well (Fig. S1A). In this motif, Lys, the two hydrophobic residues, and Ala are located on the same side of the α1-helix and thus define a common surface (Fig. 2A). In the X-ray structures of Sf6-gp7 and the P22 portal:gp4 complex, this surface is the one that forms the interface between α1-helix with the rest of the molecule (mainly α2- and α3-helices) (Fig. 2A). Thus, repositioning of the α1-helix during transition from the preassembly state to the postassembly state is enabled by swapping of the two repeats of such octad sequence motif, which utilizes the essentially identical interface and would maintain essentially the same interactions between this helix to the binding interface on the molecule.

The C-Terminal Segment Displays Conformational Flexibility.

Superposition of molecules A and B gives an rmsd of 0.468 Å. The significant structural difference is at the C-terminal segment. The eight C-terminal residues in molecule B (residues T117–R124) show no electron density in the X-ray map and thus are disordered, while this region in molecule A forms a well-defined extended loop that is immobilized by interactions with adjacent molecules in the crystal (Fig. 1B). These data indicate conformational flexibility of the Sf6-gp7 C-terminal segment. In addition, the conformation of the Sf6-gp7 C-terminal portion differs from that of P22-gp4, as seen in the P22 portal:gp4 complex structure. A part of this portion in P22-gp4 (residues K116–S121) is the C-terminal part of the α4-helix, while the remaining residues from R122 to the very C terminus form an elongated loop that extensively interacts with the portal (Fig. 2B). It is likely that the Sf6-gp7 C-terminal portion is flexible while free in solution before attachment to the portal. Upon binding to the portal, the C-terminal portion may adopt an extended conformation and make extensive contact with the portal akin to what was observed in the P22. The C-terminal portions of Sf6-gp7 and P22-gp4 both contain several glycine and proline residues that are conserved between the two phages and are well aligned (Fig. 2A), which may facilitate such conformational changes upon postassembly with the portal. In fact, folding of unstructured segments upon binding to viral partners appears as a general strategy for the control of sequential tail assembly (26, 28, 29).

The Bipolar Distribution of Electric Charges on the Surface of the Sf6-gp7 Monomer.

Sf6-gp7 contains 19 negatively and 10 positively charged residues. While negatively charged residues are spread over the molecule essentially uniformly, 7 out of 10 positively charged residues are concentrated on one face of the molecule. Calculation of the electrostatic potential using USCF Chimera (30) reveals a distinct bipolar feature—that is, a convex, positively charged face on one side and a negatively charged face covering the rest of the molecule (Fig. 3A). During assembly of the ring-like gp7 dodecamer (see below) in the tail, the positively charged ridge of each monomer packs into the negatively charged groove of the adjacent subunit (Fig. 3C). This indicates that assembly of gp7 monomers into the ring-like dodecamer in the tail is mediated by electrostatic interactions. Each monomer mimics a molecular “magnet” that joins the next one in a head-to-tail manner via interactions between its positively charged face and the negatively charged face of the next monomer. The similar bipolar feature is observed on gp7 homologs P22-gp4 and HK97-gp6 (Figs. S4 and S5) (26, 31), suggesting that such charge:charge interactions between bipolar molecules may be a common mechanism for assembly of the adaptor protein monomers into ring-like oligomers in those phage tails.

Fig. 3.

Fig. 3.

The Sf6-gp7 monomer shows bipolar distribution of electric charges on its surface, whereas the dodecameric gp7 ring has the remarkably negative-charged surface. (A) Three neighboring gp7 monomers from the pseudo-atomic dodecameric model (shown in B) are shown as molecular surface colored according to electrostatic potential (middle) or ribbon (sides). The molecule is positively charged (blue) on one side, and negatively charged (red) on the other side. The positively charged ridge packs into the adjacent, negatively charged groove during assembly. (B) Ribbon diagram of the pseudo-atomic model of gp7 dodecameric ring with chains in different colors. (C) Surface representation of the gp7 dodecamric ring colored according to the electrostatic potential calculated with UCSF Chimera. (D) A cut-away view shows the negatively charged inner surface of gp7 dodecameric ring. For the color scheme of molecular surfaces, the blue area corresponds to an electrostatic potential of +10 kcal/(mol*e), and the red area corresponds to an electrostatic potential of −10 kcal/(mol*e).

A Model for the Sf6-gp7 Dodecameric Ring.

The P22 portal:gp4 complex X-ray structure (21) and the subnanometer-resolution cryo-EM map of P22 isolated tail (22) reveal how the P22-gp4 assembles with other components of the tail. Given the apparent sequence identity and common folds between tail component proteins, the overall assembly of the tail may be conserved between the two phages, which is also supported by the similarities of the tail structures as shown in the cryo-EM asymmetric reconstructions of P22 (23) and Sf6 (3). The X-ray structure of the Sf6-gp7 fits well into the 9.4 Å-resolution cryo-EM map of the P22 tail, except for the N-terminal and C-terminal portions where conformational changes occur as discussed above (Fig. S2A). The biological assembly of P22-gp4 is a ring-like dodecamer. To model the Sf6-gp7 dodecameric ring, we first docked one Sf6-gp7 monomer and the P22-gp4 dodecamer on the cryo-EM map of the P22 tail. By applying the symmetry matrix of the P22-gp4 dodecamer to Sf6-gp7, we could generate a pseudo-atomic model for the Sf6-gp7 dodecameric ring without significant clashes between monomers (Fig. 3B and Fig. S2B). This assembly has a dimension of 125 Å by 58 Å, with an inner diameter of 43 Å.

The Sf6-gp7 monomer:monomer interface involves a complex network of salt bridges between the positively charged face of one monomer and the complementary negatively charged face of its adjacent monomer, as described above, suggesting that electrostatic interactions play a dominant role in stabilizing the Sf6-gp7 dodecamer. The Sf6-gp7 dodecameric model shows an overwhelmingly negatively charged surface, while the positively charged surfaces of monomers are buried. The ring assembly also shows a highly negatively charged internal surface of the channel. Such highly negatively charged internal surface of the channel has been described for several phage portals and tail adaptor proteins as a common characteristic and was thought to help to avoid nonspecific DNA binding during DNA passage through these molecular channels (21, 25, 32). Hence, such a feature in Sf6-gp7 may facilitate translocation of phage dsDNA from inside of the phage into host cell cytoplasm.

The peripheral surface of the Sf6-gp7 dodecamer must provide binding sites for the tailspike and the tail nozzle. Thus, electrostatic interactions may play a major role in mediating assembly of the tail adaptor with the tailspike and tail nozzle. This may, at least partially, contribute to the ultrastability of the tail, which is able to withstand heating to 60 °C in 2 M urea and in the presence of detergent in case of P22 (17). Additionally, the highly negatively charged surface of the tail adaptor ring assembly may be suitable for the need of switching from the terminase-bound state of the capsid at the end of DNA-packaging process to the state for terminase dissociation and tail assembly. Interestingly, the highly negatively charged surface of the tail adaptor ring is also observed in P22-gp4, HK97-gp6, and Spp1-gp15 (Figs. S4S6). SPP1-gp15 shows some variance in that it contains a positively charged rim at the bottom (Fig. S6C). Thus, such a feature may be a common characteristic among those phages.

Implications for the gp7-Mediated Sequential Assembly of the Tail.

The X-ray structure of Sf6 tail adaptor gp7 and the pseudo-atomic model of its dodecameric assembly provide insight into the structural basis of the sequential assembly of the tails in P22-like phages. We propose that the Sf6-gp7 reported here and the P22-gp4 structure in the context of the portal:gp4 complex represent the pre- and postassembly states of the tail adaptor, respectively. Our proposal is supported by the presence of the repeated octad motif KXXΦΦXXA in the α1-helix of Sf6-gp7 and P22-gp4. Comparison of the two X-ray structures shows that the first motif of the Sf6-gp7 α1-helix is superimposed on the second motif of the P22-gp4 α1-helix (Fig. 2B). Due to the distributions of the four conserved residues in the motif, such a positional replacement between the first and second motifs in α1-helix still retains interactions of this α1-helix with the rest of the molecule. That is to say that upon assembly of the tail adaptor to the portal, the second motif replaces the first motif, generating a different conformation for the highly conserved loop α1α2. Such a conformational change in the loop α1α2 creates the binding site for the next tail component (i.e., the tail nozzle) and eliminates the steric clash between the tail adaptor and the tail nozzle as observed in fitting the Sf6-gp7 structure in the P22 tail cryo-EM map (Fig. 2C).

Based on these results, we propose a model for the sequential assembly of the phage tail in Sf6 (Fig. 4). In the infected cell cytoplasm, the newly synthesized tail adaptor protein molecules exist as monomers (33) that adopt the preassembly conformation as observed in the Sf6-gp7 X-ray structure, with the first octad motif occupying the interface with the protein core and a largely vertically oriented loop α1α2. The C-terminal portion may be flexible and unstructured. Upon completion of the DNA-packaging process, the terminase dissociates from the capsid and a tail adaptor subunit then binds to the portal dodecamer via its elongated C-terminal portion, which undergoes an induced fit from the flexible conformation to an extended and immobilized conformation that makes extensive contact with the portal. The binding probably triggers the upward repositioning of the second octad sequence motif in the α1-helix by ∼10 Å, which takes the place of the first motif, and this likely favors the binding of the next tail adaptor subunit and is supported by that facts that P22-gp4 exists as monomer in solution (33) and binds to the portal ring in a highly cooperative manner (19). Accompanying the repositioning of the motif, the loop α1α2 moves away from protein core to a horizontal conformation, generating the binding site for the tail nozzle. This structural rearrangement allows assembly of Sf6-gp7 to the portal in the post–DNA-packaging capsid, followed by attachment of the tail nozzle. The tail adaptor protein free in solution, which is in the preassembly conformation, would not be able to bind to the tail nozzle until it binds to the portal and undergoes the structural rearrangement at the α1-helix and the loop α1α2. After that, tailspike and tail needle will bind sequentially.

Fig. 4.

Fig. 4.

The gp7-mediated phage Sf6 sequential tail assembly model. Once the viral DNA (green curves) is pumped into the procapsid to form the mature capsid (blue hexagons), the gp7 monomer (pink bullets) in solution that adopts the preassembly conformation begins to bind to the portal (red ovals). The binding to the portal and/or to adjacent gp7 monomers induces the N-terminal segment (pink ribbons) to switch to the postassembly conformation, allowing the attachment of the tail nozzle gp8 (light blue ovals). The tail plug gp9 (brown lines) and the tailspikes (gold) then attach to complete the tail assembly.

Our structural analysis suggests that the sequential tail assembly of P22-like podovirus is enabled by a conserved conformational switch in the tail adaptor via replacement of two consecutive repeats of a sequence motif, which creates the binding site for the tail nozzle only when it is time. Additional studies, such as a mutagenesis approach, can further demonstrate the proposed mechanism, and a high-resolution structure of the entire phage or the isolated tail assembly of Sf6 would unveil the conformation of the postassembly state and ultimately confirm our proposed mechanism. Although Siphoviridae and Myoviridae tail and head assemblies occur independently, assembly of tails in those phages may also follow a sequential order such as in Podoviridae (34). For example, the baseplate wedge of the myovirus T4 is assembled sequentially, which is mediated by the induced conformational change caused by addition of new components (35). SPP1-gp15 has been shown to undergo conformational change during assembly to allow appropriate interaction between its α0-helix and loop α2α3 with the next tail protein gp16 (26, 31). It shares certain similarity with Sf6-gp7, in which the structural change allows the loop α1α2 to interact with gp8. The unique feature in Sf6-gp7 is that the conformational change is achieved by repositioning of the two octad motifs. Such localized structural change may be more efficient and genetically stable compared with the large-scale conformational change as in SPP1-gp15 because only several residues (the conserved residues of the octad motif) are essential. The tandem octad motif is highly conserved in P22-like phages, suggesting a conserved mechanism adopted by this group of phages. It will be interesting to see if the sequence motif-repositioning mechanism is a recent evolutionary result adopted only among P22-like phages or if it also exists in other Caudoviruses.

Materials and Methods

Production of Sf6-gp7.

The DNA fragment encoding Sf6-gp7 (residues 1–124) was cloned into pET28b (Novagen) between Nco1/Xho1. The region after residue 124 was predicted as disordered by Phyre2 therefore removed. An His-tag followed the protein at the C-terminus. Protein overexpression was achieved by growth of Escherichia coli strain B834(DE3) at 37 °C in LB until OD600 nm = 0.6, followed by induction at 30 °C by adding isopropyl β-d-1-thiogalactopyranoside to a final concentration of 1 mM. Cells were harvested after 3 h of growth and lysed on a French press in the resuspension buffer (20 mM Tris⋅HCl, pH 8.5; 500 mM NaCl; 10 mM β-mercaptoethanol). The protein was purified with a Ni-NTA column (Qiagen) followed by gel filtration chromatography on a Hiload 16/60 Superdex 75 column (GE Healthcare) in the gel filtration buffer (20 mM Tris⋅HCl, pH 8.5; 150 mM NaCl; 1 mM DTT; 1 mM EDTA), which showed an elution volume of 63.82 mL, corresponding to a dimer. The eluted fractions were collected and concentrated to 13 mg/mL using a Millipore centricon (molecular weight cutoff 10 kDa) before crystallization.

Crystallization, X-Ray Data Collection, and Structure Determination.

The purified Sf6-gp7 was crystallized with the hanging drop-vapor diffusion method by mixing 1 μL of the protein solution with 1 μL of the well solution containing 15% (wt/vol) PEG8,000, 0.1 M Mes pH 6.0, and 0.28 M Ca(OAc)2. Crystals were dipped into well solution plus 15% and 30% ethylene glycol in succession before flash freezing in liquid nitrogen. The heavy atom derivative was obtained by soaking the native crystals in well solution with 5 mM HgCl2 for 24 h before flash freezing.

Sf6-gp7 native and heavy atom-derivative X-ray data were collected at the Advanced Photon Source. All data were indexed and integrated using XDS (36, 37), scaled using Aimless (38). The structure was determined by the single anomalous dispersion method using the mercury derivative data (Table 1). The experimental electron density map calculated with Autosol (39) in PHENIX (40) was of excellent quality, which allowed automated building of most of the model using PHENIX. Small loops of regions of residues 26–29 in chain A and 24–29 and 96–101 in chain B were manually built using COOT (41). Structure refinement coupled with manual model building was performed with the programs PHENIX and COOT, respectively.

Generation of gp7 Dodecamer.

Sf6 is a P22-like phage and therefore has conserved architectures with P22 (3). The 9.4 Å cryo-EM reconstruction of the P22 phage tail (EMD-5051) (22) and the 3.25 Å X-ray structure of P22-gp4 dodecamer bound to the portal core (PDB-3LJ4) (21) have been determined, which offers the possibility to establish a Sf6-gp7 dodecamer model by symmetry operations. Two approaches were employed to achieve the Sf6-gp7 model. First, the P22-gp4 dodecamer was separated from the portal core and fitted into the corresponding volume of P22 tail cryo-EM map. The first approach was to superimpose one Sf6-gp7 monomer (peptide chain A was chosen) onto the P22-gp4 dodecamer, giving an rmsd of 1.269 Å, while the second approach was to directly dock one Sf6-gp7 monomer onto the P22 tail map. This led to slight downward movement of Sf6-gp7 compared with the one superimposed on P22-gp4. For both approaches, the P22-gp4 dodecamer symmetry was applied to Sf6-gp7 by using the CCP4 (42) program lsqkab (43), giving gp4-based and map-based gp7 dodecamer models, respectively.

Supplementary Material

Supplementary File
pnas.201706846SI.pdf (1.8MB, pdf)

Acknowledgments

We thank the General Medical Sciences and Cancer Institutes Structural Biology Facility at the Advanced Photon Source (GM/CA @ APS) for beamline access for X-ray data collection. Molecular graphics and analyses were performed with the UCSF Chimera package. This work was partially supported by Grant R01GM090010 and a subproject of the Kansas COBRE Grant P30 GM103326, both from the National Institute of General Medical Sciences of the National Institutes of Health (to L.T.).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

Data deposition: Atomic coordinates have been deposited with the RCSB PDB with the entry code 5VGT.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1706846115/-/DCSupplemental.

References

  • 1.Casjens S, et al. The chromosome of Shigella flexneri bacteriophage Sf6: Complete nucleotide sequence, genetic mosaicism, and DNA packaging. J Mol Biol. 2004;339:379–394. doi: 10.1016/j.jmb.2004.03.068. [DOI] [PubMed] [Google Scholar]
  • 2.Zhao H, Sequeira RD, Galeva NA, Tang L. The host outer membrane proteins OmpA and OmpC are associated with the Shigella phage Sf6 virion. Virology. 2011;409:319–327. doi: 10.1016/j.virol.2010.10.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Parent KN, Gilcrease EB, Casjens SR, Baker TS. Structural evolution of the P22-like phages: Comparison of Sf6 and P22 procapsid and virion architectures. Virology. 2012;427:177–188. doi: 10.1016/j.virol.2012.01.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Israel V. E proteins of bacteriophage P22. I. Identification and ejection from wild-type and defective particles. J Virol. 1977;23:91–97. doi: 10.1128/jvi.23.1.91-97.1977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Perez GL, Huynh B, Slater M, Maloy S. Transport of phage P22 DNA across the cytoplasmic membrane. J Bacteriol. 2009;191:135–140. doi: 10.1128/JB.00778-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Serwer P, Wright ET, Hakala KW, Weintraub ST. Evidence for bacteriophage T7 tail extension during DNA injection. BMC Res Notes. 2008;1:36. doi: 10.1186/1756-0500-1-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hu B, Margolin W, Molineux IJ, Liu J. The bacteriophage t7 virion undergoes extensive structural remodeling during infection. Science. 2013;339:576–579. doi: 10.1126/science.1231887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhao H, et al. Structure of a bacterial virus DNA-injection protein complex reveals a decameric assembly with a constricted molecular channel. PLoS One. 2016;11:e0149337. doi: 10.1371/journal.pone.0149337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Casjens SR, Molineux IJ. Short noncontractile tail machines: Adsorption and DNA delivery by podoviruses. Adv Exp Med Biol. 2012;726:143–179. doi: 10.1007/978-1-4614-0980-9_7. [DOI] [PubMed] [Google Scholar]
  • 10.Molineux IJ, Panja D. Popping the cork: Mechanisms of phage genome ejection. Nat Rev Microbiol. 2013;11:194–204. doi: 10.1038/nrmicro2988. [DOI] [PubMed] [Google Scholar]
  • 11.Earnshaw WC, Casjens SR. DNA packaging by the double-stranded DNA bacteriophages. Cell. 1980;21:319–331. doi: 10.1016/0092-8674(80)90468-7. [DOI] [PubMed] [Google Scholar]
  • 12.Fokine A, Rossmann MG. Molecular architecture of tailed double-stranded DNA phages. Bacteriophage. 2014;4:e28281. doi: 10.4161/bact.28281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rao VB, Feiss M. Mechanisms of DNA packaging by large double-stranded DNA viruses. Annu Rev Virol. 2015;2:351–378. doi: 10.1146/annurev-virology-100114-055212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lokareddy RK, et al. Portal protein functions akin to a DNA-sensor that couples genome-packaging to icosahedral capsid maturation. Nat Commun. 2017;8:14310. doi: 10.1038/ncomms14310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Strauss H, King J. Steps in the stabilization of newly packaged DNA during phage P22 morphogenesis. J Mol Biol. 1984;172:523–543. doi: 10.1016/s0022-2836(84)80021-2. [DOI] [PubMed] [Google Scholar]
  • 16.Cuervo A, et al. Structural characterization of the bacteriophage T7 tail machinery. J Biol Chem. 2013;288:26290–26299. doi: 10.1074/jbc.M113.491209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Tang L, Marion WR, Cingolani G, Prevelige PE, Johnson JE. Three-dimensional structure of the bacteriophage P22 tail machine. EMBO J. 2005;24:2087–2095. doi: 10.1038/sj.emboj.7600695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Olia AS, Bhardwaj A, Joss L, Casjens S, Cingolani G. Role of gene 10 protein in the hierarchical assembly of the bacteriophage P22 portal vertex structure. Biochemistry. 2007;46:8776–8784. doi: 10.1021/bi700186e. [DOI] [PubMed] [Google Scholar]
  • 19.Lorenzen K, Olia AS, Uetrecht C, Cingolani G, Heck AJ. Determination of stoichiometry and conformational changes in the first step of the P22 tail assembly. J Mol Biol. 2008;379:385–396. doi: 10.1016/j.jmb.2008.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wood WB. Bacteriophage T4 morphogenesis as a model for assembly of subcellular structure. Q Rev Biol. 1980;55:353–367. doi: 10.1086/411980. [DOI] [PubMed] [Google Scholar]
  • 21.Olia AS, Prevelige PE, Jr, Johnson JE, Cingolani G. Three-dimensional structure of a viral genome-delivery portal vertex. Nat Struct Mol Biol. 2011;18:597–603. doi: 10.1038/nsmb.2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lander GC, et al. The P22 tail machine at subnanometer resolution reveals the architecture of an infection conduit. Structure. 2009;17:789–799. doi: 10.1016/j.str.2009.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Tang J, et al. Peering down the barrel of a bacteriophage portal: The genome packaging and release valve in p22. Structure. 2011;19:496–502, and erratum (2011) 19:748. doi: 10.1016/j.str.2011.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Pintilie G, Chen DH, Haase-Pettingell CA, King JA, Chiu W. Resolution and probabilistic models of components in CryoEM maps of mature P22 bacteriophage. Biophys J. 2016;110:827–839. doi: 10.1016/j.bpj.2015.11.3522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Cardarelli L, et al. The crystal structure of bacteriophage HK97 gp6: Defining a large family of head-tail connector proteins. J Mol Biol. 2010;395:754–768. doi: 10.1016/j.jmb.2009.10.067. [DOI] [PubMed] [Google Scholar]
  • 26.Lhuillier S, et al. Structure of bacteriophage SPP1 head-to-tail connection reveals mechanism for viral DNA gating. Proc Natl Acad Sci USA. 2009;106:8507–8512. doi: 10.1073/pnas.0812407106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Liu G, et al. NMR structure of protein yqbG from Bacillus subtilis reveals a novel alpha-helical protein fold. Proteins. 2006;62:288–291. doi: 10.1002/prot.20666. [DOI] [PubMed] [Google Scholar]
  • 28.Maxwell KL, Yee AA, Arrowsmith CH, Gold M, Davidson AR. The solution structure of the bacteriophage lambda head-tail joining protein, gpFII. J Mol Biol. 2002;318:1395–1404. doi: 10.1016/s0022-2836(02)00276-0. [DOI] [PubMed] [Google Scholar]
  • 29.Dyson HJ, Wright PE. Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol. 2005;6:197–208. doi: 10.1038/nrm1589. [DOI] [PubMed] [Google Scholar]
  • 30.Pettersen EF, et al. UCSF Chimera—A visualization system for exploratory research and analysis. J Comput Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
  • 31.Chaban Y, et al. Structural rearrangements in the phage head-to-tail interface during assembly and infection. Proc Natl Acad Sci USA. 2015;112:7009–7014. doi: 10.1073/pnas.1504039112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Cuervo A, Carrascosa JL. Viral connectors for DNA encapsulation. Curr Opin Biotechnol. 2012;23:529–536. doi: 10.1016/j.copbio.2011.11.029. [DOI] [PubMed] [Google Scholar]
  • 33.Olia AS, et al. Binding-induced stabilization and assembly of the phage P22 tail accessory factor gp4. J Mol Biol. 2006;363:558–576. doi: 10.1016/j.jmb.2006.08.014. [DOI] [PubMed] [Google Scholar]
  • 34.Aksyuk AA, Rossmann MG. Bacteriophage assembly. Viruses. 2011;3:172–203. doi: 10.3390/v3030172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Yap ML, Mio K, Leiman PG, Kanamaru S, Arisaka F. The baseplate wedges of bacteriophage T4 spontaneously assemble into hubless baseplate-like structure in vitro. J Mol Biol. 2010;395:349–360. doi: 10.1016/j.jmb.2009.10.071. [DOI] [PubMed] [Google Scholar]
  • 36.Kabsch W. Xds. Acta Crystallogr D Biol Crystallogr. 2010;66:125–132. doi: 10.1107/S0907444909047337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kabsch W. Integration, scaling, space-group assignment and post-refinement. Acta Crystallogr D Biol Crystallogr. 2010;66:133–144. doi: 10.1107/S0907444909047374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Evans PR, Murshudov GN. How good are my data and what is the resolution? Acta Crystallogr D Biol Crystallogr. 2013;69:1204–1214. doi: 10.1107/S0907444913000061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Terwilliger TC, et al. Decision-making in structure solution using Bayesian estimates of map quality: The PHENIX AutoSol wizard. Acta Crystallogr D Biol Crystallogr. 2009;65:582–601. doi: 10.1107/S0907444909012098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Adams PD, et al. PHENIX: A comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D Biol Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Winn MD, et al. Overview of the CCP4 suite and current developments. Acta Crystallogr D Biol Crystallogr. 2011;67:235–242. doi: 10.1107/S0907444910045749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kabsch W. A solution for the best rotation to relate two sets of vectors. Acta Crystallogr Sect A. 1976;32:922–923. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File
pnas.201706846SI.pdf (1.8MB, pdf)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES