Abstract
Tubulin-binding cofactor (TBC)-B is implicated in the presentation of α-tubulin ready to polymerize, and at the correct levels to form microtubules. Bioinformatics analyses, including secondary structure prediction, CD, and crystallography, were combined to characterize the molecular architecture of Trypanosoma brucei TBC-B. An efficient recombinant expression system was prepared, material-purified, and characterized by CD. Extensive crystallization screening, allied with the use of limited proteolysis, led to structures of the N-terminal ubiquitin-like and C-terminal cytoskeleton-associated protein with glycine-rich segment domains at 2.35-Å and 1.6-Å resolution, respectively. These are compact globular domains that appear to be linked by a flexible segment. The ubiquitin-like domain contains two lysines that are spatially conserved with residues known to participate in ubiquitinylation, and so may represent a module that, through covalent attachment, regulates the signalling and/or protein degradation associated with the control of microtubule assembly, catastrophe, or function. The TBC-B C-terminal cytoskeleton-associated protein with glycine-rich segment domain, a known tubulin-binding structure, is the only such domain encoded by the T. brucei genome. Interestingly, in the crystal structure, the peptide-binding groove of this domain forms intermolecular contacts with the C-terminus of a symmetry-related molecule, an association that may mimic interactions with the C-terminus of α-tubulin or other physiologically relevant partners. The interaction of TBC-B with the α-tubulin C-terminus may, in particular, protect from post-translational modifications, or simply assist in the shepherding of the protein into polymerization.
Keywords: CAP-Gly domain, CD, crystallography, tubulin-binding, ubiquitin-like
Introduction
Key to the regulation of many biological processes is the tight control exercised over protein biosynthesis, folding, and degradation. With respect to tubulin, a central component of the cytoskeleton, the correct folding and polymerization involves distinct stages that are influenced by several chaperones or cofactors 1–2. Initially, after a tubulin polypeptide is produced, it is captured by prefoldin 3 and subsequently passed to chaperonin-containing T-complex polypeptide 1 (CCT) 4. When released from CCT, tubulin is essentially folded, but appears to be unable to polymerize and form microtubules (MTs) 1. Our understanding of what occurs between release from CCT and MT formation is limited. Five tubulin-binding cofactors (TBCs) are implicated in late-stage tubulin folding and heterodimer assembly 1,5–10. TBC-B and TBC-E are implicated in binding α-tubulin, whereas TBC-A and TBC-D interact with β-tubulin. TBC-C is involved in the final stages of dimer formation, stimulating GTP hydrolysis in β-tubulin and heterodimer release from a protein assembly 2. Little is known about the molecular basis for the roles of these cofactors, a point that we sought to address in relation to TBC-B.
TBC-B comprises an N-terminal ubiquitin-like (Ubl) domain and a C-terminal cytoskeleton-associated protein with glycine-rich segment (CAP-Gly) domain. There are NMR structures of the TBC-B Ubl domains from Caenorhabditis elegans 11, Drosophila melanogaster [Protein Data bank (PDB) code 2KJR], Arabidopsis thaliana (2KJ6), and Mus musculus (1V6E). The mouse TBC-B CAP-Gly domain NMR structure has also been determined (1WHG), but the only crystal structure known is that of the C. elegans TBC-B CAP-Gly domain 12. We targeted the protein from Trypanosoma brucei, an organism that is considered to be a useful model for the study of MT biology 13. Searching against the translated T. brucei genome (http://www.genedb.org/) with mouse, human and C. elegans TBC-B sequences identified a single protein, Tb10.61.2930, with ∼ 40% amino acid sequence identity. This protein, T. brucei TBC-B (TbTBC-B), consists of 232 amino acids organized into the N-terminal Ubl domain (∼ 90 residues) and a CAP-Gly domain (residues 157–222; Fig. 1).
We now report the construction of an efficient bacterial recombinant expression system, protein purification and the use of CD and bioinformatics approaches to investigate secondary structure content and predicted flexibility. Crystallization of the full-length protein was achieved, but the samples were poorly ordered. Consequently, structural analyses of the individual domains were carried out. The fortuitous observation in the crystal structure of the CAP-Gly domain of intermolecular contacts with the C-terminus of a symmetry-related molecule suggests how interactions with partners, including α-tubulin, might occur. Comparisons with ubiquitin suggest a functional link between the structure of TBC-B and the regulation of distinct populations of proteins associated with tubulin biology by proteasome-dependent degradation.
Results and Discussion
Recombinant full-length TbTBC-B was produced in Escherichia coli and purified. The final step in purification was size exclusion gel chromatography, and this indicated that the full-length protein is monomeric in solution. The protein crystallized readily; however, despite displaying a good appearance (data not shown), X-ray diffraction from the crystals did not extend beyond 7 Å of resolution. Limited proteolysis produced two polypeptides, one of which, the Ubl domain, gave highly ordered crystals, and the structure was solved with anomalous dispersion methods 14. Subsequently, crystals of the CAP-Gly domain were also obtained after testing of a series of truncated constructs and molecular replacement allowed for structure solution. We note, that following proteolysis, the two domains were readily separated and purified, indicating that there was no domain–domain association, and when studied individually they remained monomeric in solution.
The Ubl domain
The N-terminal Ubl domain of TbTBC-B crystallized in the tetragonal space group P41212 with a single polypeptide in the asymmetric unit. Size exclusion chromatography employed during purification indicated that the protein was monomeric in solution.
The Ubl domain is a small globular entity consisting of a mixed four-strand β-sheet that forms a concave groove in which a single α-helix is placed (Fig. 2). This is an example of a β-grasp fold, a common structure involved in protein–protein interactions 5–11. A pronounced hydrophobic core is present, formed mainly by aliphatic side chains. The residues involved, which have < 10% solvent-accessible surface area, include Val4, Val6, Leu8, Tyr22, Ile28, Ile31, Val35, Thr41, Met46, Leu48, Leu50, Met62, Leu68, Cys73, Ile79, and Val81 (not shown). These core residues are, in general, conserved among the TBC-B family of proteins. The surface residues are more variable, perhaps indicative of a module that does not interact directly with α-tubulin. Note that, as α-tubulin is a very highly conserved protein, any interacting components would be expected to display conservation on their surfaces if they interacted at similar positions.
The Ubl module, in terms of amino acid sequence, is the more variable domain of TBC-B (Fig. 3) as compared with homologues. For example, the Ubl and CAP-Gly domains of T. brucei and mouse TBC-Bs share ∼ 30% and 55% sequence identity, respectively. Although remarkably similar in structure to ubiquitin (rmsd of 0.83 Å over 41 Cα atoms with PDB code 1UBQ), the TbTBC-B Ubl domain shares only 10% sequence identity. Ubiquitin contains seven lysines (residues 6, 11, 27, 29, 33, 48, and 63) that are targets for covalent modification 15–16. Intriguingly, and despite only low sequence conservation, the Ubl domain of TbTBC-B contains two lysines (Lys34 and Lys62), which correspond to Lys27 and Lys48 of ubiquitin, and a structural overlay reveals strong structural similarity in terms of the positioning of these residues (Fig. 4). The Ubl domains of known structure overlap at a similar level (Fig. 5A).
The CAP-Gly domain
Crystals of the CAP-Gly domain of TbTBC-B were obtained from a construct producing the 81 C-terminal amino acids. The peptide eluted from the size exclusion column as a single species of ∼ 11 kDa, indicating a monomer in solution. The crystals are orthorhombic with space group P212121, with two polypeptides, labelled A and B, in the asymmetric unit. The rmsd observed after superposition of 79 Cα positions of molecule A on molecule B was 0.86 Å. No restraints were imposed on noncrystallographic symmetry during refinement, so this indicates that the molecules are highly similar. The numbering of residues and secondary structure elements in the C-terminal domain is carried on from the Ubl domain.
Although the sequence derived from the genome assigns residue 223 as aspartate, the cloned gene encodes a glutamate at this position, an observation confirmed by the structural analysis. This conservative difference could be an artefact of the cloning process, or, more likely, could result from a natural variation in the strain that from which genomic DNA was obtained. We note that the T. brucei gambiense gene sequence for TBC-B (http://www.genedb.org/) encodes a glutamate in this position.
The TbTBC-B CAP-Gly domain is a small globular structure with a similar fold to other CAP-Gly domains, and is an important module for the recognition and binding of the C-terminal tail of α-tubulin 17–18. Structural comparison reveals that TbTBC-B matches closely to other CAP-Gly domains (Fig. 5B). The C. elegans TBC-B CAP-Gly domain, which shares ∼ 50% sequence identity, is the closest structural relative, with an rmsd of 1.4 Å for 95 Cα atoms. The fold is dominated by five β-strands, β5–β9, forming a consecutive twisted antiparallel sheet with β9 returning to lie in an antiparallel fashion at the other side of β5. The side of strand β8 forms the floor of a solvent-exposed, primarily basic groove, which is flanked by the two extended loops, linking β6 and β7, and β7 and β8. NMR studies have revealed that the α-tubulin C-terminal tail peptide binds to this groove in the CAP-Gly domain of human CAP-Gly domain containing linker protein 170 (CLIP-170) 19. Several glycines (residues 170, 188, 195, 200, 215, and 225), which are responsible for the name of this domain, are highly conserved and contribute to the fold of the domain 17. Residues around the peptide-binding groove are also highly conserved, Phe216, Leu180, Trp185, Val201 and Phe207 forming a hydrophobic core that helps to stabilize the floor of the groove.
On the other side of the domain from the peptide-binding groove, there are two salt bridges formed by Arg159 with Asp191, and Arg172 with Glu189, which link β5 with β7, and β6 with β7, respectively. In addition, hydrogen bonds between Arg159 and the backbone oxygen of Phe227 link β5 to the C-terminal segment of β9 (Fig. 6).
CAP-Gly domains, including those of TBC-B, possess a highly conserved pentapeptide tubulin-binding motif with a sequence that is almost always GKNDG 19–20. This motif is placed on one of the loops that flank the peptide-binding groove. NMR studies of CLIP-170 indicate that the asparagine at position 3 can participate in hydrogen bond formation with the C-terminal tail of peptides ending with the sequence EE[Y/F]. This is the sequence found at the C-terminus of α-tubulin and several MT tip-binding proteins. Occasionally, the asparagine in the tubulin-binding motif is replaced by a histidine, a conservative change that does not affect peptide binding. However, mutating either the lysine or the asparagine to alanine has a deleterious effect on binding 20. The TbTBC-B CAP-Gly domain has a different sequence in this tubulin-binding motif at residues 195–199, GKGDG. Glycine rather than asparagine occupies the third position, and this nonconservative change precludes the formation of a side chain hydrogen bond with the terminal residue of an EE[Y/F] peptide.
No crystal structures have been solved with an α-tubulin tail-like ligand bound to TBC-B, and, despite our efforts, it was not possible to cocrystallize the TbTBC-B CAP-Gly domain with peptides either. However, and fortuitously, in our structure the C-terminus of chain A forms intermolecular contacts with the basic groove of a symmetry-related chain B (Fig. 7A). The symmetry operation is (− x + 1/2, − y, z + 1/2). The loss of these intermolecular contacts may help to explain why constructs in which the C-terminus had been truncated did not crystallize. The C-terminus of TbTBC-B has the sequence EVF, which is similar to the α-tubulin tail EE[Y/F] motif, and so these intermolecular interactions can be taken to mimic the association with a relevant binding partner. The terminal residue of chain A, Phe232, anchors the peptide in the groove with a combination of hydrophobic and hydrophilic interactions. The aromatic side chain interacts with a hydrophobic patch consisting of the chain B residues Leu180, Val201, and Phe216. The carboxylate of this terminal residue forms a hydrogen bond with the main chain amide of Phe216, and participates in a water-mediated hydrogen-bonding network with Asp198 and Thr200. The C-terminal peptide then arches away from β8, trailing back to Glu230, which forms a salt bridge with Arg167. Another residue, Gln221, interacts with Glu230, as well as contributing to a water-mediated interaction with the backbone of Pro228. Gln221 is conserved in the trypanosomatid TBC-B sequences, but is highly variable in other species. The final interaction that chain A undergoes as it exits the groove is a salt bridge between Asp226 and Arg218.
The NMR structure of the CAP-Gly domain of human CLIP-170 complexed with an α-tubulin tail peptide provides an example for comparative purposes 20. The domains share ∼ 50% identity and are structurally well conserved, with an rmsd of 1.06 Å over 50 Cα atoms. In the CLIP-170 α-tubulin tail peptide complex, the peptide is positioned closer to the β8 strand (Fig. 7B).
The Gly/Asp difference at position 4 in the tubulin-binding loop, discussed earlier, allows the C-terminus of chain A to bind further along the groove than the α-tubulin tail peptide in the CLIP-170 structure. This causes a relative shift in the position of the Val231 side chain, which is directed out of the groove, and undergoes no interactions with the CAP-Gly domain. The third residue from the end of chain A, Glu230 (A3 in Fig. 7), is in the same position as the second residue from the end in the α-tubulin tail peptide, Glu450 (P2 in Fig. 7). This may explain why the sequence EVF can bind, contrary to the conclusion that EE[Y/F] is the essential recognition sequence 18–19. In an attempt to further investigate CAP-Gly peptide interactions, we carried out isothermal titration calorimetry (ITC) with the hexapeptide EDVEEY, which represents the C-terminal residues of T. brucei α-tubulin. A range of concentrations of the protein domain and the peptide were tested, but no heat changes were observed (data not shown).
The peptide-binding groove on chain A is occupied by two formate molecules, derived from the crystallization mixture, which bind in similar fashion to the negatively charged moieties of the C-terminal peptide just discussed (data not shown). One formate interacts with Arg167 and the other with Gln221.
Unique features of Trypanosoma TBC-B
CAP-Gly domains recognize the highly conserved α-tubulin C-terminal tail with the sequence EE[Y/F] 18. In T. cruzi and Trypanosoma vivax α-tubulin, this sequence is matched exactly. In T. brucei gambiense and T. brucei 427, the sequence is EMF, and, as just described, we present structural data to show that the sequence EVF can also bind to a CAP-Gly domain. Our structure of TbTBC-B indicates that the penultimate residue, whether glutamate or methionine, is probably directed out of the peptide-binding groove, and, with no direct interactions involving the side chain, the identity would appear to be less important for binding.
Both TBC-B and TBC-E retain highly conserved CAP-Gly domains across numerous different species, and this probably reflects important roles in tubulin biology 20. The CAP-Gly domain of TBC-E also usually contains the GKNDG tubulin-binding motif. The closest homologue to human TBC-E in T. brucei is Tb927.3.2680, a 530-residue protein with ∼ 25% sequence identity. The protozoan protein contains a leucine-rich repeat segment and a Ubl domain but, surprisingly, lacks a CAP-Gly domain. In fission yeast, the TBC-E homologue Alp21 also lacks a CAP-Gly domain, but is indispensible for maintaining α-tubulin levels, MT integrity, and cell survival 21–22. In contrast, mammals possess a TBC-E paralogue, which lacks the CAP-Gly domain. This protein, known as E-like, with ∼ 30% sequence identity with TBC-E, cannot compensate for loss of TBC-E, and, instead of being involved in tubulin biogenesis, is implicated in degradation 23. This difference is perhaps a legacy of a more complex and highly regulated tubulin biology in higher eukaryotes, and suggests that a degree of care is required when considering different model systems.
We could not identify any other CAP-Gly domains encoded by the T. brucei genome, even in proteins that usually contain such modules, e.g. kinesins. Perhaps, in trypanosomatids, TBC-B is sufficient to provide this interaction with the α-tubulin tail, or other modules, yet to be discovered, may compensate for this function.
The missing structural information
As it was not possible to obtain crystallographic data on the linker region between the globular domains (residues 89–153), the CD spectrum of full-length TBC-B was analysed. The spectra indicated a low α-helical content of ∼ 10% and a β-strand content of ∼ 35%. The α-helical content of the Ubl domain itself is ∼ 8% of the overall structure, and, together with the prediction of a short helical segment between residues 116 and 123, ∼ 4% of the sequence, there is excellent agreement with the CD data. Residues with a β-strand conformation constitute nearly 30% of the full-length structure, which is also in good agreement with the spectroscopic data. Both the CD results and the structural analyses also agreed well with the predicted secondary structure, and overall indicate a two-domain structure with a flexible linker.
Functional implications and concluding remarks
Structures of the Ubl and CAP-Gly domains of TbTBC-B, the first such protist structures, were determined to 2.3-Å and 1.6-Å resolution, respectively. It was not possible to solve the full-length structure, as the crystals did not diffract sufficiently well, and attempts to extend structural information into the region linking the two domains were also unsuccessful. It was determined that the missing linker region is mostly unstructured, with only a short segment of α-helix being noted, and potentially flexible.
The fold of the Ubl domain is highly conserved, despite low sequence identity within TBC-B proteins and, indeed, ubiquitin itself. A striking similarity is observed in the spatial location of two functionally important lysines in ubiquitin, and strongly supports a biological function for the Ubl domain, and by implication for TBC-B. Through reversible post-translational modification, TBC-B can influence aspects of tubulin biology in a proteasome-dependent manner. Such a conclusion is consistent with other studies. Yeast two-hybrid studies suggest that proteasome-dependent degradation of human TBC-B is driven by gigaxonin binding to the Ubl domain 24. In addition, Saccharomyces cerevisiae TBC-E interacts with the ubiquitin receptor Rpn10 via the Ubl domain, and subsequently with a ubiquitin ligase complex, providing a route to protein degradation 10. We note also that Lys34, but not Lys62, of TbTBC-B is conserved in the Ubl domains of TBC-E (data not shown).
In the final stages of MT assembly, α-tubulin and β-tubulin form a heterodimer with the encouragement of the TBC proteins. A balance has to be struck between the availability of free tubulin, heterodimer assembly, and release of MT structures. This process involves a number of highly abundant proteins, and the presence of Ubl domains offers a means whereby protein folding and the population of complex assemblies can be regulated by protein degradation or recycling. The use of a Ubl protein rather than ubiquitin itself may provide specificity with regard to this aspect of tubulin biology. Cognate activating enzymes might then contribute to the regulation of levels of free tubulin heterodimers, tubulin polymerization, and MT catastrophe, or to avoid miscommunication during a stress response. It will therefore be of great interest to now identify, from a plethora of candidates, the specific ligases and proteases that might participate in the control of MT disassembly/assembly as opposed to other biological functions.
The molecular packing in the crystal structure of the CAP-Gly domain places the C-terminal tail of one CAP-Gly domain in the peptide-binding groove of a symmetry-related molecule, and allows examination of interactions in this binding site. The binding differs slightly from that observed in a CLIP-170 CAP-Gly domain bound to an α-tubulin tail peptide, owing to a difference in the β7–β8 tubulin-binding loop of TbTBC-B. A glycine replaces asparagine at position 3 of the normally conserved GKNDG sequence, and allows the C-terminus of chain A to bind further down in the peptide-binding groove. This appears to be a trypanosomatid-specific feature.
Tubulin and MT assembly are subject to C-terminal post-translational modifications, which provide important tracking positions for a range of MT-binding partners 25. For example, the C-terminal residue of α-tubulin is a tyrosine that can be removed and then replaced, and adjacent glutamates can be polyglutamylated or, in some species, polyglycylated. The consensus sequence for recognition by CAP-Gly domains has been deduced as EE[Y/F], and proteolytic removal of the tyrosine prevents such recognition of α-tubulin 25. This provides a feature that might be used to regulate subpopulations of MTs. A plausible function of the CAP-Gly domain may therefore be to bind then block post-translational modifications of the C-terminal tyrosine, as the complex with α-tubulin appears to form even while it is still bound to CCT 8, i.e. while the tyrosine is still present.
We were unable to observe any binding of the CAP-Gly domain with a hexapeptide representing the terminal residues of T. brucei α-tubulin by using ITC. According to previous work, the presence of TBC-E might be required for TBC-B to form a stable complex with α-tubulin 6. Further experiments would be required to inform hypotheses involving the need for partner proteins and/or post-translational modifications that allow TBC-B to contribute to MT assembly or disassembly.
Experimental procedures
Protein production and purification
The gene encoding full-length TbTBC-B was amplified from genomic DNA (strain 927) with 5′-CATATGTCCGTTGTTAAAGTATCGC-3′ and 5′-CTCGAGTTAAAACACCTCCGGGGGAAAGTC-3′ as forward and reverse primers, respectively (ThermoFisher Scientific, Waltham, MA, USA). The restriction enzyme sites for NdeI and XhoI are in bold. The PCR product was ligated into pCR2.1-TOPO with the TOPO TA Cloning Kit (Invitrogen, Carlsbad, CA, USA), and then cloned into a modified pET15b (Novagen, Madison, WI, USA) expression vector, which adds an N-terminal His6 tag and a tobacco etch virus (TEV) protease cleavage site to the product. Sequencing confirmed the identity of the construct, and the vector was heat-shock-transformed into E. coli Rosetta (DE3) pLysS cells for expression. DNA encoding the CAP-Gly domain, residues 157–222, was amplified from the vector containing the TbTBC-B gene with 5′-CATATGGCAGAGACAATACATGTGGGGG-3′ and 5′-CTCGAGTTAAAACACCTCCGGGGGAAAGTC-3′ as forward and reverse primers, respectively (ThermoFisher Scientific). The gene fragment was cloned into the modified pET15b vector as described above, and transformed into E. coli Rosetta (DE3) pLysS cells for protein production.
Similar protocols were applied to purify full-length TbTBC-B and the CAP-Gly domain. Typically, cells were grown at 37 °C in 1 L of LB medium supplemented with carbenicillin (50 μg·L−1) and chloramphenicol (25 μg·L−1). Gene expression was induced, at a D600 nm of 0.6, by addition of 1 mm isopropyl thio-β-d-galactoside. The culture was incubated for a further 16 h at 22 °C, and cells were then harvested by centrifugation at 3500 g for 30 min at 4 °C. The cells were resuspended in a lysis buffer (50 mm Tris/HCl, pH 7.5, 250 mm NaCl, 25 mm imidazole) containing DNase I and EDTA-free protease inhibitors (Roche, Basel, Switzerland), and then lysed with a French press at 16000 p.s.i. The resultant lysate was centrifuged at 35 000 g for 30 min at 4 °C, and the supernatant was loaded onto a pre-equilibrated HisTrap HP 5-mL column (GE Healthcare, Milwaukee, WI, USA) precharged with Ni2+. A linear gradient of 25 mm to 1 m imidazole was applied to elute the proteins, and the derived fractions were analysed by SDS/PAGE. Samples were pooled and incubated with His-tagged TEV protease at 30 °C for 3 h, and then dialyzed into 50 mm Tris/HCl (pH 7.5) and 250 mm NaCl. The sample was loaded onto a HisTrap HP 5-mL column to remove the His-tagged TEV protease, uncleaved product, and histidine-rich contaminants. A Superdex 20 026/60 size exclusion column (GE Healthcare) equilibrated with 50 mm Tris/HCl (pH 7.5) and 250 mm NaCl was used to further purify the protein. This column had been calibrated with BioRad Gel Filtration standards. Fractions containing the proteins were pooled and concentrated to 10 mg·L−1 with Amicon Ultra devices (Millipore) for subsequent use. The purity and identity of the proteins were confirmed by MALDI-TOF MS. Yields of ∼ 20 mg·L−1 cell culture for full-length TBC-B and ∼ 8 mg·L−1 cell culture for the CAP-Gly domain were obtained.
A series of constructs encoding five polypeptide fragments, covering residues 91–232, 101–232, 113–232, 128–226, and 143–232, were also produced in an attempt to extend structural knowledge between the domains. These polypeptides proved to be either insoluble or failed to crystallize, and so no further details are provided.
CD of full-length TbTBC-B
CD spectra were recorded on a Jasco J-810 spectropolarimeter. Far-UV CD spectra were obtained with 0.5 mg·mL−1 protein solutions and a 0.02-cm-pathlength quartz cuvette. Five scans were accumulated and averaged with the following parameters: scan rate, 50 nm·min−1; response, 0.5 s; and bandwidth, 1 nm. Protein CD spectra were corrected by subtracting the appropriate buffer spectrum and correcting for protein concentration. Protein secondary structure estimates were obtained with the contin procedure 26, available from the dichroweb server 27.
Analysis of crystals formed following proteolysis
Crystals of the full-length protein did not diffract beyond ∼ 7-Å resolution (data not shown). Limited proteolysis by addition of chymotrypsin to the crystallization drops 28 was therefore tested in the search for ordered crystals. Crystals were observed in a number of conditions, one of which was successfully optimized, as detailed below. These crystals were harvested, washed in the reservoir solution, dissolved in ddH2O, and then submitted for MALDI-TOF MS analysis. This gave a molecular mass of 12 069 Da for the polypeptide. The fragment was isolated by SDS/PAGE and then transferred onto a poly(vinylidene difluoride) membrane. The band was then submitted for N-terminal Edman sequencing, and the sequence GHMSVVKV was identified. This corresponds to the N-terminus of TbTBC-B, with the first two residues being remnants of the TEV cleavage site after treatment to remove the His6 tag. These data indicated that the Ubl domain had crystallized.
Isolation of products after chymotrypsin treatment
Full-length TbTBC-B (50 mg) was incubated with chymotrypsin (0.02 mg) overnight, and then dialyzed into 50 mm Tris/HCl (pH 7.5) and 50 mm NaCl. The mixture was loaded onto a HiTrap Q HP 5-mL column, and eluted with a linear gradient of 50 mm to 1 m NaCl. The products of the cleavage eluted as two peaks, and were analysed by SDS/PAGE. Sample 1 eluted at 50 mm NaCl, and sample 2 at 220 mm NaCl. MALDI-TOF MS of sample 1, the Ubl domain, gave a mass of 12 002 Da, and MALDI-TOF MS of sample 2 gave a mass of 12 116 Da. Both protein fragment samples were concentrated to 5 mg·mL−1.
Selenomethionine (SeMet) derivative of the Ubl domain
A SeMet-substituted Ubl domain was obtained from material generated as described above, and with incorporation achieved following metabolic inhibition 29. Purification of the protein yielded ∼ 2 mg·L−1 of cell culture. Analysis by MALDI-TOF MS indicated full incorporation of four SeMet residues.
Crystallization and data collection
Initial crystallization conditions were established by sitting drop vapour diffusion with a Phoenix Liquid Handling System (Rigaku; Art Robins Instruments, Mountain View, CA, USA) and commercially available screens. Several conditions were optimized for hanging drop vapour diffusion, with drops of 1 μL of protein solution and 1 μL of reservoir.
Rectangular crystals (dimensions 0.3 × 0.1 × 0.1 mm) of the Ubl domain, which had been isolated after proteolysis of the full-length protein, were obtained with a reservoir of 0.1 m Hepes (pH 7.5), 1% poly(ethylene glycol) 400, and 2 m ammonium sulfate, and a protein concentration of 2.5 mg·mL−1. Crystals of the CAP-Gly domain, derived from expression of a gene fragment, were rod-like, with dimensions of 1 × 0.05 × 0.05 mm, and were obtained with reservoir conditions of 0.2 m potassium formate, 30% poly(ethylene glycol) 3350 and a 7.5 mg·mL−1 protein solution in 50 mm Tris/HCl (pH 7.5) and 250 mm NaBr.
Crystals were transferred to a solution containing a mixture of their reservoir solution and 40% poly(ethylene glycol) 400 for ∼ 15 s before being cooled to −173 °C in a stream of nitrogen. The crystals were characterized in-house with a Micromax HF007 copper-rotating anode X-ray generator equipped with an R-axis IV++ dual image plate detector. Subsequently, data were collected with ADSC charge-coupled device detectors at the Diamond light source beamline I03 for the Ubl domain and at beamline I02 for the CAP-Gly domain.
Data from the Ubl domain crystal were processed with xds 30 and scala 31. Initial phases were obtained with SeMet single-wavelength anomalous dispersion methods 14, with data measured at the experimentally determined f′ maximum wavelength (λ = 0.98 Å) in the ccp4 pipeline 32 with crank 33. A figure-of-merit of 0.23 was obtained from bp3 34, and this increased to 0.67 after density modification and solvent flattening in solomon 35. This phase set was used for initial model building with buccaneer 36.
Data from the CAP-Gly domain crystal were scaled and processed with imosflm 37 and scala 31. The phyre server 38 identified the NMR structure of the M. musculus CAP-Gly domain of TBC-B (PDB code 1WHG), sharing a sequence identity of ∼ 55%, as a suitable model for molecular replacement. A poly-Ala model was prepared with chainsaw 39, and the positions of two molecules were then identified with phaser 40. Rigid body refinement gave an initial R-factor of 49% to 1.6-Å resolution.
Both structures were refined with rounds of map inspection and model manipulation with coot 41 and refinement calculations with refmac 42. When the protein models were nearly complete, waters and ligands (ethylene glycol in the Ubl domain and formate in the CAP-Gly domain) were added, together with multiple conformers for several side chains. The geometric quality of the models was assessed with molprobity 43. Figures were created with pymol 44, and solvent accessibility analyses were performed with areaimol 32. Crystallographic statistics are presented in Table 1. Coordinates and structure factors are deposited with the PDB under accession codes 4B6W for the Ubl domain and 4B6M for the Cap-Gly domain.
Table 1.
Ubl domain | CAP-Gly domain | |
---|---|---|
Space group | P41212 | P212121 |
Unit cell lengths (Å) | 50.9, 50.9, 77.2 | 32.7, 55.7, 80.4 |
Resolution range (Å) | 36.00–2.35 | 40.18–1.59 |
Completeness (%) | 100 (99.8)a | 100 (99.97) |
<I/σ(I)> | 15.4 (3.3) | 28.2 (9.6) |
Reflections measured/unique | 58 686/4631 | 282 648/20 384 |
Redundancy | 12.7 (9.4) | 13.9 (14.1) |
Anomalous redundancy: | 7.1 (5.0) | NA |
Rmergeb | 0.12 (0.64) | 0.07 (0.28) |
Rworkc/Rtreed | 0.16/0.21 | 0.20/0.24 |
Protein residues (chain A/B) | 89 | 79/78 |
Ligands (number) | Ethylene glycol (1) | Formate (6) |
Rmsd from ideal geometry | ||
Bond lengths (Å)/angles (o) | 0.015, 1.53 | 0.01, 1.44 |
Thermal parameters (B, Å2) | ||
Wilson B | 42.6 | 14.6 |
Mean B (all atoms) | 20.1 | 14.8 |
Protein atoms (chainA/B) | 18.0 | 12.3/12.5 |
Ligands | 58.9 | 21.0 |
Water molecules | 42.4 | 24.8 |
Ramachandran plot | ||
Favored/allowed (%) | 97.6/2.4 | 98/2 |
NA, not analysed.
Values in parentheses refer to the highest-resolution bin.
Rmerge = ∑h∑i||(h,i) − <I(h)> ∑h∑I I(h,i).
Rwork = ∑hkl||Fo|–|Fc||/∑|Fo|, where Fo is the observed structure factor amplitude and Fc is the structure-factor amplitude calculated from the model.
Rfree is the same as Rwork except that it was only calculated using a subset, 5%, of the data that are not included in any refinement calculations.
Bioinformatic analyses
Secondary structure and disorder predictions were obtained from phyre 38 and psipred 45. Structural comparisons and homologues were identified with dali 46. Conserved residues within the CAP-Gly domain were identified with consurf 47. The UniProt reference cluster 90 (UNIREF90) 48 database was searched, and sequences were aligned with mafft 49. Overall, 642 sequences were identified with psi blast 50, on the basis of sequence identity in the range 35–95%, 417 of which were unique. A subset of sequences, 150, the default value in consurf, with identity > 50% were used for comparisons. However, there were not enough homologues with identities of > 30% of the Ubl domain for consurf to be used, so 31 sequences with identities in the range 31–38%, were aligned by the use of muscle 51, and residues with > 75% conservation were annotated.
ITC details
ITC experiments were performed at 25 °C with a VP-ITC calorimeter (MicroCal, Northampton, MA, USA). The 1.4-mL sample cell and 300-μL syringe were filled with TBC-B CAP-Gly domain and peptide EDVEEY in a buffer of 50 mm Tris/HCl (pH 7.5) and 250 mm NaCl. Typically, 8 μL of peptide was injected from the stirred syringe (305 r.p.m.) 30 times into the sample cell. The peptides were dissolved in the minimum amount of the buffer, and then diluted to the appropriate concentration just before use. Four experiments were carried out with CAP-Gly/hexapeptide concentrations of 1 : 10, 10 : 100, 7 : 500 and 45 : 500 μL. No binding events were observed.
Acknowledgments
This work was supported by BBSRC (PhD studentship to J. Fleming) and The Wellcome Trust (grant numbers WT082596, WT083481, and WT094090). We thank Diamond for synchrotron beam time and staff support, and our colleague D. Norman for discussions and help.
Glossary
- CAP-Gly
cytoskeleton-associated protein with glycine-rich segment
- CCT
chaperonin containing T-complex polypeptide 1
- CLIP-170
cytoskeleton-associated protein with glycine-rich segment domain containing linker protein 170
- ITC
isothermal titration calorimetry
- MT
microtubule
- PDB
Protein Data Bank
- SeMet
selenomethionine
- TBC
tubulin-binding cofactor
- TbTBC-B
Trypanosoma brucei tubulin-binding cofactor B
- TEV
tobacco etch virus
- Ubl
ubiquitin-like
References
- 1.Szymanski D. Tubulin folding cofactors: half a dozen for a dimer. Curr Biol. 2002;12:R767–R769. doi: 10.1016/s0960-9822(02)01288-5. [DOI] [PubMed] [Google Scholar]
- 2.Lundin VF, Leroux MR, Stirling PC. Quality control of cytoskeletal proteins and human disease. Trends Biochem Sci. 2010;35:288–297. doi: 10.1016/j.tibs.2009.12.007. [DOI] [PubMed] [Google Scholar]
- 3.Vainberg IE, Lewis SA, Rommelaere H, Ampe C, Vandekerckhove J, Klein HL, Cowan NJ. Prefoldin, a chaperone that delivers unfolded proteins to cytosolic chaperonin. Cell. 1998;93:863–873. doi: 10.1016/s0092-8674(00)81446-4. [DOI] [PubMed] [Google Scholar]
- 4.Muñoz IG, Yébenes H, Zhou M, Mesa P, Serna M, Park AY, Bragado-Nilsson E, Beloso A, de Cárcer G, Malumbres M, et al. Crystal structure of the open conformation of the mammalian chaperonin CCT in complex with tubulin. Nat Struct Mol Biol. 2011;18:14–19. doi: 10.1038/nsmb.1971. [DOI] [PubMed] [Google Scholar]
- 5.Grynberg M, Jaroszewski L, Godzik A. Domain analysis of the tubulin cofactor system: a model for tubulin folding and dimerization. BMC Bioinformatics. 2003;10:46. doi: 10.1186/1471-2105-4-46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kortazar D, Fanarraga ML, Carranza G, Bellido J, Villegas JC, Avila J, Zabala JC. Role of cofactors B (TBCB) and E (TBCE) in tubulin heterodimer dissociation. Exp Cell Res. 2007;313:425–436. doi: 10.1016/j.yexcr.2006.09.002. [DOI] [PubMed] [Google Scholar]
- 7.Lopez-Fanarraga M, Carranza G, Bellido J, Kortazar D, Villegas JC, Zabala JC. Tubulin cofactor B plays a role in the neuronal growth cone. J Neurochem. 2007;100:1680–1687. doi: 10.1111/j.1471-4159.2006.04328.x. [DOI] [PubMed] [Google Scholar]
- 8.Carranza G, Castaño R, Fanarraga ML, Villegas JC, Gonçalves J, Soares H, Avila J, Marenchino M, Campos-Olivas R, Montoya G, et al. Autoinhibition of TBCB regulates EB1-mediated microtubule dynamics. Cell Mol Life Sci. 2013;70:357–371. doi: 10.1007/s00018-012-1114-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Baffet AD, Benoit B, Januschke J, Audo J, Gourhand V, Roth S, Guichet A. Drosophila tubulin-binding cofactor B is required for microtubule network formation and for cell polarity. Mol Biol Cell. 2012;23:3591–3601. doi: 10.1091/mbc.E11-07-0633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Voloshin O, Gocheva Y, Gutnick M, Movshovich N, Bakhrat A, Baranes-Bachar K, Bar-Zvi D, Parvari R, Gheber L, Raveh D. Tubulin chaperone E binds microtubules and proteasomes and protects against misfolded protein stress. Cell Mol Life Sci. 2010;67:2025–2038. doi: 10.1007/s00018-010-0308-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lytle BL. Solution structure of a ubiquitin-like domain from tubulin-binding cofactor B. J Biol Chem. 2004;279:46787–46793. doi: 10.1074/jbc.M409422200. [DOI] [PubMed] [Google Scholar]
- 12.Li S, Finley J, Liu ZJ, Qiu SH, Chen H, Luan CH, Carson M, Tsao J, Johnson D, Lin G, et al. Crystal structure of the cytoskeleton-associated protein glycine-rich (CAP-Gly) domain. J Biol Chem. 2002;277:48596–48601. doi: 10.1074/jbc.M208512200. [DOI] [PubMed] [Google Scholar]
- 13.Ralston KS, Kabututu ZP, Melehani JH, Oberholzer M, Hill KL. The Trypanosoma brucei flagellum: moving parasites in new directions. Annu Rev Microbiol. 2009;63:335–362. doi: 10.1146/annurev.micro.091208.073353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Micossi E, Hunter WN, Leonard GA. De novo phasing of two crystal forms of tryparedoxin II using the anomalous scattering from S atoms: a combination of small signal and medium resolution reveals this to be a general tool for solving protein crystal structures. Acta Crystallogr D Biol Crystallogr. 2002;58:21–28. doi: 10.1107/s0907444901016808. [DOI] [PubMed] [Google Scholar]
- 15.Ikeda F, Dikic I. Atypical ubiquitin chains: new molecular signals. ‘Protein modifications: beyond the usual suspects’ review series. EMBO Rep. 2008;9:536–542. doi: 10.1038/embor.2008.93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Pickart CM. Mechanisms underlying ubiquitination. Annu Rev Biochem. 2001;70:503–533. doi: 10.1146/annurev.biochem.70.1.503. [DOI] [PubMed] [Google Scholar]
- 17.Weisbrich A, Honnappa S, Jaussi R, Okhrimenko O, Frey D, Jelesarov I, Akhmanova A, Steinmetz MO. Structure–function relationship of CAP-Gly domains. Nat Struct Mol Biol. 2007;14:959–967. doi: 10.1038/nsmb1291. [DOI] [PubMed] [Google Scholar]
- 18.Steinmetz MO, Akhmanova A. Capturing protein tails by CAP-Gly domains. Trends Biochem Sci. 2008;33:535–545. doi: 10.1016/j.tibs.2008.08.006. [DOI] [PubMed] [Google Scholar]
- 19.Mishima M, Maesaki R, Kasa M, Watanabe T, Fukata M, Kaibuchi K, Hakoshima T. Structural basis for tubulin recognition by cytoplasmic linker protein 170 and its autoinhibition. Proc Natl Acad Sci USA. 2007;104:10346–10351. doi: 10.1073/pnas.0703876104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Honnappa S, Okhrimenko O, Jaussi R, Jawhari H, Jelesarov I, Winkler FK, Steinmetz MO. Key interaction modes of dynamic +TIP networks. Mol Cell. 2006;23:663–671. doi: 10.1016/j.molcel.2006.07.013. [DOI] [PubMed] [Google Scholar]
- 21.Radcliffe PA, Garcia MA, Toda T. The cofactor-dependent pathways for alpha- and beta-tubulins in microtubule biogenesis are functionally different in fission yeast. Genetics. 2000;156:93–103. doi: 10.1093/genetics/156.1.93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Peris L, Thery M, Fauré J, Saoudi Y, Lafanechère L, Chilton JK, Gordon-Weeks P, Galjart N, Bornens M, Wordeman L, et al. Tubulin tyrosination is a major factor affecting the recruitment of CAP-Gly proteins at microtubule plus ends. J Cell Biol. 2006;174:839–849. doi: 10.1083/jcb.200512058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bartolini F, Tian G, Piehl M, Cassimeris L, Lewis SA, Cowan NJ. Identification of a novel tubulin-destabilizing protein related to the chaperone cofactor E. J Cell Sci. 2005;118:1197–1207. doi: 10.1242/jcs.01719. [DOI] [PubMed] [Google Scholar]
- 24.Wang W, Ding J, Allen E, Zhu P, Zhang L, Vogel H, Yang Y. Gigaxonin interacts with tubulin folding cofactor B and controls its degradation through the ubiquitin–proteasome pathway. Curr Biol. 2005;15:2050–2055. doi: 10.1016/j.cub.2005.10.052. [DOI] [PubMed] [Google Scholar]
- 25.Janke C, Bulinski JC. Post-translational regulation of the microtubule cytoskeleton: mechanisms and functions. Nat Rev Mol Cell Biol. 2011;12:773–786. doi: 10.1038/nrm3227. [DOI] [PubMed] [Google Scholar]
- 26.Provencher S, Gloeckner J. Estimation of globular protein secondary structure from circular dichroism. Biochemistry. 1981;20:33–37. doi: 10.1021/bi00504a006. [DOI] [PubMed] [Google Scholar]
- 27.Whitmore L, Wallace BA. DICHROWEB, an online server for protein secondary structure analyses from circular dichroism spectroscopic data. Nucleic Acids Res. 2004;32:668–673. doi: 10.1093/nar/gkh371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dong A, Xu X, Edwards AM. In situ proteolysis for protein crystallization and structure determination. Nat Methods. 2007;4:1019–1021. doi: 10.1038/nmeth1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Doublié S. Preparation of selenomethione proteins for phase determination. Methods Enzymol. 1997;276:523–530. [PubMed] [Google Scholar]
- 30.Kabsch W. XDS. Acta Crystallogr D Biol Crystallogr. 2010;66:125–132. doi: 10.1107/S0907444909047337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Evans P. Scaling and assessment of data quality. Acta Crystallogr D Biol Crystallogr. 2005;62:72–82. doi: 10.1107/S0907444905036693. [DOI] [PubMed] [Google Scholar]
- 32.Collaborative Computational Project Number 4. The CCP4 suite: programs for protein crystallography. Acta Crystallogr D Biol Crystallogr. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]
- 33.de Graaff R, Abrahams J, Pannu N. Crank: mew methods for automated macromolecular crystal structure solution. Structure. 2004;12:1753–1761. doi: 10.1016/j.str.2004.07.018. [DOI] [PubMed] [Google Scholar]
- 34.Storoni L, Read R. Simple algorithm for a maximum-likelihood SAD function. Acta Crystallogr D Biol Crystallogr. 2004;60:1220–1228. doi: 10.1107/S0907444904009990. [DOI] [PubMed] [Google Scholar]
- 35.Abrahams JP, Leslie AGW. Methods used in the structure determination of bovine mitochondrial F1 ATPase. Acta Crystallogr D Biol Crystallogr. 1996;52:30–42. doi: 10.1107/S0907444995008754. [DOI] [PubMed] [Google Scholar]
- 36.Cowtan K. The Buccaneer software for automated model building. 1. Tracing protein chains. Acta Crystallogr D Biol Crystallogr. 2006;62:1002–1011. doi: 10.1107/S0907444906022116. [DOI] [PubMed] [Google Scholar]
- 37.Battye TGG, Kontogiannis L, Johnson O, Powell HR, Leslie AGW. iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM. Acta Crystallogr D Biol Crystallogr. 2011;67:271–281. doi: 10.1107/S0907444910048675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kelley LA, Sternberg MJE. Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc. 2009;4:363–371. doi: 10.1038/nprot.2009.2. [DOI] [PubMed] [Google Scholar]
- 39.Stein N. CHAINSAW: a program for mutating pdb files used as templates in molecular replacement. J Appl Crystallogr. 2008;41:641–643. [Google Scholar]
- 40.McCoy A, Grosse-Kunstleve R, Adams P, Winn M, Storoni L, Read R. Phaser crystallographic software. J Appl Crystallogr. 2007;40:658–674. doi: 10.1107/S0021889807021206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
- 42.Murshudov GN, Vagin AA, Dodson EJ. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr D Biol Crystallogr. 1997;53:240–255. doi: 10.1107/S0907444996012255. [DOI] [PubMed] [Google Scholar]
- 43.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, 3rd, Snoeyink J, Richardson JS, et al. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 2007;35:375–383. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.DeLano W. 2002. Pymol: an open-source molecular graphics tool. CCP4 Newsletter On Protein Crystallography.
- 45.Buchan DWA, Ward SM, Lobley AE, Nugent TCO, Bryson K, Jones DT. Protein annotation and modelling servers at University College London. Nucleic Acids Res. 2010;38:563–568. doi: 10.1093/nar/gkq427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Holm L, Kääriäinen S, Wilton C, Plewczynski D. Using Dali for structural comparison of proteins. Curr Protoc Bioinformatics. 2006;14:5.5.1–5.5.24. doi: 10.1002/0471250953.bi0505s14. [DOI] [PubMed] [Google Scholar]
- 47.Ashkenazy H, Erez E, Martz E, Pupko T, Ben-Tal N. ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res. 2010;38:W529–W533. doi: 10.1093/nar/gkq399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Suzek BE, Huang H, McGarvey P, Mazumder R, Wu CH. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics. 2007;23:1282–1288. doi: 10.1093/bioinformatics/btm098. [DOI] [PubMed] [Google Scholar]
- 49.Katoh K, Misawa K, Kuma K-I, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–3066. doi: 10.1093/nar/gkf436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]