Abstract
The general transcription factor IID (TFIID) plays a central role in the initiation of RNA polymerase II-dependent transcription by nucleating pre-initiation complex (PIC) assembly at the core promoter. TFIID comprises the TATA-binding protein (TBP) and 13 TBP-associated factors (TAF1-13), which specifically interact with a variety of core promoter DNA sequences. Here we present the structure of human TFIID in complex with TFIIA and core promoter DNA, determined by single-particle cryo-electron microscopy (cryo-EM) at sub-nanometer resolution. All core promoter elements are contacted by subunits of TFIID, with TAF1 and TAF2 mediating major interactions with the downstream promoter. TFIIA bridges the TBP-TATA complex with lobe B of TFIID. We also present the cryo-EM reconstruction of a fully-assembled human TAF-less PIC. Superposition of common elements between the two structures provides novel insights into the general role of TFIID in promoter recognition, PIC assembly, and transcription initiation.
Initiation of transcription by RNA polymerase II (Pol II) represents a major control point for eukaryotic cells, and its regulation is the primary means of differential gene expression in metazoans1. A prerequisite for Pol II transcription is the recruitment of the general transcription factors TFIIA, -B, -D, -E, -F, and –H, to the core promoter, where they assemble with Pol II into the preinitiation complex (PIC)2. The process is thought to begin with the recruitment of TFIID and TFIIA to the core promoter, followed by TFIIB, TFIIF, and Pol II, and ending with TFIIE and TFIIH2,3.
TFIID is a ~1 megadalton complex consisting of the TATA-binding protein (TBP) and 13 TBP-associated factors (TAF1-13)4,5. TBP and TAFs mediate specific interactions with a variety of core promoter sequences6–10 and other components of the PIC11–14, establishing TFIID as the primary core promoter recognition factor that nucleates PIC assembly. Additionally, TAFs mediate regulatory signals by interacting with transcriptional activators or epigenetic marks, and TFIID has been shown to be required for the initiation of activated transcription5.
Despite its critical role in transcription, little is known regarding the arrangement of subunits within TFIID and the structural bases of their interactions with DNA and the transcriptional machinery. The lack of a recombinant expression system for full TFIID necessitates purification from endogenous sources, which limits the yield that can be used for structural studies. There are crystallographic structures for domains of several TFIID subunits15–25, but only low-resolution electron microscopy (EM) structures of the TFIID holocomplex and subcomplexes25–33.
Here, we present the cryo-EM structure of human TFIID bound to TFIIA and core promoter DNA, determined by single particle cryo-EM at 7–16 Å resolution. The structure reveals the position of the TBP-TFIIA-TATA subcomplex and defines the path and register for promoter DNA. Our study also shows the locations of TAF1, -2, -6, -7, and -8, and implicates specific elements within TAF1 and TAF2 in mediating interactions with downstream core promoter DNA. We also present the cryo-EM reconstruction of a human TAF-less PIC containing Pol II, promoter DNA, TBP, TFIIA, -B, -F, -S, -E, and -H. By superimposing the common elements between the two structures, we propose a model for the complete TFIID-based PIC that provides novel insights into the intertwined roles of TFIID in promoter recognition, PIC assembly, and transcription initiation.
Overall structure of promoter-bound TFIID
Human TFIID has a horseshoe shape, with lobes A, B and C surrounding a central cavity26,29. Our previous cryo-EM studies revealed that human TFIID adopts two major conformations, termed the canonical and rearranged states, that differ in the position of lobe A33. In the canonical state, lobe A is attached to lobe C, while in the rearranged state, which is the conformation in which TFIID binds promoter DNA, lobe A adopts a position proximal to lobe B. To reduce the conformational and compositional heterogeneity limiting the resolution of our previous studies (~30 Å), we purified human promoter-bound TFIID complexes using the super core promoter (SCP) sequence. This composite promoter was designed to maximize transcriptional output by increasing the affinity of TFIID for DNA through the presence of several naturally occurring promoter motifs (TATA, Inr, MTE, and DPE)34. Purification of SCP-bound TFIID in the presence of TFIIA (see Methods) resulted in more homogeneous TFIID-IIA-SCP complexes, which we then used in single-particle cryo-EM to obtain a reconstruction with an overall resolution of 10.2 Å (Extended Data Fig. 1).
The shape of the human promoter-bound TFIID is consistent with previous lower resolution reconstructions26,29,33. In our present structure, lobe A appears separated into a smaller lobe (lobe A1) that is more stably positioned with respect to the BC core, and a highly flexible lobe A2 (Fig. 1a and Extended Data Fig. 1e). In order to improve the resolution of the more stable core (comprising lobes A1, B and C), lobe A2 was excluded from the references used during subsequent three-dimensional classification and refinement (see Methods). This procedure led to an improved reconstruction of the promoter-bound TFIID core with an overall resolution of 8.7 Å (Fig. 1b and Extended Data Fig. 2a, c–e).
TFIIA-TBP-TATA subcomplex and DNA path
We could easily localize the TBP-TFIIA-TATA ternary complex within lobe A1 by rigid body docking of the crystal structure19 into the cryo-EM density (Fig. 1c and Supplementary Video). This result agrees with previous EM studies using gold labeling that localized the TATA-containing upstream promoter region to lobes A and B, and the downstream region to lobe C of the rearranged TFIID33. The position of TBP and TFIIA in our structure, together with previous lower-resolution reconstructions of promoter-bound TFIID-IIA and of TFIID alone33 (Extended Data Fig. 3a) are consistent with the proposal that TBP resides in the mobile lobe A and thus changes position during TFIID rearrangement.
In our promoter-bound complex, TFIIA appears to serve as a bridge between TBP and lobe B of TFIID. Interestingly, DNAse I footprinting of TFIID-bound SCP showed that only in the presence of TFIIA is the TATA box protected33, supporting the idea that TFIIA is essential for TBP positioning for DNA engagement in the rearranged state. The tip of the TFIIA four-helix bundle is oriented towards lobe B, in agreement with the finding that mutations within this region of TFIIA affect its interaction with TFIID35 (Fig. 1c and Extended Data Fig. 3b). A density connecting the TFIIA β-barrel with lobe B of TFIID and the DNA near position −16 (Fig. 1b, bottom and Extended Data Fig. 3b) cannot reasonably accommodate any TFIIA portions unmodeled in the crystal structure. On the other hand, footprinting has shown a TFIID-dependent protection from DNase I cleavage in this region of promoter DNA33, suggesting the connecting density likely corresponds to one of the TAFs in lobe B.
We could model the SCP DNA from −40 to +42 base pairs (bp) relative to the transcription start site (TSS), including all SCP motifs (TATA, Inr, MTE, and DPE), using the bent TATA DNA as an anchoring point for assigning the base pair register (Fig. 1b and Supplementary Video). The MPE-Fe cleavage pattern of SCP DNA bound to TFIID-IIA33 can be mapped onto the structure with high correspondence between protected sequences and protein contacts (Extended Data Fig. 3c). Mapping the downstream core element (DCE)9 onto our structure strongly suggests that TFIID uses very similar protein elements to interact with this alternative promoter motif (Extended Data Fig. 3d).
Lobe C architecture and downstream promoter binding
The structural stability of lobe C and bound downstream promoter DNA relative to the rest of the complex allowed us to improve the resolution in this region (to ~7–12 Å, with an 8.2 Å average) through further local 3D classification and alignment against a masked reference (Extended Data Fig. 2b–f). The crystal structure of a human TAF1-TAF7 complex24 that includes the highly-conserved central and N-terminal fragments of TAF1 and TAF7, respectively, could be unambiguously docked as a rigid body into the density adjacent to the downstream core promoter (Fig. 2a and Supplementary Video), in agreement with its reported ability to bind DNA with a preference for the downstream sequence of the SCP24. The docking reveals that TAF1 is the primary mediator of downstream promoter binding, contributing contacts that span 34 bp of DNA (positions −3 to +31; Fig. 2a). The winged helix (WH) domain of TAF1 forms a major interaction at the junction of the MTE and DPE promoter motifs (Fig. 2a–c). Superposition of the TAF1 WH-DNA complex with other DNA-binding WH proteins confirms that it shares a common mode of DNA recognition36 (Extended Data Fig. 4a). The third α-helix (α3) of this domain inserts into the major groove, positioning three conserved positively charged residues (R864, K865, K868) for specific interaction with the MTE (Fig. 2b–c and Extended Data Fig. 4b), which supports the finding that mutation of these residues to alanine ablates binding of the TAF1-TAF7 module to promoter DNA24. Additional conserved positively charged residues within the extended β-wing (R875) and N-terminal end of the first α-helix (α1; K818) of the WH domain form additional promoter contacts within the minor groove of the DPE and near the upstream boundary of the MTE, respectively (Fig. 2b–c and Extended Data Fig. 4b). There is a small protein density contacting the minor groove of the Inr (Fig. 2a, d). We propose that it corresponds to the portion of TAF1 between residues 993–1075, which is disordered in the crystal structure but is well conserved among metazoans and is predicted to be ~50% α-helical (Extended Data Fig. 4c–e). The residues near both termini of this missing stretch also appear to form contacts with the DNA between the Inr and MTE (Fig. 2d).
Superposition of the previous low-resolution cryo-EM reconstruction of TFIID in the canonical state33 with our promoter-bound TFIID structure reveals that, in the former, lobe A is attached to lobe C near the newly-identified position of the TAF1/TAF7 subcomplex (Fig. 2e). The N-terminal domain of TAF1 (TAND) contains two subdomains known to bind the concave and convex surfaces of TBP, respectively, thereby competing with TBP binding to DNA and TFIIA23,37. It is likely that in the canonical state of TFIID, TBP is at least partially inhibited from binding to promoter DNA through interactions with the TAND. In the rearranged promoter-bound state of TFIID, however, TBP is at the opposite end of the core promoter from the identified region of TAF1. Binding of TFIIA to the convex side of TBP likely plays a role in releasing TBP from inhibition by the TAND23,37. Additionally, TFIIA contributes to the localization of TBP in the rearranged state of TFIID through its interaction with lobe B. Thus, our studies suggest that the conformational rearrangement of TFIID and the binding of TFIIA are coupled and play critical roles in modulating the handoff of TBP to the upstream promoter region.
Previous studies suggested that TAF1 interacts with promoter DNA as a complex with TAF28. The conserved N-terminal portion of TAF2 is homologous to M1-family aminopeptidases38 and we could unambiguously assign the density adjacent to TAF1 and the downstream promoter to the TAF2 aminopeptidase-like domain (APD) by fitting the structure of the human endoplasmic reticulum aminopeptidase (ERAP1)39 into our map (Extended Data Fig. 5a–c). We were able to generate a complete atomic model of the TAF2 APD (residues 27–975) through homology modeling and flexible fitting into the cryo-EM density (Fig. 3a, b, Extended Data Fig. 5a–d, Supplementary Video, and Methods). All TAF2 interactions with promoter DNA are mediated by APD domain 3, with the primary contact involving a highly conserved loop of the β-sandwich that interacts with the MTE (Fig. 3b, c and Extended Data Fig. 5e). Contacts with TAF1 are exclusively contributed by APD domain 2, involving the regions from ~467 to ~514 and from ~558 to ~561 of TAF2 (Fig. 3c, d).
Structural TAFs and unassigned density
We were able to assign the majority of the remaining lobe C density to two copies of the C-terminal HEAT repeat domain of TAF6 (TAF6C) by fitting the crystal structure of the A. locustae orthologue22 (Fig. 4a, Extended Data Fig. 6a, b, and Supplemental Video). We propose that this part of TAF6 forms a homodimer that effectively bridges the downstream promoter-interacting TAFs (TAF1, -2 and -7) with lobe B (Fig. 4b, e). The TAF6C density, at < 9 Å resolution, was sufficient to unambiguously confirm the alignment with all ten α-helices in the crystal structure (Fig. 4a), and the density at the C-terminal region of both TAF6C copies is indicative of the presence of additional C-terminal α-helices, which are predicted to exist in the human protein but are missing in the crystallized orthologue (Extended Data Fig. 6b, c). We were unable to detect density near either copy of the TAF6C homodimer for the N-terminal histone fold domain of TAF6, which forms a heterodimer with the histone fold domain of TAF915. This result, which suggests that the TAF6 histone fold is flexibly attached and not critical to the structural integrity of the core TFIID, agrees with the finding that the human isoform TAF6δ, which lacks a critical part of its histone fold domain, integrates into an active TFIID complex that retains all TAFs except TAF940.
After accounting for the portions of TAF2 and TAF6 that we could model into lobe C, there remains clear density for two additional α-helices bridging TAF2 and TAF6 that we were not able to assign to either of these TAFs (Extended Data Fig. 6d). We propose that these helices are contributed by the C-terminal region of TAF8 because: (1) TAF8 associates directly with TAF2 and mediates its nuclear import and incorporation into TFIID through its C-terminal region25; (2) a fragment of the region critical for TAF2 binding (residues ~140–200)25 is predicted to harbor a ~26 residue α-helix, the length of the longer helical density we observe bridging TAF2 and TAF6 (Fig. 4c–e, Extended Data Fig. 6e, and Supplemental Video); (3) TAF8 exhibits robust crosslinking to TAF6 within reconstituted TFIID subcomplexes25.
We were unable to localize the positions for the rest of the TAFs (TAF3, -4, -5, -9, -10, -11, -12, and -13) within our promoter-bound TFIID structure. These TAFs must therefore reside within lobes A2 and B, which are not yet resolved at enough resolution for reliable identification of subunits via docking of the available atomic models. The combined volume of these unassigned lobes is consistent with the ~300 kDa of structured TAF domains that have yet to be localized (Extended Data Table 1), considering that much of TFIID is predicted to be intrinsically disordered. A previously described recombinant 5TAF subcomplex contains two copies each of TAF4, -5, -6, -9, and -1232,41. TAF6 is the only component of 5TAF that we were able to localize in our map. While lobe B is contiguous with TAF6C, the density for lobe B is not large enough to accommodate two copies of each TAF in the 5TAF subcomplex. Additionally, we do not observe density within lobe B for the distinctive WD40 beta-propeller domain of TAF5 nor the TAF6-TAF9 histone-fold heterodimer, both of which were proposed to contact TAF6C through opposing interfaces within the recombinant 5TAF subcomplex. We therefore conclude that the components of 5TAF are likely divided between lobes B and A2 in the full TFIID complex.
TAF-less PIC structure and full PIC model
In order to gain structural insight into the full PIC assembly, we solved the cryo-EM structure of a simplified, TAF-less PIC containing TBP, Pol II, TFIIA, -B, -F, -S, -E, -H, and SCP DNA. This human TAF-less PIC cryo-EM reconstruction is similar to the one we previously reported using negative stain EM42, but promoter DNA is now visible (Fig. 5a and Supplemental Video). By superimposing the common elements between the TFIID-IIA-SCP complex and the TAF-less PIC (i.e. TBP, TFIIA, promoter DNA), we were able to generate a model of a complete TFIID-based PIC (Fig. 5b and Supplemental Video). Overall the two structures fit well with each other, with significant shape complementarity and minimal steric clashes, which are, however, of potential functional relevance. Superposition of the unmasked TFIID-IIA-SCP reconstruction (including lobe A2) onto our model of the TFIID-based PIC indicates that the observed range of positions for the flexible lobe A2 is overall compatible with the model, without any major clashes (Extended Data Fig. 7a). The proximity of TFIIF and TFIIE to lobe B of TFIID in our model supports the finding that these factors can interact with TAFs11–13, thus implicating TAFs in the recruitment of PIC components (Fig. 5b, c). The unidentified density emanating from TFIIA and TFIID’s lobe B and contacting the DNA downstream of the TATA sequence in the TFIID-IIA-SCP reconstruction overlaps the promoter binding site of the RAP30 WH domain of TFIIF in the TAF-less PIC, suggesting that a structural reorganization occurs in this region upon TFIIF recruitment to the PIC (Fig. 5c).
Our model shows that Pol II docks between the up- and downstream promoter-binding regions of TFIID. The protein density bound to the Inr promoter element, which we attribute to TAF1, docks into the cleft of Pol II, adjacent to its RPB1, RPB2, and RPB5 subunits (Extended Data Fig. 7b). Downstream of the Inr, TAF1, TAF2 and the XPB subunit of TFIIH make complementary promoter contacts on opposite faces of the DNA duplex (Fig. 5b, right). Minor clashes between TAF1 and RPB1, -2, and -5 signify that this region of TFIID undergoes structural rearrangement upon loading of Pol II onto promoter DNA. Additionally, the path of the promoter DNA in the TFIID-IIA-SCP complex deviates from that seen in the TAF-less PIC, especially downstream of the TSS, further supporting a structural rearrangement in TFIID and downstream promoter DNA during PIC assembly (Extended Data Fig. 7c).
As Pol II reads through the DNA downstream of the TSS, TAF1 and TAF2 must disengage from the downstream promoter DNA before Pol II clears the promoter. Indeed, it has been found that upon recruitment of Pol II, promoter-bound TFIID undergoes an isomerization in which the TFIID contacts with promoter DNA downstream of the +10 position are released concomitantly with the engagement of the promoter DNA with Pol II upstream of this position43. Those findings can be mapped onto our model (Fig. 5d) and agree with our structure-based proposal of a reorganization in the downstream region of the PIC. It has also been recently demonstrated that a chemical inhibitor of this isomerization interacts with the intrinsically disordered region of TAF2 and prevents the first round of transcription initiation by blocking the initial recruitment of Pol II44. However, the inhibitor has no effect on reinitiation of transcription, suggesting that the isomerization does not occur during reloading of Pol II. We propose that the isomerization of promoter-bound TFIID required for Pol II recruitment, its engagement with promoter DNA, and its clearance of the promoter during transcription initiation, largely involves the release of downstream promoter contacts by TAF1 and TAF2. Since the same isomerization does not take place during reinitiation, it is likely that TAF1 and TAF2 do not re-form some of these promoter contacts following the first round of initiation. The release of TAF7 from TFIID following PIC assembly has been shown to be required for transcription initiation45, and could potentially serve as the mechanism for preventing re-engagement of the promoter DNA by TAF1 and TAF2.
Recent cryo-EM studies have revealed the binding site of the yeast Mediator complex on Pol II within a minimal transcription initiation complex46,47. Superposition of this structure with our TFIID-based PIC model shows that TFIID and Mediator occupy opposite faces of Pol II (Extended Data Fig. 7d, e and Supplemental Video).
Role of TFIID in transcription initiation
Our structures suggest that a primary function of TFIID during PIC assembly is the proper positioning of TBP on the upstream promoter, which ultimately determines the placement of Pol II relative to the TSS. For the majority of human promoters, which lack a canonical TATA sequence48–50, accurate loading of TBP would be ensured by the TAF subunits of TFIID, which collectively act as a molecular ruler to position TBP at a location on the upstream promoter that is precisely defined by the downstream promoter binding sites of TAF1-TAF2 and the length of the BC core of TFIID. TAFs may also facilitate PIC assembly by contributing to the incorporation of TFIIF and TFIIE to the growing PIC. In order to accommodate the recruitment of Pol II and its subsequent engagement with promoter DNA, an isomerization occurs in which TAF1 and TAF2 likely release some of their contacts with the downstream DNA. Our model suggests that TAFs are generally required for initial PIC assembly and first round of transcription initiation, but that at least some TAFs may be dispensable for the reloading of Pol II.
TAFs are likely also critical for providing additional levels of control of transcription initiation. For instance, competition from the TAND of TAF1 with the binding of TFIIA and DNA to TBP, which we propose involves the conformational rearrangement of TFIID, may present additional opportunities for regulating PIC assembly. The timing and rate of PIC assembly at the promoter will ultimately be regulated through combinatorial interactions involving TAFs, variable promoter sequences, activators, and epigenetic marks. The demonstrated flexible character of TFIID is likely to be an important property for integrating regulatory cues and allowing sequential conformational states that provide checkpoints throughout the processes of PIC assembly and transcription initiation. The structures presented here offer a structural framework for understanding the complex mechanism underlying TFIID function, shedding new light into the overlapping roles of TFIID as both a coactivator and a general platform for PIC assembly in the coordination of transcription initiation.
Methods
No statistical methods were used to predetermine sample size. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment.
Preparation of transcription complexes for cryo-EM
TFIID, Pol II, and TFIIH were immunopurified from HeLa cell nuclear extracts following previously established protocols51,52. The human TFIIA used in the reconstitution of both the TFIID-IIA-SCP complex and TAF-less PIC was recombinantly expressed and purified first as three separate polypeptides (TFIIAα 2–58, TFIIAβ 303–376, and TFIIAγ 2–109) from Escherichia coli, then reconstituted into the conserved three-subunit core TFIIA similarly as in Bleichenbacher, et al. (2003)19. The C-terminal DNA-binding core of human TBP (residues 159–339), and full-length TFIIB, TFIIE, TFIIF, and TFIIS were used in the reconstitution of the TAF-less PIC and were recombinantly expressed and purified from Escherichia coli.
The sequence of the promoter DNA construct used in this study was based on the SCP34, except that a BREu element was introduced upstream of the TATA box and an EcoRI restriction enzyme site was included downstream of the DPE element for purification purposes (template 5′-ACTGGGGAATTCCATGGTCCGTAGGCACGTCTGCTCGGCTCGAGTGTTCGATCGCGA CTGAGGACGAACGCGCCCCCACCCCCTTTTATAGGCGCCCTTC-3′; nontemplate 5′-GAAGGGCGCCTATAAAAGGGGGTGGGGGCGCGTTCGTCCTCAGTCGCGATCGAACA CTCGAGCCGAGCAGACGTGCCTACGGACCATGGAATTCCCCAGT-3′). The template strand was modified with a biotin tag at the 5′ end (Integrated DNA Technologies). To generate the duplexed DNA, equimolar amounts of the template and non-template strand oligonucleotides were mixed at a final concentration of 25 μM in water, and the annealing reaction was heated to 95 °C for 5 minutes then allowed to slowly cool to room temperature over 2–3 hours.
To assemble TFIID-IIA-SCP complex, 5.0 pmol TFIIA was first added to 2.5 pmol purified HeLa TFIID in assembly buffer (20 mM HEPES pH 7.9, 0.2 mM EDTA, 10% glycerol, 6 mM MgCl2, 80 mM KCl, 1mM DTT, 0.05% NP-40) and incubated for 5 minutes at 37 °C. A limiting amount of biotinylated SCP DNA (1 pmol) was then added and the assembly reaction was incubated for 10 minutes at 37 °C. The reaction was added to 0.25 μl Streptavidin Mag Sepharose magnetic beads (GE Healthcare) and incubated at 28 °C for 15 minutes. The beads were then washed three times with washing buffer (10 mM HEPES pH 7.9, 3% trehalose, 8 mM MgCl2, 100 mM KCl, 1 mM DTT, 0.025% NP-40). The promoter-bound complex was then eluted by incubating in 3.6 ul of elution buffer (10 mM HEPES pH 7.9, 10 mM MgCl2, 3% trehalose, 50 mM KCl, 1 mM DTT, 0.05% NP-40, 1 unit μl−1 EcoRI-HF (New England BioLabs)) for 30 minutes at 37 °C.
The TAF-less PIC was assembled similarly as before42, except for an additional incubation of TFIIS at a final concentration of 200 nM with the purified PIC prior to application to the grid. We included TFIIS in our initiation assemblies due to its novel role in active PIC formation besides that in elongation53.
Following the restriction digest elution, purified TFIID-IIA-SCP complex or TAF-less PIC was crosslinked with 0.01% glutaraldehyde for 5 minutes on ice, then used immediately for cryo-EM sample preparation.
Electron Microscopy
Cryo-EM samples were prepared on continuous carbon coated C-flat holey carbon grids (Protochips). Grids were plasma cleaned for 10 s in air using a Solarus Plasma Cleaner (Gatan) operating at 10 Watts. Immediately following crosslinking, 4 μl of purified TFIID-IIA-SCP complex or TAF-less PIC was added to the plasma-cleaned grid and loaded into a Vitrobot (FEI). The sample was incubated on the grid for 10 minutes at 4 °C and 100% relative humidity to enhance its absorption onto the carbon substrate, then was blotted and immediately plunge-frozen in liquid ethane. Frozen grids were transferred to a 626 Cryo-Transfer Holder (Gatan) and loaded into a Titan electron microscope (FEI) operating at 300 keV. Images were recorded with a K2 direct electron detector (Gatan) operating in counting mode at a calibrated magnification of 37,879 (1.32 Å pixel−1) and a defocus range of −2 μm to −4 μm, using the Leginon data collection software for semi-automated acquisition targeting. 20-frame exposures were taken at 0.5 s per frame (10 s total exposure time), using a dose rate of 8 e− pixel−1 s−1 (4.6 e− Å−2 s−1 or 2.3 e− Å−2 per frame), corresponding to a total dose of 46 e− Å−2 per micrograph.
Image Processing
The exposure frames were aligned using MotionCorr54 to correct for specimen motion, and the average of the aligned frames was used for initial processing. The CTF parameters of the micrographs were estimated using CTFFIND355. For the TFIID-IIA-SCP complex, RELION56 (version 1.4-beta) was used for automatic selection of 203,163 particles from 1,253 micrographs (Extended Data Fig. 1a). For the TAF-less PIC, 245,501 particles were automatically selected from 855 micrographs using a difference of Gaussians (DoG) particle picker57 within the Appion image processing environment58. All two- and three-dimensional classification and refinement steps were performed within RELION56 (version 1.4-beta).
For the TFIID-IIA-SCP complex, the initial set of 203,163 particles was subjected to an initial three-dimensional classification, with the negative stain reconstruction of the same complex (in which the nucleic acid is not visible) low-pass filtered to 60 Å used as an initial reference (Extended Data Fig. 1b). Three out of five classes in this classification, corresponding to 121,459 particle images, were indicative of promoter-bound complexes and were selected for further processing. Reference-free two-dimensional classification of this set of images was used to select for 56,457 high-quality particle images. The 56,457-particle set was then subjected to three-dimensional refinement and particle polishing procedure within RELION59 to correct for individual particle motion and beam-induced radiation damage of the sample. Due to the low contrast inherent in the images of this sample, the per-frame B-factor plot that is used to model the beam-induced radiation damage was too noisy to use for modeling. Instead, we generated an idealized curve for the dose-dependent B-factor based on cryo-EM data collected on microtubules under similar imaging conditions (Extended Data Fig. 1c), which we applied to our data during the particle polishing step. The resulting set of 56,457 “polished” particles (56k set) was used in all subsequent three-dimensional classification and refinement.
An initial three-dimensional refinement of the 56k set resulted in a reconstruction of the promoter-bound TFIID-IIA complex at 10.2 Å resolution (Extended Data Fig. 1d, e). All resolutions reported herein correspond to the gold-standard FSC = 0.143 criterion60. Local resolution estimation indicated that the density for lobe A2 of TFIID was at much lower resolution than the promoter-bound BC-core (Extended Data Fig. 1e). In order to improve the reconstruction of the promoter-bound BC-core of TFIID, the orientations of the particle images were locally-refined against a reference in which a mask was applied around the BC-core density, effectively excluding the contribution of lobe A2 signal from the alignment. The images were then three-dimensionally classified within the same mask, and one class with 22,050 particles exhibited the lowest error in angular and translational alignment (Extended Data Fig. 2a). This set of 22,050 particle images was then subjected to three-dimensional refinement without using any mask, followed by a local refinement against a reference with a mask around the BC-core. This procedure resulted in an improved reconstruction of the BC-core with an overall resolution of 8.7 Å (Extended Data Fig. 2c).
Three-dimensional classification and local-resolution analysis of the BC-core density indicated further conformational heterogeneity, which could be largely characterized as mobility of lobe B, TBP/TFIIA module, and upstream promoter DNA relative to lobe C and bound downstream promoter DNA (Extended Data Fig. 2d–f). Therefore, we employed a similar strategy to improve the reconstruction of lobe C and bound downstream promoter DNA (Extended Data Fig. 2b). The resulting lobe C density incorporated 28,448 particle images and had an overall resolution of 8.2 Å (Extended Data Fig. 2c).
For the TAF-less PIC, all particle picks were used for an initial three-dimensional classification, using the previously published negative stain reconstruction of the TFIIH-containing PIC (EMDB code 2308), low-pass filtered to 60 Å, as an initial reference. A single class corresponded to the fully assembled, TFIIH-containing TAF-less PIC, comprising 24,290 particle images. A three-dimensional refinement of this set of particles yielded the final reconstruction at 7.2 Å resolution.
Local resolution estimations were performed using the Bsoft software package61, and all final volumes shown in this paper have been automatically sharpened using the post-processing program within RELION and then filtered according to local resolution using the blocfilt program within Bsoft.
Structural modelling
For the TFIID-IIA-SCP complex, densities were initially assigned to specific components by rigid-body docking of known crystal structures (TBP, TFIIA, TATA DNA, TAF1, TAF7) or homologous structures (TAF2 and TAF6) using UCSF Chimera62, ADP_EM63, or Situs64. These structures were used as starting point for flexible refinement using iMODFIT65 when necessary. Reliable homology models were generated with either the SWISS-MODEL server66 or I-TASSER server67.
The TBP-TFIIA-TATA DNA complex was the first structure to be fitted into the density, which was accomplished by docking the crystal structure of this complex19 as a single rigid body. The model of the entire SCP was then generated by extrapolating B-form DNA from TBP-bound TATA box sequence present in the fitted crystal structure, followed by manual bending of the DNA structure using the 3D-DART server68, and finally by flexible fitting of the DNA into the cryo-EM density using iMODFIT65.
The density for the TAF1-TAF7 promoter-binding module was initially identified within the promoter-bound BC-core of TFIID using the unbiased 6D global docking search algorithm implemented in Situs, and the rigid body docking of the crystal structure (PDB code 4RGW)24 was further refined using the higher-resolution lobe C density within Chimera. The secondary and three-dimensional structure prediction for TAF1 993–1074 (putative Inr-binding domain) was performed using the I-TASSER server67.
The density for TAF2 was initially identified through manual docking of the crystal structure of the human endoplasmic reticulum aminopeptidase I (ERAPI, PDB code 2YD0)39 within UCSF Chimera. Homology modelling of the TAF2 APD began with the building of APD domains 1 and 2 within the SWISS-MODEL server66, using the structures of Leukotriene A-4 hydrolase (PDB code 3U9W: 21% sequence identity, 31% similarity, and 79% coverage) and endoplasmic reticulum aminopeptidase 2 (PDB code 3SE6: 16% identity, 30% similarity, and 88% coverage) as templates. While the first 26 N-terminal residues and an insertion (residues 88–124) of APD domain 1 could not be modeled by homology, there are two nearby unassigned densities in contact with APD domain 4 that are likely to account this missing modeling part. TAF2 APD domain 3 was then modeled using templates derived from aminopeptidase N (PDB code 3B34 and 4QME). Despite the high confidence secondary structure prediction score of the 16 predicted alpha helices in APD domain 4, the low sequence identities (between 10 and 19%), coverages (between 50 and 86%), and the conformational variability of the model templates precluded that a single model fit well in the armadillo fold visible in the density. However, we were able to accommodate the distinctive armadillo curvature by merging and flexibly fitting fragments from 2 to 6 helices extracted from different models. The templates of these models were the structures of aminopeptidase N (PDB code 3B34 and 4QME), deoxyhypusine hydroxylase (PDB code 4D4Z), AP2 clathrin adaptor (PDB code 1GW5), and hypothetical protein yibA (PDB code 1OYZ). The missing loops between fragments and domains were modeled ab initio using RCD69 to obtain a complete model. Finally, the full TAF2 APD model was relaxed with Chiron70 and PyRosetta71 to prevent clashes and to improve geometry.
The DUF1546 domain of TAF6 (TAF6C) was modeled from the crystal structure of the Antonospora locustae orthologue (PDB code 4ATG: 23% identity and 81% coverage)22. Two copies of this well conserved structure containing HEAT repeats were easily located by rigid-body docking in the cryo-EM density using Situs64. In the contact region between TAF6 and the TAF2 APD domain 4, there is obvious density for two alpha helices, which we predict to be derived from TAF8 based on biochemical data and secondary structure prediction.
The model for the TAF-less PIC is based on the previously published model using the negative stain EM reconstruction of the PIC containing TBP, TFIIA, SCP DNA, TFIIB, Pol II, TFIIF, TFIIE, and TFIIH. The model for TFIIS was based on a combination of the yeast Pol II-TFIIS complex (PDB code 1Y1V) and the free human TFIIS domain II (PDB code 3NDQ). The model of the TFIID-based PIC was generated by superimposing the promoter DNA downstream of the TATA sequence in the TFIID-IIA-SCP structure with that in the TAF-less PIC structure within UCSF Chimera.
All multi-sequence alignments were performed using Clustal Omega72. Secondary structure predictions were performed with the PSIPRED server73, except where noted. All molecular graphics and analyses were performed with the UCSF Chimera package62.
Extended Data
Extended Data Table 1.
TFIID subunit | Length (residues) | M.W. (kDa) | Expected # of copies | Structured domains* | Span (residues) | M.W. (kDa) |
---|---|---|---|---|---|---|
TBP | 339 | 38 | 1 | DNA bindinga | 159–338 | 20 |
| ||||||
TAF1 | 1872 | 213 | 1 | TANDb | 26–87 | 6 |
DUF3591a | 600–1109 | 59 | ||||
Double bromob | 1359–1625 | 31 | ||||
| ||||||
TAF2 | 1119 | 137 | 1 | Aminopeptidasea | 18–975 | 110 |
| ||||||
TAF3 | 929 | 104 | 1 | Histone foldb | 5–89 | 10 |
Plant homeob | 847–921 | 9 | ||||
| ||||||
TAF4 | 1085 | 110 | 2 | TAFHb,c | 582–678 | 11 |
Histone foldb,c | 870–943 | 9 | ||||
| ||||||
TAF5 | 800 | 87 | 2 | N-terminal 1b,c | 91–124 | 4 |
N-terminal 2b,c | 194–340 | 18 | ||||
WD40 repeatb,c | 460–739 | 31 | ||||
| ||||||
TAF6 | 677 | 73 | 2 | Histone foldb,c | 8–77 | 8 |
HEAT-like repeata,c | 212–436 | 25 | ||||
| ||||||
TAF7 | 349 | 40 | 1 | TAF1-interactinga | 11–154 | 17 |
| ||||||
TAF8 | 310 | 34 | 1 | Histone foldb | 28–120 | 11 |
| ||||||
TAF9 | 264 | 29 | 2 | Histone foldb,c | 13–80 | 8 |
| ||||||
TAF10 | 218 | 22 | 1 | Histone foldb | 113–212 | 11 |
| ||||||
TAF11 | 211 | 23 | 1 | Histone foldb | 113–201 | 10 |
| ||||||
TAF12 | 161 | 18 | 2 | Histone foldb,c | 57–128 | 9 |
| ||||||
TAF13 | 124 | 14 | 1 | Histone foldb | 31–75 | 5 |
Subset of structured domains† | Total M.W. ‡ (kDa) |
---|---|
(a) Fitted domains | 260 |
(b) Unassigned domains | 290 |
(c) Present in the recombinant 5TAF | 250 |
M.W., molecular weight calculated from amino acid sequence; kDa, kilodalton
”Structured domains” indicates domains that have a known structure or are predicted to be structured by sequence homology.
The structured domains comprising each subset are indicated by superscripted letters in the larger table above, corresponding to the letter label of that subset (“a”, “b”, or “c”). Note that each domain is included in either the fitted domains subset or unassigned domains subset, based on whether or not they have been modeled in the present study, respectively. Additionally, domains present in the recombinant 5TAF complex32 comprise subset “c”.
Total M.W. for domain subsets corresponds to the total mass of structured domains and takes into account the expected number of copies for each corresponding subunit
Supplementary Material
Acknowledgments
We thank P. Grob, S. Howes, and R. Zhang for electron microscopy support; T. Houweling for computer support; S. Scheres for technical advice on image processing; A. Patel for useful discussion; C. Inouye for providing us with recombinant TFIIF; S. Zheng for providing TAF4 mAb; D. King for providing peptides. We are thankful to R. Tjian and J. Kadonaga for their comments on the manuscript. This work was funded by NIGMS (GM63072 to E.N.) and Spanish Ministry of Economy and Competitiveness (BFU2013-44306P to P.C). R.L was supported by the NIGMS Molecular Biophysics Training Grant (GM008295). E.N. is a Howard Hughes Medical Institute Investigator.
Footnotes
Supplementary Information is linked to the online version of the paper at www.nature.com/nature.
Author Contributions
R.L. and Y.H. designed and carried out the experiments; J.R.L. and P.C. performed structural modeling ; J.F. purified the TFIID, Pol II, TFIIE, and TFIIH; R.L., Y.H., and E.N. analyzed the data and wrote the paper.
Cryo-EM density maps have been deposited in the Electron Microscopy Data Bank (EMDB) under accession numbers EMD-3304 (TFIID-IIA-SCP complex), EMD-3305 (locally refined BC-core of TFIID-IIA-SCP complex), EMD-3306 (locally refined lobe C of TFIID-IIA-SCP complex) and EMD-3307 (TAF-less PIC). Model coordinates have been deposited in the Protein Data Bank (PDB) under accession number 5FUR.
The authors declare no competing financial interests.
Readers are welcome to comment on the online version of the paper.
References
- 1.Levine M, Cattoglio C, Tjian R. Looping Back to Leap Forward: Transcription Enters a New Era. Cell. 2014;157:13–25. doi: 10.1016/j.cell.2014.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Thomas MC, Chiang CM. The general transcription machinery and general cofactors. Critical reviews in biochemistry and molecular biology. 2006;41:105–178. doi: 10.1080/10409230600648736. [DOI] [PubMed] [Google Scholar]
- 3.Buratowski S, Hahn S, Guarente L, Sharp PA. Five intermediate complexes in transcription initiation by RNA polymerase II. Cell. 1989;56:549–561. doi: 10.1016/0092-8674(89)90578-3. [DOI] [PubMed] [Google Scholar]
- 4.Burley SK, Roeder RG. Biochemistry and structural biology of transcription factor IID (TFIID) Annual review of biochemistry. 1996;65:769–799. doi: 10.1146/annurev.bi.65.070196.004005. [DOI] [PubMed] [Google Scholar]
- 5.Albright SR, Tjian R. TAFs revisited: more data reveal new twists and confirm old ideas. Gene. 2000;242:1–13. doi: 10.1016/s0378-1119(99)00495-3. [DOI] [PubMed] [Google Scholar]
- 6.Verrijzer CP, Chen JL, Yokomori K, Tjian R. Binding of TAFs to core elements directs promoter selectivity by RNA polymerase II. Cell. 1995;81:1115–1125. doi: 10.1016/s0092-8674(05)80016-9. [DOI] [PubMed] [Google Scholar]
- 7.Burke TW, Kadonaga JT. The downstream core promoter element, DPE, is conserved from Drosophila to humans and is recognized by TAFII60 of Drosophila. Genes & Development. 1997;11:3020–3031. doi: 10.1101/gad.11.22.3020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chalkley GE, Verrijzer CP. DNA binding site selection by RNA polymerase II TAFs: a TAF(II)250-TAF(II)150 complex recognizes the initiator. EMBO J. 1999;18:4835–4845. doi: 10.1093/emboj/18.17.4835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lee DH, et al. Functional characterization of core promoter elements: the downstream core element is recognized by TAF1. Molecular and cellular biology. 2005;25:9674–9686. doi: 10.1128/MCB.25.21.9674-9686.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Theisen JW, Lim CY, Kadonaga JT. Three key subregions contribute to the function of the downstream RNA polymerase II core promoter. Molecular and cellular biology. 2010;30:3471–3479. doi: 10.1128/MCB.00053-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hisatake K, et al. Evolutionary conservation of human TATA-binding-polypeptide-associated factors TAFII31 and TAFII80 and interactions of TAFII80 with other TAFs and with general transcription factors. Proc Natl Acad Sci U S A. 1995;92:8195–8199. doi: 10.1073/pnas.92.18.8195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ruppert S, Tjian R. Human TAFII250 interacts with RAP74: implications for RNA polymerase II initiation. Genes Dev. 1995;9:2747–2755. doi: 10.1101/gad.9.22.2747. [DOI] [PubMed] [Google Scholar]
- 13.Dubrovskaya V, et al. Distinct domains of hTAFII100 are required for functional interaction with transcription factor TFIIF beta (RAP30) and incorporation into the TFIID complex. EMBO J. 1996;15:3702–3712. [PMC free article] [PubMed] [Google Scholar]
- 14.Wu SY, Chiang CM. TATA-binding protein-associated factors enhance the recruitment of RNA polymerase II by transcriptional activators. J Biol Chem. 2001;276:34235–34243. doi: 10.1074/jbc.M102463200. [DOI] [PubMed] [Google Scholar]
- 15.Xie X, et al. Structural similarity between TAFs and the heterotetrameric core of the histone octamer. Nature. 1996;380:316–322. doi: 10.1038/380316a0. [DOI] [PubMed] [Google Scholar]
- 16.Birck C, et al. Human TAF(II)28 and TAF(II)18 interact through a histone fold encoded by atypical evolutionary conserved motifs also found in the SPT3 family. Cell. 1998;94:239–249. doi: 10.1016/s0092-8674(00)81423-3. [DOI] [PubMed] [Google Scholar]
- 17.Jacobson RH, Ladurner AG, King DS, Tjian R. Structure and function of a human TAFII250 double bromodomain module. Science. 2000;288:1422–1425. doi: 10.1126/science.288.5470.1422. [DOI] [PubMed] [Google Scholar]
- 18.Werten S, et al. Crystal structure of a subcomplex of human transcription factor TFIID formed by TATA binding protein-associated factors hTAF4 (hTAF(II)135) and hTAF12 (hTAF(II)20) J Biol Chem. 2002;277:45502–45509. doi: 10.1074/jbc.M206587200. [DOI] [PubMed] [Google Scholar]
- 19.Bleichenbacher M, Tan S, Richmond TJ. Novel interactions between the components of human and yeast TFIIA/TBP/DNA complexes. Journal of molecular biology. 2003;332:783–793. doi: 10.1016/s0022-2836(03)00887-8. [DOI] [PubMed] [Google Scholar]
- 20.Bhattacharya S, Takada S, Jacobson RH. Structural analysis and dimerization potential of the human TAF5 subunit of TFIID. Proc Natl Acad Sci U S A. 2007;104:1189–1194. doi: 10.1073/pnas.0610297104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang X, et al. Conserved region I of human coactivator TAF4 binds to a short hydrophobic motif present in transcriptional regulators. Proc Natl Acad Sci U S A. 2007;104:7839–7844. doi: 10.1073/pnas.0608570104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Scheer E, Delbac F, Tora L, Moras D, Romier C. TFIID TAF6-TAF9 complex formation involves the HEAT repeat-containing C-terminal domain of TAF6 and is modulated by TAF5 protein. J Biol Chem. 2012;287:27580–27592. doi: 10.1074/jbc.M112.379206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Anandapadamanaban M, et al. High-resolution structure of TBP with TAF1 reveals anchoring patterns in transcriptional regulation. Nat Struct Mol Biol. 2013;20:1008–1014. doi: 10.1038/nsmb.2611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang H, Curran EC, Hinds TR, Wang EH, Zheng N. Crystal structure of a TAF1-TAF7 complex in human transcription factor IID reveals a promoter binding module. Cell Res. 2014;24:1433–1444. doi: 10.1038/cr.2014.148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Trowitzsch S, et al. Cytoplasmic TAF2-TAF8-TAF10 complex provides evidence for nuclear holo-TFIID assembly from preformed submodules. Nat Commun. 2015;6:6011. doi: 10.1038/ncomms7011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Andel F, 3rd, Ladurner AG, Inouye C, Tjian R, Nogales E. Three-dimensional structure of the human TFIID-IIA-IIB complex. Science. 1999;286:2153–2156. doi: 10.1126/science.286.5447.2153. [DOI] [PubMed] [Google Scholar]
- 27.Brand M, Leurent C, Mallouh V, Tora L, Schultz P. Three-dimensional structures of the TAFII-containing complexes TFIID and TFTC. Science. 1999;286:2151–2153. doi: 10.1126/science.286.5447.2151. [DOI] [PubMed] [Google Scholar]
- 28.Leurent C, et al. Mapping histone fold TAFs within yeast TFIID. EMBO J. 2002;21:3424–3433. doi: 10.1093/emboj/cdf342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Grob P, et al. Cryo-electron microscopy studies of human TFIID: conformational breathing in the integration of gene regulatory cues. Structure. 2006;14:511–520. doi: 10.1016/j.str.2005.11.020. [DOI] [PubMed] [Google Scholar]
- 30.Liu WL, et al. Structural changes in TAF4b-TFIID correlate with promoter selectivity. Mol Cell. 2008;29:81–91. doi: 10.1016/j.molcel.2007.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Papai G, et al. TFIIA and the transactivator Rap1 cooperate to commit TFIID for transcription initiation. Nature. 2010;465:956–960. doi: 10.1038/nature09080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bieniossek C, et al. The architecture of human general transcription factor TFIID core complex. Nature. 2013;493:699–702. doi: 10.1038/nature11791. [DOI] [PubMed] [Google Scholar]
- 33.Cianfrocco Michael A, et al. Human TFIID Binds to Core Promoter DNA in a Reorganized Structural State. Cell. 2013;152:120–131. doi: 10.1016/j.cell.2012.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Juven-Gershon T, Cheng S, Kadonaga JT. Rational design of a super core promoter that enhances gene expression. Nature Methods. 2006;3:917–922. doi: 10.1038/nmeth937. [DOI] [PubMed] [Google Scholar]
- 35.Kraemer SM, Ranallo RT, Ogg RC, Stargell LA. TFIIA interacts with TFIID via association with TATA-binding protein and TAF40. Molecular and cellular biology. 2001;21:1737–1746. doi: 10.1128/MCB.21.5.1737-1746.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gajiwala KS, Burley SK. Winged helix proteins. Curr Opin Struct Biol. 2000;10:110–116. doi: 10.1016/s0959-440x(99)00057-3. [DOI] [PubMed] [Google Scholar]
- 37.Kokubo T, Swanson MJ, Nishikawa JI, Hinnebusch AG, Nakatani Y. The yeast TAF145 inhibitory domain and TFIIA competitively bind to TATA-binding protein. Molecular and cellular biology. 1998;18:1003–1012. doi: 10.1128/mcb.18.2.1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Malkowska M, Kokoszynska K, Rychlewski L, Wyrwicz L. Structural bioinformatics of the general transcription factor TFIID. Biochimie. 2013;95:680–691. doi: 10.1016/j.biochi.2012.10.024. [DOI] [PubMed] [Google Scholar]
- 39.Kochan G, et al. Crystal structures of the endoplasmic reticulum aminopeptidase-1 (ERAP1) reveal the molecular basis for N-terminal peptide trimming. Proc Natl Acad Sci U S A. 2011;108:7745–7750. doi: 10.1073/pnas.1101262108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bell B, Scheer E, Tora L. Identification of hTAF(II)80 delta links apoptotic signaling pathways to transcription factor TFIID function. Mol Cell. 2001;8:591–600. doi: 10.1016/s1097-2765(01)00325-2. [DOI] [PubMed] [Google Scholar]
- 41.Wright KJ, Marr MT, 2nd, Tjian R. TAF4 nucleates a core subcomplex of TFIID and mediates activated transcription from a TATA-less promoter. Proc Natl Acad Sci U S A. 2006;103:12347–12352. doi: 10.1073/pnas.0605499103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.He Y, Fang J, Taatjes DJ, Nogales E. Structural visualization of key steps in human transcription initiation. Nature. 2013;495:481–486. doi: 10.1038/nature11991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Yakovchuk P, Gilman B, Goodrich JA, Kugel JF. RNA polymerase II and TAFs undergo a slow isomerization after the polymerase is recruited to promoter-bound TFIID. J Mol Biol. 2010;397:57–68. doi: 10.1016/j.jmb.2010.01.025. [DOI] [PubMed] [Google Scholar]
- 44.Zhang Z, et al. Chemical perturbation of an intrinsically disordered region of TFIID distinguishes two modes of transcription initiation. Elife. 2015;4 doi: 10.7554/eLife.07777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gegonne A, Devaiah BN, Singer DS. TAF7: traffic controller in transcription initiation. Transcription. 2013;4:29–33. doi: 10.4161/trns.22842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Plaschka C, et al. Architecture of the RNA polymerase II-Mediator core initiation complex. Nature. 2015;518:376–380. doi: 10.1038/nature14229. [DOI] [PubMed] [Google Scholar]
- 47.Tsai KL, et al. Subunit architecture and functional modular rearrangements of the transcriptional mediator complex. Cell. 2014;157:1430–1444. doi: 10.1016/j.cell.2014.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Kim TH, et al. A high-resolution map of active promoters in the human genome. Nature. 2005;436:876–880. doi: 10.1038/nature03877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Carninci P, et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat Genet. 2006;38:626–635. doi: 10.1038/ng1789. [DOI] [PubMed] [Google Scholar]
- 50.Cooper SJ, Trinklein ND, Anton ED, Nguyen L, Myers RM. Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome. Genome research. 2006;16:1–10. doi: 10.1101/gr.4222606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Pal M, Ponticelli AS, Luse DS. The role of the transcription bubble and TFIIB in promoter clearance by RNA polymerase II. Molecular cell. 2005;19:101–110. doi: 10.1016/j.molcel.2005.05.024. [DOI] [PubMed] [Google Scholar]
- 52.Revyakin A, et al. Transcription initiation by human RNA polymerase II visualized at single-molecule resolution. Genes Dev. 2012;26:1691–1702. doi: 10.1101/gad.194936.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kim B, et al. The transcription elongation factor TFIIS is a component of RNA polymerase II preinitiation complexes. Proc Natl Acad Sci U S A. 2007;104:16068–16073. doi: 10.1073/pnas.0704573104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Li X, et al. Electron counting and beam-induced motion correction enable near-atomic-resolution single-particle cryo-EM. Nat Methods. 2013;10:584–590. doi: 10.1038/nmeth.2472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Mindell JA, Grigorieff N. Accurate determination of local defocus and specimen tilt in electron microscopy. J Struct Biol. 2003;142:334–347. doi: 10.1016/s1047-8477(03)00069-8. [DOI] [PubMed] [Google Scholar]
- 56.Scheres SH. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J Struct Biol. 2012;180:519–530. doi: 10.1016/j.jsb.2012.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Voss NR, Yoshioka CK, Radermacher M, Potter CS, Carragher B. DoG Picker and TiltPicker: software tools to facilitate particle selection in single particle electron microscopy. J Struct Biol. 2009;166:205–213. doi: 10.1016/j.jsb.2009.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Lander GC, et al. Appion: an integrated, database-driven pipeline to facilitate EM image processing. J Struct Biol. 2009;166:95–102. doi: 10.1016/j.jsb.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Scheres SH. Beam-induced motion correction for sub-megadalton cryo-EM particles. Elife. 2014;3 doi: 10.7554/eLife.03665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Henderson R, et al. Outcome of the first electron microscopy validation task force meeting. Structure. 2012;20:205–214. doi: 10.1016/j.str.2011.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Heymann JB. Bsoft: image and molecular processing in electron microscopy. J Struct Biol. 2001;133:156–169. doi: 10.1006/jsbi.2001.4339. [DOI] [PubMed] [Google Scholar]
- 62.Pettersen EF, et al. UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 63.Garzon JI, Kovacs J, Abagyan R, Chacon P. ADP_EM: fast exhaustive multi-resolution docking for high-throughput coverage. Bioinformatics. 2007;23:427–433. doi: 10.1093/bioinformatics/btl625. [DOI] [PubMed] [Google Scholar]
- 64.Wriggers W. Using Situs for the integration of multi-resolution structures. Biophys Rev. 2010;2:21–27. doi: 10.1007/s12551-009-0026-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lopez-Blanco JR, Chacon P. iMODFIT: efficient and robust flexible fitting based on vibrational analysis in internal coordinates. Journal of structural biology. 2013;184:261–270. doi: 10.1016/j.jsb.2013.08.010. [DOI] [PubMed] [Google Scholar]
- 66.Biasini M, et al. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic acids research. 2014;42:W252–258. doi: 10.1093/nar/gku340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Yang J, et al. The I-TASSER Suite: protein structure and function prediction. Nat Methods. 2015;12:7–8. doi: 10.1038/nmeth.3213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.van Dijk M, Bonvin AM. 3D-DART: a DNA structure modelling server. Nucleic acids research. 2009;37:W235–239. doi: 10.1093/nar/gkp287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Chys P, Chacón P. Random coordinate descent with spinor-matrices and geometric filters for efficient loop closure. Journal of Chemical Theory and Computation. 2013;9:1821–1829. doi: 10.1021/ct300977f. [DOI] [PubMed] [Google Scholar]
- 70.Ramachandran S, Kota P, Ding F, Dokholyan NV. Automated minimization of steric clashes in protein structures. Proteins. 2011;79:261–270. doi: 10.1002/prot.22879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Chaudhury S, Lyskov S, Gray JJ. PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta. Bioinformatics. 2010;26:689–691. doi: 10.1093/bioinformatics/btq007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Sievers F, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Buchan DW, Minneci F, Nugent TC, Bryson K, Jones DT. Scalable web services for the PSIPRED Protein Analysis Workbench. Nucleic Acids Res. 2013;41:W349–357. doi: 10.1093/nar/gkt381. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.