Skip to main content
eLife logoLink to eLife
. 2020 Nov 16;9:e61503. doi: 10.7554/eLife.61503

Structure of the human core transcription-export complex reveals a hub for multivalent interactions

Thomas Pühringer 1,, Ulrich Hohmann 1,2,, Laura Fin 1, Belén Pacheco-Fiallos 1, Ulla Schellhaas 1, Julius Brennecke 2, Clemens Plaschka 1,
Editors: Karsten Weis3, James L Manley4
PMCID: PMC7744094  PMID: 33191911

Abstract

The export of mRNA from nucleus to cytoplasm requires the conserved and essential transcription and export (TREX) complex (THO–UAP56/DDX39B–ALYREF). TREX selectively binds mRNA maturation marks and licenses mRNA for nuclear export by loading the export factor NXF1–NXT1. How TREX integrates these marks and achieves high selectivity for mature mRNA is poorly understood. Here, we report the cryo-electron microscopy structure of the human THO–UAP56/DDX39B complex at 3.3 Å resolution. The seven-subunit THO–UAP56/DDX39B complex multimerizes into a 28-subunit tetrameric assembly, suggesting that selective recognition of mature mRNA is facilitated by the simultaneous sensing of multiple, spatially distant mRNA regions and maturation marks. Two UAP56/DDX39B RNA helicases are juxtaposed at each end of the tetramer, which would allow one bivalent ALYREF protein to bridge adjacent helicases and regulate the TREX–mRNA interaction. Our structural and biochemical results suggest a conserved model for TREX complex function that depends on multivalent interactions between proteins and mRNA.

Research organism: Human

eLife digest

The DNA of human and other eukaryotic cells is stored inside a compartment called the nucleus. DNA carries the genetic code and provides a blueprint for all of the cell’s proteins. However, protein production occurs outside the nucleus, in the main body of the cell. To transmit genetic information from one compartment to the other, the DNA sequences are first transcribed into another molecule called messenger RNA, or mRNA for short. Once made, mRNA exits the nucleus and enters the cell’s main body to encounter the machinery that translates its sequence into a protein.

Before mRNA can exit the nucleus, it must first undergo a series of modifications, which result in the mRNA molecule being successively bound to specific proteins. Once mRNA has passed through these steps, it is recognized by the transcription-and-export complex, or TREX for short, which is comprised of several proteins. When TREX binds to mRNA, it adds on a final protein which allows the mRNA molecule to be transported out of the nucleus. However, it remained unclear how TREX selects the completed mRNA-protein complexes that are ready for export while at the same time recognizing the wide variety of mRNA molecules produced by cells.

Now, Pühringer and Hohmann et al. have identified the first three-dimensional structure of the core of the human TREX complex using a technique called cryo-electron microscopy. This revealed that the seven proteins of the TREX core assemble into a large complex that has four copies of each protein. The structure suggests that TREX can bind to mRNA and its attached proteins in various ways. These different binding arrangements may help the complex select which mRNA molecules are fully modified and ready to be exported. The structure also sheds light on how mutations in this complex can lead to diseases such as Beaulieu–Boycott–Innes syndrome (BBIS).

This work will help guide future research into the activity of TREX, including how its structure changes when it binds to mRNA and deposits the final transport protein. Identifying these structures will make it easier to design experiments that target specific aspects of TREX activity and provide new insights into how these complexes work.

Introduction

Eukaryotic protein-coding mRNA is matured in the nucleus before it is exported and translated in the cytoplasm. During maturation, mRNA is capped, spliced, and poly-adenylated to form fully processed and packaged messenger ribonucleoprotein complexes (mRNPs) (Köhler and Hurt, 2007; Singh et al., 2015; Heath et al., 2016; Stewart, 2019; Xie and Ren, 2019). The transcription and export (TREX) complex is recruited during transcription (Heath et al., 2016; Viphakone et al., 2019) to maturing mRNPs through mRNA and protein interactions at the mRNP 5’-end, splice junctions, and mRNP 3’-end (Cheng et al., 2006; Merz et al., 2007; Gromadzka et al., 2016; Shi et al., 2017). TREX subsequently loads the global mRNA-export factor NXF1–NXT1, to license mRNAs for export (Strässer and Hurt, 2001; Köhler and Hurt, 2007; Hautbergue et al., 2008; Taniguchi and Ohno, 2008). By chaperoning the nascent mRNA, TREX also inhibits the formation of harmful DNA-RNA hybrids, called R-loops, and protects genome integrity (Luna et al., 2019; Pérez-Calero et al., 2020).

The TREX complex is found in all eukaryotes and contains the multi-subunit THO complex, the DEXD-box RNA helicase UAP56/DDX39B (yeast Sub2), and an RNA export adapter such as ALYREF (yeast Yra1) (Strässer et al., 2002). The human THO complex comprises six subunits, THOC1, −2, –3, −5, –6, and −7, of which four have known counterparts in the yeast Saccharomyces cerevisiae (Sc): THOC1 (yeast Hpr1), −2 (yeast Tho2), −3 (yeast Tex3), and −7 (yeast Mft1) (Heath et al., 2016; Mitchell et al., 2019). Additional TREX interactors include SARNP (yeast Tho1), the mammalian protein ZC3H11A and ALYREF-like proteins UIF, LUZP4, POLDIP3, and CHTOP (Dufu et al., 2010; Heath et al., 2016). In this study we focus on the conserved TREX complex (Heath et al., 2016; Xie and Ren, 2019): THO–UAP56/DDX39B–ALYREF, and hereafter refer to UAP56/DDX39B as UAP56.

In the current model of mRNA export, the THO complex is recruited to the mRNP and delivers UAP56, which subsequently ‘clamps’ the mRNA. THO–UAP56 binds the export adapter ALYREF and together they promote loading of the export factor NXF1–NXT1 onto mRNA (Hautbergue et al., 2008; Köhler and Hurt, 2007; Strässer and Hurt, 2001; Taniguchi and Ohno, 2008). However, the structural basis for TREX activities remains poorly understood despite active study (Peña et al., 2012; Pérez-Alvarado et al., 2003; Ren et al., 2017; Shi et al., 2004; Xie and Ren, 2019).

Here, we present the cryo-electron microscopy (cryo-EM) structure of the human THO–UAP56 complex at 3.3 Å resolution. We resolved the THO architecture, its binding mode to UAP56, and can explain mutations linked to human disease (Heath et al., 2016). Our results show that THO–UAP56 adopts a tetrameric 28-subunit architecture, suggesting how TREX may bind multiple mRNA and mRNP regions at the same time. Taken together, the data reveal a conserved mechanism for TREX function that depends on multivalent protein–mRNA interactions.

Results and discussion

To gain structural insights into TREX function, we prepared the human THO complex by co-expressing its six subunits in insect cells (THOC1, −2 residues 1–1203, −3, –5, −6,–7) and added recombinant UAP56 to reconstitute the THO–UAP56 complex (Figure 1—figure supplement 1a, Materials and methods). The complex was imaged under cryogenic conditions using a K3 direct electron detector, yielding ~1.6 million single-particle images. Unsupervised 3D particle sorting and focused refinements resulted in cryo-EM densities of THO–UAP56 at nominal resolutions between 3.3 and 4.7 Å (maps A-E) (Figure 1—figure supplements 14, Materials and methods). The densities enabled us to build a near-complete atomic model of a 14-subunit THO–UAP56 dimer, which assembles into a 28-subunit tetramer with a total molecular weight of 1.8 MDa (Figures 1a–c and 2, Figure 1—figure supplements 5 and 6, Video 1). The cryo-EM data showed that two THO–UAP56 tetramers associate loosely to form an octamer, however, we could not observe direct molecular contacts between the two tetramers (Figure 1—figure supplement 1c,d). To determine the in vivo oligomeric state of THO–UAP56, we in addition purified the endogenous complex from human K562 cells and calculated negative stain 2D class averages (Figure 1—figure supplement 1f–h). The endogenous THO–UAP56 complex formed a tetramer, indistinguishable from the modeled tetramer structure, suggesting that this state exists in vivo (Figure 1—figure supplement 1h).

Figure 1. Human THO–UAP56 complex structure.

(a) Domain organization of human THO–UAP56 complex subunits. Solid and dashed lines indicate atomic and backbone regions of the THO–UAP56 structure, respectively. The UAP56 RecA1 lobe is not observed in the structure and indicated with a light gray line. Orthologous yeast protein names are in parenthesis. Color code used throughout. CC, coiled coil; tRWD-N and -C, N- and C-terminal tandem RWD. (b) Surface representation of the THO–UAP56 tetramer structure. Dimer 1 monomers 1A (dark turquoise) and 1B (light turquoise), and dimer 2 monomers 2A (dark blue), 2B (light blue) are indicated. (c) Human THO–UAP56 structure front and left side views. The UAP56 RecA1 lobe was not visible in the cryo-EM density, but is modeled here by superimposing a UAP56 (yeast Sub2) homology model based on a low-resolution THO–Sub2 crystal structure (Ren et al., 2017) (PDB ID 5SUQ) and is indicated with an asterisk. The THOC2 CD domain is disordered (Peña et al., 2012), was truncated for recombinant THO–UAP56 production, and is shown as a dashed circle.

Figure 1.

Figure 1—figure supplement 1. Biochemical characterization and EM of recombinant and endogenous THO–UAP56 complexes.

Figure 1—figure supplement 1.

(a) Protein analysis of the recombinant human THO–UAP56 complex (SDS-PAGE stained with Coomassie blue). (b) Cryo-EM micrograph of the recombinant THO–UAP56 complex. Scale bar, 50 nm. (c) Representative cryo-EM 2D class averages of THO–UAP56 reveal flexibility between and within two adjacent tetramers. (d) Composite cryo-EM density of the THO–UAP56 complex shown in two orthogonal views, front and left side. Colors indicate the respective cryo-EM densities used for modeling (A, dark violet; B, cyan; C, light green; D, dark green; E, magenta). Map A was low-pass filtered to 30 Å (gray, transparent) to show the full tetramer density (front view) and the location of the second tetramer (left side view), which is highly mobile relative to the first. The sharpened densities are shown and were aligned using overlapping regions. See Materials and methods, and Figure 1—figure supplements 2 and 3 for details on image processing. (e) Composite cryo-EM density of THO–UAP56 superimposed on a ribbon model of the final THO–UAP56 model, colored as in Figure 1. (f) The endogenous THO complex forms an oligomer in vivo that does not depend on RNA. THOC1–GFP was expressed ectopically in K562 cells using a lentivirus (lanes 4–6) and purified via the GFP-tag in a single step in absence (lane 5) or presence of Benzonase to degrade nucleic acid (lane 6). Untagged THOC1 did not bind the GFP-beads (lanes 1–3). Proteins where visualized by western blot using an anti-THOC1 antibody (see Materials and methods). (g) Protein analysis of the endogenous human THO–UAP56 complex purified in two steps (SDS-PAGE, Silver stain). See Materials and methods for details. (h) Comparison of representative negative stain EM 2D class averages of the endogenous THO–UAP56 complex (top) to corresponding views of the THO–UAP56 tetramer structure (bottom).

Figure 1—figure supplement 2. THO–UAP56 cryo-EM image classification.

Figure 1—figure supplement 2.

Three-dimensional image classification tree of the THO–UAP56 cryo-EM data set using an initial cryoSPARC 2.0 (Punjani et al., 2017) model determined from 30,000 particles (Materials and methods). The combined two data sets yielded 1,570,265 THO–UAP56 picked octamer particles using Warp 1.0.7 (Tegunov and Cramer, 2019), from which a total of 3,140,530 tetramer units (100%) were extracted and subsequently analyzed using RELION 3.1 (Scheres, 2012). Classes are colored in gray and visualized at the same density threshold. The percentage of tetramer units contributing to each class is provided. The type of mask and overall resolution is indicated for each 3D refinement subsequent to classification and maps are colored as in Figure 1—figure supplement 1d. For additional details see the Materials and methods.

Figure 1—figure supplement 3. THO–UAP56 cryo-EM reconstructions.

Figure 1—figure supplement 3.

(a) Gold-standard Fourier shell correlation (FSC = 0.143) of the respective A, B, C, D, and E cryo-EM reconstructions. (b) Orientation distribution plots for all particles contributing to the respective A, B, C, D, and E cryo-EM reconstructions. (c) Front and left side views of the composite THO–UAP56 complex cryo-EM density (maps A, B, C, D, E) colored by local resolution as determined by ResMap (Kucukelbir et al., 2014). (d) Representative regions of sharpened THO–UAP56 densities are superimposed on the refined coordinate model. Segments from THOC1 (residues 316–330, helix α13; map B), THOC2 (residues 576–590, α31; map D), THOC5 tRWD-C (residues 469–495, helix α14; map B), and THOC6 (residues 278–286, blade 6 β3; map B) are shown.

Figure 1—figure supplement 4. THO–UAP56 cryo-EM data collection and refinement statistics.

Figure 1—figure supplement 4.

(a) Cryo-EM data collection and refinement statistics of the THO–UAP56 structure. Maps A and E were used to position THO–UAP56 monomers 1B and 2A, and THOC5 tRWD–THOC6 from monomer 1B, respectively. (b) Fourier shell correlation between the map B and D cryo-EM densities and the respective refined coordinate model parts, using phenix.mtriage (Afonine et al., 2018).

Figure 1—figure supplement 5. Gallery of THO–UAP56 subunits and their cryo-EM densities.

Figure 1—figure supplement 5.

(a) Front view of the THO–UAP56 complex structure, indicating monomers 1A and 2B that are shown individually in panel b. (b) Gallery of THO–UAP56 complex subunits shown superimposed on their respective cryo-EM densities (maps A-E). Unmodeled densities are shown and selected domains and secondary structure elements are labelled.

Figure 1—figure supplement 6. Back view and surface charge distribution of THO–UAP56.

Figure 1—figure supplement 6.

(a) Human THO–UAP56 structure viewed from the back. The insets show surface representations of the THO–UAP56 tetramer structure (front and back view) and are colored as in Figure 1b. (b) Surface representations of the THO–UAP56 tetramer structure (front, left side, and back view), colored according to electrostatic surface potential. This analysis indicates a positive patch at the THOC1–THOC5–THOC7 interface, which is distant from the THOC2–UAP56 interface, and may interact with nucleic acid.

Figure 2. THO–UAP56 tetramer, dimer, and monomer organization.

Overview of the THO–UAP56 architecture from the front, showing the asymmetric dimers 1 and 2 as ribbons and surfaces, respectively on the left. THO–UAP56 monomers 1B (middle) and 1A (right) are shown as ribbons. THOC5, −6, and −7 assume different positions in monomers A and B, shown in the right panel by superposing monomer 1B (transparent ribbons) on monomer 1A via their THOC2 subunits. This movement is indicated with gray arrows. Colors as in Figure 1.

Figure 2.

Figure 2—figure supplement 1. Biochemical probing of THO complex oligomerization, the THO–UAP56 interface, and mapping of disease mutations.

Figure 2—figure supplement 1.

(a) Recombinant human THO (top), THO∆THOC6 (middle), THOC1/2/3 (bottom) complex migration in sucrose density gradients (SDS-PAGE stained with Coomassie blue) indicates an oligomeric state consistent with tetramer, dimer, and monomer architectures, respectively. (b) Location of human disease-associated mutations in THOC2 (Kumar et al., 2020; Kumar et al., 2018; Kumar et al., 2015) and THOC6 (FORGE Canada Consortium et al., 2013; Casey et al., 2016; Anazi et al., 2016; Care4Rare Canada Consortium et al., 2017; Mattioli et al., 2019). Several THOC6 mutations were shown to affect its stability and lead to the loss of interaction with THO subunits (Mattioli et al., 2019), suggesting that stabilization of the THO tetramer is needed for its full activity. In particular, two THOC6 premature stop codon mutations (at residues Y45 and R87) would truncate the THOC6 regions at the tetramer interface, and therefore disrupt this interaction. The mapped THOC2 mutations disrupt local secondary structure or alter the protein surface, but do not affect interfaces with THOC1, THOC3, and UAP56. The THO–UAP56 monomer 1A is colored as in Figure 1 with mutated residues marked as red spheres at their Cα atoms. Monomers 1B, 2A, 2B and associated mutations are colored in gray. The asterisk indicates mutations that result in premature stop codons. (c) Analysis of the human THO–UAP56 complex interaction in a pulldown experiment using the 6-subunit THO complex, THO∆THOC6, or THOC1/2/3 (SDS-PAGE stained with Coomassie blue). The THO complex and its variants were immobilized on Ni-NTA beads via a 10xhistidine tag on the THOC2 subunit. The asterisk indicates contaminants and degradation products. See Materials and methods for details. (d) Ribbon model showing THOC2 MIF4G domain residues (sticks) mutated in panel e to probe the UAP56 interface in a top view. (e) As in panel c, but comparing the THO complex and three THO variants containing THOC2–UAP56 interface mutants (THOM1: THOC2 residues Y551A, K554S, R555S, K558S; THOM2: THOC2 residues K589A, Y590S, N592A; THOM3: combined THOM1 and THOM2 mutants) for their UAP56 affinity. See Materials and methods for details. (f) The human export factor NXF1–NXT1 binds the core THOC1/2/3–UAP56 complex in a pulldown experiment. NXF1–NXT1 was immobilized on amylose beads via an MBP-tagged NUP214 FG-repeat peptide (Materials and methods) and bound THO, THO∆THOC6, or core THOC1/2/3 complex together with UAP56 (SDS-PAGE stained with Coomassie blue). The asterisk indicates contaminants and degradation products. See Materials and methods for details.

Video 1. 360 degree rotation of the human THO–UAP56 complex, colored as in Figure 1c.

Download video file (10.7MB, mp4)

THO–UAP56 structure

The THO–UAP56 28-subunit tetramer consists of two asymmetric dimers, 1 and 2, each comprising two seven-subunit THO–UAP56 monomers, A and B. Using this nomenclature, the tetramer contains monomers 1A, 1B, 2A, and 2B (Figures 1b,c and 2). Each monomer is partitioned into two regions, one formed by THOC1, −2, –3, and UAP56 and the second by THOC5, −6, and −7. THOC1, −2, –3, and UAP56 adopt the same architecture in all monomers, whereas THOC5, −6, –7 assume different conformations to assemble the dimer via THOC5 and THOC7 and the tetramer via THOC5 and THOC6. THOC5 and THOC7 form a parallel coiled coil, whose C-terminal ends from monomer A and B meet in a four-helix bundle, yielding the dimerization region (Figures 2 and 3a,b). The dimer is further stabilized by contacts between the N-terminal THOC5 tandem RWD (tRWD-N) domains from monomers A and B that pack against each other. The THOC5–THOC7 coiled coil is interrupted by a ‘hinge’ in each monomer, which generates three mobile sub-regions in the 14-subunit THO–UAP56 dimer: monomer A, monomer B, and the above mentioned dimerization region (Figures 2 and 3a). Notably, the THOC5–THOC7 coiled coil measures ~ 350 Å across the human dimer, suggesting that distal positioning of monomers A and B may play a role in TREX function.

Figure 3. Details of THO–UAP56 tetramer and dimer interfaces.

Figure 3.

(a) Ribbon model of the THO–UAP56 complex dimer 1, indicating dimer and tetramer interfaces. Colors for monomer A subunit UAP56 and monomer B are as in Figure 1, monomer A THOC1 (yellow), THOC2 (pale yellow), THOC3 (orange), THOC5 (salmon), THOC6 (maroon), and THOC7 (red). The black line marks the tetramer interface. The THOC5 tRWD-C domain and THOC6 from monomer 2A are shown as transparent ribbons and stabilize the tetramer. (b) THO–UAP56 dimerization domain. Monomer A THOC5 helix α5 and THOC7 α4 make a parallel coiled coil that forms a four-helix bundle with equivalent helices from monomer B. The four-helix bundle is asymmetric (note the α5* position between THOC51A and THOC51B) due to dimerization of the adjacent THOC5 tRWD-N domains. View and colors as in panel a. (c) The THOC5–THOC7 coiled coil binds THOC1 and the THOC2 anchor helices and adopts different positions in presence (left, monomer 1B) or absence (right, monomer 1A) of THOC62A from the neighboring THO–UAP56 monomer 2A. Interacting THOC62A β-propeller blades 1, 5, 6, and seven are labelled. Monomers A and B are superimposed (right) on their THOC2 subunits, and monomer B is transparent. Colors as in panel a.

The THO–UAP56 tetramer assembles from dimer 1 and 2 via two interfaces that involve the THOC5 and THOC6 subunits. The first interface is formed by homotypic interactions between the THOC5 tRWD-C domains from monomers 1A and 2A. The second is formed by the THOC6 β-propeller from monomers 1A and 2A that interact with the THOC1, −5, and −7 subunits from monomers 2B and 1B, respectively (Figure 3a–c). THOC6 thereby associates with the neighboring dimer and causes THOC1 and the THOC5–THOC7 coiled coil to adopt different positions (Figure 3c). This interaction stabilizes the tetramer and at the same time leads to the asymmetry within the THO–UAP56 dimer (Figure 3c). In agreement with this architecture, the recombinant THO complex lacking THOC6 assembles into a dimer rather than a tetramer, shown by its slowed migration in sucrose density gradients (Figure 2—figure supplement 1a). Deletion of the THOC5 and THOC7 subunits leads to loss of THOC6, as expected, and results in a monomeric THOC1/2/3 complex (Figure 2—figure supplement 1a). Nine residues in THOC6 are known to be mutated in human disease (Heath et al., 2016), all of which map to the THOC6 β-propeller. These mutations are predicted to destabilize the β-propeller fold or the THOC5–THOC7 interaction and would thereby disrupt THO tetramerization (Figure 2—figure supplement 1b). Indeed, four mutations were previously shown to reduce nuclear THOC6 levels and perturb THOC6 interaction with the THO complex (Mattioli et al., 2019).

The core of the THO–UAP56 monomer is formed by the THOC2 subunit, which comprises an extended helical repeat with five distinct domains that we name ‘anchor’, ‘bow’, ‘MIF4G’, ‘stern’, and the disordered ‘charged domain’ (CD, truncated for recombinant expression) (Peña et al., 2012Figures 1a and 4a). The N-terminal THOC2 anchor helices bind α-helices THOC5 α3 and THOC7 α2 and connect via a disordered linker to the THOC2 bow domain. The THOC2 bow is sandwiched by the helical THOC1 ‘dock’ domain (Figure 3b). Hence, the THOC1, −5, and −7 subunits jointly bind THOC2, and THOC1 orients the extended and mobile THOC2 helical repeat (bow, MIF4G, stern) (Figures 1c and 2, Figure 1—figure supplements 1d and 3c). The THOC2 bow and adjacent MIF4G domain bind the THOC3 β-propeller (Figure 4a), and the THOC2 MIF4G domain makes additional, extensive contacts to the UAP56 helicase (Figures 1a and 4a).

Figure 4. THOC2–UAP56 interface and TREX model.

(a) Details of THOC2 regions (bow, MIF4G, stern, and CD) and their interactions with THOC1, THOC3, and UAP56. THOC3 β-propeller blades 4 and 5 bind the THOC2 bow domain. The UAP56 N-terminal RecA1 lobe is mobile and modeled as for Figure 1, indicated with an asterisk. Unmodeled density (map D) was putatively assigned to THOC1 and binds along the THOC2 bow, MIF4G and stern domains. THOC2 is colored in shades of green, other subunits as in Figure 1. (b) TREX–mRNA complex model. The model suggests that ALYREF may license mRNA export by bridging two adjacent UAP56 helicases through (i) its N- and C-UBMs and (ii) charge complementary between acidic and basic residues of one ALYREF UBM, bound to UAP561A, and the juxtaposed UAP562B. The model was obtained by superimposing the human homology model of the yeast Sub2–Yra1–RNA crystal structure (Ren et al., 2017) (PDB ID 5SUP) on the UAP56 RecA2 lobe, in each monomer. The asterisk indicates modeled elements (UAP56 RecA1, mRNA, ALYREF). THO–UAP56 monomers 1A and 2B are positioned in proximity, according to 3D variability analysis (Figure 4—figure supplement 1d,e). THOC3, −5, –7, the THOC21A bow, THOC11A, and the THOC12B N-terminus were omitted for clarity. Colors as in Figure 3, ALYREF (violet), RNA (black). RRM, RNA-recognition motif; N- and C-UBM, N- and C-terminal UAP56-binding motifs. (c) Cartoon schematic of TREX-dependent mRNA-export licensing. Multivalent interactions between THO–UAP56, mRNA, and mRNP proteins may be required for (i) mRNP recruitment and mRNA binding. ALYREF may bridge two proximal UAP56 helicases in the assembled TREX–mRNA complex, to facilitate (ii) loading of multiple NXF1/NXT1 export factor complexes onto nascent mRNA and license it for export. The curved lines indicate flexibility. The UAP56 RecA1 lobe is mobile in absence of mRNA, indicated by dashed lines and the transparent surface.

Figure 4.

Figure 4—figure supplement 1. Details of THO–UAP56 flexibility and the UAP56–ALYREF interaction.

Figure 4—figure supplement 1.

(a) Domain organization of human ALYREF (yeast Yra1). UBM, UAP56 Binding Motif; RBD, RNA Binding Domain; RRM, RNA-Recognition Motif. The RBD contains multiple RG- motifs. (b) Amino acid sequence alignment comparing N-terminal (left) and C-terminal (right) UBMs of ALYREF from Homo sapiens (Hs), Danio rerio (Dr), Drosophila melanogaster (Dm), Caenorhabditis elegans (Ce), Arabidopsis thaliana (At), Saccharomyces cerevisiae (Sc), and Schizosaccharomyces pombe (Sp). The alignment was created using Clustal Omega (Sievers et al., 2011) and visualized with ESPript 3 (Robert and Gouet, 2014). A secondary structure assignment, based on PSIPRED (Buchan et al., 2013) for the N-UBM and the crystal structure of Yra1 C-UBM bound to Sub2 (PDB ID 5SUP) for the C-UBM, is shown above the alignment. Purple stars indicate residues predicted to not bind the UAP56 RecA1 lobe. Invariant and conserved residues are indicated with a red background or red font, respectively. (c) Amino acid sequence alignment as in panel b but comparing N-UBMs of human ALYREF, POLDIP3, UIF and LUZP4 on the top, and the C-UBMs of human ALYREF and CHTOP below. (d) Principal component analysis (PCA) of THO–UAP56 cryo-EM data reveals flexibility. Densities for the start (green) and endpoint (gray) of the first two PCA components are shown from front and left side views. Component two is additionally shown from a top view, as in panel e. Dimers 1 and 2, monomers A and B, and the pseudo two-fold symmetry axis are indicated. (e) Modeling of THO–UAP56 monomers 1A and 2B into densities from PCA component two shows that proximal UAP56 helicases approach each other (ribbon model, colored as in Figure 2). Conformation two is the conformation of the THO–UAP56 structure, while conformations 1 and 3 reflect the starting and endpoints for the PCA component two movement shown in panel d. The UAP56 RecA1 lobes are not resolved in our structure and are indicated as dashed circles. The inset shows a surface representation of the THO–UAP tetramer, colored as in Figure 3a, with the subunits shown below depicted as ribbons. THOC5, −6 and −7 were omitted for clarity. (f) Details of the TREX–mRNA complex model shown in Figure 4b (ribbon model, colored as in Figure 4b). Side chains of ALYREF C-UBM residues in stick representation, with conserved acidic residues labelled. Conserved, basic residues on the neighboring UAP562B RecA2 lobe are shown as gray sticks and are labelled. (g) The DEXD-box RNA helicase RecA2 lobe is frequently bound by other proteins. Surface view of RNA-bound UAP56 (homology model based on the Sub2–Yra1–RNA crystal structure [Ren et al., 2017] [PDB ID 5SUP]), with RNA shown as ribbons. RecA1 lobe (purple), RecA2 lobe (pink), RNA (black) and UAP56 RecA2 lobe residues highlighted in panel f (gray). Superimposed on UAP56, and shown as Cα trace, are Edc3 (residues 88–116, Dhh1-bound conformation, PDB ID 4BRU), 4ET (residues 216–238, DDX6-bound conformation, PDB ID 5ANR), DDX6 C-terminus (residues 461–469, PDB ID 4CT4) and CWC22 (residues 131–136, eIF4AIII-bound conformation, PDB ID 4CB9). Other DEXD-box helicases, except for UAP56, were omitted for clarity.
Figure 4—figure supplement 2. ATPase assay and structural comparisons of the DEXD-box RNA helicase–MIF4G domain interaction.

Figure 4—figure supplement 2.

(a) Human UAP56 ATPase activity is accelerated in vitro by the THOC1/2/3 core complex. Standard deviations are calculated from three independent experiments. For details see the Materials and methods. (b) The DEXD-box helicase RecA2–MIF4G domain interaction is broadly conserved. UAP56 (pink, RecA2) and the interacting THOC2 MIF4G (green, MIF4G α29–36; yellow, α28, α41, α46) were superimposed on DDX6 (light gray) and its interacting CNOT1 MIF4G (dark gray, PDB ID 4CT4) (left) or with eIF4AIII (light gray) and its interacting CWC22 MIF4G (dark gray, PDB ID 4CB9) (right) via their RecA2 domains, respectively. Additional contacts are observed between CNOT1 and CWC22 MIF4G domains and the RecA1 lobe of the respective DEXD-box helicase. Note that a transient interaction between RecA1–THOC2 might not be observed in a cryo-EM experiment, whereas such transient interactions may be trapped in crystal structures.
Figure 4—figure supplement 3. Revised yeast THO–Sub2 model and conservation of the THO architecture.

Figure 4—figure supplement 3.

(a) Ribbon model of the published chimeric yeast THO-Sub2 crystal structure (Ren et al., 2017), which contained Saccharomyces bayanus Tex1, while the other subunits were from Saccharomyces cerevisiae. (Sub2, RecA1, purple; RecA2 pink; unassigned subunits, gray; Tex1, green; PDB ID 5SUQ). Phosphotungsten (KEG) clusters are shown as surfaces (yellow). (b) The published yeast THO–Sub2 crystal structure contained a dimer. Superposition of model residues 8–5289 (corresponding to monomer A, orange) onto model residues 6170–8550 of the unassigned subunits. Related phosphotungsten clusters are shown as surfaces in dark yellow (N8701 and A501) and light yellow (M8701 and M8702). The left inset shows that unmodeled density in the final (2Fo − Fc) map contoured at 1.0σ of monomer B was independently built in the superimposed monomer A (model residues 482–492, orange). The right inset shows a closeup of KEG N8701 (yellow, surface representation) that is located at the previously unmodeled monomer B Sub2 RecA2 lobe, but distant from monomer A. In support of this architecture, the endogenous Saccharomyces cerevisiae THO complex was also reported to form dimers using negative stain EM (Peña et al., 2012). (c) Asymmetric unit of the revised yeast THO–Sub2 crystal structure. The previously built backbone model is colored in gray. The second Sub2 (RecA1, purple; RecA2 pink), Tex1 (green) and Tho2 C-terminus (orange) are shown. The two dimers in the asymmetric unit are shown, dimer one as a surface and dimer two as a ribbon model. Consistent with the model, we observed no clashes in the completed asymmetric unit. (d) Model of the yeast Saccharomyces cerevisiae TREX complex dimer in front and left side views. The model suggests that yeast Yra1, similar to human ALYREF, may bridge two Sub2 helicases that are located ~80 Å apart. The shown homology model is based on the combined revised chimeric yeast THO–Sub2 complex crystal structure (for details see a–c, Materials and methods) and the Sub2–RNA–Yra1 structure (Ren et al., 2017). The Tho2 ‘anchor’ domain is putatively assigned based on secondary structure prediction and homology to the human monomer B THOC2 structure (Figure 1—figure supplement 5b). Colors as in Figure 1, Yra1 (violet), RNA (black). (e) Superposition of the human THO–UAP56 complex monomer 1A (colored as in Figure 4a) with yeast THO–Sub2 monomer A on their THOC2 (yeast Tho2) subunits (gray; Mft1–Thp2, light blue; putative anchor Tho2, light green). Note the good fit of the THOC2 MIF4G domain (between joints 1 and 2), and the structural differences towards the THOC2 Stern (left) and the base of the THOC5–THOC7 coiled coil (right) (indicated by arrows). Tex1, THOC3, and the THOC5 tRWD domain were omitted for clarity. (f) The THO complex architecture was adapted during evolution. The THOC5 tRWD domain and THOC6, which stabilize the tetramer, are found in complex eukaryotes. The heatmap is colored according to sequence identity to the human THO subunits. Shown are Danio rerio (Dr), Drosophila melanogaster (Dm), Caenorhabditis elegans (Ce), Arabidopsis thaliana (At) Saccharomyces cerevisiae (Sc), and Schizosaccharomyces pombe (Sp).

UAP56 contains two ATPase lobes, RecA1 and RecA2. We observed density for the UAP56 RecA2 lobe, but not RecA1, which remains mobile in the absence of RNA and ATP (Shi et al., 2004). Consistent with the structure, THO oligomeric state did not impact UAP56 binding in a pulldown experiment (Figure 2—figure supplement 1c), whereas point mutations in the THOC2 MIF4G domain at the UAP56 interface did impair UAP56 binding (Figure 2—figure supplement 1d,e). The THOC2 MIF4G domain adjoins the curved THOC2 stern domain, which in turn connects to the disordered CD implicated in nucleic acid binding (Peña et al., 2012). Near the THOC2 stern, we observed weak density for the THOC1 C-terminus (Figure 4a), which can bind the mRNA-export factor in yeast and humans (Hobeika et al., 2007; Viphakone et al., 2012). The close positioning of UAP56, the THOC1 C-terminus, and the THOC2 CD suggests that this region is involved in export factor loading. Supporting this model, we observed that the recombinant core THOC1/2/3–UAP56 complex is sufficient to bind the NXF1–NXT1 export factor in vitro (Figure 2—figure supplement 1f). Previous data showed that the isolated THOC5 subunit can also bind NXF1–NXT1 in vitro (Katahira et al., 2009; Viphakone et al., 2012). However, THOC5 did not contribute to NXF1–NXT1 binding in the context of multi-subunit THO–UAP56 complexes (Figure 2—figure supplement 1f), consistent with its distant location from the putative export factor loading site in the THO–UAP56 structure (Figures 1c and 4a).

The THO–UAP56 tetramer is ~260 Å long,~290 Å high, and ~150 Å wide. Subunits involved in mRNA and export factor binding are located at the ends of the complex (THOC1, −2, –3, and UAP56), away from those involved in oligomerization (THOC5, −6, and −7) (Figures 1c and 2, Figure 1—figure supplement 6a). Whereas all THO subunits are essential genes (Dempster et al., 2019), the requirement for THOC5, −6, and −7 suggests that THO oligomerization may be needed for normal function. The extended shape and flexibility of the THO–UAP56 tetramer (Figure 4—figure supplement 1d) could allow for multiple interactions with spatially distant mRNA regions and mRNP maturation marks, such as the cap-binding complex or exon-junction complex (Cheng et al., 2006; Merz et al., 2007) and facilitate NXF1–NXT1 loading from multiple sites.

TREX complex and ALYREF function

ALYREF (yeast Yra1) is part of the conserved TREX complex (Chávez et al., 2000; Strässer et al., 2002), is required for transfer of mRNA to the export factor NXF1–NXT1 (Hautbergue et al., 2008; Taniguchi and Ohno, 2008), and is essential for viability in yeast (Strässer and Hurt, 2000; Stutz et al., 2000) and humans (Dempster et al., 2019). ALYREF contains two UAP56-binding motifs (UBMs) at its N- and C-terminus that can separately bind UAP56 (Hautbergue et al., 2009Figure 4—figure supplement 1a,b). Between the UBMs is an RG-rich region, followed by a central RNA-recognition motif (RRM) domain, and a second RG-rich region (Gromadzka et al., 2016). To gain insight into ALYREF function, we prepared a homology model of UAP56 bound to RNA, an ATP-analog, and the C-terminal ALYREF UBM (C-UBM) based on a yeast crystal structure (Ren et al., 2017). We then superimposed this model onto the human THO–UAP56 tetramer structure via the four UAP56 RecA2 lobes (Figure 4b, Figure 4—figure supplement 1d,e). The resultant model reveals how ALYREF may function in the TREX complex bound to mRNA.

First, the model suggests that a single ALYREF protein may reach across the two THO dimers to bind two juxtaposed UAP56 helicases. The ALYREF N- and C-UBMs would each bind one UAP56 RecA1 lobe (Hautbergue et al., 2009), located 20 Å apart, from THO–UAP56 monomers 1A and 2B (or 2A and 1B) (Figure 4b). Alternatively, ALYREF could bridge two UAP56 proteins within one dimer, which are ~200 Å apart, or possibly multiple THO–UAP56 tetramers. Bridging by ALYREF can explain why deletion of either N- or C-UBM leads to growth defects in yeast (Strässer and Hurt, 2001; Zenklusen et al., 2001) and suggests that the UBMs and associated RG-rich domains contribute to the avidity of TREX–mRNA binding, consistent with biochemical data (Hautbergue et al., 2008; Ren et al., 2017). Furthermore, RNA binding by UAP56 (monomers 1A and 2A) would orient the mobile RecA1 lobe and project conserved negatively charged residues of the interacting ALYREF UBM towards a conserved patch of positively charged residues of the juxtaposed UAP56 RecA2 lobe (from monomers 1B and 2B) (Figure 4b). This RecA2 surface is frequently bound by regulators in other DEXD-box helicases (Linder and Jankowsky, 2011; Buchwald et al., 2013; Sharif et al., 2013; Mathys et al., 2014; Figure 4—figure supplement 1g) and suggests how a single UBM could be sufficient for partial ALYREF function (Strässer and Hurt, 2001; Zenklusen et al., 2001; Golovanov et al., 2006; Gromadzka et al., 2016). Notably, these two models of ALYREF function are not mutually exclusive and collectively suggest that the ALYREF UBMs bridge UAP56 helicases, which may correctly position other ALYREF regions for export factor loading.

The functionally related ALYREF-like proteins contain one (LUZP4, POLDIP3, UIF) or two (CHTOP) closely spaced UBMs (Gromadzka et al., 2016; Hautbergue et al., 2009; Viphakone et al., 2015Figure 4—figure supplement 1c), and thus are unlikely to bridge two helicase RecA1 lobes in the THO–UAP56 structure (Figure 4—figure supplement 1c). Nevertheless, we speculate that the THO–UAP56 tetramer increases the affinity of mRNA-export adaptors to their target mRNAs through avidity, by providing multiple UAP56 binding sites in close proximity.

UAP56 is a member of the DEXD-box RNA helicase family. These helicases function as ‘clampases’ that couple ATP hydrolysis to RNA release and can be regulated by interacting proteins (Linder and Jankowsky, 2011). Biochemical data from yeast (Ren et al., 2017) and humans (Taniguchi and Ohno, 2008; Shen et al., 2007) show that ALYREF weakly promotes UAP56 ATPase activity, and the yeast THO complex further stimulates this (Ren et al., 2017). We observed that the human THOC1/2/3 core complex is sufficient to promote UAP56 ATPase activity in vitro, consistent with our structure and other MIF4G–DEXD-box helicase systems (Hilbert et al., 2011; Montpetit et al., 2011Figure 4—figure supplement 2a,b). Taken together, we propose that THO and ALYREF regulate the activity of two bridged UAP56 helicases within the TREX–mRNA complex to control export factor loading onto mRNA (Hautbergue et al., 2008; Taniguchi and Ohno, 2008).

Conserved THO–UAP56 dimerization

To gain insights into conservation of the TREX complex, we re-analyzed a previously reported 6.0 Å resolution crystal structure of the six-subunit yeast Sc THO–Sub2 complex (Ren et al., 2017Figure 4—figure supplement 3, Materials and methods). The yeast structure was proposed to be a six-subunit monomer, within which a single Tex1 (THOC3) and a single Sub2 (UAP56) subunit were assigned, but not Tho2, Hpr1, Thp2, or Mft1. Surprisingly, our revised yeast model revealed that the crystal contained a 12-subunit THO–Sub2 dimer, which could better explain the published electron density and phosphotungsten cluster positions (Figure 4—figure supplement 3a–c, Video 2). We could assign two copies of Tho2, Hpr1, the Thp2–Mft1 coiled coil, and place one additional copy of the Tex1 and Sub2 subunits based on (i) comparisons to the human THO–UAP56 structure, (ii) the known homologies of yeast Hpr1, Tho2, Tex1, and Mft1 with human THOC1, −2,–3, and −7, and (iii) the predicted Thp2 coiled coil (Söding et al., 2005Figure 4—figure supplement 3d, Materials and methods). The revised model allowed us to identify Thp2 and confirm Mft1 as the respective yeast homologs of the THOC5 and THOC7 coiled coils. This indicates an unexpectedly high degree of structural conservation of the five-subunit THO complex monomer (THOC1, −2, –3, −5, –7) and its dimerization via coiled coils. Notably, the THO–Sub2 model shows that the two Sub2 helicases are located ~80 Å apart. Both bind to their respective Tho2 MIF4G domain in monomers A and B, analogous to the human THOC2–UAP56 interaction. Thus, also in yeast a single Yra1 could bridge two Sub2 helicases via its N- and C-UBM regions, mirroring human ALYREF (Figure 4—figure supplement 3d).

Video 2. 360 degree rotation of the yeast THO–Sub2 complex, colored as in Figure 4—figure supplement 3d.

Download video file (5.4MB, mp4)

Model of mRNA-export licensing

Our results suggest a unified model for TREX function through multimerization of the THO–UAP56 complex (Figure 4c). THO–UAP56 forms a dimer in yeast (Figure 4—figure supplement 3) and a constitutive tetramer in humans, mediated by the additional THOC5 tRWD domain and THOC6 subunit (Figure 1, Figure 1—figure supplement 1f–h, Figure 4—figure supplement 3f). In humans, THO–UAP56 tetramerization may have evolved in response to an increased complexity in gene architecture and mRNP composition (Singh et al., 2015). The conserved THO–UAP56 multimerization, extended architecture, and flexibility (Figure 1—figure supplement 1b–d, Figure 4—figure supplement 1d) indicate that one complex can bind multiple mRNP maturation marks and mRNA regions simultaneously. Subsequent to THO–UAP56 recruitment to the mRNP, a single ALYREF (or yeast Yra1) molecule could bridge two juxtaposed UAP56–mRNA complexes and, potentially, multimerize itself through its RG-rich regions (Gromadzka et al., 2016). In the assembled TREX–mRNP complex UAP56 ATPase activity is stimulated and, together with ALYREF or alternative export adaptors, may load the export factor NXF1–NXT1 from multiple sites within the complex. Taken together, the data support a model where requirements for both the selectivity and efficiency of mRNP recognition and mRNA-export licensing are fulfilled through multivalent interactions between proteins and mRNA.

Materials and methods

Key resources table.

Reagent type (species)
or resource
Designation Source or reference Identifiers Additional information
Gene (H. sapiens) THOC1 Uniprot Q96FV9
Gene (H. sapiens) THOC2 Uniprot Q8NI27
Gene (H. sapiens) THOC3 Uniprot Q96J01
Gene (H. sapiens) THOC5 Uniprot Q13769
Gene (H. sapiens) THOC6 Uniprot Q8NI27
Gene (H. sapiens) THOC7 Uniprot Q6I9Y2
Gene (H. sapiens) UAP56/DDX39B Uniprot Q13838
Gene (H. sapiens) NXF1 Uniprot Q9UBU9
Gene (H. sapiens) NXT1 Uniprot Q9UKK6
Gene (H. sapiens) NUP214 Uniprot P35658
Strain, strain
background
(Escherichia coli)
DH10EmBacY Geneva Biotech DH10EmBacY Strain propagated locally
from commercial source stock
Strain, strain
background
(E. coli)
BL21(DE3)RIL Agilent 230245 Strain propagated locally
from commercial source stock
Cell line
(Spodoptera frugiperda)
Sf9 Thermo Fisher Scientific IPLB-Sf-21-AE;
RRID:CVCL_0549
Strain propagated locally
from commercial source stock
Cell line
(Trichoplusia ni)
High Five Thermo Fisher
Scientific
BTI-TN-5B1-4;
RRID:CVCL_C190
Strain propagated locally
from commercial source stock
Cell line (Lenti-X) Lenti-X Takara Cat. #632180
Cell line (K562) K562 DSMZ ACC 10, RRID:CVCL_0004
Antibody anti-THOC1
(Polyclonal rabbit)
Merk HPA019096 (dilution 1:500)
Antibody anti-mouse IgG-HRP
(Polyclonal goat)
Promega W4021 (dilution 1:10000)
Recombinant
DNA reagent
pACEBac1 Geneva Biotech pACEBac1
Recombinant
DNA reagent
pOPINB Addgene #41142
Recombinant
DNA reagent
Lentiviral vector Addgene #31485
Software, algorithm hhpred doi:10.1093/nar/gki408 RRID:SCR_010276 https://toolkit.tuebingen.mpg.de/tools/hhpred
Software, algorithm WARP doi: 10.1038/s41592-019-0580-y http://www.warpem.com/warp/
Software, algorithm cryoSPARC doi:10.1038/nmeth.4169 RRID:SCR_016501 https://cryosparc.com/
Software, algorithm Relion doi:10.7554/eLife.42166 RRID:SCR_016274 https://www3.mrc-lmb.cam.ac.uk/relion/
Software, algorithm ResMap doi:10.1038/nmeth.2727 http://resmap.sourceforge.net
Software, algorithm Modeller doi:10.1002/cpbi.3 RRID:SCR_008395 https://salilab.org/modeller/
Software, algorithm COOT doi:10.1107/S0907444910007493 RRID:SCR_014222 https://www2.mrc-lmb.cam.ac.uk/personal/pemsley/coot/
Software, algorithm Phenix doi:10.1107/S2059798318006551 RRID:SCR_014224 https://www.phenix-online.org/
Software, algorithm USCF ChimeraX doi:10.1002/pro.3235 RRID:SCR_015872 https://www.rbvi.ucsf.edu/chimerax/
Software, algorithm Pymol Schrödinger LLC, 2020 RRID:SCR_000305 https://pymol.org/2/

Vectors and sequences

For heterologous co-expression of the human 6-subunit THO complex in insect cells, the open reading frames (ORFs) of THOC1, −2 residues 1–1203, −3, –5, −6, –7 with an N-terminal 10xhistidine tag on THOC2, an N-terminal 3xV5 tag on THOC1, and a TwinStrepII tag on THOC3 were cloned into a modified pACEBac1 vector (Geneva Biotech) by Golden Gate cloning (Engler et al., 2008). THOC2 was truncated for co-expression at its disordered C-terminus (residues 1204–1593) for an improved biochemical behavior (Figure 1—figure supplement 1a). The human UAP56 ORF was cloned into a pOPINB vector with an N-terminal 6xhistidine tag for expression in E. coli. For endogenous purification of THO–UAP56 from human K562 cells, the THOC1 cDNA and a C-terminal 3C-AID-GFP tag were cloned into a lentiviral vector backbone (Addgene plasmid #31485), yielding a plasmid containing pRRL-SFFV-THOC1-3C-AID-GFP. The human heterodimer NXF1–NXT1 ORFs with an N-terminal 10xhistidine-3xflag tag on NXF1 and, separately, the human NUP214 FG-repeat (residues 1916–2033) ORF with an N-terminal Maltose-binding protein fusion and C-terminal 10xhistidine tag were cloned into modified pACEBac1 vectors as done for the THO complex.

Preparation and reconstitution of the THO-UAP56 complex

Recombinant THO complex was co-expressed in insect cells using a plasmid containing all six subunits (THOC1, −2 residues 1–1203, −3, –5, −6, –7). The THO plasmid was electroporated into DH10EMBacY cells to generate bacmids (Trowitzsch et al., 2010) that were then transfected into Spodoptera frugiperda Sf9 cells to generate a V0 virus. The V0 virus was further amplified in Sf9 cells to yield V1 virus. For protein expression, 750 mL Trichoplusia ni Hi5 cells at a density of 1 × 106 cells were infected with 1 mL V1 virus and harvested after 3 days by centrifugation. Harvested cells were resuspended in buffer A (50 mM Tris-HCl pH 8.0, 300 mM NaCl, 5% (w/v) glycerol, 20 mM imidazole, 1 mM dithiothreitol (DTT), 0.5 mM PMSF, cOmplete EDTA-free protease inhibitor cocktail (Roche)) and lysed by sonication. The lysate was clarified by ultracentrifugation and the supernatant loaded on a HisTrap HP 5 mL column (GE Healthcare), equilibrated in buffer B (10 mM Tris-HCl pH 8.0, 300 mM NaCl, 5% (w/v) glycerol, 20 mM imidazole, 1 mM DTT). The column was washed with a linear gradient from 20 mM to 100 mM imidazole and eluted with buffer B containing 350 mM imidazole. Peak fractions were diluted 1:1 with buffer B lacking NaCl and imidazole and applied to anion exchange using a HiTrapQ HP 5 mL column (GE Healthcare), equilibrated in buffer C (25 mM HEPES pH 7.9, 150 mM NaCl, 5% (w/v) glycerol, 2.5 mM DTT). The complex was eluted with a linear gradient of buffer C from 150 to 1000 mM NaCl. Fractions containing the THO complex were concentrated, loaded on a HiLoad 16/600 Superdex 200 pg column (GE Healthcare) equilibrated in buffer D (25 mM HEPES pH 7.9, 250 mM NaCl, 5% (w/v) glycerol, 1 mM TCEP). The purified THO complex was concentrated to 4 mg mL−1, flash frozen and stored at −80°C. Variant THO complexes, lacking subunits THOC6 or containing mutations, were purified as described above. The THOC1/2/3 complex was purified as above, except that buffer A and B contained 500 mM NaCl, the complex was eluted from the HisTrap HP 5 mL column using a linear gradient from 0–50% with buffer B containing 500 mM NaCl and 250 mM Imidazole, and that buffer D lacked TCEP. The identity of all THO subunits was confirmed by mass spectrometry.

Full-length human UAP56 was expressed in E. coli BL21 DE3 RIL cells grown in autoinduction media at 37°C for 16 hr. Cells were lysed by sonication in buffer E (25 mM HEPES pH 7.9, 500 mM NaCl, 5% (w/v) glycerol, 20 mM imidazole, 1 mM DTT, 0.5 mM PMSF, 0.1% (v/v) Tween-20, cOmplete EDTA-free protease inhibitor cocktail). The supernatant was clarified by centrifugation and loaded on a HisTrap HP 5 mL column, equilibrated in buffer F (25 mM HEPES pH 7.9, 500 mM NaCl, 5% (w/v) glycerol, 20 mM imidazole, 1 mM DTT). The column was washed with buffer F and eluted with a linear gradient from 20 to 250 mM imidazole. The 6xhistidine tag was cleaved by PreScission protease at 4°C overnight, and UAP56 was then diluted to 50 mM NaCl using buffer F lacking NaCl and imidazole and applied to anion exchange using a HiTrapQ HP 5 mL, equilibrated in buffer G (25 mM HEPES pH 7.9, 50 mM NaCl, 5% (w/v) glycerol, 2.5 mM DTT). The protein was eluted with a linear gradient from 50 to 400 mM NaCl. Fractions containing UAP56 were concentrated and loaded on a HiLoad 16/600 Superdex 75 pg column, equilibrated in buffer H (25 mM HEPES pH 7.9, 100 mM NaCl, 5% (w/v) glycerol, 2.5 mM DTT). Purified UAP56 was concentrated to 11 mg mL−1, flash frozen and stored at −80°C.

To reconstitute the THO–UAP56 complex, the THO complex was mixed with a two-fold molar excess of UAP56 and incubated at 20°C for 30 min in buffer I (25 mM HEPES pH 7.9, 100 mM NaCl, 5% (w/v) glycerol, 1 mM TCEP). We used GraFix (Kastner et al., 2008) to obtain a homogenous and crosslinked THO–UAP56 complex for EM studies. 90 µg of THO–UAP56 complex were loaded onto a 4 mL 10–40% (w/v) sucrose gradient in buffer J (25 mM HEPES pH 7.9, 50 mM KCl, 1 mM TCEP, 0–0.05% glutaraldehyde) and ultracentrifuged at 91,100 x g in a SW60 Ti swing bucket rotor (Beckman) for 20 hr 30 min at 4°C. Peak fractions were pooled and the crosslinking reaction was quenched for 15 min using a final concentration of 50 mM Lysine. The sample was then buffer-exchanged using Zeba Spin 7K MWCO Desalting Columns that were equilibrated in buffer K (25 mM HEPES pH 7.9, 50 mM KCl, 2% (w/v) glycerol, 1 mM TCEP). The sample was concentrated using a Vivaspin 500 centrifugal concentrator (MWCO 100 kDa) and immediately used for EM studies.

NXF1–NXT1 virus generation and co-expression in insect cells were performed as for the THO complex (see above). Cells were harvested by centrifugation and resuspended in buffer L (50 mM HEPES pH 7.9, 300 mM NaCl, 10% (w/v) glycerol, 1 mM DTT, cOmplete EDTA-free protease inhibitor cocktail) before sonication. The cell lysate was ultracentrifuged and the supernatant applied to a HisTrap HP 5 mL column, equilibrated in buffer M (25 mM HEPES pH 7.9, 300 mM NaCl, 10% (w/v) glycerol, 30 mM imidazole, 1 mM DTT). The column was washed with 45 mM imidazole and eluted with buffer M containing 300 mM imidazole. Peak fractions were diluted 1:2 with buffer M lacking imidazole and containing 5% (w/v) glycerol and 10 mM NaCl, and were loaded on a HiTrap Heparin HP 5 mL column (GE Healthcare), equilibrated in buffer N (25 mM HEPES pH 7.9, 50 mM NaCl, 5% (w/v) glycerol, 1 mM DTT). The complex was eluted with a linear gradient of buffer N from 50 to 1000 mM NaCl. Fractions containing the NXF1–NXT1 heterodimer were concentrated to 1 mg mL−1, flash frozen, and stored at −80°C.

The NUP214 FG-repeat (residues 1916–2033) was expressed in insect cells as done for the THO complex above. The harvested cells were lysed by sonication in buffer A, and applied to a HisTrap HP 5 mL column, equilibrated in buffer B. The column was washed with buffer B and then with buffer B containing 600 mM NaCl. The protein was eluted in buffer B containing 300 mM imidazole and dialyzed overnight against 1 L buffer C containing 200 mM NaCl. The dialyzed protein was loaded onto a HiLoad 16/600 Superdex 200 pg column, equilibrated in buffer C containing 200 mM NaCl. Fractions containing the NUP214 FG-repeat protein were pooled, concentrated to 14 mg mL−1, flash frozen, and stored at −80°C.

Endogenous THOC1 overexpression and nuclear extract preparation

For endogenous purification of THO–UAP56, lentiviral particles carrying the THOC1-3C-AID-GFP construct were generated using Lenti-X cells (Takara) by polyethylenimine transfection (Polysciences) of the viral plasmid and helper plasmids pCMVR8.74 (Addgene plasmid #22036) and pCMV-VSV-G (Addgene plasmid #8454), according to standard procedures. K562 (DSMZ) cells were infected at limiting dilutions and GFP-positive cells were isolated using a BD FACSAria III cell sorter (BD Biosciences). Viral integration was confirmed by immunoblotting for THOC1 and GFP (Figure 1—figure supplement 1f). To prepare nuclear extract (NE), 15 L of human K562 cells were grown to a density of 1 × 106 cells mL−1 at 37°C, stirred at 80 rpm and 5% CO2. The NE was prepared as previously described (Mayeda and Krainer, 1999) and dialyzed against buffer O (20 mM HEPES, pH 7.9, 100 mM KCl, 20% (w/v) glycerol, 0.2 mM EDTA, 2 mM DTT). Lenti-X (Takara) and K562 (DSMZ) cells tested negative for mycoplasma.

Western blot

NE from wild type and THOC1-3C-AID-GFP containing human K562 cells was diluted 1:1 with buffer P (40 mM HEPES pH 7.9, 15 mM KCl, 6 mM MgCl2, 2 mM DTT). For immunoprecipitation, the NE was first incubated for 1 hr at 4°C with or without 2 µg Benzonase per mL NE, as indicated in Figure 1—figure supplement 1f, and then incubated on a rotating wheel for 2.5 hr at 4°C with GFP-Trap Agarose resin (Chromotek) that was previously equilibrated with buffer Q (20 mM HEPES pH 7.9, 100 mM KCl, 2 mM MgCl2, 8% (w/v) glycerol, 0.05% (v/v) Igepal CA-630, 1 mM DTT). After five washes, the beads were eluted by boiling and the samples were applied to SDS–PAGE, transferred onto a PVDF membrane (ThermoScientific) and probed with anti-THOC1 primary antibody (HPA019096, Merk, dilution 1:500). Goat anti-mouse IgG-HRP (W4021, Promega, dilution 1:10000) was used as a secondary antibody. Antibody detection was performed with Amersham ECL Select Western Blotting Detection Reagent (GE Healthcare) and a ChemiDoc MP imaging system (Bio-Rad Laboratories).

Endogenous THO–UAP56 purification

The THOC1-3C-AID-GFP K562 NE was treated with 1 µg Benzonase per mL NE for 12 hr at 4°C, to release THO–UAP56 complexes from the pellet fraction and substantially increase the total yield. THOC1 was immunoprecipitated as described above (Western blot) and eluted for 2 hr at 4°C with 3C PreScission Protease. The eluate was loaded onto a 10–50% w/v sucrose step gradient containing 0–0.05% glutaraldehyde in buffer T (20 mM HEPES pH 7.9, 100 mM KCl, 2 mM MgCl2, 2 mM TCEP), and centrifuged for 16 hr at 105,600 x g in a SW60 Ti rotor (Beckman coulter). Fractions containing the THO–UAP56 complex were quenched for 15 min using a final concentration of 50 mM lysine, pooled, concentrated in a 0.5 mL 100 kDa MWCO Amicon concentrator (Sigma) and immediately used for EM grid preparation.

ATPase assay

We measured steady-state UAP56 ATPase activity using a NADH-coupled ATPase assay, essentially as described (Montpetit et al., 2012). The assay was set up at the final concentrations of 5 U/mL rabbit muscle pyruvate kinase, Type III (Sigma-Aldrich), 5 U/mL rabbit muscle L-lactic dehydrogenase, Type XI (Sigma-Aldrich), 500 µM phosphoenolpyruvate and 50 µM NADH. Reactions were prepared in buffer R (25 mM HEPES pH 7.9, 100 mM NaCl, 25 mM KCl, 10 mM MgCl2, 5% (w/v) glycerol, 1 mM ATP) and contained the indicated combinations of proteins and RNA at the following concentrations: 2 µM UAP56, 2 µM THOC1/2/3, 10 µM poly-uridine 15 nucleotide RNA. The decay of NADH emission signal was monitored over time at 37°C in a PHERAstar FS (BMG LABTECH), using a 0.03–100 µM NADH dilution series as calibration standard. Average UAP56 ATPase rates from triplicate experiments were calculated from linear slopes of NADH decay as hydrolyzed molecules of ATP s−1 per enzyme.

Pulldown assay

To test whether THO complex oligomeric state influences the THO–UAP56 interaction, we immobilized equimolar amounts of the recombinant THOC1/2/3 complex (3.3 µg), THO∆THOC6 complex (4.5 µg) and THO complex (5 µg) via the 10xhistidine tag on THOC2 on magnetic nickel beads (Promega) together with 1 µg recombinant human UAP56 (two-fold molar excess over THO) for one hour at 4°C in buffer S (25 mM HEPES pH 7.9, 50 mM KCl, 5% (w/v) glycerol, 20 mM imidazole, 0.01% (v/v) Igepal CA-630). The beads were washed five times with 1 mL buffer S, and proteins were eluted for one hour at 20°C using 500 mM imidazole in buffer S and visualized by SDS-PAGE (Coomassie blue) (Figure 2—figure supplement 1c). To probe the THOC2–UAP56 interface, 5 µg of THO complex or three variants (THOM1: THOC2 Y551A, K554S, R555S, K558S; THOM2: THOC2 K589A, Y590S, N592A; THOM3: combined THOM1 and THOM2 mutants) were immobilized, incubated with 1 µg UAP56, washed in buffer T (25 mM HEPES pH 7.9, 100 mM KCl, 5% (w/v) glycerol, 20 mM imidazole, 0.01% (v/v) Igepal CA-630), eluted and visualized by SDS-PAGE (Coomassie blue) (Figure 2—figure supplement 1e). To assess binding between the NXF1–NXT1 export factor and the THO complex, we immobilized 4 µg MBP-tagged NUP214 FG-repeat peptide on amylose resin (New England Biolabs) together with NXF1–NXT1 (1:1 molar ratio) in buffer U (20 mM HEPES pH 7.9, 100 mM NaCl, 10% (w/v) glycerol, 0.1% (v/v) Igepal CA-630). Each binding reaction additionally contained equimolar amounts of THO, THO∆THOC6, and THOC1/2/3 complex relative to NXF1–NXT1 and these were incubated together at 4°C on a rotating wheel. The beads were washed three times with 1 mL buffer U, and eluted for 1 hr on ice in buffer U containing 12 mM maltose. The elutions were visualized by SDS-PAGE (Coomassie blue). Each pulldown was repeated in triplicates.

Negative stain electron microscopy of the endogenous THO–UAP56 complex

For negative stain EM imaging of the endogenous THO–UAP56 complex, copper grids were coated with a ~ 5 nm homemade carbon film and glow-discharged. 4 µL sample was applied to the grid and incubated for 1 min. The grid was blotted and washed four times with 4 µL distilled water, stained for 1 min in 4 µL 2% (w/v) uranyl-acetate solution and blotted until dry. 1660 micrographs were acquired using SerialEM (Mastronarde, 2005) on a FEI Tecnai G2 20 transmission electron microscope (Eagle 4 k HS CCD camera) operated at 200 keV at a nominal magnification of 50,000x (2.21 Å pixel−1) and a defocus range of –1 µm to –1.5 µm. 60938 particles were picked and extracted with a 2562 pixel box size using WARP 1.07 (Tegunov and Cramer, 2019) and transferred to RELION 3.1 (Scheres, 2012) for 2D classification using default settings.

Cryo-electron microscopy of the recombinant THO–UAP56 complex

4 µL concentrated and crosslinked THO–UAP56 complexes were applied to glow-discharged Cu R1.2/1.3 300 mesh holey carbon grids (Quantifoil). Grids were blotted at 4°C and 70% humidity and plunged into liquid ethane using a Leica EM GP. Cryo-EM data was recorded using SerialEM (Mastronarde, 2005) in two sessions (data sets 1 and 2) on a FEI Titan Krios G3i operated at 300 keV, equipped with Gatan K3 direct electron detector. Datasets 1 (7482 movies) and 2 (18821 movies) were acquired with a defocus range of –0.4 to –3.7 µm at a nominal magnification of 105,000x (0.86 Å pixel−1). The camera was operated in ‘super-resolution’ mode (0.43 Å pixel−1) with an exposure time of 3.8 s and 33 frames per micrograph, a dose rate of 9.73 e pixel−1 s−1 and a total dose of 50 e Å−2.

Image processing

Movie alignment with 4 × 6 patches, dose-weighting, and contrast transfer function (CTF) parameter estimation were all carried out in WARP 1.07 (Tegunov and Cramer, 2019) for the separate datasets 1 and 2. Automated particle picking was performed with a re-trained neural network in WARP 1.07 (Tegunov and Cramer, 2019), yielding 414,082 and 1,156,183 particles for datasets 1 and 2, respectively. The particles were then extracted, normalized, and Fourier cropped to 1.34 Å pixel−1 with a 4402 pixel box size in RELION 3.1 (Scheres, 2012). The gold-standard Fourier shell correlation (FSC) (0.143 criterion) was used to determine resolution, and B-factors were estimated and applied in RELION 3.1 (Scheres, 2012).

The initial 3D model of the THO–UAP56 complex was determined from the first 30,000 particles from dataset one with an ab initio refinement in cryoSPARC 2.0 (Punjani et al., 2017) using default parameters and two classes. The resultant class one was filtered to 60 Å and used as reference for separate 3D refinements in RELION 3.1 (Scheres, 2012) of all particles from each dataset, in order to align all particle images to a common reference. To increase data set size, we separately extracted two THO–UAP56 tetramer units from each octamer using the particle extraction parameters as above. This was possible since the octamer apparently does not show particle orientations with two tetramers on top of each other (Figure 1—figure supplement 1c), yielding 828,164 and 2,312,366 tetramer units for datasets 1 and 2, respectively. Subsequent 3D classification was carried out without image alignment to identify homogenous particle groups (Figure 1—figure supplement 2). To identify high-quality tetramer units we classified each dataset in 3D into eight classes using a soft-edged mask in the shape of the THO–UAP56 complex monomers 1A and 2B generated with the volume eraser in UCSF Chimera (Pettersen et al., 2004) and RELION 3.1 (Scheres, 2012). 3D class 8 (Round 1a) and class 7 (Round 1b) were selected for their excellent density quality, resulting in 195,098 high-quality tetramer units for subsequent processing, resulting in the final density maps A-E (Figure 1—figure supplements 24). First, we carried out a 3D refinement of the high-quality tetramer data set with the THO–UAP56 monomer 1A/2B mask, yielding a density (map B) with an overall resolution of 3.3 Å and a B-factor of −132 Å2. To better resolve the connecting density to THO–UAP56 monomers 1B and 2A, we prepared a soft mask enveloping the complete tetramer, yielding a refinement from the same particles to 3.9 Å and a B-factor of −176 Å2 (map A). THO–UAP56 monomer 1A THOC2, −3, –5, −7 and UAP56 regions remained poorly resolved, and were subjected to a focused refinement with a soft-edged mask surrounding these parts, to a resolution of 4.6 Å and a B-factor of −235 Å2 (map D). THO–UAP56 monomer 1B THOC5 and THOC6 as well as the four-helix bundle connecting to monomers 1A and 1B were improved in a focused refinement of this region to resolution of 4.7 Å and a B-factor of −244 Å2 (map E). To locally improve the density of THO–UAP56 monomer 2B THOC2–THOC3–UAP56, we performed an additional round of 3D classification (Round 2) yielding a subset of particles that could be subsequently refined to a resolution of 4.7 Å and a B-factor of −160 Å2 (map C).

We used ResMap (Kucukelbir et al., 2014) to estimate the local resolution (Figure 1—figure supplement 3c) and performed 3D variability analysis of the high-quality tetramer units (6.2% of the total data) in cryoSPARC 2.0 (Punjani et al., 2017Figure 4—figure supplement 1d).

Structural modeling

We prepared a composite THO–UAP56 complex model by combining the cryo-EM densities A-E (Figure 1—figure supplements 1d, 2 and 3). The structure was manually built in COOT (Emsley and Cowtan, 2004) using THOC2 MIF4G domain and THOC3 homology models and the previously determined crystal structure of the UAP56 RecA2 lobe (Shi et al., 2004). The model coordinates were refined into the respective sharpened maps B and D in PHENIX (Adams et al., 2010) using the phenix.real_space_refine routine, applying secondary structure and rotamer restraints. We first built monomers 1A and 2B. THOC12B (residues 10–392), THOC22B (residues 164–288), THOC52B (residues 47–227), THOC72B (residues 22–181), THOC51A (tRWD), and THOC61A were modeled into map B. The THOC22B anchor helices were putatively assigned and modeled as poly-alanine with unknown register. To build the THOC21A MIF4G domain (residues 535–687) we first prepared a homology model of the CWC22 MIF4G-like domain crystal structure (Buchwald et al., 2013) (PDB ID 4C9B), and then fitted this into map D and manually adjusted and extended the model to THOC21A residue 288. The THOC21A C-terminal residues 688–1175 (stern) were assigned based on density connectivity in map D, and were modeled as poly-alanine owing to a lower local resolution of ~5–6 Å (Figure 1—figure supplement 3c). Note that the connectivity between THOC21A residues 688–895 remains uncertain. The THOC31A homology model was generated with MODELLER (Webb and Sali, 2017) based on the WDR61 crystal structure (Xu and Min, 2011) (PDB ID 3OW8), rigid-body fitted into map D, and refined in real space using the Relax protocol in ROSETTA (Tyka et al., 2011). The human UAP561A RecA2 structure (Shi et al., 2004) (PDB ID 1XTJ) was rigid-body fitted into map D and the interface with THOC2 was adjusted. To extend monomer 2B, THOC21A (residues 240–1175)–THOC31A–UAP561A were superimposed on THOC22B as a rigid body, and this fit is in agreement with map C. To extend the monomer 1A, refined THOC12B, THOC22B (residues 164–287), THOC52B (residues 47–227), and THOC72B were fitted into map D as rigid bodies. Further, THOC51A (tRWD), and THOC61A were rigid body fitted into map B, into the density of monomer 1B, and adjusted at the THOC51A–THOC51B tRWD domain interface. To generate the complete THO–UAP56 complex model the refined monomer 1A, monomer 1B THOC5 tRWD and THOC6, and monomer 2B models were fitted into map A into the symmetry related positions of monomers 2A, 2B, and 1B, respectively, using COOT (Emsley and Cowtan, 2004). The THOC51A/1B–THOC71A/1B four-helix bundle was modeled as poly-alanine into maps A and E and refined, and then copied as a rigid body into the THOC52A/2B–THOC72A/2B position. ADP refinement was carried out in a composite THO–UAP56 map, generated from the individual maps A-E using phenix.combine_focused_maps (Adams et al., 2010). The final model comprises 28 proteins.

Based on our THO–UAP56 model, we revised the 6.0 Å resolution model of the chimeric yeast THO–Sub2 (PDB ID 5SUQ) crystal structure (Figure 4—figure supplement 3a, Supplementary file 1). We first defined a putative THO–Sub2 monomer A model that comprised residues 8–5289 (chain M) from the crystal structure. We then superimposed this monomer A model onto residues 6170–8550 (chain M) in COOT (Emsley and Cowtan, 2004), showing an excellent fit to the electron density and revealing the presence of a second monomer B. Four pieces of additional evidence validate this re-interpretation of the yeast crystal structure. First, placement of monomer A into the monomer B position could explain unmodeled density (Figure 4—figure supplement 3b). Second, the positions of phosphotungsten are consistent with a dimer architecture: M8701 and M8702 are now bound to equivalent positions in monomer A and B; N8701 binds the monomer B Sub2 RecA2 lobe in an equivalent position to A501 (Figure 4—figure supplement 3b). Third, weak density is observed in the location of the second Sub2 RecA2 lobe. Fourth, completion of the yeast THO-Sub2 dimer model with the monomer B Tho2 Stern, Tex1 and Sub2 fits into the asymmetric unit (Figure 4—figure supplement 3c) and is compatible with the observed crystal packing. The Tho2 C-terminus, corresponding to the human THOC2 Stern, is not resolved in monomer B. This might be due to the flexibility observed between Tho2 MIF4G and Stern domains (joint 2, Figure 4—figure supplement 3d) and the lack of stabilizing crystal packing contacts that are present in monomer A. We also see no density for monomer B Tex1, however, based on the 1:1 stoichiometry of Tex1 (or THOC3) in yeast (Ren et al., 2017) and human THO complexes (Figure 1, Figure 1—figure supplement 1), its presence is likely. Overall, minor differences remain between the yeast and human THO–UAP56 monomer models that may stem from species differences and/or the unique THO–Sub2 architecture captured in the crystal (Ren et al., 2017).

Figures were made with UCSF Chimera X (Goddard et al., 2018) and PyMol (https://www.pymol.org).

Acknowledgements

We thank the Plaschka group for their help and discussions; the Protein Technologies facility at the Vienna BioCenter Core Facilities GmbH (VBCF), a member of the Vienna BioCenter (VBC), for assistance with protein production; the VBCF Electron Microscopy Facility, in particular T Heuser and H Kotisch, for support and maintaining facilities; V-V Hodirnau at Institute of Science and Technology Austria EM facility for cryo-EM data collection; J Zuber and M Hinterndorfer for help with human cell culture and cloning; R Zimmermann and his team for computation support; M Novatchkova for bioinformatics analysis; K Mechtler and his team for mass spectrometry; the in-house Molecular Biology Service for reagents; and C Bernecky, A Pauli, R Pavri, and A Stark for discussions. We also thank C Bernecky, M Elmaghraby, D Haselbach, A Pauli, G Riddihough (Life Science Editors), A Stark, M Vorländer, and members of the Plaschka group for critical reading of the manuscript. We thank BP-F for drawing Figure 4c. UH was supported by JB and an SNF Early Postdoc Mobility fellowship (P2GEP3_188343). JB was supported by the Austrian Academy of Sciences, the European Research Council (ERC-2015-CoG 682181), and the Austrian Science Fund (F4303 and W1207). CP was supported by Boehringer Ingelheim and the European Research Council (ERC-2020-STG 949081 RNApaxport).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Clemens Plaschka, Email: clemens.plaschka@imp.ac.at.

Karsten Weis, ETH Zurich, Switzerland.

James L Manley, Columbia University, United States.

Funding Information

This paper was supported by the following grants:

  • Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung P2GEP3_188343 to Ulrich Hohmann.

  • H2020 European Research Council ERC-2015-CoG 682181 to Julius Brennecke.

  • Austrian Science Fund F4303 and W1207 to Julius Brennecke.

  • Österreichischen Akademie der Wissenschaften to Julius Brennecke.

  • Austrian Academy of Sciences to Julius Brennecke.

  • Boehringer Ingelheim to Clemens Plaschka.

  • H2020 European Research Council ERC-2020-STG 949081 to Clemens Plaschka.

Additional information

Competing interests

No competing interests declared.

Author contributions

Formal analysis, Investigation, Visualization, Writing - review and editing.

Formal analysis, Investigation, Visualization, Writing - review and editing.

Investigation.

Investigation, Visualization.

Investigation.

Supervision, Funding acquisition, Writing - review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Investigation, Visualization, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Supplementary file 1. Coordinate file of the revised yeast THO–Sub2 complex.
elife-61503-supp1.pdb (1.8MB, pdb)
Transparent reporting form

Data availability

Three-dimensional cryo-EM density maps A, B, C, D, and E have been deposited in the Electron Microscopy Data Bank under the accession numbers EMD-11853, EMD-11857, EMD-11854, EMD-11855, EMD-11856, respectively. The coordinate file of the human THO-UAP56 complex has been deposited in the Protein Data Bank under the accession number 7APK.

The following datasets were generated:

Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. human THO-UAP56 complex. RCSB Protein Data Bank. 7APK

Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map A. Electron Microscopy Data Bank. EMD-11853

Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map B. Electron Microscopy Data Bank. EMD-11857

Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map C. Electron Microscopy Data Bank. EMD-11854

Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density maps D. Electron Microscopy Data Bank. EMD-11855

Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density maps E. Electron Microscopy Data Bank. EMD-11856

References

  1. Adams PD, Afonine PV, Bunkóczi G, Chen VB, Davis IW, Echols N, Headd JJ, Hung L-W, Kapral GJ, Grosse-Kunstleve RW, McCoy AJ, Moriarty NW, Oeffner R, Read RJ, Richardson DC, Richardson JS, Terwilliger TC, Zwart PH. PHENIX : a comprehensive Python-based system for macromolecular structure solution. Acta Crystallographica Section D Biological Crystallography. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Afonine PV, Klaholz BP, Moriarty NW, Poon BK, Sobolev OV, Terwilliger TC, Adams PD, Urzhumtsev A. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallographica Section D Structural Biology. 2018;74:814–840. doi: 10.1107/S2059798318009324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Anazi S, Alshammari M, Moneis D, Abouelhoda M, Ibrahim N, Alkuraya FS. Confirming the candidacy of THOC6 in the etiology of intellectual disability. American Journal of Medical Genetics. Part A. 2016;170A:1367–1369. doi: 10.1002/ajmg.a.37549. [DOI] [PubMed] [Google Scholar]
  4. Buchan DW, Minneci F, Nugent TC, Bryson K, Jones DT. Scalable web services for the PSIPRED protein analysis workbench. Nucleic Acids Research. 2013;41:W349–W357. doi: 10.1093/nar/gkt381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Buchwald G, Schussler S, Basquin C, Le Hir H, Conti E. Crystal structure of the human eIF4AIII-CWC22 complex shows how a DEAD-box protein is inhibited by a MIF4G domain. PNAS. 2013;110:E4611–E4618. doi: 10.1073/pnas.1314684110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Care4Rare Canada Consortium. Amos JS, Huang L, Thevenon J, Kariminedjad A, Beaulieu CL, Masurel-Paulet A, Najmabadi H, Fattahi Z, Beheshtian M, Tonekaboni SH, Tang S, Helbig KL, Alcaraz W, Rivière JB, Faivre L, Innes AM, Lebel RR, Boycott KM. Autosomal recessive mutations in THOC6 cause intellectual disability: syndrome delineation requiring forward and reverse phenotyping. Clinical Genetics. 2017;91:92–99. doi: 10.1111/cge.12793. [DOI] [PubMed] [Google Scholar]
  7. Casey J, Jenkinson A, Magee A, Ennis S, Monavari A, Green A, Lynch SA, Crushell E, Hughes J. Beaulieu-Boycott-Innes syndrome: an intellectual disability syndrome with characteristic facies. Clinical Dysmorphology. 2016;25:146–151. doi: 10.1097/MCD.0000000000000134. [DOI] [PubMed] [Google Scholar]
  8. Chávez S, Beilharz T, Rondón AG, Erdjument-Bromage H, Tempst P, Svejstrup JQ, Lithgow T, Aguilera A. A protein complex containing Tho2, Hpr1, Mft1 and a novel protein, Thp2, connects transcription elongation with mitotic recombination in Saccharomyces cerevisiae. The EMBO Journal. 2000;19:5824–5834. doi: 10.1093/emboj/19.21.5824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cheng H, Dufu K, Lee CS, Hsu JL, Dias A, Reed R. Human mRNA export machinery recruited to the 5' end of mRNA. Cell. 2006;127:1389–1400. doi: 10.1016/j.cell.2006.10.044. [DOI] [PubMed] [Google Scholar]
  10. Dempster JM, Pacini C, Pantel S, Behan FM, Green T, Krill-Burger J, Beaver CM, Younger ST, Zhivich V, Najgebauer H, Allen F, Gonçalves E, Shepherd R, Doench JG, Yusa K, Vazquez F, Parts L, Boehm JS, Golub TR, Hahn WC, Root DE, Garnett MJ, Tsherniak A, Iorio F. Agreement between two large pan-cancer CRISPR-Cas9 gene dependency data sets. Nature Communications. 2019;10:5817. doi: 10.1038/s41467-019-13805-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Dufu K, Livingstone MJ, Seebacher J, Gygi SP, Wilson SA, Reed R. ATP is required for interactions between UAP56 and two conserved mRNA export proteins, aly and CIP29, to assemble the TREX complex. Genes & Development. 2010;24:2043–2053. doi: 10.1101/gad.1898610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallographica. Section D, Biological Crystallography. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
  13. Engler C, Kandzia R, Marillonnet S. A one pot, one step, precision cloning method with high throughput capability. PLOS ONE. 2008;3:e3647. doi: 10.1371/journal.pone.0003647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. FORGE Canada Consortium. Beaulieu CL, Huang L, Innes AM, Akimenko MA, Puffenberger EG, Schwartz C, Jerry P, Ober C, Hegele RA, McLeod DR, Schwartzentruber J, Majewski J, Bulman DE, Parboosingh JS, Boycott KM. Intellectual disability associated with a homozygous missense mutation in THOC6. Orphanet Journal of Rare Diseases. 2013;8:62. doi: 10.1186/1750-1172-8-62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE. UCSF ChimeraX: meeting modern challenges in visualization and analysis. Protein Science. 2018;27:14–25. doi: 10.1002/pro.3235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Golovanov AP, Hautbergue GM, Tintaru AM, Lian LY, Wilson SA. The solution structure of REF2-I reveals interdomain interactions and regions involved in binding mRNA export factors and RNA. RNA. 2006;12:1933–1948. doi: 10.1261/rna.212106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gromadzka AM, Steckelberg A-L, Singh KK, Hofmann K, Gehring NH. A short conserved motif in ALYREF directs cap- and EJC-dependent assembly of export complexes on spliced mRNAs. Nucleic Acids Research. 2016;44:2348–2361. doi: 10.1093/nar/gkw009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hautbergue GM, Hung ML, Golovanov AP, Lian LY, Wilson SA. Mutually exclusive interactions drive handover of mRNA from export adaptors to TAP. PNAS. 2008;105:5154–5159. doi: 10.1073/pnas.0709167105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hautbergue GM, Hung ML, Walsh MJ, Snijders AP, Chang CT, Jones R, Ponting CP, Dickman MJ, Wilson SA. UIF, a new mRNA export adaptor that works together with REF/ALY, requires FACT for recruitment to mRNA. Current Biology. 2009;19:1918–1924. doi: 10.1016/j.cub.2009.09.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Heath CG, Viphakone N, Wilson SA. The role of TREX in gene expression and disease. Biochemical Journal. 2016;473:2911–2935. doi: 10.1042/BCJ20160010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hilbert M, Kebbel F, Gubaev A, Klostermeier D. eIF4G stimulates the activity of the DEAD box protein eIF4A by a conformational guidance mechanism. Nucleic Acids Research. 2011;39:2260–2270. doi: 10.1093/nar/gkq1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hobeika M, Brockmann C, Iglesias N, Gwizdek C, Neuhaus D, Stutz F, Stewart M, Divita G, Dargemont C. Coordination of Hpr1 and ubiquitin binding by the UBA domain of the mRNA export factor Mex67. Molecular Biology of the Cell. 2007;18:2561–2568. doi: 10.1091/mbc.e07-02-0153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kastner B, Fischer N, Golas MM, Sander B, Dube P, Boehringer D, Hartmuth K, Deckert J, Hauer F, Wolf E, Uchtenhagen H, Urlaub H, Herzog F, Peters JM, Poerschke D, Lührmann R, Stark H. GraFix: sample preparation for single-particle electron cryomicroscopy. Nature Methods. 2008;5:53–55. doi: 10.1038/nmeth1139. [DOI] [PubMed] [Google Scholar]
  24. Katahira J, Inoue H, Hurt E, Yoneda Y. Adaptor aly and co-adaptor Thoc5 function in the Tap-p15-mediated nuclear export of HSP70 mRNA. The EMBO Journal. 2009;28:556–567. doi: 10.1038/emboj.2009.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Köhler A, Hurt E. Exporting RNA from the nucleus to the cytoplasm. Nature Reviews Molecular Cell Biology. 2007;8:761–773. doi: 10.1038/nrm2255. [DOI] [PubMed] [Google Scholar]
  26. Kucukelbir A, Sigworth FJ, Tagare HD. Quantifying the local resolution of cryo-EM density maps. Nature Methods. 2014;11:63–65. doi: 10.1038/nmeth.2727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kumar R, Corbett MA, van Bon BW, Woenig JA, Weir L, Douglas E, Friend KL, Gardner A, Shaw M, Jolly LA, Tan C, Hunter MF, Hackett A, Field M, Palmer EE, Leffler M, Rogers C, Boyle J, Bienek M, Jensen C, Van Buggenhout G, Van Esch H, Hoffmann K, Raynaud M, Zhao H, Reed R, Hu H, Haas SA, Haan E, Kalscheuer VM, Gecz J. THOC2 mutations implicate mRNA-Export pathway in X-Linked intellectual disability. The American Journal of Human Genetics. 2015;97:302–310. doi: 10.1016/j.ajhg.2015.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kumar R, Gardner A, Homan CC, Douglas E, Mefford H, Wieczorek D, Lüdecke HJ, Stark Z, Sadedin S, Nowak CB, Douglas J, Parsons G, Mark P, Loidi L, Herman GE, Mihalic Mosher T, Gillespie MK, Brady L, Tarnopolsky M, Madrigal I, Eiris J, Domènech Salgado L, Rabionet R, Strom TM, Ishihara N, Inagaki H, Kurahashi H, Dudding-Byth T, Palmer EE, Field M, Gecz J, Broad CMG Severe neurocognitive and growth disorders due to variation in THOC2, an essential component of nuclear mRNA export machinery. Human Mutation. 2018;39:1126–1138. doi: 10.1002/humu.23557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kumar R, Palmer E, Gardner AE, Carroll R, Banka S, Abdelhadi O, Donnai D, Elgersma Y, Curry CJ, Gardham A, Suri M, Malla R, Brady LI, Tarnopolsky M, Azmanov DN, Atkinson V, Black M, Baynam G, Dreyer L, Hayeems RZ, Marshall CR, Costain G, Wessels MW, Baptista J, Drummond J, Leffler M, Field M, Gecz J. Expanding clinical presentations due to variations in THOC2 mRNA nuclear export factor. Frontiers in Molecular Neuroscience. 2020;13:12. doi: 10.3389/fnmol.2020.00012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Linder P, Jankowsky E. From unwinding to clamping - the DEAD box RNA helicase family. Nature Reviews Molecular Cell Biology. 2011;12:505–516. doi: 10.1038/nrm3154. [DOI] [PubMed] [Google Scholar]
  31. Luna R, Rondón AG, Pérez-Calero C, Salas-Armenteros I, Aguilera A. The THO complex as a paradigm for the prevention of cotranscriptional R-Loops. Cold Spring Harbor Symposia on Quantitative Biology; 2019. pp. 105–114. [DOI] [PubMed] [Google Scholar]
  32. Mastronarde DN. Automated electron microscope tomography using robust prediction of specimen movements. Journal of Structural Biology. 2005;152:36–51. doi: 10.1016/j.jsb.2005.07.007. [DOI] [PubMed] [Google Scholar]
  33. Mathys H, Basquin J, Ozgur S, Czarnocki-Cieciura M, Bonneau F, Aartse A, Dziembowski A, Nowotny M, Conti E, Filipowicz W. Structural and biochemical insights to the role of the CCR4-NOT complex and DDX6 ATPase in microRNA repression. Molecular Cell. 2014;54:751–765. doi: 10.1016/j.molcel.2014.03.036. [DOI] [PubMed] [Google Scholar]
  34. Mattioli F, Isidor B, Abdul-Rahman O, Gunter A, Huang L, Kumar R, Beaulieu C, Gecz J, Innes M, Mandel JL, Piton A. Clinical and functional characterization of recurrent missense variants implicated in THOC6-related intellectual disability. Human Molecular Genetics. 2019;28:952–960. doi: 10.1093/hmg/ddy391. [DOI] [PubMed] [Google Scholar]
  35. Mayeda A, Krainer AR. Preparation of HeLa cell nuclear and cytosolic S100 extracts for in vitro splicing. Methods in Molecular Biology. 1999;118:309–314. doi: 10.1385/1-59259-676-2:309. [DOI] [PubMed] [Google Scholar]
  36. Merz C, Urlaub H, Will CL, Lührmann R. Protein composition of human mRNPs spliced in vitro and differential requirements for mRNP protein recruitment. RNA. 2007;13:116–128. doi: 10.1261/rna.336807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Mitchell AL, Attwood TK, Babbitt PC, Blum M, Bork P, Bridge A, Brown SD, Chang HY, El-Gebali S, Fraser MI, Gough J, Haft DR, Huang H, Letunic I, Lopez R, Luciani A, Madeira F, Marchler-Bauer A, Mi H, Natale DA, Necci M, Nuka G, Orengo C, Pandurangan AP, Paysan-Lafosse T, Pesseat S, Potter SC, Qureshi MA, Rawlings ND, Redaschi N, Richardson LJ, Rivoire C, Salazar GA, Sangrador-Vegas A, Sigrist CJA, Sillitoe I, Sutton GG, Thanki N, Thomas PD, Tosatto SCE, Yong SY, Finn RD. InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Research. 2019;47:D351–D360. doi: 10.1093/nar/gky1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Montpetit B, Thomsen ND, Helmke KJ, Seeliger MA, Berger JM, Weis K. A conserved mechanism of DEAD-box ATPase activation by nucleoporins and InsP6 in mRNA export. Nature. 2011;472:238–242. doi: 10.1038/nature09862. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Montpetit B, Seeliger MA, Weis K. Analysis of DEAD-box proteins in mRNA export. Methods in Enzymology. 2012;511:239–254. doi: 10.1016/B978-0-12-396546-2.00011-5. [DOI] [PubMed] [Google Scholar]
  40. Peña A, Gewartowski K, Mroczek S, Cuéllar J, Szykowska A, Prokop A, Czarnocki-Cieciura M, Piwowarski J, Tous C, Aguilera A, Carrascosa JL, Valpuesta JM, Dziembowski A. Architecture and nucleic acids recognition mechanism of the THO complex, an mRNP assembly factor. The EMBO Journal. 2012;31:1605–1616. doi: 10.1038/emboj.2012.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Pérez-Alvarado GC, Martínez-Yamout M, Allen MM, Grosschedl R, Dyson HJ, Wright PE. Structure of the nuclear factor ALY: insights into post-transcriptional regulatory and mRNA nuclear export processes. Biochemistry. 2003;42:7348–7357. doi: 10.1021/bi034062o. [DOI] [PubMed] [Google Scholar]
  42. Pérez-Calero C, Bayona-Feliu A, Xue X, Barroso SI, Muñoz S, González-Basallote VM, Sung P, Aguilera A. UAP56/DDX39B is a major cotranscriptional RNA-DNA helicase that unwinds harmful R loops genome-wide. Genes & Development. 2020;34:898–912. doi: 10.1101/gad.336024.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE. UCSF Chimera?A visualization system for exploratory research and analysis. Journal of Computational Chemistry. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
  44. Punjani A, Rubinstein JL, Fleet DJ, Brubaker MA. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nature Methods. 2017;14:290–296. doi: 10.1038/nmeth.4169. [DOI] [PubMed] [Google Scholar]
  45. Ren Y, Schmiege P, Blobel G. Structural and biochemical analyses of the DEAD-box ATPase Sub2 in association with THO or Yra1. eLife. 2017;6:e20070. doi: 10.7554/eLife.20070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Robert X, Gouet P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Research. 2014;42:W320–W324. doi: 10.1093/nar/gku316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Scheres SH. RELION: implementation of a bayesian approach to cryo-EM structure determination. Journal of Structural Biology. 2012;180:519–530. doi: 10.1016/j.jsb.2012.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Sharif H, Ozgur S, Sharma K, Basquin C, Urlaub H, Conti E. Structural analysis of the yeast Dhh1-Pat1 complex reveals how Dhh1 engages Pat1, Edc3 and RNA in mutually exclusive interactions. Nucleic Acids Research. 2013;41:8377–8390. doi: 10.1093/nar/gkt600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Shen J, Zhang L, Zhao R. Biochemical characterization of the ATPase and helicase activity of UAP56, an essential pre-mRNA splicing and mRNA export factor. Journal of Biological Chemistry. 2007;282:22544–22550. doi: 10.1074/jbc.M702304200. [DOI] [PubMed] [Google Scholar]
  50. Shi H, Cordin O, Minder CM, Linder P, Xu RM. Crystal structure of the human ATP-dependent splicing and export factor UAP56. PNAS. 2004;101:17628–17633. doi: 10.1073/pnas.0408172101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Shi M, Zhang H, Wu X, He Z, Wang L, Yin S, Tian B, Li G, Cheng H. ALYREF mainly binds to the 5' and the 3' regions of the mRNA in vivo. Nucleic Acids Research. 2017;45:9640–9653. doi: 10.1093/nar/gkx597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. Fast, scalable generation of high-quality protein multiple sequence alignments using clustal omega. Molecular Systems Biology. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Singh G, Pratt G, Yeo GW, Moore MJ. The clothes make the mRNA: past and present trends in mRNP fashion. Annual Review of Biochemistry. 2015;84:325–354. doi: 10.1146/annurev-biochem-080111-092106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Research. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Stewart M. Polyadenylation and nuclear export of mRNAs. Journal of Biological Chemistry. 2019;294:2977–2987. doi: 10.1074/jbc.REV118.005594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Strässer K, Masuda S, Mason P, Pfannstiel J, Oppizzi M, Rodriguez-Navarro S, Rondón AG, Aguilera A, Struhl K, Reed R, Hurt E. TREX is a conserved complex coupling transcription with messenger RNA export. Nature. 2002;417:304–308. doi: 10.1038/nature746. [DOI] [PubMed] [Google Scholar]
  57. Strässer K, Hurt E. Yra1p, a conserved nuclear RNA-binding protein, interacts directly with Mex67p and is required for mRNA export. The EMBO Journal. 2000;19:410–420. doi: 10.1093/emboj/19.3.410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Strässer K, Hurt E. Splicing factor Sub2p is required for nuclear mRNA export through its interaction with Yra1p. Nature. 2001;413:648–652. doi: 10.1038/35098113. [DOI] [PubMed] [Google Scholar]
  59. Stutz F, Bachi A, Doerks T, Braun IC, Séraphin B, Wilm M, Bork P, Izaurralde E. REF, an evolutionary conserved family of hnRNP-like proteins, interacts with TAP/Mex67p and participates in mRNA nuclear export. RNA. 2000;6:638–650. doi: 10.1017/S1355838200000078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Taniguchi I, Ohno M. ATP-dependent recruitment of export factor aly/REF onto intronless mRNAs by RNA helicase UAP56. Molecular and Cellular Biology. 2008;28:601–608. doi: 10.1128/MCB.01341-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Tegunov D, Cramer P. Real-time cryo-electron microscopy data preprocessing with warp. Nature Methods. 2019;16:1146–1152. doi: 10.1038/s41592-019-0580-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Trowitzsch S, Bieniossek C, Nie Y, Garzoni F, Berger I. New baculovirus expression tools for recombinant protein complex production. Journal of Structural Biology. 2010;172:45–54. doi: 10.1016/j.jsb.2010.02.010. [DOI] [PubMed] [Google Scholar]
  63. Tyka MD, Keedy DA, André I, Dimaio F, Song Y, Richardson DC, Richardson JS, Baker D. Alternate states of proteins revealed by detailed energy landscape mapping. Journal of Molecular Biology. 2011;405:607–618. doi: 10.1016/j.jmb.2010.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Viphakone N, Hautbergue GM, Walsh M, Chang C-T, Holland A, Folco EG, Reed R, Wilson SA. TREX exposes the RNA-binding domain of Nxf1 to enable mRNA export. Nature Communications. 2012;3:2005. doi: 10.1038/ncomms2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Viphakone N, Cumberbatch MG, Livingstone MJ, Heath PR, Dickman MJ, Catto JW, Wilson SA. Luzp4 defines a new mRNA export pathway in cancer cells. Nucleic Acids Research. 2015;43:2353–2366. doi: 10.1093/nar/gkv070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Viphakone N, Sudbery I, Griffith L, Heath CG, Sims D, Wilson SA. Co-transcriptional loading of RNA export factors shapes the human transcriptome. Molecular Cell. 2019;75:310–323. doi: 10.1016/j.molcel.2019.04.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Webb B, Sali A. Protein structure modeling with MODELLER. Methods in Molecular Biology. 2017;1654:39–54. doi: 10.1007/978-1-4939-7231-9_4. [DOI] [PubMed] [Google Scholar]
  68. Xie Y, Ren Y. Mechanisms of nuclear mRNA export: a structural perspective. Traffic. 2019;20:829–840. doi: 10.1111/tra.12691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Xu C, Min J. Structure and function of WD40 domain proteins. Protein & Cell. 2011;2:202–214. doi: 10.1007/s13238-011-1018-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Zenklusen D, Vinciguerra P, Strahm Y, Stutz F. The yeast hnRNP-Like proteins Yra1p and Yra2p participate in mRNA export through interaction with Mex67p. Molecular and Cellular Biology. 2001;21:4219–4232. doi: 10.1128/MCB.21.13.4219-4232.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Karsten Weis1

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The manuscript describes the structure of the human THO-UAP56 complex determined by cryo-EM analysis. This is the first high-resolution structure of this large complex and this study will be of significance to the mRNP biogenesis and "R loop" fields providing an important basis for future mechanistic studies.

Decision letter after peer review:

Thank you for submitting your article "Human THO-UAP56 complex structure reveals multivalent interactions aid mRNA nuclear export" for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen by a Reviewing Editor and James Manley as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

The manuscript entitled, "Human THO-UAP56 complex structure reveals multivalent interactions aid mRNA nuclear export", by Pühringer et al. describes the structure of the human THO-UAP56 complex determined by cryo-EM analysis. The six human THO proteins and the UAP56 helicase build up a 28-subunit tetramer assembly consisting of two asymmetric dimers. Modeling the export adapter ALY/REF into the structure suggests that a single ALY/REF protein bridges two proximal UAP56 helicases. The authors do not present biochemical experiments, e.g., using structure-based mutants, to underline the functional significance of the determined structure or test the validity of the proposed mechanism. However, since this is the first high-resolution structure of the large THO-UAP56 complex this study will be of significance to the mRNP biogenesis and "R loop" fields and will provide an important basis for new mechanistic studies.

Essential revisions:

1) The role of CHTOP

An interesting observation from this work is that THO-Sub2 from yeast is dimeric whereas human THO-UAP56 is a tetramer. Whilst the present structure explains this difference in molecular terms it doesn't really address the purpose of the tetramerisation. Did the authors consider the action of the mRNA export co-adaptor and TREX subunit CHTOP in the human mRNA export process? CHTOP binds the central NTF2L domain of NXF1 and together with ALYREF drives NXF1 into an open conformation which allows it to bind RNA (https://pubmed.ncbi.nlm.nih.gov/22893130/) and (https://pubmed.ncbi.nlm.nih.gov/23299939/) and (https://pubmed.ncbi.nlm.nih.gov/31104896/). Importantly, CHTOP has tandem UAP56 binding motifs separated by only a few amino acids so if it were to bind two UAP56 molecules they would presumably need to be very close, as two of them are in this structure. Are the UAP56 molecules close enough though to simultaneously bind the CHTOP tandem UBMs? The authors should look into this. CHTOP also activates the helicase and ATPase activities of UAP56 just like ALYREF. Efficient loading of NXF1 onto the mRNP needs both ALYREF and CHTOP, so this tetrameric structure might provide that key platform allowing one UAP56 juxtaposed dimer to deliver ALYREF to RNA and the other to deliver CHTOP.

In yeast, the central NTF2L domain has an unusual insertion which allows it to directly bind RNA (https://pubmed.ncbi.nlm.nih.gov/17434126/) not present in human NXF1 and there are no reports of co-adaptor like activities in yeast binding this NTF2L domain so perhaps THO-Sub2 is dimeric because it doesn't need to recruit a CHTOP like molecule, just Yra1p? The authors should consider these ideas in their revision and certainly explore how CHTOP might bind UAP56 in their structure given its tightly juxtaposed tandem arrangement of UBMs.

2) The title is grammatically awkward and should be changed.

3) It would be easier to understand the structure of the complex, if the monomer was discussed first, followed by a description of the tetramer.

eLife. 2020 Nov 16;9:e61503. doi: 10.7554/eLife.61503.sa2

Author response


Essential revisions:

1) The role of CHTOP

An interesting observation from this work is that THO-Sub2 from yeast is dimeric whereas human THO-UAP56 is a tetramer. Whilst the present structure explains this difference in molecular terms it doesn't really address the purpose of the tetramerisation. Did the authors consider the action of the mRNA export co-adaptor and TREX subunit CHTOP in the human mRNA export process? CHTOP binds the central NTF2L domain of NXF1 and together with ALYREF drives NXF1 into an open conformation which allows it to bind RNA (https://pubmed.ncbi.nlm.nih.gov/22893130/) and (https://pubmed.ncbi.nlm.nih.gov/23299939/) and (https://pubmed.ncbi.nlm.nih.gov/31104896/). Importantly, CHTOP has tandem UAP56 binding motifs separated by only a few amino acids so if it were to bind two UAP56 molecules they would presumably need to be very close, as two of them are in this structure. Are the UAP56 molecules close enough though to simultaneously bind the CHTOP tandem UBMs? The authors should look into this. CHTOP also activates the helicase and ATPase activities of UAP56 just like ALYREF. Efficient loading of NXF1 onto the mRNP needs both ALYREF and CHTOP, so this tetrameric structure might provide that key platform allowing one UAP56 juxtaposed dimer to deliver ALYREF to RNA and the other to deliver CHTOP.

In yeast, the central NTF2L domain has an unusual insertion which allows it to directly bind RNA (https://pubmed.ncbi.nlm.nih.gov/17434126/) not present in human NXF1 and there are no reports of co-adaptor like activities in yeast binding this NTF2L domain so perhaps THO-Sub2 is dimeric because it doesn't need to recruit a CHTOP like molecule, just Yra1p? The authors should consider these ideas in their revision and certainly explore how CHTOP might bind UAP56 in their structure given its tightly juxtaposed tandem arrangement of UBMs.

We thank the reviewers for their suggestions and hypothesis regarding CHTOP, and apologize for its omission in the earlier text. We now mention CHTOP and the additional UBM-containing export adapters in the Introduction and expanded their discussion in the subsection “TREX complex and ALYREF function”. We wish to stress that in this study we primarily focused on ALYREF due to its high conservation from yeast to humans, ubiquitous expression in human tissues, and the available structural and biochemical data from yeast and human systems.

The tandem UBMs in CHTOP are very interesting, but we think that they are very unlikely to bridge two UAP56 helicases within the THO–UAP56 tetramer (now also included in Figure 4—figure supplement 2C as suggested below). A secondary structure prediction using PsiPred (Buchan and Jones, 2019) suggests that both CHTOP UBMs are part of one long helix. A comparison of the CHTOP UBMs to the Yra1 C-terminal UBM bound to Sub2 (PDB-ID: 5sup, Ren, Schmiege and Blobel, 2017) reveals a spacing of seven residues between the anchoring tyrosine of the first UBM (CHTOP Tyr223) and the first tightly bound residue of the second UBM (CHTOP Leu231). Due to the rotation of the two UBMs along the helical axis it may indeed be possible that two UAP56 molecules bind simultaneously to CHTOP, however, the spacing is too close to span the distance between the RecA1 lobes of two UAP56 molecules in the context of the THO–UAP56 tetramer. The THO–UAP56 tetramer architecture does, however, suggest that the multiplicity of UBM binding sites may increase the affinity of UBM-containing proteins for their target mRNA substrates. We summarized these ideas in the subsection “TREX complex and ALYREF function”.

As the reviewers point out, we were also very intrigued to find a tetrameric human and dimeric yeast THO architecture. The reason for this architectural difference remains unclear, though we speculate it may reflect the complexity in gene architecture in complex eukaryotes. For the reasons below, we do not think that the dimer/tetramer organisation reflects a dependence on other export adapters, like CHTOP, however, the tetramer may have allowed for their emergence. First, the loop in the NTF2L domain, residues 407 and 436, in S. cerevisiae Mex67p is present only in S. cerevisiae, but absent in S. pombe, worms, plants, insects and vertebrates. In contrast, the 6-subunit tetrameric THO assembly is present in insects, worms, and plants but not in S. pombe (which lacks the THOC5 tRWD domain as well as THOC6, resulting in a dimeric THO complex). Second, CHTOP is not conserved outside of vertebrates. Thus, the presence of a tetrameric THO complex and CHTOP as well as the absence of the S. cerevisiae-specific loop in Mex67 do not seem to be related. We hope that knowledge of the tetrameric THO complex will inform further experiments to study the function of CHTOP. Within our group we are excited to address the potential roles of further export adapters and, in particular, the recruitment of NXF1 to the THO complex in future studies.

2) The title is grammatically awkward and should be changed.

We have changed the title in our revised manuscript. It now reads: “Structure of the human core transcription-export complex reveals a hub for multivalent interactions”. We reason that since all TREX complexes are thought to contain THO and UAP56, but as the reviewers point out, not necessarily ALYREF, we can consider THO–UAP56 the TREX core.

3) It would be easier to understand the structure of the complex, if the monomer was discussed first, followed by a description of the tetramer.

We thank the reviewers for this suggestion. While preparing the current manuscript we discussed and tested various orders of describing the THO-UAP complex structure. We believe this to be the more accessible route (tetramer to monomer), but we appreciate that parts of the structural description in the subsection “THO–UAP56 structure” were not easy to follow. To address this, we edited several key parts in the revised version to improve readability (subsection “THO–UAP56 structure”). In the new version, we first briefly describe the monomer, before delving into the dimer/tetramer organisation, and then returning to the monomer. Starting out with the detailed dimer/tetramer organization allows us to later focus fully on the monomer, the more conserved THOC1, -2, -3, UAP56 subunits, NXF1, and ALYREF, and we hope the reviewers agree with this revised version.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. human THO-UAP56 complex. RCSB Protein Data Bank. 7APK [DOI] [PMC free article] [PubMed]
    2. Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map A. Electron Microscopy Data Bank. EMD-11853
    3. Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map B. Electron Microscopy Data Bank. EMD-11857
    4. Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map C. Electron Microscopy Data Bank. EMD-11854
    5. Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density maps D. Electron Microscopy Data Bank. EMD-11855
    6. Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density maps E. Electron Microscopy Data Bank. EMD-11856

    Supplementary Materials

    Supplementary file 1. Coordinate file of the revised yeast THO–Sub2 complex.
    elife-61503-supp1.pdb (1.8MB, pdb)
    Transparent reporting form

    Data Availability Statement

    Three-dimensional cryo-EM density maps A, B, C, D, and E have been deposited in the Electron Microscopy Data Bank under the accession numbers EMD-11853, EMD-11857, EMD-11854, EMD-11855, EMD-11856, respectively. The coordinate file of the human THO-UAP56 complex has been deposited in the Protein Data Bank under the accession number 7APK.

    The following datasets were generated:

    Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. human THO-UAP56 complex. RCSB Protein Data Bank. 7APK

    Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map A. Electron Microscopy Data Bank. EMD-11853

    Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map B. Electron Microscopy Data Bank. EMD-11857

    Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density map C. Electron Microscopy Data Bank. EMD-11854

    Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density maps D. Electron Microscopy Data Bank. EMD-11855

    Pühringer T, Hohmann U, Fin L, Pacheco-Fiallos B, Schellhaas U, Brennecke J, Plaschka C. 2020. Three-dimensional cryo-EM density maps E. Electron Microscopy Data Bank. EMD-11856


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES