Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2012 Nov 28;288(4):2111–2117. doi: 10.1074/jbc.M112.430678

Yeast Nab3 Protein Contains a Self-assembly Domain Found in Human Heterogeneous Nuclear Ribonucleoprotein-C (hnRNP-C) That Is Necessary for Transcription Termination*

Travis J Loya 1, Thomas W O'Rourke 1, Daniel Reines 1,1
PMCID: PMC3554884  PMID: 23192344

Background: Termination factor Nab3 binds RNA, but its function and structure are poorly understood.

Results: The carboxyl terminus of Nab3 functions as a self-assembly region important for termination.

Conclusion: Nab3 resembles hnRNP-C in this new function, indicating that it is related to mammalian RNA-packaging proteins.

Significance: Nab3 coats RNA to form a higher order assembly due to interactions among RNA, RNA polymerase, and hnRNPs.

Keywords: Polyglutamine, RNA-binding Protein, RNA-Protein Interaction, Transcription Termination, Yeast, Nab3, hnRNP

Abstract

Nab3 is an RNA-binding protein whose function is important for terminating transcription by RNA polymerase II. It co-assembles with Nrd1, and the resulting heterodimer of these heterogeneous nuclear ribonucleoprotein-C (hnRNP)-like proteins interacts with the nascent transcript and RNA polymerase II. Previous genetic analysis showed that a short carboxyl-terminal region of Nab3 is functionally important for termination and is located far from the Nab3 RNA recognition domain in the primary sequence. The domain is structurally homologous to hnRNP-C from higher organisms. Here we provide biochemical evidence that this short region is sufficient to enable self-assembly of Nab3 into a tetrameric form in a manner similar to the cognate region of human hnRNP-C. Within this region, there is a stretch of low complexity protein sequence (16 glutamines) adjacent to a putative α-helix that potentiates the ability of the conserved region to self-assemble. The glutamine stretch and the final 18 amino acids of Nab3 are both important for termination in living yeast cells. The findings herein describe an additional avenue by which these hnRNP-like proteins can polymerize on target transcripts. This process is independent of, but acts in concert with, the interactions of the proteins with RNA and RNA polymerase and extends the relationship of Nab3 as a functional orthologue of a higher eukaryotic hnRNP.

Introduction

Saccharomyces cerevisiae Nab3 is an hnRNP2-like, RNA-binding protein that is involved in the termination of transcription of small, noncoding RNAs such as snRNAs and cryptic unstable (CUT) RNAs such as that involved in regulation of the IMD2 gene (1, 2). IMD2 encodes IMP dehydrogenase, an enzyme important for GTP synthesis. Cellular GTP levels control its expression. When GTP is abundant, RNA polymerase II initiates transcription at an upstream location where the incorporation of tandem guanines is called for by the template. Just downstream, a terminator is encountered that results in a short CUT RNA. When GTP concentrations are too low to support this start site choice, RNA polymerase II initiates with an adenylate at +1 that is downstream of the terminator. This terminator bypass results in synthesis of IMD2 mRNA in GTP-depleted states.

During termination, Nab3 associates with Nrd1, another hnRNP-like protein that also contains a conserved RNA recognition motif (RRM) (3). One model of transcription termination suggests that this complex associates with the repeated carboxyl-terminal domain of the large subunit of RNA polymerase II through the Nrd1 subunit (4). As well, Nab3 and Nrd1 recognize specific RNA sequences in target RNA enabling the proteins to associate directly with RNA (4). The number and arrangement of such sites vary from substrate to substrate. Thus, nascent RNA destined for termination by this mechanism accumulates a coating of Nrd1 and Nab3 proteins during RNA chain elongation, marking it for the specific mode of termination that characterizes this class of RNA. The process also involves the putative RNA helicase Sen1 and the TRAMP complex, both of which interact with the Nrd1/Nab3 proteins to activate termination and RNA processing (2).

Whereas Nrd1 appears to be the yeast orthologue of human hnRNPs SCAF8 and SCAF4 (35) and it contains a CTD-interacting domain and an RRM, Nab3 is less obviously related to a particular mammalian hnRNP protein based upon sequence homology. Like a number of other hnRNPs, it contains amino acid repeats and tracts of homopolymeric residues including aspartic acid/glutamic acid and proline/glutamine rich domains that have not yet been assigned function. Polyglutamine stretches are of biomedical interest because a variety of proteins suffer mutations that lengthen the homopolymeric tract resulting in multiple human diseases (6). Whereas long tracts are associated with protein aggregation, wild type runs of polyglutamine in proteins are important for protein function where they can augment protein-protein interactions depending upon their length and the sequences adjacent to them (6, 7). In particular, polyglutamine regions can stabilize the co-assembly of adjacent coiled-coil structures (8, 9).

Little is known of the function of most of the essential Nab3 polypeptide outside of the RRM and Nrd1-interaction domains, including the numerous runs of consecutive glutamines and other amino acids. Furthermore, the protein is difficult to express in recombinant form to study its function as a specialized ribonucleoprotein complex (10).

Recently, we revealed a role for a small segment of the carboxyl-terminal “tail” of Nab3 in transcription termination at the CUT associated with IMD2 regulation (1, 11). From spontaneous mutations in yeast, we learned that the final 19 amino acids at the Nab3 carboxyl terminus are important for function. Interestingly, deletion of the last three amino acids recapitulated the phenotype observed when the entire tail was lost, and even deletion of only the last amino acid of Nab3 resulted in significant loss of function (11). Furthermore, this region shares structural homology with an α-helical segment of mammalian hnRNP-C1 (and hnRNP-C2) that enables it to tetramerize (12). Curiously, Nab3 possesses a tract of 16 glutamines of unknown function adjacent to this hnRNP-C-like tail. Polyglutamine tracts in proteins, and other simple amino acid repeats, have been implicated in the function of adjacent secondary structural elements and protein-protein interaction domains (9).

Here we show that the yeast Nab3 tail domain is sufficient to enable assembly of recombinant fusion proteins into higher order multimers, much as the hnRNP-C region does. The stretch of glutamines adjacent to the terminal α-helix facilitates its role as a multimerization domain and incapacitates the role of Nab3 in termination when lost. These results provide evidence in favor of a model that suggests that deposition of multiple Nrd1-Nab3 heterodimers onto RNA substrates is important for the packaging of substrate RNAs and for the optimal functioning of these yeast hnRNP-like proteins in transcription termination and RNA degradation/processing.

MATERIALS AND METHODS

Plasmid Construction, Protein Expression, and Purification

Using oligonucleotides or PCR, portions of the carboxyl-terminal sequences of Nab3 were added to the carboxyl terminus of thioredoxin for expression in Escherichia coli from the pET32a vector. The oligonucleotide pair 5′-gatccggcgggcaacaacgtgcagagcctgctggatagcctggcgaaactgcagaaataac-3′ and 5′-tcgagttatttctgcagtttcgccaggctatccagcaggctctgcacgttgttgcccgccg-3′ encoding the final 18 amino acids of Nab3 were annealed, phosphorylated, and ligated into the XhoI and BglII sites of pET32a to generate pET32a18aa. A PCR product encoding the final 40 amino acids was generated from the plasmid pRS315Nab3FL (11) using oligonucleotides 5′-atatagatctacaccaacctccgcc-3′ and 5′-atatctcgagctatttttgtagttttgctaaac-3′, digested with BglII and XhoI, and inserted into similarly cut pET32a to yield pET32a40aa. pET32a-tov1 and pET32a-tov4 were generated by inserting into pET32a XhoI- and BglII-digested PCR products generated from genomic DNA from tov1 and tov4 yeast strains (11), respectively, using the oligonucleotides 5′-atatagatctacaccaacctccgcc-3′ and 5′-atatctcgagctatttttgtagttttgctaaac-3′.

Plasmids were introduced into BL21(DE3) E. coli, and protein expression was induced with isopropylthiogalactoside for 3 h at 37 °C. Lysozyme-treated cells were broken by sonication, and the lysate was clarified by centrifugation for 30 min at 27,000 × g. The resulting His6-tagged thioredoxin fusion proteins were bound to immobilized nickel at 10 mm imidazole and eluted at 250 mm. They were then dialyzed into 20 mm Tris, pH 7.5, 50 mm NaCl, 1 mm EDTA and chromatographed onto QAE-Sephadex. Flow-through protein was collected and concentrated by centrifugal filtration as needed. Where indicated, the Nab3 polypeptides were excised from the carboxyl terminus of the thioredoxin fusion protein using thrombin. A second round of nickel chromatography was used to separate the His6-tagged thioredoxin from the free Nab3 polypeptide.

Mutagenesis, Yeast Strain Construction, and Flow Cytometry

To generate the L800A and Q801A point mutants, an intermediate plasmid containing a portion of NAB3 (pBS3′Nab3 (11)) was linearized with XhoI and BspMI, and the oligonucleotide pairs 5′-aataatgttcaaagtctattagatagtttagcaaaagctcaaaaatagc-3′ and 5′-tcgagctatttttgagcttttgctaaactatctaatagactttgaacat-3′ (for L800A) or 5′-aataatgttcaaagtctattagatagtttagcaaaactagctaaatagc-3′ and 5′-tcgagctatttagctagttttgctaaactatctaatagactttgaacat-3′ (for Q801A) were annealed, phosphorylated, and ligated into it. The resulting Nab3 region was excised from the respective intermediates with HindIII and XhoI and inserted into similarly cut pRS315Nab3 to generate pRS315-Nab3-L800A and pRS315-Nab3-Q801A.

pRS315-Nab3 was the starting plasmid from which the Gln769 to Gln783 polyglutamine stretch was deleted by in vitro replication with Phusion DNA polymerase (New England Biolabs) and mutagenic oligonucleotides 5′-cctgctggcaataatgttcaaagtctatta-3′ and 5′-aggtggcggaggttggtgtgac-3′. Replicated DNA was used to transform E. coli, and the loss of the region encoding the glutamine stretch in the new plasmid (pRS315-Nab3-NoQ) was confirmed by DNA sequencing.

The strains used in this study are presented in Table 1. The LEU2-marked plasmids pRS315-Nab3-L800A, pRS315-Nab3-Q801A, and pRS315-Nab3-NoQ were introduced independently into a Δnab3 strain that also contained the plasmid pRS316-nab3-11 (DY30229 (11)). Following growth on SC ura leu plates, colonies were grown at 30º on SC leu and tested for loss of the URA3-bearing plasmid by growth on 5-fluoroorotic acid. Following this plasmid shuffle, yeast were then transformed with the GAL-terminator-GFP reporter plasmid described previously (11) and were named DY3126, DY3127, and DY3129. The resulting yeast strains, and a control strain expressing wild type NAB3 (DY3036 (11)), were assayed at an excitation wavelength of 488 nm on a LSRII (BD Bioscience) flow cytometer as described (11). The lithium acetate method of transformation was used throughout (13).

TABLE 1.

Yeast strains used in this study

Strain Genotype Reference
DY30229 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pRS316-nab3–11 (URA3)] 11
DY3034 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pRS315-NAB3 (LEU2)] 11
DY3036 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pREF-GFP kan (URA3)] [pRS315-NAB3FL (LEU2)] 11
DY3126 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pREF-GFP kan (URA3)] [pRS315-nab3-Q801A (LEU2)] This study
DY3127 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pREF-GFP kan (URA3)] [pRS315-nab3-L800A (LEU2)] This study
DY3128 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pRS315-nab3-NoQ (LEU2)] This study
DY3129 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pREF-GFP kan (URA3)] [pRS315-nab3-NoQ (LEU2)] This study
DY3132 MATα his3Δ1 leu2Δ0 (lys2, met15 status unknown) nab3::kanMX ura3Δ0 [pRS315-nab3Δ40 (LEU2)] This study
Protein Cross-linking

Cross-linking reactions were prepared as described by Whitson et al. (12). Briefly, purified recombinant proteins (0.5–1 μg/μl) were incubated for 30 min at 22 °C with freshly dissolved 5 mm (unless otherwise indicated) bis(sulfosuccinimidyl)suberate (BS3; Thermo Scientific) in 20 mm Tris, pH 7.5, 50 mm NaCl, 1 mm EDTA. Reactions were terminated by adjustment to 50 mm glycine. Proteins were precipitated with 10% (v/v) trichloroacetic acid and resolved on SDS gels composed of a resolving slab of 9.5% polyacrylamide layered upon an 11% gel to better separate monomeric from polymeric species. Where indicated, 16% polyacrylamide Tris-Tricine gels were used to resolve small proteins as described by Schagger (14). Gels were stained with fresh Coomassie Brilliant Blue (0.2% w/v).

Gel Filtration

Purified thioredoxin-Nab3 40-amino acid fusion protein (2 mg in 0.1–0.4 ml) was applied to a Superdex 200 10/300 column in 20 mm Tris, pH 7.5, 200 mm NaCl using an AKTA chromatography system at a flow rate of 0.5 ml/min, and 0.5-ml fractions were collected. The column was calibrated with bovine thyroglobulin (670 kDa), bovine γ-globulin (158 kDa), chicken ovalbumin (44 kDa), horse myoglobin (17 kDa), and vitamin B12 (1.4 kDa).

RESULTS

Using chemical cross-linking, a 27-residue α-helix from hnRNP-C was shown to self-assemble into a tetrameric bundle when appended to a heterologous protein (E. coli thioredoxin (12)). Because the carboxyl-terminal tail of Nab3 is structurally related to the hnRNP-C self-assembling helix (11), we employed this assay to test whether the cognate region of the yeast protein could similarly polymerize. A family of recombinant fusion proteins in which various portions of the Nab3 carboxyl terminus were fused to thioredoxin was expressed in bacteria and purified. The homobifunctional reagent BS3 was used to cross-link the purified proteins, and the resulting multimers were separated by SDS-PAGE. The 18-amino acid terminus of Nab3 (residues 785–802; Fig. 1, Nab3–18aa) was able to facilitate intermolecular cross-linking when attached to thioredoxin (Fig. 2A, Nab3–18). More effective cross-linking was observed when the carboxyl-terminal 40 amino acids of Nab3 (Fig. 1, Nab3–40aa) were added to the carboxyl terminus of thioredoxin (Fig. 2A, Nab3–40). This construct included a stretch of 16 glutamines immediately adjacent to the 18-amino acid tail that was identified as important by mutation (11). The pattern of higher order multimers resembled what was seen for the cross-linking of a thioredoxin fusion protein to which the human hnRNP-C assembly helix was attached (Fig. 2A, hnRNP-C-27), as described by Whitson et al. (12). These cross-linking data are evidence that the Nab3 tail is functionally similar to the hnRNP-C assembly domain and that it is likely to possess α-helical character. The similarity in disposition of hydrophobic and hydrophilic residues seen when the primary sequence of yeast Nab3 and human hnRNP-C were displayed in a helical wheel representation is also consistent with this interpretation (Fig. 1).

FIGURE 1.

FIGURE 1.

Sequence of the Nab3 carboxyl terminus. Alignment of the carboxyl-terminal 40 amino acids of S. cerevisiae Nab3 with the self-assembling region of human hnRNP-C is shown. The cognate species numbering is used, and only a portion of the native proteins are shown. Dots ( … ) signify that the polypeptide continues in that direction. The structural homology identified by the PHYRE algorithm is boxed. Derivatives of the Nab3 terminus previously identified by mutation are shown below the alignment and include the part of this region remaining in the tov1 and tov4 mutants. The peptides corresponding to these yeast sequences were appended to thioredoxin for analysis of self-association. Helical wheel representations for the primary sequences of yeast Nab3 and human hnRNP-C were generated using the Helical Wheel Applet.

FIGURE 2.

FIGURE 2.

Cross-linking of recombinant thioredoxin-Nab3 fusion proteins. A, purified proteins (20 μg each) representing thioredoxin (TRX) expressed from unaltered pET32a expression vector (polylinker), or thioredoxin fused to Nab3 carboxyl-terminal 18 or 40 amino acids, or 27 amino acids from hnRNP-C were incubated in the presence (+) or absence (−) of BS3 cross-linking reagent. Proteins were separated on a 9.5% SDS-polyacrylamide gel overlaying an 11% slab and were stained with Coomassie Blue. (Note: thioredoxin lacking added Nab3 amino acids contains a carboxyl-terminal extension encoded by the polylinker and hence is larger than the fusion protein containing the 18-amino acid Nab3 extension which is inserted further upstream in the polylinker.) B, cross-linking of recombinant fusion proteins containing 13 (tov4) or 21 (tov1) Nab3 amino acids fused to thioredoxin was run on an SDS-PAGE as in A.

The structure of hnRNP-C self-assembly domain and the helical wheel modeling of the related Nab3 region suggest that this protein-protein interaction results from a packing of hydrophobic faces of the helices into a bundle. This implies the interaction would be relatively insensitive to high ionic strength. To test this, we repeated the cross-linking assay in 0.5 m and 1 m NaCl. The thioredoxin-Nab3 40 amino acid fusion protein could be readily cross-linked at these high ionic strengths (data not shown).

A spontaneous mutation in Nab3 (tov1) changed the final glutamine in the 16-residue repeat to a stop codon, resulting in a Nab3 protein that lacks this residue and the carboxyl-terminal 18 amino acid tail (Fig. 1, Nab3-tov1) (11). To test whether this derivative of Nab3 could self-associate, we generated a thioredoxin fusion derivative containing only the 21 amino acids (763–783) of Nab3 on the amino-terminal side of the stop codon. In other words, we tested a fusion protein with the last 40 amino acids of Nab3 from which the 18-amino acid tail was removed, leaving 15 glutamines at the fusion protein carboxyl terminus. This protein also generated multimers (Fig. 2B, tov1). Similarly, we recapitulated a second tov mutant, tov4, which had a nonsense mutation within the glutamine repeat tract. The construct had a thioredoxin-Nab3 fusion polypeptide that extends only 7 residues into the glutamine repeat (residues 763–775; Fig. 1, Nab3-tov4). This derivative also assembled into higher order species as determined by chemical cross-linking (Fig. 2B, tov4). For the two tov mutant derivatives of Nab3, as well as the complete 40-amino acid tail, a small amount of the dimeric form was detectable in the absence of chemical cross-linking reagent (Fig. 2B, − lanes). More of this denaturation-resistant form was observed for the 40-amino acid piece of Nab3 versus the 13- or 21-amino acid segments representing tov mutants. We conclude that both the terminal 18 amino acids and the adjacent 16 glutamines of Nab3 contribute independently to the ability to self-associate. Together they provide a multimerization interface such as that seen for hnRNP-C.

To confirm that multimerization was due to the Nab3 portion of the fusion proteins, the 18- and 40-amino acid portions of Nab3 were removed from thioredoxin by digestion with thrombin at a site engineered into the expression plasmid. Free thioredoxin was separated from the cleaved Nab3 tails by metal affinity chromatography, and the small isolated tails were cross-linked with BS3 and analyzed by Tris-Tricine PAGE. Both the 18- and 40-amino acid pieces of Nab3 formed higher order polypeptides following cross-linking, with the larger 40-amino acid piece doing so more robustly (Fig. 3). Interestingly, multimers of the 40-amino acid piece were apparent in the absence of cross-linking reagent, similar to what has been seen for the 27-amino acid α-helix from human hnRNP-C and other polyglutamine-containing proteins (12, 15). This independently confirmed that the carboxyl-terminal region of Nab3 can self-associate and that the glutamine tract augments this property.

FIGURE 3.

FIGURE 3.

Cross-linking of Nab3 tail peptides. The Nab3 18- or 40-amino acid carboxyl-terminal peptide expressed in E. coli was cleaved from its thioredoxin tag by cleavage with thrombin. The released thioredoxin was removed by nickel affinity chromatography, and it (right) or the free tail Nab3 peptides (left) were cross-linked with the indicated concentration of BS3. Polypeptides were separated on a 16% polyacrylamide Tris-Tricine gel and stained.

As an additional test of the ability of the Nab3 tail to self- assemble, we subjected the thioredoxin-Nab3 40-amino acid fusion protein to gel filtration on Superdex 200. Two protein peaks were resolved in migration positions consistent with the sizes of monomer and tetramer based on the positions of reference size markers (data not shown). When the monomeric peak was pooled and gel-filtered a second time, it again was separable as two peaks, consistent with an equilibrium between monomeric and assembled versions of the fusion protein (Fig. 4).

FIGURE 4.

FIGURE 4.

Gel filtration of a thioredoxin-Nab3 fusion protein. Purified thioredoxin (TRX) fused to Nab3 carboxyl-terminal 40 amino acids was subjected to gel filtration as described under “Materials and Methods.” The protein-containing fractions (112) were concentrated by precipitation (400 μl each, except for monomer peak fractions 7–9 which were 12 μl each) and run on SDS-PAGE (11%) gel and stained with Coomassie Blue. The migration positions of reference proteins and their molecular masses are indicated above their cognate elution position. The peak positions of the monomeric and tetrameric forms of the fusion proteins are also indicated.

We next tested whether yeast could grow following removal of the carboxyl-terminal 40 amino acids of Nab3 that contains the glutamine stretch and the hnRNP-C homology region (Fig. 1). Through plasmid shuffling, we obtained a strain of yeast containing such an allele of Nab3 (nab3Δ40) on a plasmid covering a deletion of chromosomal NAB3. This strain was viable but grew slowly, as observed previously for the spontaneous mutant of Nab3 (tov1) that lacked its carboxyl-terminal 19 residues (Fig. 5A, nab3Δ40 and nab3Δ19). Similarly, we generated a strain containing mutated NAB3 in which only the codons for the 16-glutamine tract were deleted in the context of an otherwise wild type gene. This strain grew as well as wild type (Fig. 5A, nab3ΔQ(16)). A more extensive removal of the carboxyl-terminal 134 amino acids of Nab3 proved to be lethal (data not shown), indicating another important function for this protein exists slightly upstream in the protein.

FIGURE 5.

FIGURE 5.

Growth and in vivo termination activity of Nab3 tail mutants. A, growth of tail deletion mutants. Yeast lacking their chromosomal copy of NAB3 and containing a plasmid with either wild type NAB3 (DY3034), NAB3 lacking the 16-glutamine stretch (nab3ΔQ(16); DY3128), the nab3Δ19 deletion allele (DY2233), or the nab3Δ40 deletion allele (DY3132), were diluted to an A at 600 nm of 0.05, serially 10-fold diluted, and grown on SC leu medium at 30 °C. B, flow cytometry on yeast expressing GFP from a reporter plasmid. Yeast strains DY3129 (green), DY3127 (orange), DY3126 (blue), and DY3036 (red) were grown in SC leu ura medium and induced with galactose. 30,000 cells of each strain were subjected to flow cytometry as described under “Materials and Methods,” and the cell count was plotted versus the GFP intensity. The mean intensity is indicated next to the NAB3 status of each strain.

Loss of the 18-amino acid tail from Nab3 negatively impacts its termination activity (11). To test whether the glutamines adjacent to this domain were important for the biological function of Nab3, we returned to a flow cytometry assay previously used to evaluate termination efficiency in living cells (11). In this assay, the inducible GAL1 promoter is placed upstream of the green fluorescent protein (GFP) reading frame and introduced into yeast on a plasmid. Galactose-treated cells are strongly fluorescent unless the IMD2 terminator is inserted between the promoter and GFP. Mutations in the terminator sequence, or in genes encoding trans-acting factors involved in termination (e.g. NAB3, SSU72, or PCF11), restore fluorescence by abrogating terminator function (11).

We designed a plasmid-borne version of NAB3 deleted for the sequence encoding the 16 glutamines adjacent to its hnRNP-C-like α-helix. This plasmid was introduced into a yeast strain lacking chromosomal NAB3, thereby making it the sole source of this protein. To complete the strain, we also transformed cells with the GAL1 promoter-IMD2 terminator-GFP reporter plasmid. If the glutamine stretch is important for termination, cells should show increased fluorescence as gauged by flow cytometry compared with a strain with wild type NAB3 (11). Indeed, when grown on galactose, loss of the 16 glutamines resulted in readily detectable terminator readthrough (Fig. 5B, green) compared with cells expressing wild type Nab3 (red). (A perfectly functional terminator results in a population of cells that show a low level of autofluorescence with a peak centered at ≈102.)

Deletion of the last three amino acids of Nab3 compromised termination as scored by this flow cytometry technique (11). When the penultimate residue (Gln801; Fig. 1) was mutated to alanine (blue), the Nab3 termination function was compromised to an extent similar to the loss of the glutamine tract. When the residue before it (Leu800) was mutated to alanine, termination function was severely compromised resulting in GFP intensities as strong as that observed from loss of the carboxyl-terminal three amino acids ((11) Fig. 5B, orange). These data underscore the importance of the very carboxyl-terminal sequence of Nab3 and the polyglutamine stretch. The difference in severity between the alanine substitutions in residues 800 versus 801 may be related to structural modeling, which suggested that Leu800 ends an α-helix whereas Gln801 is not in that helix (11).

DISCUSSION

Mutations in Nab3 that impaired its termination function have focused attention on the extreme carboxyl terminus of the protein (11). This domain has structural homology with an α-helical region of hnRNP-C that was shown to enable this mammalian protein to tetramerize in its role as an RNA packaging protein (12). Adjacent to that is an unusually long stretch of glutamines. Here we show that this region of Nab3 is functionally similar to the cognate region of human hnRNP-C in that both self-assemble into tetramers. This strengthens the case that Nab3 is a yeast orthologue of mammalian hnRNP-C, a suggestion that initially arose due to homology between the RRM domains of the two proteins (16). A Nrd1 binding region near the Nab3 RRM has also been described (10), but other functions of this large yeast protein have remained mysterious. Study of full-length recombinant protein has proven difficult due to its unusual sequence composition which seems responsible for its extreme lability (data not shown (10)).

Although hnRNP-C1 and -C2 (293 and 306 residues, respectively) and Nab3 (802 residues) share similar RNA binding and tetramerization domains, the positioning of these motifs in the respective primary sequences differs between the proteins, and the surrounding sequences are not closely related. Thus, although the RRM and assembly modules are conserved between the yeast and human proteins, the overall protein structure and function are more divergent. Whereas both polypeptides serve to size-select newly made RNAs, they do so in opposite ways. hnRNP-C binds and measures relatively long RNAs for processing and export (17) whereas Nab3 preferentially facilitates termination of short RNAs in yeast (18). hnRNP-C is highly abundant (≈106/cell (19)), whereas the level of Nab3 is more modest (<6,000 copies/cell (20)). Genetic probing revealed that the self-assembly region of Nab3 is important for the biological function of the protein (11). The significance of the cognate region in hnRNP-C was shown biochemically, due in part to the ability to reconstitute RNA-protein particles with recombinant full-length hnRNP-C (21). hnRNP-C does not have a polyglutamine stretch and is toxic when expressed in S. cerevisiae (22).

Low complexity protein sequences, including homopolymeric runs of certain amino acids, are distributed nonrandomly in proteins. RNA-binding proteins are overrepresented in this group (23). The sequences tend to be involved in protein-protein interactions resulting in aggregation or assembly of dynamic polymers including hydrogels. This property has been suggested to be functionally important for the reversible formation of cytoplasmic granules that harbor RNA-binding proteins (23).

Nab3 contains numerous regions of low complexity sequence across its length, including 12 consecutive aspartates and 10 consecutive glutamates that are embedded in a stretch of 30 residues that are one or the other of these residues. Nab3 also has numerous uninterrupted glutamine runs including three stretches of 6 and one of 16. Our results suggest a role for the latter, which is the longest of the Nab3 homopolymeric tracts and which is immediately adjacent to the hnRNP-C self-assembly region important for transcription termination. This glutamine stretch augments the Nab3 protein-protein interaction, as has been seen elsewhere for polyglutamine tracts (9), and is consistent with suggestions that polyglutamine tracts form an extended helix when placed next to sequences with an α-helical character (9). Whether this is the case here will require more detailed structural analysis. Polyglutamine stretches are seen in human diseases in which the repeat expands by mutation. In general, the repeat expansions in humans are not sufficient for pathology. Instead, it seems that the gain of function in such mutant proteins results from enhancement of its native function rather than acquisition of a new role (6). This report provides some insight into a role for the natural function of a 16-glutamine stretch in a yeast protein whose structure is not well understood. We also learned there is a region neighboring the low complexity hnRNP-C-like multimerization tail that is essential for cell viability. Thus, there is another important domain in this unexplored region of Nab3. Whether its role is related to that of the extreme carboxyl-terminal tail remains to be elucidated.

These findings suggest an extension of current models positing that the Nab3/Nrd1 termination machinery forms a network with nascent RNA and the RNA polymerase II CTD (10, 24). Based upon the roles of the Nab3 glutamine tracts in both termination and protein-protein interaction, this nuclear complex could resemble the higher order assemblage seen in cytoplasmic RNA-binding proteins that contain low complexity primary amino acid sequences (23, 25). Self-association of Nab3 subunits provides an additional tether between the Nab3-Nrd1 heterodimers as they form an array on RNA substrates in which they recognize multiple consensus sequence motifs (10). The number and positioning of Nab3-Nrd1 heterodimers attracted to a single RNA could be governed, not only by the quantity of consensus sites on the RNA, but also through stabilization by the additional association of multiple Nrd1 subunits with the repeats in the CTD of RNA polymerase II (10). Reversible assembly may regulate the ability of the Nrd1-Nab3 complex to terminate transcription and guide substrate RNAs through their respective processing or degradative pathways. Determination of the structure and stoichiometry of such a hypothetical (Nab3-Nrd1)n-RNA complex likely depends upon the specific RNA substrate. Future analysis of these interactions will be aided by in vitro reconstitution from purified RNA and protein components.

Acknowledgments

We thank Dr. A. Krezel for samples of thioredoxin-hnRNP-C fusion protein, Drs. X. Cheng and G. Conn for helpful discussions, and Dr. G. Conn and his laboratory for access to their AKTA.

*

This work was supported, in whole or in part, by National Institutes of Health Grant GM46331.

2
The abbreviations used are:
hnRNP
heterogeneous nuclear ribonucleoprotein
BS3
bis(sulfosuccinimidyl)suberate
CTD
carboxyl-terminal domain
CUT
cryptic unstable transcripts
RRM
RNA recognition motif
Tricine
N-[2-hydroxy-1,1-bis(hydroxymethyl)ethyl]glycine.

REFERENCES

  • 1. Jenks M. H., O'Rourke T. W., Reines D. (2008) Properties of an intergenic terminator and start site switch that regulate IMD2 transcription in yeast. Mol. Cell. Biol. 28, 3883–3893 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Kuehner J. N., Pearson E. L., Moore C. (2011) Unravelling the means to an end: RNA polymerase II transcription termination. Nat. Rev. Mol. Cell Biol. 12, 283–294 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Steinmetz E. J., Brow D. A. (1996) Repression of gene expression by an exogenous sequence element acting in concert with a heterogeneous nuclear ribonucleoprotein-like protein, Nrd1, and the putative helicase Sen1. Mol. Cell. Biol. 16, 6993–7003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Steinmetz E. J., Brow D. A. (1998) Control of pre-mRNA accumulation by the essential yeast protein Nrd1 requires high-affinity transcript binding and a domain implicated in RNA polymerase II association. Proc. Natl. Acad. Sci. U.S.A. 95, 6699–6704 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Yuryev A., Patturajan M., Litingtung Y., Joshi R. V., Gentile C., Gebara M., Corden J. L. (1996) The C-terminal domain of the largest subunit of RNA polymerase II interacts with a novel set of serine/arginine-rich proteins. Proc. Natl. Acad. Sci. U.S.A. 93, 6975–6980 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Orr H. T. (2012) Polyglutamine neurodegeneration: expanded glutamines enhance native functions. Curr. Opin. Genet. Dev. 22, 251–255 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Wetzel R. (2012) Physical chemistry of polyglutamine: intriguing tales of a monotonous sequence. J. Mol. Biol. 421, 466–490 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Bhattacharyya A., Thakur A. K., Chellgren V. M., Thiagarajan G., Williams A. D., Chellgren B. W., Creamer T. P., Wetzel R. (2006) Oligoproline effects on polyglutamine conformation and aggregation. J. Mol. Biol. 355, 524–535 [DOI] [PubMed] [Google Scholar]
  • 9. Schaefer M. H., Wanker E. E., Andrade-Navarro M. A. (2012) Evolution and function of CAG/polyglutamine repeats in protein-protein interaction networks. Nucleic Acids Res. 40, 4273–4287 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Carroll K. L., Ghirlando R., Ames J. M., Corden J. L. (2007) Interaction of yeast RNA-binding proteins Nrd1 and Nab3 with RNA polymerase II terminator elements. RNA 13, 361–373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Loya T. J., O'Rourke T. W., Reines D. (2012) A genetic screen for terminator function in yeast identifies a role for a new functional domain in termination factor Nab3. Nucleic Acids Res. 40, 7476–7491 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Whitson S. R., LeStourgeon W. M., Krezel A. M. (2005) Solution structure of the symmetric coiled coil tetramer formed by the oligomerization domain of hnRNP-C: implications for biological function. J. Mol. Biol. 350, 319–337 [DOI] [PubMed] [Google Scholar]
  • 13. Shaw R. J., Wilson J. L., Smith K. T., Reines D. (2001) Regulation of an IMP dehydrogenase gene and its overexpression in drug-sensitive transcription elongation mutants of yeast. J. Biol. Chem. 276, 32905–32916 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Schägger H. (2006) Tricine-SDS-PAGE. Nat. Protoc. 1, 16–22 [DOI] [PubMed] [Google Scholar]
  • 15. Alberti S., Halfmann R., King O., Kapila A., Lindquist S. (2009) A systematic survey identifies prions and illuminates sequence features of prionogenic proteins. Cell 137, 146–158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Wilson S. M., Datar K. V., Paddy M. R., Swedlow J. R., Swanson M. S. (1994) Characterization of nuclear polyadenylated RNA-binding proteins in Saccharomyces cerevisiae. J. Cell Biol. 127, 1173–1184 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. McCloskey A., Taniguchi I., Shinmyozu K., Ohno M. (2012) hnRNP-C tetramer measures RNA length to classify RNA polymerase II transcripts for export. Science 335, 1643–1646 [DOI] [PubMed] [Google Scholar]
  • 18. Lykke-Andersen S., Jensen T. H. (2007) Overlapping pathways dictate termination of RNA polymerase II transcription. Biochimie 89, 1177–1182 [DOI] [PubMed] [Google Scholar]
  • 19. Beck M., Schmidt A., Malmstroem J., Claassen M., Ori A., Szymborska A., Herzog F., Rinner O., Ellenberg J., Aebersold R. (2011) The quantitative proteome of a human cell line. Mol. Syst. Biol. 7, 549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Ghaemmaghami S., Huh W. K., Bower K., Howson R. W., Belle A., Dephoure N., O'Shea E. K., Weissman J. S. (2003) Global analysis of protein expression in yeast. Nature 425, 737–741 [DOI] [PubMed] [Google Scholar]
  • 21. McAfee J. G., Soltaninassab S. R., Lindsay M. E., LeStourgeon W. M. (1996) Proteins C1 and C2 of heterogeneous nuclear ribonucleoprotein complexes bind RNA in a highly cooperative fashion: support for their contiguous deposition on pre-mRNA during transcription. Biochemistry 35, 1212–1222 [DOI] [PubMed] [Google Scholar]
  • 22. Tan J. H., Kajiwara Y., Shahied L., Li J., McAfee J. G., LeStourgeon W. M. (2001) The bZIP-like motif of hnRNP-C directs the nuclear accumulation of pre-mRNA and lethality in yeast. J. Mol. Biol. 305, 829–838 [DOI] [PubMed] [Google Scholar]
  • 23. Kato M., Han T. W., Xie S., Shi K., Du X., Wu L. C., Mirzaei H., Goldsmith E. J., Longgood J., Pei J., Grishin N. V., Frantz D. E., Schneider J. W., Chen S., Li L., Sawaya M. R., Eisenberg D., Tycko R., McKnight S. L. (2012) Cell-free formation of RNA granules: low complexity sequence domains form dynamic fibers within hydrogels. Cell 149, 753–767 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Vasiljeva L., Kim M., Mutschler H., Buratowski S., Meinhart A. (2008) The Nrd1-Nab3-Sen1 termination complex interacts with the Ser5-phosphorylated RNA polymerase II C-terminal domain. Nat. Struct. Mol. Biol. 15, 795–804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Han T. W., Kato M., Xie S., Wu L. C., Mirzaei H., Pei J., Chen M., Xie Y., Allen J., Xiao G., McKnight S. L. (2012) Cell-free formation of RNA granules: bound RNAs identify features and components of cellular assemblies. Cell 149, 768–779 [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES