Abstract
The characterization of transcription factor complexes and their binding sites in the genome by affinity purification has yielded tremendous new insights into how genes are regulated. The affinity purification requires either the use of antibodies raised against the factor of interest itself or by high-affinity binding of a C- or N-terminally added tag sequence to the factor. Unfortunately, fusing extra amino acids to the termini of a factor can interfere with its biological function or the tag may be inaccessible inside the protein. Here, we describe an effective solution to that problem by integrating the ‘tag’ close to the nuclear localization sequence domain of the factor. We demonstrate the effectiveness of this approach with the transcription factors Fli-1 and Irf2bp2, which cannot be tagged at their extremities without loss of function. This resulted in the identification of novel proteins partners and a new hypothesis on the contribution of Fli-1 to hematopoiesis.
INTRODUCTION
Transcription factors (TFs) regulate gene expression through their recruitment to gene regulatory sequences (1). They often function as protein complexes cooperating with other TFs or cofactors to regulate many biological processes, such as cellular proliferation and differentiation. For example, protein complexes containing the Ldb1 TF have been shown to control erythroid differentiation by regulating the expression of key erythroid-specific genes (2).
Much of our current knowledge of the molecular mechanisms TF use to regulate gene expression comes from the identification of their genomic binding sites by chromatin immunoprecipitation (ChIP) experiments and the identification of their protein partners by pull-down assays usually followed by mass spectrometry (MS) analysis to determine the identity of the co-precipitated factors. These approaches rely on the efficient and specific purification of the proteins and DNA bound by the factor of interest using antibodies. The availability of high-affinity antibodies against particular TFs is, therefore, critical for experimental success. These experiments are usually single-step purifications and/or are performed on low number of cells. The antibodies should therefore be efficient and very specific to obtain a high signal-to-noise ratio to allow the identification of true DNA/protein or protein/protein interactions. However, suitable antibodies are often not available at all or perform suboptimally. A popular alternative to antibodies is therefore the generation of a fusion between a small epitope ‘tag’ sequence and the protein of interest because purification strategies for these are readily available. These short peptide sequences, which are either recognized by high-affinity antibodies or by streptavidin (biotag), have been widely used alone or in combination to characterize TF complexes and genome-wide binding sites (3–5). The peptide tag is fused to either the N-terminal or to the C-terminal end of the protein, however, the addition of extra amino acids to one or both termini can disrupt protein function and/or its stability, as exemplified by the Myef2 protein (6). Because most proteins are modular in structure, an alternative strategy to circumvent problems with terminal tagging would be to integrate the tag sequence next to a domain within the protein (7,8). Several constraints need to be respected for this approach. Most importantly, the tag should not be integrated in a functional domain of the protein, which is often not well defined. Moreover, the tag should be positioned in a region of the protein that is expected to be highly exposed to the cellular milieu in order to promote recognition by antibodies or by the BirA enzyme. Again, such information is usually not available. We therefore thought of using a domain that is almost ubiquitously present and accessible in TFs, namely, the nuclear localization signal (NLS).TFs contain a NLS recognized by the importin α/importin β heterodimers that transport the protein from the cytoplasm through the nuclear pore into the nucleus (9). This domain will be exposed in all cells where the TF is active, although it can be regulated by post-translational modifications (e.g. phosphorylation) or by NLS masking. A well-studied example of the latter is the control of NF-κB nuclear import that is regulated by its interaction with IκB, which masks the NF-κB NLS to prevent its nuclear import (10). Together with structural studies of the FUS NLS (11), the data indicate that the NLS forms an exposed site on the protein that can be recognized by the importin complex.
Here, we address the possibility to make use of the exposed NLS for tagging purposes by integrating a tag sequence close to the NLS as an alternative for the classical C-/N-terminal approach and used two ‘difficult’ proteins, Fli-1 and Irf2bp2, to test this strategy. A 3×Flag-biotin peptide was integrated close to the NLS of these TFs, whose C-/N-terminal tagging disrupt their function (data not shown). Their expression in an erythroid progenitor cell line (which also expresses these protein endogenously) showed that their function is unaffected. We then used the NLS-tagged Fli-1 protein to identify its protein partners by MS analysis in erythroid cells for the first time and found novel protein partners belonging to the key erythroid Ldb1 TF complex.
MATERIALS AND METHODS
Plasmid constructs
Expression vectors for 3×Flag-Bio(NLS)-Fli-1 and 3×Flag-Bio(NLS)-Irf2bp2 were obtained by stepwise insertion of Fli-1 and Irf2bp2 cDNA parts into a modified pBud plasmid containing the 3×Flag sequence. First, the N-terminal coding part up to the NLS sequence was inserted followed by the insertion of the C-terminal coding part containing the bio-tag. The different coding parts have been obtained by polymerase chain reaction (PCR) amplification from a MEL cells cDNA library using the primers listed in Supplementary Table S2. Constructs were verified by Sanger sequencing before transfection in MEL/BirA cells (12). Clones were selected using neomycin at 1 mg/ml. Only clones expressing the same level as the endogenous protein were used for analysis.
Cell culture
MEL cell lines were maintained in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum and penicillin/streptomycin. Cells were induced by culturing them in presence of 2% of di-methyl sulphoxide (DMSO) for 4 days.
Nuclear extracts
Cells were washed twice with 1× phosphate buffered saline (PBS) and resuspended in 8 ml of cold buffer A (10 mM HEPES-KOH pH 7.9, 1.5 mM MgCl2, 10 mM KCl, 0.5 mM DTT, 1× Complete ethylenediaminetetraacetic acid (EDTA)-free protease inhibitor mix (Roche)). After 10 min of incubation in ice, lysates were vortexed and spun down 10 s at 13 200 revolutions per minute (rpm) at 4°C. Pellets were resuspended in 1 volume of buffer C (20 mM HEPES-KOH pH 7.9, 25% glycerol, 420 mM KCl, 1.5 mM MgCl2, 0.2 mM EDTA, 0.5 mM DTT, 1× Complete EDTA-free protease inhibitor mix (Roche)) and incubated 20 min in ice. Lysates were vortexed and pelleted 2 min at 13 200 rpm at 4°C. The supernatant was used for further experiments.
Co-immunoprecipitation
Co-immunoprecipitation experiments were performed starting from 1 mg (2 mg for MS) of nuclear extracts prepared as mentioned above and diluted to 150 mM KCl with Heng0 buffer (20 mM HEPES-KOH pH 7.9, 20% glycerol, 0.25 mM EDTA pH 8.0, 0.05% NP-40, 1× Complete EDTA-free protease inhibitor mix (Roche)). Note that 10 μl of magnetic beads (Dynabeads M-280, Life Science) (blocked during 1 h in presence of 0.2 μg/μl Chicken Egg Albumin) coated with streptavidin for 1 mg of proteins are added to the nuclear extracts together with 100 U benzonase (Roche). After 3 h of incubation at room temperature, the beads were washed five times with Heng150 (20 mM HEPES-KOH pH 7.9, 20% glycerol, 0.25 mM EDTA pH 8.0, 0.2% NP-40, 150 mM KCl, 1× Complete EDTA-free protease inhibitor mix (Roche)) for 5 min at room temperature and directly resuspended in 2× Laemmli buffer (120 mM Tris-HCl, pH 6.8, 4% sodium dodecyl sulphate (SDS), 20% (w/v) glycerol, 0.01% bromophenol blue, 400 mM DTT). Finally, the beads were boiled 5 min and discarded. Co-immunoprecipitation experiments using the anti-Flag antibody (M2, Sigma-Aldrich) were performed as mentioned above except that the nuclear extracts were incubated with 10 μg of antibody and 10 μl of magnetic beads coated with protein G for 4 h at room temperature.
Western blot
Nuclear extracts or immunoprecipitated proteins were boiled in Laemmli buffer and loaded onto NuPAGE precast 4–12% gradient Bis-Tris acrylamide gels (Invitrogen). Proteins were transferred to nitrocellulose membranes and probed for the protein of interest using the antibodies mentioned below. Fluorescently labeled secondary antibodies (Licor) were used for visualization and membranes were scanned on an Odyssey Imaging System.
The following primary antibodies were used : Rabbit anti-Fli-1 (Abcam, ab-15289), Rat anti-Gata-1 (Santa Cruz Biotechnology, N6), Rabbit anti-Irf2bp2 (Absea), Rabbit anti-Klf1 (5-V) (kindly provided by Dr. Sjaak Philipsen), Mouse anti-Vcp (Abcam).
Quantitative reverse transcriptase-PCR (RT-PCR)
Total RNA was extracted from the different cell lines using TRI Reagent (Sigma). First-strand cDNA was synthesized using the SuperScript II First Strand Synthesis System (Invitrogen) and oligo-dT primers (Invitrogen). Real-time PCR was performed using Platinium Taq Polymerase and SYBR green (Invitrogen) cDNA on a Bio-Rad CFX96 PCR System. Ribonuclease/angiogenin 1 (Rnh1) was amplified in parallel for normalization purposes. Primer sequences are listed in Supplementary Table S3.
ChIP followed by real-time PCR
Note that 1 × 107 cells were cross-linked 50 min at room temperature using DSG (Di(N-succinimidyl) glutarate) 50 mM (Proteochem), washed twice with 1× PBS followed by an additional cross-linking step of 10 min using 1% formaldehyde (Merck). The cross-linking reaction was stopped by adding 0.125 M Glycine. Cells were washed twice with 1× cold PBS and resuspended in sonication buffer (Tris-HCl 10 mM pH8.0, EDTA 1 mM, EGTA 0.5 mM). After 10 min of incubation in ice, chromatin was sonicated for 33 cycles (15 s ON, 30 s OFF Amp 9) using the Soniprep 150 (Beun de Ronde) to get fragments of 500–800 bp. The sonicated chromatin from 106 cells was then diluted 20 times in ChIP dilution buffer (0.01% SDS, 1.1% Triton X-100, 1.2 mM EDTA, 16.7 mM Tris-HCl pH 8.0, 167 mM NaCl) and incubated overnight in presence of 10 μl of magnetic beads (Dynabeads M-280, Life Science) (blocked for 1 h with 1.5% fish skin gelatin and 200 ng sonicated salmon sperm DNA). The beads were then washed once with Low Salt Buffer (0.1% SDS, 1% Triton X-100, 2 mM EDTA, 20 mM Tris-HCl pH 8.0, 150 mM NaCl), once with High Salt Buffer (0.1% SDS, 1% Triton X-100, 2 mM EDTA, 20 mM Tris-HCl pH 8.0, 500 mM NaCl), once with LiCl Buffer (0.25 M LiCl, 1% NP-40, 1% deoxycholixacid sodium salt, 1 mM EDTA, 10 mM Tris-HCl pH 8.0) and twice with TE buffer (10 mM Tris-HCl pH 8.0, 1 mM EDTA). The beads were then decrosslinked in presence of Elution Buffer (1% SDS, 0.1 M NaHCO3) during 4 h at 65°C and treated for 1 h with 20 μg proteinase K. Finally, DNA was purified.
ChIP using the anti-Fli-1 (Santa Cruz Biotechnology, sc-356) was performed using the same protocol as described above except for the following differences. The diluted sonicated chromatin was incubated for 30 min with agarose beads coated with protein A and G (Santa Cruz Biotechnology). After spinning 1 min at 1000 rpm at 4°C, the chromatin of 106 cells was incubated overnight at 4°C with either 10 μg of anti-Fli-1 antibody or the same amount of Rabbit immunoglobulin G (IgG; Santa Cruz Biotechnology). Agarose beads coated with protein A and G were then added and the samples were incubated for 1 h at 4°C on a rotating wheel.
Immunofluorescence
Cells were sediment for 15 min on a poly-L-lysine coated glass slide for 15 min at room temperature and fixed for 15 min with 2% PBS/paraformaldehyde at room temperature. Cells were then quickly washed three times with PBS/0.1% Triton-X100 and twice with the same buffer for 10 min. After an extra quick wash with PBS/0.5% bovine serum albumin (BSA)/0.15% Glycine, cells were incubated for 2 h at room temperature with streptavidin conjugated with Alexa Fluor 488 diluted 250 times with PBS/0.5% BSA/0.15% Glycine. Cells were quickly washed three times with PBS/0.1% Triton-X100, washed for 10 min with same buffer and finally quickly washed with 1× PBS. A total of 10 μl prolong GOLD were added to the cells and a coverslip was mounted. After 24 h overnight at room temperature, slides were visualized in the confocal microscope Lyca SP5.
Mass spectrometry
The beads containing bound TF complexes (see Co-immunoprecipitation section) were washed twice with 50 mM NH4HCO3 and incubated overnight at 37°C with shaking with 0.1 μg of trypsin/50 μl beads. The digests are analyzed by nanoflow liquid chromatography-tandem MS on a LTQ-Orbitrap (Thermo) mass spectrometer coupled to an 1100 series LC pump and autosampler (Agilent) operating in positive mode and equipped with a nanospray source. Peptide mixtures were trapped on a ReproSil C18 reversed phase column (Dr. Maisch GmbH; column dimensions 1.5 cm _ 100 μm, packed in-house) with a flow rate of 8 l/min. Peptide separation was performed using a ReproSil C18 reversed phase column (Dr. Maisch GmbH; column dimensions 15 cm _ 50 μm, packed in-house) using a linear gradient from 0% to 80% B (A = 0.1 M formic acid; B = 80% (v/v) acetonitrile, 0.1 M formic acid) over 70 min with a constant flow rate of 200 nl/min using a splitter. The column eluent was directly sprayed into the electron spray ionization (ESI) source of the mass spectrometer (13). Mass spectra were acquired in continuum mode; while fragmentation of the peptides was performed in data-dependent mode. Peak lists were automatically created from raw data files using the Mascot Distiller software (version 2.1; MatrixScience). The Mascot search algorithm (version 2.2, MatrixScience) was used for searching against the NCBInr database (latest NCBInr release; taxonomy: Mus musculus). The peptide tolerance was typically set to 10 ppm and the fragment ion tolerance to 0.8 Da. A maximum number of two missed cleavages by trypsin were allowed and carbamidomethylated cysteine and oxidized methionine were set as fixed and variable modifications, respectively. The Mascot score cut-off value for a positive protein hit was set to 80 (5).
RESULTS
Generation of MEL cells expressing NLS-tagged Fli-1 and Irf2bp2 proteins
Fli-1 is expressed as a 51 and 48 kDa protein isoform that belongs to ETS family of TFs. Like all ETS TFs, it contains an ETS DNA binding domain of around 85 amino acids involved in the recognition of a GGAA core consensus sequence (14). Fli-1 contains two NLSs: the first NLS is located between amino acids 62 and 126 while the second is localized between amino acids 277 and 360 that correspond to the ETS domain (Figure 1a) (15). We therefore decided to integrate the popular triple FLAG (3×FLAG) and a biotin tag (biotag) between amino acids 60 and 61 in a region that does not contain any known domain critical for Fli-1 function (Figure 1a).
Irf2bp2 is a 60 kDa nuclear protein found to function as a co-repressor of the IRF2 TF (16). The Irf2bp2 NLS has been mapped between amino acids 333 and 422. Teng et al. showed that nuclear localization of Irf2bp2 is dependent on the phosphorylation of the serine 360 residue (17). We therefore decided to integrate the 3×FLAG and a biotag between amino acids 331 and 332, which do not belong to any important domain of the Irf2bp2 protein (Figure 1b).
Fli-1 and the Irf2bp2 cDNA containing the tag in the aforementioned positions were cloned in a modified pBud vector that allows cDNA expression driven by a EF1α housekeeping gene promoter. These constructs were transfected into MEL cells that express the BirA biotin ligase (MEL/BirA) at level sufficient to biotinylate TFs carrying a biotag (2,12). Stable clones expressing the NLS-tagged proteins were selected based on their expression levels. Total protein extracts were analyzed by western blot using either an anti-Fli-1 antibody (Figure 1c, top panel), an anti-Irf2bp2 antibody (Figure 1d, top panel) or directly using streptavidin (Figure 1c and d, bottom panel).
Several clones expressing 3×Flag-Bio(NLS)-Fli-1 or 3×Flag-Bio(NLS)-Irf2bp2 were obtained (data not shown). Figure 1c and d show that MEL/BioFli-1 and MEL/BioIrf2bp2 cells express the NLS-tagged proteins at similar levels as the endogenous protein. Moreover, streptavidin western blot revealed the presence of specific bands at the expected size of 3×Flag-Bio(NLS)-Fli-1 or 3×Flag-Bio(NLS)-Irf2bp2 in these stable cell lines (Figure 1c and d, bottom panel).
These data show that the NLS-tagged factors can be expressed at the same level as the endogenous proteins. Moreover, tag integration near the NLS allows recognition and biotinylation of the proteins by BirA biotin ligase.
NLS-tagged Fli-1 and Irf2bp2 are properly localized in the nucleus
The integration of the tag sequences close to the NLS of Fli-1 or Irf2bp2 proteins may disrupt their nuclear localization. Hence, we first performed immunofluorescence assays using streptavidin conjugated with Alexa-Fluor 488 and Dapi in MEL/BirA, MEL/BioFli-1 and MEL/BioIrf2bp2 cells.
While no signal was observed in the nucleus using Streptavidin in MEL/BirA control cells (Figure 2b and c), Figure 2e and f show a clear signal, which merged with DAPI in MEL/BioFli-1 cells indicating that the NLS-tagged Fli-1 protein is, as expected, localized in the nucleus. Similarly, the NLS-tagged Irf2bp2 protein is also localized in the nucleus (Figure 2h and i).
To determine if the NLS-tagged proteins have a similar nuclear localization as the endogenous protein, we compare the ratio between the exogenous and the endogenous protein in total protein and nuclear protein extracts of MEL/BioFli-1 and MEL/BioIrf2bp2 cells. Western blot analyses shown in Figure 2j (compare lanes 2 and 4) and k (compare lanes 2 and 4) indicate that the ratio between the NLS-tagged proteins and the endogenous protein is similar in total protein extracts compared to nuclear extracts showing that the NLS-tagged proteins have a similar nuclear localization as the endogenous protein.
Altogether, these results indicate that integration of a 3×Flag and a biotag sequence close to the NLS of Fli-1 and Irf2bp2 proteins does not disrupt their nuclear localization.
NLS-tagged Fli-1 and Irf2bp2 interact with known protein partners of the endogenous protein
Even though NLS-tagged proteins are properly localized in the nucleus, integration of a short peptide sequence within the Fli-1 and Irf2bp2 proteins could possibly affect their interaction with other proteins. To test this possibility, we investigated whether 3×Flag-Bio(NLS)-Fli-1 or 3×Flag-Bio(NLS)-Irf2bp2 are able to interact with known protein partners of the endogenous proteins.
We first tested whether the NLS-tagged Fli-1 protein interacts with Gata-1 and Klf1, two known Fli-1 protein partners identified in MEL cells and megakaryocytes (18,19). Streptavidin-IPs were performed in MEL/BirA control cells, which only express the BirA biotin ligase and in MEL/BioFli-1, which express the NLS-tagged Fli-1 protein. The absence of the NLS-tagged Fli-1 protein in the unbound fraction shows that more than 90% of the NLS-tagged Fli-1 protein is biotinylated (Figure 3a, compare lanes 2 and 4). Besides, while the streptavidin-IP in MEL/BirA cells does not pull-down Gata-1 or Klf1 (lane 3), the same experiment in MEL/BioFli-1 cells precipitates Gata-1 (line 4, middle panel) and Klf1 (lane 4, bottom panel) together with 3×Flag-Bio(NLS)-Fli-1 (line 4, top panel). The same experiments using the antibody against the 3×Flag tag in MEL/BirA and MEL/BioFli-1 cells show that Gata-1 and Klf1 are also specifically precipitated in MEL/BioFli-1 cells (Figure 3b). These results show that, like the endogenous Fli-1 protein, NLS-tagged Fli-1 interacts with Gata-1 and Klf1 despite the presence of the internal tag sequences.
As mentioned before, Irf2bp2 was first discovered as a protein partner and corepressor of the IRF2 TF (16). To check if the NLS-tagged Irf2bp2 also interacts with Irf2, similar experiments as above were performed in MEL/BirA and MEL/BioIrf2bp2 cells. The results shown in Figure 3c and d indicate that NLS-tagged Irf2bp2 indeed interacts with Irf2 in MEL cells.
Taken together, these results show that the integration of short sequences close to the Fli-1 and Irf2bp2 NLS does not disrupt their interaction with other proteins.
NLS-tagged Fli-1 and Irf2bp2 are recruited to known binding regions of the endogenous Fli-1 and Irf2bp2 proteins
TFs modulate gene expression through their binding to gene regulatory regions. We therefore also investigated whether the integration of a tag close to the NLS does not disrupt the DNA binding pattern of NLS-tagged Fli-1 and Irf2bp2 by performing ChIP experiments using streptavidin-conjugated beads (BioChIP).
Some target regions of Fli-1 have been reported in MEL cells using the anti-Fli-1 antibody. Fli-1 was shown to be recruited to its own promoter (20,21), to the nip7 gene promoter, which encodes a protein involved in ribosome biogenesis (22). We confirmed the recruitment of Fli-1 to these specific regions and also to the tgfb1 locus by performing ChIP experiments using the anti-Fli-1 antibody in MEL cells. Although the efficiency of the ChIP is variable between the biological triplicates, we observed a higher enrichment with the anti-Fli-1 antibody compared to the Rabbit IgG for all these three regions (Figure 4a). We then tested whether 3×Flag-Bio(NLS)-Fli-1 could also be recruited to these loci in MEL/BioFli-1 cells. BioChIP experiments performed on these cells and MEL/BirA control cells showed a specific enrichment in these three loci only in MEL/BioFli-1 cells, with no enrichment observed in the β-amylase control locus (Figure 4b). Besides, the efficiency of the BioChIP is more reproducible than the ChIP with the anti-Fli-1 antibody. These results indicate that NLS-tagged Fli-1 is also recruited to the fli-1 and nip7 promoters as well as the tgfb1 locus in MEL cells.
We recently determined the genome-wide Irf2bp2 binding sites by ChIP-Seq in MEL cells using an antibody raised against the endogenous protein (Soler, in preparation). Interestingly, Irf2bp2 is also recruited to the Fli-1 occupied loci mentioned above and also binds to two known enhancers of Myb (23) (Figure 4c). Similar to Fli-1, we performed BioChIP experiments in MEL/BioIrf2bp2 and MEL/BirA control cells to verify whether the NLS-tagged Irf2bp2 protein is also recruited to these specific loci. Figure 4d indeed shows the recruitment of NLS-tagged Irf2bp2 to five known binding regions of the endogenous Irf2bp2 protein.
Altogether, these results show that the presence of a short tag sequence close to the Fli-1 and Irf2bp2 NLS does not affect their recruitment to gene regulatory regions.
NLS-tagged Fli-1 inhibits MEL cells differentiation
Fli-1 overexpression and depletion experiments performed in mouse and human cells demonstrated that Fli-1 is a repressor of erythroid differentiation (21,22,24). Specifically in MEL cells, the overexpression of Fli-1 represses erythroid differentiation of these cells (21). To further validate the functionality of the NLS-tagged Fli-1 protein, we examined the potential of 3×Flag-Bio(NLS)-Fli-1 to repress the DMSO-induced erythroid differentiation of MEL cells.
As observed in Figure 5a, the 4-day DMSO treatment does not affect the expression of the NLS-tagged Fli-1. Control cells obtained a red color, a well-known sign of hemoglobin synthesis and erythroid differentiation (Figure 5b, left tube). In contrast, MEL/BioFli-1 cells remain white after the same treatment indicating that a functional 3×Flag-Bio(NLS)-Fli-1 is able to inhibit erythroid differentiation.
To confirm these observations, we measured the induction of β-globin and Alas2 expression, two established markers for terminal erythroid differentiation, by RT-qPCR experiments. In contrast to MEL BirA control cells, MEL/BioFli-1 cells show either an impaired (Alas2) or no (β-globin) induction of erythroid gene expression upon treatment while both cell type show a decrease of Sfpi-1 gene expression, an early effect of DMSO in MEL cells (25) (Figure 5c).
Altogether, these results demonstrate that NLS-tagged Fli-1, similar to the endogenous Fli-1 protein, represses the erythroid differentiation of MEL cells despite the integration of a short tag close to its NLS.
Fli-1 interacts with several proteins belonging to the Ldb1 complex
The previous experiments described above show that the integration of a short epitope tag sequence next to the NLS of Fli-1 and Irf2bp2 does not affect their function. To take advantage of the epitope tag for protein purification, we used the MEL/BioFli-1 cells to purify Fli-1 protein partners in MEL cells by performing immunoprecipitation experiments using streptavidin-conjugated beads followed by MS in both MEL/BirA control cells and MEL/BioFli-1 cells.
The full list of the identified proteins interacting with Fli-1 is shown in Supplementary Table S1. According to the MS results, Fli-1 interacts with 99 proteins. Among these proteins, some have been already identified by other groups in different cellular model system. For example, we found among the 99 interacting proteins, the ETS TF ETV6 (also called TEL) whose interaction with Fli-1 has been identified in human K562 cells (26). Moreover, we also found Fli-1 to interact with the Run×1 TF, an interaction found to be important for megakaryopoiesis (27) (Figure 6a). These results further validate the functionality of the NLS-tagged Fli-1 as a valuable tool to study TFs functions.
Strikingly, among the other proteins we found interacting with Fli-1, several members of the Ldb1 TF complex were detected (Figure 6b). Among these proteins, we found the core complex members Gata-1 (whose interaction was confirmed by streptavidin-IP experiments (Figure 3a), Tal1/Scl, Ldb1 itself, Tcf3/E2A and several Ssbp proteins). All Ldb1 complex members were found interacting specifically in 3×Flag-Bio(NLS)-Fli-1 cells (although Gata-1 was also found in the MEL/BirA purification, the Mascott score was less than 2-fold lower as compared to 3×Flag-Bio(NLS)-Fli-1 cells). We next performed new Flag-IP experiments in MEL/BirA and MEL/BioFli-1 cells to confirm the interaction between Fli-1 and Ldb1. As shown in Figure 6c, Ldb1 is pulled-down together with NLS-tagged Fli-1 while no signal corresponding to the Ldb1 protein was detected in MEL/BirA control cells validating our MS data. We also performed co-immunoprecipitation experiments using MEL cells expressing a bio-tagged Ldb1 (12) to check whether BioLdb1 interacts with Fli-1. As shown in Figure 6d, Fli-1 is precipitated more with bio-Ldb1 than in MEL/BirA control cells while similar amount of proteins were used in both cells.
In summary, these data indicate that the NLS-tagged Fli-1 protein is a functional protein that can be efficiently purified, allowing us to identify for the first time the critical erythroid Ldb1 TF complex as a Fli-1 protein interacting partner.
DISCUSSION
Here, we describe a focused strategy of epitope tagging by integrating the sequence close to the NLS (‘NLS-tagging’) of two very different TFs that could not be effectively tagged at their N- or C-terminal end (data not shown). We have shown that the NLS-tagged proteins still interact with known protein partners, that they are still recruited to regulatory sequences bound by the endogenous proteins and that the NLS-tagged Fli-1, like the endogenous protein, is still able to repress the erythroid differentiation of MEL cells. Therefore, we propose NLS-tagging as an effective and fast alternative strategy to the common N- or C-terminal tag integration methods for the characterization of protein complexes formed by TFs and the identification of their DNA binding regions.
The NLS-tagging strategy allowed us to show that Fli-1 interacts with several members of the Ldb1 complex. This complex is mainly known for its positive regulation of erythropoiesis by being recruited to regulatory sequences of erythroid-specific genes (2). This TF complex is also formed in megakaryocytes where it regulates the expression of megakaryocytic genes and the megakaryocyte differentiation (28). These data highlight similar and opposite contributions of Fli-1 and the Ldb1 complex during megakaryopoiesis and erythropoiesis, respectively, resulting in a new hypothesis about the Fli-1 contribution during hematopoiesis. Altogether, these data show that our NLS-tagging strategy can greatly facilitate the elucidation of TF function during developmental processes, exemplified here by new insights in the contribution of the Fli-1 protein to hematopoiesis.
Epitope tag exposure is critical to efficient recognition of the tag by antibodies (or streptavidin in the case of the bio-tag). We chose close proximity to the NLS sequence as a suitable strategy here as the NLS sequence is an often highly exposed domain of the protein. Although we tested this on only two proteins, we expect this to work with many if not all TFs. As a consequence, however, the strategy we propose can only be readily used for TFs or other nuclear proteins with a known NLS, although the NLS can often be recognized from the amino acid sequence. There are several such sequences, the classical sequence PKKKRKV (29) for which the consensus K-K/R-X-K/R was proposed or the bipartite-type sequence KR[PAATKKAGQA]KKKK (30). Other sequences, such as the PY-NLS motif (proline-tyrosine pairing), have also been identified as a bona fide NLS binding importin β2 (31). In addition, the NLS of TF are usually present either in the DNA-binding domain or its vicinity making this region a likely target to integrate the tag (32). Other sequences involved in protein localization could also be chosen for internal tagging of nuclear proteins with an unknown NLS sequence or cytoplasmic proteins. For example, the NES (nuclear export signal) has to be recognized by proteins, such as CRM1, to export proteins from the nucleus to cytoplasm and as a consequence, this sequence should be exposed (33). Moreover, proteins have often modular structure. As we could see for Fli-1 and Irf2bp2 (Figure 1), they are composed of different functional domains important for their biological functions. Any place where domains are linked is potentially a tag sequence integration site. However, some will not work properly as they may be inside the protein. Finally, structural knowledge of the targeted proteins (i.e. from crystallography studies) would clearly be advantageous when choosing where to integrate an internal tag sequence.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
Acknowledgments
We thank the members of the FGG laboratory and of the Cell Biology Department for helpful discussions.
Author contributions: G.G. and F.G.G. conceived and designed the experiments. G.G., R.S., A.C., D.H.W.D. and E.S. performed the experiments. G.G., R.S., D.H.W.D., J.A.A.D., E.S. and F.G.G. collected and analyzed the data. D.H. revised the manuscript. G.G., R.S. and F.G.G. wrote the paper.
FUNDING
SyBOSS EU consortium [#050040212, coordinator F. Stewart, Dresden to F.G.G.]; Netherlands Genomics Initiative [NGI, MEC #242129, coordinator F. Grosveld]. IUAP7-7 DevRepair project funding by Belspo, Belgium [to D.H. and F.G.]; FWO-V [GA09411 to D.H.]. Erasmus Medical Center [to G.G. and A.C.]. SyBoSS EU consortium [to R.S.]. SyBoSS, the Dutch Genomics Initiative (MEC) and the Netherlands Institute for Regenerative Medicine (NIRM). Funding for open access charge: Netherlands Genomics Initiative [NGI, MEC #242129, coordinator F. Grosveld].
Conflict of interest statement. None declared.
REFERENCES
- 1.Farnham P.J. Insights from genomic profiling of transcription factors. Nat. Rev. Genet. 2009;10:605–616. doi: 10.1038/nrg2636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Soler E., Andrieu-Soler C., de Boer E., Bryne J.C., Thongjuea S., Stadhouders R., Palstra R.J., Stevens M., Kockx C., van Ijcken W., et al. The genome-wide dynamics of the binding of Ldb1 complexes during erythroid differentiation. Genes Dev. 2010;24:277–289. doi: 10.1101/gad.551810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Grosveld F., Rodriguez P., Meier N., Krpic S., Pourfarzad F., Papadopoulos P., Kolodziej K., Patrinos G.P., Hostert A., Strouboulis J. Isolation and characterization of hematopoietic transcription factor complexes by in vivo biotinylation tagging and mass spectrometry. Ann. N. Y. Acad. Sci. 2005;1054:55–67. doi: 10.1196/annals.1345.008. [DOI] [PubMed] [Google Scholar]
- 4.Pilon A.M., Ajay S.S., Kumar S.A., Steiner L.A., Cherukuri P.F., Wincovitch S., Anderson S.M., Center N.C.S., Mullikin J.C., Gallagher P.G., et al. Genome-wide ChIP-Seq reveals a dramatic shift in the binding of the transcription factor erythroid Kruppel-like factor during erythrocyte differentiation. Blood. 2011;118:e139–148. doi: 10.1182/blood-2011-05-355107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Soler E., Andrieu-Soler C., Boer E., Bryne J.C., Thongjuea S., Rijkers E., Demmers J., Ijcken W., Grosveld F. A systems approach to analyze transcription factors in mammalian cells. Methods. 2011;53:151–162. doi: 10.1016/j.ymeth.2010.08.002. [DOI] [PubMed] [Google Scholar]
- 6.van Riel B., Pakozdi T., Brouwer R., Monteiro R., Tuladhar K., Franke V., Bryne J.C., Jorna R., Rijkers E.J., van Ijcken W., et al. A novel complex, RUNX1-MYEF2, represses hematopoietic genes in erythroid cells. Mol. Cell. Biol. 2012;32:3814–3822. doi: 10.1128/MCB.05938-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bailey J., Manoil C. Genome-wide internal tagging of bacterial exported proteins. Nat. Biotechnol. 2002;20:839–842. doi: 10.1038/nbt715. [DOI] [PubMed] [Google Scholar]
- 8.Gauss R., Trautwein M., Sommer T., Spang A. New modules for the repeated internal and N-terminal epitope tagging of genes in Saccharomyces cerevisiae. Yeast. 2005;22:1–12. doi: 10.1002/yea.1187. [DOI] [PubMed] [Google Scholar]
- 9.Goldfarb D.S., Corbett A.H., Mason D.A., Harreman M.T., Adam S.A. Importin alpha: a multipurpose nuclear-transport receptor. Trends Cell. Biol. 2004;14:505–514. doi: 10.1016/j.tcb.2004.07.016. [DOI] [PubMed] [Google Scholar]
- 10.Huxford T., Huang D.B., Malek S., Ghosh G. The crystal structure of the IkappaBalpha/NF-kappaB complex reveals mechanisms of NF-kappaB inactivation. Cell. 1998;95:759–770. doi: 10.1016/s0092-8674(00)81699-2. [DOI] [PubMed] [Google Scholar]
- 11.Niu C., Zhang J., Gao F., Yang L., Jia M., Zhu H., Gong W. FUS-NLS/Transportin 1 complex structure provides insights into the nuclear targeting mechanism of FUS and the implications in ALS. PLoS ONE. 2012;7:e47056. doi: 10.1371/journal.pone.0047056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Meier N., Krpic S., Rodriguez P., Strouboulis J., Monti M., Krijgsveld J., Gering M., Patient R., Hostert A., Grosveld F. Novel binding partners of Ldb1 are required for haematopoietic development. Development. 2006;133:4913–4923. doi: 10.1242/dev.02656. [DOI] [PubMed] [Google Scholar]
- 13.Wilm M., Shevchenko A., Houthaeve T., Breit S., Schweigerer L., Fotsis T., Mann M. Femtomole sequencing of proteins from polyacrylamide gels by nano-electrospray mass spectrometry. Nature. 1996;379:466–469. doi: 10.1038/379466a0. [DOI] [PubMed] [Google Scholar]
- 14.Wei G.H., Badis G., Berger M.F., Kivioja T., Palin K., Enge M., Bonke M., Jolma A., Varjosalo M., Gehrke A.R., et al. Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo. EMBO J. 2010;29:2147–2160. doi: 10.1038/emboj.2010.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hu W., Philips A.S., Kwok J.C., Eisbacher M., Chong B.H. Identification of nuclear import and export signals within Fli-1: roles of the nuclear import signals in Fli-1-dependent activation of megakaryocyte-specific promoters. Mol. Cell. Biol. 2005;25:3087–3108. doi: 10.1128/MCB.25.8.3087-3108.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Childs K.S., Goodbourn S. Identification of novel co-repressor molecules for interferon regulatory factor-2. Nucleic Acids Res. 2003;31:3016–3026. doi: 10.1093/nar/gkg431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Teng A.C., Al-Montashiri N.A., Cheng B.L., Lou P., Ozmizrak P., Chen H.H., Stewart A.F. Identification of a phosphorylation-dependent nuclear localization motif in interferon regulatory factor 2 binding protein 2. PLoS ONE. 2011;6:e24100. doi: 10.1371/journal.pone.0024100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Eisbacher M., Holmes M.L., Newton A., Hogg P.J., Khachigian L.M., Crossley M., Chong B.H. Protein-protein interaction between Fli-1 and GATA-1 mediates synergistic expression of megakaryocyte-specific genes through cooperative DNA binding. Mol. Cell. Biol. 2003;23:3427–3441. doi: 10.1128/MCB.23.10.3427-3441.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Starck J., Cohet N., Gonnet C., Sarrazin S., Doubeikovskaia Z., Doubeikovski A., Verger A., Duterque-Coquillaud M., Morle F. Functional cross-antagonism between transcription factors FLI-1 and EKLF. Mol. Cell. Biol. 2003;23:1390–1402. doi: 10.1128/MCB.23.4.1390-1402.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bouilloux F., Juban G., Cohet N., Buet D., Guyot B., Vainchenker W., Louache F., Morle F. EKLF restricts megakaryocytic differentiation at the benefit of erythrocytic differentiation. Blood. 2008;112:576–584. doi: 10.1182/blood-2007-07-098996. [DOI] [PubMed] [Google Scholar]
- 21.Starck J., Doubeikovski A., Sarrazin S., Gonnet C., Rao G., Skoultchi A., Godet J., Dusanter-Fourt I., Morle F. Spi-1/PU.1 is a positive regulator of the Fli-1 gene involved in inhibition of erythroid differentiation in friend erythroleukemic cell lines. Mol. Cell. Biol. 1999;19:121–135. doi: 10.1128/mcb.19.1.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Juban G., Giraud G., Guyot B., Belin S., Diaz J.J., Starck J., Guillouf C., Moreau-Gachelin F., Morle F. Spi-1 and Fli-1 directly activate common target genes involved in ribosome biogenesis in Friend erythroleukemic cells. Mol. Cell. Biol. 2009;29:2852–2864. doi: 10.1128/MCB.01435-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Stadhouders R., Thongjuea S., Andrieu-Soler C., Palstra R.J., Bryne J.C., van den Heuvel A., Stevens M., de Boer E., Kockx C., van der Sloot A., et al. Dynamic long-range chromatin interactions control Myb proto-oncogene transcription during erythroid development. EMBO J. 2012;31:986–999. doi: 10.1038/emboj.2011.450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Athanasiou M., Mavrothalassitis G., Sun-Hoffman L., Blair D.G. FLI-1 is a suppressor of erythroid differentiation in human hematopoietic cells. Leukemia. 2000;14:439–445. doi: 10.1038/sj.leu.2401689. [DOI] [PubMed] [Google Scholar]
- 25.Schuetze S., Paul R., Gliniak B.C., Kabat D. Role of the PU.1 transcription factor in controlling differentiation of Friend erythroleukemia cells. Mol. Cell. Biol. 1992;12:2967–2975. doi: 10.1128/mcb.12.7.2967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kwiatkowski B.A., Bastian L.S., Bauer T.R., Jr, Tsai S., Zielinska-Kwiatkowska A.G., Hickstein D.D. The ets family member Tel binds to the Fli-1 oncoprotein and inhibits its transcriptional activity. J. Biol. Chem. 1998;273:17525–17530. doi: 10.1074/jbc.273.28.17525. [DOI] [PubMed] [Google Scholar]
- 27.Huang H., Yu M., Akie T.E., Moran T.B., Woo A.J., Tu N., Waldon Z., Lin Y.Y., Steen H., Cantor A.B. Differentiation-dependent interactions between RUNX-1 and FLI-1 during megakaryocyte development. Mol. Cell. Biol. 2009;29:4103–4115. doi: 10.1128/MCB.00090-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hamlett I., Draper J., Strouboulis J., Iborra F., Porcher C., Vyas P. Characterization of megakaryocyte GATA1-interacting proteins: the corepressor ETO2 and GATA1 interact to regulate terminal megakaryocyte maturation. Blood. 2008;112:2738–2749. doi: 10.1182/blood-2008-03-146605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kalderon D., Roberts B.L., Richardson W.D., Smith A.E. A short amino acid sequence able to specify nuclear location. Cell. 1984;39:499–509. doi: 10.1016/0092-8674(84)90457-4. [DOI] [PubMed] [Google Scholar]
- 30.Dingwall C., Robbins J., Dilworth S.M., Roberts B., Richardson W.D. The nucleoplasmin nuclear location sequence is larger and more complex than that of SV-40 large T antigen. J. Cell. Biol. 1988;107:841–849. doi: 10.1083/jcb.107.3.841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lee B.J., Cansizoglu A.E., Suel K.E., Louis T.H., Zhang Z., Chook Y.M. Rules for nuclear localization sequence recognition by karyopherin beta 2. Cell. 2006;126:543–558. doi: 10.1016/j.cell.2006.05.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Boulikas T. Putative nuclear localization signals (NLS) in protein transcription factors. J. Cell. Biochem. 1994;55:32–58. doi: 10.1002/jcb.240550106. [DOI] [PubMed] [Google Scholar]
- 33.Yoneda Y. Nucleocytoplasmic protein traffic and its significance to cell function. Genes Cells. 2000;5:777–787. doi: 10.1046/j.1365-2443.2000.00366.x. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.