Abstract
Telomere repeat binding proteins (TRBs) belong to a family of proteins possessing a Myb-like domain which binds to telomeric repeats. Three members of this family (TRB1, TRB2, TRB3) from Arabidopsis thaliana have already been described as associated with terminal telomeric repeats (telomeres) or short interstitial telomeric repeats in gene promoters (telo-boxes). They are also known to interact with several protein complexes: telomerase, Polycomb repressive complex 2 (PRC2) E(z) subunits and the PEAT complex (PWOs-EPCRs-ARIDs-TRBs). Here we characterize two novel members of the TRB family (TRB4 and TRB5). Our wide phylogenetic analyses have shown that TRB proteins evolved in the plant kingdom after the transition to a terrestrial habitat in Streptophyta, and consequently TRBs diversified in seed plants. TRB4-5 share common TRB motifs while differing in several others and seem to have an earlier phylogenetic origin than TRB1-3. Their common Myb-like domains bind long arrays of telomeric repeats in vitro, and we have determined the minimal recognition motif of all TRBs as one telo-box. Our data indicate that despite the distinct localization patterns of TRB1-3 and TRB4-5 in situ, all members of TRB family mutually interact and also bind to telomerase/PRC2/PEAT complexes. Additionally, we have detected novel interactions between TRB4-5 and EMF2 and VRN2, which are Su(z)12 subunits of PRC2.
Supplementary Information
The online version contains supplementary material available at 10.1007/s11103-023-01348-2.
Keywords: Telomere repeat binding, TRB, PRC2, PEAT, TERT, Telomeric
Key message
TRB proteins bind short/long telomeric repeats and attract telomerase, PRC2 or PEAT complexes. Here we show the unique features of novel members of TRB family that have earlier phylogenetic origin.
Supplementary Information
The online version contains supplementary material available at 10.1007/s11103-023-01348-2.
Introduction
Telomere repeat binding proteins (TRBs) were originally characterized as proteins in Arabidopsis thaliana with a binding affinity to telomeric DNA sequences proportional to the number of telomeric repeats (Schrumpfová et al. 2004). They belong to the plant-specific Single myb histone 1 (SMH) family with an N-terminal Myb-like domain (Myb-like), a central histone-like (H1/5-like) domain, and a coiled-coil domain near the C-terminus (Marian et al. 2003). Three members of the TRB family (TRB1, TRB2 and TRB3) in A. thaliana exhibit self-interactions and mutual interactions in the yeast-two hybrid (Y2H) system (Kuchař and Fajkus 2004; Schrumpfová et al. 2004). They bind plant telomeric repeats (TTTAGGG)n through the Myb-like domain (Mozgová et al. 2008), while the H1/5-like domain is responsible for dimerization with other TRB proteins (Schrumpfová et al. 2008).
TRB1-3 proteins are proposed to participate in telomerase biogenesis. They interact directly with the catalytic protein subunit of telomerase (TERT) (Schrumpfová et al. 2014) and mediate interactions between TERT and Recombination UV B-like (RUVBL) proteins (Schořová et al. 2019), homologs of the essential mammalian telomerase assembly components Pontin and Reptin (Venteicher et al. 2008). Nuclear and predominantly nucleolar localization of TRB1-3 interacting with TERT and RUVBLs, as well as with the plant ortholog of dyskerin, CBF5 (Lermontova et al. 2007), was observed using Bimolecular fluorescence complementation (BiFC) (Sweetlove and Gutierrez 2019). Moreover, the TRB1 protein interacts via its H1/5-like domain with Protection of telomeres 1 (POT1b) (Schrumpfová et al. 2008), an A. thaliana homolog of the G-overhang binding protein Pot1, a core component of mammalian telomere cap complex, Shelterin (Tani and Murata 2005; Surovtseva et al. 2007). Additionally, in situ co-localization of TRB1 with telomeric DNA repeats has been detected in plant cells (Schrumpfová et al. 2014; Dreissig et al. 2017).
Telomere shortening was observed in trb1 mutants in the A. thaliana ecotype Columbia, with otherwise-stable telomere lengths (Shakirov and Shippen 2004; Schrumpfová et al. 2014). In contrast, telomere extension was detected in trb2 knockout mutants of the A. thaliana ecotype Wassilewskija, which exhibits telomere length polymorphism in wild-type plants (Shakirov and Shippen 2004; Maillet et al. 2006; Lee and Cho 2016). Triple homozygous mutant plants, containing the alleles from Columbia (trb1 and trb3) and from Wassilewskija (trb2), exhibit telomere shortening (Zhou et al. 2018).
In multicellular organisms, Polycomb repressive complex 1 (PRC1) and PRC2 repress target genes through histone modification and chromatin compaction. In Drosophila melanogaster, four core PRC2 subunits are present: the histone methyltransferase Enhancer of zeste [E(z)], Suppressor of zeste 12 [Su(z)12], Extra sex combs (Esc), and the histone-binding nucleosome remodelling factor 55 kDa (Nurf55). The E(z) homologs in A. thaliana, named CURLY LEAF (CLF) and SWINGER (SWN), are implicated in sporophyte development (reviewed in Mozgova and Hennig 2015). The PRC2 complex primarily methylates histone H3 on lysine 27 (H3K27me3), a mark of transcriptionally silent chromatin. TRB1–3 interact with the PRC2 proteins CLF and SWN (Zhou et al. 2018). We have shown that TRB1 proteins are not only associated with long arrays of telomeric repeats but also with interstitially located short telomeric sequences telo-box motifs, especially in the promoters of translation machinery genes (Schrumpfová et al. 2016). It was further shown that these telo-boxes are part of the cis-regulatory elements that may relate to PRC2 recruitment (Zhou et al. 2016, 2018).
Besides the PRC2 complex, TRB1-3 are components of the PEAT complex (PWOs-EPCRs-ARIDs-TRBs) mediating histone acetylation/deacetylation and heterochromatin condensation. They potentially regulate the RNA-directed DNA methylation (RdDM) pathway (Tan et al. 2018; Tsuzuki and Wierzbicki 2018). The involvement of TRB proteins in histone deacetylation supports the previous observation that TRB2 directly interacts with histone deacetylases (HDT4 and HDA6) (Lee and Cho 2016). Recently, it has also been proposed that histone 1 (H1) selectively prevents H3K27me3 accumulation at telomeres and large pericentromeric interstitial telomeric repeat (ITR) domains by restricting DNA accessibility to TRB proteins (Teano et al. 2020).
Here we show that the plant-specific TRB proteins can be recognized in lower plants, such as Streptophytic algae, as well as in higher plants. In seed plants, TRB proteins are divided into three main lineages. We speculate that due to whole genome duplication (WGD) in A. thaliana, three ancestral TRB proteins have multiplied to the current five TRB members. We characterize new members of TRB family in A. thaliana (TRB4 and TRB5) and demonstrate that all members of the TRB family can bind long arrays of telomeric repeats with high specificity. We defined the minimal recognition motif for all TRBs as one telo-box. Even though TRB4 and TRB5 share very high sequence and structural homology with TRB1, TRB2 and TRB3, they differ in terms of the surface of their Myb-domains and their cellular localization. We provide evidence that TRB4-5 mutually interact with other members of the TRB family and physically interact with TERT, POT1a/b, SWN/CLF, and PWO1-3. Novel interactions were also detected between TRB4-5 and EMF2/VRN2, which represent the Su(z)12 subunits of PRC2. Completing TRB family analysis permits further exploration of the biological roles of these important plant-specific proteins.
Materials and methods
Primers
The sequences of all primers and probes used in this study are provided in Supplemental Table 1.
Plant material
For transient assays, Nicotiana benthamiana plants were grown in soil in LD conditions up to 4 weeks and subsequently used for Agrobacterium tumefaciens infiltration.
Phylogenetic analyses
We combined two homology searches based on A. thaliana TRB1-5. First, we searched completely sequenced genomes using Phytozome v12 and second using BLASTP from available databases (NCBI) and publicly available sequences.
Protein sequences were aligned using the Clustal Omega (Sievers et al. 2011) algorithm in the Mobyle platform (Néron et al. 2009), with homology detection by HMM–HMM comparisons. We screened data after alignment in the BioEdit program (Hall 1999).
Maximum likelihood (ML) analyses of the matrices were performed in RAxML 8.2.4 (Stamatakis 2014) to examine differences in optimality between alternative topologies. 1000 replications were run for bootstrap values. Phylogenetic trees were constructed and modified with iTOL v3.4 (Letunic and Bork 2016). The MEME search was set to identify domains and conserved amino acid (aa) sequence motifs under these conditions: a maximum of 15 motifs for each protein with a wide sequence motif from 2 to 50 and a total number of sites from 2 to 600 MEME 4.11.2 (Bailey et al. 2009). The evolutionarily conserved aa residues were visualized using ConSurf 2016 (Ashkenazy et al. 2016).
Analysis of protein structures
The AlphaFold (Jumper et al 2021; Varadi et al 2022) and SwissModel (Waterhouse et al 2018) tools were used to generate in silico protein models. Structural models were compared as previously described (Palecek & Gruber 2015). All structures including electrostatic potential of their molecular surfaces were visualized using PyMOL Molecular Graphics System, Version 2.4.1, Schrödinger, LLC.
Cloning
For yeast two-hybrid assays (Y2H), most of the Y2H constructs in pGADT7-DEST or pGBKT7-DEST (Horák et al. 2008) were prepared previously: TERT constructs (RID1, TEN, Fw1N, Fw3_NLS, Fw3N and RT) were reported in Majerská et al. 2017, TRB1-3 were reported in Schrumpfová et al. 2014, RUVBL1 and RUVBL2A were reported in Schořová et al. 2019, POT1a and POT1b were reported in Majerská et al. 2017, SWNΔSET and CLFΔSET were reported in Chanvivattana et al. 2004 and Hohenstatt et al. 2018). PWO1 was reported in Hohenstatt et al. 2018, VRN2 and EMF2 were reported in Lindner et al. 2013. The coding sequences of PWO2 and the fragment of PWO3 were cloned in the pGBKT7 and pGADT7 vectors (Clontech), passing through pDONR221 (Invitrogen) as described in Hohenstatt et al. 2018.
For the TRB4 construct, the cloned cDNA sequence of TRB4 (G60951 from Arabidopsis Biological Resource Center; ABRC, https://abrc.osu.edu/) in pENTR223 was used as the entry clone. Site-directed mutagenesis using the QuikChange XL Site-Directed Mutagenesis Kit (Agilent Technologies) was performed following the manufacturer’s instructions to remove the mutation V122A in the protein encoded by pENTR223-TRB4 as described in Wiese et al. 2021. To generate an entry vector containing the cDNA sequence of the TRB5 gene, the total RNA from 7-day-old A. thaliana Col-0 seedlings isolated by TRI reagent (Molecular Research Center) was used for cDNA preparation using M-MuLV Reverse Transcriptase (New England Biolabs). The cDNA sequence of TRB5 was amplified using gene specific Gateway-compatible primers according to the manufacturer’s instructions with primers specified in Supplemental Table 1, and the RT-PCR products were recombined into the Gateway donor vector pDONR207 (Invitrogen). DNA fragments were introduced into the destination Gateway vectors pGBKT7-GW, pGADT7-GW (Addgene) using the LR recombinase reaction (Invitrogen).
For BiFC experiments, the Multisite Gateway® system (Invitrogen) was used to create pBiFCt-2in1 constructs (Grefen and Blatt 2012). The genes encoding TRB1-5 from A. thaliana Col-0 were PCR-amplified from the constructs used in the Y2H system described above by two-step PCR using primers specified in Supplemental Table 1 and Phusion™ High-Fidelity DNA Polymerase (Thermo Fisher Scientific) as described in Wiese et al. 2021. The amplicons with TRB1,2,3,5 genes were cloned into Gateway pDONR221 entry vectors (Thermo Fisher Scientific) carrying either attP1-P4 or attP3-P2 recombination sites using the BP Clonase™ II enzyme mix (Thermo Fisher Scientific). To generate pDONR221 entry clones carrying the TRB4 gene, the In-Fusion® Snap Assembly cloning kit (Takara Bio USA) was used, where PCR-generated TRB4 amplicons with one half of att-sites obtained in the first Phusion PCR step of the Gateway® cloning were fused with linearized PCR-generated pDONR221 backbone with appropriate attL sites by recognizing 15-base pairs (bp) overlaps at their ends according to the manufacturer’s instructions. All entry clones were subsequently used in LR Clonase™ II Plus (Thermo Fisher Scientific) reactions to create pBiFCt-2in1-CC and pBiFCt-2in1-NN expression constructs harboring two protein coding regions C- or N-terminally fused to either the N- or C-terminal eYFP halves (e.g., TRB1-nYFP and TRB1-cYFP). After verification by Sanger sequencing (Macrogene), the constructs were used for transient expression in N. benthamiana leaves.
For transient expression of TRB1-5 fused with GFP in N. benthamiana leaf epidermal cells, the specific entry clones described above (TRB1-3, 5 in pDONR207 and TRB4 in pENTR223) were used in LR Clonase™ II (Thermo Fisher Scientific) reactions to create the pGWB6 (N-terminal GFP fusion under the 35S promoter) expression vectors (Nakagawa et al. 2007). To label nucleolus or nucleoplasm, we co-transfected N. benthamiana leaf epidermal cells with constructs expressing Fibrillarin1-mRFP (Koroleva et al. 2009) or SRp34-mRFP (Lorković et al. 2004; Koroleva et al. 2009), respectively.
For protein expression in Escherichia coli, constructs in pDONR207 (for TRB1 and TRB5) or in pENTR223 (for TRB4) were used as donor vectors for LR Clonase™ reactions (Thermo Fisher Scientific), where genes coding proteins of interest (TRB1, 4 and 5) were transferred into the destination vector pHGWA (Busso et al. 2005).
Yeast two-hybrid assays
Y2H was performed using the Matchmaker TM GAL4-based two-hybrid system (Clontech) as described in Schrumpfová et al. 2014. Successful co-transformation of each bait/prey combination into Saccharomyces cerevisiae PJ69-4a was confirmed on SD plates lacking Leu and Trp, and interactions assessed on SD medium lacking Leu, Trp and His (with or without 1 mM or 3 mM 3-aminotriazol (3-AT)) or SD medium lacking Leu, Trp and Ade at 30 °C. Co-transformation with an empty vector and homodimerization of the TRB1 protein served as a negative and positive control, respectively (Schrumpfová et al. 2014). Each combination was co-transformed at least three times, and at least three independent drop tests were carried out. Protein expression was verified as described in Schořová et al. 2019.
In vitro translation and co-immunoprecipitation
Proteins were expressed from constructs similar to those used in the yeast two-hybrid system with a hemagglutinin tag (pGADT7-DEST; VRN2, EMF2 and POT1a proteins) or a myc-tag (pGBKT7-DEST; TRB proteins) using a TNT Quick Coupled Transcription/Translation system (Promega) in 50 µl reaction volumes according to the manufacturer’s instructions. The VRN2, EMF2 or POT1a proteins were radioactively labelled using 35S-Met. The co-immunoprecipitation procedure was performed as described by Schrumpfová et al. 2008. Input, Unbound and Bound fractions were separated by 10% SDS – PAGE, and analyzed using a FLA7000 imager (Fuji-film).
Transient heterologous expression
A. tumefaciens competent cells (strain GV3101) were transformed with selected expression clones and selected on YEB medium supplemented with gentamycin (50 µg/mL), rifampicin (50 µg/mL), and a vector-specific selection agent (pBiFCt-2in1: spectinomycin 100 µg/mL, pGWB6: kanamycin and hygromycin both 50 µg/mL, pROK2: kanamycin 50 µg/mL) at 28 °C for 48 h. Colonies were inoculated in the same media lacking agar and grown overnight at 28 °C. Bacterial cells of overnight cultures were pelleted by centrifugation (5 min at 1620 g), washed twice, re-suspended, and diluted to an OD600 of 0.5 with infiltration medium (10 mM MES pH 5.6, 10 mM MgCl2 and 200 µM acetosyringone). A suspension of Agrobacterium cells carrying the p19 repressor plasmid was added in a 1:1 ratio with Agrobacterium suspensions harboring plasmids of interest to suppress gene silencing and enhance transient expression (Gehl et al. 2009). Mixed suspensions were incubated with moderate shaking for 1.5–2 h at room temperature and subsequently injected into the abaxial side of leaves of 4-week-old N. benthamiana plant. On the third day after infiltration, tobacco epidermal cells were microscopically analyzed. Microscopy images were acquired using the Zeiss LSM880 laser scanning microscope (Axio Observer Z1, inverted) with Definite Focus 2 (excitation 488 nm for GFP/YFP and 561 nm for mRFP). Images were processed using Fiji/ImageJ (Schindelin et al. 2012) and Adobe Photoshop CS6 (Adobe, San Jose, CA, USA) software.
Nuclei isolation and immunofluorescence
Isolation of nuclei was performed using 10-day-old seedlings as described in (McKeown et al. 2008). Nuclei were centrifuged for 20 min at 350 g and 4 °C, resuspended in 1 × PBS and spotted onto slides. Nuclei were briefly dried at 4 °C, then fixed in 4% paraformaldehyde in 1 × PBS, 0.5% Triton-X for 10 min. Nuclei were rinsed three times in 1 × PBS, blocked in 5% goat serum with 0.05% Tween-20 for 30 min at RT and incubated overnight with antibody anti-TRB 5.2 (Schrumpfová et al. 2014) diluted 1:300 in 5% BSA in 1 × PBS. Nuclei were washed three times for 5 min in 1 × PBS supplemented with 0,05% Tween-20 (PBST), then incubated 1 h with an anti-mouse Alexa 488 antibody, Invitrogen (1:750 dilution). Slides were washed three times for 5 min in 1 × PBST, then dehydrated in ethanol series as described in Kutashev et al. 2021. Coverslips were mounted in 4′,6-Diamidine-2′-phenylindole dihydrochloride (DAPI), 2 µg/mL in Vectashield and imaged using fluorescence using a Zeiss AxioImager Z1 epifluorescence microscope.
Expression of TRB1, 4 and 5 in E. coli
Proteins fused with His-tags were expressed in E. coli (BL21(DE3) RIPL) from the destination vector pHGWA. BL21(DE3) RIPL were grown in Luria–Bertani medium supplemented with ampicillin and chloramphenicol to final concentration 100 ug/ml and 12,5 ug/ml at 37 °C until OD600 reached 0.5. Cells were cultured for 4 h at 25 °C after the addition of IPTG to a final concentration of 0.5 mM. Cells were collected by centrifugation (8,000 g, 8 min, 4 °C). The cell pellet was dissolved in lysis buffer containing 50 mM sodium phosphate (pH 8.0), 300 mM NaCl, 10 mM imidazole, 5 mM β-mercaptoethanol, and protease inhibitor cocktail cOmplete tablets EDTA-free (Roche). The cell suspension was sonicated on ice for 6 min of process time with 1 s pulse and 2 s pause (Misonix). Cell lysate supernatant was collected after centrifugation at 20,000 g, 4 °C for 1 h. Proteins were purified by immobilized-metal affinity chromatography using TALON® metal affinity resin (Clontech), where filtered supernatant (0.45 μm filter) was mixed with TALON® beads and incubated for 1 h. The proteins of interest were eluted with 100 mM imidazole in the same buffer. These proteins were then concentrated, and the buffer was exchanged for 50 mM sodium phosphate (pH 7.0), 300 mM NaCl by ultrafiltration (Amicon 3 K/30 K, Millipore). The concentration of purified proteins was determined using the Bradford assay. We evaluated protein purity using SDS–polyacrylamide gel electrophoresis, with gels stained by Bio-Safe Coomassie G250 (Bio-Rad). To ensure successful purification, the purified proteins were detected by specific monoclonal mouse anti-polyhistidine antibody as described previously (Schrumpfová et al. 2004) and by matrix-assisted laser desorption-ionization tandem mass spectrometry (MALDI-MS/MS) (CEITEC Proteomics Core Facility).
Electrophoretic mobility shift assay (EMSA)
DNA probes and competitors used are described in Supplemental Table 1. To reduce a non-specific DNA–protein binding, 10 pmol of purified TRB1, 4 or 5 were preincubated with 1, 10, or 100 pmol of a specific competitor (oligodeoxynucleotides of non-telomeric or competitor telomeric sequence in double-stranded form, as indicated in Results) for 20 min in EMSA buffer (10 mM Tris–HCl pH 8.0, 1 mM EDTA, 1 mM dithiothreitol, 50 mM NaCl, and 5% w/v glycerol). Probes were end-labelled using [γ-32P]ATP and polynucleotide kinase (New England Biolabs), according to (Sambrook et al. 1989). A probe (1 pmol) was added to the reaction mixture on ice and incubated for 20 min before the mixture was loaded onto a 7.5% w/v non-denaturing polyacrylamide gel (AA:BIS = 37.5:1, 0.5 × TBE, 1.5 mm thick). Electrophoresis was at 15 °C for 3 h at 10 V/cm in a precooled buffer. Signals of labelled oligonucleotides were detected using a phosphorimager FLA-7000IP (Fujifilm).
Results
Sequence and structural divergences in the TRB family
Two decades ago, in silico analysis already predicted that the A. thaliana SMH family contains five TRB members (Marian et al. 2003). We used a combination of recent genome and transcriptome annotations (Lamesch et al. 2012; Cheng et al. 2017) and predicted sequence and structural similarity to characterize the TRB family members TRB4 and TRB5. These were each found to contain an N-terminal Myb-like domain, a central H1/5-like domain, and a C-terminal coiled-coil domain, similar to previously characterized family members TRB1-3 (Fig. 1A) (Marian et al. 2003; Schrumpfová et al. 2004; Mozgová et al. 2008).
We then performed a phylogenetic reconstruction of TRB proteins from the Brassicaceae family based on a matrix including 52 TRBs, with 357 aligned positions. The tree shows that in A. thaliana, as well as in other Brassicaceae, TRB4 and TRB5 are grouped in a monophyletic lineage that is distant from TRB1-3 (Fig. 1B, Supplemental Table 2).
The Myb-like domain is composed of a helix-turn-helix (HTH) motif (Ogata et al. 1992; Bilaud et al. 1996). Even though the domain composition of SMH proteins is unique to the plant lineage, the Myb-like domain shows high aa sequence conservation across the plant and animal kingdoms (Fig. 1C). Predicted three-dimensional (3D) models of A. thaliana TRB Myb-like structural features were overlaid with the X-ray diffraction-resolved crystal structure of human Telomeric repeat-binding factor 2 (hTRF2) (Fig. 1D) and it was found that the 3D structure of the Myb-like domain is well conserved in both plant and animal kingdoms. Interestingly, all TRBs show a slight difference in the composition of the second helix. However, the aa residues mediating the interaction between hTRF2 and the human telomeric repeat sequence (TTAGGG) are fully conserved in A. thaliana even though the plant telomeric sequence slightly differs (TTTAGGG) from the human one (Fig. 1C, D).
The central linker histone globular domain (H1/5-like domain) adopts a winged-helix fold including a HTH motif and a “wing” defined by two β-loops (Ramakrishnan et al. 1993). Using the SWISS-MODEL tool, the Xenopus laevis (Xl) H1 domain (XlH1.0) appeared to be the closest template to A. thaliana TRBs H1/5-like domain (Bednar et al. 2017). We compared the 3D structure of the histone globular domain from XlH1.0-B, obtained by cryo-electron microscopy (Cryo-EM) and X-ray crystallography (Bednar et al. 2017), with the AlphaFold protein structure predictions of the H1/5-like domain of A. thaliana TRBs (Varadi et al. 2022). Although the sequence conservation between the XlH1 domain and the H1/5-like domain of A. thaliana TRBs is lower than in the Myb-like domain, the 3D structure of the H1/5-like domain in TRBs is highly similar (Fig. 1C, D).
The C-terminal coiled-coil domains usually contain a repeated pattern of hydrophobic and charged aa residues, referred to as a heptad repeat (Lupas and Gruber 2005). This repeating pattern enables two helices to wrap/coil around each other. The conservation of hydrophobic residues (Alanine (A), Isoleucine (I), Valine (V), Phenylalanine (F) or Methionine (M)) and an acidic residue (Glutamic acid (E)) is obvious in all coiled-coil domains from A. thaliana TRBs. Only one long α‐helix is predicted by AlphaFold in TRB4-5, while the coiled-coil domains from TRB1 and TRB2 seem to have an additional short α‐helix (Fig. 1C, D). However, these predictions await experimental verification.
The most divergent of the three domains in TRBs (Myb-like, H1/5-like and coiled-coil) is the coiled-coil domain. Our alignments suggest that although TRB4 and TRB5 are distant from TRB1-3 members in terms of sequence, they are folded into similar three-dimensional structures, with only minor differences.
The evolution of TRBs within the plant lineage
We performed comprehensive phylogenetic analysis to investigate whether TRBs are conserved across lower and higher plants, and whether TRB4-5 form a distinct group to TRB1-3 in all species, as observed in the higher plant A. thaliana. We used a data set of 268 proteins and 599 aligned positions and found that TRB proteins first evolved in Streptophyta in Klebsormidiophyceae, although TRBs are missing in the other Streptophyte algae studied, including Charophyceae and Zygnematophyceae. In Klebsormidium nites only one TRB homolog was identified. Following the evolutionary tree, an increasing number of TRB homologues were found in Bryophyta and Tracheophyta. There are three homologues of TRB in mosses Sphagnum fallax and Physcomitrium patens, and these TRBs share very high sequence similarities. Contrastingly, there is only one homologue in the moss Ceratodon purpureus. Perhaps due to limited sequence information available for the genomes of hornworts or liverworts, TRB was not found in these lineages. In seed plants, which have undergone more rounds of whole genome duplication events (WGDs) than Bryophyta and Lycophyta (Clark and Donoghue 2018), predominantly three TRB proteins were recognized. Within Brassicaceae, which has undergone an additional recent round of WGD (Walden et al. 2020), five TRB homologs were revealed (Fig. 2A).
Next, the MEME search (Bailey et al. 2009) was used to identify the 15 most typical motifs in TRBs across the plant kingdom (Supplemental Fig. 1). For ease of presentation, a simplified tree was used with 83 representatives from 33 families and 26 orders, each family having only one member (Fig. 2B, Supplemental Table 3). TRB proteins were divided into three main lineages named TRB_A, TRB_B and TRB_C, into the latter of which the TRB_D sub-lineage is integrated. The lineage TRB_A includes Streptophyte algae, Bryophyta and Lycophyta and diverges from seed plant lineages TRB_B and TRB_C in terms of the length of introns, disordered parts of proteins or additional motifs. A clear diversification of TRBs into monocots and dicots was revealed in both TRB_B and TRB_C lineages. In seed plants, the TRB_B lineage seems to be less abundant than the TRB_C lineage.
Consistent with previous findings, the canonical N-terminal Myb-like (motif 1), a central H1/5-like (motifs 2, 4 and 7), and a C-terminal coiled-coil (motif 3) domains include the top conserved motifs present in all TRBs (Fig. 2B). Within the lineage TRB_A, a unique aa motif was detected in Bryophytes (motif 15; EEREH). Interestingly, the coiled-coil domains of the proteins from TRB_B lineage contain the specific motif 11 (EDTDS), but most of the proteins from the TRB_A and TRB_C lineage contain the motif 6 (DAEAA) instead. However, motif 6 is not present in the A. thaliana protein TRB5 that has diverged from TRB4. The A. thaliana proteins TRB4 and TRB5 (belonging to the TRB_B lineage) lack motif 8 (DDVKI) adjacent to the coiled-coil domain. In contrast, this motif is present within proteins of the TRB_C lineage, including A. thaliana TRB1-3 proteins.
The TRB_D sub-lineage is embedded within TRB_C lineage but supported by a high 99% bootstrap value. TRB2 and TRB3 from A. thaliana are nested within the TRB_D sub-lineage. This lineage lacks motif 5 (MSVMA), a motif adjacent to the Myb-like domain, which is present in the rest of the TRBs in the TRB_C lineage. Similarly to Brassicaceae, divergence to the TRB_D sub-lineage was detected within several other dicot families (e.g., Rhamnaceae - Ziziphus jujuba, Cucurbitaceae - Cucumis sativus, Euphorbiaceae - Ricinus communis, Rutaceae - Citrus sinensis, Malvaceae - Glycine max, Cleomaceae - Tarenaya hassleriana).
In general, TRBs were detected in lower and higher plants. TRBs in Streptophyte algae, Lycophyta and Bryophyta (grouped in TRB_A lineage) are more closely related to A. thaliana TRB4 and TRB5 proteins (TRB_B lineage) than to A. thaliana TRB1-3 (TRB_C lineage). Lineage TRB_B and TRB_C differ in several motifs accompanying the canonical H1/5-like or coiled-coil domains. The specificity of the sub-lineage TRB_D, embedded into TRB_C lineage, is highlighted by its lack of a specific motif 5 following the canonical Myb-like domain.
The TRB4-5 from dicots differ in solution accessible surface of the Myb-like domain
The N-terminal Myb-like domain is the most conspicuous structural unit in TRBs. In order to compare the structural characteristics of this, we further examined the sequence conservation and estimated evolutionary conservation of predicted 3D structures. The TRB groups (TRB_A, B, C, D), established in Fig. 2B, were subdivided into Lycophytes, Bryophytes, Monocots and Dicots. Consensus sequences of the Myb-like domain in each individual group are visualized in Fig. 3A.
The evolutionary dynamics of aa substitutions in A. thaliana TRB4, 1 and 2, representing Dicots from lineages TRB_B, C and D, respectively, were visualized using ConSurf 2016 (Ashkenazy et al. 2016). Visualization has shown a very strong conservation of the DNA-binding surface in all lineages (Supplemental Fig. 2), consistent with the above analysis (Figs. 1 and 2). In comparison, opposite solution-accessible surface was less conserved; in Dicots from the TRB_D lineage, conservation was lower than in Dicots from TRB_B and TRB_C lineages (Fig. 3B, Supplemental Fig. 2). Dicots in all three lineages exhibit a variant unstructured N-terminus of the MYB-domain which is conserved within the lineage. The aa at the N-terminal position 3 in Dicots from the TRB_B lineage, including TRB4-5 from Arabidopsis, show significantly conserved substitutions of Alanine (A) residues to the polar uncharged aa Asparagine (N) (Fig. 3, Inverted triangle). The unstructured C-terminal tail of the Myb-like domain in the TRB_B lineage (Fig. 3, Triangle, Supplemental Fig. 2A) shows less conservation than in TRB_C and TRB_D lineages, where these residues are part of the third helix.
Surface models showing the charge of the Myb-like domain were visualized using PyMol viewer (Fig. 3C, Supplemental Fig. 2C, F). In the TRB_B lineage, visualizations reveal a negatively charged aa (E/D) at position 14 (Fig. 3, Rhombus) flanking conserved EEE motif (Fig. 3, Circle), whereas proteins in the TRB_A and TRB_C lineages possess uncharged Alanine (A) at the position 14. Interestingly, the aa (positively charged aa K/R at position 17) proximal to the E/D motif, are replaced to uncharged (L) in Dicots from TRB_B lineage (Fig. 3C, Trapezoid).
These data indicate that the E/D motif at position 14 together with aa at position 17 are responsible for the additional areas of negative charge on the solution-accessible surface of the Myb-domain in Dicots from TRB_B lineage.
Even one telomeric unit is sufficient for TRB binding
Our previous findings revealed that the N-terminal Myb-like domains of TRB1-3 are responsible for specific recognition of long arrays of telomeric DNA (Schrumpfová et al. 2004; Mozgová et al. 2008). The DNA-binding preference for oligodeoxynucleotide (oligo) substrates was tested using the electrophoretic mobility shift assay (EMSA). Oligo sequences were designed to assess the effect of TRB4 and TRB5 binding to long arrays of telomeric sequences as well as to interstitially located telo-boxes (Fig. 4A, Supplemental Table 1). Either a tetramer of telomeric sequences or a telo-box sequence (1.2 telomeric units) flanked with non-telomeric DNA were used as the labelled probes. A non-telomeric oligonucleotide, added in 1, 20 and 100-fold excess, served as competitor DNA and vice versa. Parallel experiments with the TRB1 protein were used for comparison.
The results obtained from these experiments clearly show that TRB4 and TRB5 do indeed preferentially bind long arrays of telomeric sequences or telo-boxes positioned within a non-telomeric DNA sequence (Fig. 4B–D). TRBs bind to telomeric dsDNA in a similar mode as was described for TRB1-3, forming a high-molecular-weight complex which does not migrate into the gel (Schrumpfová et al. 2004; Mozgová et al. 2008). The binding of all three TRB proteins to the telomeric sequence is highly specific, as even a 100-fold higher concentration of non-telomeric competitor does not prevent the formation of protein-telomeric DNA complex.
Our data indicate that TRB4-5, as well as other TRBs, are capable of binding long arrays of telomeric sequences or short motifs with as little as one telomeric repeat. This is in contrast to human TRF1/TRF2, which bind two telomeric repeats as preformed dimers (Court et al. 2005) and also to previous predictions (Hofr et al. 2009). Having defined the TRB minimal recognition motif as one telo-box, the binding properties of TRBs are poised for further investigation.
Unlike other TRB family members, TRB5 is preferentially localized in the cytoplasm
To compare Arabidopsis TRB proteins further, we examined the subcellular localization of native TRB proteins in Arabidopsis cells, using a mouse monoclonal antibody developed in our laboratory specific for the conserved section of the Myb-like domain found in the TRB family. The anti-TRB 5.2 antibody recognizes all five members of TRB family (Schrumpfová et al. 2014). Nuclei isolated from 10-day-old seedlings were subjected to immunofluorescence using this anti-TRB antibody combined with DAPI staining. We observed a speckled distribution of TRBs in the nucleus and nucleolus (Fig. 5A).
As the anti-TRB 5.2 antibody does not distinguish the localization of individual proteins, we proceeded to the subcellular localization of individual members of TRB family using TRBs fused with green fluorescent protein (GFP). Confocal microscopy showed that TRB1-3 fused with GFP expressed from pGWB6 after transient Agrobacterium-mediated transformation in N. benthamiana leaf epidermal cells are localized mainly in nucleoli and nucleoplasmic fluorescence foci, as was described previously (Schrumpfová et al. 2014; Zhou et al. 2016). Interestingly, TRB4 is distributed not only in nucleoli and nucleoplasmic fluorescence foci of different sizes, but also throughout the nucleoplasm. Conversely, TRB5 fused with GFP is localized mainly in the cytoplasm with only minor localization in the nucleolus and nucleoplasm (Fig. 5B, Supplemental Figs. 3, 4).
Co-expression of GFP-TRBs with the nucleolar marker Fibrillarin1 or nucleoplasm marker Serine-arginine-rich proteins 34 (SRp34) fused with a monomeric red fluorescent protein (mRFP) after transient Agrobacterium-mediated transformation in N. benthamiana leaf epidermal cells verified the subnuclear localizations of the TRB4 and TRB5 described above. Using Fibrillarin-mRFP we clearly identified the stronger localization of GFP-TRB4 in the nucleolus, but the weak nucleolar localization of GFP-TRB5. The relative positioning of GFP-TRB and the nucleoplasm marker SRp34-mRFP also supports our observation that the GFP-TRB5 is preferentially localized in the cytoplasm (Fig. 5C).
TRB4-5 fused with GFP show a distinct localization in the plant cell compared to TRB1-3, proteins with a later evolutionary origin. In particular, GFP-TRB5 manifests strong cytoplasmic localization, suggesting a possible specific functional role or spatiotemporal regulation.
Dimerization of TRB proteins
To shed light on the conservation of mutual interactions between TRB proteins from Arabidopsis, we analyzed interactions between all members of TRB family. To date, self and mutual dimerization of TRB1-3 has been investigated by Y2H or by co-immunoprecipitation experiments (Co-IP) (Kuchař and Fajkus 2004; Schrumpfová et al. 2004, 2008; Mozgová et al. 2008), however knowledge of the precise subcellular localization of mutual TRB interactions is missing.
First, the interactions of TRB4 and TRB5 with other TRB family members were investigated using a GAL4 based Y2H assay. Mutual interactions between TRB family members from Arabidopsis appear to be conserved, as both TRB4 and TRB5, interact with all TRB family members. Moreover, TRB4-5 form self-dimers in a similar way to TRB1-3 (Fig. 6A).
Next, BiFC assays with equal protein levels were used to detect self- and mutual interactions of TRBs at the level of cellular compartments. The TRB coding sequences were introduced into 2in1 multiple expression cassettes within a single vector backbone with an internal marker for transformation and expression-mRFP1 (Grefen and Blatt 2012). TRBs were fused with an N- and/or C-terminal half of yellow fluorescent protein (nYFP, cYFP) as both N-terminal and C-terminal fusions. A list of all constructs is provided in Supplemental Table 4. Fluorescence was detected using confocal laser scanning microscopy after transient Agrobacterium-mediated transformation in N. benthamiana epidermal cells. Our observations show self- and mutual interactions of TRB1-4 in the nucleolus and/or in nucleoplasmic fluorescence foci that were of different numbers and sizes. Interestingly, the TRB5 homodimeric interaction was found to be clearly cytoplasmic with a reduced nuclear speckle size compared to other TRB interactions. Additionally, TRB1-3, but not TRB4, interact moderately with TRB5 in the nucleolus, as well as in more prominent nucleoplasmic fluorescence foci (Fig. 6B, Supplemental Figs. 5, 6).
Overall, our characterization of mutual TRB interactions revealed that all TRB members have the ability to form self-dimers and mutually interact. However, TRB5 homodimeric interactions are predominantly localized in the cytoplasm, which differs from TRB1-4 homodimeric interactions, which are nucleolar or localized in nucleoplasmic fluorescence foci.
Novel interaction partners of TRB proteins—PRC2 subunits EMF2 and VRN2
To further elucidate the conservation of protein interaction partners throughout the TRB family, we looked for interactions between TRB4 and TRB5 with TERT, RUVBLs, POT1b, PWO1-3, SWN, CLF interactors, which have already been described as interacting with TRB1-3 (Schrumpfová et al. 2008, 2014; Hohenstatt et al. 2018; Zhou et al. 2018; Tan et al. 2018; Schořová et al. 2019; Mikulski et al. 2019). These interactions were investigated using a GAL4-based Y2H assay, and TRB interactions with newly identified protein interactors were verified by Co-IP.
Our results revealed that TRB4-5 interact with the N-terminal domains of TERT called telomerase essential N-terminal (TEN, 1 – 233 aa) and RNA interaction domain 1 (RID1, 1 – 271 aa) fragments, in a similar manner to that described for TRB1-3 (Schrumpfová et al. 2014). Interactions of TRBs with TERT seem to be mediated only via the N-terminal domain of TERT, as the centrally positioned Telomerase RNA-binding domain (TRBD) or reverse transcriptase domain (RT) domain do not show interactions (Supplemental Fig. 7). We also observed interactions of TRB4-5 with RUVBL1, RUVBL2A and POT1b in the same manner as was observed for TRB1-3. Moreover, Y2H experiments detected novel interactions between TRB4-5 with a second homolog of the human POT1 protein in A. thaliana - POT1a. These interactions were further confirmed by the Co-IP of these proteins (Fig. 7A, Supplemental Fig. 8A).
Furthermore, TRB4-5 interact with PWOs, members of the PEAT complex, as was already observed for TRB1-3 (Tan et al. 2018). A strong interaction was detected between TRB4-5 and PWO1 full-length (Supplemental Fig. 8B) or PWO1 N-terminal sections, including PROLINE-TRYPTOPHANE-TRYPTOPHANE-PROLINE (PWWP) domain (1–223 aa) (Fig. 7B). PWO2 and 3 interact with TRB4 via their N-terminal parts (1–216 and 1–204 aa, respectively). Similarly, we detected the interactions of the N-terminal parts of PWO1 and PWO3 with TRB5.
To identify TRB interactions with E(z) homologs, members of the PRC2 complex, we used SWN and CLF proteins without the SET [Su(var)3–9, E(z), Trx] domain that confers histone methyltransferase activity (SWNΔSET and CLFΔSET) (Chanvivattana et al. 2004; Hohenstatt et al. 2018). The SET-domain does not seem to be involved in interactions of SWN and CLF with TRB4-5 as Y2H experiments identified interactions between TRB4-5 and SWNΔSET and CLFΔSET (Fig. 7C).
It was shown recently that the rice homologue of TRB, Telomere repeat-binding factor 2 (TRBF2), and rice Su(z)12 homologues of the PRC2 complex named EMBRYONIC FLOWER 2b (EMF2b) interact with each other (Xuan et al. 2022). We observed novel interactions, not only between Arabidopsis TRB4-5 and EMF2, but also between TRB4-5 and other Su(z)12 homologue VERNALIZATION 2 (VRN2). The observed interactions from Y2H system were verified by Co-IP (Fig. 7C).
This broad screen of TRB4-5 protein interaction partners suggests a role of TRBs in various protein complexes. TRBs might contribute to the recruitment of these complexes to telomeric DNA repeats (e.g., Telomerase complex to telomeres) (Fig. 8A) or to telo-boxes localized in the promoter regions of the various genes (e.g., PEAT complex or PRC2 subunits to telo-boxes) (Fig. 8C, D, F).
Discussion
Although the TRBs were originally characterized as being associated with long arrays of telomeric repeats (Schrumpfová et al. 2004, 2014; Mozgová et al. 2008; Dvořáčková et al. 2010; Dreissig et al. 2017) (Fig. 8A), recent observations indicate broad engagement of TRB proteins in various cellular pathways. The most important TRB functions (Fig. 8B–F) include interactions with the telomerase complex (Schrumpfová et al. 2014; Schořová et al. 2019), association with telo-boxes in the promoters mainly of translation machinery genes (Schrumpfová et al. 2016), recruitment of PRC2 and PEAT complexes to telo-boxes (Zhou et al. 2018; Tan et al. 2018; Tsuzuki and Wierzbicki 2018; Mikulski et al. 2019) or antagonisms between TRB1 and H1 at long interstitial telomeric DNA repeats (Teano et al. 2020).
TRB4-5 are evolutionarily closer to TRBs from lower plants
Our phylogeny indicates that TRB proteins with N-terminal Myb-domains are conserved in plants and probably arose in Streptophyte algae within the family Klebsormidiophyceae. This corresponds to the transition of plants to a terrestrial habitat 800 Mya ago (Cheng et al. 2019) (Fig. 2). However, other groups of Streptophyte algae do not have TRB proteins, favoring a birth and death model of gene evolution (Nei et al., 2005; Eirín- López et al., 2012) or total divergence (Pinho and Hey 2010), such as to the TRF-like (TRFL) genes with a C-terminal Myb-domain (Karamysheva et al. 2004; Hwang and Cho 2007; Fulcher and Riha 2016; Majerská et al. 2017).
Recurrent gene duplications over many generations have created orthologs and paralogs in the plant genome, giving rise to several new protein functions. In green algae, the majority of gene families contain only one gene (Clark and Donoghue 2018; Qiao et al. 2019), consistent with this, only one copy of the TRB protein was detected in Streptophyte algae. An increased number of TRB homologs were found in Embryophyta, which could be a result of WGD within these derived groups. In Mosses (taxon Bryophyta), the genome was duplicated in several independent events. Two WGD events in Sphagnum and Physcomitrium around 200 Mya and 120 Mya, respectively (Lang et al. 2018; Clark and Donoghue 2018), resulted in three almost identical TRB proteins clustered in the lineage TRB_A. The moss Ceratodon also underwent WGD, but unlike Sphagnum and Physcomitrium, the TRB duplication was not detected. No TRBs were found in hornworts and liverworts, but this may be due to the limited data available on these genomes.
Our results suggest diversification of the TRB gene family in seed plants which is linked with multiple subsequent or independent events of WGD. Lineage TRB_B seems to have evolved in the ancient past and is closely related to TRB_A in Bryophytes. The sub-lineage TRB_D is present in dicots in many families and embedded within the TRB_C lineage.
Within Brassicaceae, nested WGDs resulted in multiple TRB homologs. A. thaliana has five TRBs with closely related proteins TRB2-3 and TRB4-5. However, within Brassicales the situation differs. The genome of Carica papaya (Caricaceae) possesses only two TRB proteins, each belonging to one of the lineages TRB_B or TRB_C lineages. This observation is in agreement with an older paleoploidy event in the Brassicaceae lineage (β) that is not shared by C. papaya but is shared by all other analyzed Brassicaceae species (Dassanayake et al. 2011; Wang et al. 2011). As summarized by Rockinger et al. 2016, the ancestor of all Caricaceae underwent only a single WGD event, in comparison to the ancestor of A. thaliana, which underwent a more recent, additional round of WGD (α) (Bowers et al. 2003; Kagale et al. 2014). Similarly, in monocots, where several independent WGDs have occurred, an increased number of TRB proteins were found, e.g., six TRBs were detected in Zea mays (Fig. 2B).
The complexity of plant genomes is extremely high, and annotations of plant genomes are undergoing intensive improvement at present (Kress et al. 2022). Further revision and reinvestigation of the TRB evolutionary tree may help to elucidate species-specific TRB variants such as the one identified in Malus domestica.
Conserved structure of individual domains
Although TRB4-5 are grouped in a monophyletic lineage that is distant to TRB1-3 proteins, our assessment of predicted structural models implies that all TRB family members are folded into similar three-dimensional structures. The TRB Myb-like domain is very closely related to that of other telomere-binding proteins, including human TRF1 and TRF2 (Chong et al. 1995; van Steensel et al. 1998; Smogorzewska and de Lange 2004). TRF1 and TRF2 bind to DNA as preformed homodimers (Bianchi et al. 1997, 1999; Broccoli et al. 1997). It was suggested that in vitro the Myb-like binding domain of TRF1 binds to DNA essentially independently of the rest of the protein (König et al. 1998; Bianchi et al. 1999; Court et al. 2005). Specificity in DNA recognition of TRF1 and TRF2 is achieved by several direct contacts from aa side chains to the DNA, mainly via helix 3 and an extended N-terminal arm (Hanaoka et al. 2005; Court et al. 2005). Predicted models of the plant Myb-like domain (Fig. 1) show that the overall 3D structure is preserved in both plant and animal kingdoms, and that the surface mediating protein-DNA interactions are fully conserved. However, the surface side and extended N-terminal arm of the Myb-domain of proteins from the TRB_B family differ to those of the TRB_A or TRB_C families (Fig. 3).
Consistent with the conserved features of the Myb domain, our EMSA results showed the conserved ability of TRB4-5 proteins to bind telomeric sequences. Notably, not all proteins with Myb-like domains are able to bind telomeric repeats, as Arabidopsis proteins from the TRFL family, with this motif at the C-terminus, need an accessory Myb-extension domain for telomeric dsDNA interactions in vitro (Karamysheva et al. 2004; Ko et al. 2008). Moreover, unlike TRBs, only Arabidopsis plants deficient in one particular member (Hwang and Cho 2007), but not plants deficient for all six TRFL proteins with Myb-extension domain, exhibit changes in telomere length (Fulcher and Riha 2016). Our EMSA results showed the ability of TRB4-5 proteins to bind longer telomeric tracts as well as short telo-boxes, containing roughly one telomeric repeat flanked by non-telomeric sequence (Fig. 4). These observations suggest that TRB4-5 proteins may be associated with cis-regulatory elements in the promoter regions of Arabidopsis genes as was described for TRB1-3 (Zhou et al. 2016; Schrumpfová et al. 2016). The presence of only one telomeric repeat in the telo-box raises the question of whether TRB proteins operate on promoter regions as monomers/dimers or multimers. Based on our quantitative DNA-binding study, we propose that TRB1 and TRB3 bind long telomeric DNA arrays with the stoichiometry of one protein monomer per one telomeric repeat (Hofr et al. 2009), but the stoichiometry of TRBs in regulatory complexes associated with telo-boxes needs further elucidation.
The sequence-specific interaction between telomeric dsDNA and TRB1-3 is mediated predominantly by the Myb-like domain, although additional domains from TRBs can also contribute to non-specific DNA interactions (Schrumpfová et al. 2004; Mozgová et al. 2008; Hofr et al. 2009). The H1/5-like domains of TRB proteins belong to the same group as central globular domain of core H1 histones, the incorporation of which directly influences the physicochemical properties of the chromatin fiber and further modulates nucleosome distribution, chromatin compaction and contributes to the local variation in transcriptional activity by affecting the accessibility of transcription factors and RNA polymerases to chromatin (Fan and Roberts 2006; Zhou et al. 2015; Hergeth and Schneider 2015; Bednar et al. 2017; Fyodorov et al. 2018). The H1/5-like domain of TRB mediates non-specific DNA interactions (Mozgová et al. 2008) as well as interactions with the other members of TRB family and also with the POT1b protein (Schrumpfová et al. 2008). The globular domain of H1 adopts a winged-helix fold with a “wing” defined by two β-sheets. 3D model predictions suggest only minor differences between TRB4-5 and TRB1-3 within the H1/5-like domain, as the loop between two antiparallel β-sheets in the H1/5-like domain of TRB4-5 is longer than in TRB1-3.
Coiled-coil domains are structural motifs that consist of two or more α-helical peptides that are wrapped around each other in a superhelical fashion that may mediate interactions between proteins (Lupas and Gruber 2005; Apostolovic et al. 2010). Only minor differences were predicted for the 3D structure of the coiled coil domain of TRB4-5 and TRB1-3. However, the motifs at the C-terminal sections of the coiled-coil domains in TRBs from TRB_B differ from those motifs at the C-terminal parts of TRB_C lineages, and clearly distinguish these two evolutionarily distinct groups.
TRB4-5 differ in subcellular localization from TRB1-3
In previous studies, it was shown that TRB1-3 are highly dynamic DNA-binding proteins with cell-cycle regulated localization. During interphase, GFP-fused TRB1-3 proteins are preferentially localized in the nucleus, with a strong nucleolar signal and relatively strong nuclear speckles of different sizes (Dvořáčková et al. 2010; Schrumpfová et al. 2014; Zhou et al. 2018). A similar pattern was observed in transiently or stably transformed Arabidopsis cells (Dvořáčková et al. 2010) or in tobacco epidermal cells (Schrumpfová et al. 2014) despite the fact that tobacco telomeres are dispersed throughout the nucleus (so-called non-Rabl chromosome configuration), while Arabidopsis telomeres are clustered around the nucleolus (rosette-like chromosome configuration) (Shan et al. 2021). Our results show that even native TRBs in isolated Arabidopsis nuclei can be detected with a specific antibody, and are localized in the speckles. Some of these speckles in the vicinity of the Arabidopsis nucleoli might be telomeric (Dvořáčková et al. 2010), or might be Cajal bodies (Dvořáčková 2010), as was demonstrated for speckles detected in tobacco nuclei.
Here we show that TRB4 and TRB5 fused to GFP have, in contrast to TRB1-3, a distinct subcellular localization pattern (Fig. 5B, C). In addition, unlike all other members of the TRB family, TRB5 is preferentially localized in the cytoplasm. It remains to be clarified whether it is sequestered there or plays a specific functional role.
Our Y2H or BiFC assays proved that TRB4-5 could form homo- and hetero-dimers, as observed in TRB1-3 (Kuchař and Fajkus 2004; Schrumpfová et al. 2004). In addition to dimers, TRB1-3 are capable of forming both homo- and heterotypic multimers via their H1/5-like domain (Schrumpfová et al. 2004; Mozgová et al. 2008; Hofr et al. 2009). Similar multimerization can also be assumed for TRB4 and TRB5, as we observed the formation of high molecular weight complexes for TRB4-5 in EMSAs, which did not migrate into the gel (Warren et al. 2003).
Despite distinct subcellular localization of GFP-TRB4 and GFP-TRB5, we observed mutual interactions between all TRB family members using BiFC, predominantly in the nuclear speckles (Fig. 6B). Only the TRB5 homodimeric interaction was found to be clearly cytoplasmic with nuclear speckles of decreased size compared to the size of speckles in the other homodimeric or mutual TRB interactions. Our BiFC assays showed that TRB1-3, but not TRB4, have the ability to drag TRB5 into the nucleolus, as TRB4 does not interact with TRB5 in the nucleolus. We can assume that TRB proteins form various heteromers in different subcellular compartments that might possess different functions related to distinct biochemical pathways.
Interconnection of TRBs with various protein complexes
Even though TRB4-5 show slightly distinct localization patterns compared to TRB1-3, it appears that all TRBs may interact with similar partners in vitro (Fig. 7). TRB4-5 proteins directly interact with the N-terminal domains of TERT as was also shown for TRB1-3 proteins (Schrumpfová et al. 2014). Additionally, TRB4-5 interact with both RUVBL1 and RUVBL2A proteins, which may imply that they may mediate interactions in the trimeric complex RUVBL-TRB-TERT as was proposed for TRB1-3 (Schořová et al. 2019). Telomerase might also be modulated by POT1 proteins: A. thaliana POT1a positively regulates telomerase activity, POT1b is proposed to negatively regulate telomerase and promote chromosome end protection (Beilstein et al. 2015). We observed not only the expected interaction between TRB4-5 and POT1b (Schrumpfová et al. 2008), but also revealed novel interactions of TRB proteins with POT1a. Altogether, we can assume that TRB4-5 are associated with the telomerase complex as was proved for TRB1-3.
In addition to telomeres (Schrumpfová et al. 2014), TRB1-3 regulate the PRC2 target genes (Zhou et al. 2016, 2018). Interestingly, H3K27me3 is present at telomeres of Arabidopsis (Adamusová et al. 2020; Vaquero-Sedas and Vega-Palas 2023). The observation that not only TRB1-3 (Zhou et al. 2018), but also TRB4-5, physically interact with homologs of E(z) subunit of PRC2 complex named CLF and SWN suggests that all TRBs can target PRC2 to Polycomb response elements (PREs) including telo-boxes (Deng et al. 2013; Zhou et al. 2016; Godwin and Farrona 2022). Moreover, the novel interaction between TRBs and Su(z)12 A. thaliana homologues EMF2 and VRN2 described here, tightly interconnects all TRBs with the core PRC2 components. These observations support the recently published observation that O. sativa single Myb transcription factor TRBF2 forms phase-separated droplets, which aggregate with PRC2 via rice OsCLF and OsEMF2 (Xuan et al. 2022).
In Arabidopsis, the PWO1 protein interacts with all three E(z) homologs, including CLF and SWN through its conserved N-terminal PWWP domain (Mikulski et al. 2019). Furthermore, PWO1-3 are associated with members of the PEAT complex, which was recently identified as being able to silence transposable elements (Tan et al. 2018). Our data suggest that the interaction between TRBs and PWOs is not restricted to only TRB1 and 2 (Tan et al. 2018), instead other TRB members, including TRB4-5, interact with N-terminal part of PWO1 and PWO3 from A. thaliana, containing PWWP domains. Additionally, the N-terminal part of PWO2 is recognized by Arabidopsis TRB4. Interestingly, in tobacco PWO1 tethers CLF to nuclear speckles (Hohenstatt et al. 2018; Mikulski et al. 2019) which have a similar distribution to speckles of TRB proteins.
Overall, we conclude that TRB proteins, including the newly characterized TRB4-5, are associated with several complexes, including telomerase, PRC2 or PEAT complexes. However, TRBs are unlikely to be permanently associated with all of these complexes (Tan et al. 2018; Schubert 2019), and we might speculate that TRB subunits are partially interchangeable within these complexes.
It should be noted that the number of identified interaction partners of TRBs in Arabidopsis may increase in the future, as these promiscuous proteins may play a role in various additional biochemical processes that are not yet elucidated. For example, the A. thaliana TRB1 gene is responsive to several types of hormones, including jasmonate (JA) (Yanhui et al. 2006); TRB homologue from apple dynamically modulates JA-mediated accumulation of anthocyanin and proanthocyanidin (An et al. 2021); soybean TRB homologue was identified as candidate gene regulating total soluble sugar in soybean seeds (Xu et al. 2022).
Conclusion
Proteins from the TRB family are plant-specific and apparently first evolved in lower plants. We speculate, that due to WGDs one ancestral TRB was multiplied to the current five TRB members in A. thaliana increasing the potential of diversification of their particular functions. Further research is needed to confirm whether newly described TRB4-5 proteins specifically target to telomeric sequences located terminally (Schrumpfová et al. 2004, 2014; Mozgová et al. 2008; Dvořáčková et al. 2010; Dreissig et al. 2017) or interstitially (Schrumpfová et al. 2016) via their Myb-like domain like other members of this family TRB1-3. Additionally, the versatile interactions of all TRBs members with other proteins contribute to the multiple functions that they adopt in the cell nucleus, including participation in telomerase biogenesis (Schrumpfová et al. 2014; Schořová et al. 2019), recruitment of PRC2 or PEAT complexes (Zhou et al. 2016, 2018; Tan et al. 2018; Tsuzuki and Wierzbicki 2018; Mikulski et al. 2019) or competing with H1 for binding to interstitially localized telomeric sequences (Teano et al. 2020). The cytoplasmic localization of TRB5 and its implications deserve further investigation. As additional functions and interaction partners of TRBs are discovered, it can be expected that research in plants will lead to a better understanding of the mode of action of the different TRBs and also to the elucidation of novel functions.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
This work was supported by the Czech Science Foundation projects 21-15841S (P.P.S., M.K., D.H.) and 20-01331X (A.K), and Czech Ministry of Education, Youth and Sports grant number LTAUSA18115 (L.S.). The Plant Sciences Core Facility of CEITEC Masaryk University is acknowledged for its technical support. We acknowledge the Imaging Facility of the Institute of Experimental Botany AS CR supported by the MEYS CR (LM2018129 Czech-BioImaging) and IEB AS CR.
We acknowledge Sara Farrona (Plant and AgriBiosciences Centre, National University Ireland, IR) for providing us with SWN, CLF and EMF2 constructs. Moreover, we acknowledge Eva Sykorova (Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic) for providing us constructs with POT1a-b and TERT fragments and Ali Pendle (John Innes Centre, Norwich, United Kingdom) for the sub-nuclear localization markers. We also acknowledge Yvona Stražická and Michal Franek for the help with the immunofluorescence on Arabidopsis nuclei.
We acknowledge Leon Jenner and Iva Mozgová for proofreading and manuscript editing.
The access to the computing and storage facilities owned by parties and projects contributing to the National Grid Infrastructure MetaCentrum provided under the programme Projects of Large Infrastructure for Research, Development, and Innovations (LM2010005) was highly appreciated, as was the access to the CERIT-SC computing and storage facilities provided under the programme Center CERIT Scientific Cloud, part of the Operational Program Research and Development for Innovations, reg. no. CZ.1.05/3.2.00/08.0144.
Author contribution
Petra Procházková Schrumpfová and David Honys designed the study, supervised the project. Petra Procházková Schrumpfová wrote the manuscript with support from David Honys, Jan Paleček, Daniel Schubert. Lenka Steinbachová, Zuzana Gadiou, Alžbeta Kusová, Tereza Přerovská, Claire Jourdain and Tino Stricker performed cloning, Lenka Steinbachová performed localization and BiFC, Alžbeta Kusová and Ahmed Khan performed Y2H, Tereza Přerovská performed EMSA, Gabriela Rigóová performed Co-IP, Jan Paleček analyzed 3D structures, Lenka Záveská Drábková performed evolution analysis. Petra Procházková Schrumpfová conducted experiments and analyzed data.
Funding
Open access publishing supported by the National Technical Library in Prague.
Data availability
All data generated or analysed during this study are included in this published article.
Declaration
Competing interest
The authors report no competing interests.
Footnotes
Accession numbers
TRB1 (AT1G49950); TRB2, formerly TBP3 (AT5G67580); TRB3, formerly TBP2 (AT3G49850); TRB4 AT1G17520; TRB5 AT1G72740; RUVBL1 (AT5G22330); RUVBL2A (AT5G67630); TERT (AT5G16850); POT1a (AT2G05210); POT1b (AT5G06310); Fibrillarin1 (AT5G52470); SRp34 (AT1G02840); EMF2 (AT5G51230); VRN2 (AT4G16845); CLF (AT2G23380); SWN (AT4G02020); PWO1 also named as PWWP1 (AT3G03140); PWO2 also named as PWWP2 (AT1G51745); PWO3 also named as PWWP3 (AT3G21295).
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Alžbeta Kusová and Lenka Steinbachová shared co-first authorship.
References
- Adamusová K, Khosravi S, Fujimoto S, Houben A, Matsunaga S, Fajkus J, Fojtová M. Two combinatorial patterns of telomere histone marks in plants with canonical and non-canonical telomere repeats. Plant J. 2020;102:678–687. doi: 10.1111/tpj.14653. [DOI] [PubMed] [Google Scholar]
- An J-P, Xu R-R, Liu X, Zhang J-C, Wang X-F, You C-X, Hao Y-J. Jasmonate induces biosynthesis of anthocyanin and proanthocyanidin in apple by mediating the JAZ1–TRB1–MYB9 complex. Plant J. 2021;106:1414–1430. doi: 10.1111/tpj.15245. [DOI] [PubMed] [Google Scholar]
- Apostolovic B, Danial M, Klok H-A. Coiled coils: attractive protein folding motifs for the fabrication of self-assembled, responsive and bioactive materials. Chem Soc Rev. 2010;39:3541–3575. doi: 10.1039/B914339B. [DOI] [PubMed] [Google Scholar]
- Ashkenazy H, Abadi S, Martz E, Chay O, Mayrose I, Pupko T, Ben-Tal N. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–W350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202–208. doi: 10.1093/nar/gkp335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bednar J, Garcia-Saez I, Boopathi R, Cutter AR, Papai G, Reymer A, Syed SH, Lone IN, Tonchev O, Crucifix C, Menoni H, Papin C, Skoufias DA, Kurumizaka H, Lavery R, Hamiche A, Hayes JJ, Schultz P, Angelov D, Petosa C, Dimitrov S. Structure and dynamics of a 197 bp nucleosome in complex with linker histone H1. Mol Cell. 2017;66:384–397.e8. doi: 10.1016/j.molcel.2017.04.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beilstein MA, Renfrew KB, Song X, Shakirov EV, Zanis MJ, Shippen DE. Evolution of the telomere-associated protein POT1a in Arabidopsis thaliana is characterized by positive selection to reinforce protein-protein interaction. Mol Biol Evol. 2015;32:1329–1341. doi: 10.1093/molbev/msv025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bianchi A, Smith S, Chong L, Elias P, de Lange T. TRF1 is a dimer and bends telomeric DNA. EMBO J. 1997;16:1785–1794. doi: 10.1093/emboj/16.7.1785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bianchi A, Stansel RM, Fairall L, Griffith JD, Rhodes D, de Lange T. TRF1 binds a bipartite telomeric site with extreme spatial flexibility. EMBO J. 1999;18:5735–5744. doi: 10.1093/emboj/18.20.5735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bilaud T, Koering CE, Binet-Brasselet E, Ancelin K, Pollice A, Gasser SM, Gilson E. The telobox, a Myb-Related telomeric DNA binding motif found in proteins from yeast, plants and human. Nucleic Acids Res. 1996;24:1294–1303. doi: 10.1093/nar/24.7.1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowers JE, Chapman BA, Rong J, Paterson AH. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003;422:433–438. doi: 10.1038/nature01521. [DOI] [PubMed] [Google Scholar]
- Broccoli D, Smogorzewska A, Chong L, de Lange T. Human telomeres contain two distinct Myb–related proteins, TRF1 and TRF2. Nat Genet. 1997;17:231–235. doi: 10.1038/ng1097-231. [DOI] [PubMed] [Google Scholar]
- Busso D, Delagoutte-Busso B, Moras D. Construction of a set Gateway-based destination vectors for high-throughput cloning and expression screening in Escherichia coli. Anal Biochem. 2005;343:313–321. doi: 10.1016/j.ab.2005.05.015. [DOI] [PubMed] [Google Scholar]
- Chanvivattana Y, Bishopp A, Schubert D, Stock C, Moon Y-H, Sung ZR, Goodrich J. Interaction of Polycomb-group proteins controlling flowering in Arabidopsis. Development. 2004;131:5263–5276. doi: 10.1242/dev.01400. [DOI] [PubMed] [Google Scholar]
- Cheng C-Y, Krishnakumar V, Chan AP, Thibaud-Nissen F, Schobel S, Town CD. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. Plant. 2017;J89:789–804. doi: 10.1111/tpj.13415. [DOI] [PubMed] [Google Scholar]
- Cheng S, Xian W, Fu Y, Marin B, Keller J, Wu T, Sun W, Li X, Xu Y, Zhang Y, Wittek S, Reder T, Günther G, Gontcharov A, Wang S, Li L, Liu X, Wang J, Yang H, Xu X, Delaux P-M, Melkonian B, Wong GK-S, Melkonian M. Genomes of subaerial zygnematophyceae provide insights into land plant evolution. Cell. 2019;179:1057–1067.e14. doi: 10.1016/j.cell.2019.10.019. [DOI] [PubMed] [Google Scholar]
- Chong L, van Steensel B, Broccoli D, Erdjument-Bromage H, Hanish J, Tempst P, de Lange T. A Human Telomeric Protein Science. 1995;270:1663–1667. doi: 10.1126/science.270.5242.1663. [DOI] [PubMed] [Google Scholar]
- Clark JW, Donoghue PCJ. Whole-genome duplication and plant macroevolution. Trends Plant Sci. 2018;23:933–945. doi: 10.1016/j.tplants.2018.07.006. [DOI] [PubMed] [Google Scholar]
- Court R, Chapman L, Fairall L, Rhodes D. How the human telomeric proteins TRF1 and TRF2 recognize telomeric DNA: A view from high-resolution crystal structures. EMBO Rep. 2005;6:39–45. doi: 10.1038/sj.embor.7400314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dassanayake M, Oh D-H, Haas JS, Hernandez A, Hong H, Ali S, Yun D-J, Bressan RA, Zhu J-K, Bohnert HJ, Cheeseman JM. The genome of the extremophile crucifer Thellungiella parvula. Nat Genet. 2011;43:913–918. doi: 10.1038/ng.889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deng W, Buzas DM, Ying H, Robertson M, Taylor J, Peacock WJ, Dennis ES, Helliwell C. Arabidopsis Polycomb Repressive Complex 2 binding sites contain putative GAGA factor binding motifs within coding regions of genes. BMC Genom. 2013;14:593. doi: 10.1186/1471-2164-14-593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dreissig S, Schiml S, Schindele P, Weiss O, Rutten T, Schubert V, Gladilin E, Mette MF, Puchta H, Houben A. Live-cell CRISPR imaging in plants reveals dynamic telomere movements. Plant J. 2017;91:565–573. doi: 10.1111/tpj.13601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dvořáčková M (2010) Analysis of Arabidopsis telomere-associated proteins in vivo. Dissertation, Masaryk University, Faculty of Science
- Dvořáčková M, Rossignol P, Shaw PJ, Koroleva OA, Doonan JH, Fajkus J. AtTRB1, a telomeric DNA-binding protein from Arabidopsis, is concentrated in the nucleolus and shows highly dynamic association with chromatin. Plant J. 2010;61:637–649. doi: 10.1111/j.1365-313X.2009.04094.x. [DOI] [PubMed] [Google Scholar]
- Eirin-Lopez JM, Rebordinos L, Rooney AP and & Rozas J. (2012) The birth-and-death evolution of multigene families revisited. Genome Dyn 7:170–196. 10.1159/000337119. [DOI] [PubMed]
- Fan L, Roberts VA. Complex of linker histone H5 with the nucleosome and its implications for chromatin packing. Proc Natl Acad Sci. 2006;103:8384–8389. doi: 10.1073/pnas.0508951103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fulcher N, Riha K. Using Centromere Mediated Genome Elimination to Elucidate the Functional Redundancy of Candidate Telomere Binding Proteins in Arabidopsis thaliana. Front Genet. 2016;6:349. doi: 10.3389/fgene.2015.00349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fyodorov DV, Zhou B-R, Skoultchi AI, Bai Y. Emerging roles of linker histones in regulating chromatin structure and function. Nat Rev Mol Cell Biol. 2018;19:192–206. doi: 10.1038/nrm.2017.94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gehl C, Waadt R, Kudla J, Mendel R-R, Hänsch R. New GATEWAY vectors for high throughput analyses of protein-protein interactions by bimolecular fluorescence complementation. Mol Plant. 2009;2:1051–1058. doi: 10.1093/mp/ssp040. [DOI] [PubMed] [Google Scholar]
- Godwin J, Farrona S. The importance of networking: Plant polycomb repressive complex 2 and its interactors. Epigenomes. 2022;6:8. doi: 10.3390/epigenomes6010008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grefen C, Blatt MR. A 2in1 cloning system enables ratiometric bimolecular fluorescence complementation (rBiFC) Biotechniques. 2012;53:311–314. doi: 10.2144/000113941. [DOI] [PubMed] [Google Scholar]
- Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41:95–98. [Google Scholar]
- Hanaoka S, Nagadoi A, Nishimura Y. Comparison between TRF2 and TRF1 of their telomeric DNA-bound structures and DNA-binding activities. Protein Sci. 2005;14:119–130. doi: 10.1110/ps.04983705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hergeth SP, Schneider R (2015) The H1 linker histones: Multifunctional proteins beyond the nucleosomal core particle. EMBO Rep 16:1439–1453. 10.15252/embr.201540749 [DOI] [PMC free article] [PubMed]
- Hofr C, Sultesová P, Zimmermann M, Mozgová I, Procházková Schrumpfová P, Wimmerová M, Fajkus J. Single-Myb-histone proteins from Arabidopsis thaliana: A quantitative study of telomere-binding specificity and kinetics. Biochem J. 2009;419:221–230. doi: 10.1042/BJ20082195. [DOI] [PubMed] [Google Scholar]
- Hohenstatt ML, Mikulski P, Komarynets O, Klose C, Kycia I, Jeltsch A, Farrona S, Schubert D. PWWP-DOMAIN INTERACTOR OF POLYCOMBS1 interacts with polycomb-group proteins and histones and regulates Arabidopsis flowering and development. Plant Cell. 2018;30:117–133. doi: 10.1105/tpc.17.00117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Horák J, Grefen C, Berendzen KW, Hahn A, Stierhof Y-D, Stadelhofer B, Stahl M, Koncz C, Harter K. The Arabidopsis thaliana response regulator ARR22 is a putative AHP phospho-histidine phosphatase expressed in the chalaza of developing seeds. BMC Plant Biol. 2008;8:77. doi: 10.1186/1471-2229-8-77. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hwang MG, Cho MH. Arabidopsis thaliana telomeric DNA-binding protein 1 is required for telomere length homeostasis and its Myb-extension domain stabilizes plant telomeric DNA binding. Nucleic Acids Res. 2007;35:1333–1342. doi: 10.1093/nar/gkm043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Zidek A, Potapenko A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kagale S, Robinson SJ, Nixon J, Xiao R, Huebert T, Condie J, Kessler D, Clarke WE, Edger PP, Links MG, Sharpe AG, Parkin IAP. Polyploid Evolution of the Brassicaceae during the Cenozoic Era. Plant Cell. 2014;26:2777–2791. doi: 10.1105/tpc.114.126391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karamysheva ZN, Surovtseva YV, Vespa L, Shakirov EV, Shippen DE. A C-terminal Myb extension domain defines a novel family of double-strand telomeric DNA-binding proteins in Arabidopsis. J Biol Chem. 2004;279:47799–47807. doi: 10.1074/jbc.M407938200. [DOI] [PubMed] [Google Scholar]
- Ko S, Jun S-H, Bae H, Byun J-S, Han W, Park H, Yang SW, Park S-Y, Jeon YH, Cheong C, Kim WT, Lee W, Cho H-S. Structure of the DNA-binding domain of NgTRF1 reveals unique features of plant telomere-binding proteins. Nucleic Acids Res. 2008;36:2739–2755. doi: 10.1093/nar/gkn030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- König P, Fairall L, Rhodes D. Sequence-specific DNA recognition by the Myb-like domain of the human telomere binding protein TRF1: A model for the protein-DNA complex. Nucleic Acids Res. 1998;26:1731–1740. doi: 10.1093/nar/26.7.1731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koroleva OA, Calder G, Pendle AF, Kim SH, Lewandowska D, Simpson CG, Jones IM, Brown JWS, Shaw PJ. Dynamic behavior of Arabidopsis eIF4A-III, putative core protein of exon junction complex: Fast relocation to nucleolus and splicing speckles under hypoxia. Plant Cell. 2009;21:1592–1606. doi: 10.1105/tpc.108.060434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kress WJ, Soltis DE, Kersey PJ, Wegrzyn JL, Leebens-Mack JH, Gostel MR, Liu X, Soltis PS (2022) Green plant genomes: What we know in an era of rapidly expanding opportunities. Proc Natl Acad Sci 119:e2115640118. 10.1073/pnas.2115640118 [DOI] [PMC free article] [PubMed]
- Kuchař M, Fajkus J. Interactions of putative telomere-binding proteins in Arabidopsis thaliana: identification of functional TRF2 homolog in plants. FEBS Lett. 2004;578:311–315. doi: 10.1016/j.febslet.2004.11.021. [DOI] [PubMed] [Google Scholar]
- Kutashev KO, Franek M, Diamanti K, Komorowski J, Olšinová M, Dvořáčková M. Nucleolar rDNA folds into condensed foci with a specific combination of epigenetic marks. Plant J. 2021;105:1534–1548. doi: 10.1111/tpj.15130. [DOI] [PubMed] [Google Scholar]
- Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, Muller R, Dreher K, Alexander DL, Garcia-Hernandez M, Karthikeyan AS, Lee CH, Nelson WD, Ploetz L, Singh S, Wensel A, Huala E. The Arabidopsis Information Resource (TAIR): Improved gene annotation and new tools. Nucleic Acids Res. 2012;40:D1202–D1210. doi: 10.1093/nar/gkr1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lang D, Ullrich KK, Murat F, Fuchs J, Jenkins J, Haas FB, Piednoel M, Gundlach H, Van Bel M, Meyberg R, Vives C, Morata J, Symeonidi A, Hiss M, Muchero W, Kamisugi Y, Saleh O, Blanc G, Decker EL, van Gessel N, Grimwood J, Hayes RD, Graham SW, Gunter LE, McDaniel SF, Hoernstein SNW, Larsson A, Li F-W, Perroud P-F, Phillips J, Ranjan P, Rokshar DS, Rothfels CJ, Schneider L, Shu S, Stevenson DW, Thümmler F, Tillich M, Villarreal Aguilar JC, Widiez T, Wong GK-S, Wymore A, Zhang Y, Zimmer AD, Quatrano RS, Mayer KFX, Goodstein D, Casacuberta JM, Vandepoele K, Reski R, Cuming AC, Tuskan GA, Maumus F, Salse J, Schmutz J, Rensing SA. The Physcomitrella patens chromosome-scale assembly reveals moss genome structure and evolution. Plant J. 2018;93:515–533. doi: 10.1111/tpj.13801. [DOI] [PubMed] [Google Scholar]
- Lee WK, Cho MH. Telomere-binding protein regulates the chromosome ends through the interaction with histone deacetylases in Arabidopsis thaliana. Nucleic Acids Res. 2016;44:4610–4624. doi: 10.1093/nar/gkw067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lermontova I, Schubert V, Börnke F, Macas J, Schubert I. Arabidopsis CBF5 interacts with the H/ACA snoRNP assembly factor NAF1. Plant Mol Biol. 2007;65:615–626. doi: 10.1007/s11103-007-9226-z. [DOI] [PubMed] [Google Scholar]
- Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44:W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindner M, Simonini S, Kooiker M, Gagliardini V, Somssich M, Hohenstatt M, Simon R, Grossniklaus U, Kater MM. TAF13 interacts with PRC2 members and is essential for Arabidopsis seed development. Dev Biol. 2013;379:28–37. doi: 10.1016/j.ydbio.2013.03.005. [DOI] [PubMed] [Google Scholar]
- Lorković ZJ, Hilscher J, Barta A. Use of fluorescent protein tags to study nuclear organization of the spliceosomal machinery in transiently transformed living plant cells. MBoC. 2004;15:3233–3243. doi: 10.1091/mbc.e04-01-0055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lupas AN, Gruber M. The structure of α-helical coiled coils. Adv Protein Chem. 2005;70:37–38. doi: 10.1016/S0065-3233(05)70003-6. [DOI] [PubMed] [Google Scholar]
- Maillet G, White CI, Gallego ME. Telomere-length regulation in inter-ecotype crosses of Arabidopsis. Plant Mol Biol. 2006;62:859–866. doi: 10.1007/s11103-006-9061-7. [DOI] [PubMed] [Google Scholar]
- Majerská J, Schrumpfová PP, Dokládal L, Schořová Š, Stejskal K, Obořil M, Honys D, Kozáková L, Polanská PS, Sýkorová E. Tandem affinity purification of AtTERT reveals putative interaction partners of plant telomerase in vivo. Protoplasma. 2017;254:1547–1562. doi: 10.1007/s00709-016-1042-3. [DOI] [PubMed] [Google Scholar]
- Marian CO, Bordoli SJ, Goltz M, Santarella RA, Jackson LP, Danilevskaya O, Beckstette M, Meeley R, Bass HW. The maize single myb histone 1 Gene, Smh1, belongs to a novel gene family and encodes a protein that binds telomere DNA repeats in vitro. Plant Physiol. 2003;133:1336–1350. doi: 10.1104/pp.103.026856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKeown P, Pendle AF, Shaw PJ (2008) Preparation of Arabidopsis nuclei and nucleoli. In: Hancock R (ed) The nucleus. Methods in molecular biology, vol 463. Humana Press, Totowa, NJ, pp 67–75. 10.1007/978-1-59745-406-3_5 [DOI] [PubMed]
- Mikulski P, Hohenstatt ML, Farrona S, Smaczniak C, Stahl Y, Kalyanikrishna null, Kaufmann K, Angenent G, Schubert D, The chromatin-associated protein PWO1 interacts with plant nuclear lamin-like components to regulate nuclear size. Plant Cell. 2019;31:1141–1154. doi: 10.1105/tpc.18.00663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mozgova I, Hennig L. The polycomb group protein regulatory network. Annu Rev Plant Biol. 2015;66:269–296. doi: 10.1146/annurev-arplant-043014-115627. [DOI] [PubMed] [Google Scholar]
- Mozgová I, Schrumpfová PP, Hofr C, Fajkus J. Functional characterization of domains in AtTRB1, a putative telomere-binding protein in Arabidopsis thaliana. Phytochemistry. 2008;69:1814–1819. doi: 10.1016/j.phytochem.2008.04.001. [DOI] [PubMed] [Google Scholar]
- Nakagawa T, Kurose T, Hino T, Tanaka K, Kawamukai M, Niwa Y, Toyooka K, Matsuoka K, Jinbo T, Kimura T. Development of series of gateway binary vectors, pGWBs, for realizing efficient construction of fusion genes for plant transformation. J Biosci Bioeng. 2007;104:34–41. doi: 10.1263/jbb.104.34. [DOI] [PubMed] [Google Scholar]
- Nei M, Rooney AP (2005) Concerted and birth-and-death evolution in multigene families. Annu Rev Genet 39:121–152. 10.1146/annurev.genet.39.073003.112240 [DOI] [PMC free article] [PubMed]
- Néron B, Ménager H, Maufrais C, Joly N, Maupetit J, Letort S, Carrere S, Tuffery P, Letondal C. Mobyle: a new full web bioinformatics framework. Bioinformatics. 2009;25:3005–3011. doi: 10.1093/bioinformatics/btp493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ogata K, Hojo H, Aimoto S, Nakai T, Nakamura H, Sarai A, Ishii S, Nishimura Y. Solution structure of a DNA-binding unit of Myb: a helix-turn-helix-related motif with conserved tryptophans forming a hydrophobic core. Proc Natl Acad Sci. 1992;89:6428–6432. doi: 10.1073/pnas.89.14.6428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Palecek JJ, Gruber S. Kite proteins: A superfamily of SMC/Kleisin partners conserved across bacteria, archaea, and eukaryotes. Structure. 2015;23:2183–2190. doi: 10.1016/j.str.2015.10.004. [DOI] [PubMed] [Google Scholar]
- Pinho C, Hey J. Divergence with gene flow: Models and data. Annu Rev Ecol Evol Syst. 2010;41(1):215–230. doi: 10.1146/annurev-ecolsys-102209-144644. [DOI] [Google Scholar]
- Qiao X, Li Q, Yin H, Qi K, Li L, Wang R, Zhang S, Paterson AH. Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants. Genome Biol. 2019;20:38. doi: 10.1186/s13059-019-1650-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramakrishnan V, Finch JT, Graziano V, Lee PL, Sweet RM. Crystal structure of globular domain of histone H5 and its implications for nucleosome binding. Nature. 1993;362:219–223. doi: 10.1038/362219a0. [DOI] [PubMed] [Google Scholar]
- Rockinger A, Sousa A, Carvalho FA, Renner SS. Chromosome number reduction in the sister clade of Carica papaya with concomitant genome size doubling. Am J Bot. 2016;103:1082–1088. doi: 10.3732/ajb.1600134. [DOI] [PubMed] [Google Scholar]
- Sambrook J, Fritsch EF, Maniatis T. Molecular Cloning: A Laboratory Manual. 2. NY, Cold Spring Laboratory Press: Cold Spring Harbor; 1989. [Google Scholar]
- Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez J-Y, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schořová Š, Fajkus J, Záveská Drábková L, Honys D, Schrumpfová PP. The plant Pontin and Reptin homologues, RuvBL1 and RuvBL2a, colocalize with TERT and TRB proteins in vivo, and participate in telomerase biogenesis. Plant J. 2019;98:195–212. doi: 10.1111/tpj.14306. [DOI] [PubMed] [Google Scholar]
- Schrumpfová P, Kuchar M, Miková G, Skrísovská L, Kubicárová T, Fajkus J. Characterization of two Arabidopsis thaliana myb-like proteins showing affinity to telomeric DNA sequence. Genome. 2004;47:316–324. doi: 10.1139/g03-136. [DOI] [PubMed] [Google Scholar]
- Schrumpfová P, Vychodilová I, Dvořáčková M, Majerská J, Dokládal L, Schořová Š, Fajkus J. Telomere repeat binding proteins are functional components of Arabidopsis telomeres and interact with telomerase. Plant J. 2014;77:770–781. doi: 10.1111/tpj.12428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schrumpfová PP, Kuchař M, Paleček J, Fajkus J. Mapping of interaction domains of putative telomere-binding proteins AtTRB1 and AtPOT1b from Arabidopsis thaliana. FEBS Lett. 2008;582:1400–1406. doi: 10.1016/j.febslet.2008.03.034. [DOI] [PubMed] [Google Scholar]
- Schrumpfová PP, Vychodilová I, Hapala J, Schořová Š, Dvořáček V, Fajkus J. Telomere binding protein TRB1 is associated with promoters of translation machinery genes in vivo. Plant Mol Biol. 2016;90:189–206. doi: 10.1007/s11103-015-0409-8. [DOI] [PubMed] [Google Scholar]
- Schubert D (2019) Evolution of Polycomb-group function in the green lineage. F1000Res Faculty Rev-268. 10.12688/f1000research.16986.1 [DOI] [PMC free article] [PubMed]
- Shakirov EV, Shippen DE. Length regulation and dynamics of individual telomere tracts in wild-type Arabidopsis. Plant Cell. 2004;16:1959–1967. doi: 10.1105/tpc.104.023093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shan W, Kubová M, Mandáková T, Lysak MA. Nuclear organization in crucifer genomes: Nucleolus-associated telomere clustering is not a universal interphase configuration in Brassicaceae. Plant J. 2021;108:528–540. doi: 10.1111/tpj.15459. [DOI] [PubMed] [Google Scholar]
- Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smogorzewska A, de Lange T. Regulation of Telomerase by Telomeric Proteins. Annu Rev Biochem. 2004;73:177–208. doi: 10.1146/annurev.biochem.73.071403.160049. [DOI] [PubMed] [Google Scholar]
- Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Surovtseva YV, Shakirov EV, Vespa L, Osbun N, Song X, Shippen DE. Arabidopsis POT1 associates with the telomerase RNP and is required for telomere maintenance. EMBO J. 2007;26:3653–3661. doi: 10.1038/sj.emboj.7601792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sweetlove L, Gutierrez C. The journey to the end of the chromosome: delivering active telomerase to telomeres in plants. Plant J. 2019;98:193–194. doi: 10.1111/tpj.14328. [DOI] [PubMed] [Google Scholar]
- Tan L-M, Zhang C-J, Hou X-M, Shao C-R, Lu Y-J, Zhou J-X, Li Y-Q, Li L, Chen S, He X-J (2018) The PEAT protein complexes are required for histone deacetylation and heterochromatin silencing. EMBO J 37:e98770. 10.15252/embj.201798770 [DOI] [PMC free article] [PubMed]
- Tani A, Murata M. Alternative splicing of Pot1 (Protection of telomere)-like genes in Arabidopsis thaliana. Genes Genet Syst. 2005;80:41–48. doi: 10.1266/ggs.80.41. [DOI] [PubMed] [Google Scholar]
- Teano G, Concia L, Carron L, Wolff L, Adamusová K, Fojtová M, Bourge M, Kramdi A, Colot V, Grossniklaus U, Bowler C, Baroux C, Carbone A, Probst AV, Schrumpfová PP, Fajkus J, Amiard S, Grob S, Bourbousse C, Barneche F (2020) Histone H1 protects telomeric repeats from H3K27me3 invasion in Arabidopsis. bioRxiv. 10.1101/2020.11.28.402172
- Tsuzuki M, Wierzbicki AT (2018) Buried in PEAT—discovery of a new silencing complex with opposing activities. EMBO J 37:e100573. 10.15252/embj.2018100573 [DOI] [PMC free article] [PubMed]
- van Steensel B, Smogorzewska A, de Lange T. TRF2 Protects Human Telomeres from End-to-End Fusions. Cell. 1998;92:401–413. doi: 10.1016/S0092-8674(00)80932-0. [DOI] [PubMed] [Google Scholar]
- Vaquero-Sedas and Vega-Palas Epigenetic nature of Arabidopsis thaliana telomeres. Plant Physiol. 2023;191:47–55. doi: 10.1093/plphys/kiac471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, Yuan D, Stroe O, Wood G, Laydon A, et al. AlphaFold Protein Structure Database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022;50:D439–D444. doi: 10.1093/nar/gkab1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Venteicher AS, Meng Z, Mason PJ, Veenstra TD, Artandi SE. Identification of ATPases pontin and reptin as telomerase components essential for holoenzyme assembly. Cell. 2008;132:945–957. doi: 10.1016/j.cell.2008.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walden N, German DA, Wolf EM, Kiefer M, Rigault P, Huang X-C, Kiefer C, Schmickl R, Franzke A, Neuffer B, Mummenhoff K, Koch MA. Nested whole-genome duplications coincide with diversification and high morphological disparity in Brassicaceae. Nat Commun. 2020;11:3795. doi: 10.1038/s41467-020-17605-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, Bai Y, Mun J-H, Bancroft I, Cheng F, Huang S, Li X, Hua W, Wang J, Wang X, Freeling M, Pires JC, Paterson AH, Chalhoub B, Wang B, Hayward A, Sharpe AG, Park B-S, Weisshaar B, Liu B, Li B, Liu B, Tong C, Song C, Duran C, Peng C, Geng C, Koh C, Lin C, Edwards D, Mu D, Shen D, Soumpourou E, Li F, Fraser F, Conant G, Lassalle G, King GJ, Bonnema G, Tang H, Wang H, Belcram H, Zhou H, Hirakawa H, Abe H, Guo H, Wang H, Jin H, Parkin IAP, Batley J, Kim J-S, Just J, Li J, Xu J, Deng J, Kim JA, Li J, Yu J, Meng J, Wang J, Min J, Poulain J, Wang J, Hatakeyama K, Wu K, Wang L, Fang L, Trick M, Links MG, Zhao M, Jin M, Ramchiary N, Drou N, Berkman PJ, Cai Q, Huang Q, Li R, Tabata S, Cheng S, Zhang S, Zhang S, Huang S, Sato S, Sun S, Kwon S-J, Choi S-R, Lee T-H, Fan W, Zhao X, Tan X, Xu X, Wang Y, Qiu Y, Yin Y, Li Y, Du Y, Liao Y, Lim Y, Narusaka Y, Wang Y, Wang Z, Li Z, Wang Z, Xiong Z, Zhang Z. The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 2011;43:1035–1039. doi: 10.1038/ng.919. [DOI] [PubMed] [Google Scholar]
- Warren CM, Krzesinski PR, Greaser ML. Vertical agarose gel electrophoresis and electroblotting of high-molecular-weight proteins. Electrophoresis. 2003;24:1695–1702. doi: 10.1002/elps.200305392. [DOI] [PubMed] [Google Scholar]
- Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiese AJ, Steinbachová L, Timofejeva L, Čermák V, Klodová B, Ganji RS, Limones-Mendez M, Bokvaj P, Hafidh S, Potěšil D, Honys D. Arabidopsis bZIP18 and bZIP52 accumulate in nuclei following heat stress where they regulate the expression of a similar set of genes. Int J Mol Sci. 2021;22:530. doi: 10.3390/ijms22020530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu W, Liu H, Li S, Zhang W, Wang Q, Zhang H, Liu X, Cui X, Chen X, Tang W, Li Y, Zhu Y, Chen H. GWAS and identification of candidate genes associated with seed soluble sugar content in vegetable soybean. Agronomy. 2022;12:1470. doi: 10.3390/agronomy12061470. [DOI] [Google Scholar]
- Xuan H, Liu Y, Zhao J, Shi N, Li Y, Zhou Y, Pi L, Li S, Xu G, Yang H (2022) Phase-separated TRB-PRC2 aggregates contribute to Polycomb silencing in plants. bioRxiv. 10.1101/2022.03.27.485997
- Yanhui C, Xiaoyuan Y, Kun H, Meihua L, Jigang L, Zhaofeng G, Zhiqiang L, Yunfei Z, Xiaoxiao W, Xiaoming Q, Yunping S, Li Z, Xiaohui D, Jingchu L, Xing-Wang D, Zhangliang C, Hongya G, Li-Jia Q. The MYB transcription factor superfamily of Arabidopsis: Expression analysis and phylogenetic comparison with the rice MYB family. Plant Mol Biol. 2006;60:107–124. doi: 10.1007/s11103-005-2910-y. [DOI] [PubMed] [Google Scholar]
- Zhou B-R, Jiang J, Feng H, Ghirlando R, Xiao TS, Bai Y. Structural mechanisms of nucleosome recognition by linker histones. Mol Cell. 2015;59:628–638. doi: 10.1016/j.molcel.2015.06.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y, Hartwig B, James GV, Schneeberger K, Turck F. Complementary Activities of TELOMERE REPEAT BINDING Proteins and Polycomb Group Complexes in Transcriptional Regulation of Target Genes. Plant Cell. 2016;28:87–101. doi: 10.1105/tpc.15.00787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y, Wang Y, Krause K, Yang T, Dongus JA, Zhang Y, Turck F. Telobox motifs recruit CLF/SWN–PRC2 for H3K27me3 deposition via TRB factors in Arabidopsis. Nat Genet. 2018;50:638–644. doi: 10.1038/s41588-018-0109-9. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data generated or analysed during this study are included in this published article.