Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2019 Jun 11;9:8474. doi: 10.1038/s41598-019-44836-6

Benefits of using genomic insulators flanking transgenes to increase expression and avoid positional effects

Ana Pérez-González 1, Elena Caro 1,
PMCID: PMC6560062  PMID: 31186481

Abstract

For more than 20 years, plant biologists have tried to achieve complete control of transgene expression. Until the techniques to target transgenes to safe harbor sites in the genome become routine, flanking transgenes with genetic insulators, DNA sequences that create independent domains of gene expression, can help avoid positional effects and stabilize their expression. We have, for the first time, compared the effect of three insulator sequences previously described in the literature and one never tested before. Our results indicate that their use increases transgene expression, but only the last one reduces variability between lines and between individuals. We have analyzed the integration of insulator-flanked T-DNAs using whole genome re-sequencing (to our knowledge, also for the first time) and found data suggesting that chiMARs can shelter transgene insertions from neighboring repressive epigenetic states. Finally, we could also observe a loss of accuracy of the RB insertion in the lines harboring insulators, evidenced by a high frequency of truncation of T-DNAs and of insertion of vector backbone that, however, did not affect transgene expression. Our data supports that the effect of each genetic insulator is different and their use in transgenic constructs should depend on the needs of each specific experiment.

Subject terms: Molecular engineering in plants, Reporter genes

Introduction

Due to the random nature of transgene insertion in the majority of higher eukaryotes, transgenic DNA may integrate into regions of the genome that are transcriptionally repressed (heterochromatin), which can result in many cases in transgene silencing. Additionally, transgenes may be incorporated near endogenous regulatory elements, such as transcriptional enhancers or repressors, which can cause their miss-expression (reviewed by1).

Chromatin insulator sequences, or boundary elements, are DNA sequences with the capacity to define a chromatin domain because of two key activities, the first is the ability to interfere with enhancer-promoter communication when placed between the two (enhancer blocking activity) and the second one is the ability to protect a flanked transgene from position-dependent silencing (barrier activity)2.

These boundary elements have been characterized extensively in animals. In plants, possibly the best studied elements with potential applications are scaffold or matrix attachment regions (S/MARs), which have been suggested to trigger the formation of chromatin loops, and thus delimit the boundaries of discrete chromosomal domains3. Much of the research carried out concerning the use of transgene-flanking MARs as genetic insulators has shown that the use of these elements often results in an increase in the level of transgene expression and/or a reduction in plant-to-plant variability4.

One of the most studied MARs is the one localized upstream the chicken lysozyme gene (chiMARs)5. Its role as insulator was shown in studies with animal cell lines where its presence near a reporter gene produced an increase in transgene expression and a decrease in variability among different lines6. The use of the chiMARs in plant constructs has been somehow controversial, leading to reports with different conclusions. The chicken sequence was shown to be able to bind to the tobacco nuclear matrix and that when it flanked a T-DNA containing a GUS reporter gene, the variability of its expression decreased in full-grown primary transformants of tobacco7. The same group later found that a significant reduction in variation of gene expression was conferred upon the GUS gene driven by the double cauliflower mosaic virus 35S promoter, but not to the NPTII gene, driven by the nopaline synthase (pNOS) promoter, also in tobacco8. These results could, however, not be replicated in Arabidopsis thaliana first generation plants, where the chiMARs was found to have no influence on the level or variability of expression of transgenes driven by the 35S promoter9. In fact, later studies applying different transformation methods reported no boost effect on transgene expression of Arabidopsis wild type plants10, but an increase in silencing mutant backgrounds3.

In 199611, it was shown that stably transformed tobacco cell lines in which a GUS reporter gene was flanked by the tobacco MAR isolated from a genomic clone containing a root specific gene (Rb7)12 produced more than 140 times more GUS enzyme activity than control transformants without it. However, the use of Rb7 did not reduce variation between different transformants.

The effect of the Rb7 MAR increasing transgene expression on tobacco cell lines was also reported in 200313, that analyzed in depth the specificity of the results depending on the promoter used. They reported that highly active promoters exhibited significant increases in GUS activity in constructs flanked by Rb7 compared to controls, but its presence did not significantly increase GUS activity when driven by weak promoters. Importantly, most transgenes flanked by the insulator showed a large reduction in the number of low expressing GUS transformants, suggesting that MARs can reduce the frequency of gene silencing.

Following that line, the effects of Rb7 were tested in conjunction with regulated transcription using a doxycycline-inducible luciferase transgene within tobacco cell cultures14. The Rb7 lines showed higher reporter gene expression levels and avoided silencing apparition in the absence of active transcription from condensed chromatin spreading.

Another well characterized genetic insulator, defined initially by its ability to block interactions between enhancers and promoters when positioned between them, is the petunia transformation boost sequence (TBS)15. This sequence has been shown to function in Arabidopsis and tobacco plants, and a detailed analysis of the motifs it contains showed that several specific regions are required for maximum enhancer-blocking function16.

It was only a few years ago that another work showed that the TBS could similarly function in synthetic constructs sheltering transgenes promoters from the host plant genome regulatory elements. The TBS sequence was found to produce enhanced transgene expression in tobacco plants, but did not prevent gene silencing in transformants with multiple and rearranged gene copies17.

It has been almost 25 years since the description of the first insulators and new examples are still being discovered nowadays18, but their use is not common practice in plant genetic engineering. This is in part due to the trouble that cloning them through traditional methods entails, and because the reports on their effects are scattered over different systems, organisms and transformation methods that do not allow for a clear comparison between them.

Targeting transgenes to a specific integration site in the plant genome might reduce chromosomal position effects, but until there are routine efficient techniques for directed gene targeting in plants, another alternative method needs to be developed for that purpose.

With the advent of modular cloning techniques that allow rapid and straight forward generation of multigene constructs, the incorporation of genetic insulators to the flanks of T-DNAs is no longer a problem. Therefore, we decided to perform a systematic and parallel study comparing the activity and effectivity of incorporating different boundary elements flanking transgenes as a strategy in T-DNA design to maximize and stabilize transgene expression. We have, moreover, used whole genome re-sequencing for the molecular characterization of the insertion of insulator-flanked T-DNAs, finding interesting results that point to previously unknown functions of the barrier sequences.

Material and Methods

Modular cloning

Modular pieces AtS/MAR10 and Rb7 were amplified by PCR using Phusion High-Fidelity DNA Polymerase (NEB) from A. thaliana and N. tabacum genomic DNA using primers 359/348 and 269/270, respectively; chiMARs and TBS were amplified using KAPA2G Fast HotStart DNA Polymerase (Sigma) from chicken liver tissue and P. hybrida genomic DNA using primers 1724/1725/1726/1727 and 275/276/277/278, respectively (Figure S1, Table 1).

Table 1.

List of primers used.

1724 GCGCCGTCTCGCTCGGGAGGCTCAGAAAACGGCAGTTGG
1725 GCGCCGTCTCGACCGCTCTAGGAAATTTAAGG
1726 GCGCCGTCTCGCGGTGCTCAGTAAGGCGGGT
1727 GCGCCGTCTCGCTCGAGCGCACACCAGAGCCTACACCTG
275 GCGCCGTCTCGCTCGGGAGTTCCTAACACCTGGAGAACC
276 GCGCCGTCTCGGCGACCAAAGTGTGCAGGCT
277 GCGCCGTCTCGTCGCCCCTTGGCTGTGAAAA
278 GCGCCGTCTCGCTCGAGCGAAGTTGTAATGAGTTGCTGGC
359 GCGCCGTCTCGCTCGGGAGTGGCTATTGTTGTTATCATCA
348 GCGCCGTCTCGCTCAAGCGGGGTTTAGCCATTAACATCGT
269 GCGCCGTCTCGCTCGGGAGTCGATTAAAAATCCCAATTATATTTGG
270 GCGCCGTCTCGCTCGAGCGACTATTTTCAGAAGAAGTTCCCAA

Modular pieces were cloned into pFranki (chiMARs and TBS) or into GoldenBraid pUPD2 (Rb7 and AtS/MAR10) vectors, as described in (Sarrion-Perdigones et al., 2011)19. pFranki is a home-made vector adapted to clone pieces originally designed for GB2.0 so they can be compatible with GB3.0 and MoClo cloning systems. pFranki vector is composed by the cloning cassette of the GoldenBraid pUPD vector and the backbone of the pUPD2 vector. To generate transcriptional units, MoClo Level 1 destination vectors were used (pICH47732-L1P1, pICH47742-L1P2, pICH47751-L1P3, pICH47761-L1P4). Insulators modular pieces were cloned into L1P1 and L1P4 in all cases. Luciferase transcriptional unit was cloned into L1P2 vector using the following modular pieces: pICH85281 (pMAS), pICSL80001 (luciferase CDS), pICH41421 (tNOS) (Engler et al., 2014)20. Bialaphos resistance cassette (pICSL70005) was cloned into L1P3. Level 2 destination vector pAGM467321 was used for multigene assembly, and a rule of 2:1 molar ratio of inserts:acceptor was applied for adding Level 1 plasmids to the reaction. Level 1 and Level 2 digestion/ligation reactions were performed in a thermocycler as follows: 20 seconds at 37 °C, [3 minutes at 37 °C, 4 minutes at 16 °C] for 26 cycles, 5 minutes at 50 °C, 5 minutes at 80 °C, hold 16 °C (adapted from21). E. coli DH5α quimiocompetent cells were transformed with the ligation products from both levels and grown in LB medium containing X-Gal (20 µg/mL) (Duchefa) and IPTG (1 mM) (Anatrace), supplemented with ampicillin (100 µg/mL) (Formedium) for GB pUPD and MoClo Level 1, chloramphenicol (50 µg/mL) (Formedium) for GB pUPD2 and pFranki, or kanamycin (50 µg/mL) (IBIAN Technologies) for MoClo Level 2. Sequencing (Macrogen) was done previously to plant transformation for correct sequence confirmation.

Plant transformation

Level 2 transformation plasmids were introduced into Agrobacterium tumefaciens LBA4404 quimiocompetent cells and plated in LB medium supplemented with Rifampicin (25 µg/mL) (Sigma-Aldrich), Streptomycin (100 µg/mL) (sigma-Aldrich) and Kanamycin (50 µg/ml). A single transformant colony was grown in 200 mL LB medium supplemented with the same antibiotics at 28 °C under constant shaking to perform Col0 plant transformation22.

Plant growth conditions and selection

T1 seeds were put into soil and grown in an environment controlled room (FitoClima HP, Aralab) under 16/8 hours light/dark conditions, at 22 °C and 65% RH. After 10–20 days, seedlings were sprayed with Basta herbicide (200 mg/L). Resistant plants were grown in the same conditions for T2 seeds recovering.

Seedlings were grown in plates in MS medium23 with 1% sucrose, supplemented with 6 µg/mL of DL-Phosphinothricin (Basta) herbicide (DL-Phosphinothricin, Sigma-Aldrich) for selection when needed, in a growth chamber under 16/8 hours light/dark conditions at 22 °C.

Luciferase reporter assay

For luciferase imaging, 16 seedlings per line were sowed in plates to analyze LUC activity. D-Luciferin Firefly, potassium salt (Biosynth) was dissolved in sterile H2O with 0.01% Triton X-100 to a final concentration of 0.2 µM and sprayed over. After 6 minutes in the dark, luciferase activity was measured in a NightOWL II LB 983 (Berthold Technologies), with 3 minutes of exposition.

Whole Genome Re-sequencing

Isolation of Arabidopsis genomic DNA was performed using a DNeasy Plant Mini Kit (Qiagen). Samples were sent to Novogene Co., Ltd. for library construction and sequencing. There, genomic DNA of each sample was randomly sheared into short fragments of about 350 bp. These fragments were subjected to library construction using the Illumina TruSeq Library Construction Kit, strictly following manufacturer’s instructions. As followed by end-repairing, dA-tailing and further ligation with Illumina adapters, the required fragments (between 300 bp and 500 bp) were selected by PCR and amplified. After gel electrophoresis and subsequent purification, the required fragments were obtained for library construction.

Quality control of the constructed libraries were performed afterwards. Qubit 2.0 fluorometer (Life Technologies) was used to determine the concentration of the DNA libraries. After that, a dilution to 1 ng/µl was done and the Agilent 2100 bioanalyzer was used to assess the insert size. Finally, a quantitative real-time PCR (qPCR) was performed to detect the effective concentration of each library. Pair-end sequencing was performed on the Illumina platform, with the read length of 150 bp at each end.

Bisulfite conversion and sequencing

Genomic DNA of 12 days-old plants of line chiMARs 6.13 was extracted using a DNeasy Plant Mini Kit (Qiagen). Bisulfite treatment was done using the EZ DNA Methylation Gold kit (Zymo Research) following the manufacturer’s instructions. Amplification from converted DNA was performed with NXT Taq PCR kit (EURx) using primers 642 (AATTTCCCGGACGTAGCGTA) and 635 (ATCCAAGCTTTCAAGCCACAC). PCR fragments were checked on an 1% agarose gel for size verification. 4 µl of PCR product was cloned into pGEM-T Easy (Promega) and transformed into chemically competent E. coli DH5α cells. Nine clones were selected for the analysis. Plasmid DNA of each clone was sent for sequencing (GATC), and results were checked using Geneious version 10.2.2 software24. Comparison of the converted clones to the original unconverted sequences was done using CyMate software25, to count the converted/unconverted cytosines at each site. Percenatge of DNA methylation was calculated as (number of methylated C residues in each context (CG, CHG or CHH)/total number of C residues in that context) * 100.

Results

Since the advent of plant genetic transformation, plant biologists have tried to maximize transgene expression level and minimize variability by flanking transgenes with genetic insulators. There are numerous studies that describe the use of a certain insulator sequence in a host organism and analyze different aspects of its barrier and enhancer-blocking ability, but they are performed in such diverse conditions that do not allow for comparison and their results are sometimes contradictory. Our work consists on the use four different insulator sequences flanking a LUC transgene with the aim of conducting a definitive parallel and systematic analysis of their effect on transgene integration, expression level and variance in Arabidopsis seedlings.

Taking advantage of the capacities of modular cloning systems, we generated five identical constructs harboring the firefly luciferase transgene driven by the constitutive mannopine synthase Agrobacterium gene promoter (pMAS) and followed by the Basta resistance selection marker cassette. One of these constructs was used as a control, and the other four were flanked by different sequences reported in the literature to have some type of insulator activity (Fig. 1A). The insulator sequences used in this work were the MAR located next to the tobacco root specific gene Rb7 (Rb7)12, the chicken lysozyme A MAR region (chiMARs)5, the petunia transformation booster sequence (TBS)15 and one of the scaffold/matrix attachment region sequences isolated from Arabidopsis chromosome 4 (AtS/MAR10)26.

Figure 1.

Figure 1

Analysis of insulator effect over LUC activity. (A) Schematic representation of the constructs used for studying the effect genomic insulators flanking transgenes. The above scheme represents the construct used as a control (LUC) while the scheme below represents the four constructs flanked by the four different insulators. pMAS: mannopine synthase gene promoter; LUC: firefly luciferase; Tnos: nopaline synthase terminator: pNos: nopaline synthase promoter; Tocs: octopine synthase terminator. “Insulator” represents Rb7, chiMAR, TBS or AtS/MAR10. (B) Time course of LUC activity when expressed under the pMAS promoter. Lines were assayed for LUC imaging at 12, 22 and 28 days-old. Results for control line LUC 14.9 (indicated with an arrow) are shown, but similar data was obtained for the rest of the lines. d.o.s: day-old seedlings; cps: counts per second. (C) Box and whisker plots showing LUC activity. ** Represents Student’s test significant differences (p < 0.005); *** represents Student’s test highly significant differences (p < 0.001); cps: counts per second. (D) LUC activity imaging of the T3 homozygous lines, eight lines per construct.

The pMAS promoter is known to be most active in the roots of the emerging seedlings and also very active in the cotyledons and lower leaves, with progressively less signal towards the apex of the shoot27,28. Accordingly, a time course study of the LUC expression conferred by the pMAS showed that its activity was maximum in young seedlings (Fig. 1B). Given these results, for the following experiments, LUC activity was always measured in 12 day old seedlings. Eight 3:1 segregating Arabidopsis Col0 T2 lines were randomly selected and a 100% Basta resistant T3 line coming from each of them was used for LUC activity imaging to assess their levels of transgene expression (Fig. 1C). Our results confirmed previous reports, indicating that all constructs flanked by insulator elements led to plants with increased transgene expression than the control (Fig. 1D).

Another property of insulator sequences is their ability to decrease variability between transgenic lines transformed with the same construct reducing the positional effects. When the transgene was flanked by Rb7, chiMARs or TBS, the increase in LUC expression described above was accompanied also by a statistically significant increase in the coefficient of variation between lines, which measures the extent of variation in relation to the mean within a population (Fig. 2A, B). Line 40.01 from AtS/MAR10 behaved very differently from the rest in terms of expression (Fig. 1D). We confirmed it was an outlier (expression value above Q3 + 1.5 × InterQuartileRange) and thus, did not consider it for this analysis. When the outlier line data was removed, the presence of AtS/MAR10 flanking the transgene led to the opposite effect than the rest of insulators, a statistically significant reduction in the coefficient of variation between lines, or what is the same, a reduction in inter-line variation (Fig. 2A, B).

Figure 2.

Figure 2

Analysis of insulator effect over inter-line, inter-individual and inter-generation variation of LUC activity. (A) Scattergrams showing LUC activity in the selected eight lines obtained after transformation with each construct. The CV of each population was calculated as (standard deviation/mean) * 100. (B) Comparison of the inter-line coefficient of variation. ** Represents Student’s test significant differences (p < 0.005); *** represents Student’s test highly significant differences (p < 0.001); CV: coefficient of variation; cps: counts per second. (C) Scattergrams showing LUC activity in 16 seedlings of the eight selected lines obtained after transformation with each construct. CV was determined for each line and calculated as (standard deviation/mean) * 100. cps/cm2: counts per second/cm2. The arrow in the AtS/MAR10 graph represents the outlier line. (D) Comparison of the inter-individual coefficients of variation. CV for each insulator was calculated as (standard deviation/mean) * 100. A great variance was overserved for the insulated lines compared to the control except for AtS/MAR10, that showed a small variation similar to the control, in agreement with the Student’s test. *** Represents highly significant differences (p < 0.001). (E) Box and whisker plots showing LUC activity in T2 and T3 generations of the 8 selected lines obtained after transformation eighth each construct. cps: counts per second.

To analyze the level of variation between genetically identical individuals within a population, we measured the expression of 16 seedlings from each line, and evaluated the effect of insulators on inter-individual (intra-line) variation (Fig. 2C). For Rb7, chiMARs and TBS, the increase in expression induced was not homogeneous between individuals and, as a result, there was a greater variance in these lines compared to the control. For AtS/MAR10, there was a small variance, similar to that of the control with no insulator (CV around 25%) (Fig. 2D).

Next, we compared LUC expression in segregating lines from the T2 generation with homozygous lines from the T3 generation, in an effort to establish if, in our system in study, LUC expression was dependent on gene dosage. Our experiments confirm an increase in expression in all T3 lines compared to T2, consistent with the establishment of homozygous populations (Fig. 2E). Rb7 and AtS/MAR10 lines were the ones where expression increased most in the transition to T3 (T2/T3 expression ratio of 4.1 and 5.5 respectively versus 2.5 of Control, 1.8 of chiMARs and 3.1 of TBS).

In an effort to further characterize the insulator lines in more detail than previous works, we proceeded to perform whole genome re-sequencing (WGR) in some of the lines obtained by transformation with each construct (Fig. 3A). The results allowed us to select 21 lines with a single T-DNA insertion locus. Even though all the lines showed a 3:1 Basta resistance segregation in the T2, we found three T3 lines in which there were multiple insertions in different chromosomes, suggesting that some of them were not leading to proper transgene expression. An interesting finding was that AtS/MAR10 40.01, the outlier line that showed abnormally high LUC expression, had two insertions in the same region of chromosome 1, what could explain its behavior as a single locus in our segregation analysis and the reported increased transgene expression. The WGR data also allowed us to map the T-DNA insertion site of each line and to identify the deletions in the host genome associated with the insertion (Figure S2, Fig. 3B and Table 2). Surprisingly, integration was not homogeneous among all chromosomes (we found none of the mapped insertions to be located in chromosome 2), and for Rb7 lines there was a clear preference for insertion within chromosome 3 (60%, 3 out of 5 lines) and with the T-DNA in the 3′->5′ direction (100%, 5 out of 5 lines), while for the rest of the lines chromosome 3 integrations and reverse T-DNA insertions only represented a 31% in each case (5 out of 16 for each) (Table 2).

Figure 3.

Figure 3

Analysis of insulator effect over T-DNA insertion. (A) Scheme of the WGR pipeline. (B) Representation of the T-DNA insertion sites mapped within the five Arabidopsis chromosomes. (C) Graph showing LUC activity versus chromatin state30 at T-DNA integration site. cps: counts per second.

Table 2.

Details of the T-DNA insertions for single-copy lines based on WGR results.

Line Chr Insertion site (coordinates TAIR10) T-DNA direction Deletion of host genome at insertion site Chromatin State30 Figure S
Control 1009 1 17,979,345 3′ → 5′ 3 bp 1 1.1
Control 1314 3 2,976,299 5′ → 3′ 63 bp 3 1.2
Control 1409 5 561,006 5′ → 3′ 0 bp 5 1.3
Control 3414 3 22,164,245 5′ → 3′ 35 bp 3 1.4
AtS/MAR10 0101 4 317,714 5′ → 3′ 2 bp 7 1.5
AtS/MAR10 1607 1 9,473,679 5′ → 3′ 4256 bp 6 1.6
AtS/MAR10 2308 1 27,233,367 3′ → 5′ 12 bp 1 1.7
AtS/MAR10 4101 3 22,571,686 5′ → 3′ 15 bp 6 1.8
AtS/MAR10 4203 4 1,427,639 5′ → 3′ 26 bp 3 1.9
Rb7 0502 5 25,137,027 3′ → 5′ 48 bp 1 1.10
Rb7 0902 1 2,741,409 3′ → 5′ 14 bp 7 1.11
Rb7 1001 3 3,646,853 3′ → 5′ 26 bp 6 1.12
Rb7 2707 3 19,513,996 3′ → 5′ 91 bp 2 1.13
Rb7 4202 3 6,147,763 3′ → 5′ 34 bp 2 1.14
TBS 0403 3 1,999,288 5′ → 3′ 2 bp 4 1.15
TBS 1109 1 10,441,945 5′ → 3′ 30 bp 6 1.16
TBS 1503 5 16,979,834 3′ → 5′ 27 bp 2 1.17
TBS 1806 1 30,225,399 3′ → 5′ 21 bp 1 1.18
chiMARs 0112 3 1,192,380 5′ → 3′ 96 bp 1 1.19
chiMARs 0613 1 21,638,034 5′ → 3′ 11 bp 8 1.20
chiMARs 0711 4 5,558,851 3′ → 5′ 1 bp 8 1.21

The existence of a selection bias towards T-DNA integrations in euchromatin where the transgenes used for selection of transformants are efficiently expressed has been reported previously in the literature29. This was the case for most of the insertions we mapped (insertion sites in euchromatin, chromatin states 1 to 7 as described in30, Fig. 3C), and when we plotted LUC activity versus state of the chromatin at the T-DNA insertion site, we could observe that lines grouped high or low depending on the construct they belonged to, and not left or right depending on the chromatin state where the T-DNA integration was located (Fig. 3C). However, 2 lines carrying the chiMARs insulator presented T-DNA insertions in regions of the host genome featuring “chromatin state 8”, described as an A/T rich heterochromatic region characterized by methylated DNA and chromatin modifications such as H3K9me2 and H3K27me130.

We performed an analysis of the DNA methylation levels in the junction between the host genome and the T-DNA insertion for chiMAR line 6.13 and our results show that the DNA at the insertion site is indeed heavily methylated while the DNA of the T-DNA remains devoid of this chromatin modification even in the T3 generation, consistent with a boundary role of the insulator avoiding the repressive mark spreading (Fig. 4).

Figure 4.

Figure 4

Analysis of the DNA methylation levels in the junction between the host genome and the T-DNA for line chiMAR 6.13. Upper panel: schematic representation of the junction site. Middle panel: graphical output of the methylation analysis (CyMate software) in 12 day-old seedlings of chiMARs 6.13 line. Red circles represent CG sites, blue squares represent CHG sites and green triangles represent CHH sites. Filled symbols indicate methylated cytosines while empty ones represent non methylated cytosines. Lower panel: the graph shows the DNA methylation quantification of CG (red bars), CHG (blue bars) and CHH (green bars) cytosine contexts for the flanking sequence (left) and the T-DNA (right).

The data from WGR also allowed us to characterize the genomic sequence generated as a result of the T-DNA integration, and we could observe that for 8 out of 17 of the lines that contained insulator sequences, we had evidence of a lack of precision in the insertion of the RB, while that was not the case for any of the 4 control lines (Fig. 5). 3 out of 5 of the AtS/MAR10 lines contained vector backbone DNA (from outside the T-DNA region) integrated into the plant genome, while 3 out of 5 of the Rb7 lines, one AtS/MAR10 and one TBS line showed different degrees of truncation of the inserted T-DNA in the right border region. There was no evidence of truncation in the LB for any of the lines analyzed.

Figure 5.

Figure 5

Characterization of the genomic sequences generated as a result of the T-DNA integrations. Upper panel: Table showing details of T-DNA 5′ and 3′ insertion sites. Chr: chromosome; Coord: coordinates. Lower panel: Schematic representation of the transformation vector genome insertion site for each line.

Discussion

Effect of insulators on transgene expression level and variation between lines

Most previous works have reported positive evidence of the effects of insulators on transgene expression, although some works can be found in the literature that report no such effect. The experiments were, however, very diverse in terms of species (some experiments had been done in tobacco and others in Arabidopsis) and in terms of method of transformation (some performed in primary transformants after regeneration and some in floral-dipped Arabidopsis).

It was an important motivation for the present study to compare the effects of the different isolators in the same conditions: organism, developmental stage and transformation method. Our results do in fact support most results from literature, since we detect an increase in expression for lines where LUC is flanked by any of the four insulators, and previous negative results could reflect a dependency of the function of insulators on the experimental conditions.

Noteworthy, the use of AtS/MAR10, that had never been tested before for insulator activity, resulted in a moderate but very consistent increase in LUC expression.

In our hands, neither chiMARs, Rb7 nor TBS had an effect on reducing inter-line or inter-individual variation, in fact they increased them significantly. However, previous studies on the effect of chiMARs had highlighted its effect on the reduction of expression variability among transgenic lines7,8. This inconsistence could derive from a few factors in which our study differs basically from these other works. First, in our system we have used the pMAS promoter (versus the p35S used by Mlynarova et al.7,8) which never reaches such high levels of expression as the p35S, but that results in normally distributed expression levels in populations of transformants9. It might be possible that the chiMARs works reducing the variance of strong promoters but its effect is not so apparent in promoters with an intrinsically low level of variation such as pMAS, like Mankin et al.13 described for Rb7. Second, in our study we have analyzed expression in homozygous T3 lines, that are already established lines with low variance in comparison with the T1 transformants analyzed by Mlynarova et al.7,8. It is interesting to note that the levels of variability between lines in the LUC control are in the same range as the variability between genetically identical individuals (around 30%), supporting the consistency and small intrinsic variance of our experimental set up in which we analyze T3.

In fact, it is striking that AtS/MAR10 is able to diminish inter-line variance, proving efficient in modifying both of the parameters measured, increasing transgene expression and reducing variability between lines, what makes it the best performing of the insulators analyzed.

Effect of insulators on T-DNA insertion

Two interesting observations have been made regarding the effect of insulators on the insertion of T-DNAs. On the one hand, it is reported that T-DNA integrations recovered by selection are mostly located in “open chromatin” or euchromatin, while, without selection, integration is biased towards regions with marks of heterochromatin29. This is explained by the silencing of the selection genes when integration takes place within heterochromatin, a phenomenon that prevents transformant recovery. Our results show the ability of chiMAR to shelter T-DNAs from heterochromatin spreading and to allow for transgene expression regardless of the position effect.

On the other hand, the observation of an increased frequency of truncated T-DNAs in the lines containing insulators had been reported before by Li et al.31. Our results can be interpreted in the light of a role of insulators in the protection of transgenes at the right border end of the T-DNA from deletions. This would also explain the low correlation of expression between reporter genes located within the same T-DNA observed in many previous studies, and shown to improve by the use of insulators flanking them8.

An in silico analysis of the insulators sequences using NonB DB32 showed that the 5′ region of the AtS/MAR10 contains an inverted repeat and a mirror repeat rich in purines, features that lead to the formation of cruciform and triplex structures, respectively, which have been associated with genomic instability33. Future experiments could be directed at understanding the role of these repeats in the insertion of truncated T-DNAs or vector backbone in constructs harboring AtS/MAR10.

As a general conclusion, we can state that there are many different insulators described in the literature with very different properties. Their functions might reflect differences in their action mechanisms and their use in transgenic constructs should depend on the needs of a specific experiment.

In our experimental setup, the best performing insulators were Rb7 in terms of increase of transgene expression, and AtS/MAR10 in terms of reducing variance.

Plant biologists should invest more efforts in the development of technologies that can render transgenes with high and stable expression with rapidity and ease. The future of synthetic biology and biotechnology projects depends on our ability to stabilize transgene expression and alleviate interference with the host genome regulation. In this work we show that the use of genetic insulators can help achieve these objectives with their simple addition at the flanks of the constructs used for transformation.

Supplementary information

Acknowledgements

Technical help from Cristina Vaca and Diana Coroian is greatly appreciated.

Author Contributions

A.P. and E.C. designed the experiments and analyzed the results. A.P. carried out the experiments. E.C. wrote the manuscript.

Data Availability

All materials, data and associated protocols are available to readers.

Competing Interests

The authors declare no competing interests.

Footnotes

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information accompanies this paper at 10.1038/s41598-019-44836-6.

References

  • 1.Pérez-González, A. & Caro, E. In Systems Biology Application in Synthetic Biology 79–89, 10.1007/978-81-322-2809-7_7 (Springer India, 2016).
  • 2.Matzat LH, Lei EP. Surviving an identity crisis: a revised view of chromatin insulators in the genomics era. Biochim. Biophys. Acta. 2014;1839:203–14. doi: 10.1016/j.bbagrm.2013.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Butaye KMJ, et al. Stable high-level transgene expression in Arabidopsis thaliana using gene silencing mutants and matrix attachment regions. Plant J. 2004;39:440–449. doi: 10.1111/j.1365-313X.2004.02144.x. [DOI] [PubMed] [Google Scholar]
  • 4.Butaye KMJ, Cammue BPA, Delauré SL, De Bolle MFC. Approaches to Minimize Variation of Transgene Expression in Plants. Mol. Breed. 2005;16:79–91. doi: 10.1007/s11032-005-4929-9. [DOI] [Google Scholar]
  • 5.Loc PV, Strätling WH. The matrix attachment regions of the chicken lysozyme gene co-map with the boundaries of the chromatin domain. EMBO J. 1988;7:655–64. doi: 10.1002/j.1460-2075.1988.tb02860.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Stief A, Winter DM, Strätling WH, Sippel AE. A nuclear DNA attachment element mediates elevated and position-independent gene activity. Nature. 1989;341:343–345. doi: 10.1038/341343a0. [DOI] [PubMed] [Google Scholar]
  • 7.Mlynarova L, et al. Reduced Position Effect in Mature Transgenic Plants Conferred by the Chicken Lysozyme Matrix-Associated Region. Plant Cell. 1994;6:417–426. doi: 10.1105/tpc.6.3.417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Mlynarova L, Jansen RC, Conner AJ, Stiekema WJ, Nap JP. The MAR-Mediated Reduction in Position Effect Can Be Uncoupled from Copy Number-Dependent Expression in Transgenic Plants. Plant Cell. 1995;7:599–609. doi: 10.1105/tpc.7.5.599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.De Bolle MFC, et al. Analysis of the influence of promoter elements and a matrix attachment region on the inter-individual variation of transgene expression in populations of Arabidopsis thaliana. Plant Sci. 2003;165:169–179. doi: 10.1016/S0168-9452(03)00156-0. [DOI] [Google Scholar]
  • 10.De Bolle MFC, et al. The influence of matrix attachment regions on transgene expression in Arabidopsis thaliana wild type and gene silencing mutants. Plant Mol. Biol. 2007;63:533–543. doi: 10.1007/s11103-006-9107-x. [DOI] [PubMed] [Google Scholar]
  • 11.Allen GC, et al. High-Level Transgene Expression in Plant Cells: Effects of a Strong Scaffold Attachment Region from Tobacco. Plant Cell. 1996;8:899–913. doi: 10.1105/tpc.8.5.899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hall G, Allen GC, Loer DS, Thompson WF, Spiker S. Nuclear scaffolds and scaffold-attachment regions in higher plants. Proc. Natl. Acad. Sci. USA. 1991;88:9320–4. doi: 10.1073/pnas.88.20.9320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mankin SL, Allen GC, Phelan T, Spiker S, Thompson WF. Elevation of Transgene Expression Level by Flanking Matrix Attachment Regions (MAR) is Promoter Dependent: A Study of the Interactions of Six Promoters with the RB7 3′ MAR. Transgenic Res. 2003;12:3–12. doi: 10.1023/A:1022194120518. [DOI] [PubMed] [Google Scholar]
  • 14.Abranches R, Shultz RW, Thompson WF, Allen GC. Matrix attachment regions and regulated transcription increase and stabilize transgene expression. Plant Biotechnol. J. 2005;3:535–543. doi: 10.1111/j.1467-7652.2005.00144.x. [DOI] [PubMed] [Google Scholar]
  • 15.Hily JM, Singer SD, Yang Y, Liu Z. A transformation booster sequence (TBS) from Petunia hybrida functions as an enhancer-blocking insulator in Arabidopsis thaliana. Plant Cell Rep. 2009;28:1095–1104. doi: 10.1007/s00299-009-0700-8. [DOI] [PubMed] [Google Scholar]
  • 16.Singer SD, Hily JM, Cox KD. Analysis of the enhancer-blocking function of the TBS element from Petunia hybrida in transgenic Arabidopsis thaliana and Nicotiana tabacum. Plant Cell Rep. 2011;30:2013–2025. doi: 10.1007/s00299-011-1109-8. [DOI] [PubMed] [Google Scholar]
  • 17.Dietz-Pfeilstetter A, Arndt N, Manske U. Effects of a petunia scaffold/matrix attachment region on copy number dependency and stability of transgene expression in Nicotiana tabacum. Transgenic Res. 2016;25:149–162. doi: 10.1007/s11248-015-9924-2. [DOI] [PubMed] [Google Scholar]
  • 18.Singh R, Yadav R, Amla DV, Sanyal I. Characterization and functional validation of two scaffold attachment regions (SARs) from Cicer arietinum (L.) Plant Cell. Tissue Organ Cult. 2016;125:135–148. doi: 10.1007/s11240-015-0935-8. [DOI] [Google Scholar]
  • 19.10.1371/journal.pone.0021622.
  • 20.Singh R, Yadav R, Amla DV, Sanyal I. Characterization and functional validation of two scaffold attachment regions (SARs) from Cicer arietinum (L.) Plant Cell. Tissue Organ Cult. 2016;125:135–148. doi: 10.1007/s11240-015-0935-8. [DOI] [Google Scholar]
  • 21.Weber E, Engler C, Gruetzner R, Werner S, Marillonnet S. A Modular Cloning System for Standardized Assembly of Multigene Constructs. PLoS One. 2011;6:e16765. doi: 10.1371/journal.pone.0016765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Clough SJ, Bent AF. Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1998;16:735–43. doi: 10.1046/j.1365-313x.1998.00343.x. [DOI] [PubMed] [Google Scholar]
  • 23.Murashige T, Skoog F. A Revised Medium for Rapid Growth and Bio Assays with Tobacco Tissue Cultures. Physiol. Plant. 1962;15:473–497. doi: 10.1111/j.1399-3054.1962.tb08052.x. [DOI] [Google Scholar]
  • 24.Kearse M, et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–1649. doi: 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Hetzl J, Foerster AM, Raidl G, Scheid OM. CyMATE: a new tool for methylation analysis of plant genomic DNA after bisulphite sequencing. Plant J. 2007;51:526–536. doi: 10.1111/j.1365-313X.2007.03152.x. [DOI] [PubMed] [Google Scholar]
  • 26.Pascuzzi PE, et al. In vivo mapping of arabidopsis scaffold/matrix attachment regions reveals link to nucleosome-disfavoring poly(dA:dT) tracts. Plant Cell. 2014;26:102–20. doi: 10.1105/tpc.113.121194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Langridge WHR, Fitzgerald KJ, Koncz C, Schell J, Szalay AA. Dual promoter of Agrobacterium tumefaciens mannopine synthase genes is regulated by plant growth hormones. Proc. Natl. Acad. Sci. 1989;86:3219–3223. doi: 10.1073/pnas.86.9.3219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Leung J, Fukuda H, Wing D, Schell J, Masterson R. Functional analysis of cis-elements, auxin response and early developmental profiles of the mannopine synthase bidirectional promoter. MGG Mol. Gen. Genet. 1991;230:463–474. doi: 10.1007/BF00280304. [DOI] [PubMed] [Google Scholar]
  • 29.Francis KE, Spiker S. Identification of Arabidopsis thaliana transformants without selection reveals a high occurrence of silenced T-DNA integrations. Plant J. 2005;41:464–477. doi: 10.1111/j.1365-313X.2004.02312.x. [DOI] [PubMed] [Google Scholar]
  • 30.Sequeira-Mendes J, et al. The Functional Topography of the Arabidopsis Genome Is Organized in a Reduced Number of Linear Motifs of Chromatin States. Plant Cell. 2014;26:2351–2366. doi: 10.1105/tpc.114.124578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Li J, Brunner AM, Meilan R, Strauss SH. Matrix attachment region elements have small and variable effects on transgene expression and stability in field-grown Populus. Plant Biotechnol. J. 2008;6:887–896. doi: 10.1111/j.1467-7652.2008.00369.x. [DOI] [PubMed] [Google Scholar]
  • 32.Cer RZ, et al. Non-B DB v2.0: A database of predicted non-B DNA-forming motifs and its associated tools. Nucleic Acids Res. 2013;41:D94–D100. doi: 10.1093/nar/gks955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zhao J, Bacolla A, Wang G, Vasquez KM. Non-B DNA structure-induced genetic instability and evolution. Cellular and Molecular Life Sciences. 2010;67:43–62. doi: 10.1007/s00018-009-0131-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

All materials, data and associated protocols are available to readers.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES