Skip to main content
Wellcome Open Research logoLink to Wellcome Open Research
. 2021 Nov 19;6:265. Originally published 2021 Oct 13. [Version 2] doi: 10.12688/wellcomeopenres.17229.2

MUC4 is not expressed in cell lines used for live cell imaging

Naouel Athmane 1, Iain Williamson 1, Shelagh Boyle 1, Simon C Biddie 1, Wendy A Bickmore 1,a
PMCID: PMC8567686  PMID: 34796278

Version Changes

Revised. Amendments from Version 1

Legend for Figure 4E corrected and information about other genes containing tandem repeats has been added to the Discussion.

Abstract

Background: The ability to visualise specific mammalian gene loci in living cells is important for understanding the dynamic processes linked to transcription. However, some of the tools used to target mammalian genes for live cell imaging, such as dCas9, have been reported to themselves impede processes linked to transcription. The MUC4 gene is a popular target for live cell imaging studies due to the repetitive nature of sequences within some exons of this gene.

Methods: We set out to compare the impact of dCas9 and TALE-based imaging tools on MUC4 expression, including in human cell lines previously reported as expressing MUC4.

Results: We were unable to detect MUC4 mRNA in these cell lines. Moreover, analysis of publicly available data for histone modifications associated with transcription, and data for transcription itself, indicate that neither MUC4, nor any of the mucin gene family are significantly expressed in the cell lines where dCas9 targeting has been reported to repress MUC4 and MUC1 expression, or in the cell lines where dCas13 has been used to report MUC4 RNA detection in live cells.

Conclusions: Methods for visualising specific gene loci and gene transcripts in live human cells are very challenging. Our data suggest that care should be given to the choice of the most appropriate cell lines for these analyses and that orthogonal methods of assaying gene expression be carefully compared.

Keywords: dCas9, imaging, mucin, transcription

Introduction

Live cell imaging of actively transcribing genes allows investigation of the changes in chromatin dynamics associated with gene expression. However, achieving this involves multiple technical challenges and has, to date, resulted in contradictory conclusions likely due to the different approaches used for visualising the loci under investigation.

Live imaging studies of genomic loci in mammalian cells have mainly targeted repetitive elements. Initial studies relied on the insertion of repetitive arrays of bacterial operators (LacO, tetO, CuO) into the mouse or human genome and visualisation using the corresponding binding proteins (lacI, TetR, CymR) ( Alexander et al., 2019; Chubb et al., 2002; Pollex & Heard, 2019). Repeats are easier to visualise because they help to accumulate many fluorescent molecules in one spot, producing a high signal to noise ratio. The orthogonal ANCHOR system uses insertion of a short ANCH sequence and then spreading of the bacterial ParB protein, seeded at these binding sites, to enhance the signal ( Germier et al., 2017). However, these approaches raise concerns about alteration of the chromatin and epigenetic state of the tagged loci caused by the insertion of bacterial DNA sequences. ( Jacome & Fernandez-Capetillo, 2011; Osorio-Valeriano et al., 2019).

More recently, directed binding by catalytically dead Cas9 (dCas9) and transcription-activator like effectors (TALEs) have been used to visualise telomeres, centromeres, sub-telomeric and pericentromeric repeats ( Ma et al., 2016) or SINE elements in mammalian cells ( Chen et al., 2013; Knight et al., 2015; Miyanari et al., 2013). These approaches do not rely on modifying the genomic locus under investigation.

However, the goal has been to image specific gene loci – not just repetitive elements. Various approaches, such as the use of multiple sgRNAs and fluorescently-tagged dCas9 ( Gu et al., 2018), or the recruitment of multiple tagged proteins to sgRNAs ( Cheng et al., 2016; Qin et al., 2017) have been used to overcome the signal:noise problem. In addition, the MUC4 gene has become a popular locus targeted for live cell visualisation due to a repetitive region in one of its exons ( Chen et al., 2013).

However, dCas9 has been reported to inhibit both gene expression when bound at a gene locus ( Chen et al., 2013), and the binding of endogenous transcription factors ( Gao et al., 2014) ( Shariati et al., 2019). Understanding how current live cell imaging tools impact gene expression is therefore an important aim. We set out to compare dCas9 and TALE based imaging to visualise the MUC4 locus in human cell lines in order to assess the impact of dCas9 and TALE binding on the ability of the targeted locus to be transcribed. However, this work led us to question whether the cell lines commonly used for imaging MUC4 are an appropriate background on which to investigate MUC4 expression.

Our experimental data and our bioinformatic analysis of publicly available data suggest that neither MUC4, MUC1 or indeed any member of the mucin gene family, are expressed at significant levels in the cell lines that have been previously used either to study dCas9-mediated repression of MUC4/ MUC1 expression, or to visualise MUC4 RNA in the nucleus.

Methods

Cell culture and transfections

U2OS cells were grown in McCoy’s 5A supplemented with 10% fetal bovine serum (FBS) and Penicillin/Streptomycin. Transfections were performed using Lipofectamine 3000 reagent (Invitrogen) following the manufacturer’s recommendations. Briefly, 90% confluent U2OS were transfected in a 6-well plate with 1μg of plasmid and 3,75μl of lipofectamine and 4 μl of P3000 reagent. dCas9-EGFP or MUC4 targeting TALE-EGFP were transiently transfected into U2OS cells prior to imaging and RNA extraction. Transient transfection of sgRNAs in pSLQ1651 was monitored by mCherry expression.

DNA FISH

FISH on metaphase arrested U2OS cells was performed as previously described ( Fantes et al., 1992). Fosmid WI2-1916J7 (chr3:195764450-195798680; hg38) was used to detect MUC4 exon 2 and was directly labelled with ChromaTide Alexa Fluor 594-5-dUTP (Thermofisher scientific C11400) by nick translation. 200ng of labelled probe were used per slide, with 8ug human CotI DNA (Invitrogen, cat#18440-016) and 10ug sonicated salmon sperm DNA (Sigma, cat#31149) and denatured in hybridization mix at 70°C for five minutes, then preannealed at 37°C for 15 minutes. The probe was then hybridized to the denatured slides in a humid chamber at 37°C overnight (approximately 16 hours). Slides were washed for 4x3 minutes in 2xSSC at 45°C, then 0.1xSSC at 60°C. Slides were counterstained in 0.5 µg/ml DAPI and mounted using Vectashield prior to imaging.

dCas9 and TALE MUC4 targeting constructs

A TALE binding domain targeting MUC4 was assembled simultaneously using Golden Gate Assembly of Esp3I digested fragments. Four modules from RVD encoding plasmids (Addgene kit #1000000024) based on the MUC4 targeting RVDI to IV described by Ren et al. (2017) were assembled into a pTAL-spec-puro-eGFPmodified vector by thermocycling ((37°C 10 mins, 16°C 10 mins)x12), 36°C 15mins, 80°C 5 mins. RVDI(CCTG), RVDII (TCAC), RVDIII (CGAC), RVD IV (ACT). Golden Gate products were transformed into E. coli and selected on spectinomycin plates. Colonies screened for fully assembled TALEs by PCR were confirmed by Sanger sequencing and by a diagnostic digestion with NotI and BamHI.

The sgRNA targeting MUC4 expressed under control of the U6 promoter (pSLQ 1661-sgMUC4 – E3(F+E)) was generated by Chen et al. (2013) and ordered from Addgene (addgene #51025). This was co-transfected into U2OS cells together with dCas9-EGFP-NLS plasmid.

RNA extraction and cDNA synthesis

To quantify gene expression when targeting MUC4, dCas9-EGFP and TALE-EGFP expressing cells were harvested 24h after gRNA transfection. RNA was extracted from approximately 1×10 6 cell pellets using the RNeasy mini kit (Qiagen 74106) as per the manufacturer’s instruction, including an on-column DNase digestion (Qiagen 79254), eluted in 20 μL ddH2O and quantified using the Qubit RNA broad range assay (ThermoFisher Q10210) with the Qubit 4 fluorometer. cDNA was synthesised from 2 μg RNA using Superscript II reverse transcriptase (Invitrogen 18064071) primed with random hexamers in a final volume of 20µl (Promega C1181) as per the manufacturer’s instructions.

RT-qPCR

For real-time (q)RT-PCR analysis of MUC4 expression in U2OS cells ( Figure 3A and B), qPCR was carried out on the BioRad CFX96 Real-Time System as follows: For a final volume of 10 µl, three technical replicates were prepared together for each sample: 17.5 µl of Light cycler 480 SYBR green I master (Roche 04887352001) + 10.5µl 1 µM primer mix of forward and reverse primer + 7 µl cDNA (diluted 1:4). A standard curve was included for each primer set. Thermal cycler conditions were 44 PCR cycles (95° for 5 min, 95° for 10s, 60° for 10s, 72° for 20s). Primer sequences used for qRT-PCR were:

MUC4 ( Chen et al., 2013) ( Figure 1B)

Figure 1. The MUC4 locus.

Figure 1.

( A) UCSC Genome Browser screen shot of the MUC4/MUC20 region on human chromosome 3. The location of the two repeated regions in exon2 (E2) and intron3 (I3) of MUC4 are indicated. The target sequence (black line) and the PAM (red line) of sgMUC4-E3 for targeting of dCas9 to E2 of MUC4 are shown. At the bottom, the position of the probe used for FISH is indicated. ( B) Location of primers (arrowheads) located within exon 17 used for RT-PCR analysis of MUC4 expression by Chen et al. (2013). ( C) Location of primers (arrowheads) spanning an exon-exon junction used here for RT-PCR analysis of MUC4 mRNA expression (mRNA transcript variant used for design: NCBI Reference Sequence: NM_004532.6).

Fw: 5’ TCAATGGTGGTCGTGTGATT 3’

Rv: 5’ AAGTCGGTGCAGCTGTCTCT 3’

β-actin

Fw 5’ CATGTACGTTGCTATCCAGGC 3’

Rv 5’ CTCCTTAATGTCACGCACGAT 3’

Gene expression data were analysed for transfected cells (TALE or dCas9) vs mock transfected cells, normalised to β-actin by the (2 ΔΔC T ) method ( Livak & Schmittgen, 2001) as follows:

ΔΔCT=[(CTMUC4CTβactin)transfected(CTMUC4CTβactin)mock]

RT-PCR

First strand product was amplified by PCR using primers spanning between exon 3 and exon 4 of MUC4 for 35 cycles (95°C for 20 s, 60°C for 30 s, 72°C for 30 s) and products visualised by agarose gel electrophoresis ( Figure 3C). A -RT control was included. Primer sequences were:

MUC4 ( Figure 1C):

Fw: 5’ CACAACCTCCCAGACCATCAT 3’

Rv: 5’ GGAAGAGGGAAACTCCTCTCTCA 3’

β-actin:

Fw 5’ AGAGCTATGAGCTGCCTGACG 3’

Rv 5’ TGTGTTGGCATAGAGGTCTTTACG 3’

Image acquisition

U2OS cells growing on slides were fixed with 4 % paraformaledhyde, permeabilised with Triton X-100 and DAPI stained 24h following transfection with dCas9 or TALE constructs. Slides were imaged using a Photometrics Coolsnap HQ2 CCD camera and a Zeiss AxioImager A1 fluorescence microscope with a Plan Apochromat 100x 1.4NA objective, a Nikon Intensilight Mercury based light source (Nikon UK Ltd, Kingston-on-Thames, UK) and Chroma #89014ET (3 colour) single excitation and emission filters (Chroma Technology Corp., Rockingham, VT) with the excitation and emission filters installed in Prior motorised filter wheels. A piezoelectrically driven objective mount (PIFOC model P-721, Physik Instrumente GmbH & Co, Karlsruhe) was used to control movement in the z dimension. Step size for z stacks was set to 0.2 μm. Hardware control and image capture were performed the acquisition module or Nikon Nis-Elements software (Nikon UK Ltd, Kingston-on-Thames, UK).

Bioinformatic analysis of gene expression

ENCODE chromatin immunoprecipitation (ChIP)-seq data from U2OS cells were obtained from NCBI GEO: H3K4me3 (GSM871043), H3K36me3 (GSM788076).

For mature and nascent RNA-seq analysis from cell lines, publicly available data were obtained as indicated in Table 1. FastQ files were aligned to human genome hg38 using Bowtie.2 with default settings. BAM files were then used to generate BigWig files using bamCoverage with normalisation across samples by scaling to 1X genome size.

Table 1. Sources of mRNA and nascent RNA-seq data for Capan-1 and 2 cells, U2OS, RPE1, HeLa and HT1080 cells.

Cell
line
RNA GSE Series SRA Number
Mature mRNA
Capan-1 mRNA GSE79669 SRR3308945
Capan-2 mRNA GSE79669 SRR3308945
U-2OS mRNA GSE162163 SRR13142368
RPE1 mRNA GSE98541 SRR5508027
HeLa mRNA GSE90235 SRR5048095
HT1080 mRNA GSE78653 SRR3192620
Nascent RNA
U-2OS 4SU-seq (4-thiouridine) GSE162264 SRR13159400
RPE1 EU-seq (5-ethynyl uridine) GSE137448 SRR10119526
HeLa 4SU-seq (4-thiouridine) GSE128753 SRR8775198

To analyse mucin gene family expression, transcripts per million (TPM) data from RNA-seq datasets for 934 cell lines were obtained from the EBI-EMBL Expression Atlas data release 37. A heatmap was generated using pheatmap with TPM counts expressed as a Z-score. Cell lines were assigned an organ origin type with clustering using the default clustering method.

Results

Detection of the MUC4 locus in U2OS cells

MUC4 is the most common mammalian gene targeted for visualisation in live cells as its coding sequence has a variable number (>100) of a 48 nt tandem repeat in exon 2. This 7.5 to 19 kb repeat region results in the translation of a 550 to 930 kDa protein ( Chaturvedi et al., 2008). The MUC4 locus also contains 90 repeats of a 15 bp sequence in intron 3 ( Chen et al., 2013) ( Figure 1).

Because dCas9 and TALE targeting approaches have been shown to have very different efficiencies with regard to the synthetic activation and repression of gene loci ( Gao et al., 2014), we aimed to target dCas9-GFP or TALE-GFP to the MUC4 locus in human cell lines with the aim of assaying the chromatin dynamics of this locus when transcriptionally active.

There have been various reports visualising MUC4 in live U2OS cells ( Chen et al., 2013; Qin et al., 2017). We therefore chose this bone osteosarcoma epithelial cell line for our initial studies. We targeted dCas9, to the MUC4 locus using a sgRNA ( Figure 1A) previously designed for exon 2 ( Chen et al., 2013). We also designed TALE proteins that target exon 2 of MUC4.

U2OS cells are reported to be hypertriploid. We confirmed this for chromosome 3q, where MUC4 is located, using DNA FISH ( Figure 2A) with a probe that encompasses exon 3 ( Figure 1A). MUC4-specific dCas9+sgRNA as well as TALE-GFP constructs were then transfected into U2OS cells. Foci were detected with both dCas9 and TALES. However, whilst 3 spots per nucleus were detected by the dCas9, consistent with the presence of three copies of the locus detected by DNA FISH ( Figure 2B), up to 6 spots per nucleus were detected with the TALE-GFPs leading us to question what the TALE constructs were detecting.

Figure 2. Detection of the MUC4 locus.

Figure 2.

A) Representative FISH image of a U2OS DAPI-stained metaphase chromosomes hybridised with the MUC4 fosmid probe (red) and indicating three copies of the locus. Scale bar= 10μm. The graph to the right shows the number of loci detected per spread (n=23 metaphase spreads). B) Images of GFP fluorescence in fixed U2OS cells transiently expressing either dCas9-gRNA or TALE constructs specific to the MUC4 locus. The graphs to the right quantify the number of spots observed in transfected cells for either dCas9 or TALE.

MUC4 is not expressed in U2OS, RPE or HeLa cells

Since it has been previously reported that targeting dCas9 to exon 2 of MUC4 leads to partial repression of MUC4 expression ( Chen et al., 2013), we wished to assay whether MUC4 expression is impacted similarly by the binding of TALEs or dCas9 in U2OS cells. We performed real-time (q)RT-PCR using the previously reported MUC4 qRT-PCR primers ( Chen et al., 2013). Whilst there was some modest reduction in the concentration of the MUC4 amplicons from dCas9 and TALE-transfected cells relative to mock transfected cells, this was variable between biological replicates, especially for dCas9 ( Figure 3A). However, Ct values for MUC4 amplification were very high compared with the β-actin control ( Figure 3B) suggesting that MUC4 expression levels may be very low in this cell line and therefore that the qRT-PCR results may be unreliable. We also noted that the previously reported ( Chen et al., 2013) MUC4 qRT-PCR primers are located entirely within exon 17 ( Figure 1B) making it hard to exclude genomic DNA contamination, or to distinguish spliced from unspliced transcripts. Indeed, we obtained the same Ct values with these MUC4 primers in U2OS RNA samples + or -RT.

Figure 3. No evidence of MUC4 expression in U2OS cells.

Figure 3.

A) qRT-PCR data for MUC4 showing amplification levels in cells transfected with dCas9 and TALEs targeting MUC4, relative to mock transfected cells (2 -ΔΔCt) and normalised to β-actin. Primers are those described by Chen et al. (2013) and shown in Figure 1B. Data show means (+/- stdev) from three biological replicates. B) Graph showing the rise in amplified product concentration with increasing cycle number for β-actin and for MUC4. Data shown are from three technical replicates of one of the three biological replicates used in ( A). C) RT-PCR using the primers shown in Figure 1C to detect the expression of MUC4 (top) in RNA prepared from U2OS cells and human colon mucosa. Amplification of β-actin (bottom) acts as a positive control. - reverse transcriptase (RT) and DNAse I untreated samples act as controls for the presence of genomic DNA contamination in RNA samples. D and E) UCSC Genome Browser screen shot of the genomic regions containing the MUC20/MUC4 ( D) and MUC1 ( E) loci and adjacent non-mucin genes. Shown below is the ENCODE H3K36me3 ChIP-seq track from U2OS cells (GEO Accession number GSM788076). Genome co-ordinates are from the hg19 assembly of the human genome.

We therefore also designed RT-qPCR primers which span across the exon 3-exon 4 junction of the MUC4 mRNA ( Figure 1C). MUC4 encodes for a mucin, a transmembrane glycoprotein which is an important constituent of mucus. It is expressed by epithelial cells in the airway, the cervix, and the colon and is aberrantly expressed in some cancers ( Chaturvedi et al., 2008). We therefore used RNA from human colonic mucosa tissue as a positive control for MUC4 expression. RT-PCR detects a strong ~110bp MUC4 band in the colonic mucosa sample but not in U2OS cells ( Figure 3C). There are multiple alternatively spliced isoforms of MUC4 ( Figure 1), therefore it is possible that we were unable to detect the isoforms expressed in U2OS cells using the primers designed for RT-PCR.

Tri-methylation of histone H3K36 (H3K36me3) occurs co-transcriptionally and is enriched over the exons of expressed genes ( de Almeida et al., 2011; Kim et al., 2011). Analysis of ENCODE ChIP-seq data for H3K36me3 in U2OS cells shows an absence of H3K36me3 from MUC4, from the adjacent MUC20 gene on chromosome 3 ( Figure 3D) and from MUC1 located on human 1 ( Figure 3E) in contrast to the neighbouring non-mucin genes. We therefore conclude that the mucin gene family, and particularly MUC4 is not expressed in U2OS cells.

The live cell imaging study of the MUC4 locus ( Chen et al., 2013) used the human retinal pigment epithelium (RPE) cell line, and reported that targeting dCas9 to exon 2 of MUC4 led to partial repression of MUC4 expression as assayed by qRT-PCR. Very significant (70–80%) repression of expression was also reported for targeting of MUC1 with dCas9. This implies expression of the mucin genes in this cell line, which is surprising given the origin of these cells from the pigmented epithelium at the back of the eye.

To investigate this further, we searched publicly available RNA-seq datasets from RPE cells. No mature MUC4, MUC20 ( Figure 4A) or MUC1 ( Figure 4B) mRNAs were detected in these RPE datasets ( Figure 4A), making it hard to understand how meaningful the repression, reported as a consequence of dCas9 targeting at these loci, is. Consistent with previous reports ( Jonckheere et al., 2004) MUC4 expression was detected in the pancreatic adenocarcinoma cell lines CAPAN-1 and CAPAN-2.

Figure 4. RNA-seq analysis for the mucin genes.

Figure 4.

RNA-seq data from mature mRNA at the ( A) MUC4/MUC20 locus and ( B) MUC1 locus for human cell lines; CAPAN-1 and CAPAN-2 pancreatic adenocarcinoma, RPE1, U2OS, HeLa and HT1080 fibrosarcomma cell lines. Data sources are detailed in Table 1. C) Heatmap showing z-scores for transcripts per million (TPM) for the mucin gene family obtained from RNA-seq datasets for 934 cell lines from the EBI-EMBL Expression Atlas data release 37. Cell lines were assigned an organ origin type with clustering using the default clustering method. Zoom in shows data for MUC4. D and E) Nascent (4SU) RNA-seq data at the ( D) MUC4/MUC20 and ( E) MUC1 loci in U2OS, HeLa and RPE cell lines. Data sources are detailed in Table 1.

Using publicly available datasets, we also found no evidence of mRNA expression from these mucin genes in U2OS cells or in HeLa cells ( Figure 4A and B), even though visualisation of MUC4 transcripts using dCas13 and single molecule FISH has been recently reported in live and fixed HeLa cells, respectively ( Yang et al., 2019).

Analysis of RNA-seq data for 934 cell lines, including U2OS and CAPAN cells ( Figure 4C), confirmed highly restricted expression of the entire mucin gene family, with MUC4 expression detected in a small number of cell lines, including CAPAN-1 and 2, of gastrointestinal, urological and pancreatic origin. No expression of any mucin gene was detected in U2OS cells.

To ascertain if despite the absence of stable mucin mRNAs, there might still be transcription from the MUC4/MUC20 loci in the cell lines examined, we assessed nascent RNA-seq (4-thiouridine/4SU-seq) data from U2OS and HeLa cells and 5-ethynyl uridine/EU-seq from RPE cells. No evidence for nascent transcription was detected from MUC4/MUC20 in data from any of these cell lines ( Figure 4D).

Discussion

The ability to detect endogenous gene loci in mammalian cells is an important goal and, the ability to study these genes during the act of transcription is key to understanding both the chromatin dynamics associated with transcription and the spatial organisation of these genes relative to the components of the transcriptional machinery. Whilst many groups are exploring ways to improve the signal:noise problems inherent in visualising a single-copy gene, the mammalian MUC4 gene could be an excellent model since the repetitive nature of the sequences in exon 2 and intron 3 maximises the detection of fluorescent signal from molecules targeted to this locus – e.g. through dCas9 or TALEs ( Chen et al., 2013) ( Figure 1). A similar approach may be feasible for other genes, such as FLG (encoding filaggrin), which also contain tandem repeats ( Brown et al., 2012).

There are several studies reporting repression of the MUC4 (and MUC1) genes upon binding by dCas9 ( Chen et al., 2013) and visualisation of MUC4 transcripts using dCas13 ( Yang et al., 2019). However, our data reported here suggest a re-examination of the conclusions from these reports is required in order to understand, for example, whether these discrepancies arise from breakthrough transcription in a very small fraction of cells, from differences between batches of cell lines, or from differences in the ability to detect transcription using different methods. Our data also suggest that other cell lines that robustly express mucin genes might be a better system to employ for live cell imaging studies.

Data availability

Underlying data

NCBI Gene Expression Omnibus: Sources of H3K4me3. Accession number GSM871043; https://identifiers.org/geo:GSM871043.

NCBI Gene Expression Omnibus: Sources of H3K36me3. Accession number GSM788076; https://identifiers.org/geo:GSM788076.

NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for Capan-1 and 2 cells. Accession number GSE79669; https://identifiers.org/geo:GSE79669.

NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for U2OS. Accession number GSE162163; https://identifiers.org/geo:GSE162163.

NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for RPE1. Accession number GSE98541; https://identifiers.org/geo:GSE98541.

NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for HeLa. Accession number GSE90235; https://identifiers.org/geo:GSE90235.

NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for HT1080. Accession number GSE78653; https://identifiers.org/geo:GSE78653.

NCBI Gene Expression Omnibus: Sources of nascent RNA-seq data for U-2OS. Accession number GSE162264; https://identifiers.org/geo:GSE162264.

NCBI Gene Expression Omnibus: Sources of nascent RNA-seq data for RPE1. Accession number GSE137448; https://identifiers.org/geo:GSE137448.

NCBI Gene Expression Omnibus: Sources of nascent RNA-seq data for HeLa. Accession number GSE128753; https://identifiers.org/geo:GSE128753.

Acknowledgements

NA conducted most of the experiments, data analysis and presentation, and was responsible for methodology and original draft preparation, IW assisted with and validated RT-PCR, SB assisted with FISH and karyotyping, SCB conducted analysis of publicly available datasets, WAB was responsible for conceptualization and funding acquisition, data presentation and wrote the manuscript. We thank Ling-Ling Chen (Shanghai Institute of Biochemistry and Cell Biology) and Bo Huang (University of California, San Francisco) for helpful discussion and for sharing data with us.

Funding Statement

NA was supported by a PhD studentship from the MRC National Productivity Investment Fund. SCB is supported by a postdoctoral clinical lectureship from NES/ Chief Scientist Office. WAB is supported by Wellcome [212203, <a href=https://doi.org/10.35802/217120>https://doi.org/10.35802/217120</a>] and by an MRC MRC University Unit grant (MC_UU_00007/2).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; peer review: 2 approved]

References

  1. Alexander JM, Guan J, Li B, et al. : Live-cell imaging reveals enhancer-dependent Sox2 transcription in the absence of enhancer proximity. eLife. 2019;8:e41769. 10.7554/eLife.41769 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Brown SJ, Kroboth K, Sandilands A, et al. : Intragenic copy number variation within filaggrin contributes to the risk of atopic dermatitis with a dose-dependent effect. J Invest Dermatol. 2012;132(1):98–104. 10.1038/jid.2011.342 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chaturvedi P, Singh AP, Batra SK: Structure, evolution, and biology of the MUC4 mucin. FASEB J. 2008;22(4):966–981. 10.1096/fj.07-9673rev [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen B, Gilbert LA, Cimini BA, et al. : Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell. 2013;155(7):1479–1491. 10.1016/j.cell.2013.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cheng AW, Jillette N, Lee P, et al. : Casilio: A versatile CRISPR-Cas9-Pumilio hybrid for gene regulation and genomic labeling. Cell Res. 2016;26(2):254–257. 10.1038/cr.2016.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chubb JR, Boyle S, Perry P, et al. : Chromatin motion is constrained by association with nuclear compartments in human cells. Curr Biol. 2002;12(6):439–445. 10.1016/s0960-9822(02)00695-4 [DOI] [PubMed] [Google Scholar]
  7. de Almeida SF, Grosso AR, Koch F, et al. : Splicing enhances recruitment of methyltransferase HYPB/Setd2 and methylation of histone H3 Lys36. Nat Struct Mol Biol. 2011;18(9):977–983. 10.1038/nsmb.2123 [DOI] [PubMed] [Google Scholar]
  8. Fantes JA, Bickmore WA, Fletcher JM, et al. : Submicroscopic deletions at the WAGR locus, revealed by nonradioactive in situ hybridization. Am J Hum Genet. 1992;51(6):1286–1294. [PMC free article] [PubMed] [Google Scholar]
  9. Gao X, Tsang JCH, Gaba F, et al. : Comparison of TALE designer transcription factors and the CRISPR/dCas9 in regulation of gene expression by targeting enhancers. Nucl Acids Res. 2014;42(20):e155. 10.1093/nar/gku836 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Germier T, Kocanova S, Walther N, et al. : Real-Time Imaging of a Single Gene Reveals Transcription-Initiated Local Confinement. Biophys J. 2017;113(7):1383–1394. 10.1016/j.bpj.2017.08.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gu B, Swigut T, Spencley A, et al. : Transcription-coupled changes in nuclear mobility of mammalian cis-regulatory elements. Science. 2018;359(6379):1050–1055. 10.1126/science.aao3136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Jacome A, Fernandez-Capetillo O: Lac operator repeats generate a traceable fragile site in mammalian cells. EMBO Rep. 2011;12(10):1032–1038. 10.1038/embor.2011.158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Jonckheere N, Perrais M, Mariette C, et al. : A role for human MUC4 mucin gene, the ErbB2 ligand, as a target of TGF-beta in pancreatic carcinogenesis. Oncogene. 2004;23(34):5729–5738. 10.1038/sj.onc.1207769 [DOI] [PubMed] [Google Scholar]
  14. Kim S, Kim H, Fong N, et al. : Pre-mRNA splicing is a determinant of histone H3K36 methylation. Proc Natl Acad Sci U S A. 2011;108(33):13564–13569. 10.1073/pnas.1109475108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Knight SC, Xie L, Deng W, et al. : Dynamics of CRISPR-Cas9 genome interrogation in living cells. Science. 2015;350(6262):823–826. 10.1126/science.aac6572 [DOI] [PubMed] [Google Scholar]
  16. Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001;25(4):402–408. 10.1006/meth.2001.1262 [DOI] [PubMed] [Google Scholar]
  17. Ma H, Tu LC, Naseri A, et al. : Multiplexed labeling of genomic loci with dCas9 and engineered sgRNAs using CRISPRainbow. Nat Biotechnol. 2016;34(5):528–30. 10.1038/nbt.3526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Miyanari Y, Ziegler-Birling C, Torres-Padilla ME: Live visualization of chromatin dynamics with fluorescent TALEs. Nat Struct Mol Biol. 2013;20(11):1321–1324. 10.1038/nsmb.2680 [DOI] [PubMed] [Google Scholar]
  19. Osorio-Valeriano M, Altegoer F, Steinchen W, et al. : ParB-type DNA Segregation Proteins Are CTP-Dependent Molecular Switches. Cell. 2019;179(7):1512–1524.e15. 10.1016/j.cell.2019.11.015 [DOI] [PubMed] [Google Scholar]
  20. Pollex T, Heard E: Nuclear positioning and pairing of X-chromosome inactivation centers are not primary determinants during initiation of random X-inactivation. Nat Genet. 2019;51(2):285–295. 10.1038/s41588-018-0305-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Qin P, Parlak M, Kuscu C, et al. : Live cell imaging of low- and non-repetitive chromosome loci using CRISPR-Cas9. Nat Comms. 2017;8:14725. 10.1038/ncomms14725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ren R, Deng L, Xue Y, et al. : Visualization of aging-associated chromatin alterations with an engineered TALE system. Cell Res. 2017;27(4):483–504. 10.1038/cr.2017.18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Shariati SA, Dominguez A, Xie S, et al. : Reversible Disruption of Specific Transcription Factor-DNA Interactions Using CRISPR/Cas9. Mol Cell. 2019;74(3):622–633.e4. 10.1016/j.molcel.2019.04.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Yang LZ, Wang Y, Li SQ, et al. : Dynamic Imaging of RNA in Living Cells by CRISPR-Cas13 Systems. Mol Cell. 2019;76(6):981–997.e7. 10.1016/j.molcel.2019.10.024 [DOI] [PubMed] [Google Scholar]
Wellcome Open Res. 2021 Nov 3. doi: 10.21956/wellcomeopenres.19041.r46425

Reviewer response for version 1

Jonathan Chubb 1

This is a very interesting contribution to the technology development of Cas9/Talen tools for imaging and controlling gene expression and chromatin dynamics. A variety of approaches have been used over the past 35 years, starting with the imaging of DAPI and EtBr stained nuclei, through to the integration of repeats of bacterial repeats and their detection using repeat binding proteins. These methods have limits in resolution and potential disruption of the underlying template. The potential to detect single loci and their products without disrupting the underlying locus promises more precise measurements of a more physiological chromatin state. Many initial studies using Talens and Cas9 tools have focussed on the MUC4 gene, as its repetitive nature allows the accumulation of more fluorescence at the site of transcription. This new study indicates that MUC4 is not expressed in most of the cell lines under study, which limits the interpretation of any chromatin dynamics inferred, and the magnitude of any unwanted effects on the endogenous locus inferred using Cas9 and Talens to image chromatin.

The manuscript is generally well written, and the conclusions unequivocal. The statistical analysis is limited, but the data are so clear, this probably warrants little consideration. The text in the legend for Figure 4E is missing and should be added. Perhaps also some discussion of an alternative target for imaging, such as an abundantly expressed gene with lots of repeats, such as Filaggrin.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Partly

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Single cell gene expression, imaging, development.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2021 Nov 17.
Wendy Bickmore 1

We thank the reviewer for pointing out the omission in the legend to Figure 4E, this has now been corrected in a revised version. We also thank the reviewer for pointing out other genes with internal tandem repeats, and we have now added a statement about alternative gene targets such as Filaggrin .

Wellcome Open Res. 2021 Oct 22. doi: 10.21956/wellcomeopenres.19041.r46423

Reviewer response for version 1

Thomas Sexton 1

This clearly written report convincingly shows lack of MUC4 expression in the cell lines used in previous live imaging studies, and the authors rightly highlight the care needed in developing and validating such studies in the future.

It would be nice if the authors could be a bit more explicit in their take on what this means for the potential impact of dCas9 and/or TALEs on expression of the genes they are being used to image. Are there other cases unrelated to MUC4 in the literature, showing transcription perturbation by these tools, and are they more believable? Or do the authors believe that the impact of the tools on transcription has not yet been adequately tested?

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Molecular biology; nuclear organisation

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2021 Nov 16.
Wendy Bickmore 1

We thank this reviewer for these comments. We do indeed think that the impact of dCas9 or TAL effector tools on DNA templated transactions, including transcription, have not been systematically tested to date.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Availability Statement

    Underlying data

    NCBI Gene Expression Omnibus: Sources of H3K4me3. Accession number GSM871043; https://identifiers.org/geo:GSM871043.

    NCBI Gene Expression Omnibus: Sources of H3K36me3. Accession number GSM788076; https://identifiers.org/geo:GSM788076.

    NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for Capan-1 and 2 cells. Accession number GSE79669; https://identifiers.org/geo:GSE79669.

    NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for U2OS. Accession number GSE162163; https://identifiers.org/geo:GSE162163.

    NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for RPE1. Accession number GSE98541; https://identifiers.org/geo:GSE98541.

    NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for HeLa. Accession number GSE90235; https://identifiers.org/geo:GSE90235.

    NCBI Gene Expression Omnibus: Sources of mRNA RNA-seq data for HT1080. Accession number GSE78653; https://identifiers.org/geo:GSE78653.

    NCBI Gene Expression Omnibus: Sources of nascent RNA-seq data for U-2OS. Accession number GSE162264; https://identifiers.org/geo:GSE162264.

    NCBI Gene Expression Omnibus: Sources of nascent RNA-seq data for RPE1. Accession number GSE137448; https://identifiers.org/geo:GSE137448.

    NCBI Gene Expression Omnibus: Sources of nascent RNA-seq data for HeLa. Accession number GSE128753; https://identifiers.org/geo:GSE128753.


    Articles from Wellcome Open Research are provided here courtesy of The Wellcome Trust

    RESOURCES