Abstract
A recent study proposed that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) hijacks the LINE-1 (L1) retrotransposition machinery to integrate into the DNA of infected cells. If confirmed, this finding could have significant clinical implications. Here, we apply deep (>50×) long-read Oxford Nanopore Technologies (ONT) sequencing to HEK293T cells infected with SARS-CoV-2 and do not find the virus integrated into the genome. By examining ONT data from separate HEK293T cultivars, we completely resolve 78 L1 insertions arising in vitro in the absence of L1 overexpression systems. ONT sequencing applied to hepatitis B virus (HBV)-positive liver cancer tissues located a single HBV insertion. These experiments demonstrate reliable resolution of retrotransposon and exogenous virus insertions by ONT sequencing. That we find no evidence of SARS-CoV-2 integration suggests that such events are, at most, extremely rare in vivo and therefore are unlikely to drive oncogenesis or explain post-recovery detection of the virus.
Keywords: COVID-19, SARS-CoV-2, LINE-1, L1, retrotransposon, hepatitis B virus
Graphical abstract
In response to a recent claim that SARS-CoV-2 can be incorporated into human DNA by LINE-1 retrotransposon proteins, Smits et al. apply long-read DNA sequencing to cultured HEK293T cells infected with SARS-CoV-2. They find no evidence for genomic integration of the virus, despite demonstrated availability of the LINE-1 protein machinery.
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a positive-sense single-stranded ∼30-kbp polyadenylated RNA betacoronavirus (V’kovski et al., 2021; Wu et al., 2020). SARS-CoV-2 does not encode a reverse transcriptase (RT) and therefore is not expected to integrate into genomic DNA as part of its life cycle. This assumption is of fundamental importance to the accurate diagnosis and potential long-term clinical consequences of SARS-CoV-2 infection, as demonstrated by other viruses known to incorporate into genomic DNA, such as human immunodeficiency virus 1 (HIV-1) and hepatitis B virus (HBV) (Bill and Summers, 2004; Fujimoto et al., 2012; Jiang et al., 2012; Nagaya et al., 1987). Notably, a recent work by Zhang et al. reported potential evidence of SARS-CoV-2 integration into the genome of infected human cells (Zhang et al., 2021). Prior analyses of mammalian genome sequences, as well as in vivo and in vitro experimental data, indicate single-stranded RNA viruses can act as templates for endogenous RTs (Belyi et al., 2010; Feschotte and Gilbert, 2012; Geuking et al., 2009; Horie et al., 2010; Kawasaki et al., 2021; Klenerman et al., 1997). These studies provide a conceptual basis to further investigate the genomic integration of SARS-CoV-2, as pursued by Zhang et al.
LINE-1 (L1) retrotransposons reside in all mammalian genomes (Kazazian and Moran, 2017). In humans, L1 transcribes a bicistronic mRNA encoding two proteins, namely, ORF1p and ORF2p, which are essential to L1 mobility (Moran et al., 1996). ORF2p possesses endonuclease (EN) and RT activities and exhibits a strong cis preference for reverse transcription of L1 mRNA (Doucet et al., 2015; Garcia-Perez et al., 2007; Kulpa and Moran, 2006; Monot et al., 2013; Moran et al., 1996; Wei et al., 2001). Nonetheless, the L1 protein machinery can trans mobilize polyadenylated cellular RNAs, especially those produced by Alu and SINE-VNTR-Alu (SVA) retrotransposons but also including protein-coding gene mRNAs (Dewannieux et al., 2003; Esnault et al., 2000; Garcia-Perez et al., 2007; Hancks et al., 2011; Raiz et al., 2012). Somatic L1 mobilization in cis is observed in embryonic cells, the neuronal lineage, and various cancers (Evrony et al., 2015; Feusier et al., 2019; Rodriguez-Martin et al., 2020; Sanchez-Luque et al., 2019; Schauer et al., 2018; Scott et al., 2016). In contrast, somatic L1-mediated trans mobilization is apparently rare in vivo (Evrony et al., 2015; Rodriguez-Martin et al., 2020; Sanchez-Luque et al., 2019) and is likely repressed by various mechanisms (Ahl et al., 2015; Deniz et al., 2019; Doucet et al., 2015; Ewing et al., 2020; Sanchez-Luque et al., 2019). Although Alu and, to a lesser extent, SVA are readily mobilized in cultured cell assays by L1 proteins, the same machinery produces less than one non-retrotransposon cellular RNA insertion for every 2,000 L1 insertions (Dewannieux et al., 2003; Hancks et al., 2011; Wei et al., 2001). Both cis and trans L1-mediated insertions incorporate target site duplications (TSDs) and a 3′ polyA tract and integrate at the degenerate L1 EN motif 5′-TTTT/AA (Dewannieux et al., 2003; Esnault et al., 2000; Garcia-Perez et al., 2007; Gilbert et al., 2005; Hancks et al., 2009; Jurka, 1997; Moran et al., 1996; Raiz et al., 2012). These sequence hallmarks can together discriminate artifacts from genuine insertions (Faulkner and Billon, 2018).
In their work, Zhang et al. overexpressed L1 in HEK293T cells, infected the cells with SARS-CoV-2, and identified DNA fragments of the virus through PCR amplification. These results, alongside other less direct (Kazachenka and Kassiotis, 2021; Yan et al., 2021) analyses, were interpreted as evidence of SARS-CoV-2 genomic integration. Crucially, Zhang et al. then detected 63 putative SARS-CoV-2 integrants by Oxford Nanopore Technologies (ONT) long-read sequencing. Of these integrants, only one on chromosome X was spanned by an ONT read aligned to one locus and was flanked by potential TSDs (Figure 1 A). However, this SARS-CoV-2 integrant did not incorporate a 3′ polyA tract, as is expected for an L1-mediated insertion, and involved an unusual 28-kb internal deletion of the SARS-CoV-2 sequence. Overall, the SARS-CoV-2 integrants reported by Zhang et al. were 26-fold enriched in exons, despite the L1 EN showing no preference for these regions (Flasch et al., 2019; Sultana et al., 2019). Zhang et al. also used Illumina short-read sequencing to map putative SARS-CoV-2 integration junctions in HEK293T cells without L1 overexpression. A lack of spanning reads and the tendency of Illumina library preparation to produce artifacts (Treiber and Waddell, 2017) leave this analysis open to interpretation.
The application of ONT sequencing to HEK293T cells nonetheless held conceptual merit. ONT reads can span germline and somatic retrotransposition events end-to-end and resolve the sequence hallmarks of L1-mediated integration (Ewing et al., 2020; Siudeja et al., 2021). Through this approach, we previously found two somatic L1 insertions in the liver tumor sample of an individual positive for hepatitis C virus (HCV), a ∼10-kbp positive-sense single-stranded non-polyadenylated RNA virus (Lauer and Walker, 2001), including one PCR-validated L1 insertion spanned by a single ONT read (Ewing et al., 2020; Shukla et al., 2013). Moreover, HEK293T cells are arguably a context favorable to L1 activity. They express L1 ORF1p (Philippe et al., 2016), accommodate engineered L1-mediated retrotransposition in cis and in trans (Hancks et al., 2011; Kubo et al., 2006; Niewiadomska et al., 2007; Sanchez-Luque et al., 2019), and support SARS-CoV-2 viral replication (Figure S1). Endogenous L1-mediated insertions can be detected in cell culture by genomic analysis of separate cultivars derived from a common population (Klawitter et al., 2016; Nguyen et al., 2018). Based on this experimental rationale, we sought to replicate the central findings of Zhang et al. and, after deeply ONT sequencing SARS-CoV-2-infected HEK293T cells, we did not detect SARS-CoV-2 genomic integration.
Results
We ONT sequenced (∼54× genome-wide depth, read length N50 of ∼39 kbp) genomic DNA harvested from HEK293T cells infected with SARS-CoV-2 at a multiplicity of infection (MOI) of 1.0, as well as mock-infected cells (∼28× depth, N50 of ∼47 kbp) (Figures 2 A and S1; Table S1). As positive controls, we ONT sequenced the tumor and non-tumor liver tissue of a HBV-positive hepatocellular carcinoma patient (Shukla et al., 2013). HBV is a DNA virus known to be integrated into sites of genomic damage by DNA double-strand break repair (Bill and Summers, 2004). As negative controls, we used the aforementioned HCV-positive hepatocellular carcinoma samples and a normal liver sample (Ewing et al., 2020; Table S1). We viewed HCV-infected samples as a suitable negative control because HCV and SARS-CoV-2 are both positive-sense single-stranded RNA viruses, and yet HCV is not polyadenylated and is therefore unlikely to attract the L1 machinery and has not been found to integrate into infected hepatocytes or liver tumor genomes (Fujimoto et al., 2012; Lauer and Walker, 2001). To these data, we added those of Zhang et al. and then used the transposons from Long DNA Reads (TLDR) (Ewing et al., 2020) software to call SARS-CoV-2, HBV, HCV, and retrotransposon insertions spanned by at least one uniquely aligned ONT read. TLDR detected no SARS-CoV-2, HBV, or HCV insertions.
In total, TLDR identified 575 non-reference human-specific L1 (L1HS) insertions, which were typically flanked by TSDs with a median length of 14 bp (Figure 2B; Table S2). No tumor-specific L1 insertions were found, apart from the two previously detected in the HCV-infected liver tumor (Ewing et al., 2020; Shukla et al., 2013). Seventy-eight L1 insertions were found only in our SARS-CoV-2-infected HEK293T cells (n = 66) or the mock infected control (n = 12) and produced TSDs with a median length of 14 bp (Figure 2B). Of the 78 events, 69 (88.5%) were detected by a single spanning read and 13 carried a 3′ transduction (Holmes et al., 1994; Moran et al., 1999; Table S2). After random downsampling, the more deeply sequenced SARS-CoV-2-infected HEK293T cells still had more than 2-fold more putative cultivar-specific L1 insertions than the mock-infected HEK293T cells. Next, we chose at random 6/69 L1 insertions detected by one spanning read for manual curation and PCR validation. All 6 L1 insertions bore a TSD and a 3′ polyA tract and integrated at a degenerate L1 EN motif (Figures 2D, 2E, and S2A–S2D). Three insertions were 5′ inverted (Kazazian et al., 1988; Ostertag and Kazazian, 2001; Figures 2E, S2C, and S2D) and one carried a 3′ transduction (Holmes et al., 1994) traced to a mobile (Rodriguez-Martin et al., 2020) full-length non-reference L1HS (Figure S2C). Three insertions PCR amplified in the SARS-CoV-2- and mock-infected samples (Figures 2D, 2E, and S2A) and three did not amplify in either sample (Figures S2B–S2D). The 6 integration sites were on average spanned by 86 reads not containing the L1 insertion (Figure S2E), a ratio (1:86) suggesting the L1s were absent from most cells. An additional analysis revealed 293 putative Alu (n = 291) and SVA (n = 2) insertions specific to either one of the HEK293T populations, with 290 of these found in the SARS-CoV-2 infected cells and 275 (93.9%) detected by a single spanning read (Table S2). The median TSD size for this cohort was 13 bp (Figure 2C). Altogether, these and earlier (Ewing et al., 2020; Siudeja et al., 2021) experiments show that lone spanning ONT reads can recover bona fide retrotransposition events and highlight endogenous L1 activity in HEK293T cells lacking L1 overexpression systems.
We next tested whether our computational analysis parameters excluded genuine HBV, HCV, or SARS-CoV-2 insertions. We directly aligned our ONT reads to the genome of the SARS-CoV-2 isolate (QLD002, GISAID EPI_ISL_407896) used here, as well as to a geographically diverse set of HBV and HCV genomes (Table S1) and a highly mobile L1HS sequence (Dombroski et al., 1993). In total, 3.6% of our ONT sequence bases aligned to L1HS, whereas no alignments to the SARS-CoV-2 or HCV genomes were observed (Figure 3 A). One read from the HBV-infected non-tumor liver sample aligned to 2,770 bp of a HBV genotype B isolate, and the remaining 2,901 bp aligned to an intergenic region of chromosome 2 (Figure 3B; Table S2). To validate this HBV insertion, we PCR amplified and capillary sequenced its 3′ junction (Figure 3B). The HBV sequence was linearized and rearranged (Figure 3B) as per prior reports (Fujimoto et al., 2012; Jiang et al., 2012; Nagaya et al., 1987). Direct inspection of ONT read alignments thus recovered a HBV integrant, which are found in ∼1 per 101–104 infected hepatocytes (Mason et al., 2016; Tu et al., 2018), and yet did not reveal reads alignable to the SARS-CoV-2 genome in our ONT datasets.
Reanalyzing the ONT data generated by Zhang et al., we found 555 reads (out of ∼12 million) that generated an alignment of ≥100 bp to the SARS-CoV-2 genome (Figure 3A). These reads (median length of 924 bp) were however 65.6% shorter than the overall dataset (2,686 kbp) and were comprised of a much higher average proportion of SARS-CoV-2 sequence (52.3%) than the proportion of L1HS sequence found in reads aligned to L1HS (17.1%). Of the 555 reads, 79 generated an alignment of ≥100 bp to the human genome, including 1 matching the aforementioned integrant on chromosome X that lacked a 3′ polyA tract (Figure 1A). An analysis of the corresponding 79 human genome alignment breakpoints, which could be interpreted as putative SARS-CoV-2 insertion points, as per Zhang et al., indicated 36.2% were exonic (Figure 3C). By comparison, 1.1% of all non-reference L1-mediated insertions reported here were exonic (Figure 3C), as were 1.3% and 1.7% of cultivar-specific L1 and Alu insertions, respectively (Figures 2B and 2C). Finally, we investigated why TLDR called neither of the two SARS-CoV-2 insertions highlighted by Zhang et al. (Figure 1) and found ambiguity in the ONT read alignments that supported these examples, leading to their exclusion. Specifically, the putative chromosome X insertion (Figure 1A) was filtered due to its short 5′ genomic flank alignment, whereas the chromosome 22 example was filtered because the corresponding ONT read alignment was marked as supplementary to another alignment on chromosome 1 (Figure 1B). These analyses confirmed that SARS-CoV-2 alignable reads were present in the Zhang et al. ONT dataset, and yet these reads were unusually short and could include molecular artifacts interpreted by Zhang et al. as SARS-CoV-2 integrants.
Discussion
We did not observe L1-mediated SARS-CoV-2 genomic integration in HEK293T cells, despite the availability of the L1 machinery (Hancks et al., 2011; Kubo et al., 2006; Niewiadomska et al., 2007; Philippe et al., 2016; Sanchez-Luque et al., 2019). We did detect L1, Alu, and SVA retrotransposition events. The higher number of L1 and Alu insertions found in our SARS-CoV-2-infected HEK293T cells is of potential interest given that viral infection can repress host factors limiting L1 activity (Hrecka et al., 2011; Laguette et al., 2011; Zhao et al., 2013). This preliminary finding perhaps indicates SARS-CoV-2 infection could increase L1 or Alu retrotransposition in vitro, a possibility requiring experimental confirmation. The comparative rarity of SVA insertions and the absence of SARS-CoV-2 insertions is however congruent with the relative frequencies of L1, Alu, SVA, and non-retrotransposon cellular RNA insertions driven by L1 proteins in prior cultured cell assays (Dewannieux et al., 2003; Hancks et al., 2011; Wei et al., 2001).
Our approach has several notable differences and caveats when compared to that of Zhang et al.. Each study used different SARS-CoV-2 isolates, and here, the MOI (1.0) was double that of Zhang et al. (MOI of 0.5). The high-molecular-weight DNA extraction method, ONT library preparation kit, and depth and quality of sequencing applied to HEK293T cells by Zhang et al. (standard isopropanol precipitation, SQK-LSK109 kit; ∼21× depth, N50 of ∼11 kbp) and here (Nanobind kit, SQK-LSK110 kit; ∼54× depth, N50 of ∼39 kbp) differed. Nevertheless, the DNA extraction protocols of each study would limit retention of extrachromosomal SARS-CoV-2 DNA potentially generated by ectopic L1 reverse transcription (Dhellin et al., 1997). The origins of the ONT reads aligned to the SARS-CoV-2 genome reported by Zhang et al. are therefore unclear in our view. Zhang et al. ONT sequenced only HEK293T cells transfected with an L1 expression plasmid, which human cells would not carry in vivo. We did not analyze SARS-CoV-2 patient samples, although, arguably, HEK293T cells present an environment far more conducive to L1 activity than those cells accessed in vivo by SARS-CoV-2 (Sungnak et al., 2020; Wiersinga et al., 2020). Widespread cell death post-infection also reduces the probability SARS-CoV-2 integrants would persist in the body (Karki et al., 2021; Varga et al., 2020). This view aligns with a very recent report of negligible SARS-CoV-2 DNA being detected by PCR in coronavirus disease 2019 (COVID-19) patient nasal swabs (Briggs et al., 2021).
Finally, the incredible enrichment reported by Zhang et al. for putative SARS-CoV-2 insertions in exons, which this and prior studies (Flasch et al., 2019; Sultana et al., 2019) have shown are not preferred by the L1 EN, contradicts the involvement of L1 in the events interpreted by Zhang et al. as SARS-CoV-2 genomic integrants. We conclude the L1 cis preference strongly disfavors SARS-CoV-2 retrotransposition, making the phenomenon mechanistically plausible but likely very rare, as for other polyadenylated non-retrotransposon cellular RNAs (Dewannieux et al., 2003; Doucet et al., 2015; Esnault et al., 2000; Garcia-Perez et al., 2007; Hancks et al., 2011; Kulpa and Moran, 2006; Monot et al., 2013; Moran et al., 1996; Wei et al., 2001).
STAR★Methods
Key resources table
REAGENT OR RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Bacterial and virus strains | ||
SARS-CoV-2 isolate hCoV-19/Australia/QLD02/2020 | Queensland Health Forensic and Scientific Services | EPI_ISL_407896 |
Biological samples | ||
Snap frozen hepatocellular carcinoma and matched non-tumor liver tissue | Centre Hépatobiliaire, Paul-Brousse Hospital | HCC32 |
Chemicals, peptides, and recombinant proteins | ||
SYBR Safe DNA Gel Stain | Invitrogen | S33102 |
Agarose | Bioline | BIO-41026 |
Carboxymethylcellulose | Sigma-Aldrich | C4888 |
KPL Milk Diluent/Blocking Solution Concentrate | SeraCare | 5140-0011 |
IRDye 800CW Goat anti-Mouse IgG antibody | LI-COR | 926-32210 |
Anti-SARS-CoV-2-spike antibody | Amarilla et al., 2021 | CR3022 |
Critical commercial assays | ||
Ligation Sequencing Kit | Oxford Nanopore Technologies | SQK-LSK110 |
Qubit dsDNA HS Assay Kit | Invitrogen | Q32851 |
Nanobind CBB Big DNA Kit | Circulomics | NB-900-001-01 |
Short Read Eliminator XS Kit | Circulomics | SS-100-121-01 |
Expand Long Template PCR System | Merck | 11681834001 |
MyTaq DNA Polymerase | Bioline | BIO-21105 |
Deposited data | ||
Nanopore WGS of mock and SARS-CoV-2 infected HEK293T cells, and HBV infected samples from liver cancer patient HCC32 | This paper | ENA: PRJEB44816 |
Nanopore WGS of SARS-CoV-2 infected HEK293T cells overexpressing L1 | Zhang et al., 2021 | SRA: PRJNA721333 |
Nanopore WGS of normal liver and HCV infected samples from liver cancer patient HCC33 | Ewing et al., 2020 | SRA: PRJNA629858 |
Experimental models: Cell lines | ||
Human embryonic kidney 293T (HEK293T) cells | ATCC | CRL-1568 |
Cercopithecus aethiops Vero E6 cells | ATCC | CRL-3216 |
Oligonucleotides | ||
Oligo sequences are shown in Table S2. | Integrated DNA Technologies | N/A |
Software and algorithms | ||
TLDR | https://github.com/adamewing/tldr | Ewing et al., 2020 |
Minimap2 | https://github.com/lh3/minimap2 | Li, 2018 |
SAMtools | https://github.com/samtools/ | Li et al., 2009 |
IGV | https://software.broadinstitute.org/software/igv/ | Robinson et al., 2011 |
Primer3 | https://bioinfo.ut.ee/primer3-0.4.0/ | Untergasser et al., 2012 |
Resource availability
Lead contact
Requests for further information and for resources and reagents should be directed to and will be fulfilled by the lead contact, Geoffrey J. Faulkner (faulknergj@gmail.com).
Materials availability
This study did not generate new unique reagents.
Experimental model and subject details
Liver tumor and non-tumor tissue were previously obtained from a HBV-positive patient (HCC32, male, 73yrs) who underwent surgical resection at the Centre Hepatobiliaire, Paul-Brousse Hospital, and made available for research purposes with approval from the French Institute of Medical Research and Health (Reference: 11-047). Further ethics approvals were provided by the Mater Health Services Human Research Ethics Committee (Reference: HREC-15-MHS-52) and the University of Queensland Medical Research Review Committee (Reference: 2014000221). HEK293T and Vero E6 cells were obtained from the American Type Culture Collection (ATCC).
Method details
SARS-CoV-2 infection of HEK293T cells
HEK293T cells and African green monkey kidney cells (Vero E6) were maintained in standard Dulbecco’s Modified Eagle Medium (DMEM). Culture media were supplemented with sodium pyruvate (11mg/L), penicillin (100U/mL), streptomycin (100 μg/mL) (P/S) and 10% fetal calf serum (FCS) (Bovogen, USA). Cells were maintained at 37°C with 5% CO2.
An early Australian SARS-CoV-2 isolate (hCoV-19/Australia/QLD02/2020; GISAID Accession EPI_ISL_407896) was sampled from patient nasopharyngeal aspirates by Queensland Health Forensic and Scientific Services and used to inoculate Vero E6 African green monkey kidney cells (passage 2). A viral stock (passage 3) was then generated on Vero E6 cells and stored at −80°C. Viral titration was determined by immuno-plaque assay (iPA), as previously described (Amarilla et al., 2021). To verify viral replication in HEK293T cells, a growth kinetic was assessed using a MOI of 0.01, 0.1 or 1.0, and showed efficient SARS-CoV-2 replication (Figure S1).
HEK293T viral infection was undertaken as follows: 3 × 106 HEK293T cells were seeded onto 6-well plates pre-coated with polylysine one day before infection. Cells were infected at MOI of 1 in 200μL of DMEM (2% FCS and P/S) and incubated for 30min at 37°C. Plates were rocked every 5min to ensure the monolayer remained covered with inoculum. The inoculum was then removed, and the monolayer washed five times with 1mL of additive-free DMEM. Finally, cells were maintained with 3mL of DMEM (supplemented with 2% fetal bovine serum and P/S) and incubated at 37°C with 5% CO2. Cell supernatant was harvested 0, 1, 2 and 3 days post-infection. The mock infected control differed only in that virus was not added to the inoculum media.
Genomic DNA was extracted from mock and SARS-CoV-2 infected (MOI 1.0) HEK293T cells sampled 2 days post-infection, using a Nanobind CBB Big DNA Kit (Circulomics) following the manufacturer’s instructions for high molecular weight (HMW) DNA extraction. DNA was eluted in elution buffer (10mM Tris-Cl, pH 8.5) and concentration measured by Qubit dsDNA High-Sensitivity Assay Kit on a Qubit Fluorometer (Life Technologies).
Hepatocellular carcinoma sample processing
DNA was extracted from the HCC32 tissues in our earlier study (Shukla et al., 2013) with a DNeasy Blood and Tissue Kit (QIAGEN, Germany) and stored at −80°C. To enrich for HMW DNA, 4.5 μg of DNA from the patient HCC32 tumor and non-tumor liver samples was diluted to 75ng/μL in a 1.5mL Eppendorf DNA LoBind tube and processed with a Short Read Eliminator XS Kit (Circulomics) following the manufacturer’s instructions.
ONT sequencing
DNA libraries were prepared at the Kinghorn Centre for Clinical Genomics (KCCG) using 3-4 μg HMW input DNA, without shearing, and a SQK-LSK110 ligation sequencing kit. 350-500ng of each prepared library was sequenced separately on one PromethION (Oxford Nanopore Technologies) flow cell (FLO-PRO002, R9.4.1 chemistry) (Table S1). SARS-CoV-2 infected HEK293T DNA was sequenced on two flow cells. Flow cells were washed (nuclease flush) and reloaded at 24hr and 48hr with 350-500ng of additional library to maximize output. Bases were called with guppy 4.0.11 (Oxford Nanopore Technologies).
ONT bioinformatic analyses
To call non-reference insertions with TLDR (Ewing et al., 2020), ONT reads generated here, by Zhang et al. (2021), and by our previous ONT study of human tissues (Ewing et al., 2020; Table S1) were aligned to the human reference genome build hg38 using minimap2 (Li, 2018) version 2.17 (index parameter: -x map-ont; alignment parameters: -ax map-ont -L -t 32) and SAMtools (Li et al., 2009) version 1.12. BAM files were then processed as a group with TLDR (Ewing et al., 2020) version 1.1 (parameters -e virus.fa -p 128 -m 1–max_te_len 40000–max_cluster_size 100–min_te_len 100–keep_pickles -n nonref.collection.hg38.chr.bed.gz). The file virus.fa was composed of: representative HBV and HCV isolate genomes (Table S1), the SARS-CoV-2 isolate used here (GISAID Accession EPI_ISL_407896), the L1HS sequence L1.3 (Dombroski et al., 1993) (GenBank Accession L19088), several Alu and SVA subfamily consensus sequences, and a consensus sequence for human endogenous retrovirus K (HERVK), the youngest human long terminal repeat (LTR) retrotransposon family. The file nonref.collection.hg38.chr.bed.gz is a collection of known non-reference retrotransposon insertions available from https://github.com/adamewing/tldr/. The TLDR output table was further processed to remove calls not passing all TLDR filters, representing homopolymer insertions, where MedianMapQ < 50 or family = “NA” or remappable = “FALSE” or UnmapCover < 0.75 or LengthIns < 100 or EndTE-StartTE < 100 or strand = “None” or SpanReads < 1. As 3ʹ truncation is rarely encountered for L1-mediated insertions, calls where EndTE was more than 10bp less than the consensus length were filtered, as were Alu insertions 5ʹ truncated by more than 1bp. The filtered TLDR output table is provided as Table S2. Insertions detected in only our mock or SARS-CoV-2 infected HEK293T datasets, but not in both experiments, and not matching a known non-reference insertion, were designated as putative cultivar-specific insertions (Table S2). Many if not most of these insertions were likely to have occurred in cell culture prior to the cultivars being separated.
To identify L1HS and viral sequences, we directly aligned all reads to the virus.fa file with minimap2 (index parameter: -x map-ont; alignment parameters: -ax map-ont -L -t 32). Reads containing alignments of ≥ 100bp to a sequence present in virus.fa were counted with SAMtools idxstats. Alignments to HBV, HCV or SARS-CoV-2 were excluded if they overlapped by ≥ 10bp with a genomic alignment of ≥ 100bp. Read alignments were visualized with SAMtools view and the Integrative Genomics Viewer (Robinson et al., 2011) version 2.8.6.
PCR validation
We used Primer3 (Untergasser et al., 2012) to design PCR primers for 6 L1 insertions found by a single spanning ONT read, using the reference genome and L1HS sequences as inputs (Table S2). These validation experiments were conducted in three phases. First, we performed an “empty/filled site” PCR using primers positioned on either side of the L1, where the filled site is the L1 allele, and the empty site is the remaining allele(s). Each empty/filled reaction was performed using a DNA Engine Tetrad 2 Thermal Cycler (Bio-Rad) and Expand Long Range Enzyme Mix, with 1X Expand Long Range Buffer with MgCl2, 50pmol of each primer, 0.5mM dNTPs, 5% DMSO, 100ng of template DNA and 1.75U of enzyme, in a 25 μL final volume. PCR cycling conditions were as follows: (92°C, 3min) × 1; (92°C, 30sec; 54-57°C, 30sec; 68°C, 7min) × 10; (92°C, 30sec; 52-55°C, 30sec; 68°C, 7min + 20sec/cycle) × 30; (68°C, 10min; 4°C, hold) × 1. Amplicons were visualized on a 1% agarose gel stained with SYBR SAFE (Invitrogen). GeneRuler™ 1kb plus (Thermo Scientific) was used as the ladder. Second, we combined each empty/filled primer with a primer positioned within the L1 sequence, to amplify the 5ʹ and 3ʹ L1-genome junctions. These reactions were undertaken on a DNA Engine Tetrad 2 Thermal Cycler (Bio-Rad), with MyTaq HS DNA polymerase, 1X MyTaq Reaction Buffer, 10pmol of each primer, 10ng of template DNA, and 2.5U of enzyme, in a 25 μL final volume. PCR cycling conditions were as follows: (95°C, 1min) × 1; (95°C, 15sec; 53-55°C, 15sec; 72°C, 15sec) × 35; (72°C, 5min; 4°C, hold) × 1. Amplicons were visualized on a 1.5% agarose gel stained with SYBR SAFE (Invitrogen). Third, we repeated the 5ʹ L1-genome junction-specific PCR using 200ng template DNA. All PCRs were performed with non-template control, as well as DNA extracted from the same HEK293T cells (SARS-CoV-2 and mock) subjected to genomic analysis. Notably, L1 insertions that did not amplify in either cultivar were still likely to be genuine events as they carried all of the relevant sequence hallmarks of L1-mediated retrotransposition.
PCR primers for the HBV insertion 3ʹ junction (Figure 3B; Table S2) were designed with Primer3 using the reference genome and closest match HBV sequence (GenBank accession AB602818) as inputs. PCR amplification and capillary sequencing was conducted as per the L1 insertions, except using Expand Long Range polymerase (Roche) with 1X Expand Long Range buffer with MgCl2, 10pmol of each primer, 100ng of template DNA, 500μM of PCR Nucleotide Mix, and 3.5U of enzyme, in a 25 μL final volume. PCR cycling conditions were as follows: (92°C, 2min) × 1; (92°C, 15sec; 65°C, 15sec; 68°C, 7:30min) × 10; (92°C, 15sec; 65°C, 15sec; 68°C, 7min+ 20sec per cycle) × 35 (68°C, 10min; 4°C, hold) × 1. Amplicons were visualized on a 1.2% agarose gel.
Amplicons in each experiment were visualized using a GelDoc (Bio-Rad) and, if of the correct size, gel extracted using a QIAGEN MinElute Gel Extraction Kit and capillary sequenced by the Australian Genomics Research Facility (Brisbane).
Quantification and statistical analysis
Error bars and replicate values are defined in figure legends, where appropriate. No statistical tests for significance were conducted.
Acknowledgments
The authors thank the human subjects of this study who donated tissues to the Centre Hépatobiliaire, Paul-Brousse Hospital. We thank S. Richardson and R. Shukla for helpful discussions, K. Chappell and P. Young for project support, and Queensland Health for providing the SARS-CoV-2 virus isolate QLD02. This study was funded by an Australian Government Research Training Program (RTP) Scholarships (N.S.), an NHMRC-ARC Dementia Research Development Fellowship (GNT1108258, G.O.B.), seed funding provided by the Australian Infectious Disease Research Centre to establish SARS-CoV-2 research at the University of Queensland (A.A.K), an Australian Department of Health Medical Research Future Fund (MRFF) Novel Coronavirus Vaccine Development Grant (APP1202445-2020, D.W.), an MRFF Investigator Grant (MRF1175457, A.D.E.), an NHMRC Investigator Grant (GNT1173711, G.J.F.), a CSL Centenary Fellowship (G.J.F.), and the Mater Foundation.
Author contributions
N.S., J.R., G.O.B., A.A.A., P.G., F.J.S.-L., P.A., N.M., and B.L. performed experiments and analyzed data. A.D.E. and G.J.F. performed bioinformatic analysis. J.F., I.W.D., A.A.K., D.W., and G.J.F. provided resources. G.J.F. designed the project and wrote the manuscript.
Declaration of interests
The authors declare no competing interests.
Published: July 28, 2021
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.celrep.2021.109530.
Supplemental information
Data and code availability
Oxford Nanopore Technologies sequencing data generated by this study were deposited in the European Nucleotide Archive (ENA) under project PRJEB44816. TLDR and instructions for its use and application are available at https://github.com/adamewing/TLDR.
References
- Ahl V., Keller H., Schmidt S., Weichenrieder O. Retrotransposition and Crystal Structure of an Alu RNP in the Ribosome-Stalling Conformation. Mol. Cell. 2015;60:715–727. doi: 10.1016/j.molcel.2015.10.003. [DOI] [PubMed] [Google Scholar]
- Amarilla A.A., Modhiran N., Setoh Y.X., Peng N.Y.G., Sng J.D.J., Liang B., McMillan C.L.D., Freney M.E., Cheung S.T.M., Chappell K.J. An optimized high-throughput immuno-plaque assay for SARS-CoV-2. Front. Microbiol. 2021;12:625136. doi: 10.3389/fmicb.2021.625136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belyi V.A., Levine A.J., Skalka A.M. Unexpected inheritance: multiple integrations of ancient bornavirus and ebolavirus/marburgvirus sequences in vertebrate genomes. PLoS Pathog. 2010;6:e1001030. doi: 10.1371/journal.ppat.1001030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bill C.A., Summers J. Genomic DNA double-strand breaks are targets for hepadnaviral DNA integration. Proc. Natl. Acad. Sci. USA. 2004;101:11135–11140. doi: 10.1073/pnas.0403925101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Briggs E., Ward W., Rey S., Law D., Nelson K., Bois M., Ostrov N., Lee H.H., Laurent J.M., Mita P. Assessment of potential SARS-CoV-2 virus N gene integration into human genome reveals no significant impact on RT-qPCR COVID-19 diagnostic testing. medRxiv. 2021 doi: 10.1101/2021.06.21.21258023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deniz Ö., Frost J.M., Branco M.R. Regulation of transposable elements by DNA modifications. Nat. Rev. Genet. 2019;20:417–431. doi: 10.1038/s41576-019-0106-6. [DOI] [PubMed] [Google Scholar]
- Dewannieux M., Esnault C., Heidmann T. LINE-mediated retrotransposition of marked Alu sequences. Nat. Genet. 2003;35:41–48. doi: 10.1038/ng1223. [DOI] [PubMed] [Google Scholar]
- Dhellin O., Maestre J., Heidmann T. Functional differences between the human LINE retrotransposon and retroviral reverse transcriptases for in vivo mRNA reverse transcription. EMBO J. 1997;16:6590–6602. doi: 10.1093/emboj/16.21.6590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dombroski B.A., Scott A.F., Kazazian H.H., Jr. Two additional potential retrotransposons isolated from a human L1 subfamily that contains an active retrotransposable element. Proc. Natl. Acad. Sci. USA. 1993;90:6513–6517. doi: 10.1073/pnas.90.14.6513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doucet A.J., Wilusz J.E., Miyoshi T., Liu Y., Moran J.V. A 3′ Poly(A) Tract Is Required for LINE-1 Retrotransposition. Mol. Cell. 2015;60:728–741. doi: 10.1016/j.molcel.2015.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Esnault C., Maestre J., Heidmann T. Human LINE retrotransposons generate processed pseudogenes. Nat. Genet. 2000;24:363–367. doi: 10.1038/74184. [DOI] [PubMed] [Google Scholar]
- Evrony G.D., Lee E., Mehta B.K., Benjamini Y., Johnson R.M., Cai X., Yang L., Haseley P., Lehmann H.S., Park P.J., Walsh C.A. Cell lineage analysis in human brain using endogenous retroelements. Neuron. 2015;85:49–59. doi: 10.1016/j.neuron.2014.12.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ewing A.D., Smits N., Sanchez-Luque F.J., Faivre J., Brennan P.M., Richardson S.R., Cheetham S.W., Faulkner G.J. Nanopore Sequencing Enables Comprehensive Transposable Element Epigenomic Profiling. Mol. Cell. 2020;80:915–928.e5. doi: 10.1016/j.molcel.2020.10.024. [DOI] [PubMed] [Google Scholar]
- Faulkner G.J., Billon V. L1 retrotransposition in the soma: a field jumping ahead. Mob. DNA. 2018;9:22. doi: 10.1186/s13100-018-0128-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feschotte C., Gilbert C. Endogenous viruses: insights into viral evolution and impact on host biology. Nat. Rev. Genet. 2012;13:283–296. doi: 10.1038/nrg3199. [DOI] [PubMed] [Google Scholar]
- Feusier J., Watkins W.S., Thomas J., Farrell A., Witherspoon D.J., Baird L., Ha H., Xing J., Jorde L.B. Pedigree-based estimation of human mobile element retrotransposition rates. Genome Res. 2019;29:1567–1577. doi: 10.1101/gr.247965.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flasch D.A., Macia Á., Sánchez L., Ljungman M., Heras S.R., García-Pérez J.L., Wilson T.E., Moran J.V. Genome-wide de novo L1 Retrotransposition Connects Endonuclease Activity with Replication. Cell. 2019;177:837–851. doi: 10.1016/j.cell.2019.02.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fujimoto A., Totoki Y., Abe T., Boroevich K.A., Hosoda F., Nguyen H.H., Aoki M., Hosono N., Kubo M., Miya F. Whole-genome sequencing of liver cancers identifies etiological influences on mutation patterns and recurrent mutations in chromatin regulators. Nat. Genet. 2012;44:760–764. doi: 10.1038/ng.2291. [DOI] [PubMed] [Google Scholar]
- Garcia-Perez J.L., Doucet A.J., Bucheton A., Moran J.V., Gilbert N. Distinct mechanisms for trans-mediated mobilization of cellular RNAs by the LINE-1 reverse transcriptase. Genome Res. 2007;17:602–611. doi: 10.1101/gr.5870107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geuking M.B., Weber J., Dewannieux M., Gorelik E., Heidmann T., Hengartner H., Zinkernagel R.M., Hangartner L. Recombination of retrotransposon and exogenous RNA virus results in nonretroviral cDNA integration. Science. 2009;323:393–396. doi: 10.1126/science.1167375. [DOI] [PubMed] [Google Scholar]
- Gilbert N., Lutz S., Morrish T.A., Moran J.V. Multiple fates of L1 retrotransposition intermediates in cultured human cells. Mol. Cell. Biol. 2005;25:7780–7795. doi: 10.1128/MCB.25.17.7780-7795.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hancks D.C., Ewing A.D., Chen J.E., Tokunaga K., Kazazian H.H., Jr. Exon-trapping mediated by the human retrotransposon SVA. Genome Res. 2009;19:1983–1991. doi: 10.1101/gr.093153.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hancks D.C., Goodier J.L., Mandal P.K., Cheung L.E., Kazazian H.H., Jr. Retrotransposition of marked SVA elements by human L1s in cultured cells. Hum. Mol. Genet. 2011;20:3386–3400. doi: 10.1093/hmg/ddr245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holmes S.E., Dombroski B.A., Krebs C.M., Boehm C.D., Kazazian H.H., Jr. A new retrotransposable human L1 element from the LRE2 locus on chromosome 1q produces a chimaeric insertion. Nat. Genet. 1994;7:143–148. doi: 10.1038/ng0694-143. [DOI] [PubMed] [Google Scholar]
- Horie M., Honda T., Suzuki Y., Kobayashi Y., Daito T., Oshida T., Ikuta K., Jern P., Gojobori T., Coffin J.M., Tomonaga K. Endogenous non-retroviral RNA virus elements in mammalian genomes. Nature. 2010;463:84–87. doi: 10.1038/nature08695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hrecka K., Hao C., Gierszewska M., Swanson S.K., Kesik-Brodacka M., Srivastava S., Florens L., Washburn M.P., Skowronski J. Vpx relieves inhibition of HIV-1 infection of macrophages mediated by the SAMHD1 protein. Nature. 2011;474:658–661. doi: 10.1038/nature10195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang Z., Jhunjhunwala S., Liu J., Haverty P.M., Kennemer M.I., Guan Y., Lee W., Carnevali P., Stinson J., Johnson S. The effects of hepatitis B virus integration into the genomes of hepatocellular carcinoma patients. Genome Res. 2012;22:593–601. doi: 10.1101/gr.133926.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jurka J. Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. Proc. Natl. Acad. Sci. USA. 1997;94:1872–1877. doi: 10.1073/pnas.94.5.1872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karki R., Sharma B.R., Tuladhar S., Williams E.P., Zalduondo L., Samir P., Zheng M., Sundaram B., Banoth B., Malireddi R.K.S. Synergism of TNF-α and IFN-γ Triggers Inflammatory Cell Death, Tissue Damage, and Mortality in SARS-CoV-2 Infection and Cytokine Shock Syndromes. Cell. 2021;184:149–168. doi: 10.1016/j.cell.2020.11.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kawasaki J., Kojima S., Mukai Y., Tomonaga K., Horie M. 100-My history of bornavirus infections hidden in vertebrate genomes. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2026235118. e2026235118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kazachenka A., Kassiotis G. SARS-CoV-2-Host Chimeric RNA-Sequencing Reads Do Not Necessarily Arise From Virus Integration Into the Host DNA. Front. Microbiol. 2021;12:676693. doi: 10.3389/fmicb.2021.676693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kazazian H.H., Jr., Moran J.V. Mobile DNA in Health and Disease. N. Engl. J. Med. 2017;377:361–370. doi: 10.1056/NEJMra1510092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kazazian H.H., Jr., Wong C., Youssoufian H., Scott A.F., Phillips D.G., Antonarakis S.E. Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man. Nature. 1988;332:164–166. doi: 10.1038/332164a0. [DOI] [PubMed] [Google Scholar]
- Klawitter S., Fuchs N.V., Upton K.R., Muñoz-Lopez M., Shukla R., Wang J., Garcia-Cañadas M., Lopez-Ruiz C., Gerhardt D.J., Sebe A. Reprogramming triggers endogenous L1 and Alu retrotransposition in human induced pluripotent stem cells. Nat. Commun. 2016;7:10286. doi: 10.1038/ncomms10286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klenerman P., Hengartner H., Zinkernagel R.M. A non-retroviral RNA virus persists in DNA form. Nature. 1997;390:298–301. doi: 10.1038/36876. [DOI] [PubMed] [Google Scholar]
- Kubo S., Seleme M.C., Soifer H.S., Perez J.L.G., Moran J.V., Kazazian H.H., Jr., Kasahara N. L1 retrotransposition in nondividing and primary human somatic cells. Proc. Natl. Acad. Sci. USA. 2006;103:8036–8041. doi: 10.1073/pnas.0601954103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kulpa D.A., Moran J.V. Cis-preferential LINE-1 reverse transcriptase activity in ribonucleoprotein particles. Nat. Struct. Mol. Biol. 2006;13:655–660. doi: 10.1038/nsmb1107. [DOI] [PubMed] [Google Scholar]
- Laguette N., Sobhian B., Casartelli N., Ringeard M., Chable-Bessia C., Ségéral E., Yatim A., Emiliani S., Schwartz O., Benkirane M. SAMHD1 is the dendritic- and myeloid-cell-specific HIV-1 restriction factor counteracted by Vpx. Nature. 2011;474:654–657. doi: 10.1038/nature10117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lauer G.M., Walker B.D. Hepatitis C virus infection. N. Engl. J. Med. 2001;345:41–52. doi: 10.1056/NEJM200107053450107. [DOI] [PubMed] [Google Scholar]
- Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mason W.S., Gill U.S., Litwin S., Zhou Y., Peri S., Pop O., Hong M.L.W., Naik S., Quaglia A., Bertoletti A., Kennedy P.T. HBV DNA Integration and Clonal Hepatocyte Expansion in Chronic Hepatitis B Patients Considered Immune Tolerant. Gastroenterology. 2016;151:986–998. doi: 10.1053/j.gastro.2016.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monot C., Kuciak M., Viollet S., Mir A.A., Gabus C., Darlix J.-L., Cristofari G. The specificity and flexibility of l1 reverse transcription priming at imperfect T-tracts. PLoS Genet. 2013;9:e1003499. doi: 10.1371/journal.pgen.1003499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moran J.V., Holmes S.E., Naas T.P., DeBerardinis R.J., Boeke J.D., Kazazian H.H., Jr. High frequency retrotransposition in cultured mammalian cells. Cell. 1996;87:917–927. doi: 10.1016/s0092-8674(00)81998-4. [DOI] [PubMed] [Google Scholar]
- Moran J.V., DeBerardinis R.J., Kazazian H.H., Jr. Exon shuffling by L1 retrotransposition. Science. 1999;283:1530–1534. doi: 10.1126/science.283.5407.1530. [DOI] [PubMed] [Google Scholar]
- Nagaya T., Nakamura T., Tokino T., Tsurimoto T., Imai M., Mayumi T., Kamino K., Yamamura K., Matsubara K. The mode of hepatitis B virus DNA integration in chromosomes of human hepatocellular carcinoma. Genes Dev. 1987;1:773–782. doi: 10.1101/gad.1.8.773. [DOI] [PubMed] [Google Scholar]
- Nguyen T.H.M., Carreira P.E., Sanchez-Luque F.J., Schauer S.N., Fagg A.C., Richardson S.R., Davies C.M., Jesuadian J.S., Kempen M.H.C., Troskie R.-L. L1 Retrotransposon Heterogeneity in Ovarian Tumor Cell Evolution. Cell Rep. 2018;23:3730–3740. doi: 10.1016/j.celrep.2018.05.090. [DOI] [PubMed] [Google Scholar]
- Niewiadomska A.M., Tian C., Tan L., Wang T., Sarkis P.T.N., Yu X.-F. Differential inhibition of long interspersed element 1 by APOBEC3 does not correlate with high-molecular-mass-complex formation or P-body association. J. Virol. 2007;81:9577–9583. doi: 10.1128/JVI.02800-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ostertag E.M., Kazazian H.H., Jr. Twin priming: a proposed mechanism for the creation of inversions in L1 retrotransposition. Genome Res. 2001;11:2059–2065. doi: 10.1101/gr.205701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Philippe C., Vargas-Landin D.B., Doucet A.J., van Essen D., Vera-Otarola J., Kuciak M., Corbin A., Nigumann P., Cristofari G. Activation of individual L1 retrotransposon instances is restricted to cell-type dependent permissive loci. eLife. 2016;5:e13926. doi: 10.7554/eLife.13926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raiz J., Damert A., Chira S., Held U., Klawitter S., Hamdorf M., Löwer J., Strätling W.H., Löwer R., Schumann G.G. The non-autonomous retrotransposon SVA is trans-mobilized by the human LINE-1 protein machinery. Nucleic Acids Res. 2012;40:1666–1683. doi: 10.1093/nar/gkr863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson J.T., Thorvaldsdóttir H., Winckler W., Guttman M., Lander E.S., Getz G., Mesirov J.P. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rodriguez-Martin B., Alvarez E.G., Baez-Ortega A., Zamora J., Supek F., Demeulemeester J., Santamarina M., Ju Y.S., Temes J., Garcia-Souto D., PCAWG Structural Variation Working Group. PCAWG Consortium Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition. Nat. Genet. 2020;52:306–319. doi: 10.1038/s41588-019-0562-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanchez-Luque F.J., Kempen M.H.C., Gerdes P., Vargas-Landin D.B., Richardson S.R., Troskie R.-L., Jesuadian J.S., Cheetham S.W., Carreira P.E., Salvador-Palomeque C. LINE-1 Evasion of Epigenetic Repression in Humans. Mol. Cell. 2019;75:590–604. doi: 10.1016/j.molcel.2019.05.024. [DOI] [PubMed] [Google Scholar]
- Schauer S.N., Carreira P.E., Shukla R., Gerhardt D.J., Gerdes P., Sanchez-Luque F.J., Nicoli P., Kindlova M., Ghisletti S., Santos A.D. L1 retrotransposition is a common feature of mammalian hepatocarcinogenesis. Genome Res. 2018;28:639–653. doi: 10.1101/gr.226993.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scott E.C., Gardner E.J., Masood A., Chuang N.T., Vertino P.M., Devine S.E. A hot L1 retrotransposon evades somatic repression and initiates human colorectal cancer. Genome Res. 2016;26:745–755. doi: 10.1101/gr.201814.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shukla R., Upton K.R., Muñoz-Lopez M., Gerhardt D.J., Fisher M.E., Nguyen T., Brennan P.M., Baillie J.K., Collino A., Ghisletti S. Endogenous retrotransposition activates oncogenic pathways in hepatocellular carcinoma. Cell. 2013;153:101–111. doi: 10.1016/j.cell.2013.02.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siudeja K., van den Beek M., Riddiford N., Boumard B., Wurmser A., Stefanutti M., Lameiras S., Bardin A.J. Unraveling the features of somatic transposition in the Drosophila intestine. EMBO J. 2021;40:e106388. doi: 10.15252/embj.2020106388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sultana T., van Essen D., Siol O., Bailly-Bechet M., Philippe C., Zine El Aabidine A., Pioger L., Nigumann P., Saccani S., Andrau J.-C. The Landscape of L1 Retrotransposons in the Human Genome Is Shaped by Pre-insertion Sequence Biases and Post-insertion Selection. Mol. Cell. 2019;74:555–570. doi: 10.1016/j.molcel.2019.02.036. [DOI] [PubMed] [Google Scholar]
- Sungnak W., Huang N., Bécavin C., Berg M., Queen R., Litvinukova M., Talavera-López C., Maatz H., Reichart D., Sampaziotis F., HCA Lung Biological Network SARS-CoV-2 entry factors are highly expressed in nasal epithelial cells together with innate immune genes. Nat. Med. 2020;26:681–687. doi: 10.1038/s41591-020-0868-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Treiber C.D., Waddell S. Resolving the prevalence of somatic transposition in Drosophila. eLife. 2017;6:e28297. doi: 10.7554/eLife.28297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tu T., Budzinska M.A., Vondran F.W.R., Shackel N.A., Urban S. Hepatitis B virus DNA integration occurs early in the viral life cycle in an in vitro infection model via sodium taurocholate cotransporting polypeptide-dependent uptake of enveloped virus particles. J. Virol. 2018;92:e02007–e02017. doi: 10.1128/JVI.02007-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Untergasser A., Cutcutache I., Koressaar T., Ye J., Faircloth B.C., Remm M., Rozen S.G. Primer3—new capabilities and interfaces. Nucleic Acids Res. 2012;40:e115. doi: 10.1093/nar/gks596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- V’kovski P., Kratzel A., Steiner S., Stalder H., Thiel V. Coronavirus biology and replication: implications for SARS-CoV-2. Nat. Rev. Microbiol. 2021;19:155–170. doi: 10.1038/s41579-020-00468-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varga Z., Flammer A.J., Steiger P., Haberecker M., Andermatt R., Zinkernagel A.S., Mehra M.R., Schuepbach R.A., Ruschitzka F., Moch H. Endothelial cell infection and endotheliitis in COVID-19. Lancet. 2020;395:1417–1418. doi: 10.1016/S0140-6736(20)30937-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei W., Gilbert N., Ooi S.L., Lawler J.F., Ostertag E.M., Kazazian H.H., Boeke J.D., Moran J.V. Human L1 retrotransposition: cis preference versus trans complementation. Mol. Cell. Biol. 2001;21:1429–1439. doi: 10.1128/MCB.21.4.1429-1439.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiersinga W.J., Rhodes A., Cheng A.C., Peacock S.J., Prescott H.C. Pathophysiology, Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review. JAMA. 2020;324:782–793. doi: 10.1001/jama.2020.12839. [DOI] [PubMed] [Google Scholar]
- Wu F., Zhao S., Yu B., Chen Y.-M., Wang W., Song Z.-G., Hu Y., Tao Z.-W., Tian J.-H., Pei Y.-Y. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yan B., Chakravorty S., Mirabelli C., Wang L., Trujillo-Ochoa J.L., Chauss D., Kumar D., Lionakis M.S., Olson M.R., Wobus C.E. Host-Virus Chimeric Events in SARS-CoV-2-Infected Cells Are Infrequent and Artifactual. J. Virol. 2021;95:e0029421. doi: 10.1128/JVI.00294-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang L., Richards A., Barrasa M.I., Hughes S.H., Young R.A., Jaenisch R. Reverse-transcribed SARS-CoV-2 RNA can integrate into the genome of cultured human cells and can be expressed in patient-derived tissues. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2105968118. e2105968118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao K., Du J., Han X., Goodier J.L., Li P., Zhou X., Wei W., Evans S.L., Li L., Zhang W. Modulation of LINE-1 and Alu/SVA retrotransposition by Aicardi-Goutières syndrome-related SAMHD1. Cell Rep. 2013;4:1108–1115. doi: 10.1016/j.celrep.2013.08.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Oxford Nanopore Technologies sequencing data generated by this study were deposited in the European Nucleotide Archive (ENA) under project PRJEB44816. TLDR and instructions for its use and application are available at https://github.com/adamewing/TLDR.