Abstract
The methylation index of the LINE-1 promoter is one of the most commonly used markers for assessing the global level of genome methylation in various human cells and tissues. We developed an NGS-based protocol for DNA methylation analysis of the LINE-1 retrotransposon promoter. This approach allows assessment of the DNA methylation index of 19 CpG sites in the LINE-1 promoter that have the highest tissue- or tumor-specific variability. The method provides a DNA methylation profile for analyzing either the methylation index of each CpG site independently or the mean DNA methylation index across the LINE-1 promoter. The results obtained using the developed method corresponded well to the level of methylation assessed using a commercially available kit for DNA pyrosequencing. In addition, our method provides much more information: 1) the DNA methylation profile of a significant part of the LINE-1 promoter and 2) the level of DNA methylation at individual LINE-1 loci in the genome. The method of targeted bisulfite massive parallel sequencing of the human LINE-1 retrotransposon promoter can be used in large-scale studies of the global level of genome methylation in normal human cells or tumors.
To accomplish this, we modified the targeted massive parallel sequencing method based on 16S Metagenomic Sequencing Library Preparation protocol (Illumina, USA) by:
-
•
Introduction of the stage of bisulfite conversion of DNA.
-
•
Development of specific primers for the LINE-1 sequence.
Keywords: LINE-1 retrotransposon, DNA methylation, Targeted bisulfite massive parallel sequencing
Graphical abstract
Specifications table
Subject area: | Biochemistry, Genetics and Molecular Biology |
More specific subject area: | Epigenetics |
Method name: | NGS-based LINE-1 retrotransposon methylation analysis |
Name and reference of original method: | 16S Metagenomic Sequencing Library Preparation: Preparing 16S Ribosomal RNA Gene Amplicons for the Illumina MiSeq System (Official Illumina Protocol) |
Resource availability: | Reagents:
|
Method details
Overview
The methylation index of the long interspersed nuclear element 1 (LINE-1) retrotransposon, which occupies approximately 17% of the human genome, is often used as a marker of the level of whole-genome methylation. Most often, LINE-1 methylation index analysis is performed by real-time quantitative methylation-specific PCR [6,11], methylation-sensitive high resolution melting analysis [9], or pyrosequencing 3-4 CpG sites in its promoter region [1,3,4,16]. We developed a targeted method for determining the index of methylation in the LINE-1 retrotransposon promoter using targeted bisulfite massive parallel sequencing. In our method, LINE-1 methylation is assessed based on the DNA methylation index of 19 CpG sites in the consensus sequence of the LINE-1 promoter that have the highest tissue- or tumor-specific variability [14]. Our method can potentially have a wide range of applications, from evaluating epigenetic abnormalities in various tissues and tumors to analyzing the epigenetic effects of mutagens and monitoring studies.
Targeted bisulfite massive parallel sequencing
Considering the bisulfite converted consensus sequence of the LINE-1 promoter (GenBank Accession: X58075.1), we designed primers to cover 19 CpG sites: forward 5’-TATTAGGGAGTGTTAGATAGTGGG-3’ and reverse 5’-CCTCTAAACCAAATATAAAATATAATCTC-3’ (Fig. 1a). These 19 CpG sites included the 3 CpG sites covered by the PyroMark LINE‐1 kit (Qiagen, Hilden, Germany). Primers with attached Illumina adapters were synthesized, and the DNA product was amplified by PCR using a modified Illumina 16S rRNA protocol.
Fig. 1.
Method of targeted bisulfite massive parallel sequencing of the human LINE-1 retrotransposon promoter. a. Schematic representation of the targeted bisulfite massive parallel sequencing of 19 CpG sites in the LINE-1 promoter. The red frame indicates the analyzed CpG sites in this study, and the blue frame indicates the most frequently analyzed CpG sites as determined using the Qiagen pyrosequencing LINE-1 kit (explanations are in the text). The schematic profile of the plot was obtained from the LINE-1 methylation index across the 5’-UTR region in analyzed samples. b. Gel image of the results of the amplified LINE-1 sequences for five samples after the first PCR with attached adapters and the same samples with attached indices after the second PCR. M represents Sky-High DNA marker (Biolabmix, Russia) with size in bp indicated.
First, sodium bisulfite conversion of 1000 ng of total genomic DNA was performed using an EZ DNA Methylation-Direct Kit (Zymo Research, USA). First PCR amplifications were performed in 20 µl reaction mixtures. A master mix for each reaction was made using the 2x BioMaster HS-Taq PCR mix (Biolabmix, Russia), primers with attached Illumina adapters, and 60 ng of bisulfite-converted DNA. The cycling conditions were as follows: 95 °C for 5 min, followed by 40 cycles of 95 °C for 30 s, 57 °C for 1 min and 72 °C for 1 min. A final 7 min elongation step was performed at 72 °C.
A second PCR amplification, used to incorporate barcodes and sequencing adapters into the final PCR product, was performed in 25-microliter reaction mixtures using the 2x Phusion high-fidelity DNA polymerase mix (NEB, USA). As a matrix for the reaction, 2 µl (1/10 of the volume) of the unpurified product of the first PCR was used. A separate primer pair containing a unique 6-base barcode from the Nextera XT index kit was added to each well. The cycling conditions were as follows: 95 °C for 3 min, followed by 8 cycles of 95 °C for 30 s, 55 °C for 30 s and 72 °C for 30 s. A final 5 min elongation step was performed at 72 °C. Products from both PCRs were verified by gel electrophoresis (Fig. 1b) with careful examination of the fragment sizes (PCR1 – 311 bp, PCR2 – 380 bp).
The quality and quantity of the amplified libraries were assessed by a Qubit fluorometer (Thermo Fisher, USA) and pooled together in equimolar quantities. Pooled library purified by AMPure XP beads (Beckman Coulter, USA), the size of fragments was assessed using a Bioanalyzer 2100 instrument (Agilent Technologies, USA), and the library was sequenced by MiSeq (Illumina, USA) using a Reagent Micro Kit v2 (300 cycles) in paired-end mode, with 30% PhiX added for clear cluster separation.
The short length of the LINE-1 amplified region (224 bp) made it possible to analyze more than 300 samples in a single MiSeq run, which was significantly more cost-effective than pyrosequencing.
NGS data processing
Sequenced reads were demultiplexed by standard Illumina i7 and i5 barcodes using the Genome Analysis Toolkit (GATK, v4.1.3.0) [10]. Read quality was assessed using FastQC v0.11.8, and residual adapter sequences and low-quality reads were removed with Trimmomatic v0.39 [2]. Read alignment was performed using bwa-meth v0.2.2 (https://github.com/brentp/bwa-meth) with default parameters. To avoid contamination in the library preparation step, we first mapped reads to the human genome assembly (February 2009 release, hg19/GRCh37) with masked repeat sequences, and then, unmapped reads (SAM flag “4”) were extracted using SAMtools v1.12 [8] and aligned to the consensus sequence of CpG-rich region of the LINE-1 element (GenBank Accession: X58075.1) [5] using bwa-meth. Human genome repeats were masked using BEDTools v2.26.0 [12] and RepeatMasker sequences [7], as described previously [15]. DNA methylation data in the CpG context were extracted from obtained BAM files using MethylDackel (https://github.com/dpryan79/MethylDackel). The results are presented as the methylation index, which is the ratio of the number of cytosines to the sum of cytosine and thymine in a given CpG site. The resulting methylation index characterizes the average level of methylation of a part of the LINE-1 sequences in the genome that are amplified using the primers used and have specific CpG sites. Statistical analysis was carried out with Statistica v8.0 (StatSoft) and R statistical software (v. 3.6.1) using Pearson tests for the correlation analysis.
Method validation
The method was tested on DNA samples obtained from various human tissues: chorionic villous trophoblast cells from 42 induced and 193 spontaneous human abortions; tumor (n = 2) and normal tissues (n = 2) of the mammary gland; peripheral blood leukocytes (n = 36); tissues of the carotid artery (n = 38) and great saphenous vein (n = 10); and alpha-actin-2 positive (ACTA2+) cells (n = 8) and macrosialin-positive (CD68+) cells (n = 6) from carotid atherosclerotic lesions.
Commercially available control samples (in 3 replicates) from the Human Methylated & Nonmethylated DNA Set (Zymo Research, Irvine, CA, USA) were used to verify the validity of the assay.
Chorionic villous trophoblast cells were obtained by maceration of chorionic villi in 70% acetic acid. The isolated cells were washed in phosphate buffer. Genomic DNA was isolated from chorionic villous trophoblasts using the phenol-chloroform method. Artery wall tissue and whole blood samples were harvested from patients with atherosclerosis during carotid endarterectomy. ACTA2+ cells were derived from fresh frozen and immunostained samples of atherosclerotic lesions by laser-capture microdissection on a PALM MicroBeam (Carl Zeiss, Germany). DNA extraction and sodium bisulfite conversion were performed using an EZ DNA Methylation-Direct kit (Zymo Research, Irvine, CA, USA). Converted DNA from ACTA+ and CD68+ cells was further amplified using an EpiTect whole bisulfitome kit (Qiagen, Germany).
We were able to assess up to 346 samples in one sequencing run. The average output was 3333 (maximum 9076 and minimum 19) reads per demultiplexed library. Samples with more than 50% of short reads mapped to the human genome (13 out of 346) and those showing less coverage than 1000 reads per sample (85 out of 346) were excluded. Only 70% of the samples (242 of 346) reached the 1000x base coverage threshold to be considered for CpG methylation analysis [13].
The results of the LINE-1 promoter methylation index in hypomethylated (DNMT1−/− and DNMT3b−/− HCT cells) and methylated (MspI-treated HCT cells) control samples showed average LINE-1 methylation indices of 19 ± 6% and 65 ± 13%, respectively. One possible reason for the low LINE-1 methylation index in a methylated control sample is the spontaneous transition of CpG to TpG during evolution, which is one of the mechanisms for controlling the activity of retrotransposons. When analyzing the methylation index, all CpG sites where such a transition occurred is interpreted as having a zero methylation index. This may reduce the average observed methylation index. However, this method can be used to compare the methylation index between different samples and groups of samples, given similar initial evolutionary conditions for all individuals.
In addition to our method, the methylation index of the LINE-1 retrotransposon in 35 analyzed samples was determined by pyrosequencing using the PyroMark Q24 CpG LINE-1 (Qiagen, Germany) assay according to the manufacturer's protocol. The mean LINE-1 methylation index at 19 CpG sites assessed by targeted bisulfite massive parallel sequencing had a strong correlation with the mean LINE-1 methylation index at 3 CpG sites obtained using bisulfite pyrosequencing (Fig. 2a). The correlation between average methylation indices across identical CpG sites measured by different techniques was also high (Fig. 2b). However, the methylation index estimated by pyrosequencing did not correlate well with the methylation index for all individual CpG sites after targeted bisulfite massive parallel sequencing (Fig. 3). This finding indicates that our method provides not only the same information on average values as pyrosequencing but also additional information on individual CpG sites.
Fig. 2.
Relationship between the average LINE-1 methylation index (%) measured by the introduced method (NGS) and bisulfite DNA pyrosequencing.
Fig. 3.
Relationship between methylation indices of individual CpG sites as measured by the introduced method and average LINE-1 methylation index (%) as assessed by bisulfite DNA pyrosequencing (at positions 318, 321, and 328) trophoblast and ACTA2+ cells.
The developed method of targeted bisulfite massive parallel sequencing makes it possible to estimate the methylation index of the LINE-1 retrotransposon quickly and cost-effectively (Table 1). A possible scenario for using our method is to analyze large groups of heterogeneous samples (e.g., the analysis of the LINE-1 methylation index in tumors of different locations in a large number of patients). Our results indicate that despite the relatively good correlation between the LINE-1 methylation indices estimated by targeted bisulfite NGS and pyrosequencing, the correlations for a portion of the CpG sites are quite weak. This indicates that the methylation index of the three CpG sites evaluated by pyrosequencing is insufficient to obtain complete information about the methylation profile of the LINE-1 promoter.
Table 1.
Cost-effectiveness of the analysis of LINE-1 methylation index by targeted bisulfite NGS compared with pyrosequencing (steps after bisulfite conversion of DNA are compared only).
Pyrosequencing | Supplier | Price per 1 sample, $ (96 samples) | ||
---|---|---|---|---|
PyroMark Q24 CpG LINE-1 | Qiagen | 8.5 | ||
PyroMark Binding Buffer | Qiagen | 0.01 | ||
PyroMark Denaturation Sol. | Qiagen | 0.25 | ||
PyroMark Wash Buffer | Qiagen | 0.25 | ||
PyroMark Annealing Buffer | Qiagen | 0.01 | ||
Streptavidin Sepharose® High Performance | Sigma Aldrich | 0.22 | ||
PyroMark Q24 Plate | Qiagen | 0.11 | ||
PyroMark Gold Q24 Reagents | Qiagen | 3.23 | ||
Total | 12.58 | |||
Targeted bisulfite NGS | 300 samples | 200 samples | 100 samples | |
---|---|---|---|---|
PCR 1 | Biolabmix | 0.21 | 0.21 | 0.21 |
PCR 2 | NEB | 2.04 | 2.04 | 2.04 |
AMPure XP | Beckman | 0.01 | 0.01 | 0.02 |
Adapters | Illumina | 0.35 | 0.35 | 0.35 |
Indexes | Illumina | 0.46 | 0.46 | 0.46 |
Reagent Micro Kit v2 (300 cycles) | Illumina | 3.23 | 4.85 | 9.69 |
Total | 6.30 | 7.92 | 12.78 |
Thus, the results of our method correspond well to the methylation index assessed using the commercially available kit for pyrosequencing and provide much more information, allowing for the evaluation of the methylation profile in a large part of the LINE-1 promoter. In addition to estimating the average LINE-1 methylation index in the genome, this method potentially allows the estimation of the level of methylation in individual LINE-1 promoters at individual genome loci containing specific genetic variants. The introduced method of targeted bisulfite massive parallel sequencing of the human LINE-1 retrotransposon promoter can be used in large-scale studies of genome methylation.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
Acknowledgments
The molecular cytogenetic and molecular genetic studies were performed in the “Medical Genomics” Core Facility of the Tomsk National Research Medical Center of the Russian Academy of Sciences using the resources of the biocollection “Biobank of the population of northern Eurasia” of the Research Institute of Medical Genetics, Tomsk NRMC.
Funding
This study was supported by the Russian Science Foundation (project 19-74-10026).
Ethics approval
This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the local Research Ethics Committee of the Research Institute of Medical Genetics, Tomsk NRMC (08.12.2020 / No 166).
Consent to participate
Written informed consent was obtained from the patients for their participation.
References
- 1.Benitez-Trinidad A.B., Medina-Diaz I.M., Bernal-Hernandez Y.Y., Barron-Vivanco B.S., Gonzalez-Arias C.A., Herrera-Moreno J.F., Alvarado-Cruz I., Quintanilla-Vega B., Rojas-Garcia A.E. Relationship between LINE-1 methylation pattern and pesticide exposure in urban sprayers. Food Chem. Toxicol. 2018;113:125–133. doi: 10.1016/j.fct.2018.01.035. [DOI] [PubMed] [Google Scholar]
- 2.Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bollati V., Fabris S., Pegoraro V., Ronchetti D., Mosca L., Deliliers G.L., Motta V., Bertazzi P.A., Baccarelli A., Neri A. Differential repetitive DNA methylation in multiple myeloma molecular subgroups. Carcinogenesis. 2009;30(8):1330–1335. doi: 10.1093/carcin/bgp149. [DOI] [PubMed] [Google Scholar]
- 4.Cho Y.H., Woo H.D., Jang Y., Porter V., Christensen S., Hamilton R.F., Jr., Chung H.W. The association of LINE-1 hypomethylation with age and centromere positive micronuclei in human lymphocytes. PLoS One. 2015;10(7) doi: 10.1371/journal.pone.0133909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Crowther P.J., Doherty J.P., Linsenmeyer M.E., Williamson M.R., Woodcock D.M. Revised genomic consensus for the hypermethylated CpG island region of the human L1 transposon and integration sites of full length L1 elements from recombinant clones made using methylation-tolerant host strains. Nucleic Acids Res. 1991;19(9):2395–2401. doi: 10.1093/nar/19.9.2395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Furlan C., Polesel J., Barzan L., Franchin G., Sulfaro S., Romeo S., Colizzi F., Rizzo A., Baggio V., Giacomarra V., Dei Tos A.P., Boscolo-Rizzo P., Vaccher E., Dolcetti R., Sigalotti L., Fratta E. Prognostic significance of LINE-1 hypomethylation in oropharyngeal squamous cell carcinoma. Clin. Epigenet. 2017;9:58. doi: 10.1186/s13148-017-0357-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jurka J. Repbase update: a database and an electronic journal of repetitive elements. Trends Genet. 2000;16(9):418–420. doi: 10.1016/s0168-9525(00)02093-x. [DOI] [PubMed] [Google Scholar]
- 8.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., Genome Project Data Processing S The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Marques-Rocha J.L., Milagro F.I., Mansego M.L., Mourao D.M., Martinez J.A., Bressan J. LINE-1 methylation is positively associated with healthier lifestyle but inversely related to body fat mass in healthy young individuals. Epigenetics. 2016;11(1):49–60. doi: 10.1080/15592294.2015.1135286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.McKenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A., Garimella K., Altshuler D., Gabriel S., Daly M., DePristo M.A. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ponomaryova A.A., Rykova E.Y., Azhikina T.L., Bondar A.A., Cheremisina O.V., Rodionov E.O., Boyarko V.V., Laktionov P.P., Cherdyntseva N.V. Long interspersed nuclear element-1 methylation status in the circulating DNA from blood of patients with malignant and chronic inflammatory lung diseases. Eur. J. Cancer Prev. 2021;30(2):127–131. doi: 10.1097/CEJ.0000000000000601. [DOI] [PubMed] [Google Scholar]
- 12.Quinlan A.R., Hall I.M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Roeh S., Wiechmann T., Sauer S., Kodel M., Binder E.B., Provencal N. HAM-TBS: high-accuracy methylation measurements via targeted bisulfite sequencing. Epigenet. Chromatin. 2018;11(1):39. doi: 10.1186/s13072-018-0209-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Sharma A., Jamil M.A., Nuesgen N., Dauksa A., Gulbinas A., Schulz W.A., Oldenburg J., El-Maarri O. Detailed methylation map of LINE-1 5′-promoter region reveals hypomethylated CpG hotspots associated with tumor tissue specificity. Mol. Genet. Genomic Med. 2019;7(5):e601. doi: 10.1002/mgg3.601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tang Z., Steranka J.P., Ma S., Grivainis M., Rodic N., Huang C.R., Shih I.M., Wang T.L., Boeke J.D., Fenyo D., Burns K.H. Human transposon insertion profiling: Analysis, visualization and identification of somatic LINE-1 insertions in ovarian cancer. Proc. Natl. Acad. Sci. USA. 2017;114(5):E733–E740. doi: 10.1073/pnas.1619797114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vasilyev S.A., Tolmacheva E.N., Kashevarova A.A., Sazhenova E.A., Lebedev I.N. Methylation status of LINE-1 retrotransposon in chromosomal mosaicism during early stages of human embryonic development. Mole. Biol. 2015;49(1):144–152. [PubMed] [Google Scholar]