Skip to main content
Data in Brief logoLink to Data in Brief
. 2022 Apr 20;42:108187. doi: 10.1016/j.dib.2022.108187

RNA-Seq data of ALKBH5 and FTO double knockout HEK293T human cells

Egor A Smolin 1, Andrey I Buyan 1, Dmitry N Lyabin 1, Ivan V Kulakovskiy 1,, Irina A Eliseeva 1,
PMCID: PMC9062317  PMID: 35516002

Abstract

N6-methyladenosine (m6A) is the most abundant, highly dynamic mRNA modification that regulates mRNA splicing, stability, and translation. The m6A epigenetic mark is erased by RNA demethylases ALKBH5 (AlkB Homolog 5) and FTO (Fat mass and obesity-associated protein). The ALKBH5 and FTO RNA demethylases recognize m6A in similar nucleotide contexts. Therefore, these proteins can partially substitute for each other. To assess the impact of total m6A demethylation failure we performed high-throughput sequencing of cytoplasmic RNA from ALKBH5 and FTO double knockout and wild type HEK293T cells. The RNA-Seq libraries were sequenced on Illumina NextSeq 500, trimmed, and mapped to the human genome. The consequent read counting and differential expression analysis in the R environment detected 5871 differentially expressed and 166 alternatively spliced genes comparing double knockout against wild type HEK293T cells. Raw data are deposited in NCBI Gene Expression Omnibus (GEO) repository under GEO accession GSE198050.

Keywords: m6A methylation, m6A demethylation, Epitranscriptomics, RNA modifications, ALKBH5, FTO, Demethylase, Transcriptome

Specifications Table

Subject Biological sciences
Specific subject area Bioinformatics, Transcriptomics, RNA modifications
Type of data Tables and figures
How the data were acquired RNA-Seq libraries were obtained from cytoplasmic RNA of HEK293T and HEK293TΔFTOΔALKBH5 cells. Sequencing was performed using Illumina NextSeq 500.
Data format Raw and analyzed
Description of data collection Cytoplasmic RNA from wild type and ΔALKBH5ΔFTO HEK293T cells was isolated using TRIzol LS Reagent and purified on Direct-zol RNA Miniprep columns. Libraries were prepared using the NEBNext Ultra Directional RNA Library Prep Kit for Illumina (NEB) and RiboMinus Eukaryote System v2 (Thermo Fisher Scientific) for rRNA depletion. Statistical analysis was performed with R/Bioconductor.
Data source location Institute of Protein Research (Pushchino, Russia)
Data accessibility Analyzed data and raw RNA-Seq data were deposited in the NCBI GEO database under GEO accession GSE198050 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE198050)

Value of the Data

  • N6-methyladenosine (m6A) is the most abundant and dynamic mRNA modification that affects mRNA splicing, stability, and translation [1]. The double knockout ΔALKBH5ΔFTO cells lack two m6A demethylases and provide suitable model to study m6A demethylation failure.

  • The presented RNA-Seq data will be beneficial to study the impact of the impaired m6A RNA demethylation on gene expression.

  • The data will facilitate understanding of individual and shared functional roles of FTO and ALKBH5 if analyzed along with existing RNA-Seq datasets from single demethylase KO cells [e.g. 2,3] and m6A-methylation profiles [e.g. 4,5].

1. Data Description

Here we present the RNA-Seq data for ALKBH5 and FTO double knockout HEK293T cells. The ALKBH5 and FTO RNA demethylases have similar m6A substrate specificity [6] and likely can partially compensate for the absence of each other in the living cell. While there are existing RNA-Seq datasets for individual knockouts and knockdowns of ALKBH5 [e.g. 3] and FTO [2], this is the first transcriptomic profiling of the double knockout cells. We created the knockout cells using the CRISPR/Cas9 approach. The position of gRNAs and the lack of ALKBH5 and FTO in knockout cells are illustrated in Fig. 1.

Fig. 1.

Fig 1

Evaluation of ALKBH5 and FTO knockout. (A) Schematic representation of guide RNA (gRNA) locations. Two gRNAs targeting the third exon of FTO (for nickase CRISPR/Cas9) and two gRNAs targeting the first and third exons of ALKBH5 (for CRISPR/Cas9) are shown. The coordinates are according to the hg38 genome assembly. (B) Western blot of wild type and ΔALKBH5ΔFTO HEK293T cells using anti-ALKBH5, anti-FTO, and anti-ACTB as a loading control.

The RNA-Seq was performed in three replicates for HEK293T and double knockout cells. The libraries were sequenced in a 76 bp single-end run on Illumina NexSeq 500. In all samples, quality-trimming (Q20) was required for less than 1% of reads. The total sequenced library size and mapping statistics are provided in Table 1, with more than 60% of total reads uniquely mapped and included in gene-level read counts. The raw reads (fastq) and gene-level read counts (tab-separated plain text) are available in the NCBI GEO database under GEO accession GSE198050.

Table 1.

RNA-Seq library size and mapping statistics.

Sample name Total library size Uniquely mapped reads Total gene-level read counts
HEK293T, replicate 1 10897264 8956475 7400887
HEK293T, replicate 2 11306936 9195340 7552615
HEK293T, replicate 3 8406417 6862227 5635235
ΔALKBH5ΔFTO, replicate 1 9226839 7324822 5924003
ΔALKBH5ΔFTO, replicate 2 10443280 8153351 6434276
ΔALKBH5ΔFTO, replicate 3 9633924 7664124 6137469

1.1. The presented RNA-Seq data are suitable for analysis of differential gene expression in FTO and ALKBH5 double knockout cells

Gene-level read counting showed good concordance between experimental replicates and separation of wild type HEK293T from double knockout samples, see results of the principal component analysis in Fig. 2A based upon 13718 genes that passed two counts per million in at least 3 of 6 samples. With these genes, we performed differential expression (DE) analysis to estimate overall RNA composition changes and detected 774 genes passing FDR < 0.05 and |log2(Fold Change)| > 1. Regarding the knockout targets, FTO (log2(Fold Change) = -0.9, FDR < 10−40) and ALKBH5 (log2(Fold Change) = -0.4, FDR < 10−18) were significantly downregulated (Fig. 2B,C). Thus, the RNA-Seq data confirm the knockout effect, and are suitable for differential expression analysis.

Fig. 2.

Fig 2

The data are suitable for analysis of differential gene expression under FTO and ALKBH5 knockout. (A) Principal component analysis for three replicates of HEK293T (WT) and three double knockout (KO) samples, X and Y axes correspond to the first two principal components. (B) Barplot illustrating FTO and ALKBH5 expression in counts per million (CPM). (C) Volcano plot of differential expression analysis. The red line indicates 5% FDR, gray lines indicate -1 and 1 log2(Fold Change).

1.2. The presented RNA-Seq data are suitable for detection of alternative splicing events occurring upon FTO and ALKBH5 knockout

RNA N6-methyladenosine demethylation by FTO and ALKBH5 has been shown to affect splicing [2,7]. The presented RNA-Seq data are suitable for the detection of alternative splicing events, as differential exon usage analysis of 8887 multi-exonic genes found 166 cases passing FDR < 0.05 (Fig. 3A). The illustrative examples of isoform switching under FTO and ALKBH5 knockout include RPS24, SETD7, ASPH, KRT8, and SEPT9 genes, the genomic views are shown in Fig. 3B–-F.

Fig. 3.

Fig 3

The data are suitable for analysis of differential exon usage under FTO and ALKBH5 knockout. (A) Volcano plot of differential splicing analysis. The red line indicates 5% FDR, several illustrative examples of alternative splicing are highlighted. (B-F) Genomic views of RNA-Seq profiles for the selected genes with alternative splicing events.

2. Experimental Design, Materials and Methods

2.1. Cell culture and FTO/ALKBH5 knockout

HEK293T cell line (originally obtained from ATCC, American Type Culture Collection) was kindly provided by Dr. Elena Nadezhdina (IPR RAS). The cells were cultivated in Dulbecco's modified eagle medium (DMEM), supplemented with 10% fetal bovine serum (FBS, Capricorn Scientific), 2 mM glutamine, and 1x Antibiotic-Antimycotic (Gibco) at 37°C in a humidified atmosphere containing 5% CO2 and passaged by standard methods.

The gRNAs were designed using CHOPCHOP software [8]. For the FTO knockout, two gRNAs targeting the third exon (top strand: 5′-CCTACAAGTACCTGAACACC-3′ and bottom strand: 5′-CCAATGAGGATGCGAGATAC-3′) were inserted into pSpCas9n(BB)-2A-Puro(PX462) V2.0 (Addgene #62987, Feng Zhang lab [9]). For ALKBH5 deletion, the gRNAs targeting the first and the third exon (5′-GACGTCCCGGGACAACTATA-3′ and 5′-TCGCCTGTCAGGAAACAACA-3′) were inserted into pSpCas9(BB)‐2A‐Puro(pX459) V2.0 (Addgene #62988, Feng Zhang lab [9]). To obtain the ALKBH5 donor vector, upstream and downstream homology arms (about 1000-bp each) were amplified and inserted into the pJet1.2 vector (Table 2).

Table 2.

Primers used for the ALKBH5 donor DNA cloning and amplification. Down – downstream homology arm, up – upstream homology arm, RE – restriction enzyme (bold). For full-length donor DNA amplification, For_up and Rev_down primers were used.

Primer name Sequence RE Cloning strategy
For_down AGGAAACAACAGGGACCCTGCTCTGAAACC Inserted into pJet1.2 at EcoRV in the forward orientation
Rev_down TACCCACTTTCTCTCTCTTGGGCCTCC
For_up AAGCGGCCGCGCGGCACGGCGACACTGTTGG NotI Inserted into pJet1.2 with downstream homology arm at NotI/XhoI
Rev_up TTCTCGAGACGCAGGTCCGTGTAGCCGCTGG XhoI

Cells were transfected with the gRNA expression vectors (for FTO) or with the amplified donor sequence and the gRNA expression vectors (for ALKBH5) using Lipofectamine 3000 (Invitrogen). After puromycin treatment (1 μg/ml) for three days, the cells were allowed to recover for two days, and then the pool of survived cells was cloned by limiting dilution and screened by immunoblotting with anti-ALKBH5 (#ab195377, Abcam), anti-FTO (#31687, CST), and anti-ACTB (loading control, #ab8227, Abcam) antibodies as was described previously [10]. The double knockout ΔALKBH5ΔFTO cells were obtained by consecutive knockout of FTO and ALKBH5, correspondingly.

2.2. RNA isolation and library preparation

The cells were lysed in buffer containing 20 mM Tris-HCl (pH 7.4), 150 mM NaCl, 5 mM MgCl2, 1 mM DTT, 1% Triton X-100, 100 µg/ml cycloheximide (Sigma-Aldrich), 25 U/ml TURBO DNase (Ambion). Cell lysates were incubated on ice for 10 min, triturated ten times through a 26-G needle, and centrifuged at 20,000 g at +4°C for 10 min. Total RNA was isolated using TRIzol LS Reagent (Thermo Fisher Scientific) and was subsequently purified on columns Direct-zol RNA Miniprep (Zymo Research). The 1 µg of the total RNA supplemented with in vitro transcribed spike-in RNAs [10] (7 pg GFP mRNA and 1.2 pg FLuc mRNA) was used for library preparation. The experiment was performed in three replicates for wild type HEK293T and double knockout cells.

The RNA-seq libraries were constructed according to the manufacturer's protocol using the NEBNext Ultra Directional RNA Library Prep Kit for Illumina (NEB) and RiboMinus Eukaryote System v2 (Thermo Fisher Scientific) for rRNA depletion.

2.3. Sequencing and raw data processing

The RNA-Seq libraries were sequenced using Illumina NexSeq 500 (76 bp single-end reads). The reads were trimmed with cutadapt v2.10 [11] with "-j 24 –minimum-length 20 –nextseq-trim=20" parameters. Read mapping was performed using the hg38 genome assembly and GENCODE v29 genome annotation (modified to include spike-ins as additional contigs) was performed with STAR v2.7.6a [12] using default parameters.

2.4. Gene counting, differential expression, and exon usage analyses

Differential expression (DE) analysis was performed to estimate overall transcriptomic changes. The read counting was performed by R featureCounts from RsubRead (v.2.0.1) [13] using GENCODE v29 [14] basic annotation with GTF.featureType = 'gene', and countMultiMappingReads = FALSE to leave only uniquely mapped reads that overlap exons or introns. Next, we used edgeR (v.3.28.1) [15] to identify differentially expressed genes among 13718 genes that passed 2 cpm for at least 3 of 6 samples for DE (default TMM-normalization with calcNormFactors, dispersion estimation with estimateDisp, and test for differential expression with exactTest).

To test the genes for differential exon usage (DEU), the exon-level reads counting was performed with featureCounts allowing multiple feature overlaps (allowMultiOverlap = TRUE). F-test for differential splicing (diffSpliceDGE) was used for detection of significant alternative splicing events.

RNA-Seq profiles for visualization were generated using samtools (v.1.10) [16] view with -f/-F 16 and -q 255 flags to leave only correctly stranded and uniquely mapped reads followed by bedtools (v.2.27.1) [17] genomecov with -split and -scale [106 / total number of uniquely mapped reads]. The genomic views were generated with svist4get (v.1.2.24) [18], the other plots were generated using ggplot2 (v.3.3.5).

Ethics Statements

This work does not contain any studies with humans, animals or social media platforms.

CRediT authorship contribution statement

Egor A. Smolin: Investigation, Writing – review & editing, Visualization. Andrey I. Buyan: Writing – original draft, Software, Visualization, Investigation. Dmitry N. Lyabin: Writing – review & editing, Investigation. Ivan V. Kulakovskiy: Writing – original draft, Formal analysis, Data curation, Supervision. Irina A. Eliseeva: Conceptualization, Writing – original draft, Supervision, Funding acquisition, Data curation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by Russian Science Foundation [grant number 19-74-10079]. We are grateful to the Skoltech Genomics Core Facility and A.A. Kharkevich Institute for Information Transmission Problems RAS for assistance with high-throughput sequencing.

Contributor Information

Ivan V. Kulakovskiy, Email: ivan.kulakovskiy@gmail.com.

Irina A. Eliseeva, Email: yeliseeva@vega.protres.ru.

Data Availability

References

  • 1.He P.C., He C. m 6 A RNA methylation: from mechanisms to therapeutic potential. EMBO J. 2021;40 doi: 10.15252/embj.2020105977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bartosovic M., Molares H.C., Gregorova P., Hrossova D., Kudla G., Vanacova S. N6-methyladenosine demethylase FTO targets pre-mRNAs and regulates alternative splicing and 3’-end processing. Nucleic Acids Res. 2017;45:11356–11370. doi: 10.1093/nar/gkx778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Wang J., Li Y., Wang P., Han G., Zhang T., Chang J., Yin R., Shan Y., Wen J., Xie X., Feng M., Wang Q., Hu J., Cheng Y., Zhang T., Li Y., Gao Z., Guo C., Wang J., Liang J., Cui M., Gao K., Chai J., Liu W., Cheng H., Li L., Zhou F., Liu L., Luo Y., Li S., Zhang H. Leukemogenic chromatin alterations promote AML leukemia stem cells via a KDM4C-ALKBH5-AXL signaling axis. Cell Stem Cell. 2020;27:81–97. doi: 10.1016/j.stem.2020.04.001. .e8. [DOI] [PubMed] [Google Scholar]
  • 4.Garcia-Campos M.A., Edelheit S., Toth U., Safra M., Shachar R., Viukov S., Winkler R., Nir R., Lasman L., Brandis A., Hanna J.H., Rossmanith W., Schwartz S. Deciphering the “m6A Code” via antibody-independent quantitative profiling. Cell. 2019;178:731–747. doi: 10.1016/j.cell.2019.06.013. .e16. [DOI] [PubMed] [Google Scholar]
  • 5.Zhang Z., Chen L.Q., Zhao Y.L., Yang C.G., Roundtree I.A., Zhang Z., Ren J., Xie W., He C., Luo G.Z. Single-base mapping of m 6 A by an antibody-independent method. Sci. Adv. 2019;5 doi: 10.1126/sciadv.aax0250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zou S., Toh J.D.W., Wong K.H.Q., Gao Y.G., Hong W., Woon E.C.Y. N 6 -Methyladenosine: a conformational marker that regulates the substrate specificity of human demethylases FTO and ALKBH5. Sci. Rep. 2016;6:25677. doi: 10.1038/srep25677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tang C., Klukovich R., Peng H., Wang Z., Yu T., Zhang Y., Zheng H., Klungland A., Yan W. ALKBH5-dependent m6A demethylation controls splicing and stability of long 3’-UTR mRNAs in male germ cells. Proc. Natl. Acad. Sci. U. S. A. 2017;115:E325–E333. doi: 10.1073/pnas.1717794115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Labun K., Montague T.G., Krause M., Torres Cleuren Y.N., Tjeldnes H., Valen E. CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing. Nucleic Acids Res. 2019;47:W171–W174. doi: 10.1093/nar/gkz365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ran F.A., Hsu P.D., Wright J., Agarwala V., Scott D.A., Zhang F. Genome engineering using the CRISPR-Cas9 system. Nat. Protoc. 2013;8:2281–2308. doi: 10.1038/nprot.2013.143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lyabin D.N., Eliseeva I.A., Smolin E.A., Doronin A.N., Budkina K.S., Kulakovskiy I.V., Ovchinnikov L.P. YB-3 substitutes YB-1 in global mRNA binding. RNA Biol. 2020;17:487–499. doi: 10.1080/15476286.2019.1710050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  • 12.Dobin A., Davis C.A., Schlesinger F., Drenkow J., Zaleski C., Jha S., Batut P., Chaisson M., Gingeras T.R. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Liao Y., Smyth G.K., Shi W. The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res. 2019;47:e47. doi: 10.1093/nar/gkz114. –e47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Frankish A., Diekhans M., Ferreira A.-M., Johnson R., Jungreis I., Loveland J., Mudge J.M., Sisu C., Wright J., Armstrong J., Barnes I., Berry A., Bignell A., Carbonell Sala S., Chrast J., Cunningham F., Di Domenico T., Donaldson S., Fiddes I.T., García Girón C., Gonzalez J.M., Grego T., Hardy M., Hourlier T., Hunt T., Izuogu O.G., Lagarde J., Martin F.J., Martínez L., Mohanan S., Muir P., Navarro F.C.P., Parker A., Pei B., Pozo F., Ruffier M., Schmitt B.M., Stapleton E., Suner M.-M., Sycheva I., Uszczynska-Ratajczak B., Xu J., Yates A., Zerbino D., Zhang Y., Aken B., Choudhary J.S., Gerstein M., Guigó R., Hubbard T.J.P., Kellis M., Paten B., Reymond A., Tress M.L., Flicek P. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019;47:D766–D773. doi: 10.1093/nar/gky955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Robinson M.D., McCarthy D.J., Smyth G.K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. 1000 Genome project data processing subgroup, the sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Quinlan A.R., Hall I.M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Egorov A.A., Sakharova E.A., Anisimova A.S., Dmitriev S.E., Gladyshev V.N., Kulakovskiy I.V. svist4get: a simple visualization tool for genomic tracks from sequencing experiments. BMC Bioinf. 2019;20:113. doi: 10.1186/s12859-019-2706-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES