Abstract
Background and aims
The study aimed to identify yet unknown and uncharacterized long non-coding RNAs (lncRNAs) in treatment-naïve ulcerative colitis (UC), and to define their possible roles in UC pathogenesis. For that purpose, accurate quantification methods for lncRNA transcript detection, multiple and “stringent” strategies were applied. New insights in the regulation of functional genes and pathways of relevance for UC through expression of lncRNAs are expected.
Methods
The study was based on sequencing data derived from a data set consisting of treatment-naïve UC patients (n = 14) and control subjects (n = 16). Two complementary aligners were used to identify lncRNAs. Several different steps were used to validate differential expression including plotting the reads over the annotation for manual inspection. To help determine potential lncRNA involvement in biological processes, KEGG pathway enrichment was done on protein-coding genes which co-expressed with the lncRNAs.
Results
A total of 99 lncRNAs were identified in UC. The lncRNAs which were not previously characterized (n = 15) in UC or other autoimmune diseases were selected for down-stream analysis. In total, 602 protein-coding genes correlated with the uncharacterized lncRNAs. KEGG pathway enrichment analysis revealed involvement of lncRNAs in two significantly enriched pathways, lipid and atherosclerosis, and T-cell receptor signaling.
Conclusion
This study identified a set of 15 yet uncharacterized lncRNAs which may be of importance for UC pathogenesis. These lncRNAs may serve as potential diagnostic biomarkers and might be of use for the development of UC treatment strategies in the future.
Keywords: Ulcerative colitis, lncRNAs
1. Background
Ulcerative colitis (UC) is a chronic inflamed condition of the colon and rectum and one of the major phenotypes of inflammatory bowel disease (IBD) [1]. Despite the prevalence of UC, the etiology of UC is poorly understood. The UC pathogenesis is complex and an interplay between environmental factors, intestinal microbiome, nutrition and genetic factors [1]. Although heritability plays a potential role, only a small fraction (7.5–22%) of UC risk can be explained by genetic factors alone [2,3]. Genome-wide association studies (GWAS) found several IBD risk loci on the non-coding region of the genome [4]. LncRNAs have not been thoroughly explored in IBD [5] nor has their contribution to the progression of the disease.
LncRNAs play an important role in tumor development and carcinogenesis and have been suggested to be biomarkers for diagnosis and prognosis [[6], [7], [8]]. A growing body of evidence implies a role for lncRNAs in UC [[9], [10], [11]]. The expression of lncRNAs in UC has previously been reported [9,10,12,13]. They are involved in the modulation of the intestinal barrier function [13,14], regulating expression of inflammatory cytokines [15], and polarization of macrophages [16].
LncRNAs, which are RNAs with a length greater than 200 nucleotides, are poorly conserved [17]. Their roles in gene expression regulation are still not well understood [18]. They may or may not be polyadenylated, and 98% are spliced. At least two different alternatives spliced isoforms have been observed in about 25% of all known lncRNAs [19]. LncRNAs share common features as they are expressed at lower levels, are tissue-specific, and have exonic regions with low levels of interspecies sequence conservation [20]. Weak expression makes accurate quantification of lncRNA transcripts particularly challenging. According to ENCODE's own evaluation, less than 1000 lncRNAs are present at greater than one copy per cell in the typical human tissue culture cell lines [21]. In addition, many lncRNA exons overlap protein-coding exons on the same strand making it difficult to determine the origin of the transcript counts. To ensure the veracity of differentially expressed lncRNAs, several complementary methods need to be employed. Determining lncRNA function is difficult, but protein-coding transcripts that co-express with lncRNA transcripts may offer some insight into lncRNA function. Likewise, pathway enrichment of co-expressed protein-coding genes may offer insight into relevant biological pathways involved in UC pathogenesis.
2. Materials and methods
2.1. Patient data
Gene expression data of mucosal gene expression were obtained from the Gene expression Omnibus (GEO) GSE128682 and represent sequencing data obtained from mucosal biopsies of treatment-naïve UC patients (n = 14) and normal control subjects (n = 16) [22].
2.2. Data analysis
A schematic overview of the data analysis methodological approach is shown in Fig. 1. The Gencode v36 (GRCh38.p13) reference genome (https://www.gencodegenes.org/) [23] was used for all alignments, annotations and visualization methods. All tests for differential expression were between UC and normal samples. Both Star aligner and Kallisto were used to align the Illumina generated fastq sequences. Star was used to generate a gene count matrix. Kallisto was used to create a transcript count matrix. DESeq2 was used to find DE genes and DE transcripts with an adjusted p value less than 0.05. LncRNAs were defined as those with transcript type or gene type equals lncRNA in the annotation gtf file. Stringtie v2.0.3 (https://github.com/gpertea/stringtie/releases) [24] was used to create a consensus set of transcripts from the Star aligned bam files. The Ballgown (https://www.bioconductor.org/packages/release/bioc/html/ballgown.html) [25] stattest using transcript FPKM as a metric was used to generate transcript q values from the set of Stringtie consensus transcripts. Granges is a software package that can identify genomic overlaps (https://bioconductor.org/packages/release/bioc/html/GenomicRanges.html) [26]. Granges was used to isolate lncRNA exons that did not overlap with known protein coding exons on the same strand. A matrix of Ballgown unique exon counts was created from the non-overlapping lncRNA exons. DESeq2 was used to identify differentially expressed exons from the exon matrix, adjusted p value < 0.05. Differentially expressed lncRNA met the following conditions: Star(gene) padj <0.05, Kallisto(transcript) padj <0.05, Ballgown (FPKM) qvalue <0.05, and at least one non protein overlapping lncRNA exon with padj <0.05. Only lncRNA transcripts with an average read count greater than 16 were considered.
Fig. 1.
Flow diagram representing the outline of experimental steps. Fastq data was aligned using several methods Star, Kallisto, and Ballgown. Differential expression was estimated by DESeq2 for Star (gene counts) and Kallisto (transcript counts), stattest for Ballgown (FPKM). LncRNA candidates were significantly differentially expressed in all three tests. GRanges was used to find non protein overlapping lncRNA exons. Ballgown unique exon counts and DESeq2 were used to ensure that candidates had at least one differentially expressed non protein overlapping exon. Bam read counts were then plotted over genome annotation to ensure exon read count alignment to annotation and comparison to background noise.
LncRNA annotation is constantly updated, therefore Biomart was used to check that the remaining differentially expressed lncRNA transcript type was currently annotated as lncRNA (https://bioconductor.org/packages/release/bioc/html/biomaRt.html). in latest Ensembl annotation. Finally, each significantly lncRNA was inspected visually (Fig. 2). By using Samtools [27] the read coverage for each candidate lncRNA region was extracted directly from the STAR aligned Bam files. LncRNA transcript read coverage was plotted over the genome reference exon structure using the Gviz [28] package. LncRNAs whose read coverage peaks aligned with reference exons, was greater than the local background, and did not completely overlap non lncRNA reference exons (Fig. 2) were considered as candidates.
Fig. 2.
Visualization of lncRNA candidates. LncRNA gene symbol and location are indicated in figure title. The upper part of the figure shows the Ballgown FPKM values for each lncRNA transcript labeled on the left. Low FPKM values are white/light yellow, higher FPKM values are darker orange/red. The lower part of the figure shows the average read counts for UC and normal controls over the genomic annotation. Read counts are shown on the lower panel y axis. Normal controls read counts are indicated in blue (n = 16) and UC read counts are indicated in red (n = 14). The genomic annotation used to align is shown under the read counts. Transcripts are labeled on the left. LncRNA transcripts in orange are considered valid candidates. Transcripts in grey are lncRNA transcripts that were not considered candidates. Transcripts in black are not annotated as lncRNA.
Co-expression analysis was used to select potential protein coding target transcripts for the fifteen uncharacterized lncRNAs depicted in Table 1. Transcripts with an absolute Pearson correlation greater than 0.85 were selected. If total lncRNA transcript targets were less than thirty, the top thirty most co-related transcripts were taken into consideration. The R ReactomePA package (https://bioconductor.org/packages/release/bioc/html/ReactomePA.html) [29] was used to find significantly enriched KEGG [30] using gene names of the co-expressed uncharacterized lncRNA transcripts.
Table 1.
List of uncharacterized lncRNAs in ulcerative colitis (UC).
| Transcript ID | Gene_Name | Ballgown (qvalue) | Kallisto (FC) | Kallisto (padj) | Star (baseMean) | Star (padj) | Exon (padj) |
|---|---|---|---|---|---|---|---|
| ENST00000669835.1 | AC110611.2 | 2.92E-06 | 1.51 | 2.38E-15 | 22.1 | 1.28E-14 | 1.33E-24 |
| ENST00000669140.1 | AL354743.2 | 0.002 | 1.03 | 3.56E-05 | 33.01 | 0 | 1.12E-09 |
| ENST00000424989.1 | LINC01137 | 0.004 | 0.56 | 0.009 | 86.22 | 0.004 | 1.16E-05 |
| ENST00000606723.2 | U91328.1 | 6.39E-05 | −0.79 | 1.21E-06 | 107.54 | 5.27E-17 | 0 |
| ENST00000553330.1 | LINC02313 | 0.04 | −0.84 | 0 | 20.52 | 7.32E-07 | 0.02 |
| ENST00000447171.2 | AC007255.1 | 0.002 | −0.93 | 3.66E-06 | 155.91 | 7.06E-10 | 8.46E-05 |
| ENST00000661542.1 | AL353572.4 | 2.94E-05 | −1.11 | 7.55E-13 | 35.5 | 5.29E-18 | 6.16E-07 |
| ENST00000658026.1 | LINC02405 | 0.018 | −1.12 | 0 | 82.57 | 1.76E-08 | 0.001 |
| ENST00000451240.1 | AC005550.2 | 0.007 | −1.26 | 1.10E-05 | 170.78 | 1.18E-08 | 0 |
| ENST00000432368.2 | THRB-AS1 | 5.95E-05 | −1.42 | 2.73E-11 | 10.52 | 1.06E-10 | 1.19E-08 |
| ENST00000656535.1 | AC007114.1 | 0.033 | −1.5 | 5.19E-05 | 41.7 | 0 | 0 |
| ENST00000416416.1 | GORAB-AS1 | 0.017 | −1.53 | 5.89E-05 | 45.44 | 2.71E-09 | 0 |
| ENST00000553425.5 | AL121790.2 | 0.001 | −1.66 | 1.90E-07 | 58.31 | 2.26E-07 | 2.60E-06 |
| ENST00000512915.5 | AC098487.1 | 0.036 | −1.98 | 5.37E-05 | 21.08 | 5.78E-14 | 1.10E-07 |
| ENST00000664281.1 | AC116345.4 | 0 | −2.13 | 1.37E-09 | 81.42 | 2.34E-13 | 4.18E-09 |
3. Results
3.1. Differentially expressed lncRNAs in treatment-naïve ulcerative colitis
DESeq2 on the STAR generated gene count matrix gave a total of 8615 differentially expressed (DE) lncRNA genes (padj <0.05) coding for a total of 28182 lncRNA transcripts (Fig. 1). Ballgown stattest (q value < 0.05) using FPKM values from Stringtie consensus sequences gave a total of 5614 DE lncRNA transcripts belonging to 4254 lncRNA genes. DESeq2 was used to perform differential expression analysis on transcripts obtained from the Kallisto aligner, which gave 9599 DE lncRNAs transcripts (padj <0.05) belonging to 6720 lncRNA genes. DESeq2 on the non-protein overlapping exon unique counts matrix resulted in 4073 lncRNA transcripts with at least one DE non-overlapping exon (padj <0.05). Combining the lncRNA results Ballgown (FPKM) qvalue <0.05, Star (gene) padj <0.05, Kallisto (transcript) padj <0.05, and non-protein overlapping exons (Ballgown unique exon count) padj <0.05 resulted in 2150 lncRNA candidates. Of the 2150 candidates, 1239 candidates were verified as biotype lncRNA in the latest ensemble annotation by a BioMart query (Supplementary Table 1). The entire analysis flowchart is shown in Fig. 1.
Visual inspection of lncRNAs candidates was done by plotting read coverage of lncRNAs over the exon structure defined in the genome reference annotation (Fig. 2). A total of 116 lncRNAs transcripts representing 99 lncRNA candidate genes were selected (Supplementary Fig. 1). Seven of the candidate lncRNAs were previously found to be dysregulated in IBD, fourteen have been observed in colorectal cancer, and six were related to inflammation and infection (Supplementary Table 2). All these 99 significantly differentially expressed lncRNAs are depicted in a heat map (Supplementary Fig. 2). Among these 99, fifteen lncRNAs have not been previously described and characterized in UC (Table 1). Principal component analysis (PCA) using the uncharacterized lncRNAs showed a clear separation between UC samples and normal samples. Principal component 1 (PC1) explained 64.1% of the total variance (Fig. 3A).
Fig. 3.
Principal component analysis (PCA). (A) PCA depicting 15 uncharacterized lncRNA transcripts presenting the difference between UC (n = 14; red)) and normal controls (n = 16; green). The first two components explain 64.1% and 8.3% of the variability in the lncRNA expression data. (B) PCA of differentially expressed coding transcripts (n = 686) which correlate to the uncharacterized lncRNAs (n = 15) presenting the difference between UC (n = 14; red) and normal controls (n = 16; green). The first two components explain 74,6% and 5.6% of the variability in the expression data.
3.2. Co-expression of lncRNAs with protein-coding genes
The 15 uncharacterized lncRNAs were then subjected to correlation analysis, which resulted in a total of 602 co-expressed protein-coding genes in correlation analysis (coefficient absolute 0.85 with correlation p-value < 0.05) (Supplementary Table 3). In addition, a PCA was performed on the differentially expressed protein-coding transcripts (n = 686) which correlated with the expression of the uncharacterized lncRNAs (Fig. 3B). Here, principal component (PC1) explained 74.6% of the total variance and a clear separation of UC and normal samples was seen. LncRNAs AC110611.2, GOARB-AS1, AC005550.2, and AC116345.4 were co-expressed with 190, 170,112, and 65 protein-coding transcripts, respectively. Correlation analysis showed that multiple protein-coding transcripts can co-express with a single lncRNA transcript and vice versa (Supplemental Table 3). Among the co-expressed transcripts were several protein-coding genes which related to inflammation and UC progression like interleukin 1B (IL-1B) [31], metalloproteinase 3 (MMP3), metalloproteinase 9 (MMP9) [32], and Vav guanine nucleotide exchange factor 3 (VAV3) [33]. Several pro-inflammatory cytokines such as IL-33, TNFSF10 and IL21R co-expressed with AC110611.2.
3.3. Pathway enrichment analysis
Genes corresponding to the correlated protein-coding transcripts were used for KEGG pathway enrichment. Two significantly enriched pathways with padj and qvalue <0.05 could be identified, the T cell receptor pathway and the lipid and atherosclerosis pathway. Seventeen and twelve genes, which co-expressed with the uncharacterized lncRNAs were found to be enriched in both pathways. Among them VAV3, lymphocyte cytosolic protein 2 (LCP2), and inducible T cell co-stimulator (ICOS), both of which play a role in vascular endothelial cell integrity [34], NK-cell mediated recognition of missing-self targets [35], and effective T-helper-cell responses [36]. To illustrate the correlations an example of a co-expression is shown in Fig. 4.
Fig. 4.
Co-expression plot of KEGG enriched pathways between lncRNA transcripts and their correlated protein coding transcripts. Co-expression plots of lipid and atherosclerosis pathway (A) and T cell receptor signaling (B) are indicated. LncRNA transcripts are listed on the x-axis, correlated protein coding transcripts on the y-axis. Transcript names are followed by a ‘+’ (UC expression greater than N expression) or a ‘-’ (N expression less than UC expression). Red dots indicate lncRNA transcript and protein coding transcript expression are positively correlated. Blue dots indicate where the lncRNA transcript and protein coding transcript expression are negatively correlated. Only correlations with an absolute value greater than 0.85 are shown.
4. Discussion
In this study, differentially expressed lncRNAs in treatment-naïve UC were explored by applying accurate quantification methods for lncRNA transcript detection. This study provides new knowledge of 15 previously uncharacterized lncRNAs which may be involvedin the regulation of the lipid and atherosclerosis and T cell receptor signaling pathways.
Accurate quantification of lncRNA transcripts is challenging. Therefore, several complementary methods along with visual inspection were applied to generate a set of lncRNAs that distinguish between UC samples and controls (Fig. 1). The majority of lncRNA transcripts are expressed at a significantly lower level than protein coding transcripts, making lncRNA transcription levels difficult to distinguish from the background noise [37]. Recent RNAseq studies have shown differences in intra-exonal coverage, which could have aroused from naturally occurring splice variants sharing part of an exon or could have been due to technical errors in library construction or sequencing [38]. In addition, some lncRNA's exons overlap with other non-lncRNA exons, making it difficult to determine the origin of read counts [39]. Lower counts and overlaps present challenges for lncRNA quantification. Therefore, only lncRNAs containing at least one differentially expressed lncRNA exon that did not overlap a protein coding exon were considered for this study. An example is given in Fig. 2, showing the lncRNA myocardial infarction associated transcript (MIAT). The MIAT read counts map well to the MIAT lncRNA exon annotation and aligns to a greater extent than protein-coding exons. This suggests that the majority of read counts come from MIAT exons and not any protein coding exon overlaps. The MIAT read counts in Fig. 2 are greater than the local background. Plotting the read counts over the annotation strengthened the ability to quantify lncRNA accurately.
Initial PCAs of the uncharacterized lncRNAs (Fig. 3A) and the corresponding correlated protein-coding transcripts (Fig. 3B) revealed a clear separation of UC samples from normal samples in both cases. This indicates that the chosen sample size is satisfactory to make assumptions on the significance of the results. KEGG enrichment analysis of uncharacterized lnc transcripts correlating with protein coding genes revealed in only two significantly enriched pathways, the lipid and atherosclerosis pathway, and the T cell receptor signaling pathway. The correlation plots for the two pathways are depicted in Fig. 4.
In the lipid and atherosclerosis pathway, lncRNA AC001611.2 expression correlated positively with four genes (MMP3, MMP9, IL-1 and CXCL3) whereas six other lncRNAs correlated negatively with the same genes (Fig. 4A). Perhaps these lncRNAs are involved in the modulation of inflammatory cytokines production, and immune cells migration during UC by regulating the expression of matrix metallopeptidases. A connection between impaired intestinal integrity, cytokine production, and monocytes migration has been reported to be associated with atherosclerosis [[40], [41], [42]]. A relationship between UC and atherosclerosis has been implicated [[43], [44], [45], [46], [47]]. The reported higher risk of cardiovascular events in UC patients may be pertinent in inflammation-mediated atherosclerosis [[48], [49], [50]] as inflammation and atherosclerosis have been proposed to share similar pathogenesis [51]. Therefore, the identified and previously unknown lncRNAs might qualify for possible new prognostic factors for UC patients with atherosclerosis.
LncRNAs may also play a role in T cell apoptosis during UC. LncRNAs AL354743.2 and LINC0113 correlated positively with the STAT3 transcription factor which induces the transcription of BCL2 and BCL-XL in T cells. The expression of these anti-apoptotic genes can increase the resistance of pathogenic T cells of lamina propria to apoptosis, leading to prolonged inflammation [52].
The T cell receptor-signaling pathway was the second significantly enriched pathway. Several genes involved in this pathway such as PTPRC (CD45), NFATc1, and RASGRP1 were differentially expressed in UC (Fig. 4B). The expression of PTPRC (CD45), a known IBD susceptibility gene, correlated positively with lncRNA AC110611.2 and correlated negatively with six lncRNAs depicted in Fig. 4B. Here, the lncRNAs might contribute to the activation of Cd4+ T cells which are key players in mediating the host protective and homeostatic responses to inflammation [53]. It is interesting to note that these lncRNAs might also play a role in the regulation of the expression of different patterns of alternatively spliced CD45 isoforms that have been shown to be associated with distinct functions [54]. T cell activation of cytokine production is also regulated by the expression of NFATc1 and RASGRP1 both of which correlated positively with lncRNA AC110611.2 and correlated negatively correlated with lncRNAs AC007255.1, AL353572.4, THRB-AS1, AL121790.2, and AC116345. Interestingly, RASGRP1 promotes inflammatory responses by enhancing the production of IL-6 by sponging with miRNA let-7a [55]. IL-6 has been shown to be positively associated with UC development and regulates intestinal barrier function via STAT3 [56].
The expression of lncRNA AC110611.2 correlated with numerous protein-coding transcripts (Supplementary Table 3). Apart from genes involved in the pathways discussed above (Fig. 4), several other genes co-expressed with AC110611.2 including many regulators of inflammatory immune responses such as ICOS, IL-21, Il-21R, and Sema7A [[57], [58], [59]].
Many of the up-regulated lncRNAs found by this methodological approach have been already identified and shown to be associated with IBD pathogenesis (Supplementary Table 2), such as small integral membrane protein 25 (SMIM25) [60], IFNG antisense RNA 1 (IFNG-AS1) [61], and DIO3 opposite strand RNA (DIO3OS) [62]. The observed downregulation of CDKN2B-AS1 is negatively correlated with inflammatory cytokines expression responsible for UC progression [12]. The upregulation of LINC01871 might indicate a dysregulation of T cell inflammatory responses in UC as has been reported for several other autoimmune diseases [63,64]. Overall, our study gives an insight into novel lncRNAs which potentially be involved in intestinal barrier function and immune cell development, activation, and migration. However, loss- and gain-of-function studies are required to verify the biological importance of expression of these lncRNA by in vitro and in vivo experiments. To what extent the uncharacterized lncRNAs contribute to the regulation of the T-cell receptor signaling pathway during UC progression has to be explored in more depth in the future.
5. Conclusion
This study revealed 15 lncRNAs, which have not been functionally annotated previously and which may be involved in the pathogenesis of UC. The applied methodological approaches together with a visual inspection of read counts over the annotation was key to identifying lncRNA's that were differentially regulated. The results may provide new potential diagnostic biomarkers and therapeutic targets for ulcerative colitis which may improve the understanding of the molecular pathogenesis of UC. However, if lncRNAs are going to be of use as future biomarkers for UC, more reliable approaches for lncRNAs profiling and reliable lncRNA quantification methods are required.
Funding
Not applicable
Ethical approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Availability of data
All data are within the manuscript and its Supporting Information files.
CRediT authorship contribution statement
Mithlesh Kumar Ray: Formal analysis, Validation, writing, Software, reviewing the final draft. Christopher G. Fenton: Data curation, Methodology, Investigation, Visualization, Validation, Software, Writing – review & editing. Ruth H. Paulssen: Conceptualization, Investigation, Project administration, Resources, Methodology, Supervision, writing, Writing – original draft.
Declaration of competing interest
The authors declare no conflict of interests regarding the publication of this manuscript.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.ncrna.2022.02.001.
Appendix A. Supplementary data
The following are the Supplementary data to this article:
References
- 1.Kobayashi T., Siegmund B., Le Berre C., Wei S.C., Ferrante M., Shen B., Bernstein C.N., Danese S., Peyrin-Biroulet L., Hibi T. Ulcerative colitis. Nat. Rev. Dis. Prim. 2020;6:74. doi: 10.1038/s41572-020-0205-x. [DOI] [PubMed] [Google Scholar]
- 2.Turpin W., Goethel A., Bedrani L., Croitoru K. Determinants of IBD heritability: genes, bugs, and more. Inflamm. Bowel Dis. 2018;24 doi: 10.1093/ibd/izy085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Saravanarajan K., Douglas A.R., Ismail M.S., Omorogbe J., Semenov S., Muphy G., O'Riordan F., McNamara D., Nakagome S. Genomic profiling of intestinal T-cell receptor repertoires in inflammatory bowel disease. Gene Immun. 2020;21:109–118. doi: 10.1038/s41435-020-0092-x. [DOI] [PubMed] [Google Scholar]
- 4.Momozawa Y., Dmitrieva J., Théâtre E., Deffontaine V., Rahmouni S., Charloteaux B., Crins F., Docampo E., Elansary M., Gori A.-S. IBD risk loci are enriched in multigenic regulatory modules encompassing putative causative genes. Nat. Commun. 2018;9:1–18. doi: 10.1038/s41467-018-04365-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lin L., Zhou G., Chen P., Wang Y., Han J., Chen M., He Y., Zhang S. Which long noncoding RNAs and circular RNAs contribute to inflammatory bowel disease? Cell Death Dis. 2020;11 doi: 10.1038/s41419-020-2657-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Giulietti M., Righetti A., Principato G., Piva F. LncRNA co-expression network analysis reveals novel biomarkers for pancreatic cancer. Carcinogenesis. 2018;39:1016–1025. doi: 10.1093/carcin/bgy069. [DOI] [PubMed] [Google Scholar]
- 7.Li C.Y., Liang G.Y., Yao W.Z., Sui J., Shen X., Zhang Y.Q., Peng H., Hong W.W., Ye Y.C., Zhang Z.Y., Zhang W.H., Yin L.H., Pu Y.P. Integrated analysis of long non-coding RNA competing interactions reveals the potential role in progression of human gastric cancer. Int. J. Oncol. 2016 doi: 10.3892/ijo.2016.3407. [DOI] [PubMed] [Google Scholar]
- 8.Xie Y., Zhang Y., Du L., Jiang X., Yan S., Duan W., Li J., Zhan Y., Wang L., Zhang S., Li S., Wang L., Xu S., Wang C. Circulating long noncoding RNA act as potential novel biomarkers for diagnosis and prognosis of non-small cell lung cancer. Molecular Oncology. 2018;12:648–658. doi: 10.1002/1878-0261.12188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhu M., Xie J. lncRNA MALAT1 promotes ulcerative colitis by upregulating lncRNA ANRIL. Dig. Dis. Sci. 2020;65:3191–3196. doi: 10.1007/s10620-020-06093-w. [DOI] [PubMed] [Google Scholar]
- 10.Yu T., Meng F., Xie M., Liu H., Zhang L., Chen X. Digestive Diseases; 2020. LncRNA PMS2L2 Downregulates miR-24 through Methylation to Suppress Cell Apoptosis in Ulcerative Colitis. [DOI] [PubMed] [Google Scholar]
- 11.Yarani R., Mirza A.H., Kaur S., Pociot F. The emerging role of lncRNAs in inflammatory bowel disease. Exp. Mol. Med. 2018;50:1–14. doi: 10.1038/s12276-018-0188-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Tian Y., Cui L., Lin C., Wang Y., Liu Z., Miao X. LncRNA CDKN2B-AS1 relieved inflammation of ulcerative colitis via sponging miR-16 and miR-195. Int. Immunopharm. 2020;88 doi: 10.1016/j.intimp.2020.106970. [DOI] [PubMed] [Google Scholar]
- 13.Chen S.W., Wang P.Y., Liu Y.C., Sun L., Zhu J., Zuo S., Ma J., Li T.Y., Zhang J.L., Chen G.W., Wang X., Zhu Q.R., Zheng Y.W., Chen Z.Y., Yao Z.H., Pan Y.S. 2016. Effect of Long Noncoding RNA H19 Overexpression on Intestinal Barrier Function and its Potential Role in the Pathogenesis of Ulcerative Colitis, Inflammatory Bowel Diseases. [DOI] [PubMed] [Google Scholar]
- 14.Rankin C.R., Lokhandwala Z.A., Huang R., Pekow J., Pothoulakis C., Padua D. Life Sciences; 2019. Linear and Circular CDKN2B-AS1 Expression Is Associated with Inflammatory Bowel Disease and Participates in Intestinal Barrier Formation. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Padua D., Mahurkar-Joshi S., Law I.K.M., Polytarchou C., Vu J.P., Pisegna J.R., Shih D., Iliopoulos D., Pothoulakis C. A long noncoding RNA signature for ulcerative colitis identifies IFNG-AS1 as an enhancer of inflammation. Am. J. Physiol. Gastrointest. Liver Physiol. 2016 doi: 10.1152/ajpgi.00212.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Liu R., Tang A., Wang X., Chen X., Zhao L., Xiao Z., Shen S. Inhibition of lncRNA NEAT1 suppresses the inflammatory response in IBD by modulating the intestinal epithelial barrier and by exosome-mediated polarization of macrophages. Int. J. Mol. Med. 2018;42:2903–2913. doi: 10.3892/ijmm.2018.3829. [DOI] [PubMed] [Google Scholar]
- 17.Johnsson P., Lipovich L., Grandér D., Morris K.V. Evolutionary conservation of long non-coding RNAs; Sequence, structure, function. Biochim. Biophys. Acta Gen. Subj. 2014 doi: 10.1016/j.bbagen.2013.10.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Statello L., Guo C.-J., Chen L.-L., Huarte M. Gene regulation by long non-coding RNAs and its biological functions. Nat. Rev. Mol. Cell Biol. 2021;22:96–118. doi: 10.1038/s41580-020-00315-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zacharopoulou E., Ioakeim S., Tzouvala M., Karamanolis G., Vezakis A., Gazouli M. 2018. Correlation of Polymorphisms in Long Non-coding RNAs with the Pathogenesis of Inflammatory Bowel Diseases, Digestive and Liver Disease. [DOI] [PubMed] [Google Scholar]
- 20.Derrien T., Johnson R., Bussotti G., Tanzer A., Djebali S., Tilgner H., Guernec G., Martin D., Merkel A., Knowles D.G., Lagarde J., Veeravalli L., Ruan X., Ruan Y., Lassmann T., Carninci P., Brown J.B., Lipovich L., Gonzalez J.M., Thomas M., Davis C.A., Shiekhattar R., Gingeras T.R., Hubbard T.J., Notredame C., Harrow J., Guigó R. The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 2012 doi: 10.1101/gr.132159.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Palazzo A.F., Lee E.S. Non-coding RNA: what is functional and what is junk? Front. Genet. 2015;5:1–11. doi: 10.3389/fgene.2015.00002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Fenton C.G., Taman H., Florholmen J., Sørbye S.W., Paulssen R.H. Transcriptional signatures that define ulcerative colitis in remission. Inflamm. Bowel Dis. 2021;27:94–105. doi: 10.1093/ibd/izaa075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Frankish A., Diekhans M., Ferreira A.M., Johnson R., Jungreis I., Loveland J., Mudge J.M., Sisu C., Wright J., Armstrong J., Barnes I., Berry A., Bignell A., Carbonell Sala S., Chrast J., Cunningham F., Di Domenico T., Donaldson S., Fiddes I.T., García Girón C., Gonzalez J.M., Grego T., Hardy M., Hourlier T., Hunt T., Izuogu O.G., Lagarde J., Martin F.J., Martínez L., Mohanan S., Muir P., Navarro F.C.P., Parker A., Pei B., Pozo F., Ruffier M., Schmitt B.M., Stapleton E., Suner M.M., Sycheva I., Uszczynska-Ratajczak B., Xu J., Yates A., Zerbino D., Zhang Y., Aken B., Choudhary J.S., Gerstein M., Guigó R., Hubbard T.J.P., Kellis M., Paten B., Reymond A., Tress M.L., Flicek P. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019 doi: 10.1093/nar/gky955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Pertea M., Pertea G.M., Antonescu C.M., Chang T.C., Mendell J.T., Salzberg S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015 doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fu J., Frazee A.C., Collado-Torres L., Jaffe A.E., Leek J.T. Ballgown: flexible, isoform-level differential expression analysis. R Package Version. 2018;2 [Google Scholar]
- 26.Lawrence M., Huber W., Pagès H., Aboyoun P., Carlson M., Gentleman R., Morgan M.T., Carey V.J. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 2013 doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hahne F., Ivanek R. Methods in Molecular Biology. 2016. Visualizing genomic data using Gviz and bioconductor. [DOI] [PubMed] [Google Scholar]
- 29.Yu G., He Q.-Y. ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization. Mol. Biosyst. 2016;12:477–479. doi: 10.1039/c5mb00663e. [DOI] [PubMed] [Google Scholar]
- 30.Kanehisa M., Sato Y., Kawashima M., Furumichi M., Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ranson N., Veldhuis M., Mitchell B., Fanning S., Cook A.L., Kunde D., Eri R. NLRP3-dependent and-independent processing of interleukin (IL)-1β in active ulcerative colitis. Int. J. Mol. Sci. 2019;20:57. doi: 10.3390/ijms20010057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Siloşi I., Boldeanu M.V., Mogoantă S.Ş., Ghiluşi M., Cojocaru M., Biciuşcă V., Cojocaru I.M., Avrămescu C.S., Gheonea D.I., Siloşi C.A. Matrix metalloproteinases (MMP-3 and MMP-9) implication in the pathogenesis of inflammatory bowel disease (IBD) Rom. J. Morphol. Embryol. 2014;55:1317–1324. [PubMed] [Google Scholar]
- 33.Levison S.E., Fisher P., Hankinson J., Zeef L., Eyre S., Ollier W.E., McLaughlin J.T., Brass A., Grencis R.K., Pennock J.L. Genetic analysis of the Trichuris muris-induced model of colitis reveals QTL overlap and a novel gene cluster for establishing colonic inflammation. BMC Genom. 2013;14:1–13. doi: 10.1186/1471-2164-14-127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Hilfenhaus G., Nguyen D.P., Freshman J., Prajapati D., Ma F., Song D., Ziyad S., Cuadrado M., Pellegrini M., Bustelo X.R. Vav3-induced cytoskeletal dynamics contribute to heterotypic properties of endothelial barriers. JCB (J. Cell Biol.) 2018;217:2813–2830. doi: 10.1083/jcb.201706041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lampe K., Endale M., Cashman S., Fang H., Mattner J., Hildeman D., Hoebe K. Slp‐76 is a critical determinant of NK‐cell mediated recognition of missing‐self targets. Eur. J. Immunol. 2015;45:2072–2083. doi: 10.1002/eji.201445352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Tafuri A., Shahinian A., Bladt F., Yoshinaga S.K., Jordana M., Wakeham A., Boucher L.-M., Bouchard D., Chan V.S.F., Duncan G. ICOS is essential for effective T-helper-cell responses. Nature. 2001;409:105–109. doi: 10.1038/35051113. [DOI] [PubMed] [Google Scholar]
- 37.Freedman J.E., Miano J.M., L National heart and blood institute workshop participants*, challenges and opportunities in linking long noncoding RNAs to cardiovascular, lung, and blood diseases. Arterioscler. Thromb. Vasc. Biol. 2017;37:21–25. doi: 10.1161/ATVBAHA.116.308513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lahens N.F., Kavakli I.H., Zhang R., Hayer K., Black M.B., Dueck H., Pizarro A., Kim J., Irizarry R., Thomas R.S. IVT-seq reveals extreme bias in RNA sequencing. Genome Biol. 2014;15:1–15. doi: 10.1186/gb-2014-15-6-r86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Robert C., Watson M. Errors in RNA-Seq quantification affect genes of relevance to human disease. Genome Biol. 2015;16:1–16. doi: 10.1186/s13059-015-0734-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lewis C.V., Taylor W.R. Intestinal barrier dysfunction as a therapeutic target for cardiovascular disease. Am. J. Physiol. Heart Circ. Physiol. 2020;319:H1227–H1233. doi: 10.1152/ajpheart.00612.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Li T., Li X., Feng Y., Dong G., Wang Y., Yang J. Mediators of Inflammation; 2020. The Role of Matrix Metalloproteinase-9 in Atherosclerotic Plaque Instability; p. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Czubkowski P., Osiecki M., Szymańska E., Kierkuś J. The risk of cardiovascular complications in inflammatory bowel disease. Clin. Exp. Med. 2020:1–11. doi: 10.1007/s10238-020-00639-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Dagli N., Poyrazoglu O.K., Ferda Dagli A., Sahbaz F., Karaca I., Ali Kobat M., Bahcecioglu I.H. Is inflammatory bowel disease a risk factor for early atherosclerosis? Angiology. 2010;61:198–204. doi: 10.1177/0003319709333869. [DOI] [PubMed] [Google Scholar]
- 44.Weissman S., Sinh P., Mehta T.I., Thaker R.K., Derman A., Heiberger C., Qureshi N., Amrutiya V., Atoot A., Dave M., Tabibian J.H. Atherosclerotic cardiovascular disease in inflammatory bowel disease: the role of chronic inflammation. World J. Gastrointest. Pathophysiol. 2020;11:104–113. doi: 10.4291/WJGP.V11.I5.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Roifman I., Sun Y.C., Fedwick J.P., Panaccione R., Buret A.G., Liu H., Rostom A., Anderson T.J., Beck P.L. Evidence of endothelial dysfunction in patients with inflammatory bowel disease. Clin. Gastroenterol. Hepatol. 2009;7:175–182. doi: 10.1016/J.CGH.2008.10.021. [DOI] [PubMed] [Google Scholar]
- 46.Bernstein C.N., Wajda A., Blanchard J.F. The incidence of arterial thromboembolic diseases in inflammatory bowel disease: a population-based study. Clin. Gastroenterol. Hepatol. : Off. Clin. Practice J. Am. Gastroenterological Assoc. 2008;6:41–45. doi: 10.1016/J.CGH.2007.09.016. [DOI] [PubMed] [Google Scholar]
- 47.Yarur A.J., Deshpande A.R., Pechman D.M., Tamariz L., Abreu M.T., Sussman D.A. Inflammatory bowel disease is associated with an increased incidence of cardiovascular events. Am. J. Gastroenterol. 2011;106:741–747. doi: 10.1038/AJG.2011.63. [DOI] [PubMed] [Google Scholar]
- 48.Rungoe C., Basit S., Ranthe M.F., Wohlfahrt J., Langholz E., Jess T. Risk of ischaemic heart disease in patients with inflammatory bowel disease: a nationwide Danish cohort study. Gut. 2013;62:689–694. doi: 10.1136/GUTJNL-2012-303285. [DOI] [PubMed] [Google Scholar]
- 49.Kristensen S.L., Ahlehoff O., Lindhardsen J., Erichsen R., Jensen G.V., Torp-Pedersen C., Nielsen O.H., Gislason G.H., Hansen P.R. Disease activity in inflammatory bowel disease is associated with increased risk of myocardial infarction, stroke and cardiovascular death--a Danish nationwide cohort study. PLoS One. 2013;8 doi: 10.1371/JOURNAL.PONE.0056944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Singh S., Loftus E.V. Cardiovascular risk in inflammatory bowel disease: it's a heartache! Gastroenterology. 2013;145:1484–1486. doi: 10.1053/j.gastro.2013.10.030. [DOI] [PubMed] [Google Scholar]
- 51.Phipps R.P. Atherosclerosis: the emerging role of inflammation and the CD40-CD40 ligand system. Proc. Natl. Acad. Sci. U.S.A. 2000;97:6930–6932. doi: 10.1073/PNAS.97.13.6930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Sugimoto K. Role of STAT3 in inflammatory bowel disease. World J. Gastroenterol. 2008;14:5110–5114. doi: 10.3748/wjg.14.5110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Imam T., Park S., Kaplan M., Olson M. Vol. 9. 2018. effector T Helper Cell Subsets in inflammatory Bowel Diseases; p. 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Romero-Barrios N., Legascue M.F., Benhamed M., Ariel F., Crespi M. Splicing regulation by long noncoding RNAs. Nucleic Acids Res. 2018;46:2169. doi: 10.1093/NAR/GKY095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wang C., Li X., Yu C., Wang L., Deng R., Liu H., Chen Z., Zhang Y., Fan S., Sun H. Ras guanine nucleotide exchange factor RasGRP1 promotes acute inflammatory response and restricts inflammation-contributed cancer cell growth. bioRxiv. 2021 doi: 10.1038/s41467-022-34659-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Li Y., Jia Y., Cui T., Zhang J. IL-6/STAT3 signaling pathway regulates the proliferation and damage of intestinal epithelial cells in patients with ulcerative colitis via H3K27ac. Exp. Ther. Med. 2021;22:1–9. doi: 10.3892/etm.2021.10322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Yu J., He S., Liu P., Hu Y., Wang L., Wang X., Han Y., Zhu X. Interleukin-21 promotes the development of ulcerative colitis and regulates the proliferation and secretion of follicular T helper cells in the colitides microenvironment. Mol. Med. Rep. 2015;11:1049–1056. doi: 10.3892/mmr.2014.2824. [DOI] [PubMed] [Google Scholar]
- 58.Suzuki K., Okuno T., Yamamoto M., Pasterkamp R.J., Takegahara N., Takamatsu H., Kitao T., Takagi J., Rennert P.D., Kolodkin A.L. Semaphorin 7A initiates T-cell-mediated inflammatory responses through α1β1 integrin. Nature. 2007;446:680–684. doi: 10.1038/nature05652. [DOI] [PubMed] [Google Scholar]
- 59.Körner A., Bernard A., Fitzgerald J.C., Alarcon-Barrera J.C., Kostidis S., Kaussen T., Giera M., Mirakaj V. Sema7A is crucial for resolution of severe inflammation. Proc. Natl. Acad. Sci. Unit. States Am. 2021;118 doi: 10.1073/pnas.2017527118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Haberman Y., Benshoshan M., Di Segni A., Dexheimer P.J., Braun T., Weiss B., Walters T.D., Baldassano R.N., Noe J.D., Markowitz J., Rosh J., Heyman M.B., Griffiths A.M., Crandall W.V., Mack D.R., Baker S.S., Kellermayer R., Patel A., Otley A., Steiner S.J., Gulati A.S., Guthery S.L., Leleiko N., Moulton D., Kirschner B.S., Snapper S., Avivi C., Barshack I., Oliva-Hemker M., Cohen S.A., Keljo D.J., Ziring D., Anikster Y., Aronow B., Hyams J.S., Kugathasan S., Denson L.A. Long ncRNA landscape in the ileum of treatment-naive early-onset crohn disease. Inflamm. Bowel Dis. 2018;24:346–360. doi: 10.1093/ibd/izx013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Mirza A.H., Berthelsen C.H.B., Seemann S.E., Pan X., Frederiksen K.S., Vilien M., Gorodkin J., Pociot F. Transcriptomic landscape of lncRNAs in inflammatory bowel disease. Genome Med. 2015 doi: 10.1186/s13073-015-0162-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Wang S., Hou Y., Chen W., Wang J., Xie W., Zhang X., Zeng L. KIF9-AS1, LINC01272 and DIO3OS lncRNAs as novel biomarkers for inflammatory bowel disease. Mol. Med. Rep. 2018 doi: 10.3892/mmr.2017.8118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Joachims M.L., Khatri B., Tessneer K.L., Stolarczyk A.M., Wiley G.B., Rasmussen A., Guthridge J., James J.A., Scofield R.H., Sivils K.L. 2020. Op0140 Dysregulated Expression of the Long Non-coding RNA, Linc01871, Implicated in Sjögren’s Syndrome Pathogenesis. [Google Scholar]
- 64.Steel K.J.A., Srenathan U., Ridley M., Durham L.E., Wu S.Y., Ryan S.E., Hughes C.D., Chan E., Kirkham B.W., Taams L.S., Polyfunctional Proinflammatory. Tissue-resident memory phenotype and function of synovial interleukin-17A+CD8+ T cells in psoriatic arthritis. Arthritis Rheumatol. 2020 doi: 10.1002/art.41156. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are within the manuscript and its Supporting Information files.




