Abstract
Our genomes contain the blueprint of what makes us human and many indications as to why we develop disease. Until the last 10 years, most studies had focussed on protein‐coding genes, more specifically DNA sequences coding for proteins. However, this represents less than 5% of our genomes. The other 95% is referred to as the ‘dark matter’ of our genomes, our understanding of which is extremely limited. Part of this ‘dark matter’ includes regions that give rise to RNAs that do not code for proteins. A subset of these non‐coding RNAs are long non‐coding RNAs (lncRNAs), which in particular are beginning to be dissected and their importance to human health revealed. To improve our understanding and treatment of disease it is vital that we understand the molecular and cellular function of lncRNAs, and how their misregulation can contribute to disease. It is not yet clear what proportion of lncRNAs is actually functional; conservation during evolution is being used to understand the biological importance of lncRNA. Here, we present key themes within the field of lncRNAs, emphasising the importance of their roles in both the nucleus and the cytoplasm of cells, as well as patterns in their modes of action. We discuss their potential functions in development and disease using examples where we have the greatest understanding. Finally, we emphasise why lncRNAs can serve as biomarkers and discuss their emerging potential for therapy. © 2020 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Keywords: long non‐coding RNA, long intergenic non‐coding RNA, anti‐sense lncRNAs, translation, conservation, stem cells, development, X chromosome inactivation, neurodegenerative disease, cancer, diabetes
Introduction
Only 4% of the human genome codes for proteins, corresponding to ~20 000 protein‐coding genes, whereas ~85% of the genome can be transcribed into RNA. These additional transcriptional events represent part of the ‘dark matter’ of our genome. If these RNAs do not code for proteins, what do they do? Until relatively recently, most research focused on understanding the function and deregulation during disease of the 4% of the genome that codes for proteins. If we are truly to understand the genetic causes of disease, we need to look outside protein‐coding sequences, particularly at regions of non‐coding transcription. These regions produce a range of types and sizes of non‐coding RNAs, the most numerous of which are the long non‐coding RNAs (lncRNAs). Our understanding of lncRNAs has been transformed in the last 10 years. Nonetheless, relatively few lncRNAs have been characterised in detail and even fewer have had their functions characterised. Many lncRNAs have been found to be associated with a range of human diseases, but our understanding remains limited on exactly how these lncRNAs contribute to disease.
What are lncRNAs?
LncRNAs are RNAs of >200 nucleotides (nt) in length that are not thought to code for proteins. Although our appreciation and understanding of lncRNA function and importance has exploded in the last decade, the first lncRNAs were discovered in the 1990s: BC200, H19 1, and Xist 2. In the post‐genomic era, extensive and deep RNA‐Seq has revealed the existence of huge numbers of novel RNA transcripts, including lncRNAs. Many of these novel transcripts are low in abundance and so were not previously identified. Several consortia have been responsible for sequencing RNA from a variety of tissues, cell types, organisms, and disease states, and we now have a much more precise view of which RNA transcripts are expressed, and when and where (GENCODE 3, GTEX 4, FANTOM 5).
LncRNA genes are annotated as such because their RNAs do not contain large open reading frames that encode protein products (>100 amino acids). Of interest, although the number of protein‐coding genes does not substantially vary between Drosophila and human, the number of lncRNA genes does (Figure 1). Therefore, lncRNAs could represent part of the regulation system that enables higher eukaryotes to be more complex.
The lncRNA field now represents one of the most exciting and fast‐moving fields in biology. Although our understanding of lncRNAs has increased, a key question remains: how many lncRNAs are functional, or are the majority of lncRNAs the result of spurious transcription events? Experimental evidence suggests that >80% of lncRNAs possess biochemical activity such as protein binding 6. However, the importance or function of most of these lncRNA–protein interactions has yet to be validated. Although the function of many lncRNAs remains elusive, many have been shown to be associated with human health and disease. A key avenue going forward will be to dissect how lncRNAs that contribute to disease do so mechanistically.
Categories of lncRNA
The molecular nature of lncRNAs varies and has the potential to influence their function and localisation. Approximately 50% of lncRNAs possess a polyA tail and 98% of human lncRNAs are spliced 7, similar to mRNAs. Many lncRNAs also possess m7G caps. These mRNA‐like features contribute to the ability of many lncRNAs to exit the nucleus and enter into pathways in which mRNAs take part. The majority of research over the last 10 years has focused on those lncRNAs that remain in the nucleus, but sequencing RNAs in the cytoplasm has revealed that many lncRNAs are also present in the cytoplasm. The relative abundance of an lncRNA between the nucleus and cytoplasm can help reveal function and interacting partners. Although many similarities exist between lncRNAs and mRNAs, a key molecular difference is their cross‐species conservation; lncRNAs are not well conserved.
LncRNAs can be categorised in several different ways, most simply by their genomic location and structure. These are not mutually exclusive categories, however. This method of categorisation can suggest potential lncRNA function but does not infer mechanism of action. Intergenic lncRNAs (lincRNAs) are those lncRNAs that do not overlap with any other genes (protein‐coding or non‐coding) and are >1 kb away from neighbouring genes (Figure 2A). These are the most straightforward lncRNAs to which to assign function because genomic mutations can be unequivocally assigned [e.g. clustered regularly interspaced short palindromic repeats (CRISPR)]. Many lncRNAs overlap other genes and can either be in the sense or anti‐sense orientation to those other genes. Anti‐sense lncRNAs inherently possess sequence elements that will base pair to other RNAs, that is, mRNAs from protein‐coding genes they are anti‐sense to (Figure 2B). Sense lncRNAs are located within other genes, but in the sense direction (Figure 2C). lncRNAs produced from within the introns of other genes are termed intronic lncRNAs (Figure 2D). The final group are lncRNA genes, which are transcribed from the same region in the genome as another gene, but in the opposite direction: bi‐directional lncRNAs (Figure 2E). Intergenic and anti‐sense are by far the most common types of lncRNAs in humans (Figure 2F).
The link between lncRNA categories and potential function arises from the lncRNA's genomic location relative to potential target genes. The interaction of lncRNAs and target genes could take place at the site of transcription or somewhere else in the cell. Anti‐sense lncRNAs have the potential to regulate their anti‐sense genes at the point of transcription, acting in cis. Equally, lncRNAs can be transported to other locations in the cell, termed trans‐acting lncRNAs (see the glossary for definitions). On the simplest level, this might also be at the point of transcription of the target gene, which is elsewhere in the genome. The ability of lncRNAs to act in trans comes from their ability to base‐pair specifically with other RNAs and DNA, as well as to bind to proteins (Figure 3). These lncRNA complexes can then play roles in a number of gene expression processes and fall into several key mechanistic categories. lncRNAs can act as scaffolds, providing a site for other interactions. More generally, they can act to recruit protein complexes, which can be based on their sequence specificity. Forming lncRNA–protein complexes is an essential aspect for many lncRNA functions characterised so far. Some of these interactions can act as decoys, preventing proteins accessing other RNAs (RBP decoy). Alternatively, lncRNAs can act through specific base‐pairing with other RNAs, such as microRNAs (miRNAs). This can act to sponge miRNAs from other binding events. lncRNA–mRNA interactions can also act to regulate mRNA levels by increasing or decreasing their stability.
One of the most controversial areas of lncRNA biology is whether they are functional at all. Although functions have been characterised for some lncRNAs, they remain the minority. Much evidence suggests that the majority of lncRNAs are not functional 8. Some lncRNAs are thought only to possess activity as a result of their transcription; the lncRNA molecules produced from these transcription events do not have functions.
Conservation and evolution of lncRNAs
Traditionally, evolutionary conservation has been used as a proxy for functionality. However, when we compare lncRNAs to canonical protein‐coding genes, they are found to be poorly conserved at the sequence level. lncRNAs also often lack orthologues in other species, and upon comparison with protein‐coding sequences or mRNA untranslated regions (UTRs), lncRNA exons are found to evolve at a faster rate 9, 10. This provides evidence that lncRNA genes are ‘junk DNA’, and has led to the hypothesis that lncRNA transcripts are the products of pervasive transcription, a common phenomenon often dismissed as ‘biological noise’.
Despite this, lncRNAs are in fact under stronger selective pressure than neutrally evolving sequences such as introns 9, 11, and their apparent lack of sequence conservation cannot be taken at face value. Many of the mechanisms of action of lncRNAs have low sequence constraints. For example, interacting with and sequestering RNA‐binding proteins (RBPs) requires sequence conservation over only small portions of the total sequence, in the range of 10 nt.
Generally, lncRNA genes are also small relative to protein‐coding genes, and this shortening of potential areas of conserved sequence reduces the effectiveness of standard bioinformatic detection methods. Solving this issue is an ever‐growing area of research and includes the adaptation of existing methods and the development of new ones, for example, methods that focus on the promoter regions of lncRNAs 12. Within vertebrates, thousands of homologues for human lincRNAs have now been found with shared expression patterns, despite sharing very short patches of sequence conservation 13. This opens the possibility that far more lncRNAs are in fact functional.
LncRNAs are also found to be syntenically conserved across multiple species. The human lncRNA CHASERR (CHD2 adjacent, suppressive regulatory RNA) is found upstream of Chd2 (chromodomain helicase DNA binding protein 2) in both mice and humans and exhibits further conservation throughout the vertebrate lineage (Table 1). This has provided an ideal model to investigate CHASERR as a potential target to control levels of Chd2, a protein linked to human neuronal diseases ranging from epilepsy to neurodevelopmental delay 14, 15.
Table 1.
LncRNA | Subcellular localisation | Type | Function | Conservation status | Disease | References |
---|---|---|---|---|---|---|
CHASERR/Chaserr | Nuclear | Intergenic | Maintains Chd2 expression levels. | Sequentially and syntenically conserved between humans and mice. Extron–intron structure and some sequence conserved across vertebrates. | Chd2 is implicated in neurological disease. | 14 |
LINC00261 | Nuclear | Intergenic | Negative regulator of cell growth. | Syntenically homologous from humans to sea urchins. Some sequence homology in first exon. | Downregulated in multiple cancers, including endometrial and gastric. | 13, 16 |
Cerox1 (Figure 6B) | Cytoplasmic | Intergenic, bi‐directional | miRNA decoy, regulates abundance of mitochondrial complex 1 transcript. | Sequentially and syntenically conserved between humans and mice. Sequential conservation of exon 2 across mammals. | May have links to neurological diseases. | 18 |
Uchl1‐AS1 (Figures 4 and 6C) | Nuclear/Cytoplasmic | Anti‐sense | Translation regulation (lncRNA–mRNA interaction). | Conserved between humans and mice. | Downregulated in a neurochemical model of Parkinson's disease. | 78 |
PAPAS (Figures 3A and 5A) | Nuclear (Nucleolus) | Anti‐sense | Represses rRNA synthesis at elevated temperatures. | Not examined | Hepatocellular carcinoma, triple negative breast cancer and aging. | 96, 97 |
XIST (Figure 4) | Nuclear (Nuclear matrix) | Intergenic | X Chromosome inactivation. | In all placental mammals, with varying exonic and sequential conservation. | Its disruption can lead to cancer. | 98, 99 |
MEG3 (Figure 5B) | Nuclear | Intergenic | Acts as a tumour suppressor by stimulating p53. Regulates TGF‐β pathway genes. RNA–RNA interactions. | Tertiary structure conservation across most placental mammals. Unreliable detection across marsupials. | Associated with Huntington's disease and diabetic retinopathy. Tumour suppressor. | 100, 101, 102 |
linc‐RoR (Figure 3C) | Cytoplasmic | Intergenic | miRNA sponge, role in maintenance of pluripotency. | No evidence | Promotes oncogenesis in human Esophageal Squamous Cell Carcinoma. | 63, 64 |
lincRNA‐p21 (Figure 6E) | Nuclear | Intergenic |
Interacts with hnRNPK; Repressor of p53‐dependent transcriptional responses. Highly enriched in exosomes |
Alu sequence elements conserved in mouse | Oncogenic role in prostate cancer. | 48, 103, 104 |
HOTAIR | Nuclear | Anti‐sense | Hox gene expression regulation during development | Strong sequence conservation within primates, but poor across mammals. Evidence of structural conservation. | Oncogenic role in cancers including breast, gastric, colorectal and cervical. | 105, 106, 107, 108 |
yylncT (Figure 3C) | Nuclear | Bi‐directional | Required for correct lineage specification during development | A syntenic transcript was detected in E6.5 mouse embryos | Protects development from aberrant de novo methylation. | 62 |
HOTTIP (Figure 5C) |
Nuclear |
Anti‐sense | Participates in the spatial regulation of Hox gene expression during development | Conserved in mice and humans | Lung cancers. | 109, 110 |
NBAT‐1 | Nuclear | Anti‐sense |
Transcription regulation, miRNA sponge. |
Some sequence conservation across mammals. | Significantly lower expression in high‐risk neuroblastoma tumours. Potential biomarker for neuroblastoma progression. | 70 |
LINK‐A/LINC01139 (Figure 7B) | Nuclear/Cytoplasmic | Intergenic | Lipid binding, kinase activation. | No evidence | Upregulated in cancer, correlated with poor prognosis in breast cancer. | 73 |
BC200/ BCYRN1 (Figure 7C) | Cytoplasmic | Intergenic | Translation initiation repression in dendrites (RNPs) | Conserved in primates, has an orthologue in rodents | Implicated in Alzheimer's disease. | 75 |
HTT‐AS (Figure 7D) | Nuclear | Anti‐sense | Transcription regulation | No evidence | Associated with Huntington's disease. | 79 |
Another lncRNA, LINC00261, exhibits a lower level of sequence conservation than CHASERR. It shares only some sequence homology between mammals and fish in the first exon (Table 1). However, LINC00261 is also syntenically homologous in a wide range of species, from humans to sea urchins 13. Found downstream of FOXA2, LINC00261 is downregulated in multiple cancers and its overexpression inhibits cancer cell invasion, proliferation, and migration 16. In many cases, despite a lack of detectable sequence homology, the act of transcription of these syntenic lncRNAs could affect the expression of the neighbouring gene, demonstrating a conserved position and cis‐regulatory function.
For the majority of conserved lncRNAs, only a moderate level of sequence identity is observed 17. This is demonstrated by Cerox1, an intergenic, bi‐directional lncRNA found in mice (Table 1). Cerox1 regulates the abundance of the mitochondrial complex 1 transcript by acting as a miRNA decoy, therefore modulating its activity. Although Cerox1 is conserved at the sequence level and is syntenically homologous with a human homologue, the level of conservation drops rapidly in more distant species. Across eutherian mammals, conservation is found only in the second exon of Cerox1 18.
Of interest, the regulation and tissue specificity of lncRNAs are conserved to a level comparable to mRNAs 13. Specifically, conservation of lncRNA promoters is as strong as that of protein‐coding gene promoters 19. This suggests that selective constraints are often acting at the transcriptional level. However, lincRNA transcription also evolves at a rapid rate; only 72% of human lincRNAs are also expressed in macaque, compared to 98% of human protein‐coding genes in all primates 19.
A further complication in the identification of conserved lncRNAs is their evolution; the origin of lncRNAs is for the most part unknown. They also evolve rapidly, with the majority of lncRNAs found to be lineage specific 13. Due to the low levels of sequence conservation, it is generally assumed that lncRNAs are unlikely to evolve via gene duplication, a common mechanism for protein‐coding genes. Potential routes include protein‐coding genes losing their original function via truncation and loss of coding capacity and becoming lncRNAs. This mechanism is thought to account for a proportion of conserved lncRNAs, including Xist 20. A further possibility is that non‐coding portions of the genome may associate with a promoter, become transcribed, and eventually gain function as an lncRNA.
Subcellular localisation
If an lncRNA is functional, its location in the cell will be important in understanding this function. Initial high‐throughput studies have indicated that lncRNAs were predominantly enriched in the nucleus 7. However, an increasing number of cytoplasmic lncRNAs are now being unearthed 21. All lncRNAs are transcribed in the nucleus and so are present there at some level. An area of active study is to understand which lncRNAs make it out the of nucleus and why. Enrichment of a lncRNA in either the nucleus or the cytoplasm does not preclude it from operating in both compartments. In fact, it is possible for lncRNAs to move between the two compartments in response to signals (e.g. UCHL1‐AS 22).
Nuclear retention of lncRNAs
Many lncRNAs are enriched in the nucleus and a considerable number have been found to function specifically in the nucleus. Within the nucleus, many lncRNAs are specifically localised to nuclear sub‐compartments (Figure 4, Table 1), which argues against them being transcriptional by‐products. In fact, lncRNAs have been found to occupy the nucleolus (PAPAS), nuclear matrix (XIST), nuclear speckles (MALAT1), and nuclear paraspeckles (NEAT1) (Figure 4). Targeting of lncRNAs to such specific locations can be the result of nuclear localisation signals within the lncRNA itself (e.g. BORG and MALAT1 23, 24).
Analysis of nuclear lncRNAs sequences has identified specific motifs enriched in lncRNA correlating with nuclear fate 25, 26. For instance, a 57 nt motif is repeated 18 times in Xist, and a 15 nt C‐rich element is found in 21 different lncRNAs, including MALAT1. Essentially, the presence of several of these motifs in an lncRNA is more likely to facilitate its nuclear retention. Several lncRNA sequence elements, which have evolved from transposable elements (TEs), have also been characterised as sufficient for nuclear localisation (e.g. L2B, MIRb, and MIRc TE 27). Similarly, a less‐stringent analysis revealed the involvement of Alu repeat motifs in nuclear retention of lncRNAs 28. The location and number of these elements as well as flanking sequences have been shown to contribute to the strength of nuclear enrichment.
RNA splicing is important to the export of mRNAs from the nucleus. Unsurprisingly, inefficient splicing of lncRNAs has been linked to their nuclear localisation (e.g. A‐ROD lncRNA), and intron retention contributes to restricting of lncRNAs to the nucleus 29. Weak interactions of lncRNAs with splicing factors (e.g. MALAT‐1 with SRm160 23) is also thought to contribute to nuclear retention. The U1 snRNP complex keeps lncRNAs in the nucleus in two ways 30, 31. LncRNA exons and introns contain U1 recognition sites, which U1 snRNP (small nuclear ribonucleoprotein) and associated factors have been shown to be recruited to and antagonise cytoplasmic export 31. These sequences have been found in nuclear retention elements. It has also been speculated that U1 spliceosome association with U1 recognition sites then contributes to inefficient splicing and remaining bound to chromatin. As a result, polyadenylation of these transcripts is impeded, allowing for lncRNA turnover 30.
Those lncRNAs that do not interact with the mRNA export machinery (e.g. Aly/REF) are more likely to be enriched in the nucleus 32. Xist lncRNA is retained within the nucleus partly as a result of its weak interactions with RNA export proteins such as Nxf1 33. Export of some lncRNAs with limited splicing (e.g. NORAD) is facilitated by splicing‐independent export machinery such as TPR 34.
LncRNAs can be tethered to the nuclear matrix or nuclear (para)speckles (membrane‐less organelles) by adapter proteins. For example, CIZ1 and hnRNPU, which can interact with chromatin, facilitate Xist lncRNA anchoring to the nuclear periphery (Figure 4) 35, 36 and similarly with Bloodlinc 37.
Function of nuclear lncRNAs
As mentioned previously, lncRNAs can interact with DNA, RNA, and proteins. Nuclear lncRNAs tend to engage with DNA and protein (Figure 3) to regulate gene expression in a variety of ways (Figure 5). For instance, lncRNA PAPAS binds both DNA and protein to elicit its function. PAPAS interacts with the CHD4/NuRD (nucleosome remodelling and deacetylation) complex and can engage in RNA–DNA triplex formation to ‘scout’ for its DNA recognition element, effectively guiding chromatin remodellers to sites necessary for gene‐expression regulation 38 (Figure 5A). lncRNA MEG3 interacts with DNA to guide chromatin remodellers, interacts with itself (lncRNA:lncRNA) 39, and acts as a co‐activator of p53, partially regulating p53 target genes 40 (Figure 5B, C). Although lncRNA Jpx competes with DNA for CTCF protein docking, the action of titrating CTCF away from one of the two X chromosomes in mice contributes to X chromosome inactivation (Figure 5D).
Cytoplasmic lncRNAs
Export of lncRNAs to the cytoplasm is usually indicative of a specific function, given that some level of activity is required. A recent study in the human chronic myelogenous leukaemia cell line (K562) revealed that 54% of expressed lncRNAs are detected in the cytoplasm 27. The majority of 5′‐capped, spliced, and polyadenylated lncRNAs ‘get the green light’ to reach the cytoplasm. Methylation of adenosine (N6‐Methyladenosine, m6A) in lncRNAs, like mRNAs, stimulates their export to the cytoplasm. For mRNAs, this is mediated by YTHDC1, which binds to the modified base and helps recruit nuclear export factors (41 and references therein). Although there is no direct evidence, this is probably also the case for lncRNAs. Although the presence of some specific TEs promote nuclear lncRNA retention, some classes of TEs (e.g. the endogenous retrovirus class ERVL‐MaLR) are enriched in cytoplasmic lncRNAs 27. This suggests that there are specific sequence determinants of lncRNAs being localised to the cytoplasm. Of interest, lncRNA localisation to the cytoplasm can be affected by external cues and, therefore, can change. For example, UCHL1‐AS shifts from the nucleus to the cytoplasm upon stress induction with rapamycin in mouse dopaminergic MN9D cells 22.
As in the nucleus, cytoplasmic lncRNAs can form ribonucleoprotein complexes by binding mRNAs and proteins, or by competing with mRNAs for binding to a specific protein (Figure 6A). lnc MyoD, which plays a key role in the regulation of myogenesis, is one such an example. It interacts strongly with IGF2 mRNA‐binding proteins (IMPs), and negatively regulates IMP2‐mediated translation of proliferation genes such as N‐Ras and c‐Myc by antagonising those mRNAs for IMP binding, thus promoting cell cycle exit and differentiation 42.
Some lncRNAs harbour miRNA‐binding sites and by having multiple sites can act as ‘molecular sponges’, sequestering miRNAs and therefore protecting mRNAs from miRNA‐mediated degradation. These lncRNAs are termed competing endogenous RNAs (ceRNAs) 43. One such molecular sponge is the lncRNA regulator of reprogramming (linc‐RoR), which interacts with miR‐145 (Figure 6B).
As already mentioned, many lncRNAs are capped and polyadenylated 7, 17. If these lncRNAs are present in the cytoplasm, they can interact with ribosomes (Figure 6C) and be translated (Figure 6D). In fact, a number of studies in human, mouse, fly, and yeast have detected lncRNAs in ribosome‐bound complexes by ribosome profiling 44, 45, 46. This RNA‐Seq–based method, which detects translation events, has revealed that small open reading frames (smORFs, <100 aa) present in some lncRNAs are actually translated. Given that many cytoplasmic lncRNAs possess the same molecular characteristics as mRNAs, this should not be that surprising. Should these translated lncRNAs be re‐classified as mRNAs? Because lncRNAs by their very nature lack long open reading frames, these translation events generally can produce only small peptides. These lncRNA–ribosome interactions remain controversial, as they could represent non‐specific interactions rather than bonafide translation events.
In addition to lncRNAs being actively engaged by ribosomes, lncRNAs can also associate with mRNAs during translation 22 (Figure 6D). Several lncRNAs are associated with large translation complexes (polysomes), but they themselves are not translated. These polysome‐associated lncRNAs tend to exhibit widespread expression patterns across different human tissues. They are also more likely to be enriched in the cytoplasm compared to free cytoplasmic lncRNAs 27. This suggests that these polysome‐associated lncRNAs are more likely to be functional than those present just in the cytosol.
Cytoplasmic lncRNAs may localise to specific organelles or cytoplasmic structures. Notably, some cytoplasmic lncRNAs are encoded by mitochondrial DNA, and are therefore found and operate in the mitochondria 18, 47. lncRNAs participate in the formation of P‐bodies and extracellular vesicles (EVs), the latter of which are secreted from one cell and received by another, acting as cellular messengers (Figure 5E). It has been suggested that lncRNA molecules with relatively low expression levels (e.g. lincRNA‐p21, HOTAIR) are highly enriched in exosomes (Table 1) 48. The level of secretion of lncRNAs is critical for cell homeostasis, as it has been shown that lncRNA exosome levels reflect the cellular response to DNA damage 48. LincRNA‐p21 transcript levels in exosomes isolated from urine samples of prostate cancer patients appear to be significantly elevated compared to those of patients with benign prostatic hyperplasia 49. Therefore, LincRNA‐p21 could serve as a biomarker for the differential diagnosis between prostate cancer and benign prostatic hyperplasia.
LncRNA roles in development and differentiation
LncRNAs have been found not only to be spatially restricted but also temporally, that is, at certain stages in development. Specific expression of a lncRNA during development suggests that it may have an important function at that time. Several such lncRNAs have been characterised. Probably the most well‐known lncRNA is XIST (Table 1), a nuclear‐retained lncRNA that is 5′ capped, polyadenylated, and alternatively spliced (~19.2 kb in human) 50. XIST is important during early development, functioning in X chromosome inactivation (XCI) as part of dosage compensation. Mutations in mouse Xist result in an inability to undergo XCI and embryonic lethality 51, 52.
To ensure an equal dosage of X‐linked genes between females and males, one of a female's two X chromosomes is inactivated. Post‐differentiation, Xist is expressed specifically in females. In humans, XIST is lowly expressed from day 4 shortly after zygote formation, increasing at days 6 and 7 post‐fertilisation with random XCI taking place on day 7 53. The precise timing of XCI differs in diverse placental mammals 54, as does Xist's precise temporal expression. Studies in human and mouse stem cells suggest that Xist recruits protein partners, forming several lncRNA–protein complexes and covering one X chromosome to inactivate it. Xist recruits SPEN to initiate gene silencing 55, 56, 57, 58, and then, via hnRNPK, PRC1, and PRC2 complexes, to deposit repressive chromatin marks to establish and maintain gene silencing (Figure 5A, 59). hnRNPU and CIZ1 tether Xist to the nuclear periphery (Figure 4) 35, 36, 60, 61.
The establishment of the three germ layers (lineage specification) from the embryonic epiblast is a critical point in development. lncRNAs have been found to control the direction of stem cell differentiation and hence contribute to cell identity decisions. For instance, yylncT lncRNA has been detected in the nucleus of day 2 human embryonic stem cells (hESCs) following mesoderm specification 62. Disruption of yylncT transcription or its depletion in differentiating hESCs leads to decreased levels of the transcription factor Brachyury, a master regulator of mesoderm specification, resulting in increased apoptosis and downregulation of key mesoderm driver genes. yylncT lncRNA safeguards both its own and the Brachyury locus from genome‐wide methylation in response to differentiation signals 62.
Cytoplasmic lncRNAs are also important in pluripotency and differentiation by interacting with key pluripotency factors. Linc‐RoR has been shown to crosstalk with some of these factors, namely OCT4, SOX2, and NANOG, specifically in self‐renewing hESCs. In fact, linc‐RoR levels are markedly reduced in differentiated hESCs, implying a key role in pluripotency 63. Linc‐RoR can act as a ceRNA to sequester miR‐145, which negatively modulates OCT4, SOX2, and NANOG mRNA levels (Figure 6B). Essentially, linc‐RoR can block exit from pluripotency via an miR‐145‐mediated OCT4 downregulation pathway in the cytoplasm. Linc‐RoR also deregulates the pluripotency transcription factor SOX9 through competition with miR‐15b, miR‐33a, miR‐129, miR‐145, and miR‐206, in human oesophageal squamous cell carcinoma cells and patients' samples, resulting in Sox9 stabilisation and promotion of oncogenesis 64. Upregulation of linc‐RoR has been linked to the increased occurrence of cancers such as triple negative breast cancer 65, and endometrial 66, nasopharyngeal 67, and liver cancers 68.
Determination of the anterior–posterior axis is mediated by the coordinated expression of Hox genes. Along with Hox protein‐coding genes, Hox lncRNAs have also been shown to participate in regulating this process. HOTTIP is a Hox lncRNA that is functionally conserved across developing mice, chick embryos, and humans. It modulates precise spatial and temporal expression patterns of Hox genes, ensuring proper embryonic development. Chick embryos with reduced HOTTIP expression exhibit defects in limb development 69. HOTTIP lncRNA can recruit histone methyltransferase complexes to deposit gene activation chromatin marks (i.e. H3K4me3) on the 5′ Hox‐A locus (Figure 5). These examples illustrate how important lncRNAs are to development, acting as gatekeepers for cell viability or cell fate specification.
LncRNAs in cancer
It is evident that lncRNAs play decisive regulatory roles at crucial checkpoints between proliferation and differentiation. When this regulation is disrupted, disease is the inevitable consequence. Although early work uncovered lncRNAs associated with cancer, there are now functionally characterised lncRNAs whose contribution to cancer progression and phenotype has been elucidated at a more mechanistic level.
NBAT‐1 is a well‐studied lncRNA with a defined role in cancer, the mechanism of action for which has recently been elucidated. Like many lncRNAs, NBAT‐1 exhibits tissue specificity and is expressed mainly in the brain, breast, and ovary. Notably, some regions of NBAT‐1 have substantial sequence conservation across mammals, suggesting an evolutionarily conserved function 70. Cancer patients with high NBAT‐1 expression have been associated with good prognosis, whereas patients with low NBAT‐1 expression have been associated with poor prognosis 70. NBAT‐1 contains a high‐risk associated single nucleotide polymorphism (SNP) within its intron, which contributes to regulating its expression levels (A/A genotype higher NBAT‐1 expression, G/G genotype lower NBAT‐1 expression). In high‐risk patients, DNA methylation functions to inactivate the NBAT‐1 promoter, leading to downregulation of expression and consequently to cell proliferation and invasion. The SNP and NBAT‐1 promoter may together be involved in higher order regulatory interactions.
Suppression of NBAT‐1 by knockdown in the SH‐SY5Y human neuroblastoma cell line results in increased cancer cell viability and invasiveness, whereas overexpression leads to decreased cell proliferation and invasion. Furthermore, mouse xenografts developed from NBAT‐1–depleted cells showed a substantial increase in growth rate. RNA‐seq of NBAT‐1 knockdown in SH‐SY5Y identified several genes associated with cell proliferation and migration that are regulated by NBAT‐1. Among these are VCAN, SOX9, and OSMR, which contribute to cancer progression 71, 72. NBAT‐1 lncRNA interacts with the PRC2 complex member EZH2 to suppress target genes implicated in cell proliferation and cell migration via chromatin level regulation 70. Taken together, these findings suggest that NBAT‐1 is a ‘protector’ against neuroblastoma (Figure 7A).
On the other hand, some lncRNAs that are misregulated in cancer act to promote tumour growth and resistance to chemotherapy. One such example is LINK‐A (LINC01139; long intergenic non‐coding RNA for kinase activation). The LINK‐A locus is amplified in multiple cancer types, and high expression levels of LINK‐A are correlated with poor prognosis in breast cancer patients (Figure 7B) 73. Of interest, LINK‐A is cytoplasmic in adenocarcinoma lung cells and SK.N.SH cells, but nuclear in HeLa and NHEK cells 21. This suggests that it shuttles between the nucleus and cytoplasm. LINK‐A belongs to the category of lipid‐binding lncRNAs, and specifically binds to phosphatidylcholine, AKT, and PIP3, resulting in AKT hyperactivation that leads to tumorigenesis and resistance to AKT inhibitors 73. LINK‐A interacts with PIP3 through a 60 nt RNA oligonucleotide motif (nt 1081–1140), containing a stem‐loop structure essential for this interaction. Furthermore, LINK‐A–PIP3 interaction confers resistance to anti‐tumour drugs such as perifosine and MK2206.
Mechanisms of action at the molecular level and association with cancer have been studied for a substantial number of lncRNAs so far. There seem to be two distinct patterns of action: they can either inhibit cancer cell proliferation and invasion, or promote tumour growth and metastasis.
LncRNAs and neurodegeneration
About 40% of human lncRNAs (equivalent to 4000‐20 000 lncRNA genes) are specifically expressed in the brain 7, where they display an exquisite spatiotemporal expression profile as well as cellular localisation variability 74. lncRNAs are involved in key regulatory mechanisms during various stages of neurogenesis and synaptic plasticity. Therefore, it is not surprising that the misregulation of several lncRNAs contributes to the development of neurodegenerative diseases.
Primate‐specific brain cytoplasmic RNA BC200 (or BCYRN1) and its rodent orthologue (BC1) are cytoplasmic anti‐sense lncRNAs highly expressed in the brain (Table 1) and have been implicated in Alzheimer's disease (AD). They both regulate mRNA translation, selectively target to somatodendritic domains of human neurons, and contribute to the maintenance of long‐term synaptic plasticity 75. Specifically, they have been reported to repress translation initiation in dendrites 76 (Figure 7C). BC200 exhibits a diffuse localisation in dendritic domains of neuropils and its levels are reduced during ageing. In AD, BC200 is upregulated in specific areas of the neocortex, where it forms characteristic clusters mislocalised in cell soma rather than in the synapse 75. It seems to disrupt mRNA localisation to the synapse, which is required for proper synapse function 77. Levels of BC200 across different parts of the brain correlate with the severity of the disease in these brain regions 75, further suggesting an important role.
The recently characterised lncRNA UCHL1‐AS has been associated with Parkinson's disease and exhibits a dynamic subcellular localisation (Table 1). UCHL1‐AS is in the anti‐sense orientation to UCHL1, which produces a neuron‐restricted protein, essential for development. UCHL1‐AS is conserved between mouse and human. It is highly expressed in the murine ventral midbrain and in mouse MN9D dopaminergic cells 22. Normally UCHL1‐AS accumulates in the nucleus of dopaminergic neurons. Upon cellular stress induced by rapamycin (inhibitor of cap‐dependent translation), UCHL1‐AS translocates to the cytoplasm where it promotes translation of UCHL1 mRNA by increasing its association with heavy polysomes 22. UCHL1‐AS is downregulated in a mouse neurochemical model of Parkinson's disease 78.
Huntington's disease (HD) is caused by a CAG trinucleotide repeat expansion in exon 1 of the huntingtin gene, HTT 79. Several well‐studied lncRNAs, such as NEAT1 and MEG3, have been associated with HD 80. A recently discovered anti‐sense lncRNA, HTT‐AS, regulates the expression of HTT 79. HTT‐AS is 5′ capped, polyadenylated, and can be alternatively spliced 79. HTT‐AS levels are reduced by 50% in HD brains compared to controls, and it negatively regulates HTT expression. Knockdown of HTT‐AS leads to a 20% increase of HTT, whereas overexpression of HTT‐AS decreases endogenous HTT expression by 25% 79 (Figure 7D).
These three lncRNAs highlight how lncRNA misregulation is implicated in neurodegeneration. Each of them acts at a different level of gene expression regulation, a key feature of lncRNAs in general.
LncRNAs as biomarkers and therapeutic targets
Despite the small number of functionally characterised lncRNAs, their potential as biomarkers or therapeutic agents is quickly being realised. Following the huge progress in elucidating the mechanisms of action and the association of lncRNAs with disease, the scientific community is now keen to investigate the possibility of using lncRNAs as prognostic or diagnostic biomarkers. This approach is based on the initial observations that the levels of certain lncRNAs in blood samples correlates with the occurrence of a disease. In a preclinical setting, lncRNAs have been suggested as potential diagnostic or prognostic biomarkers affecting a variety of tissues including the heart (MALAT1), reproductive system (H19), muscle (linc‐MD1), and several cancer types (MALAT1, H19, MEG3, HOTAIR) (reviewed in 81, 82).
LncRNAs are ideal biomarker candidates because of their high tissue specificity. This may enable the detection of metastasis, or perhaps help alleviate the complexity imposed by cancer heterogeneity 81. As mentioned, lncRNAs can be encapsulated in EVs and so have the potential to end up in the blood stream, also making them ideal biomarkers 83. The temporal restriction of some lncRNAs could add another layer to biomarker specificity, or even allow for the tracking of disease progression. Several lncRNAs have already been explored for these reasons, but none have reached clinical trials yet.
GAS5 lncRNA is being currently studied at a pre‐clinical level as a potential biomarker for type 2 diabetes (T2D) and coronary artery disease (CAD). GAS5 is readily detected in human serum and its levels are correlated to the prevalence/onset/appearance of T2D. Specifically, GAS5 is significantly downregulated in diabetic patients compared to healthy controls 84, and significantly lower in patients with CAD compared to patients with diabetes mellitus 85.
NBAT1 also has the potential to serve as a prognostic/diagnostic marker for different types of cancer. Although it is not being tested in clinical trials yet, there is substantial pre‐clinical evidence from patient samples. It is differentially expressed in transcriptomic profiling of 15 neuroblastomas of different types 70. NBAT‐1 can be utilised for differential diagnosis, because patients with high NBAT‐1 are associated with good prognosis and vice versa 70. NBAT‐1 levels are considerably lower in bladder cancer samples compared to healthy controls 86.
SNGH1 could represent a novel lncRNA biomarker of tumorigenesis for a number of tissue biotypes including brain, breast, lung, liver, and ovary 87. SNGH1 lncRNA can be detected at a higher level in tissues from patients with colorectal carcinoma compared to healthy individuals. It is important to note that high levels correlate with tumour malignancy, reflected in poorer patient survival compared to those with lower SNGH1 levels. In cell lines, enhanced apoptosis and diminished cell proliferation were observed when SNGH1 was decreased, suggesting it might represent a therapeutic target as well as acting as a potential biomarker.
RNA therapeutics is a newly emerging but rapidly expanding field. Most approaches so far have included targeting an offending RNA molecule with small molecule drugs or using anti‐sense oligos to specifically interfere with RNA function. Using RNA molecules as drugs is also being explored by industry. Targeting the biogenesis of lncRNAs is currently being pursued in cancer cell lines 88, 89. This approach has been employed previously with miRNAs (reviewed in 90).
LncRNAs themselves could act as therapeutic molecules. One approach has been described whereby lncRNAs harbouring tuneable sequence elements (known as SINE‐UPs) can be tailored to target mRNAs and specifically increase their translation. This highly customisable approach could be adapted for any disease where physiological protein levels are disrupted in a cell, such as Friedreich's ataxia 91. These elements on specific lncRNAs can also be multiplexed to target a series of transcripts in parallel, achieving multiple target regulation.
Therapeutic strategies to modulate lncRNA activity have now made it to clinical trials. Abivax has developed a small molecule that has entered phase 2 clinical trials for ulcerative colitis and moderate to severe stage rheumatoid arthritis patients 92, 93, 94. It increases splicing of lncRNA 0599‐205, upregulating production of miR‐124 due to this lncRNA harbouring one of three miR‐124 loci in the genome 95. miR‐124 has been shown previously to reduce pro‐inflammatory cytokines and also regulate innate and adaptive immunity. Thus, by increasing the levels of spliced lncRNA 0599‐205, miR‐124 production is increased and inflammation in arthritis is reduced 95.
It is too early to speculate on whether lncRNA therapeutics will be widely applicable. Nonetheless, there is growing interest in developing and adapting lncRNAs from a preclinical to a clinical setting. The more we understand the function of lncRNAs and how they contribute to disease, the more they can be exploited as therapeutic targets.
Concluding remarks
There are more than 16 000 lncRNA genes in the human genome, but we currently understand the function of only ~50. Our improved appreciation of lncRNAs is changing the way we think about our genomes and genetic diseases. The simplistic view that lncRNAs remain in the nucleus to regulate transcription has been expanded in the last 5 years to appreciate the various ways both cytoplasmic and nuclear lncRNAs can act. In fact, their movement between cell compartments may be dynamic, and their roles can vary depending on their cellular location.
The low abundance and highly specific expression patterns of lncRNAs have made studying them difficult. Function has also been incorrectly assigned to lncRNAs based on CRISPR experiments, and now siRNAs are the preferred method to downregulate lncRNAs without creating genomic alterations, which may be lncRNA independent. Not all lncRNAs possess a function and determining which do and which do not is a substantial body of research. Conservation analysis will contribute to this understanding and identify model systems in which research can be performed. Those lncRNAs whose function has been determined have been found to contribute to development, neural function, and cancer (Figure 7).
We have also made significant progress from simple disease association to mechanistic insights to reveal how lncRNAs contribute to disease phenotypes. This in turn allows lncRNAs to be used as therapeutic targets as well as biomarkers. Given the number of human lncRNAs, they provide a vast screening space whose potential has yet to be fully realised. Further work to precisely map mutations to genetic diseases will no doubt uncover many more lncRNAs whose disruption gives rise to disease. Studies such as the 100 000 Genomes Project will be key to uncovering precise links with pathology.
Author contributions statement
IT, KD, IB, and JA were all involved in writing the review and had final approval of the submitted and published versions.
Glossary of terms
Homology: similarity of a nucleotide or amino acid sequence between species. In cis: function of a transcript at or very close to the site of its transcription. In trans: function of a transcript away from the site of its transcription. Orthologues: homologous genes found in different species following a speciation event, for which gene sequences and main function are conserved. Ribonucleoprotein complexes (RNPs): complexes formed by RNAs interacting with RNA‐binding proteins. Ribosome profiling/Ribo‐Seq: high‐throughput sequencing of small RNA fragments‐protected by ribosomes. Synteny: physical co‐localisation of genetic loci on the same chromosome within an individual or species. ENCODE project: ENCyclopedia Of DNA Elements project is a public research project aiming to identify functional elements in the human genome. m 7 G cap: methyl guanosine nucleotide added at 5′ end of mRNA to protect it from degradation and facilitate ribosome binding during translation. CNS: central nervous system.
Abbreviations
AD, Alzheimer's disease; CAD, coronary artery disease; ceRNA, competing endogenous RNA; EV, extracellular vesicle; HD, Huntington's disease; hESC, human embryonic stem cell; IMP, IGF2 mRNA‐binding protein; lincRNA, long intergenic non‐coding RNA; lncRNA, long non‐coding RNA; m6A, N6‐Methyladenosine; nt, nucleotide; RBP, RNA‐binding protein; smORF, small open reading frame; snRNP, small nuclear ribonucleoprotein; SNP, single nucleotide polymorphism; T2D, type 2 diabetes; TE, transposable element; UTR, untranslated region; XCI, X chromosome inactivation.
Acknowledgements
IT is funded from BBSRC DTP BB/M011151/1. KD is funded by a Leeds Anniversary Research Scholarship (LARS). IB is funded by Faculty of Biological Sciences and JA's University of Leeds start‐up fund. JA is funded by the University of Leeds (University Academic Fellow scheme), MRC (MR/N000471/1), and BBSRC (BB/S007407/1).
No conflicts of interest were declared.
References
- 1. Brannan CI, Dees EC, Ingram RS, et al. The product of the H19 gene may function as an RNA. Mol Cell Biol 1990; 10 : 28–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Brown CJ, Ballabio A, Rupert JL, et al. A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome. Nature 1991; 349 : 38–44. [DOI] [PubMed] [Google Scholar]
- 3. Frankish A, Diekhans M, Ferreira AM, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res 2019; 47 : D766–D773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Consortium GT. The genotype‐tissue expression (GTEx) project. Nat Genet 2013; 45 : 580–585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Lizio M, Harshbarger J, Shimoji H, et al. Gateways to the FANTOM5 promoter level mammalian expression atlas. Genome Biol 2015; 16 : 22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature 2012; 489 : 57–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Derrien T, Johnson R, Bussotti G, et al. The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res 2012; 22 : 1775–1789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Goudarzi M, Berg K, Pieper LM, et al. Individual long non‐coding RNAs have no overt functions in zebrafish embryogenesis, viability and fertility. Elife 2019; 8: pii: e40815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Young RS, Marques AC, Tibbit C, et al. Identification and properties of 1,119 candidate lincRNA loci in the Drosophila melanogaster genome. Genome Biol Evol 2012; 4 : 427–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Marques AC, Ponting CP. Catalogues of mammalian long noncoding RNAs: modest conservation and incompleteness. Genome Biol 2009; 10 : R124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Haerty W, Ponting CP. Mutations within lncRNAs are effectively selected against in fruitfly but not in human. Genome Biol 2013; 14 : R49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Noviello TMR, Di Liddo A, Ventola GM, et al. Detection of long non‐coding RNA homology, a comparative study on alignment and alignment‐free metrics. BMC Bioinformatics 2018; 19 : 407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Hezroni H, Koppstein D, Schwartz MG, et al. Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep 2015; 11 : 1110–1122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Rom A, Melamed L, Gil N, et al. Regulation of CHD2 expression by the Chaserr long noncoding RNA gene is essential for viability. Nat Commun 2019; 10 : 5092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Lamar KJ, Carvill GL. Chromatin remodeling proteins in epilepsy: lessons from CHD2‐associated epilepsy. Front Mol Neurosci 2018; 11 : 208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Fang Q, Sang L, Du S. Long noncoding RNA LINC00261 regulates endometrial carcinoma progression by modulating miRNA/FOXO1 expression. Cell Biochem Funct 2018; 36 : 323–330. [DOI] [PubMed] [Google Scholar]
- 17. Cabili MN, Trapnell C, Goff L, et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev 2011; 25 : 1915–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Sirey TM, Roberts K, Haerty W, et al. The long non‐coding RNA Cerox1 is a post transcriptional regulator of mitochondrial complex I catalytic activity. Elife 2019; 8: pii: e45051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Necsulea A, Soumillon M, Warnefors M, et al. The evolution of lncRNA repertoires and expression patterns in tetrapods. Nature 2014; 505 : 635–640. [DOI] [PubMed] [Google Scholar]
- 20. Hezroni H, Ben‐Tov Perry R, Meir Z, et al. A subset of conserved mammalian long non‐coding RNAs are fossils of ancestral protein‐coding genes. Genome Biol 2017; 18: 162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Mas‐Ponte D, Carlevaro‐Fita J, Palumbo E, et al. LncATLAS database for subcellular localization of long noncoding RNAs. RNA 2017; 23 : 1080–1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Carrieri C, Cimatti L, Biagioli M, et al. Long non‐coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature 2012; 491 : 454–457. [DOI] [PubMed] [Google Scholar]
- 23. Miyagawa R, Tano K, Mizuno R, et al. Identification of cis‐ and trans‐acting factors involved in the localization of MALAT‐1 noncoding RNA to nuclear speckles. RNA 2012; 18 : 738–751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Zhang B, Gunawardane L, Niazi F, et al. A novel RNA motif mediates the strict nuclear localization of a long noncoding RNA. Mol Cell Biol 2014; 34 : 2318–2329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Shukla CJ, McCorkindale AL, Gerhardinger C, et al. High‐throughput identification of RNA nuclear enrichment sequences. EMBO J 2018; 37 : e98452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Kirk JM, Kim SO, Inoue K, et al. Functional classification of long non‐coding RNAs by k‐mer content. Nat Genet 2018; 50 : 1474–1482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Carlevaro‐Fita J, Rahim A, Guigo R, et al. Cytoplasmic long noncoding RNAs are frequently bound to and degraded at ribosomes in human cells. RNA 2016; 22 : 867–882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Lubelsky Y, Ulitsky I. Sequences enriched in Alu repeats drive nuclear localization of long RNAs in human cells. Nature 2018; 555 : 107–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Ntini E, Louloupi A, Liz J, et al. Long ncRNA A‐ROD activates its target gene DKK1 at its release from chromatin. Nat Commun 2018; 9 : 1636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Yin Y, Yuyang Lu J, Zhang X, et al. U1 snRNP regulates chromatin retention of noncoding RNAs. bioRxiv 2018: 310433. 10.1101/310433 not peer reviewed. [DOI] [PubMed] [Google Scholar]
- 31. Azam S, Hou S, Zhu B, et al. Nuclear retention element recruits U1 snRNP components to restrain spliced lncRNAs in the nucleus. RNA Biol 2019; 16 : 1001–1009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Zhou Z, Luo MJ, Straesser K, et al. The protein Aly links pre‐messenger‐RNA splicing to nuclear export in metazoans. Nature 2000; 407 : 401–405. [DOI] [PubMed] [Google Scholar]
- 33. Cohen HR, Panning B. XIST RNA exhibits nuclear retention and exhibits reduced association with the export factor TAP/NXF1. Chromosoma 2007; 116 : 373–383. [DOI] [PubMed] [Google Scholar]
- 34. Lee ES, Wolf EJ, Smith HW, et al. TPR is required for the nuclear export of mRNAs and lncRNAs from intronless and intron‐poor genes. bioRxiv 2019: 740498. 10.1101/740498 not peer reviewed. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Ridings‐Figueroa R, Stewart ER, Nesterova TB, et al. The nuclear matrix protein CIZ1 facilitates localization of Xist RNA to the inactive X‐chromosome territory. Genes Dev 2017; 31 : 876–888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Sunwoo H, Colognori D, Froberg JE, et al. Repeat E anchors Xist RNA to the inactive X chromosomal compartment through CDKN1A‐interacting protein (CIZ1). Proc Natl Acad Sci U S A 2017; 114 : 10654–10659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Alvarez‐Dominguez JR, Knoll M, Gromatzky AA, et al. The super‐enhancer‐derived alncRNA‐EC7/Bloodlinc potentiates red blood cell development in trans. Cell Rep 2017; 19 : 2503–2514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Zhao Z, Senturk N, Song C, et al. lncRNA PAPAS tethered to the rDNA enhancer recruits hypophosphorylated CHD4/NuRD to repress rRNA synthesis at elevated temperatures. Genes Dev 2018; 32 : 836–848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Uroda T, Anastasakou E, Rossi A, et al. Conserved pseudoknots in lncRNA MEG3 are essential for stimulation of the p53 pathway. Mol Cell 2019; 75 : 982–995.e989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Zhu J, Liu S, Ye F, et al. Long noncoding RNA MEG3 interacts with p53 protein and regulates partial p53 target genes in hepatoma cells. PLoS One 2015; 10 : e0139790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Roundtree IA, Luo GZ, Zhang Z, et al. YTHDC1 mediates nuclear export of N(6)‐methyladenosine methylated mRNAs. Elife 2017; 6: pii: e31311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Gong C, Li Z, Ramanujan K, et al. A long non‐coding RNA, LncMyoD, regulates skeletal muscle differentiation by blocking IMP2‐mediated mRNA translation. Dev Cell 2015; 34 : 181–191. [DOI] [PubMed] [Google Scholar]
- 43. Salmena L, Poliseno L, Tay Y, et al. A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language? Cell 2011; 146 : 353–358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Ingolia NT, Ghaemmaghami S, Newman JR, et al. Genome‐wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 2009; 324 : 218–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Aspden JL, Eyre‐Walker YC, Phillips RJ, et al. Extensive translation of small Open Reading Frames revealed by Poly‐Ribo‐Seq. Elife 2014; 3 : e03528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Duncan C, Mata J. Ribosome profiling for the analysis of translation during yeast meiosis. Methods Mol Biol 2017; 1471 : 99–122. [DOI] [PubMed] [Google Scholar]
- 47. Mercer TR, Neph S, Dinger ME, et al. The human mitochondrial transcriptome. Cell 2011; 146 : 645–658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Gezer U, Özgür E, Cetinkaya M, et al. Long non‐coding RNAs with low expression levels in cells are enriched in secreted exosomes. Cell Biol Int 2014; 38 : 1076–1079. [DOI] [PubMed] [Google Scholar]
- 49. Işın M, Uysaler E, Özgür E, et al. Exosomal lncRNA‐p21 levels may help to distinguish prostate cancer from benign disease. Front Genet 2015; 6 : 168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Brown CJ, Hendrich BD, Rupert JL, et al. The human XIST gene: analysis of a 17 kb inactive X‐specific RNA that contains conserved repeats and is highly localized within the nucleus. Cell 1992; 71 : 527–542. [DOI] [PubMed] [Google Scholar]
- 51. Penny GD, Kay GF, Sheardown SA, et al. Requirement for Xist in X chromosome inactivation. Nature 1996; 379 : 131–137. [DOI] [PubMed] [Google Scholar]
- 52. Marahrens Y, Panning B, Dausman J, et al. Xist‐deficient mice are defective in dosage compensation but not spermatogenesis. Genes Dev 1997; 11 : 156–166. [DOI] [PubMed] [Google Scholar]
- 53. Petropoulos S, Edsgard D, Reinius B, et al. Single‐cell RNA‐Seq reveals lineage and X chromosome dynamics in human preimplantation embryos. Cell 2016; 165 : 1012–1026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Vallot C, Ouimette JF, Rougeulle C. Establishment of X chromosome inactivation and epigenomic features of the inactive X depend on cellular contexts. Bioessays 2016; 38 : 869–880. [DOI] [PubMed] [Google Scholar]
- 55. Monfort A, Di Minin G, Postlmayr A, et al. Identification of Spen as a crucial factor for Xist function through forward genetic screening in haploid embryonic stem cells. Cell Rep 2015; 12 : 554–561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. McHugh CA, Chen CK, Chow A, et al. The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature 2015; 521 : 232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Moindrot B, Cerase A, Coker H, et al. A pooled shRNA screen identifies Rbm15, Spen, and Wtap as factors required for Xist RNA‐mediated silencing. Cell Rep 2015; 12 : 562–572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Minajigi A, Froberg JE, Wei CY, et al. A comprehensive Xist interactome reveals cohesin repulsion and an RNA‐directed chromosome conformation. Science 2015; 349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Almeida M, Pintacuda G, Masui O, et al. PCGF3/5‐PRC1 initiates Polycomb recruitment in X chromosome inactivation. Science 2017; 356 : 1081–1084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Hasegawa Y, Brockdorff N, Kawano S, et al. The matrix protein hnRNP U is required for chromosomal localization of Xist RNA. Dev Cell 2010; 19 : 469–476. [DOI] [PubMed] [Google Scholar]
- 61. Sakaguchi T, Hasegawa Y, Brockdorff N, et al. Control of chromosomal localization of Xist by hnRNP U family molecules. Dev Cell 2016; 39 : 11–12. [DOI] [PubMed] [Google Scholar]
- 62. Frank S, Ahuja G, Bartsch D, et al. yylncT defines a class of divergently transcribed lncRNAs and safeguards the T‐mediated mesodermal commitment of human PSCs. Cell Stem Cell 2019; 24 : 318–327 e318. [DOI] [PubMed] [Google Scholar]
- 63. Wang Y, Xu Z, Jiang J, et al. Endogenous miRNA sponge lincRNA‐RoR regulates Oct4, Nanog, and Sox2 in human embryonic stem cell self‐renewal. Dev Cell 2013; 25 : 69–80. [DOI] [PubMed] [Google Scholar]
- 64. Wang L, Yu X, Zhang Z, et al. Linc‐ROR promotes esophageal squamous cell carcinoma progression through the derepression of SOX9. J Exp Clin Cancer Res 2017; 36 : 182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Eades G, Wolfson B, Zhang Y, et al. lincRNA‐RoR and miR‐145 regulate invasion in triple‐negative breast cancer via targeting ARF6. Mol Cancer Res 2015; 13 : 330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Zhou X, Gao Q, Wang J, et al. Linc‐RNA‐RoR acts as a “sponge” against mediation of the differentiation of endometrial cancer stem cells by microRNA‐145. Gynecol Oncol 2014; 133 : 333–339. [DOI] [PubMed] [Google Scholar]
- 67. Li L, Gu M, You B, et al. Long non‐coding RNA ROR promotes proliferation, migration and chemoresistance of nasopharyngeal carcinoma. Cancer Sci 2016; 107 : 1215–1222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Li C, Lu L, Feng B, et al. The lincRNA‐ROR/miR‐145 axis promotes invasion and metastasis in hepatocellular carcinoma via induction of epithelial‐mesenchymal transition by targeting ZEB2. Sci Rep 2017; 7 : 4637. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 69. Wang KC, Yang YW, Liu B, et al. A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 2011; 472 : 120–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Pandey GK, Mitra S, Subhash S, et al. The risk‐associated long noncoding RNA NBAT‐1 controls neuroblastoma progression by regulating cell proliferation and neuronal differentiation. Cancer Cell 2014; 26 : 722–737. [DOI] [PubMed] [Google Scholar]
- 71. Kopp JL, von Figura G, Mayes E, et al. Identification of Sox9‐dependent acinar‐to‐ductal reprogramming as the principal mechanism for initiation of pancreatic ductal adenocarcinoma. Cancer Cell 2012; 22 : 737–750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. West NR, Murphy LC, Watson PH. Oncostatin M suppresses oestrogen receptor‐alpha expression and is associated with poor outcome in human breast cancer. Endocr Relat Cancer 2012; 19 : 181–195. [DOI] [PubMed] [Google Scholar]
- 73. Lin A, Hu Q, Li C, et al. The LINK‐A lncRNA interacts with PtdIns(3,4,5)P3 to hyperactivate AKT and confer resistance to AKT inhibitors. Nat Cell Biol 2017; 19 : 238–251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Ponting CP, Oliver PL, Reik W. Evolution and functions of long noncoding RNAs. Cell 2009; 136 : 629–641. [DOI] [PubMed] [Google Scholar]
- 75. Mus E, Hof PR, Tiedge H. Dendritic BC200 RNA in aging and in Alzheimer's disease. Proc Natl Acad Sci U S A 2007; 104 : 10679–10684. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Wang H, Iacoangeli A, Popp S, et al. Dendritic BC1 RNA: functional role in regulation of translation initiation. J Neurosci 2002; 22 : 10232–10241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Sosińska P, Mikuła‐Pietrasik J, Książek K. The double‐edged sword of long non‐coding RNA: the role of human brain‐specific BC200 RNA in translational control, neurodegenerative diseases, and cancer. Mutat Res Rev Mutat Res 2015; 766 : 58–67. [DOI] [PubMed] [Google Scholar]
- 78. Carrieri C, Forrest AR, Santoro C, et al. Expression analysis of the long non‐coding RNA antisense to Uchl1 (AS Uchl1) during dopaminergic cells' differentiation in vitro and in neurochemical models of Parkinson's disease. Front Cell Neurosci 2015; 9 : 114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Chung DW, Rudnicki DD, Yu L, et al. A natural antisense transcript at the Huntington's disease repeat locus regulates HTT expression. Hum Mol Genet 2011; 20 : 3467–3477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Johnson R. Long non‐coding RNAs in Huntington's disease neurodegeneration. Neurobiol Dis 2012; 46 : 245–254. [DOI] [PubMed] [Google Scholar]
- 81. Wu T, Du Y. LncRNAs: from basic research to medical application. Int J Biol Sci 2017; 13 : 295–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82. Prabhakar B, Zhong XB, Rasmussen TP. Exploiting long noncoding RNAs as pharmacological targets to modulate epigenetic diseases. Yale J Biol Med 2017; 90 : 73–86. [PMC free article] [PubMed] [Google Scholar]
- 83. Hulstaert E, Morlion A, Avila Cobos F, et al. Charting extracellular transcriptomes in the human biofluid RNA atlas. bioRxiv 2019: 823369. 10.1101/823369 not peer reviewed. [DOI] [PubMed] [Google Scholar]
- 84. Carter G, Miladinovic B, Patel AA, et al. Circulating long noncoding RNA GAS5 levels are correlated to prevalence of type 2 diabetes mellitus. BBA Clin 2015; 4 : 102–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Yin Q, Wu A, Liu M. Plasma long non‐coding RNA (lncRNA) GAS5 is a new biomarker for coronary artery disease. Med Sci Monit 2017; 23 : 6042–6048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86. Liu Z, Xie D, Zhang H. Long noncoding RNA neuroblastoma‐associated transcript 1 gene inhibits malignant cellular phenotypes of bladder cancer through miR‐21/SOCS6 axis. Cell Death Dis 2018; 9 : 1042. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 87. Thin KZ, Tu JC, Raveendran S. Long non‐coding SNHG1 in cancer. Clin Chim Acta 2019; 494 : 38–47. [DOI] [PubMed] [Google Scholar]
- 88. Velagapudi SP, Gallo SM, Disney MD. Sequence‐based design of bioactive small molecules that target precursor microRNAs. Nat Chem Biol 2014; 10 : 291–297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. Velagapudi SP, Cameron MD, Haga CL, et al. Design of a small molecule against an oncogenic noncoding RNA. Proc Natl Acad Sci U S A 2016; 113 : 5898–5903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Disney MD, Dwyer BG, Childs‐Disney JL. Drugging the RNA world. Cold Spring Harb Perspect Biol 2018; 10 : a034769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91. Bon C, Luffarelli R, Russo R, et al. SINEUP non‐coding RNAs rescue defective frataxin expression and activity in a cellular model of Friedreich's Ataxia. Nucleic Acids Res 2019; 47 : 10728–10743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Exploiting long noncoding RNAs as pharmacological targets to modulate epigenetic diseases. [Accessed 25 November 2019]. Available from: https://ClinicalTrials.gov/show/NCT04049448 [PMC free article] [PubMed]
- 93.Study of two doses of ABX464 in participants with moderate to severe rheumatoid rrthritis. [Accessed 25 November 2019]. Available from: https://ClinicalTrials.gov/show/NCT03813199
- 94.Efficacy and safety stud of ABX464 as maintenance therapy in patients with moderate to severe ulcerative colitis. [Accessed 25 November 2019]. Available from: https://ClinicalTrials.gov/show/NCT04023396
- 95. Vautrin A, Manchon L, Garcel A, et al. Both anti‐inflammatory and antiviral properties of novel drug candidate ABX464 are mediated by modulation of RNA splicing. Sci Rep 2019; 9 : 792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Kong Y, Geng C, Dong Q. LncRNA PAPAS may promote triple‐negative breast cancer by downregulating miR‐34a. J Int Med Res 2019; 47 : 3709–3718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97. Bierhoff H, Dammert MA, Brocks D, et al. Quiescence‐induced LncRNAs trigger H4K20 trimethylation and transcriptional silencing. Mol Cell 2014; 54 : 675–682. [DOI] [PubMed] [Google Scholar]
- 98. Yildirim E, Kirby JE, Brown DE, et al. Xist RNA is a potent suppressor of hematologic cancer in mice. Cell 2013; 152 : 727–742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99. Chen YK, Yen Y. The ambivalent role of lncRNA Xist in carcinogenesis. Stem Cell Rev Rep 2019; 15 : 314–323. [DOI] [PubMed] [Google Scholar]
- 100. Chanda K, Das S, Chakraborty J, et al. Altered levels of long ncRNAs Meg3 and Neat1 in cell and animal models of Huntington's disease. RNA Biol 2018; 15 : 1348–1363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101. Zhou Y, Zhang X, Klibanski A. MEG3 noncoding RNA: a tumor suppressor. J Mol Endocrinol 2012; 48 : R45–R53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102. Zhang D, Qin H, Leng Y, et al. LncRNA MEG3 overexpression inhibits the development of diabetic retinopathy by regulating TGF‐beta1 and VEGF. Exp Ther Med 2018; 16 : 2337–2342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103. Huarte M, Guttman M, Feldser D, et al. A large intergenic noncoding RNA induced by p53 mediates global gene repression in the p53 response. Cell 2010; 142 : 409–419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104. Chillón I, Pyle AM. Inverted repeat Alu elements in the human lincRNA‐p21 adopt a conserved secondary structure that regulates RNA function. Nucleic Acids Res 2016; 44 : 9462–9471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105. Hajjari M, Salavaty A. HOTAIR: an oncogenic long non‐coding RNA in different cancers. Cancer Biol Med 2015; 12 : 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106. Bhan A, Mandal SS. LncRNA HOTAIR: a master regulator of chromatin dynamics and cancer. Biochim Biophys Acta 1856; 2015 : 151–164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107. He S, Liu S, Zhu H. The sequence, structure and evolutionary features of HOTAIR in mammals. BMC Evol Biol 2011; 11 : 102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108. Rinn JL, Kertesz M, Wang JK, et al. Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell 2007; 129 : 1311–1323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109. Sun Y, Zhou Y, Bai Y, et al. A long non‐coding RNA HOTTIP expression is associated with disease progression and predicts outcome in small cell lung cancer patients. Mol Cancer 2017; 16 : 162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110. Navarro A, Moises J, Santasusagna S, et al. Clinical significance of long non‐coding RNA HOTTIP in early‐stage non‐small‐cell lung cancer. BMC Pulm Med 2019; 19 : 55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111. Zerbino DR, Achuthan P, Akanni W, et al. Ensembl 2018. Nucleic Acids Res 2018; 46 : D754–D761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112. Thurmond J, Goodman JL, Strelets VB, et al. FlyBase 2.0: the next generation. Nucleic Acids Res 2019; 47 : D759–D765. [DOI] [PMC free article] [PubMed] [Google Scholar]