Abstract
Retinoblastoma is a rare pediatric tumor of the retina, caused by the homozygous loss of the Retinoblastoma 1 (RB1) tumor suppressor gene. Previous microarray studies have identified changes in the expression profiles of coding genes; however, our understanding of how non-coding genes change in this tumor is absent. This is an important area of research, as in many adult malignancies, non-coding genes including LNC-RNAs are used as biomarkers to predict outcome and/or relapse. To establish a complete and in-depth RNA profile, of both coding and non-coding genes, in Retinoblastoma tumors, we conducted RNA-seq from a cohort of tumors and normal retina controls. This analysis identified widespread transcriptional changes in the levels of both coding and non-coding genes. Unexpectedly, we also found rare RNA fusion products resulting from genomic alterations, specific to Retinoblastoma tumor samples. We then determined whether these gene expression changes, of both coding and non-coding genes, were also found in a completely independent Retinoblastoma cohort. Using our dataset, we then profiled the potential effects of deregulated LNC-RNAs on the expression of neighboring genes, the entire genome, and on mRNAs that contain a putative area of homology. This analysis showed that most deregulated LNC-RNAs do not act locally to change the transcriptional environment, but potentially function to modulate genes at distant sites. From this analysis, we selected a strongly down-regulated LNC-RNA in Retinoblastoma, DRAIC, and found that restoring DRAIC RNA levels significantly slowed the growth of the Y79 Retinoblastoma cell line. Collectively, our work has generated the first non-coding RNA profile of Retinoblastoma tumors and has found that these tumors show widespread transcriptional deregulation.
Keywords: retinoblastoma, LNC-RNAs, RNA-sequencing, LNC targeting, DRAIC
Introduction
The protein produced from the Retinoblastoma 1 (RB1) tumor suppressor gene, pRB (1), functions as a molecular scaffold, to bind to and repress the activator E2 promoter binding factor (E2F) transcription factors 1-3 (E2F1-3) (2). The activator E2F's bind to the promoters of genes necessary for cell cycle progression (3) and apoptosis and induce their expression (2). In this context, pRB plays a central role in controlling the cell cycle and acts a critical check-point to prevent cells with DNA damage or other abnormal features from continuing to divide (4). This activity has led to the widespread inactivation of the pRB/E2F in cancer cells, and a number of genomic events disrupt pRB function (1, 5) or lead to constitutive phosphorylation, and inactivation pRB (6–10). Loss of pRB-activity occurs in the vast majority of tumor types, however background mutations, and the very high frequency of pRB and p53 co-inactivation has made understanding the transcriptional changes solely driven by pRB-loss in tumors difficult to determine.
To identify transcriptional changes to both messenger RNA (mRNA) and long non-coding RNAs (LNC-RNAs) dependent on pRB-loss, we used RNA-sequencing (RNA-seq) to profile Retinoblastoma tumors and normal retinal tissue from patients. Retinoblastoma is a rare pediatric tumor of the Retina that affects around 6,000–8,000 children every year (11, 12). There is currently no targeted therapy for Retinoblastoma (13–16) and many patients suffer secondary tumors (17, 18), or developmental defects, due to treatment. In over 95% of cases, the development of this tumor is driven the homozygous deletion of the Retinoblastoma 1 (RB1) tumor suppressor gene, on chromosome 13p14 (1). The remaining ~5% of Retinoblastoma's are caused by the amplification of the V-Myc Avian Myelocytomatosis Viral Oncogene Neuroblastoma Derived Homolog (N-MYC) gene (19, 20). Loss of TP53 or the upregulation of negative regulators of p53, including MDMX, is also common in Retinoblastoma (21).
As these tumors are collected from pediatric patients, they contain very few genomic alterations and represent an ideal resource for understanding the coding and non-coding transcriptional effects driven by pRB-loss. Determining how loss of pRB changes cells is important for expanding our treatment options for patients with Retinoblastomas, and may provide insights into how RB1 contributes to additional tumor types (22). To expand on previous microarray studies (14, 23–25), we conducted the first in-depth RNA-sequencing (RNA-seq) analysis of Retinoblastoma. This analysis builds upon the mRNA profiling of earlier studies and expands the coverage of non-coding changes.
Results
RNA-Sequencing From Retinoblastoma and Retinal Tissue
To determine how the loss of the Retinoblastoma 1 (RB1) tumor-suppressor gene changes the expression of coding and non-coding RNAs in tumors, we conducted RNA-sequencing (RNA-seq) from 7 Retinoblastoma patients. The tumors were removed from patients between 8 months and 6 years of age, with a mean age of 2 years and 8 months. Importantly, the gender demographics of these patients is equally split between males (4) and females (3), and between invasive (3), and non-invasive (4) tumors. A complete list of the patient and control sample demographics is included in Table 1. We also conducted RNA-seq on control Retinal samples and incorporated additional public retinal datasets to build a transcriptional profile of normal retina (Supplementary Tables 1, 2). The RNA-seq profiles of these samples were then separated, and clustered to measure transcriptional changes in Retinoblastoma tumors (Figure 1A). Using normalized values, we found significant and recurrent changes in both coding and long non-coding RNAs (LNC-RNAs) within the tumors (Figure 1B).
Table 1.
Patient ID | RNASeq_Ref ID | Pathology Ref ID | Age at the time of enucleation | Histo pathology report | Sex | Unilateral/Bilateral | Chemo status before enucleation | Invasiveness |
---|---|---|---|---|---|---|---|---|
C1 | 48428 | 12 years Retina | 12 Y | N/A | Unknown | N/A | N/A | N/A |
1 | 48429 | 67/13 | 6 Y | Retinoblastoma, poorly differentiated, focal invasion of choroid measuring more than 3mm. There is prelaminar and laminar invasion of the optic nerve present. Hyperchromatic cells are seen in the trabecular meshwork and over the iris surface. The post laminar portion and the surgical end of optic nerve are free from tumor cells. | Male | OD (IIRC Grade E) | No | Invasive |
2 | 48430 | 1563/15 | 3 Y | Retinoblastoma, poorly differentiated, endophytic growth pattern. No invasion of choroid. No invasion of optic nerve. | Male | OS (IIRC Grade E) | No | Non-Invasive |
C2 | 48431 | 5 years Retina | 5 Y | N/A | Unknown | N/A | N/A | N/A |
3 | 48432 | 1561-15 | 2 Y | Retinoblastoma well differentiated. No invasion of iris.No invasion of choroid and optic nerve. | Female | OD (IIRC Grade E) | No | Non-Invasive |
4 | 48433 | 1630-15 | 3 Y | Retinoblastoma, poorly differentiated. No invasion of choroid. No invasion of optic nerve. | Female | OD (IIRC Grade D) | No | Non-Invasive |
5 | 48434 | 647-13 | 1 Y | Retinoblastoma, well differentiated, focal retinal pigment epithelial invasion, there is no invasion of choroid. Preliminar invasion of optic nerve is seen. The laminar, post laminar and surgical end of optic nerve is free from tumor. | Female | OD | Yes (Two cycles) | Invasive |
6 | 48435 | 1548-14 | 3 Y | Retinoblastoma poorly differentiated. | Male | OS (IIRC Grade D) | No | Non-Invasive |
There is no invasion of choroid. There is No invasion of optic nerve. No invasion into iris stroma. | ||||||||
C3 | 48436 | 22 years Retina | 22 Y | N/A | Unknown | N/A | N/A | N/A |
7 | 48437 | 768-13 | 8 Months | Retinoblastoma, well differentiated, no invasion of choroid. There is prelaminar, laminar, post laminar invasion of optic nerve. Separate section at the surgical end of optic nerve is free from tumor cells. There is no scleral invasion. | Male | OD (IIRC Grade E) | No | Invasive |
We then used Gene Ontology (GO) analysis (26, 27) to identify biological processes altered in Retinoblastoma tumors. In accordance with previous microarray studies, we found DNA replication and mitotic progression genes were strongly upregulated and retina specific genes downregulated in tumors (Supplementary Figures 1A–C). In addition, the majority of upregulated genes (>3-fold) were either E2F (28, 29) or MYC target genes (Supplementary Figures 1D,E) (30). To independently test the levels of coding gene expression, we used RT-PCR to measure the levels of a subset of E2F-target genes from an independent cohort of tumors and retina controls (Supplementary Figure 2). This analysis confirmed that E2F target genes are consistently increased in RB1-mutant Retinoblastomas (Figure 1C). We next compared our Retinoblastoma transcriptional profile to datasets from other pediactric tumors. From this analysis, we found very limited overlap suggesting the gene signature of Retinoblastoma is unique (Supplementary Figure 3A).
Aberrant RNA Processing Events in Retinoblastoma
Previous microarray studies of Retinoblastoma tumors have provided important insights into the coding RNA changes in these tumors. However, these technologies are limited in their capacity to detect RNA fusion products. To address this, we searched for RNA-fusions within our RNA-seq dataset and identified 22 putative RNA fusion events. From this collection, we selected two putative RNA hybrids for detailed examination, using RT-PCR. For this RT-PCR based evaluation, we used 10 normal retinal tissue samples and a larger cohort (>13) of Retinoblastoma tumors (Figure 1D). From this work, we were able to detect RNA from the LSG1-TMEM44 RNA fusion in 73% of tumors, whilst only 1 control sample had detectable levels (Figures 1D,E, Supplementary Figure 3B). In contrast, the CIRBP-C19orf24 fusion occurred in 31% of tumors but was not found in control samples (Figures 1D,F, Supplementary Figure 3B). We then sequenced these RT-PCR products and confirmed the RNA fusion (Figures 1E,F). From this analysis, we have identified and confirmed the presence of RNA-fusions in Retinoblastoma.
Non-coding RNA Changes in Tumors
Very little is known about how loss of the pRB tumor-suppressor gene affects the transcription of LNC-RNAs in tumors. To address this, we next analyzed the expression patterns of LNC-RNAs in Retinoblastoma and found widespread alterations in LNC-RNA levels (Figure 2A). Subsets of LNC-RNAs were both up and downregulated in tumors and could be used to sub-classify tumors from normal tissue (Figure 2A). This analysis identified groups of LNC-RNAs that show significant alterations in their levels (Figure 2B). As many of these genes are poorly characterized, we chose to measure the levels of a subset of these de-regulated LNC-RNAs in an independent tumor and control cohort. We measured each candidate LNC-RNA using RT-PCR and found that our results correlated well with our RNA-seq analysis. Up-regulated LNC-RNAs (LNC00152 and LNC01053) were consistently significantly increased (Figures 2C,D), whilst down-regulated LNC-RNAs (LNC01137 and LNC00575) were decreased in 2 of the 3 tumors (Figures 2E,F). Predicting how LNC-RNAs may function in tumors is challenging and is dependent on a number of factors. We next investigated how the expression of each deregulated LNC-RNA correlated with the expression of genes in the entire genome, locally, or in trans. We first compared the genomic loci of significantly up or downregulated LNC-RNAs, with the expression of deregulated coding genes that displayed reciprocal patterns. From this assay, we found that de-regulated LNC-RNAs (either up or downregulated) showed statistically significant anti-correlates with coding genes, however, these patterns were not restricted by chromosomal location (Figure 2G). This analysis suggested that the majority of transcriptionally altered LNC-RNAs may not affect the local expression of genes.
Despite this global pattern, we did identify a subset of LNC-RNAs, whose changes in expression correlated with that of their neighboring gene. In particular, reduced expression of the LNC-RNAs: CERNA1, AC025259.3, AL139220.2, and AC090360.1, all correlated with lower mRNA levels of the genes that they overlap with, including: GNB5, NR4A1, SLC6A9, and RBFADN (Figure 3A). Interestingly, when we measured adjoining LNC-RNA: mRNA pairs, we observed two different patterns. In the cases of NUP50-AS1 and LNC01431, we found that LNC-RNA downregulation was associated with increased expression of their neighboring genes, NUP50 and NXT1 (Figure 3B, Black and Blue bars). More commonly however, we found that reduced LNC-RNA levels, AC103706.1, EPB41L4A-AS2, AC110716.2, AL391121.1, and LNC02091, was correlated with lower mRNA levels of their adjoining genes, namely: ST3GAL1, EPB41L4A, NMNAT3, TRIM8, and DOC2B (Figure 3B). Analysis of the correlation in expression of proximal LNC-RNAs and their neighboring genes identified both anti (AC027287.2-NEUROD4) and positively correlated associations (AL354872.2-CTH, AL023284.4-MAP7, LNC00575-SCD5, and LNC01844-ARHGAP26) (Figure 3C). Collectively, this analysis identified a subset of LNC-RNAs and their neighboring genes that have altered expression patterns in Retinoblastoma.
The majority of LNC-RNA expression changes in these tumors do not correlate with genes geographically associated with them. We next examined whether LNC-RNAs with altered mRNA levels correlate with the expression any other genes across the genome. To do this, we stratified the LNC-RNAs and measured the average correlation of changed mRNAs. From this analysis, we identified several LNC-RNAs that show both average positive and negative relationships with the altered mRNAs (Figure 3D). We next selected 10 of these LNC-RNAs for closer examination and found that each LNC-RNA was significantly correlated to many deregulated genes in Retinoblastoma (Figure 3E). Collectively, this analysis shows that LNC-RNAs have diverse and putatively complex relationships with the expression of their neighboring genes and genes throughout the genome.
Non-coding RNA Changes and Their Putative Effects in trans
We next investigated the putative interaction between deregulated LNC-RNAs in Retinoblastoma and mRNAs that contain regions that could be putatively bound by the LNC-RNA. To evaluate this, we identified gene expression changes in coding genes that had highly homologous regions to those of de-regulated LNC-RNAs. We then computationally measured the potential for RNA-RNA duplexes to form, between the coding gene, and the LNC-RNA (31). Our analysis used the RNA-RNA interaction database (31).
This analysis of the putative targets of upregulated LNC-RNAs in Retinoblastoma tumors identified a group of 68 genes that are increased and 6 genes that are decreased (2-fold changed and p > 0.05) (Figure 4A, UP: Dark green balls, DOWN: Bright red balls). Gene Ontology of these increased mRNAs (>2-fold), that are putatively targeted by upregulated LNC-RNAs, showed that they function in processes supporting cell division and microtubule function (Figure 4B). In the genes downregulated (2-fold lower), we identified enrichments in genes modulating cellular adhesion and retinal development (Figure 4C). We next determined the putative targets for the LNC-RNAs downregulated in Retinoblastoma tumors, using the same stringency parameters. From this, we found 54 putative mRNAs that are increased and 8 genes with decreased expression (2-fold changed and p > 0.05) (Figure 4D, UP: Dark green balls, DOWN: Bright red balls). Gene Ontology analysis of these increased mRNAs, that are putatively targeted by down-regulated LNC-RNAs, showed that cell attachments and proliferation factors are the molecular processes putatively changed by lower LNC-RNA levels (Figure 4E). Putative LNC-RNA targets with lower expression levels in Retinoblastoma tumors are involved in muscle associated processes (Sarcolemma) and Endocytosis (Figure 4F). Collectively, this analysis highlights the potential complexity of LNC-RNA function in Retinoblastoma.
We next investigated how modulating the levels of a down-regulated LNC-RNA in Retinoblastoma affected the growth of Y79 Retinoblastoma cells. For this, we selected the DRAIC LNC-RNA, as DRAIC levels are strongly decreased in tumors (Figure 5A) and in our independent Retinoblastoma cohort (Figure 5B). We next tested how DRAIC over-expression contributed to the growth of Retinoblastoma cells. To do this, we transfected cells with either empty pCDNA plasmids or pcDNA-DRAIC over-expression plasmids (Figure 5C) (32) and measured Y79 growth. From these experiments, we found that DRAIC over-expression significantly slows Y79 growth (Figure 5D), without promoting cell death.
Discussion
Inactivation of the RB/E2F pathway is a common feature of cancer (22). In this analysis, we sought to understand how the homozygous deletion of the RB1 gene changed the transcriptional profile of both coding and non-coding genes in the rare pediatric tumor, Retinoblastoma. To do this, we compared the gene expression of Retinoblastoma tumors with that from normal retinal tissue using RNA-sequencing. This represents the first time that such an approach has been used to generate a complete transcriptional profile of these tumors and builds upon previous coding RNA studies (23–25, 33). Our work expands our coverage of the genome for Retinoblastoma and includes the expression from non-coding regions of the genome, including long non-coding RNAs (LNC-RNAs), anti-sense RNAs, and transposable elements. This new in-depth analysis of RNA changes, coupled with our recent proteomic profiling of this cohort of Retinoblastoma (34), provides a new resource for the Retinoblastoma and RB-loss research community.
In agreement with previous microarray based studies (23–25, 33), we find very strong transcriptional upregulation of E2F target genes involved in cell cycle progression and apoptosis. In contrast, p53 and MYC target genes are only mildly affected. Most of the down regulated genes are associated with retina cell differentiation and ocular development. These results are consistent with retinal cells when they abnormally differentiate and cause lineage specific gene expression changes.
As RNA-seq provides a unique platform for detecting rare fusion or trans-spliced transcripts, we investigated the frequency of these events in Retinoblastoma. There are multiple potential mechanisms for generating the hybrid RNAs, including chromosomal translocation, deletions or inversions, gene fusion, or aberrant splicing events. We independently evaluated each dataset to identify potential fusion transcripts, as these have been shown to have potential oncogenic (35) or predictive roles (36) in different malignancies. Our analysis revealed a small subset of putative fusion transcripts in our tumor cohort between consecutive genes, where transcripts of the two genes are transcribed into one chimeric transcript by splicing out the intergenic region. Further independent testing of these candidate genes, using RT-PCR, found 2 bone fide fusion transcripts that occurred at high frequencies in Retinoblastoma tumors, relative to normal retinal tissue. Although much more work is needed to test the possible function of these RNAs, this work suggests that RNA-fusion products are common in Retinoblastoma tumors.
This work represents the first genome-wide profiles of non-coding RNA changes in Retinoblastoma tumors. Our profiles identified significant and widespread differences in the levels of anti-sense, LNC-RNAs and transposable elements in tumor samples, compared to normal retinal tissue. Although there is some inter-tumoral variation between LNC-RNA levels, the majority show consistent changes which could be independently confirmed in an unrelated Retinoblastoma cohort. Determining the potential roles of LNC-RNAs can be challenging, as they can potentially function via many mechanisms to affect local or distant gene expression or directly interact with other RNAs to alter their stability and/or translation capacity. To address this complex question, we conducted an exhaustive analysis of how altered LNC-RNAs may contribute to the relative mRNA levels in Retinoblastoma.
First, we investigated how each altered LNC-RNA correlated with deregulated mRNA levels, based on chromosome location. These predictions suggest that although each LNC-RNA did show anti-correlates with several mRNAs, these were spread throughout the genome, suggesting that most of the changed LNC-RNAs in Retinoblastoma do not function in cis. In agreement with this hypothesis, we detected only 16 LNC-RNAs that showed tight expression correlations with their overlapping, adjoining or proximal gene partners. Second, by generating correlation efficiencies for each of the 400 most deregulated LNC-RNAs in our tumors to the mRNA changes, we found that many LNC-RNAs showed a relative positive or negative relationship to mRNA levels. Third, using new LNC-RNA: mRNA interaction platforms followed by gene ontology analysis of these putative interactions, we found that many up and down regulated LNC-RNAs have strong potential interactions with genes involved in cellular processes that sustain Retinoblastoma growth including cell division and cell attachment. These data provide a complete analysis of the transcriptional changes for both coding and non-coding genes in the pediatric Retinoblastoma tumor.
To measure how altered LNC-RNAs may contribute to Retinoblastoma growth, we evaluated how re-expressing a down-regulated LNC-RNA in Y79 Retinoblastoma cell lines altered cell number. DRAIC expression slowed Y79 cell growth compared to controls, in support of previous data from lung cancer, showing that reducing the expression of DRAIC back to physiological levels changes the proliferation potential of cells. These experiments show that modulating the levels the DRAIC LNC-RNA does affect Retinoblastoma cells.
Methods
Patient Samples for RNA-seq
The 2 normal Retinas were obtained from donor eye balls from the PU Shah eye bank. The age was 12 and 22 years, respectively. The Retinoblastoma tumors were obtained from informed consent provided by the parents of the children undergoing treatment at Sankara Nethralaya, India. The protocol was approved by institutional review board on Ethical practices for research at Sankara Nethralaya. The pathological characterizations of the tumors are presented in Table 2. For RNA-sequencing studies, the tumors and retinas were stored at −80°C until further use. The medical record pathology numbers of these samples were de-identified before the tumors were released by the hospital. Each sample was then assigned an identification number by the hospital. The RNA was then used for RNA-sequencing. The samples were named from 48,428 to 48,437.
Table 2.
Target | Forward primer | Reverse primer | |
---|---|---|---|
1 | ORC1 | CCCTATCAGTGGGGGACAGA | ATGGGGAGTAGAGGTCGCTT |
2 | SMC2 | TCAGCCAGATGTATTGCACCA | CACATGAACGTTGTCAGGGC |
3 | SMC4 | CCTGTTGTCATGCACTGGACT | TCGGTCATCTTTTTCGCCCA |
4 | TOP2A | AGCTGGATCAGTGGCTGAAAT | GCCTGGTACCAAACTGACCA |
5 | MCM4 | TGTTTGCTCACAATGATCTCG | CGAATAGGCACAGCTCGATA |
6 | E2F1 | ACTCCTCGCAGATCGTCATCATCT | GGACGTTGGTGATGTCATAGATGCG |
7 | CENPE | GATTTGGATGAATTTGAGGCTCT | ACTTCTGCATGCTTAACTAAATTCT |
8 | LNC00505 | GTGAAGACCCCCTTTCCCAC | AGATGCTGGCTAGTTTGGGG |
9 | LNC01053 | CTGTCCCTATCTGGAGCCCT | GAAAGATGTGTGCGTGACCG |
10 | LNC00152 | CCAGCACCTCTACCTGTTGC | GCCAGACAAATGGGAAACCG |
11 | LNC01155 | ACCTTCTTGGCCCTGCTTAG | TAGAGGGTGGCCCTTAGTGA |
12 | LNC-PINT | CGCAGAGGGACAAATCCAGT | CCCCGGAGAGCAATGAGTTT |
13 | LNC00844 | TCTGATAGGAGGATGGGGGTG | CTTAGCCATGCAAGAAACCTCC |
14 | LNC01137 | AGCGATCTTGGGGGAAGTTG | GGGTTAGGGAGTGGCATCAC |
15 | LNC00575 | GAATTGGCACAGATCCAGAGC | GGTTCTCATCCCCTGAAGTCT |
16 | LNC00606 | CACTGCTTTGGTCAGGGAGT | ACTCTCCTTGTCAGCGGTTG |
17 | β-actin | TCACCCACACTGTGCCCATCTACG | CAGCGGAACCGCTCATTGCCAATG |
18 | LSG1_TMEM44 | AAGCTGCTGTACTGCCATCCTC | CAGGAGGAGGCGCAGATC |
19 | CIRBP_C19orf24 | AGGACTCGGGGAAGGGTG | ATTCCGGGACTCAAACACCG |
20 | DRAIC | TGAACTCAACTCCTGAGAAGGAC | CGCTCTCAGACTCTTCAGTTCTC |
21 | β-actin | TCACCCACACTGTGCCCATCTACG | CAGCGGAACCGCTCATTGCCAATG |
RNA Extraction for RNA-seq
The Retinae and tumors were thawed, and RNA was extracted using TRIzol reagent (#15596026, Thermo Fisher Scientific) according to the manufacturer's instructions. The samples were homogenized in TRIzol reagent and incubated at room temperature for 5 min. Next, chloroform was added, and the tubes were mixed vigorously for 15 s followed by incubation at room temperature for 2–3 min. Next, the samples were centrifuged at 4°C for 15 min at 12,000 g. RNA, which is part of the aqueous phase, is collected cautiously and is precipitated using isopropanol, followed by 75% ethanol washes. After the washes, RNA pellet is dried completely (5–10 min) before re-suspending in nuclease free water. Total RNA thus extracted is qualified on 2,200 Tape Station using the RNA specific Tapes and reagents (#5067-5576, 5577, Agilent Technologies). Quantification of the RNA was done using Qubit 3.0 Fluorimeter (#Q33216, Thermo Fisher Scientific) with an RNA HS Assay (#Q32852, Thermo Fisher Scientific).
RNA Library Preparation and RNA-seq
Once the RNA passed QC, it was taken up for library preparation using the TruSeq RNA Library Prep Kit V2 (#RS-122-2001/RS-122-2002, Illumina Inc.). During the library preparation, the Ribosomal RNA (rRNA) were removed using biotinylated, rRNA specific oligo's combined with Ribo-zero rRNA removal beads (#MRZG12324, Illumina Inc.). Following purification, the Ribo-depleted RNA is subjected to fragmentation in the presence of divalent cations at elevated temperatures. The cleaved RNA fragments were copied into first strand cDNA using reverse transcriptase and random primers. Then, second strand cDNA synthesis was performed, using DNA polymerase I and RNase H. These cDNA fragments will go through the process of repairing the ends, addition of a single “A” base, followed by ligation of the adapters. The ligation product was purified using Agencourt AMPure XP beads (#A63882, Beckman Coulter) and enriched using PCR to create the final cDNA library for sequencing. After a final cleanup with Agencourt AMPure XP beads, the library was quantified using Qubit DNA Assay and the fragments were assessed using DNA Tape Station D1000 Screen Tape (#5067-5582,5583, Agilent Technologies). The quantified libraries were then clonally amplified on a cBOT and sequenced on the HiSeq2500 with 100 bp paired end chemistry. 5′AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT3′
The TruSeq Universal Adapter. The TruSeq Illumina Index Adapter consisted of following sequence: 5′GATCGGAAGAGCACACGTCTGAACTCCAGTCAC[INDEX]ATCTCGTATGCCGTCTTCTGCTTG3′.
Bioinformatics
The raw fastq files generated in sequencing were first subjected to a quality check to filter out substandard data. During quality control, the following parameters were checked: base quality score distribution, sequence quality score distribution, average base content per read, GC distribution in the reads, PCR amplification issue, check for over-represented sequences and finally, adapter trimming. Based on the QC report, the reads having a phred quality score of Q30 were either removed or trimmed as per requirement. The adapter trimming was done using fastq-mcf v-1.04.676 and cutadapt v-1.8dev (ftp://ftp.ensembl.org/pub/release75/gtf/homo_sapiens/Homo_sapiens.GRCh37). The processed data were then checked for the presence of unwanted sequences or contamination (e.g., non-polyA tailed RNAs, mitochondrial genome sequences, ribosomal RNAs, transfer RNAs, adapter sequences, and others), and those contaminations were removed using Bowtie2 v-2.2.4 (37). The first step of analyzing the processed RNA-seq data is to align them to the reference human genome. To do this, the paired-end reads were aligned to reference human genome, GRCh37/hg19 (2), using HISAT v-0.1.7 (38). The aligned reads were then subjected to expression estimation of individual genes and transcripts by calculating the FPKM values (Fragments Per Kilo base of transcript per Million mapped reads) using cufflinks v-2.2.1 (39). Finally, the differential expressions of genes were carried out using cuffdiff v-2.2.1 & DESEQ2 (40). Genes with a log2 fold change of more than 1 or less than −1 and a p < 0.05 were termed as differentially expressed genes.
Global changes in mRNA and biological triplicate samples per experimental condition were analyzed using RNA-Seq, specifically with the consensus Bioconductor (41, 42) workflow; FASTQC for trimming, genomic alignment in Rsubread, and differential expression with limma (43) and visualization (pheatmap) and volcano plots were generated in ggplot2.
Subsequently, the Pearson's correlation was measured for each of the top differentially expressed mRNAs or lncRNAs, and a cross-correlation of q values was generated. These significant correlations, >0.3 or <-0.3, were also visualized in pheatmap with the chromosome and genomic start location used to annotate each species. Finally, the 10 lncRNA with the greatest sum of correlations were selected and the mRNA correlations illustrated with the most significant mRNA species identified.
LNC-RNA Target Prediction
LNC-RNAs that were statistically significantly up or downregulated in Retinoblastoma tumors, compared to normal retinal tissue, were profiled to determine their putative mRNA targets. LNC-RNAs were retained for this analysis if they had SUMENERGY scores of above 2,000 for at least 10 targets and all putative substrates with SUMENERGY above 1,500 were included. This step was taken to reduce the number of weak substrates included in the Gene Ontology. LNC-RNAs that did not have this criteria were excluded, as were mRNAs that were predicted in >4 LNC-RNAs as this was taken as a false positive. LNC-RNAs were entered into the RNA-RNA interaction database (human) and the predicted targets were extracted. These mRNAs, their associated SUMENERGY score and change in expression, were used to sub-group the putative LNC-RNA targets in the tumor cohort. These groups were then used in Gene Ontology analysis using DAVID (44, 45).
LNC-RNA Neighboring Gene Analysis
LNC-RNAs are known to regulate gene expression in cis-acting and trans-acting ways. We carried out neighboring gene analysis and LNC RNA-mRNA interaction analysis to identify the probable targets of LNC-RNA. LNC-RNAs identified to be differentially expressed were classified based on genomic location, and this was done with any other gene (irrespective of coding nature). The following distance parameters were used: Overlap is within any other gene, Adjoining is within 5 kb of another gene, Proximal is within 10 kb of another gene, Others >10 kb of another gene. Those neighboring protein-coding genes that were found to be significantly expressed were considered to be influenced by LNC-RNA.
Tumors and RNA Extraction Method for RT-PCR
Retinoblastoma samples were provided by the Cooperative Human Tissue Network, a National Cancer Institute supported resource. Other investigators may have received samples from these same tissue specimens. Normal retinal tissue was obtained from warm autopsy samples from Mohamed Abdel-Rahman at the Ohio State University. RNA was extracted from each of the three retinoblastoma tissue samples and four retina tissue samples that were received snap frozen. To each sample, 1 mL TRIzol Reagent (Invitrogen) was added to 10 mg of tissue and the samples were homogenized using a homogenizer and incubated at room temperature for 5 min. 0.2 mL of chloroform was added to the samples and incubated at room temperature for 2 min, then the samples were centrifuged for 15 min at 12,000 × g at 4°C. The upper aqueous phase containing the RNA was transferred to a new tube and 0.5 mL of isopropanol was added, then this mixture was incubated at room temperature for 10 min. The samples were centrifuged for 10 min at 12,000 × g at 4°C and a white pellet formed. The supernatant was discarded and the pellet re-suspended in 1 mL of 75% ethanol. The sample was briefly vortexed and then centrifuged for 5 min at 7,500 × g at 4°C. The supernatant was discarded and the RNA pellet air dried for 10 min. The pellet was re-suspended in 20 μL of RNase-free water and incubated in a 60°C water bath for 15 min. The RNA was then quantified using a BioTek Epoch Microplate Spectrophotometer.
Fusion Transcript Detection
EricScript framework (Version: 0.5.5b) is used for the discovery of fusion transcripts in paired end RNA-seq data (cite PMID: 23093608). Following steps are performed by the EricScript: (1) Mapping of the reads against the transcriptome (downloaded from the EricScript website, Homo sapiens, Ensembl version 73, Genome Build hg19/GRCh37), (2) Identification of discordant alignments and building of the exon junction reference, (3) Recalibration of the exon junction reference, (4) Scoring and filtering of the candidate gene fusions. The data is visualized using a custom webserver based on The JBrowse Genome Browser (PMID: 19570905). A simple perl script was written to filter through the list of candidate fusion transcripts and explored using the JBrowse Genome Browser (http://hegemon.ucsd.edu/RB-p1/fusion/fusion.php).The list was then manually filtered using BLAST on the NCBI website.
cDNA Synthesis
cDNA was prepared from 400 ng of RNA using the High Capacity cDNA Reverse Transcription kit (Applied Biosystems), according to manufacturer's protocol.
Quantitative RT-PCR
The expression of the specified target genes was quantified by q-RT-PCR, using Fast Start Universal SYBR Green Master with ROX (Roche), and was normalized to the expression of β-actin and normal samples. Custom designed primers were used.
Y79 Transfection
2 x 105 Y79 cells were seeded in a 24-well plate. Six wells each were transfected with pcDNA-DRAIC and pcDNA3-EmptyVector were transfected using Lipofectamine2000, according to manufacturer's protocol. The plasmids were a kind gift of Anindya Dutta (32).
Cell Culture
The Y79 cell line was grown in RPMI medium supplemented with 20% FBS and 1X penicillin-streptomycin and cultured at 37°C, 5% CO2 incubator as per ATCC guidelines.
Cell Growth Assay
Fifteen microlitre aliquot of cells are taken and diluted 1:1 with Trypan Blue solution. Cells are counted using a haemocytometer and the total number of cells are calculated.
Data Access
Complete RNA-seq data from tumors and control retinal tissue uploaded to the NCBI GEO datasets (GSE125903).
Author Contributions
KD extracted RNA from tumor and normal retinal samples. KD and SR generated cDNA and conducted RT-PCR from extracted RNA for second independent cohort analysis. RK tumor collection and data handling. PR and VK are the surgeons who obtained tumors. SK conducted the pathology on the tumor samples. KS, MB, and DS ran the bioinformatics analysis. LS conducted RNA fusion product analysis and MC conducted LNC-RNA correlation analysis. SE contributed to data handling and designed the experiments. WM designed experiments, helped with analysis, and wrote the manuscript.
Conflict of Interest Statement
KS was employed by the company MedGenome. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors would like acknowledge the support of a K22CA204352 (WM), a Research Associateship DNT (LS), a DBT-COE grant (BT/01/CEIB/11/V/16) (SE), a Scientific and Engineering Research Board (DS) (EMR/2015/000607) (SE), a SERB grant (SE), and with the support in part by grant P30 CA016058 from the National Cancer Institute.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2019.00221/full#supplementary-material
References
- 1.Friend SH, Bernards R, Rogelj S, Weinberg RA, Rapaport JM, Albert DM, et al. A human DNA segment with properties of the gene that predisposes to retinoblastoma and osteosarcoma. Nature. (1986) 323:643–6. 10.1038/323643a0 [DOI] [PubMed] [Google Scholar]
- 2.Hiebert SW, Chellappan SP, Horowitz JM, Nevins JR. The interaction of RB with E2F coincides with an inhibition of the transcriptional activity of E2F. Genes Dev. (1992) 6:177–85. 10.1101/gad.6.2.177 [DOI] [PubMed] [Google Scholar]
- 3.Almasan A, Yin Y, Kelly RE, Lee EY, Bradley A, Li W, et al. Deficiency of retinoblastoma protein leads to inappropriate S-phase entry, activation of E2F-responsive genes, and apoptosis. Proc Natl Acad Sci USA. (1995) 92:5436–40. 10.1073/pnas.92.12.5436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gonzalo S, Garcia-Cao M, Fraga MF, Schotta G, Peters AH, Cotter SE, et al. Role of the RB1 family in stabilizing histone methylation at constitutive heterochromatin. Nat Cell Biol. (2005) 7:420–8. 10.1038/ncb1235 [DOI] [PubMed] [Google Scholar]
- 5.Dyson N, Howley PM, Munger K, Harlow E. The human papilloma virus-16 E7 oncoprotein is able to bind to the retinoblastoma gene product. Science. (1989) 243:934–7. 10.1126/science.2537532 [DOI] [PubMed] [Google Scholar]
- 6.Cairns P, Mao L, Merlo A, Lee DJ, Schwab D, Eby Y, et al. Rates of p16 (MTS1) mutations in primary tumors with 9p loss. Science. (1994) 265:415–7. 10.1126/science.8023167 [DOI] [PubMed] [Google Scholar]
- 7.Connell-Crowley L, Harper JW, Goodrich DW. Cyclin D1/Cdk4 regulates retinoblastoma protein-mediated cell cycle arrest by site-specific phosphorylation. Mol Biol Cell. (1997) 8:287–301. 10.1091/mbc.8.2.287 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Harbour JW, Luo RX, Dei Santi A, Postigo AA, Dean DC. Cdk phosphorylation triggers sequential intramolecular interactions that progressively block Rb functions as cells move through G1. Cell. (1999) 98:859–69. 10.1016/S0092-8674(00)81519-6 [DOI] [PubMed] [Google Scholar]
- 9.Kamb A, Shattuck-Eidens D, Eeles R, Liu Q, Gruis NA, Ding W, et al. Analysis of the p16 gene (CDKN2) as a candidate for the chromosome 9p melanoma susceptibility locus. Nat Genet. (1994) 8:23–6. 10.1038/ng0994-22 [DOI] [PubMed] [Google Scholar]
- 10.Khatib ZA, Matsushime H, Valentine M, Shapiro DN, Sherr CJ, Look AT. Coamplification of the CDK4 gene with MDM2 and GLI in human sarcomas. Cancer Res. (1993) 53:5535–41. [PubMed] [Google Scholar]
- 11.Broaddus E, Topham A, Singh AD. Incidence of retinoblastoma in the USA: 1975-2004. Br J Ophthalmol. (2009) 93:21–3. 10.1136/bjo.2008.138750 [DOI] [PubMed] [Google Scholar]
- 12.Cobrinik D. Retinoblastoma progression. EBioMedicine. (2015) 2:623–4. 10.1016/j.ebiom.2015.07.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Pritchard EM, Stewart E, Zhu FY, Bradley C, Griffiths L, Yang L, et al. Pharmacokinetics and efficacy of the spleen tyrosine kinase inhibitor r406 after ocular delivery for retinoblastoma. Pharm Res. (2014) 31:3060–72. 10.1007/s11095-014-1399-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zhang J, Benavente CA, McEvoy J, Flores-Otero J, Ding L, Chen X, et al. A novel retinoblastoma therapy from genomic and epigenetic analyses. Nature. (2012) 481:329–34. 10.1038/nature10733 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Antczak C, Kloepping C, Radu C, Genski T, Muller-Kuhrt L, Siems K, et al. Revisiting old drugs as novel agents for retinoblastoma: in vitro and in vivo antitumor activity of cardenolides. Invest Ophthalmol Vis Sci. (2009) 50:3065–73. 10.1167/iovs.08-3158 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Assayag F, Nicolas A, Vacher S, Dehainault C, Bieche I, Meseure D, et al. Combination of carboplatin and bevacizumab is an efficient therapeutic approach in retinoblastoma patient-derived xenografts. Invest Ophthalmol Vis Sci. (2016) 57:4916–26. 10.1167/iovs.15-18725 [DOI] [PubMed] [Google Scholar]
- 17.Eng C, Li FP, Abramson DH, Ellsworth RM, Wong FL, Goldman MB, et al. Mortality from second tumors among long-term survivors of retinoblastoma. J Natl Cancer Inst. (1993) 85:1121–8. 10.1093/jnci/85.14.1121 [DOI] [PubMed] [Google Scholar]
- 18.Nahum MP, Gdal-On M, Kuten A, Herzl G, Horovitz Y, Weyl B, et al. Long-term follow-up of children with retinoblastoma. Pediatr Hematol Oncol. (2001) 18:173–9. 10.1080/08880010151114769 [DOI] [PubMed] [Google Scholar]
- 19.Seshadri R, Matthews C, Norris MD, Brian MJ. N-myc amplified in retinoblastoma cell line FMC-RB1. Cancer Genet Cytogenet. (1988) 33:25–7. 10.1016/0165-4608(88)90045-3 [DOI] [PubMed] [Google Scholar]
- 20.Lee WH, Murphree AL, Benedict WF. Expression and amplification of the N-myc gene in primary retinoblastoma. Nature. (1984) 309:458–60. 10.1038/309458a0 [DOI] [PubMed] [Google Scholar]
- 21.Laurie NA, Donovan SL, Shih CS, Zhang J, Mills N, Fuller C, et al. Inactivation of the p53 pathway in retinoblastoma. Nature. (2006) 444:61–6. 10.1038/nature05194 [DOI] [PubMed] [Google Scholar]
- 22.Dyson NJ. RB1: a prototype tumor suppressor and an enigma. Genes Dev. (2016) 30:1492–502. 10.1101/gad.282145.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ganguly A, Shields CL. Differential gene expression profile of retinoblastoma compared to normal retina. Mol Vis. (2010) 16:1292–303. [PMC free article] [PubMed] [Google Scholar]
- 24.Kapatai G, Brundler MA, Jenkinson H, Kearns P, Parulekar M, Peet AC, et al. Gene expression profiling identifies different sub-types of retinoblastoma. Br J Cancer. (2013) 109:512–25. 10.1038/bjc.2013.283 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Chakraborty S, Khare S, Dorairaj SK, Prabhakaran VC, Prakash DR, Kumar A. Identification of genes associated with tumorigenesis of retinoblastoma by microarray analysis. Genomics. (2007) 90:344–53. 10.1016/j.ygeno.2007.05.002 [DOI] [PubMed] [Google Scholar]
- 26.Mi H, Huang X, Muruganujan A, Tang H, Mills C, Kang D, et al. PANTHER version 11: expanded annotation data from gene ontology and reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. (2017) 45:D183–9. 10.1093/nar/gkw1138 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mi H, Thomas P. PANTHER pathway: an ontology-based pathway database coupled with data analysis tools. Methods Mol Biol. (2009) 563:123–40. 10.1007/978-1-60761-175-2_7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Herschkowitz JI, He X, Fan C, Perou CM. The functional loss of the retinoblastoma tumour suppressor is a common event in basal-like and luminal B breast carcinomas. Breast Cancer Res. (2008) 10:R75. 10.1186/bcr2142 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Nicolay BN, Danielian PS, Kottakis F, Lapek JD, Jr, Sanidas I, Miles WO, et al. Proteomic analysis of pRb loss highlights a signature of decreased mitochondrial oxidative phosphorylation. Genes Dev. (2015) 29:1875–89. 10.1101/gad.264127.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zeller KI, Jegga AG, Aronow BJ, O'Donnell KA, Dang CV. An integrated database of genes responsive to the Myc oncogenic transcription factor: identification of direct genomic targets. Genome Biol. (2003) 4:R69. 10.1186/gb-2003-4-10-r69 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Terai G, Iwakiri J, Kameda T, Hamada M, Asai K. Comprehensive prediction of lncRNA-RNA interactions in human transcriptome. BMC Genomics. (2016) 17(Suppl. 1):12. 10.1186/s12864-015-2307-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Sakurai K, Reon BJ, Anaya J, Dutta A. The lncRNA DRAIC/PCAT29 Locus constitutes a tumor-suppressive nexus. Mol Cancer Res. (2015) 13:828–38. 10.1158/1541-7786.MCR-15-0016-T [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.McEvoy J, Flores-Otero J, Zhang J, Nemeth K, Brennan R, Bradley C, et al. Coexpression of normally incompatible developmental pathways in retinoblastoma genesis. Cancer Cell. (2011) 20:260–75. 10.1016/j.ccr.2011.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Danda R, Ganapathy K, Sathe G, Madugundu AK, Ramachandran S, Krishnan UM, et al. Proteomic profiling of retinoblastoma by high resolution mass spectrometry. Clin Proteomics. (2016) 13:29. 10.1186/s12014-016-9128-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Mitani K, Sato Y, Tojo A, Ishikawa F, Kobayashi Y, Miura Y, et al. Philadelphia chromosome positive B-cell type malignant lymphoma expressing an aberrant 190 kDa bcr-abl protein. Br J Haematol. (1990) 76:221–5. 10.1111/j.1365-2141.1990.tb07875.x [DOI] [PubMed] [Google Scholar]
- 36.Yang Z, Yu L, Wang Z. PCA3 and TMPRSS2-ERG gene fusions as diagnostic biomarkers for prostate cancer. Chin J Cancer Res. (2016) 28:65–71. 10.3978/j.issn.1000-9604.2016.01.05 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. (2012) 9:357–9. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. (2015) 12:357–60. 10.1038/nmeth.3317 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. (2012) 7:562–78. 10.1038/nprot.2012.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Anders S, McCarthy DJ, Chen Y, Okoniewski M, Smyth GK, Huber W, et al. Count-based differential expression analysis of RNA sequencing data using R and Bioconductor. Nat Protoc. (2013) 8:1765–86. 10.1038/nprot.2013.099 [DOI] [PubMed] [Google Scholar]
- 41.Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. (2015) 4:1521. 10.12688/f1000research.7563.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Love MI, Anders S, Kim V, Huber W. RNA-Seq workflow: gene-level exploratory analysis and differential expression. F1000Res. (2015) 4:1070. 10.12688/f1000research.7035.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. (2015) 43:e47. 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. (2009) 4:44–57. 10.1038/nprot.2008.211 [DOI] [PubMed] [Google Scholar]
- 45.Huang da W, Sherman BT, Zheng X, Yang J, Imamichi T, Stephens R, et al. Extracting biological meaning from large gene lists with DAVID. Curr Protoc Bioinformatics. (2009) 13:13 11. 10.1002/0471250953.bi1311s27 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.