Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Apr 16;15(4):e0225357. doi: 10.1371/journal.pone.0225357

The novel microRNAs hsa-miR-nov7 and hsa-miR-nov3 are over-expressed in locally advanced breast cancer

Deepak Poduval 1, Zuzana Sichmanova 1, Anne Hege Straume 1,¤, Per Eystein Lønning 1,2, Stian Knappskog 1,2,*
Editor: Bernard Mari3
PMCID: PMC7162276  PMID: 32298266

Abstract

miRNAs are an important class of small non-coding RNAs, which play a versatile role in gene regulation at the post-transcriptional level. Expression of miRNAs is often deregulated in human cancers. We analyzed small RNA massive parallel sequencing data from 50 locally advanced breast cancers aiming to identify novel breast cancer related miRNAs. We successfully predicted 10 novel miRNAs, out of which 2 (hsa-miR-nov3 and hsa-miR-nov7) were recurrent. Applying high sensitivity qPCR, we detected these two microRNAs in 206 and 214 out of 223 patients in the study from which the initial cohort of 50 samples were drawn. We found hsa-miR-nov3 and hsa-miR-nov7 both to be overexpressed in tumor versus normal breast tissue in a separate set of 13 patients (p = 0.009 and p = 0.016, respectively) from whom both tumor tissue and normal tissue were available. We observed hsa-miR-nov3 to be expressed at higher levels in ER-positive compared to ER-negative tumors (p = 0.037). Further stratifications revealed particularly low levels in the her2-like and basal-like cancers compared to other subtypes (p = 0.009 and 0.040, respectively). We predicted target genes for the 2 microRNAs and identified inversely correlated genes in mRNA expression array data available from 203 out of the 223 patients. Applying the KEGG and GO annotations to target genes revealed pathways essential to cell development, communication and homeostasis. Although a weak association between high expression levels of hsa-miR-nov7 and poor survival was observed, this did not reach statistical significance. hsa-miR-nov3 expression levels had no impact on patient survival.

Introduction

miRNAs are an important class of small non-coding RNAs, playing a versatile role in the gene regulation at the post–transcriptional level [15]. These molecules have proven to be involved in vital cellular functions, such as development, differentiation and metabolism [68]. In recent years there has been increased focus on the role of miRNAs in cancer [9], and the implementation of next generation sequencing (NGS) has led to the identification of multiple novel miRNAs as well as linked individual miRNA expression and combined signatures to tumor characteristics [10]. Currently there are 2656 distinct human miRNAs identified in the miRbase v22 [11], including more than 700 found to be deregulated in cancers [12].

Breast cancer is the most common malignancy in women. While outcome has improved significantly over the last three decades, resistance to therapy still presents a major challenge causing breast cancer related deaths [13]. As for chemoresistance in general, the underlying biological mechanisms remain poorly understood [14].

Merging evidence has indicated miRNA deregulation to play a role in breast cancer biology and outcome. Dysregulation of miRNAs may affect signal transduction pathways by targeting oncogenes and tumor suppressor genes [15], important to cancer development, progression, metastasis and potentially therapy response [16, 17]. Thus, while miR-10b, miR-125b, and miR-145 are generally downregulated, other miRNAs, like miR-21 and miR-155, are generally upregulated in breast cancer as compared to normal breast tissue [18]. Further, several miRNAs have revealed strong associations to clinical parameters [19, 20]: For example, differential expression of miR-210, miR-21, miR-106b*, miR-197, miR-let-7i, and miR-210, have been identified as a signature with prognostic value and also linked to invasiveness [21]. Moreover, miR-21 has been found linked to breast cancer metastasis and poor survival [22], while mir-29a overexpression has been shown to reduce the growth rate of breast cancer cells [23]. Given that many of the observed miRNA alterations are strongly cancer specific, this has inspired investigations into the potential use of miRNA as diagnostic biomarkers. Since miRNA are relatively stable molecules, they may be particularly attractive biomarkers to screen for in liquid biopsies (for original references, see [24])

miRNAs are also known to be differentially regulated across different subclasses of breast cancer. E.g. while members of the mir-181 family are up regulated in breast cancer in general, miR-181c in particular is activated by the expression of HER2 gene [25]. Also, miR-140 has been found suppressed by estrogen stimulation in ERα-positive breast cancer cells, most likely due to ER response elements in the flanking element of the miR-140 promoter [26].

In the present study, we analyzed global miRNA expression in 50 locally advanced breast cancers using NGS, aiming to identify novel, potentially breast cancer specific miRNAs. We identified and validated two novel miRNAs (one not previously described and one not previously reported in breast cancer), and subsequently evaluated their expression in an extended patient series (n = 223), by high sensitivity qPCR. Both were found over-expressed in breast cancer as compared to normal breast tissue. Considering different breast cancer subtypes, hsa-miR-nov3 was expressed at particular high levels in ER-positive tumors contrasting lower levels in basal-like and Her2-like tumors. No similar patterns were observed for hsa-miR-nov7.

Materials and methods

Patients

In the present work we have analyzed biopsy material from two breast cancer studies.

1) In the first study, incisional biopsies were collected before chemotherapy from 223 patients with locally advanced breast cancer in a prospective study designed to identify the response to epirubicin (n = 109) and paclitaxel (n = 114) monotherapy. Primary response to therapy as well as long-term follow up (>10 years or death) was recorded for all patients. This cohort has been described in detail previously [27].

2) In the second study, tumor breast tissue and normal breast tissue from tumor bearing and non-tumor bearing quadrants were collected from 46 anonymous breast cancer patients undergoing mastectomy, with the purpose of determining tissue estrogens. This cohort is described in detail in [28].

Using NGS, we analyzed miRNA expression in 50 patients from study 1). Next, candidate miRNAs were quantified using qPCR in all 223 patients from study 1), as well as 13 randomly selected patients from study 2), where RNA was available from tumor tissue and matching normal breast tissue (7 ER-positive and 6 ER-negative tumors). In addition, mRNA expression array data was available for 203 out of the 223 patients in study 1).

All patients provided written informed consent, and the studies conducted in accordance to national laws, regulation and ethical permissions (Norwegian health region West; REK Vest).

Tissue sampling and RNA extraction

Tissue samples were snap-frozen in liquid nitrogen in the operating theatre and stored in liquid nitrogen until further processing. Total RNA was extracted from the biopsies using miRvanaTM kit (ThermoFisher), according to the manufacturer’s instructions. RNA integrity and concentration were determined using Bioanalyzer 2000 and Nanodrop ND2000 spectrophotometer, respectively.

miRNA-sequencing

Sample preparation and single-end sequencing were performed at the core facility of the Norwegian Genomics Consortium in Oslo, on Illumina HiSeq 2500, 1x50bp. De-multiplexing was performed using the Illumina CASAVA software. FastQC was run on all samples with the main purpose to assess sequence quality. The raw data are available through the Gene Expression Omnibus (accession: GSE145151).

Novel miRNA prediction

The raw sequencing files (fastq) were processed using the novel miRNA prediction algorithm mirdeep v2.0.0.5 [10]. Potential novel miRNAs were identified using the human reference genome (hg19) and already identified miRNAs from humans and other hominids from miRbase 20 [29]. In the mirdeep2 algorithm, filtering parameters randfold P-value less than 0.05 and scores greater than or equal to 10 were applied. Precursor structures obtained after filtering were manually identified based on the presence of 1–2 mismatches in the stem region, a loop sequence of 4–8 nt, and the presence of mature sequence in the stem region (See S1 File.) [30].

Validation of predicted novel miRNAs

Validation of the predicted novel miRNAs was performed by qPCR-based amplification of the miRNAs, with subsequent cloning and capillary sequencing of the products, to pinpoint the exact size and sequence of the miRNAs (see sections below for details).

cDNA synthesis and qPCR

cDNA from miRNAs was prepared using Exiqon’s Universal cDNA synthesis kit II, with 20 ng of total RNA as input. qPCR was performed using Exiqon’s miRCURY LNA Universal RT microRNA PCR system, with custom Pick-&-Mix ready to use PCR plates with an inter-plate calibrator, on a LightCycler 480 instrument (Roche). Relative expression levels for each sample were calculated by dividing the expression of the gene of interest on the average expression of two reference miRNAs: miR-16-5p and miR-30b-5p.

miRNA cloning and capillary sequencing

End products from custom miRNA specific qPCR were cloned into pCR 2.1 TOPO-TA vector (Life Technologies) by TOPO-TA cloning according to the manufacturer’s instructions. The generated plasmids were amplified by transformation and cultivation of E. coli TOP10 cells (Life Technologies). The plasmids were then isolated using the Qiagen miniprep kit according to the manufacturer’s instructions.

Sequencing was performed using the BigDye v.1.3 system (Applied Biosystems) and the primers following thermocycling conditions as previously described [31]. Capillary electrophoresis and data collection were performed on an automated capillary sequencer (ABI3700).

Target prediction and pathway analysis

Target prediction was performed using the offline algorithm miRanda [32, 33] and the online algorithms miRDB [34] and TargetScanHuman Custom (Release 5.2) [35].

miRanda predicts gene targets based on position specific sequence complementarity between miRNA and mRNA using weighted dynamic programming, an extension of the Smith-Waterman algorithm [36]. Also, the miRanda algorithm uses the free energy estimation between duplex of miRNA: mRNA (Vienna algorithm [37]) as an additional filter.

The miRDB is an online database of animal miRNA targets, which uses SVM (Support Vector Machine) machine-learning algorithm trained with miRNA-target binding data from already known and validated miRNA-mRNA interactions [34, 38].

TargetScanHuman Custom predicts biological miRNA targets by searching for match for the seed region of the given miRNA that is present in the conserved 8-mer and 7-mer sites [35]. It also identifies sites with conserved 3' pairing from the mismatches in the seed region [39, 40].

An in-house pan-cancer panel of 283 tumor suppressor genes was used to filter target genes of interest. The panel was generated based on the tumor suppressors within the CGPv2/3-panels [41], Roche’s Comprehensive Cancer Design as well as a manual literature search (S1 Table).

Further, we used GATHER, a functional gene enrichment tool, which integrates various available biological databases to find functional molecular patterns, in order to find biological context from the target gene list [42]. With the help of GATHER, we did KEGG pathway [43], and GO (gene ontology) enrichment analyses for the common genes predicted by all three prediction algorithms. Further, validations were performed using DAVID [44] and topGO [45].

mRNA expression

In the interest of validating miRNA targets, we analyzed inverse correlations between miRNA expression and mRNA levels. mRNA expression levels were extracted from microarray analyses performed on a Human HT-12-v4 BeadChip (Illumina) after labeling (Ambion; Aros Applied Biotechnology). Illumina BeadArray Reader (Illumina) and the Bead Scan Software (Illumina) were used to scan BeadChips. Expression signals from the beads were normalized and further processed as previously described [46]. We re-annotated the data set using illuminaHumanv4.db from AnnotationDbi package, built under Bioconductor 3.3 in R [47], to select only probes with “Perfect” annotation[48]. The probes represented 21043 identified and unique genes (13340 represented by single probe and 7703 represented by multiple probes). In the cases of multiple probes targeting the same gene, we calculated fold difference for these probes. This was done to avoid losing potentially relevant biological information if expression of one probe was significantly higher that expression of another. However, for no genes did we find a fold difference >2 fold. Therefore, the mean expression for each such gene, was calculated based on the values form each probe, weighted according to the number of beads per probe.

Statistics

Expression levels of miRNAs in tumor versus normal tissue were compared by Wilcoxon rank tests for paired samples. Inverse correlations between miRNA expression and mRNA expression were assessed by Spearman tests. The potential impact of the novel miRNAs on long-term outcome (relapse-free survival and disease-specific survival) in breast cancer patients was calculated by Log-rank tests and illustrated by Kaplan-Meier curves, using the SPSS software v.19. All p-values are reported as two-sided.

Results

Novel miRNA prediction

In order to identify novel miRNAs, 50 patients with locally advanced breast cancer (from study 1, see materials and methods) were subject to global miRNA-sequencing using massive parallel sequencing. On average, the dataset resulted in 3 million reads per sample. Using the miRNA identifier module in miRDeep2, we detected 10 novel miRNAs (Table 1). Eight out of these 10 miRNAs were detected in a single sample only, while two were expressed in two or more patients and therefore regarded as the most reliable predictions. These two miRNAs, here temporarily named hsa-miR-nov3 and hsa-miR-nov7, were found in tumor samples from 2 and from 6 patients, respectively. For both of these novel miRNAs, we identified precursor structures with not more than one or two mismatches in the stem region, as well as the presence of mature miRNA sequences (Fig 1; S1 Fig). Therefore, we selected these two miRNAs for further analyses. Cross-checking the miRCarta database [49], no hits were found for either of the two, but notably, while this work was conducted, hsa-miR-nov7 was identified by another team in lymphomas, and reported as miR-10393-3p [50].

Table 1. Novel miRNA sequences as predicted by mirdeep v2.0.0.5 from massive parallel sequencing of total miRNA in 50 locally advanced breast cancers.

miRNA Co-ordinate Mature sequence Strand Number of samples
hsa-miR-nov2 chr2:36662749..36662809 AAAAACTGCGATTACTTTTGCA - 1
hsa-miR-nov3 chr3:186505088..186505149 AAAGCAGGATTCAGACTACAATAT + 2
hsa-miR-nov3_2 chr3:132393169..132393224 CAAAAACTGCAATTACTTTTGC + 1
hsa-miR-nov4 chr4:155140075..155140134 AAAAGTAATCGCTGTTTTTG + 1
hsa-miR-nov7 chr7:138728845..138728903 AATTACAGATTGTCTCAGAGA - 6
hsa-miR-nov8 chr8:116546693..116546762 TTAGAGCTTCAACCTCCAGTGTGA - 1
hsa-miR-nov10 chr10:31840034..31840078 CGCGGGTGCTTACTGACCCT + 1
hsa-miR-nov10_2 chr10:72163928..72163994 GCGGCGGCGGCGGCGGCG + 1
hsa-miR-nov17 chr17:36760852..36760906 CCCAGCCCCACGCGTCCCCATG - 1
hsa-miR-nov20 chr20:26189318..26189366 TGGCCGAGCGCGGCTCGTCGCC - 1

Fig 1. Predicted novel miRNAs.

Fig 1

Depiction of novel miRNAs (A) hsa-miR-nov3 and (B) hsa-miR-nov7, identified by miRDeep2, showing (i) predicted mature and star sequences, exp, probabilistic model expected from Drosha/Dicer processing and obs, observed sequences from sequencing data (ii) density plot for read counts for mature and star sequences as well as (iii) miRNA secondary structure.

In-vitro validation of novel micro RNAs

Next, we aimed to validate our in-silico predictions and confirm that the sequences from which we identified hsa-miR-nov3 and hsa-miR-nov7 represented bona-fide novel miRNAs expressed in the patients. Utilizing total RNA from the patients found to express the two predicted novel miRNAs, we performed global poly-adenylation and cDNA synthesis followed by miRNA-specific qPCR amplification. For both miRNAs we observed positive qPCR reactions. Further, end products of the qPCRs were then ligated into carrier-plasmids and sequenced. We confirmed the resulting plasmids to contain the predicted miRNA sequences. Further, in both cases, the sequences were flanked by a poly A tail, confirming that the original molecules used as input in the poly-adenylation were present as short 22nt RNAs (Fig 2). Thus, we confirmed the presence of miRNAs with the exact sequence as predicted from the NGS-based data.

Fig 2. miRNA sequences.

Fig 2

Chromatogram of capillary-sequenced qPCR products after hsa-miR-nov3 (A) and hsa-miR-nov7 (B) amplification. Highlighted background indicates the 22nt miRNA-sequence region (reverse complementary), followed by the Adenine homopolymer indicating in vitro adenylation at the expected site, confirming the exact size and sequence of the predicted miRNAs.

Overexpression of hsa-miR-nov7 and hsa-miR-nov3 in breast cancer

Given that the sensitivity for the novel miRNAs was better in the qPCR than in the miRNA massive parallel sequencing (MPS) analysis, we aimed to assess whether the miRNAs were expressed in a limited number of breast cancer patients only (as indicated by their detection in 2 and 6 out of 50 patients in the MPS analysis), or if they were detectable in a higher fraction of patients, when applying a more sensitive detection method. We therefore performed qPCR to quantify the expression levels of hsa-miR-nov7 and hsa-miR-nov3 in tumor tissue across the entire cohort of patients from study 1 (n = 223). With this method, we detected hsa-miR-nov7 and hsa-miR-nov3 in 206 and 214 samples out of total 223 samples respectively, albeit at variable levels (Fig 3).

Fig 3. Expression of novel miRNAs in breast cancer tissue.

Fig 3

Bars indicate the relative expression of hsa-miR-nov3 (A) and hsa-miR-nov7 (B) in 223 breast cancer patients.

Interestingly, while no difference in the expression levels of hsa-miR-nov7 was observed between breast cancer subgroups, we found a significant difference in the expression levels of has-miR-nov3 related to estrogen receptor status. Thus, the expression levels of has-miR-nov3 were higher in ER-positive as compared to ER-negative tumors (p = 0.037; Fig 4A). Further, assessing the expression levels of the two miRNAs in mRNA-based subclasses of breast cancer according to the Perou classification [51], comparing all five classes, we observed a significant difference between the subtypes with respect to miR-nov3 expression (p = 0.041; Kruskal-Wallis test; Fig 4C). We found hsa-miR-nov3 levels to be lower in HER2 like (p = 0.009; Mann-Whitney test) and basal-like (p = 0.04; Mann-Whitney) tumors as compared to tumors of the other classes.

Fig 4. Expression of novel miRNAs in breast cancer tissue.

Fig 4

Expression levels stratified by ER-status (A, B) and by expression subtypes (C, D).

Following the finding that the two miRNAs were detectable in more than 90 percent of patients, in order to assess whether the expression of these miRNAs were tumor specific we compared the levels of hsa-miR-nov7 and hsa-miR-nov3 expression in breast cancer tissue versus normal breast tissue. For this purpose, we randomly selected 13 patients from a study where samples of breast tumor tissue and matching normal tissue from a non-tumor bearing quadrant of the same breast were available (study 2, see materials and methods) [28]. We detected expression of the novel miRNAs in both tumor- and normal tissue samples for all 13 patients. Notably, we found hsa-miR-nov3 expression to be elevated in tumor compared to normal tissue in 10 out of the 13 patients (p = 0.009; Wilcoxon test; Fig 5A). Similar findings were observed for hsa-miR-nov7 with elevated expression in 10 out of 13 tumors (Wilcoxon: p = 0.016; Fig 5B). The level of overexpression (i.e. the ratio of expression levels in tumor versus normal tissue) for the two miRNAs did not correlate to each other (p>0.2; Spearman).

Fig 5. Expression of novel miRNAs in breast cancer tissue.

Fig 5

Bars indicate the ratio of expression in tumour tissue vs. matched normal breast tissue in 13 breast cancer patients, for hsa-miR-nov3 (A) and hsa-miR-nov7 (B).

Notably, overexpression of hsa-miR-nov7 in tumor versus normal tissue was observed predominantly in ER-positive tumors (overexpression in 7 out of 7 ER-positive tumors, contrasting 3 out of 6 ER-negative tumors; p = 0.070; Fischer exact test).

hsa-miR-nov7 and hsa-miR-nov3 target prediction

Based on our finding of both novel miRNAs to be overexpressed in breast cancer, we next aimed to elucidate the functional roles for hsa-miR-nov7 and hsa-miR-nov3 by identifying potential targets. We performed in silico target predictions using three different algorithms–miRanda, miRDB and TargetScan Human Custom. miRanda, which predicts possible targets from human transcripts in general, predicted 9200 and 12315 target genes for hsa-miR-nov7 and hsa-miR-nov3, respectively. miRDB, which contains curated and possible miRNA targets, predicted 570 and 530 target genes each for hsa-miR-nov7 and hsa-miR-nov3, respectively, while TargetScanHuman custom predicted 633 target genes for hsa-miR-nov7, and 282 target genes for hsa-miR-nov3. For increased stringency in our predictions, we restricted the potential targets to the ones called by all three algorithms (Fig 6). This left a total of 97 and 180 potential targets for hsa-miR-nov3 and hsa-mir-nov7, respectively.

Fig 6. Target genes predicted.

Fig 6

Venn-diagrams illustrating the number of target genes predicted by TargetScan, mirDB and Miranda for the two novel miRNAs hsa-mir-nov3 (A) and hsa-mir-nov7 (B).

The two lists of 97 and 180 predicted gene targets were then used for KEGG pathway analysis and GO enrichment analysis using GATHER. The top 10 KEGG pathways and GO terms for each microRNAs are listed in Table 2. The KEGG and GO annotations for hsa-miR-nov3 showed pathways that are important in cell development, communication and cytoskeletal organization. Similar analysis for hsa-miR-nov7 unveiled pathways playing a vital role in cell functions such as communication and homeostasis. These findings were largely validated by performing the same analyses applying alternative tools (DAVID and topGO; S2 Table).

Table 2. Top 10 (arbitrary cut-off) GO and KEGG annotation.

  • A) GO annotation—hsa-miR-nov3

# Annotation ln(Bayes factor)a neg ln(p value)b FE: neg ln(p value)c FE: neg ln(FDR)d
1 GO:0009653 [3]: morphogenesis 94.88 7.98 100.5 92.91
2 GO:0007275 [2]: development 87.32 7.58 92.89 85.99
3 GO:0007154 [3]: cell communication 85.41 7.46 90.99 84.5
4 GO:0009887 [4]: organogenesis 74.65 6.99 80.24 74.26
5 GO:0048513 [3]: organ development 74.65 6.99 80.24 74.26
6 GO:0007165 [4]: signal transduction 74.2 6.97 79.77 73.97
7 GO:0007242 [5]: intracellular signaling cascade 66.52 6.59 72.18 66.53
8 GO:0007010 [6]: cytoskeleton organization and biogenesis 55.54 6.04 61.15 55.63
9 GO:0009790 [3]: embryonic development 48.63 5.7 54.28 48.88
10 GO:0006928 [4]: cell motility 47.82 5.65 53.49 48.2
  • B) KEGG annotation—hsa-miR-nov3

# Annotation Total Genes With Ann ln(Bayes factor)a neg ln(p value)b FE: neg ln(p value)c FE: neg ln(FDR)d
1 path:hsa04810: Regulation of actin cytoskeleton 35 9.03 4.07 13.96 9.57
2 path:hsa04010: MAPK signaling pathway 36 6.93 3.75 11.8 8.1
3 path:hsa04510: Focal adhesion 32 4.15 3.26 8.94 5.94
4 path:hsa04110: Cell cycle 18 4.1 3.25 9.08 5.95
5 path:hsa04060: Cytokine-cytokine receptor interaction 33 3.23 3.07 7.97 5.24
6 path:hsa04620: Toll-like receptor signaling pathway 17 2.97 3.01 7.92 5.24
7 path:hsa04210: Apoptosis 16 2.13 2.82 7.06 4.55
8 path:hsa04512: ECM-receptor interaction 14 1.17 2.55 6.09 3.72
9 path:hsa04630: Jak-STAT signaling pathway 21 1.01 2.51 5.77 3.52
10 path:hsa05050: Dentatorubropallidoluysian atrophy (DRPLA) 5 0.7 2.41 5.9 3.59
  • C) GO annotation—hsa-miR-nov7

# Annotation ln(Bayes factor)a neg ln(p value)b FE: neg ln(p value)c FE: neg ln(FDR)d
1 GO:0007154 [3]: cell communication 60.17 6.3 65.79 58.14
2 GO:0007275 [2]: development 54.83 6 60.38 53.43
3 GO:0007165 [4]: signal transduction 50.84 5.81 56.44 49.89
4 GO:0009653 [3]: morphogenesis 48.96 5.72 54.56 48.3
5 GO:0050794 [3]: regulation of cellular process 41.31 5.3 46.94 40.9
6 GO:0009987 [2]: cellular process 40.56 5.26 46.33 40.48
7 GO:0009887 [4]: organogenesis 40.37 5.25 45.94 40.24
8 GO:0048513 [3]: organ development 39.98 5.23 45.54 40.09
9 GO:0007242 [5]: intracellular signaling cascade 39.87 5.22 45.58 40.09
10 GO:0050789 [2]: regulation of biological process 39.18 5.18 44.62 39.27
  • D) KEGG annotation—hsa-miR-nov7

# Annotation Total Genes With Ann ln(Bayes factor)a neg ln(p value)b FE: neg ln(p value)c FE: neg ln(FDR)d
1 path:hsa04630: Jak-STAT signaling pathway 27 5.53 3.5 10.48 6.6
2 path:hsa04350: TGF-beta signaling pathway 18 5.22 3.45 10.3 6.6
3 path:hsa04010: MAPK signaling pathway 33 3.15 3.04 7.94 4.57
4 path:hsa04210: Apoptosis 17 2.51 2.91 7.49 4.27
5 path:hsa04620: Toll-like receptor signaling pathway 17 2.28 2.85 7.25 4.24
6 path:hsa04020: Calcium signaling pathway 4 2.23 2.84 0 0
7 path:hsa00471: D-Glutamine and D-glutamate metabolism 3 1.12 2.54 6.48 3.7
8 path:hsa04510: Focal adhesion 29 0.96 2.49 5.64 3.23
9 path:hsa05030: Amyotrophic lateral sclerosis (ALS) 5 -0.17 0 5.04 2.78
10 path:hsa04512: ECM-receptor interaction 13 -0.28 0 4.61 2.39

a Measure of the strength of annotation

b p-value for the Bayes factor estimate

c p-value for Fishcer’s exact test

d FDR for Fishcer’s exact test

Thus, both these miRNAs implied cell functions that are vital to cancer development and progression.

In order to further substantiate these in-silico predictions, we performed a complete Spearman correlation analysis between the expression levels of hsa-miR-nov7 and hsa-miR-nov3 and mRNA expression array data available for 203 out of the 233 patients in study 1. Assuming the miRNAs, in general, to execute their function by suppressing gene expression (mRNA degradation), we restricted the analysis to genes which were negatively correlated to expression of the miRNAs. The top ranking negatively correlated genes are listed in Table 3. Notably, the only genes with Rho-values < -0.2 were RMND5A for hsa-miR-nov3 and GLUD1 and SASH1 for hsa-miR-nov7. Given that the two novel miRNAs were overexpressed in breast cancer tissue, we went on to restrict the correlation analysis to an in-house list of 283-tumor suppressor genes previously described. Among these tumor suppressors, we found 115 to be negatively correlated to hsa-miR-nov7 and 119 to hsa-miR-nov3 (S3 Table). Assessing the intersection between these negatively correlated tumor suppressor genes and the predicted targets, we obtained a list of one gene for hsa-miR-nov3 (ATRX) and three genes for hsa-miR-nov7 (APC, SFRP2 and CDH11), but the correlations were non-significant in all 4 cases (Table 4, Fig 7).

Table 3. Spearman correlation table for hsa-miR-nov3 and hsa-miR-nov7 and their top 25 target genes (arbitrary cut-off for inclusion in the table; ranked by inverse correlation).

  • A) hsa-miR-nov3

Gene Symbol Estimate P.value Expression (mean)
RMND5A -0.2018 0.0038 14.0750
YES1 -0.1649 0.0184 17.0218
PALM2-AKAP2 -0.1455 0.0378 13.0997
SLC7A1 -0.1224 0.0811 16.8650
RAPGEF5 -0.1208 0.0853 14.9652
CTDSPL2 -0.1196 0.0885 15.4945
SLC4A5 -0.1077 0.1251 15.0101
HIPK1 -0.1046 0.1366 13.3737
ABHD12 -0.0998 0.1555 16.2313
FMNL2 -0.0982 0.1624 16.0939
POU4F1 -0.0933 0.1844 13.4684
RPS6KA3 -0.0905 0.1981 14.6430
LARP1 -0.0890 0.2054 15.0210
WIPI2 -0.0702 0.3184 14.7316
MTCH1 -0.0575 0.4139 18.6604
DIAPH1 -0.0528 0.4530 16.7109
MARCKS -0.0481 0.4946 18.6286
LUZP1 -0.0453 0.5200 17.1097
DNAJC8 -0.0449 0.5238 18.2152
CLOCK -0.0436 0.5354 15.7894
SLAMF6 -0.0415 0.5557 15.4277
CDAN1 -0.0405 0.5655 16.6394
PCDH11X -0.0359 0.6104 13.4661
RYBP -0.0346 0.6234 16.9184
FGF1 -0.0344 0.6249 13.9423
  • B) hsa-miR-nov7

Gene Symbol Estimate P.value Expression (Mean)
GLUD1 -0.2274 0.0011 18.0399
SASH1 -0.2095 0.0026 16.9164
MARK1 -0.1883 0.0070 15.0356
ARID5B -0.1877 0.0072 17.7569
ELOVL5 -0.1854 0.0079 17.5656
PUM1 -0.1707 0.0147 17.8295
PNRC2 -0.1599 0.0224 15.4674
UNC13B -0.1583 0.0238 15.5633
FLRT2 -0.1581 0.0239 15.7323
ZFHX4 -0.1482 0.0344 14.7383
CHIC1 -0.1479 0.0348 13.5807
MAN1A1 -0.1457 0.0375 15.4956
CPEB2 -0.1387 0.0478 14.6995
PDE4D -0.1377 0.0495 13.9823
TMED7 -0.1366 0.0514 17.1083
NDFIP1 -0.1280 0.0680 16.1458
CSMD1 -0.1269 0.0704 13.8158
MITF -0.1187 0.0908 14.0482
ITSN1 -0.1185 0.0915 14.8011
CTDSPL2 -0.1178 0.0932 15.4945
ATAD2B -0.1178 0.0932 14.9892
SFRP2 -0.1129 0.1080 18.4511
DPP10 -0.1119 0.1110 13.4306
BMPR2 -0.1107 0.1149 17.1664
EIF5A2 -0.1100 0.1174 14.5450

Table 4. List of intersection between correlated tumour suppressor genes and the predicted targets of hsa-miR-nov3 and hsa-miR-nov7.

hsa-miR-nov3 hsa-miR-nov7
ATRX APC
CDH11
SFRP2

Fig 7. Correlations to tumor suppressor genes.

Fig 7

Scatter plots showing correlation of target tumor suppressors with A) hsa-miR-nov3 and B) hsa-miR-nov7.

In order to get a broader overview of potential biological function, we selected the 100 gene transcripts with the strongest positive and the top 100 gene transcripts with the strongest negative correlation to the two miRNAs (independent of previous target-predictions) and performed gene ontology analyses. We detected no cancer related pathways or cellular functions to be significantly associated with hsa-miR-nov7 (S4 Table). However, for hsa-miR-nov3, KEGG analysis of the negatively correlated genes revealed associations to Hepatorcellular carcinoma as well as several pathways related to drug metabolism (S5 Table). Notably, when seeking to validate these findings by application of alternative tools (DAVID and topGO) the latter was not validated. (S6 and S7 Tables).

Expression of hsa-miR-nov7 and hsa-miR-nov3 and clinical outcome in breast cancer

Since both hsa-miR-nov7 and hsa-miR-nov3 were overexpressed in the tumor tissue of breast cancer patients, we assessed whether any of the two novel miRNAs were associated to clinical outcomes in study 1 (223 breast cancer patients). Given that these patients were enrolled in a prospective study specifically designed to assess response to primary chemotherapy administered as epirubicin or paclitaxel monotherapy in a neoadjuvant setting [27, 52], we assessed the association of hsa-miR-nov7 and hsa-miR-nov3 levels with primary therapy response and with long term survival (10-years).

We found no association between any of the two novel miRNAs and primary response to either epirubicin or paclitaxel (S8 Table). Regarding survival, we observed a weak association between high levels of hsa-miR-nov7 and poor survival in the paclitaxel treated arm of the study, with the strongest associations observed for relapse free survival, however, none of these associations reached statistical significance (Fig 8). No effect was observed in the epirubicin treated arm. Further, for hsa-miR-nov3, no significant correlation to outcome was recorded.

Fig 8. miRNAs and breast cancer survival.

Fig 8

Kaplan-Meier curves showing (i) disease-specific and (ii) relapse-free survival of locally advanced breast cancer patients treated with epirubicin or paclitaxel monotherapy in the neoadjuvant setting (study 1), with respect to expression levels of (A) hsa-miR-nov3 and (B) hsa-miR-nov7 on all samples.

Given the skewed expression levels between breast cancer subtypes for hsa-miR-nov3, we performed survival analyses stratified for ER-status and subtypes. These analyses revealed no significant associations to survival (Log rank test p-values ranging from 0.09 to 0.98).

Discussion

We investigated whether we could detect novel, previously undescribed miRNAs and, if so, address their potential association to other defined biological parameters and to outcome in a cohort of locally advanced breast cancer. We successfully predicted 10 new miRNAs, out of which 2 were deemed reliable because of their detected presence in more than one patient. Although these two novel miRNAs (preliminary termed hsa-miR-nov7 and hsa-miR-nov3) were only predicted from 8 samples among the 50 initially sequenced biopsies, we found them to be expressed in all patients by highly sensitive qPCR at varying levels. In addition to our in vitro validations, the qPCR detection validated the initial NGS based analysis, detecting these two miRNAs.

Since expression of the two miRNAs was confirmed in breast tumor tissue from the majority of patients analyzed, we went on to assess the relative expression levels in tumor versus matched normal breast tissue, collected from a non-tumor bearing quadrant. Our finding that both novel miRNAs had higher expression levels in tumor than in normal tissue indicates a potential functional role in breast cancer. However, although being overexpressed, the biological role of these two miRNAs in cancer should be interpreted with caution. The expression levels are very low, and it is therefore uncertain whether they will have a major impact on cellular functions. Notably, given our approach and identification of the two miRNAs with low expression level, this indicates that there may currently be a limited potential for new discoveries of miRNAs high expression levels and strong functional roles in breast cancer. However, when assessing the potential functional roles of these microRNAs by in silico prediction of targets followed by validation using correlation to mRNA-array data, the KEGG and GO annotations for these targets revealed cellular functions of potential importance in development and progression of cancer. As such, our present findings may warrant further investigations into the functions of the two miRNAs. Notably, regarding hsa-miR-nov3, it was of particular interest that this miRNA was significantly higher expressed in ER-positive as compared to ER-negative breast cancers. Accordingly, we found relatively high expression levels of hsa-miR-nov3 in tumors of the luminal and normal-like subtypes, contrasting low expression levels in basal-like and her2-like tumors [53, 54]. This finding may indicate a potential role for hsa-miR-nov3 restricted to ER-positive tumors.

Regarding potential specific targets, we narrowed these down by first assessing the intersect of three different target prediction algorithms, and then the intersect of this result with a predefined list of tumor suppressors. Although none of the remaining genes after this filtering had a statistically significant inverse correlation with the miRNAs, we identified some potentially interesting connections: For hsa-miR-nov3, we propose ATRX as a target. This is a gene in the SWI/SNF family, involved in chromatin remodelling, and it has previously been found subject to loss of heterozygosity (LOH) in breast cancer [55]. Importantly, we recently reported mutations in the SWI/SNF family genes to be enriched in relapsed breast cancer as compared to primary cancers [56]. Thus, this supports the hypothesis of a breast cancer promoting function for hsa-miR-nov3. For hsa-miR-nov7, we propose APC, SFRP2, and CDH11 as potential targets. Interestingly, the two former are involved in regulation of the Wnt-signalling pathway [5759] and both have previously been reported as targets for several miRNAs in breast cancer [6062]. Taken together, this may imply a role for hsa-miR-nov7 in Wnt signaling. Notably, during our work with the present project, hsa-miR-nov7, was identified by Lim and colleagues and coined miR-10393-3p [50]. They found this miRNA to target genes involved in chromatin modifications associated with pathogenesis of Diffuse large B-cell lymphoma (DLBCL). While this differs from our present finding, it may likely be explained by tissue specific effects of the miRNA.

Regarding any predictive or prognostic role for the two investigated miRNAs, we found no significant impact on survival. While we recorded a non-significant trend towards an association between miRnov7 expression and overall survival in the paclitaxel arm, further studies on larger patient cohorts are warranted to clarify this issue. Alternatively, the miRNAs could play a role in tumorigenesis but not later tumor progression. As such, the observed overexpression in tumor tissue compared to normal breast tissue may be a remaining signal from tumorigenesis.

Whether cancer related overexpression of the two miRNAs described here is merely consequences of other molecular mechanisms in cancer cells or whether the two miRNAs may be involved in tumorigenesis, but not subsequent cancer progression, remains unknown.

Supporting information

S1 Fig. Predicted novel miRNAs.

Table on the upper left shows miRDeep2 scores and read counts. RNA secondary structure for miRNA on the top right. Color code for depiction as follows mature sequence in red, loop sequence in yellow and purple for star sequences. Density plot in the middle shows distribution of reads in precursor reads predicted. Dotted lines illustrate alignment and mm, number of mismatches. Exp, is potential precursor model predicted by algorithm with taking accounts of stability based on free energy, position and read frequencies according to Dicer/Drosha processing of miRNA. Obs, is position and reads found from deep sequencing data. (A) hsa-miR-nov3 and (B) hsa-miR-nov7.

(DOCX)

S1 Table. In-house pan-cancer panel of 283 tumor suppressor genes.

Panel generated based on CGPv2/3-panels [41], Roche’s Comprehensive Cancer Design along with manual literature search, to filter target genes of interest.

(DOCX)

S2 Table. Predicted miRNA-targets by DAVID and topGO.

(XLSX)

S3 Table. Correlation miRNAs and tumour suppressor genes.

Spearman correlation table for hsa-miR-nov3 (A) and hsa-miR-nov7 (B) inversely correlated tumor suppressor genes.

(DOCX)

S4 Table. Correlations mir7 and gene ontology.

(XLS)

S5 Table. Correlations mir3 and gene ontology.

(XLS)

S6 Table. Negatively correlated genes (validation analyses).

(XLSX)

S7 Table. Positively correlated genes (validation analyses).

(XLSX)

S8 Table. Statistics mir3 and mir7 versus response to treatment.

(XLSX)

S1 File. Supporting information mirDeep.

(DOCX)

Acknowledgments

We thank Beryl Leirvaag and Gjertrud T. Iversen for technical assistance.

Data Availability

All relevant data are within the manuscript and its Supporting Information files and the raw data are available through the Gene Expression Omnibus (accession: GSE145151).

Funding Statement

This work was performed in the Mohn Cancer Research Laboratory. The project was funded by grants from the Trond Mohn Research Foundation, The Norwegian Cancer Society, The Norwegian Research Council and the Norwegian Health Region West.

References

  • 1.Beezhold KJ, Castranova V, Chen F. Microprocessor of microRNAs: regulation and potential for therapeutic intervention. Mol Cancer. 2010;9:134 Epub 2010/06/03. 10.1186/1476-4598-9-134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Lee RC, Feinbaum RL, Ambros V. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993;75(5):843–54. Epub 1993/12/03. 10.1016/0092-8674(93)90529-y . [DOI] [PubMed] [Google Scholar]
  • 3.Lee Y, Jeon K, Lee JT, Kim S, Kim VN. MicroRNA maturation: stepwise processing and subcellular localization. The EMBO journal. 2002;21(17):4663–70. Epub 2002/08/29. 10.1093/emboj/cdf476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Reinhart BJ, Slack FJ, Basson M, Pasquinelli AE, Bettinger JC, Rougvie AE, et al. The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans. Nature. 2000;403(6772):901–6. Epub 2000/03/08. 10.1038/35002607 . [DOI] [PubMed] [Google Scholar]
  • 5.Lagos-Quintana M, Rauhut R, Lendeckel W, Tuschl T. Identification of novel genes coding for small expressed RNAs. Science (New York, NY). 2001;294(5543):853–8. Epub 2001/10/27. 10.1126/science.1064921 . [DOI] [PubMed] [Google Scholar]
  • 6.Ambros V, Bartel B, Bartel DP, Burge CB, Carrington JC, Chen X, et al. A uniform system for microRNA annotation. RNA (New York, NY). 2003;9(3):277–9. Epub 2003/02/20. 10.1261/rna.2183803 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ambros V, Lee RC, Lavanway A, Williams PT, Jewell D. MicroRNAs and other tiny endogenous RNAs in C. elegans. Current biology: CB. 2003;13(10):807–18. Epub 2003/05/16. 10.1016/s0960-9822(03)00287-2 . [DOI] [PubMed] [Google Scholar]
  • 8.Aravin AA, Lagos-Quintana M, Yalcin A, Zavolan M, Marks D, Snyder B, et al. The small RNA profile during Drosophila melanogaster development. Developmental cell. 2003;5(2):337–50. Epub 2003/08/16. 10.1016/s1534-5807(03)00228-4 . [DOI] [PubMed] [Google Scholar]
  • 9.Mishra S, Yadav T, Rani V. Exploring miRNA based approaches in cancer diagnostics and therapeutics. Critical reviews in oncology/hematology. 2016;98:12–23. Epub 2015/10/21. 10.1016/j.critrevonc.2015.10.003 . [DOI] [PubMed] [Google Scholar]
  • 10.Friedlander MR, Chen W, Adamidi C, Maaskola J, Einspanier R, Knespel S, et al. Discovering microRNAs from deep sequencing data using miRDeep. Nat Biotechnol. 2008;26(4):407–15. Epub 2008/04/09. 10.1038/nbt1394 . [DOI] [PubMed] [Google Scholar]
  • 11.Kozomara A, Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014;42(Database issue):D68–73. Epub 2013/11/28. 10.1093/nar/gkt1181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wang Y, Yu Y, Tsuyada A, Ren X, Wu X, Stubblefield K, et al. Transforming growth factor-beta regulates the sphere-initiating stem cell-like feature in breast cancer through miRNA-181 and ATM. Oncogene. 2011;30(12):1470–80. Epub 2010/11/26. 10.1038/onc.2010.531 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lonning PE, Knappskog S, Staalesen V, Chrisanthar R, Lillehaug JR. Breast cancer prognostication and prediction in the postgenomic era. Ann Oncol. 2007;18(8):1293–306. Epub 2007/02/24. mdm013 [pii] 10.1093/annonc/mdm013 . [DOI] [PubMed] [Google Scholar]
  • 14.Lonning PE, Knappskog S. Mapping genetic alterations causing chemoresistance in cancer: identifying the roads by tracking the drivers. Oncogene. 2013;32:5315–30. 10.1038/onc.2013.48 . [DOI] [PubMed] [Google Scholar]
  • 15.Gotte M, Mohr C, Koo CY, Stock C, Vaske AK, Viola M, et al. miR-145-dependent targeting of junctional adhesion molecule A and modulation of fascin expression are associated with reduced breast cancer cell motility and invasiveness. Oncogene. 2010;29(50):6569–80. Epub 2010/09/08. 10.1038/onc.2010.386 . [DOI] [PubMed] [Google Scholar]
  • 16.Rask L, Balslev E, Jorgensen S, Eriksen J, Flyger H, Moller S, et al. High expression of miR-21 in tumor stroma correlates with increased cancer cell proliferation in human breast cancer. APMIS: acta pathologica, microbiologica, et immunologica Scandinavica. 2011;119(10):663–73. Epub 2011/09/16. 10.1111/j.1600-0463.2011.02782.x . [DOI] [PubMed] [Google Scholar]
  • 17.Harquail J, Benzina S, Robichaud GA. MicroRNAs and breast cancer malignancy: an overview of miRNA-regulated cancer processes leading to metastasis. Cancer biomarkers: section A of Disease markers. 2012;11(6):269–80. Epub 2012/12/19. 10.3233/cbm-120291 . [DOI] [PubMed] [Google Scholar]
  • 18.Iorio MV, Ferracin M, Liu CG, Veronese A, Spizzo R, Sabbioni S, et al. MicroRNA gene expression deregulation in human breast cancer. Cancer Res. 2005;65(16):7065–70. Epub 2005/08/17. 10.1158/0008-5472.CAN-05-1783 . [DOI] [PubMed] [Google Scholar]
  • 19.Nassar FJ, Nasr R, Talhouk R. MicroRNAs as biomarkers for early breast cancer diagnosis, prognosis and therapy prediction. Pharmacology & Therapeutics. 2017;172:34–49. 10.1016/j.pharmthera.2016.11.012. [DOI] [PubMed] [Google Scholar]
  • 20.Goh JN, Loo SY, Datta A, Siveen KS, Yap WN, Cai W, et al. microRNAs in breast cancer: regulatory roles governing the hallmarks of cancer. Biological Reviews. 2016;91(2):409–28. 10.1111/brv.12176 [DOI] [PubMed] [Google Scholar]
  • 21.Volinia S, Galasso M, Sana ME, Wise TF, Palatini J, Huebner K, et al. Breast cancer signatures for invasiveness and prognosis defined by deep sequencing of microRNA. Proc Natl Acad Sci U S A. 2012;109(8):3024–9. 10.1073/pnas.1200010109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Yan LX, Huang XF, Shao Q, Huang MAY, Deng L, Wu QL, et al. MicroRNA miR-21 overexpression in human breast cancer is associated with advanced clinical stage, lymph node metastasis and patient poor prognosis. RNA (New York, NY). 2008;14(11):2348–60. 10.1261/rna.1034808 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wu JZ, Yang TJ, Lu P, Ma W. Analysis of signaling pathways in recurrent breast cancer. Genetics and Molecular Research. 2014;13(4):10097–104. 10.4238/2014.December.4.4 WOS:000350229200042. [DOI] [PubMed] [Google Scholar]
  • 24.Kashyap D, Kaur H. Cell-free miRNAs as non-invasive biomarkers in breast cancer: Significance in early diagnosis and metastasis prediction. Life Sci. 2020:117417 Epub 2020/02/12. 10.1016/j.lfs.2020.117417 . [DOI] [PubMed] [Google Scholar]
  • 25.Tashkandi H, Shah N, Patel Y, Chen H. Identification of new miRNA biomarkers associated with HER2-positive breast cancers. Oncoscience. 2015;2(11):924–9. Epub 2015/12/24. 10.18632/oncoscience.275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Güllü G. Clinical significance of miR-140-5p and miR-193b expression in patients. 2015;38(1):21–9. 10.1590/s1415-475738120140167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chrisanthar R, Knappskog S, Lokkevik E, Anker G, Ostenstad B, Lundgren S, et al. CHEK2 mutations affecting kinase activity together with mutations in TP53 indicate a functional pathway associated with resistance to epirubicin in primary breast cancer. PLoS ONE. 2008;3(8):e3062 Epub 2008/08/30. 10.1371/journal.pone.0003062 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lonning PE, Helle H, Duong NK, Ekse D, Aas T, Geisler J. Tissue estradiol is selectively elevated in receptor positive breast cancers while tumour estrone is reduced independent of receptor status. J Steroid Biochem Mol Biol. 2009;117(1–3):31–41. Epub 2009/07/14. 10.1016/j.jsbmb.2009.06.005 . [DOI] [PubMed] [Google Scholar]
  • 29.Sasidharan V, Lu YC, Bansal D, Dasari P, Poduval D, Seshasayee A, et al. Identification of neoblast- and regeneration-specific miRNAs in the planarian Schmidtea mediterranea. RNA (New York, NY). 2013;19(10):1394–404. Epub 2013/08/27. 10.1261/rna.038653.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Krishna S, Nair A, Cheedipudi S, Poduval D, Dhawan J, Palakodeti D, et al. Deep sequencing reveals unique small RNA repertoire that is regulated during head regeneration in Hydra magnipapillata. Nucleic Acids Res. 2013;41(1):599–616. Epub 2012/11/21. 10.1093/nar/gks1020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Knappskog S, Chrisanthar R, Lokkevik E, Anker G, Ostenstad B, Lundgren S, et al. Low expression levels of ATM may substitute for CHEK2 /TP53 mutations predicting resistance towards anthracycline and mitomycin chemotherapy in breast cancer. Breast Cancer Res. 2012;14(2):R47 10.1186/bcr3147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS. MicroRNA targets in Drosophila. Genome biology. 2003;5(1):R1 Epub 2004/01/08. 10.1186/gb-2003-5-1-r1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS. Human MicroRNA targets. PLoS biology. 2004;2(11):e363 Epub 2004/10/27. 10.1371/journal.pbio.0020363 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wong N, Wang X. miRDB: an online resource for microRNA target prediction and functional annotations. Nucleic Acids Res. 2015;43(Database issue):D146–52. Epub 2014/11/08. 10.1093/nar/gku1104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120(1):15–20. Epub 2005/01/18. 10.1016/j.cell.2004.12.035 . [DOI] [PubMed] [Google Scholar]
  • 36.Smith TF, Waterman MS. Identification of common molecular subsequences. J Mol Biol. 1981;147(1):195–7. Epub 1981/03/25. 10.1016/0022-2836(81)90087-5 . [DOI] [PubMed] [Google Scholar]
  • 37.Wuchty S, Fontana W, Hofacker IL, Schuster P. Complete suboptimal folding of RNA and the stability of secondary structures. Biopolymers. 1999;49(2):145–65. Epub 1999/03/10. . [DOI] [PubMed] [Google Scholar]
  • 38.Wang X, El Naqa IM. Prediction of both conserved and nonconserved microRNA targets in animals. Bioinformatics. 2008;24(3):325–32. Epub 2007/12/01. 10.1093/bioinformatics/btm595 . [DOI] [PubMed] [Google Scholar]
  • 39.Friedman RC, Farh KK, Burge CB, Bartel DP. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 2009;19(1):92–105. Epub 2008/10/29. 10.1101/gr.082701.108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Grimson A, Farh KK, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP. MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Molecular cell. 2007;27(1):91–105. Epub 2007/07/07. 10.1016/j.molcel.2007.06.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Yates LR, Gerstung M, Knappskog S, Desmedt C, Gundem G, Van Loo P, et al. Subclonal diversification of primary breast cancer revealed by multiregion sequencing. Nat Med. 2015;21(7):751–9. 10.1038/nm.3886 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Chang JT, Nevins JR. GATHER: a systems approach to interpreting genomic signatures. Bioinformatics. 2006;22(23):2926–33. Epub 2006/09/27. 10.1093/bioinformatics/btl483 . [DOI] [PubMed] [Google Scholar]
  • 43.Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30. Epub 1999/12/11. 10.1093/nar/28.1.27 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Huang DW, Sherman BT, Tan Q, Collins JR, Alvord WG, Roayaei J, et al. The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol. 2007;8(9):R183 Epub 2007/09/06. 10.1186/gb-2007-8-9-r183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Alexa A, Rahnenfuhrer J. topGO: Enrichment Analysis for Gene Ontology. 2.38.1 ed2019. p. R package
  • 46.Curtis C. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. 486(7403):346–52. 10.1038/nature10983 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47 Epub 2015/01/22. 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Barbosa-Morais NL, Dunning MJ, Samarajiwa SA, Darot JF, Ritchie ME, Lynch AG, et al. A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data. Nucleic Acids Res. 2010;38(3):e17 Epub 2009/11/20. 10.1093/nar/gkp942 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Backes C, Fehlmann T, Kern F, Kehl T, Lenhof HP, Meese E, et al. miRCarta: a central repository for collecting miRNA candidates. Nucleic Acids Res. 2018;46(D1):D160–D7. Epub 2017/10/17. 10.1093/nar/gkx851 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Lim EL, Trinh DL, Scott DW, Chu A, Krzywinski M, Zhao Y, et al. Comprehensive miRNA sequence analysis reveals survival differences in diffuse large B-cell lymphoma patients. Genome biology. 2015;16:18 Epub 2015/02/28. 10.1186/s13059-014-0568-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–52. http://www.nature.com/nature/journal/v406/n6797/suppinfo/406747a0_S1.html. 10.1038/35021093 [DOI] [PubMed] [Google Scholar]
  • 52.Chrisanthar R, Knappskog S, lokkevik E, Anker G, Ostenstad B, Lundgren S, et al. Predictive and prognostic impact of TP53 mutations and MDM2 promoter genotype in primary breast cancer patients treated with epirubicin or paclitaxel. PLoS ONE. 2011;6(4):e19249 10.1371/journal.pone.0019249 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–52. 10.1038/35021093 . [DOI] [PubMed] [Google Scholar]
  • 54.Sorlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A. 2001;98(19):10869–74. 10.1073/pnas.191367098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Roy D, Guida P, Zhou G, Echiburu-Chau C, Calaf GM. Gene expression profiling of breast cells induced by X-rays and heavy ions. International journal of molecular medicine. 2008;21(5):627–36. Epub 2008/04/22. 10.3892/ijmm.21.5.627 . [DOI] [PubMed] [Google Scholar]
  • 56.Yates LR, Knappskog S, Wedge D, Farmery JHR, Gonzalez S, Martincorena I, et al. Genomic Evolution of Breast Cancer Metastasis and Relapse. Cancer Cell. 2017;32(2):169–84.e7. 10.1016/j.ccell.2017.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.von Marschall Z, Fisher LW. Secreted Frizzled-Related Protein-2 (sFRP2) Augments Canonical Wnt3a-induced Signaling. Biochem Biophys Res Commun. 2010;400(3):299–304. 10.1016/j.bbrc.2010.08.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Rattner A, Hsieh JC, Smallwood PM, Gilbert DJ, Copeland NG, Jenkins NA, et al. A family of secreted proteins contains homology to the cysteine-rich ligand-binding domain of frizzled receptors. Proc Natl Acad Sci U S A. 1997;94(7):2859–63. 10.1073/pnas.94.7.2859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Hankey W, Frankel WL, Groden J. Functions of the APC tumor suppressor protein dependent and independent of canonical WNT signaling: Implications for therapeutic targeting. Cancer metastasis reviews. 2018;37(1):159–72. 10.1007/s10555-017-9725-6 PMC5803335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Isobe T, Hisamori S, Hogan DJ, Zabala M, Hendrickson DG, Dalerba P, et al. miR-142 regulates the tumorigenicity of human breast cancer stem cells through the canonical WNT signaling pathway. eLife. 2014;3 Epub 2014/11/19. 10.7554/eLife.01977 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Tan Z, Zheng H, Liu X, Zhang W, Zhu J, Wu G, et al. MicroRNA-1229 overexpression promotes cell proliferation and tumorigenicity and activates Wnt/beta-catenin signaling in breast cancer. Oncotarget. 2016;7(17):24076–87. Epub 2016/03/19. 10.18632/oncotarget.8119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Liu S, Wang Z, Liu Z, Shi S, Zhang Z, Zhang J, et al. miR-221/222 activate the Wnt/β-catenin signaling to promote triple negative breast cancer. Journal of Molecular Cell Biology. 2018:mjy041–mjy. 10.1093/jmcb/mjy041 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Bernard Mari

7 Jan 2020

PONE-D-19-30537

The novel microRNAs hsa-miR-nov7 and hsa-miR-nov3 are over-expressed in locally advanced breast cancer

PLOS ONE

Dear Dr Knappskog,

Thank you for submitting your manuscript to PLoS ONE. Your manuscript has been reviewed by two experts in the field and their comments are appended below. After reading the reviews and looking at the manuscript, we feel that your study has merit, but is not suitable for publication as it currently stands. Therefore, my decision is "Major Revision”.

You must revise accordingly and explain your revisions in a covering letter if you wish for us to consider your paper further for publication. Note that it will have to go through another round of review.

We invite you to submit a revised version of the manuscript that addresses the concerns raised by the reviewers and myself. Please pay attention to all the suggestions and give them due consideration.

Specifically:

You should answer to the comments of Reviewer 1. As raised by the reviewer, NGS technique can give several biases and validation should include Northern blot, as frequently recommended (see Alles et al. Nucleic Acids Research, 2019, Vol. 47, No. 7 3353–3364). It would be also highly recommended to add additional functional data to explore the potential relevance of these miRNAs, as suggested by the reviewer.

You should also clearly respond to the three major concerns raised by Reviewer 2. Several methodological and statistical points have to be clarified and / or completed.

Finally, I have also personally raised two additional points: first, the authors should submit the whole small RNA-Seq dataset to a public database such as Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/). Second, they should also match (and eventually submit) their prediction with a central repository for collecting miRNA candidates such as miRCarta (Backes et al. Nucleic Acids Res. 2018 Jan 4;46(D1):D160-D167) to check for potential redundancies within their sequences.

We would appreciate receiving your revised manuscript by Feb 21 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Bernard Mari, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

3. Thank you for including your ethics statement: 

"The study involves use of human material from cancer patients. All patients provided written informed consent, and the studies conducted in accordance to national laws, regulation and ethical permissions."

a. Please amend your current ethics statement to include the full name of the ethics committee/institutional review board(s) that approved your specific study.

b. Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

For additional information about PLOS ONE ethical requirements for human subjects research, please refer to http://journals.plos.org/plosone/s/submission-guidelines#loc-human-subjects-research.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors described the over-expression of two novel miRNAs, hsa-miRNA-nov3 and hsa-miRNA-nov7 in breast cancer. Out of these, one was not reported previously and the other not reported previously in breast cancer. The in-silico analysis to explore the putative functional role is well designed and follows a logical and systematic approach. However, there is no strong experimental evidence of their relevance to biological function, probably because of their low expression levels, as mentioned by authors. Moreover, miRNA nov3 has statistically insignificant association with patient survival, while miR-nov7 has no association with patient survival. Despite the computational approach to explore the potential functional relevance of these miRNAs, the significance of this work in the larger context is weak, provided lack of association between miRNAs and patients’ survival. Moreover, based on the evidence presented, it is hard to distinguish whether miRNAs are involved in tumorigenesis or their over-expression is a mere co-incidental effect of some other molecular pathway.

Also, NGS technique is well used to study miRNAs but it is not a strong system to find novel miRNAs. It is more accurate to use Northern blot to claim novel miRNAs.

Reviewer #2: In this manuscript, the authors have analyzed small RNA sequencing data for 50 locally advanced breast cancers to identify novel miRNAs. 10 novel miRNAs were predicted using mirDeep software and found 2 miRNAs (hsa-miR-nov3 & hsa-miR-nov7) over-expressed in tumor versus normal breast tissue in a separate set of 13 patients. They found hsa-miR-nov3 expressed at higher levels in ER-positive as compared to ER-negative tumors. They also predicted target genes for these 2 microRNAs and identified inversely correlated genes in mRNA expression array data available from 203 out of the 223 patients. Also, they have done KEGG and GO annotations to target genes that revealed pathways essential to cell development, communication and homeostasis. Likewise, they observed a weak association between high expression levels of hsa-miR-nov7 and poor survivals that did not reach statistical significance. As the author has done a very systemic work, but before publication, following issues should be addressed:

1. The author has written the introduction section that describes the role of mirna in cancer. Author has given examples of mirna that are involved in breast cancer. It would be very useful if the author provides some latest study for mirna that have been used as a biomarker in cancer diagnosis.

2. As there are number of gene enrichment tools (like DAVID, GSEA). Authors may also use other tool also for functional gene enrichment.

3. As given in Table 2 and 3, what are the criteria for selecting top 10 GO annotation as well as KEGG annotation. Also, please define how Bayes factor, p-value and FDR value calculated?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Manoj Kumar

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Apr 16;15(4):e0225357. doi: 10.1371/journal.pone.0225357.r002

Author response to Decision Letter 0


20 Feb 2020

Response to Editor and Reviewers regarding manuscript PONE-D-19-30537

“The novel microRNAs hsa-miR-nov7 and hsa-miR-nov3 are over-expressed in locally advanced breast cancer”

In the following, comments from the Editor and Reviewers are given in blue italic, while our responses are given in normal black font.

Comments from the Editor

Specifically:

You should answer to the comments of Reviewer 1. As raised by the reviewer, NGS technique can give several biases and validation should include Northern blot, as frequently recommended (see Alles et al. Nucleic Acids Research, 2019, Vol. 47, No. 7 3353–3364). It would be also highly recommended to add additional functional data to explore the potential relevance of these miRNAs, as suggested by the reviewer.

You should also clearly respond to the three major concerns raised by Reviewer 2. Several methodological and statistical points have to be clarified and / or completed.

Response:

Please see our responses to these issues below, under our responses to the Reviewers’ specific comments.

Finally, I have also personally raised two additional points: first, the authors should submit the whole small RNA-Seq dataset to a public database such as Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/).

Response:

We fully concur with this and have now submitted the dataset as requested. The accession number is GSE145151. This information has now been included in the Materials and Methods section of the revised manuscript.

Second, they should also match (and eventually submit) their prediction with a central repository for collecting miRNA candidates such as miRCarta (Backes et al. Nucleic Acids Res. 2018 Jan 4;46(D1):D160-D167) to check for potential redundancies within their sequences.

Response:

We agree with the Reviewer and have now cross checked the potential presence of our novel miRNAs against human precursor miRNA sequences obtained from miRCarta database (Backes et al., 2018) (curated from miRNA information from databases such as miRbase, miRmaster, TargetScan). Alignment of our sequences against the miRCarta database resulted in no hits for any of our two miRNAs presented in the present manuscript, reinforcing the novelty of the miRNAs.

This cross check and its result is now included in the Results section of the revised manuscript (page 7).

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and

http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Response:

This has now been checked for the revised manuscript.

2. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

Response:

We have now removed the phrase “data not shown” with respect to the assessment of potential correlation between miRNA expression levels and response to therapy. Instead, we now refer to a new Supporting Table (Supporting Table S5) where we specify the statistics performed and the resulting p-values, as well as give the full data set behind the calculations. We have also removed the phrase “data not shown” with respect to survival analyses in subgroups breast cancers. Instead, we now state the range of the non-significant p-values for these analyses.

3. Thank you for including your ethics statement:

"The study involves use of human material from cancer patients. All patients provided written informed consent, and the studies conducted in accordance to national laws, regulation and ethical permissions."

a. Please amend your current ethics statement to include the full name of the ethics committee/institutional review board(s) that approved your specific study.

Response:

We have now included the full name of the REC approving our study and biobanks in the Material and Method section, page 4.

b. Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

Response:

We have now revised the submission form accordingly.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

5. Review Comments to the Author

Reviewer #1:

The authors described the over-expression of two novel miRNAs, I-miRNA-nov3 and I-miRNA-nov7 in breast cancer. Out of these, one was not reported previously and the other not reported previously in breast cancer. The in-silico analysis to explore the putative functional role is well designed and follows a logical and systematic approach. However, there is no strong experimental evidence of their relevance to biological function, probably because of their low expression levels, as mentioned by authors. Moreover, miRNA nov3 has statistically insignificant association with patient survival, while miR-nov7 has no association with patient survival. Despite the computational approach to explore the potential functional relevance of these miRNAs, the significance of this work in the larger context is weak, provided lack of association between miRNAs and patients’ survival. Moreover, based on the evidence presented, it is hard to distinguish whether miRNAs are involved in tumorigenesis or their over-expression is a mere co-incidental effect of some other molecular pathway.

Response:

We understand these concerns from the reviewer and to a large extent we agree.

Regarding functional role of the two miRNAs, we have now performed additional computational analyses using different tools in order to validate our original findings. This is further described in our response to Reviewer #2’s point 2.

Regarding the functional relevance / biological importance of the two miRNAs, we discussed (as pointed out by the reviewer) that the functions may be weak due to low expression levels. The fact that we performed our open screen and detected these two miRNAs (but not others in more than single patients), we believe to be an important message per se; it gives a message to the scientific community that the likelihood of detecting novel miRNAs in breast cancer with high expression levels and important functions in a majority of patients, may be limited. We have now emphasized this point in the Discussion section of the revised manuscript.

Also, NGS technique is well used to study miRNAs but it is not a strong system to find novel miRNAs. It is more accurate to use Northern blot to claim novel miRNAs.

Response:

This point is well taken. We fully agree that NGS is not an optimal strategy for identification of novel miRNAs on its own, but that it needs subsequent validation. In the present work, we used NGS as an initial screen for potential novel miRNAs, with an open approach, and then moved on to validation. Regarding Northern blot analyses, we agree that this could be one way of validating the presence of miRNAs. The reason for this is two-fold: Northern will give some information about sequence (by binding of a probe with known sequence) and a size estimate (by migration under electrophoresis). In the present study, we have used a validation set-up that we believe is superior to Northern with respect to both these measures: We have poly-A-tagged, amplified, cloned and sequenced the patient’s miRNA. This has enabled us to determine both the exact sequence and the exact size of the novel miRNAs (results shown in Figure 2). We believe this is the best possible validation of their presence and nature. However, when reading the comments from the Reviewer and the Editor, we realize that this point was not properly explained in the original manuscript and have now revised the text to make this more clear for the reader: We have added a new paragraph to the Materials and Methods section (page 5) as well as added sentences specifying this point both in the main text of the Results section and the legend to Figure 2 (page 8).

Reviewer #2:

In this manuscript, the authors have analysed small RNA sequencing data for 50 locally advanced breast cancers to identify novel miRNAs. 10 novel miRNAs were predicted using mirDeep software and found 2 miRNAs (I-miR-nov3 & I-miR-nov7) over-expressed in tumor versus normal breast tissue in a separate set of 13 patients. They found I-miR-nov3 expressed at higher levels in ER-positive as compared to ER-negative tumors. They also predicted target genes for these 2 microRNAs and identified inversely correlated genes in mRNA expression array data available from 203 out of the 223 patients. Also, they have done KEGG and GO annotations to target genes that revealed pathways essential to cell development, communication and homeostasis. Likewise, they observed a weak association between high expression levels of I-miR-nov7 and poor survivals that did not reach statistical significance. As the author has done a very systemic work, but before publication, following issues should be addressed:

1. The author has written the introduction section that describes the role of mirna in cancer. Author has given examples of mirna that are involved in breast cancer. It would be very useful if the author provides some latest study for mirna that have been used as a biomarker in cancer diagnosis.

Response:

This point I well taken. We feel the introduction was well balanced, so we have not added very much new text, but in the revised manuscript, we have now included information about the use of miRNAs a biomarkers for cancer diagnosis, as suggested by the Reviewer, and refer to a recent review with detailed overview of the original literature in the field.

2. As there are number of gene enrichment tools (like DAVID, GSEA). Authors may also use other tool also for functional gene enrichment.

Response:

We fully agree with this and realize we could reach further validations regarding the potential biological roles of the miRNAs by applying additional gene enrichment tools. In the revised manuscript, we have included analyses both by DAVID and topGO. These are now included as validations in the Results section and we have given the detailed output as the new Supporting Table S2. (Note that this new table has caused re-numbering of other Supporting tables).

3. As given in Table 2 and 3, what are the criteria for selecting top 10 GO annotation as well as KEGG annotation. Also, please define how Bayes factor, p-value and FDR value calculated?

Response:

The selection of the top 10 annotations (Table 2) and the top 25 correlations (Table 3) were based on an arbitrary cut-offs, simply for reader friendliness of the Tables. We realize that this was not clear, and have now modified the legends to make this clearer for the reader.

Bayes factor, p-values and FDR were all calculated using the GATHER-algorithm (Chang and Nevins, 2006).

Here, Bayes factor is a measure of the strength of annotation. The actual calculations and the formulas are given in the original publication (Chang and Nevins, 2006); in brief, the comparison is based on a 2x2 table where one dimension is annotations associated with 2 compared groups and the other dimension is the number of genes associated with the annotation. In the output (shown in our table 2), Bayes factor is given in one column, with a p-value related to the Bayes factor estimate in the next column.

In addition, the 2x2 comparison can be assessed by a regular Fischer’s exact test. This is given as “FE” in the output, along with the FDR based on this Fischer’s exact test. Both p-values and the FDR are given as ln of the actual values as default (for a more compact representation).

Note that the precision of the method was also assessed in detail in the original publication (please see figure 2b in the original publication (Chang and Nevins, 2006))

We realize that these points were not evident from the presentation in our original manuscript and that the headings in Table 2 were not explained to the reader. We have now clarified this by inserting explanations as footnotes to Table2.

Attachment

Submitted filename: Response letter to PONE R1.docx

Decision Letter 1

Bernard Mari

17 Mar 2020

The novel microRNAs hsa-miR-nov7 and hsa-miR-nov3 are over-expressed in locally advanced breast cancer

PONE-D-19-30537R1

Dear Dr. Knappskog,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Bernard Mari, Ph.D

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: Authors have addressed all the queries raised and updated the manuscript accordingly. Revised manuscript may now be accepted for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Manoj Kumar

Acceptance letter

Bernard Mari

25 Mar 2020

PONE-D-19-30537R1

The novel microRNAs hsa-miR-nov7 and hsa-miR-nov3 are over-expressed in locally advanced breast cancer

Dear Dr. Knappskog:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Bernard Mari

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Predicted novel miRNAs.

    Table on the upper left shows miRDeep2 scores and read counts. RNA secondary structure for miRNA on the top right. Color code for depiction as follows mature sequence in red, loop sequence in yellow and purple for star sequences. Density plot in the middle shows distribution of reads in precursor reads predicted. Dotted lines illustrate alignment and mm, number of mismatches. Exp, is potential precursor model predicted by algorithm with taking accounts of stability based on free energy, position and read frequencies according to Dicer/Drosha processing of miRNA. Obs, is position and reads found from deep sequencing data. (A) hsa-miR-nov3 and (B) hsa-miR-nov7.

    (DOCX)

    S1 Table. In-house pan-cancer panel of 283 tumor suppressor genes.

    Panel generated based on CGPv2/3-panels [41], Roche’s Comprehensive Cancer Design along with manual literature search, to filter target genes of interest.

    (DOCX)

    S2 Table. Predicted miRNA-targets by DAVID and topGO.

    (XLSX)

    S3 Table. Correlation miRNAs and tumour suppressor genes.

    Spearman correlation table for hsa-miR-nov3 (A) and hsa-miR-nov7 (B) inversely correlated tumor suppressor genes.

    (DOCX)

    S4 Table. Correlations mir7 and gene ontology.

    (XLS)

    S5 Table. Correlations mir3 and gene ontology.

    (XLS)

    S6 Table. Negatively correlated genes (validation analyses).

    (XLSX)

    S7 Table. Positively correlated genes (validation analyses).

    (XLSX)

    S8 Table. Statistics mir3 and mir7 versus response to treatment.

    (XLSX)

    S1 File. Supporting information mirDeep.

    (DOCX)

    Attachment

    Submitted filename: Response letter to PONE R1.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files and the raw data are available through the Gene Expression Omnibus (accession: GSE145151).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES