Abstract
Long non-coding RNAs (lncRNAs) are emerging as a critical regulator in controlling the expression level of genes involved in cell differentiation and development. Primordial follicle activation (PFA) is the first step for follicle maturation, and excessive PFA results in premature ovarian insufficiency (POI). However, the correlation between lncRNA and cell differentiation was largely unknown, especially during PFA. In this study, we observed the expression level of lncRNA was more specific than protein-coding genes in both follicles and granulosa cells, suggesting lncRNA might play a crucial role in follicle development. Hence, a systematical framework was needed to infer the functions of lncRNAs during PFA. Additionally, an increasing number of studies indicate that the subpathway is more precise in reflecting biological processes than the entire pathway. Given the complex expression patterns of lncRNA target genes, target genes were further clustered based on their expression similarity and classification performance to reveal the activated/inhibited gene modules, which intuitively illustrated the diversity of lncRNA regulation. Moreover, the knockdown of SBF2-AS1 in the A549 cell line and ZFAS1 in the SK-Hep1 cell line further validated the function of SBF2-AS1 in regulating the Hippo signaling subpathway and ZFAS1 in the cell cycle subpathway. Overall, our findings demonstrated the importance of subpathway analysis in uncovering the functions of lncRNAs during PFA, and paved new avenues for future lncRNA-associated research.
Keywords: lncRNA, subpathway, target gene cluster, primordial follicle activation, premature ovarian insufficiency
1. Introduction
Follicle maturation refers to the development of primordial follicles into preovulatory follicles, where follicles and granulosa cells undergo extensive gene expression changes [1,2,3]. Primordial follicles form around birth and are non-renewable. The majority of primordial follicles maintain a dormant state. Primordial follicle activation refers to the process by which primordial follicles change from a quiescent to a growing state, i.e., the transition from a primordial follicle to a primary follicle. Primordial follicle activation (PFA) is a prerequisite for folliculogenesis, which refers to the process where follicle size enlarges, and the morphology of granulosa cells becomes cuboidal [4]. Researchers have indicated that excessive activation of the primordial follicles resulted in premature ovarian insufficiency (POI) and, ultimately, female sterility [5]. Studies indicate that POI patients still have some residual primordial follicles in their ovaries, which provides a new direction for the treatment by promoting the activation of these primordial follicles [6,7]. Therefore, understanding the mechanisms driving PFA is of particular importance.
Long non-coding RNAs (lncRNAs) are operationally defined as non-coding RNA transcripts longer than 200 nucleotides (nt). LncRNAs are involved in cell commitment by regulating the expression of certain genes [8,9]. Increasing studies have indicated that lncRNAs showed higher lineage specificity in multiple human organs than protein-coding genes [10]. However, limited reports focus on lncRNA during PFA. Therefore, there is an urgent need to develop computational approaches to explore the function of lncRNA during PFA.
LncRNA exerts its function by regulating hundreds or even thousands of target genes, while these target genes may have opposite expression patterns under certain biological contexts [11,12]. Therefore, it is necessary to divide target genes into different groups based on their expression value.
Pathway analysis is widely used to interpret relevant biological changes under certain contexts [13,14]. However, the entire pathway is often too large to accurately elucidate relevant biological regulation. A local region with specific biological functions is defined as a subpathway and may be more suitable for uncovering relevant biological events. Recently, a growing body of research indicated that subpathway analysis was more effective in elucidating the pathogenesis of disease and identifying cancer risk gene markers [15,16,17]. However, the application of subpathways to elucidate the mechanism of reproductive cell development is still lacking. Moreover, there were limited subpathway analysis studies at the single-cell level.
Collectively, to further improve our understanding of lncRNAs involved in stepwise and follicle maturation, a subpathway and target gene cluster-based approach was applied to predict lncRNA–subpathway connections, which provides new insight into the underlying mechanisms of lncRNAs during PFA more precisely.
2. Results
2.1. LncRNAs Improve Cell Type Classification during Human Follicle Maturation
We obtained follicles and corresponding granulosa cells from the primordial to preovulatory stages in GSE107746 [1]. In order to reveal whether lncRNA is cell-specific during follicle maturation, we performed UMAP analysis to visualize the distribution of follicles and granulosa cells from the primordial to preovulatory stages based on protein-coding genes and lncRNAs. We observed that the lncRNA signature displayed higher cell type specificity than protein-coding genes (Figure 1A). The standard deviation (sd) value of lncRNAs was lower than that of protein-coding genes, indicating that the expression pattern of lncRNA was more consistent in cells (Figure 1B). The number of expressed lncRNAs gradually increased in follicles from the primordial to preovulatory stages (see the method in Figure 1C). In order to rule out the effects of follicular size on the number of expressed protein-coding genes and lncRNAs, the attribute of nFeature_RNA of was calculated, and we found that the number of expressed lncRNAs was independent of follicular size (Figure S1A). In addition, there were fewer overlapping lncRNAs between the follicles and corresponding granulosa cells from the primordial to preovulatory stages (Figure 1D). Collectively, lncRNA showed higher cell type specificity than protein-coding genes during follicle maturation.
Figure 1.
The specificity of lncRNAs during human follicle maturation. (A) The left panel shows the development of human follicle maturation. The right panel shows the UMAP plot of follicles and granulosa cells by protein-coding genes or lncRNAs. (B) Standard deviation analysis showing the distribution differences in each cell type by protein-coding genes and lncRNAs. (C) The number of expressed protein-coding genes and lncRNAs. (D) Upper Venn diagrams show the overlapped protein-coding genes between follicles and granulosa cells, and bottom Venn diagrams show the overlapping lncRNAs. From primordial to preovulatory stages, the turquoise1, dark green, darkkhaki, hotpink and royalblue1 colors indicate follicles. The strong orange, darkcyan, burlywood4, snow4 and slateblue colors indicate granulosa cells.
2.2. Distinct Transcriptomic Expression of lncRNAs during PFA
PFA is a crucial step for female fertility [18], and excessive PFA results in POI. Increasing evidence indicates that PFA is a tightly controlled process [19,20]. However, the exact mechanism of controlling the selective activation of primordial follicles mediated by lncRNAs is unknown. According to Figure S1B, from primordial to primary stages, unions of 648 expressed lncRNAs in follicles and 170 lncRNAs in granulosa cells were acquired. By comparing the expression level of lncRNA between follicles and granulosa cells, we identified 65 upregulated lncRNAs in follicles and 22 upregulated lncRNAs in granulosa cells during PFA (Figure 2A, Table S1). For example, SBF2-AS1, MRVI1-AS1, CASC8, TEX41, CYP1B1-AS1, and OIP5-AS1 were mainly expressed in follicles, whereas WT1-AS, THAP9-AS1, GAS5, MALAT1, ZFAS1, MIR22HG, and SNHG8 had higher expression level in granulosa cells. Among them, CASC8 was specifically expressed in MII oocytes [21]. Moreover, across nine GTEx tissues (heart, liver, spleen, lung, kidney, testis, vagina, uterus, and ovary), we noticed that the majority of these lncRNAs were expressed in ovary tissues (Figure 2B). Hence, the expression of these lncRNAs at the single-cell and bulk levels provided a prelude to indicate the importance of lncRNA during PFA.
Figure 2.
Dynamic expression of lncRNAs in follicles and granulosa cells during PFA. (A) The left heat map shows the typical differentially expressed lncRNAs (Wilcox test, Bonferroni adjust p < 0.05, see Table S1). Red and green represent an increase and the decrease in the lncRNA expression levels, respectively. The bar on the top indicates cell types using lime green (follicles) and grey (granulosa cells). Right dot plot shows the expression pattern in follicles and granulosa cells during PFA. (B) The expression level of lncRNAs across 9 tissues in GTEx project.
2.3. Identifying Primordial Follicle Activation Associated lncRNA Clusters
LncRNAs play crucial roles in gene regulation at post-transcriptional levels [22,23]. In this study, bioinformatics analysis used the competitive endogenous RNA (ceRNA) theory to infer the lncRNA target genes. During PFA, a total of 97 lncRNA-associated target gene pairs in follicles and 47 lncRNA-corresponding target gene pairs in granulosa cells were identified (see details in Section 4). Given the expression diversity of target genes, we applied a four-step computational framework that divided lncRNA target genes into different clusters based on the genes’ PCC similarity and classification performance (AUC value) during PFA (Figure 3A). The number of lncRNA target gene clusters varied from 1 to 6 in follicles and 1 to 11 in granulosa cells during PFA (Figure 3B). LncRNA target genes for each cluster were indicated. For example, SBF2-AS1 was highly expressed in primordial follicles and primary follicles. There were five target gene clusters of SBF2-AS1. Among these clusters, PTGS2 in cluster 1 played a crucial role in the acquisition of oocyte competence [10]. IGF1R and KIT in cluster 2 were fertility-associated genes. In addition, ZFAS1 was mainly expressed in granulosa cells with three target gene clusters (Figure S2). Target genes in cluster 1 were mainly activated in primary granulosa cells, and genes like BECN1, CDKN1B, EIF2B2, and MTRR were POI-related genes [24]. These results suggested that grouping lncRNA target genes into different clusters was beneficial for further analysis of follicle maturation-associated functional gene modules.
Figure 3.
Pipeline of lncRNA target gene clusters. (A) Workflow of lncRNA target gene cluster identification. (B) The number distribution of lncRNA target gene clusters in follicles (upper panel) and granulosa cells (lower panel) during primordial follicle activation. (C) Heat maps of cluster genes of SBF2-AS1, MALAT1, and ZFAS1 in primordial and primary follicles.
2.4. Differences between Subpathway and Pathway during Follicle Maturation
Pathway identification may improve our understanding of lncRNA function. However, the traditional biological pathway is complex and covers multiple genes. A growing number of studies have indicated that the cell fate transition was often caused by changes in local regions (subpathways) within the biological pathway rather than the whole pathway [25,26]. However, the detailed mechanism of subpathway activation required further investigation.
In this study, 725 subpathways were identified from 238 pathways by the R package psSubpathway (Figure 4A). Each pathway contained 1–20 subpathways, which consisted of 3 to 399 genes. Then, a total of 210 subpathways in follicles and 234 subpathways in granulosa cells were significantly activated during PFA ((Formula (2) in Materials and Methods, p < 0.05). Studies have reported that the “cell cycle pathway”, “mTOR signaling pathway”, “ErbB signaling pathway”, and “FoxO signaling pathway” were associated with follicle growth [1,27,28,29,30]. Comparing the ssGSEA (single sample Gene Set Enrichment Analysis) score and pathway specificity score between pathway and corresponding subpathways, we observed high heterogeneity of activation among the pathway and corresponding subpathways in follicles, which was low in granulosa cells (Figure 4B and Figure S3). We noticed that the expression level of ligand/receptor was relatively conserved in granulosa cells in Zhang et al.’s study [1], and the UMAP distribution of granulosa cells was adjacent from the primordial to preovulatory stages using protein-coding genes (Figure 1A), suggesting that the differentiation of granulosa cells did not change much. Hence, we hypothesized that the activation pattern of the subpathway in granulosa cells showed lower heterogeneity, suggesting that the transcriptome changes in granulosa cells were relatively stable in inducing follicle maturation. In summary, these results indicated that it is more precise to depict the follicle fate transition using subpathways.
Figure 4.
The dynamics of subpathways in follicles from primordial to preovulatory stages. (A) Identification of subpathway by k-clique algorithm. The left histogram is the distribution of the number of genes among 725 subpathways ranging from 3 to 399. The middle histogram is the number of subpathways among each pathway ranging from 1 to 20. The right table lists corresponding subpathways for a given pathway. (B) The difference between “Cell cycle”, “mTOR signaling pathway”, “ErbB signaling pathway”, and “FoxO signaling pathway” and their corresponding subpathways in follicles from primordial to preovulatory stages. Take “Cell cycle” pathway as example; the left heat map shows the ssGSEA score between “Cell cycle” pathway and 5 subpathways (04110_3, 04110_4, 04110_6, 04110_8, and 04110_9). The right line chart indicates the dynamic change in pathway specificity; red line represents the entire pathway, whereas other color lines represent corresponding subpathways.
2.5. The lncRNA–Subpathway Network and Its Characteristics during Primordial Follicle Activation
After identifying the subpathways dynamically activated during PFA, we investigated whether the expressed lncRNA-target gene clusters work together to regulate a specific subpathway. In this study, a hypergeometric test was used to measure the statistical significance of shared protein-coding genes between subpathway genes and lncRNA target cluster genes. Then, 91 lncRNAs involved in 210 dynamic subpathways were obtained in follicles, and 17 lncRNAs participating in 234 dynamic subpathways were identified in granulosa cells. We observed subpathways associated with MAPK, Insulin, Ras, Autophagy, Hippo, Cell cycle, and mTOR were activated in follicles and granulosa cells during PFA (Figure 5A). When further ranking these lncRNAs by Formula 5 based on the hypergeometric p-value and the corresponding number of dynamic subpathways, the top 20 lncRNAs (20/91) in follicles (Table S2) and 17 lncRNAs in granulosa cells are shown in Figure 5B. The overlapping genes of lncRNA target genes with subpathway genes and POI genes are illustrated (Figure 5C). We found lncRNAs regulated genes involved in the POI phenomenon, reflecting how these lncRNAs might play an important role in female fertility. Among these POI-related genes in Figure 5C, CDKN1B, FMR1, FOXO3, NOBOX, and LARS2 was candidate pathogenesis for POI [31,32,33,34,35]. Moreover, we observed the target genes of CASC8 were associated with “Thermogenesis” by regulating ACTL6A. FGD5-AS1 was involved in the “MAPK_signaling_pathway (04010_21)” by regulating MAPK genes like CRKL, HSPA8, MAPK8, MAX, and NFATC3. FGD5-AS1 affected the expression of POI-related genes (EIF2B2, EXO1, FMR1, and FOXO3). These findings provided detailed lncRNA–subpathway connections during PFA.
Figure 5.
The connection between lncRNAs and subpathways during PFA. (A) Subpathway enrichment bubble plot during PFA, a larger p-value (−Log10) indicates a higher degree of enrichment. (B) Radar chart displaying the rank of lncRNAs in follicles (upper panel) and granulosa cells (lower panel) during PFA. The direction of the red circle represents the rank from largest to smallest value. Right heat map showing the expression level of top-rank lncRNAs in follicles and granulosa cells. (C) The predicted connection among lncRNAs, target genes, subpathway genes, and POI-related genes.
2.6. Validating the Function of lncRNAs Based on Subpathway Genes after lncRNA Knockdown
In our study, SBF2-AS1 was associated with the “Hippo signaling pathway (04390_1)” and “Regulation of actin cytoskeleton (04810_13)”. ZFAS1 was associated with the “mTOR signaling subpathway (04150_16)” and “Cell cycle (04110_3)” (Figure 5C). In Figure 2B, we can observe that SBF2-AS1 and ZFAS1 were not only expressed in ovarian tissues but also in liver and lung tissues. Through comprehensively searching and reviewing currently available lncRNA knockdown datasets, we obtained SBF2-AS1 knockdown data in the human A549 cell line (GSE103016) and ZFAS1 knockdown data in the human SK-Hep1 cell line (GSE104226) [36]. To further validate the potential function of SBF2-AS1 and ZFAS1, we explored the expression level of subpathway genes after the lncRNA knockdown (Figure 6). Consistently, we noticed that the knockdown of SBF2-AS1 and ZFAS1 induced down-regulation of corresponding subpathway genes. For example, DLG3, FBXW11, BTRC, PPP2R2A, RASSF6, and YAP1 belonged to the Hippo signaling pathway (04390_1). These genes were down-regulated after SBF2-AS1 knockdown, among which PPP2R2A and RASSF6 were target genes of SBF2-AS1 during PFA. These results confirmed the inferred connection between lncRNAs and subpathways.
Figure 6.
Heat map of genes associated with Hippo, regulation of actin cytoskeleton, mTOR, and cell cycle subpathways (A) in control and SBF2-AS1 knockdown A549 cell line (GSE103016) and (B) in control and ZFAS1 knockdown SK-Hep1 cell line (GSE104226). Genes in red are the target gene of lncRNA during PFA.
3. Discussion
LncRNAs are non-coding transcripts that have been studied extensively in the process of cell development [37]. Human follicles were divided into five stages based on follicular size and granulosa cell numbers, i.e., primordial follicles, primary follicles, secondary follicles, antral follicles, and preovulatory follicles [38,39]. PFA is necessary for follicle growth. However, systematic analysis of lncRNAs involved in PFA is still poorly understood. In this study, a comprehensive analysis of the expression level of lncRNAs and protein-coding genes in follicles and granulosa cells, we noticed that lncRNAs were more cell type-specific than protein-coding genes. Additionally, a growing number of studies have investigated the biological changes based on subpathways, which provided a clearer and more defined window to understand the biological process of follicle maturation. To decode the lncRNA mechanism during PFA, we obtained subpathways and further grouped lncRNA target genes into different clusters, where each gene cluster exhibited consistent expression patterns and high classification ability during PFA. The connection between subpathway genes and lncRNA was inferred based on enrichment analysis during PFA.
The development of follicle maturation is associated with multi-factorial and complex steps. Numerous crucial lncRNAs that modulated cell proliferation and differentiation have been identified. For example, SBF2-AS1 and MRVI1-AS1 were upregulated and expressed in follicles, whereas WT1-AS and ZFAS1 were highly expressed in granulosa cells during PFA (Figure 2A). Chen et al. found that the silencing of SBF2-AS1 reduced the proliferative ability of esophageal squamous cell carcinoma (ESCC) [40]. Tuo et al. observed that MRVI1-AS1 promoted the progression of hepatocellular carcinoma (HCC) [41]. Lou et al. reported that the G1-S phase transition increased after the knockdown of GAS5 in prostate cancer [42]. O’Brien indicated that the knockdown of ZFAS1 was associated with decreased cellular proliferation [43]. However, more studies are still required to ascertain lncRNA function during PFA. In this study, we investigated the development of follicles and granulosa cells by performing UMAP analysis and found that lncRNAs could more effectively discriminate cell type than protein-coding genes. We revealed the dynamic expression of lncRNAs in follicles and granulosa cells during PFA (Figure 2A), suggesting the complexity of lncRNAs in related biological processes. For example, we observed that SBF2-AS1, MRVI1-AS1, and CASC8 were highly expressed in follicles, whereas WT1-AS, THAP9-AS1, and GAS5 were upregulated in granulosa cells. SBF2-AS1 was found to be associated with cell proliferation and epithelial-to-mesenchymal transition (EMT) [44,45,46]. Wang et al. found the overexpression of GAS5 contributed to increased expression levels of IL-6 in human granulosa cells (KGN) [42]. Moreover, we divided lncRNA target genes into different clusters based on genes’ PCC similarity and classification performance. Distinct expression patterns of genes in each cluster suggested the functional diversity of lncRNAs in regulating gene expression.
Additionally, a body of studies reported that subpathways were more suitable for interpreting underlying biological processes. In this study, we obtained subpathways and first visualized the heterogeneity of activation patterns among pathways and subpathways using the ssGSEA score and specificity score. We noticed that the activation pattern of the subpathway in follicles showed higher heterogeneity than that in granulosa cells. Although subpathway activation had individual differences, some subpathways associated with cell cycle, mTOR, and Hippo were co-activated between granulosa cells and follicles during PFA. The predicted connection between lncRNAs and subpathways was further validated by lncRNA knockdown cell lines.
Currently, the rate of infertility is high all over the world. Oocyte maturation failure is a contributing factor to female infertility [47]. In vitro maturation (IVM) and ovarian stimulation technology have been proposed to treat oocyte maturation failure by activating dormant ovarian follicles to generate mature eggs for successful reproduction [48,49,50,51]. In this study, we found lncRNAs played an important role in PFA, offering a new avenue for inducing follicle maturation.
In summary, based on subpathway specificity and target gene expression consistency, we built a systematic approach to uncover lncRNA functions during PAF (Figure 7). Our subpathway analysis seemed to be the first implementation associated with follicle maturation. In parallel, the pipeline is also suitable for analyzing lncRNA functions in other patterns of cell fate determination.
Figure 7.
Workflow of identifying PFA-associated lncRNAs. The dark pink color in 1.3 indicates high expression lncRNAs, while the green color indicates low expression lncRNAs.
4. Materials and Methods
4.1. Data Collection and Pre-Processing
The expression dataset of human ovarian follicles and matched granulosa cells from primordial to preovulatory stages at single cell level was downloaded from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/, accessed on 15 September 2022) under accession number GSE107746 [1], including primordial follicles (n = 17), primary follicles (n = 25), secondary follicles (n = 12), antral follicles (n = 23), preovulatory follicles (n = 3), primordial granulosa cells (n = 8), primary granulosa cells (n = 15), secondary granulosa cells (n = 6), antral granulosa cells (n = 24), and preovulatory granulosa cells (n = 18). In this study, 18,797 protein-coding genes and 13,870 lncRNAs were annotated in GTF file downloaded from GENCODE database (https://www.gencodegenes.org/, v19, accessed on 10 March 2023). Fragments per kilobase of transcript per million mapped reads (FPKM) values of protein-coding genes and lncRNAs were normalized by log2(FPKM + 1).
The raw count datasets of 9 human tissues (heart, liver, spleen, lung, kidney, testis, vagina, uterus, and ovary) were downloaded from the Genotype-Tissue Expression (GTEx) database (v8, accessed on 29 March 2022) [52]. The read count datasets were normalized by log2(Count + 1).
4.2. Cell Clustering Analysis
R package “Seurat” v3.2.1 was used to analyze the transcriptome data of human ovarian follicles and matched granulosa cells [53]. UMAP (Uniform Manifold Approximation and Projection) was applied to reduce the dimensionality. We calculated the nFeature_RNA and standard deviation (sd) attributes in each cell based on the expression of protein-coding genes and lncRNAs, respectively.
4.3. Identifying Follicle Maturation Associated Protein-Coding Genes and lncRNAs
To identify protein-coding genes and lncRNAs associated with follicle maturation, two filtering criteria were applied: (1) protein-coding genes/lncRNAs with an average expression value ≥ 1 in any cell type and (2) expressed in ≥70% cells of any follicle or granulosa cell type. Then, from primordial to preovulatory stages, we identified unions of 7730 protein-coding genes and 1497 lncRNAs in follicles, unions of 6522 protein-coding genes, and 729 lncRNAs in granulosa cells. Function ‘FindMarkers’ in Seurat with “wilcox” test differential expression analysis method was used to obtain differentially expressed lncRNAs (Bonferroni adjusted p < 0.05).
4.4. Identification of lncRNA Target Gene Cluster
The 743,029 miRNA–mRNA and 31,344 miRNA–lncRNA interaction pairs in our study were obtained from miRNet database (https://www.mirnet.ca/, accessed on 21 March 2023) [54]. Then, to explore competitive interaction between lncRNA and mRNA, hypergeometric test was used to measure the significance of shared miRNAs between lncRNA and mRNA as follows:
(1) |
where m represents the total number of miRNAs in miRBase v22; t represents the number of miRNAs interacting with mRNAs; n represents the number of miRNAs interacting with the lncRNAs; and r represents the number of common miRNAs simultaneously shared by mRNAs and lncRNAs. We considered p-value < 0.05 to be statistically significant.
Given that lncRNA could regulate multiple genes, we divided lncRNA target genes into different clusters, as described in a previous study [55]. (1) Correlation analysis of related target genes was calculated using Pearson’s correlation coefficients (PCCs) in primordial and primary stages; (2) the hierarchical clustering was applied, where genes () with correlation coefficient above 0.7 were stratified into one group, while those cannot be grouped into any clusters were discarded; (3) the LIBSVM toolbox implemented in R package ‘e1071’ was adopted to perform the support vector machines (SVMs) classification to evaluate the classification performance of each gene clusters between primordial and primary stages; and (4) 5-fold cross validation with 100 repetitions were used to evaluate the classification performance. Subsequently, the mean of the 100 AUC scores was regarded as its final score. Target gene clusters of each lncRNA with average AUC scores larger than 0.8 were retained.
4.5. Obtaining Subpathway Based on k-Clique Algorithm
In this study, subpathways were extracted by R package psSubpathway [56]. In total, 725 subpathways were annotated from 238 pathways (Table S3).
4.6. Subpathways Activity Score
To identify subpathways with dynamic changes during primordial follicle activation, the activity score for subpathways was defined as follows:
(2) |
where n represents the number of genes in each subpathway, denotes the t-score of the ith gene in the subpathway calculated by Student’s t-test analysis for comparing the gene expression profiles between primordial stage and primary stage. Next, random selection of the same number of genes for each candidate pathway with 1000 repetitions was performed to calculate p-value of subpathway. In this study, 214 subpathways in follicles and 295 subpathways in granulosa cells (p-value < 0.05) were associated with PFA.
4.7. Cell Type-Specific Pathway Specificity
The pathway specificity score in each cell type was defined as follows:
(3) |
where represents the average expression level of subpathway i genes in cell type j, and represents the average expression level of subpathway i genes in all cells.
4.8. LncRNA Rank
After evaluating the importance of lncRNAs during primordial follicle activation, we ranked the lncRNAs by lncRNA rank (LR) score to evaluate their associations with dynamic subpathways. For each lncRNA, the score LR was defined as follows:
(4) |
(5) |
where denotes the p-value of overlapping mRNAs between subpathway i genes and the lncRNA target gene clusters calculated by hypergeometric test; denotes the activity score of subpathway i in Formula (2), and n represents the number of all subpathways regulated by this lncRNA.
4.9. Premature Ovarian Insufficiency (POI)-Related Geneset
A total of 127 POI–related genes (Table S4) in Zhao et al.’s study, which summarized genes from databases of NCBI Gene, OMIM, and DrugBank, were used [24].
4.10. Download lncRNA Knockdown Datasets
To further validate the potential function of SBF2-AS1 and ZFAS1 identified in this study, RNA-sequencing data analysis was conducted to infer the dynamic expression of subpathway genes after knockdown of SBF2-AS1 in human A549 cell line (GSE103016) and ZFAS1 in human SK-Hep1 cell line (GSE104226) [36].
4.11. Statistical Analysis
R software (www.rproject.org, version 3.6.3, accessed on 28 October 2020) was used for all statistical analyses. Wilcox test was used to identify differentially expressed lncRNAs. Student’s t-test was performed to uncover dynamic subpathways during PFA. Hypergeometric test was employed to detect the significance of shared miRNAs between lncRNAs and mRNAs, and shared mRNAs between subpathway genes and lncRNA target genes. p-value < 0.05 was used to indicate statistical significance in this study.
Acknowledgments
The results shown here are based upon data generated by Zhang et al., published in Molecular Cell, and Guo et al., published in Frontiers in Genetics.
Supplementary Materials
The supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms241310525/s1.
Author Contributions
L.L. conceived this study. L.Z. and J.Z. wrote the manuscript. L.Z. constructed the workflow and performed most of the analyses. J.Z. conducted the subpathway analysis and corrected grammar errors. Z.W. did the paper survey. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The log2-transformed FPKM of lncRNAs used to support this study was obtained in Table S1 (https://www.cell.com/cms/10.1016/j.molcel.2018.10.029/attachment/7d2c901a-6f57-40ce-a135-cd0842d719bc/mmc2.xlsx, accessed on 10 March 2023) of Zhang et al.’s research. The log2-transformed FPKM of genes was downloaded through GEO accession number GSE107746. The control and SBF2-AS1 knockdown A549 cell lines were obtained through GEO accession number GSE103016. The control and ZFAS1 knockdown SK-Hep1 cell lines were obtained through GEO accession number GSE104226.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This work was supported by the National Key R&D Program of China (2021YFA1102700 to L.L.), the National Natural Science Foundation of China (32070833 and 82101745 to L.L.), and the Natural Science Funds for Distinguished Young Scholar of Guangdong province (2022B1515020110 to L.L).
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Zhang Y., Yan Z., Qin Q., Nisenblat V., Chang H.-M., Yu Y., Wang T., Lu C., Yang M., Yang S., et al. Transcriptome Landscape of Human Folliculogenesis Reveals Oocyte and Granulosa Cell Interactions. Mol. Cell. 2018;72:1021–1034.e4. doi: 10.1016/j.molcel.2018.10.029. [DOI] [PubMed] [Google Scholar]
- 2.Suzumori N., Yan C., Matzuk M.M., Rajkovic A. Nobox is a homeobox-encoding gene preferentially expressed in primordial and growing oocytes. Mech. Dev. 2001;111:137–141. doi: 10.1016/S0925-4773(01)00620-7. [DOI] [PubMed] [Google Scholar]
- 3.Pangas S., Rajkovic A. Transcriptional regulation of early oogenesis: In search of masters. Hum. Reprod. Update. 2005;12:65–76. doi: 10.1093/humupd/dmi033. [DOI] [PubMed] [Google Scholar]
- 4.Zhao Y., Feng H., Zhang Y., Zhang J.V., Wang X., Liu D., Wang T., Li R.H.W., Ng E.H.Y., Yeung W.S.B., et al. Current Understandings of Core Pathways for the Activation of Mammalian Primordial Follicles. Cells. 2021;10:1491. doi: 10.3390/cells10061491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hu Y., Yuan D.-Z., Wu Y., Yu L.-L., Xu L.-Z., Yue L.-M., Liu L., Xu W.-M., Qiao X.-Y., Zeng R.-J., et al. Bisphenol A Initiates Excessive Premature Activation of Primordial Follicles in Mouse Ovaries via the PTEN Signaling Pathway. Reprod. Sci. 2018;25:609–620. doi: 10.1177/1933719117734700. [DOI] [PubMed] [Google Scholar]
- 6.Van Kasteren Y.M., Schoemaker J. Premature ovarian failure: A systematic review on therapeutic interventions to restore ovarian function and achieve pregnancy. Hum. Reprod. Update. 1999;5:483–492. doi: 10.1093/humupd/5.5.483. [DOI] [PubMed] [Google Scholar]
- 7.Dragojevic-Dikic S., Rakic S., Nikolic B., Popovac S. Hormone replacement therapy and successful pregnancy in a patient with premature ovarian failure. Gynecol. Endocrinol. 2009;25:769–772. doi: 10.3109/09513590903004126. [DOI] [PubMed] [Google Scholar]
- 8.Dekker J., Tr M., Me D., Js M. Long non-coding RNAs: Insights into functions. Nat. Rev. Genet. 2009;10:155–159. doi: 10.1038/nrg2521. [DOI] [PubMed] [Google Scholar]
- 9.Fatica A., Bozzoni I. Long non-coding RNAs: New players in cell differentiation and development. Nat. Rev. Genet. 2013;15:7–21. doi: 10.1038/nrg3606. [DOI] [PubMed] [Google Scholar]
- 10.Da Luz C.M., da Broi M.G., Donabela F.C., de Paz C.C.P., Meola J., Navarro P.A. PTGS2 down-regulation in cumulus cells of infertile women with endometriosis. Reprod. Biomed. Online. 2017;35:379–386. doi: 10.1016/j.rbmo.2017.06.021. [DOI] [PubMed] [Google Scholar]
- 11.Liu J., Wang F., Du L., Li J., Yu T., Jin Y., Yan Y., Zhou J., Gu J. Comprehensive Genomic Characterization Analysis of lncRNAs in Cells with Porcine Delta Coronavirus Infection. Front. Microbiol. 2020;10:3036. doi: 10.3389/fmicb.2019.03036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chen W., Lin C., Gong L., Chen J., Liang Y., Zeng P., Diao H. Comprehensive Analysis of the mRNA-lncRNA Co-expression Profile and ceRNA Networks Patterns in Chronic Hepatitis B. Curr. Genomics. 2019;20:231–245. doi: 10.2174/1389202920666190820122126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Maghsoudi Z., Nguyen H., Tavakkoli A., Nguyen T. A comprehensive survey of the approaches for pathway analysis using multi-omics data integration. Brief. Bioinform. 2022;23:bbac435. doi: 10.1093/bib/bbac435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kawatsu T. Review pathway analysis for peptide-mediated electronic coupling in the super-exchange mechanism of ET and EET. Biopolymers. 2012;100:100–113. doi: 10.1002/bip.22142. [DOI] [PubMed] [Google Scholar]
- 15.Li X., Yan M.L., Yu Q. Identification of candidate drugs for the treatment of metastatic osteosarcoma through a sub-pathway analysis method. Oncol. Lett. 2017;13:4378–4384. doi: 10.3892/ol.2017.5953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fang J., Li Z., Xu M., Ji J., Li Y., Zhang L., Chen Y. Identification and Functional Analysis of Individual-Specific Sub-pathways in Lung Adenocarcinoma. Genes. 2022;13:1122. doi: 10.3390/genes13071122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Huo Y., Shao S., Liu E., Li J., Tian Z., Wu X., Zhang S., Stover D., Wu H., Cheng L., et al. Subpathway Analysis of Transcriptome Profiles Reveals New Molecular Mechanisms of Acquired Chemotherapy Resistance in Breast Cancer. Cancers. 2022;14:4878. doi: 10.3390/cancers14194878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Fortune J., Cushman R., Wahl C., Kito S. The primordial to primary follicle transition. Mol. Cell. Endocrinol. 2000;163:53–60. doi: 10.1016/S0303-7207(99)00240-3. [DOI] [PubMed] [Google Scholar]
- 19.Zhang Y., Zhou X., Zhu Y., Wang H., Xu J., Su Y. Current mechanisms of primordial follicle activation and new strategies for fertility preservation. Mol. Hum. Reprod. 2021;27:gaab005. doi: 10.1093/molehr/gaab005. [DOI] [PubMed] [Google Scholar]
- 20.Yang S., Wang S., Luo A., Ding T., Lai Z., Shen W., Ma X., Cao C., Shi L., Jiang J., et al. Expression Patterns and Regulatory Functions of MicroRNAs During the Initiation of Primordial Follicle Development in the Neonatal Mouse Ovary. Biol. Reprod. 2013;89:126. doi: 10.1095/biolreprod.113.107730. [DOI] [PubMed] [Google Scholar]
- 21.Bouckenheimer J., Fauque P., Lecellier C.-H., Bruno C., Commes T., Lemaître J.-M., De Vos J., Assou S. Differential long non-coding RNA expression profiles in human oocytes and cumulus cells. Sci. Rep. 2018;8:1–13. doi: 10.1038/s41598-018-20727-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang K., Shi Z.-M., Chang Y.-N., Hu Z.-M., Qi H.-X., Hong W. The ways of action of long non-coding RNAs in cytoplasm and nucleus. Gene. 2014;547:1–9. doi: 10.1016/j.gene.2014.06.043. [DOI] [PubMed] [Google Scholar]
- 23.Dykes I.M., Emanueli C. Transcriptional and Post-transcriptional Gene Regulation by Long Non-coding RNA. Genom. Proteom. Bioinform. 2017;15:177–186. doi: 10.1016/j.gpb.2016.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhao H., Shan Y., Ma Z., Yu M., Gong B. A network pharmacology approach to explore active compounds and phar-macological mechanisms of epimedium for treatment of premature ovarian insufficiency. Drug Des. Devel. Ther. 2019;13:2997–3007. doi: 10.2147/DDDT.S207823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Li C., Han J., Yao Q., Zou C., Xu Y., Zhang C., Shang D., Zhou L., Zou C., Sun Z., et al. Subpathway-GM: Identification of metabolic subpathways via joint power of interesting genes and metabolites and their topologies within pathways. Nucleic Acids Res. 2013;41:e101. doi: 10.1093/nar/gkt161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Li C., Li X., Miao Y., Wang Q., Jiang W., Xu C., Li J., Han J., Zhang F., Gong B., et al. SubpathwayMiner: A software package for flexible identification of pathways. Nucleic Acids Res. 2009;37:e131. doi: 10.1093/nar/gkp667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Granados-Aparici S., Hardy K., Franks S., Sharum I.B., Waite S.L., Fenwick M.A. SMAD3 directly regulates cell cycle genes to maintain arrest in granulosa cells of mouse primordial follicles. Sci. Rep. 2019;9:1–12. doi: 10.1038/s41598-019-42878-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhang T., He M., Zhao L., Qin S., Zhu Z., Du X., Zhou B., Yang Y., Liu X., Xia G., et al. HDAC6 regulates primordial follicle activation through mTOR signaling pathway. Cell Death Dis. 2021;12:1–12. doi: 10.1038/s41419-021-03842-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kehoe S., Jewgenow K., Johnston P.R., Mbedi S., Braun B.C. Signalling pathways and mechanistic cues highlighted by transcriptomic analysis of primordial, primary, and secondary ovarian follicles in domestic cat. Sci. Rep. 2021;11:2683. doi: 10.1038/s41598-021-82051-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ting A.Y., Zelinski M.B. Characterization of FOXO1, 3 and 4 transcription factors in ovaries of fetal, prepubertal and adult rhesus macaques†. Biol. Reprod. 2017;96:1052–1059. doi: 10.1093/biolre/iox034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ojeda D., Lakhal B., Fonseca D.J., Braham R., Landolsi H., Mateus H.E., Restrepo C.M., Elghezal H., Saâd A., Laissue P. Sequence analysis of the CDKN1B gene in patients with premature ovarian failure reveals a novel mutation potentially related to the phenotype. Fertil. Steril. 2011;95:2658–2660.e1. doi: 10.1016/j.fertnstert.2011.04.045. [DOI] [PubMed] [Google Scholar]
- 32.Lu C., Lin L., Tan H., Wu H., Sherman S.L., Gao F., Jin P., Chen D. Fragile X premutation RNA is sufficient to cause primary ovarian insufficiency in mice. Hum. Mol. Genet. 2012;21:5039–5047. doi: 10.1093/hmg/dds348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Meng X., Peng L., Wei X., Li S. FOXO3 is a potential biomarker and therapeutic target for premature ovarian insufficiency (Review) Mol. Med. Rep. 2022;27:1–8. doi: 10.3892/mmr.2022.12921. [DOI] [PubMed] [Google Scholar]
- 34.Yao C., Lu L., Ji Y., Zhang Y., Li W., Shi Y., Liu J., Sun M., Xia F. Hypo-Hydroxymethylation of Nobox is Associated with Ovarian Dysfunction in Rat Offspring Exposed to Prenatal Hypoxia. Reprod. Sci. 2022;29:1424–1436. doi: 10.1007/s43032-022-00866-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Neyroud A.S., Rudinger-Thirion J., Frugier M., Riley L.G., Bidet M., Akloul L., Simpson A., Gilot D., Christodoulou J., Ravel C., et al. LARS2 variants can present as premature ovarian insufficiency in the absence of overt hearing loss. Eur. J. Hum. Genet. 2022;31:453–460. doi: 10.1038/s41431-022-01252-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Guo Z.-W., Meng Y., Zhai X.-M., Xie C., Zhao N., Li M., Zhou C.-L., Li K., Liu T.-C., Yang X., et al. Translated Long Non-Coding Ribonucleic Acid ZFAS1 Promotes Cancer Cell Migration by Elevating Reactive Oxygen Species Production in Hepatocellular Carcinoma. Front. Genet. 2019;10:1111. doi: 10.3389/fgene.2019.01111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yang Q., Wan Q., Zhang L., Li Y., Zhang P., Li D., Feng C., Yi F., Zhang L., Ding X., et al. Analysis of LncRNA expression in cell differentiation. RNA Biol. 2018;15:413–422. doi: 10.1080/15476286.2018.1441665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dunlop C.E., Anderson R.A. The regulation and assessment of follicular growth. Scand. J. Clin. Lab. Investig. 2014;74:13–17. doi: 10.3109/00365513.2014.936674. [DOI] [PubMed] [Google Scholar]
- 39.Barnett K., Schilling C., Greenfeld C., Tomic D., Flaws J. Ovarian follicle development and transgenic mouse models. Hum. Reprod. Update. 2006;12:537–555. doi: 10.1093/humupd/dml022. [DOI] [PubMed] [Google Scholar]
- 40.Chen R., Xia W., Wang X., Qiu M., Yin R., Wang S., Xi X., Wang J., Xu Y., Dong G., et al. Upregulated long non-coding RNA SBF2-AS1 promotes proliferation in esophageal squamous cell carcinoma. Oncol. Lett. 2018;15:5071–5080. doi: 10.3892/ol.2018.7968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tuo H., Liu R., Wang Y., Yang W., Liu Q. Hypoxia-induced lncRNA MRVI1-AS1 accelerates hepatocellular carcinoma progression by recruiting RNA-binding protein CELF2 to stabilize SKA1 mRNA. World J. Surg. Oncol. 2023;21:1–15. doi: 10.1186/s12957-023-02993-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Luo G., Liu D., Huang C., Wang M., Xiao X., Zeng F., Wang L., Jiang G. LncRNA GAS5 Inhibits Cellular Proliferation by Targeting P27Kip. Mol. Cancer Res. 2017;15:789–799. doi: 10.1158/1541-7786.mcr-16-0331. [DOI] [PubMed] [Google Scholar]
- 43.O’brien S.J., Fiechter C., Burton J., Hallion J., Paas M., Patel A., Patel A., Rochet A., Scheurlen K., Gardner S., et al. Long non-coding RNA ZFAS1 is a major regulator of epithelial-mesenchymal transition through miR-200/ZEB1/E-cadherin, vimentin signaling in colon adenocarcinoma. Cell Death Discov. 2021;7:61. doi: 10.1038/s41420-021-00427-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Zhang Y.-T., Li B.-P., Zhang B., Ma P., Wu Q.-L., Ming L., Xie L.-M. LncRNA SBF2-AS1 promotes hepatocellular carcinoma metastasis by regulating EMT and predicts unfavorable prognosis. Eur. Rev. Med. Pharmacol. Sci. 2018;22:6333–6341. doi: 10.26355/eurrev_201810_16044. [DOI] [PubMed] [Google Scholar]
- 45.Lv J., Qiu M., Xia W., Liu C., Xu Y., Wang J., Leng X., Huang S., Zhu R., Zhao M., et al. High expression of long non-coding RNA SBF2-AS1 promotes proliferation in non-small cell lung cancer. J. Exp. Clin. Cancer Res. 2016;35:1–13. doi: 10.1186/s13046-016-0352-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hussain S.A., Venkatesh T. YBX1/lncRNA SBF2-AS1 interaction regulates proliferation and tamoxifen sensitivity via PI3K/AKT/MTOR signaling in breast cancer cells. Mol. Biol. Rep. 2023;50:3413–3428. doi: 10.1007/s11033-023-08308-5. [DOI] [PubMed] [Google Scholar]
- 47.Sang Q., Zhou Z., Mu J., Wang L. Genetic factors as potential molecular markers of human oocyte and embryo quality. J. Assist. Reprod. Genet. 2021;38:993–1002. doi: 10.1007/s10815-021-02196-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Grynberg M., Jacquesson L., Sifer C. In vitro maturation of oocytes for preserving fertility in autoimmune premature ovarian insufficiency. Fertil. Steril. 2020;114:848–853. doi: 10.1016/j.fertnstert.2020.04.049. [DOI] [PubMed] [Google Scholar]
- 49.La Marca A., Mastellari E. Fertility preservation for genetic diseases leading to premature ovarian insufficiency (POI) J. Assist. Reprod. Genet. 2021;38:759–777. doi: 10.1007/s10815-021-02067-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Pinelli S., Artini P.G., Basile S., Obino M.E.R., Sergiampietri C., Giannarelli D., Simi G., Cela V. Estrogen treatment in infertile women with premature ovarian insufficiency in transitional phase: A retrospective analysis. J. Assist. Reprod. Genet. 2017;35:475–482. doi: 10.1007/s10815-017-1096-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kawamura K., Cheng Y., Suzuki N., Deguchi M., Sato Y., Takae S., Ho C.-H., Kawamura N., Tamura M., Hashimoto S., et al. Hippo signaling disruption and Akt stimulation of ovarian follicles for infertility treatment. Proc. Natl. Acad. Sci. USA. 2013;110:17474–17479. doi: 10.1073/pnas.1312830110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Consortium G. Tex, The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Stuart T., Butler A., Hoffman P., Hafemeister C., Papalexi E., Mauck W.M., 3rd, Hao Y., Stoeckius M., Smibert P., Satija R. Comprehensive Integration of Single-Cell Data. Cell. 2019;177:1888–1902.e21. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Chang L., Zhou G., Soufan O., Xia J. miRNet 2.0: Network-based visual analytics for miRNA functional analysis and systems biology. Nucleic Acids Res. 2020;48:W244–W251. doi: 10.1093/nar/gkaa467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zhao X.-M., Liu K.-Q., Zhu G., He F., Duval B., Richer J.-M., Huang D.-S., Jiang C.-J., Hao J.-K., Chen L. Identifying cancer-related microRNAs based on gene expression data. Bioinformatics. 2014;31:1226–1234. doi: 10.1093/bioinformatics/btu811. [DOI] [PubMed] [Google Scholar]
- 56.Han J., Han X., Kong Q., Cheng L. psSubpathway: A software package for flexible identification of phenotype-specific sub-pathways in cancer progression. Bioinformatics. 2020;36:2303–2305. doi: 10.1093/bioinformatics/btz894. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The log2-transformed FPKM of lncRNAs used to support this study was obtained in Table S1 (https://www.cell.com/cms/10.1016/j.molcel.2018.10.029/attachment/7d2c901a-6f57-40ce-a135-cd0842d719bc/mmc2.xlsx, accessed on 10 March 2023) of Zhang et al.’s research. The log2-transformed FPKM of genes was downloaded through GEO accession number GSE107746. The control and SBF2-AS1 knockdown A549 cell lines were obtained through GEO accession number GSE103016. The control and ZFAS1 knockdown SK-Hep1 cell lines were obtained through GEO accession number GSE104226.