Abstract
LncRNA is reported to have important role in diabetic nephropathy (DN). Here, we aim to identify key lncRNAs of DN using bioinformatics and systems biological methods. Method: Five microarray data sets from Gene Expression Omnibus (GEO) database were included. Probe sets were re-annotated. In the training set, differential expressed genes (DEGs) were identified. Weighted gene co-expression network analysis (WGCNA) was constructed to screen diabetic-related hub genes and reveal their potential biological function. Two more human data sets and mouse data sets were used as validation sets. Results: A total of 424 DEGs, including 10 lncRNAs, were filtered in the training data set. WGCNA and enrichment analysis of hub genes showed that inflammation and metabolic disorders are prominent in DN. Three key lncRNAs (NR_130134.1, NR_029395.1 and NR_038335.1) were identified. These lncRNAs are also differently expressed in another two human data sets. Functional enrichment of the mouse data sets showed consistent changes with that in human, indicating similar changes in gene expression pattern of DN and confirmed confidence of our analysis. Human podocytes and mesangial cells were culture in vitro. QPCR and fluorescence in situ hybridization were taken out to validate the expression and relationship of key lncRNAs and their related mRNAs. Results were also consistent with our analysis. Conclusions: Inflammation and metabolic disorders are prominent in DN. We identify three lncRNAs that are involved in these processes possibly by interacting with co-expressed mRNAs.
Introduction
Chronic kidney disease (CKD) has become a worldwide public health problem which is associated with high mortality and heavy economic burden1. As the incidence of diabetes increasing, diabetic nephropathy (DN), the major microvascular complication of diabetes mellitus, is becoming the major cause of end-stage renal disease especially in developed countries2. Certain advances about the molecular mechanism of DN have been made during last decades. However, our understanding about the disease is still limited and the treatment is mainly based on control of blood pressure and glucose, and blocking of the renin-angiotensin system3. Previous studies were mainly based on technology of molecular biology which only focused on limited number of molecules. During DN, lots of genes will change in a coordinate way. As a result, genomic approaches are needed to describe the changes in gene expression profiles.
Long noncoding RNA (lncRNA) is a class of transcripts that are longer than 200 nucleotides and cannot coding proteins. The mammalian genomes have plenty of lncRNAs which are engaged in different biological processes by participating DNA methylation, mRNA transcription, alternative splicing, maturation, decay and translation4,5. LncRNAs are often poorly conserved among different species. Previous studies showed some important lncRNAs may be involved in DN, including metastasis-associated lung adenocarcinoma transcript 1 (MALAT1), plasmacytoma variant translocation 1 (PVT1) and taurine up-regulated gene 1 (TUG1)6–8. However, none of the studies analyzed those using methods of bioinformatics and systems biology.
The goal of this study was to identify key lncRNAs and their roles in DN from the aspect of whole gene transcripts analysis. We used data downloaded from Gene Expression Omnibus (GEO) database. A human microarray data set was used as training set. Differential expressed genes (DEGs) were screened. Weighted gene co-expression network analysis (WGCNA), a widely used systems biological method for finding gene sets that are highly correlated with clinical traits, was constructed to screen gene modules that were highly co-regulated and closely related with DN within in modules. Then gene co-expression analysis was used to find hub lncRNAs and reveal their potential biological function by showing lncRNA-mRNA interactions. Two more human data sets and mouse data sets were used as validation sets. We also validated the expression and relationship of the screened lncRNAs and mRNAs using qPCR and Fluorescence in situ hybridization (FISH). To the best of our knowledge, it is the first time of analyzing combined DN data sets to locate key lncRNAs using bioinformatics and systems biological methods.
Results
Multiple data sets were included
Finally, 5 series (GSE30122, GSE47183, GSE47184, GSE20636 and GSE33744, Table 1) were screened out according to our filter criteria9–14. Three of them (GSE30122, GSE47183 and GSE47184) used human samples while the other two of them (GSE20636 and GSE33744) used mouse samples. GSE30122 was used as a training set which contains 10 diabetic tubuli (TD), 24 control tubuli (TC), 9 diabetic glomeruli (GD) and 26 control glomeruli (GC). The other four were used as validation data set.
Table 1.
Series | Platform | Sample size | Diabetic Nephropathy | Tissue | Species | |||
---|---|---|---|---|---|---|---|---|
Diabetic | Control | Glomeruli | Tubuli | Whole Kidney | ||||
GSE30122 | GPL571 | 69 | 19 | 50 | 35 | 34 | — | Homo sapiens |
GSE47183 | GPL11670, GPL14663 | 122 | 14 | — | 14 | — | — | Homo sapiens |
GSE47184 | GPL11670, GPL14663 | 107 | 18 | 4 | — | 22 | — | Homo sapiens |
GSE20636 | GPL1261 | 35 | 19 | 16 | — | — | 35 | Mus musculus |
GSE33744 | GPL1261 | 39 | 21 | 18 | 39 | — | — | Mus musculus |
Differential expressed mRNAs and lncRNAs were screened out
A modified lncRNA reannotation pipeline was used to re-annotate probe sets in each data set (Fig. 1). Characterizations of expression profiles within different groups of the training data set were shown in Fig. 2. Principle component analysis gives us an overview of the internal structure of sample data in four groups. DEGs between TD and TC, GD and GC were shown in volcano plots. DEGs among 4 groups were illustrated and clustered in heatmap. After intersecting DEGs in both TD and GD compared with TC and GC, and removing potential irrelevant genes, 424 genes, including 10 lncRNAs, were finally filtered and selected for subsequent analysis. Venn plot was shown in Fig. 2E. DELs and DEGs were shown in Supplementary Tables S1 and S2.
WGCNA was used to find out diabetic-associated gene modules
Samples were first clustered to remove outlined samples. (A dendrogram was shown in Supplementary Fig. S1). WGCNA was then performed to identify genes that were highly co-expressed across groups. Modules were randomly color-labeled. A total of 24 gene modules were identified (Fig. 3A). Then module eigengene were analyzed for correlation with tissue or disease traits. Module eigengene-to-condition correlation heatmap were shown in Fig. 3B. Since the data set includes samples from both glomeruli and tubuli, we analyzed which gene modules were clustered due to tissue differences. Figure 3C illustrate the gene significance of each module with tissue and the most significant ones were pink (r = −0.7, p = 7e-10) and turquoise (r = −0.85, p = 2e-17). As for diabetic phenotype, the top three correlated modules are lightcyan (r = 0.75, p = 8e-12), black (r = 0.74, p = 2e-11) and purple (r = 0.67, p = 6e-09) (Fig. 3D). Interestingly, their correlations with tissue are relatively weak, indicating important roles in DN process. As a result, they were selected as module of interest and subjected to following analysis. To verify the robustness of WGCNA, we analyzed the module conservation. The multiple scaling plot (Fig. 4A) shows gene clusters labeled by module colors. Topological overlap matrix (TOM) plot were constructed to show the pairwise gene correlation within each module (Fig. 4B). The stability of modules was calculated. Preservation median rank and preservation Z summary score of all modules were presented in Fig. 4C,D. The three diabetic-related modules are relatively stable since their median rank preservation was relatively low and the Z summary statistics were above the threshold of 10. Exact value could be found in Supplementary Table S3.
Hub genes and CNC network construction reveal three key lncRNAs
In selected modules, genes were ranked using formula described in method part and the top 20 were considered as hub genes. CNC network and GO enrichment analysis of hub genes were constructed. Figure 5A,C,E showed that each network contains at least one lncRNA which forms intensive connections with other mRNAs. In lightcyan module (A, B), NR_110501.1 is the key lncRNA. Regulation of protein metabolic process is prominently enriched. In black module (C,D), lncRNA NR_046035.1, NR_130134.1 and NR_029395.1 are involved. DEGs are mostly enriched in immune-related processes, including immune response, inflammatory response and innate immune response. The purple module (E, F) involves lncRNA NR_104036.1, NR_031766.2 and NR_038335.1. Metabolism (cellular response to nutrient levels, response to nutrient levels, cellular response to starvation, protein maturation and response to starvation) and stress (cellular response to extracellular stimulus, cellular response to external stimulus, response to extracellular stimulus and response to acid chemical) -related pathways are intensively enriched. Different functional enrichment indicates that these lncRNAs participated in different biological processes by interacting with various mRNAs. As we noticed that three of the lncRNAs (NR_130134.1, NR_029395.1 and NR_038335.1) were also the DELs shown in Supplementary Table S1, indicating their important roles in DN. Detailed information about the functional enrichment could be found in Supplementary Table S4.
DELs validation in human data sets
To further validate the three key lncRNAs screened from the training data set, we compared them with the DELs in the validation data sets (GSE47183 and GSE47184). Batch effects of expression values were removed (see Supplementary Fig. S2). Box plots of DELs were shown in Fig. 6, among which, the three key lncRNAs of the training data set showed up again.
Changes of gene expression pattern are similar in human and mouse data sets
LncRNAs are poorly conserved in different species. It is difficult to compare lncRNA directly between different species. As a result, we compared DEGs and the gene function changes between human data sets and two mouse data sets (GSE20636 and GSE33744). Differential expression analysis was carried out and 172 recurrent DEGs in mouse data sets were filtered (see Supplementary Table S5). To further explore the similarity of DEGs identified in human and mouse, functional enrichment analysis was also performed. Significant enriched terms or pathways were illustrated in Fig. 7 and also listed in Supplementary Tables S6 and S7. Identical pathways between species were represented in red which indicated similar changes in gene expression pattern of DN and showed confidence of our analysis. A,C,E and G are results of human and B,D,F and H are results of mouse.
Validation of the DELs and related mRNAs
The three key lncRNAs were involved in two networks as shown in Fig. 5. Related genes that were differentially expressed in DN glomerulus were chosen for further validation. We cultured HPCs (human podocytes) and HMCs (human mesangial cells) in vitro and stimulated them with high glucose. Results of qPCR showed that expression of three lncRNAs and their co-expressed genes was similar with results of microarray (Fig. 8A). Next, we transfected siRNAs targeting lncRNA NR_130134.1 and NR_029395.1 into HMCs to further validate their relationship with other DEGs. Generally suppressed expression of most mRNAs in siRNA group suggested the central roles of lncRNAs (Fig. 8B,C). FISH was used to show the expression and localization of three lncRNAs in high glucose-stimulated HMCs (Fig. 8D).
Discussion
At present, high-throughput microarray technology has been used to identify changes in gene expression profiles associated with the onset and progression of DN15,16. During DN, functionally related genes have similar changes in their expression patterns17. However, most studies focused on individual differential expressed genes and ignored the changes in gene interconnection networks. Genome wide transcript analysis can descript changes in gene expression pattern and now has been widely used in disease studies18,19.
LncRNA is a class of molecules that have been reported to participate in the onset and progression of DN6,20,21. Previous studies mainly showed their functions using molecular biologic method. LncRNAs could form complex networks with plenty of genes. Understanding the molecular interactions among key lncRNAs and mRNAs are essential for diagnosis and treatment of DN. As a result, it is necessary to use systems biology method in lncRNA screening and function interpretation of genome-wide expression data sets. However, multiple issues, including different disease staging, system error due to different research batches, different species used in different studies and lack of mouse models that would faithfully simulate changes of DN will lead to different conclusions22. Therefore, a comprehensive analysis of data containing multiple data sets from different species would be necessary.
In this study, we included five GEO data sets. Each of them contains samples more than 30 and derived from human or mice kidney tissue. Among them, gene expression profiling of GSE30122 that includes both tubular and glomerular samples of control or DN patients was used as training dataset. PCA showed that different groups clustered differently, indicating changes in gene expression profiling. As we expected, large number of DEGs were found, including 10 DELs.
WGCNA is a systems biology method used for finding modules of highly correlated genes and identifying phenotype related gene clusters. It is widely used in identifying the correlations between gene sets and clinical characteristics23. By constructing a gene co-expression network, we can find intra-modular hub genes and provide global information about hub gene-associated expression profiles as well as gene functional relationship. Here we use WGCNA to identify gene modules that showed highly correlation patterns with diabetic nephropathy across four groups (DG, TG, GC, TC). A total of 24 modules were identified. To find out modules that best correlated with diabetic state, the module-to-condition correlations were constructed and visualized as heatmap. Glomeruli and tubuli exhibit different expression profiles due to different cellular inhabitants. As a result, it is not surprising that some modules are associated with tissue difference. However, when tissue-caused variations are small, several modules are still significantly correlated with diabetic phenotype. As we know, glomerular damage is significant in diabetic nephropathy. Our results show that some gene modules change in both glomeruli and tubuli, suggesting diabetic tubuli damage should be focused in future study. The top three diabetic-associated modules are colored by lightcyan, purple and black.
Next, we analyzed the network connectivity and module conservation. TOM analysis was used which could give us a reduced view of the whole network. The connectivity of a gene is the sum of its adjacency with other genes. Nodes that are highly interconnected show high topological overlap and are represented by a darker color shade. The rows and columns are ordered by the WGCNA color bar which is shown on the left and top of the TOM plot. Preservation median rank and preservation Z summary score of all modules were presented and results suggested those three modules showed acceptable conservation. As a result, they are selected as module of interest for subsequent analyses.
Genes within a module are closely related in function. The lncRNAs involved is therefore supposed to have the similar functions. To screen the key lncRNAs, we constructed expression network and functional enrichment. In the three modules that are significantly associated with diabetic trait, a total of seven hub lncRNAs were found. From the enrichment analysis, we could see that these lncRNAs in each module participated in different biological processes. DN is known to be a metabolic problem such as energy, protein and lipid disorder24,25. Clinical trials have suggested that nutritional and dietary interventions in DN are an essential aspect of management and patients will benefit from a strict dietary protein and carbohydrates control26. Our results of function enrichment of modules also indicated an important role of nutrition and metabolic process. LncRNAs (especially NR_110501.1, NR_104036.1, NR_031766.2 and NR_038335.1) seem to be a participator of these processes. Besides metabolism problems, immune and inflammatory disorders are also believed to be involved27. Complement and immunoglobulins sometimes can also be detected in diseased glomeruli of DN28. However, due to serious side effects, usage of immunosuppressive drugs is limited in DN therapy. Recently, studies indicated that lncRNA might be a promising novel target for managing DN inflammation20. Here we show that, besides mRNAs, several lncRNAs (NR_046035.1, NR_130134.1 and NR_029395.1) are involved in immune and inflammatory responses of DN, which might hopefully provide us new targets for DN treatment. By comparing the hub genes with the DELs, we found three identical genes: NR_130134.1, NR_029395.1 and NR_038335.1, which might be the key lncRNAs in DN onset.
Then we searched for the DELs in other two human data sets which contain three groups: GD, TC and TD. A total of 14 lncRNAs were screened out. After removing the batch effect, the differences still existed (see Supplementary Fig. S2). What’s worth noticing is that the three key lncRNAs also shows up here, suggesting the robustness of these differences among data sets.
To testify the accuracy of our results, we analyzed two data sets of mice. A total of 172 recurrent DEGs showed up. However, no consistent lncRNAs was reappeared. The reason might involve that lncRNAs are poorly conserved and their sequences among different species are very different. Then we compared the functional enrichment of DEGs. Results showed some identical pathways which were highlighted in red font. These validated the robustness of gene changes found in human data sets and proved the mechanism of DN might be similar among species. However, major differences in gene functions still exist. As a result, when we interpret mechanism of DN using mouse models, we should consider species differences.
HPCs and HMCs are two main kinds of parenchymal cells that have important function in DN. As a result, we cultured them for experimental validation. Results of qPCR and FISH showed that most of the lncRNAs and mRNAs had similar expression pattern with that of microarray, although some might express differently in different cells. Gene interference results also suggested that lncRNAs had close relationship with the expression of related mRNAs. These results indicated that lncRNAs may regulate expression of mRNAs in a direct or indirect way which worth further study.There are some limitations in this study. The data sets included are all microarray data with relatively small sample size. With the development of RNA sequencing technology, the results need to be validated. In another way, lncRNA was defined by reannotation and comparing with lncRNA database. This may lead to some inaccuracy which could not be avoided at present. As a result, more studies would be needed in the future.
In summary, the key finding of this study is that we identified diabetic-related gene expression profiles using bioinformatics method. Inflammation and metabolic disorders seem to be much more prominent biological changes in DN. Three key lncRNAs was found and validated using different methods and in different data sets. They may play important roles in DN by regulating inflammation and metabolism. This may help us understanding the mechanism of DN and provide us new biomarkers for diagnosis, risk stratification and prognosis prediction in DN patients.
Materials and Methods
Material and data
We used “diabetic nephropathy” and “diabetic kidney disease” as key words to retrieve data sets from the National Center of Biotechnology Information (NCBI) GEO database (https://www.ncbi.nlm.nih.gov/geo/), and found 59 potential data series. Data series were reviewed one after another using exclusion criteria below: (1) Sample size larger than 30; (2) Expression profiles derived from kidney tissue, not serum or cell lines; (3) Uniform platform used for quantification; (4) Organisms should be Homo sapiens or Mus musculus.
LncRNA reannotation
A modified lncRNA reannotation pipeline (Fig. 1) was used to re-annotate probe sets in each data set. In summary, four different microarray platforms were used, which were Affymetrix Human Genome U133 Plus 2.0 Array (HGU133Plus2), Affymetrix Human Genome U133A array (HGU133A), Affymetrix Human Geonme U133A 2.0 Array (HGU133A_2) and Affymetrix Mouse Genome 430 2.0 Array (MG430_2). After reannotation, 6160 lncRNAs in HGU133A, 6140 lncRNAs in HGU133A_2, 15268 lncRNAs in HGU133Plus2, and 3882 lncRNAs in MG430_2 were identified.
Differential expression analysis
Raw probe intensities were normalized and background adjusted by using LIMMA R package29. According to the sample information, batch effectors were identified and removed using R sva package30. DEGs were identified based on fold changes and P values. A fold change threshold of ≥2.0 and P value ≤ 0.05 were set for up- and down regulated genes. Differential expression analysis was performed between all comparative conditions. Functional enrichment analyses including Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were assessed using the DAVID database (https://david.abcc.ncifcrf.gov/)31. For functional enrichment analysis of DEGs, the background was set to the total list of genes expressed in humans. The statistically significant threshold level for enrichment analyses was P ≤ 0.05 (Benjamini and Hochberg corrected for multiple comparisons). Hierarchical clustering was performed to determine the expression patterns of DEGs using R package gplots. Principle component analysis (PCA) were also performed using function princomp() and visualized using scatterplot3d package. R 3.3.1 was used for data manipulation and visualization.
WGCNA
WGCNA is a biology method to identify genes with similar expression variation trend across different samples. Samples clustering were performed to demonstrate the relationship between expression profile and clinical traits. After raw data preprocessing, weighted gene co-expressison analysis were performed to identify significant gene modules related to DN according to a previously described algorithm32. Probe sets were first filtered based on the variance of expression value across all samples. Probe sets with duplicated gene symbols were also removed based on expression variance. Totally 8450 genes were finally selected. The R package WGCNA23 was applied for this analysis. Briefly, Person’s correlation coefficients were calculated for selected genes in a pairwise manner yielding a similarity matrix (Sij). The matrix was transformed into an adjacency matrix (aij) using a power function using formula aij = Power (Sij, β) ≡ |Sij| β. Average linkage hierarchical clustering was then performed to identify modules of densely interconnected genes. Genes that were not assigned to specific modules were assigned the color grey. Next, genes within significantly related modules were submitted to online tool DAVID for functional enrichment analysis. All data manipulations were carried out in R 3.3.1.
Hub gene screening and CNC (coding-non-coding network) construction
Genes within significant related modules were ranked using formula , in which let T_s be a shortest path tree rooted at s. p_s (v) = 1 if more than |V(T_s)|/4 paths from s to other vertices in T_s meet at the vertex v; otherwise p_s (v) = 0. The top 20 genes were considered as hub genes. CNC network of related modules was constructed using WGCNA output and visualized using Cytoscape version 3.533.
Validation of DELs (differential expressed lncRNAs)
To validate DELs discovered in the training data set, two data series GSE47183 and GSE47184 were used. After batch effects removal, the expression of important lncRNAs discovered above (DELs in training data set and hub lncRNAs in CNC network) were validated in the two validation sets.
Validation of DEGs in mice
To validate the DEGs identified in human, differential expression analysis was carried out in two data series in mouse (Using the same method above). Finally, 172 recurrent DEGs were filtered. To further explore the similarity of DEGs identified in human and mouse, functional enrichment analysis was also performed with those 172 DEGs.
Cell culture and stimulation
HPCs and HMCs were generous gift of Prof. Fan Yi from Shandong University of China. Cells were maintained in DMEM (glucose concentration was 5.6 mM) with 10% fetal bovine serum at 37 °C with 5% CO2. Cells were passaged when they were at 80% convergence. High glucose (final concentration of 40 mM for 24 h) was used as a stimulator while mannitol of the same concentration was added as osmolality control.
RNA extranction and qPCR
Cells were collected and total RNA were extracted using TRIzol reagent (Invitrogen, Carlsbad, CA, USA). Reverse transcription of mRNA was carried out using HiScript Q RT SuperMix for qPCR (Vazyme, Nanjing, China) according to the manufacturer’s instructions. Relative quantity of lncRNAs or mRNAs was measured by SYBR Green PCR assays by AceQTM qPCR SYBR Green Master Mix (Novland BioPharma, Shanghai, China). GAPDH was used as internal control. Primers were synthesized by GenePharma (Shanghai, China). Primer sequences: NR_130134.1: Forward 5′-TTGCATCCCAGCCCTATTCCTG-3′ Reverse 5′-GCACCCTTGACATTTCCTGTACC-3′; NR_038335.1:Forward 5′-GCTTCCCACACTTCTGACTCG-3′ Reverse 5′-TCAAAGAGCTTCACGACCACC-3′; NR_029395.1: Forward 5′-GGACCCAGCTCACCGTTTTAG-3′ Reverse 5′-CTCCCGTTTACTCCCCGCTA-3′; CD163: Forward 5′-GGGCTAATTCCAGTGCAGGT-3′ Reverse 5′-GCTGACTCATTCCCACGACA-3′; HOPX:Forward 5′-CCGAGGAGGAGACCCAGAAA-3′ Reverse 5′-TGTGACGGATCTGCACTCTG-3′; TDO2:Forward 5′-TCCTCAGGCTATCACTACCTGC-3′ Reverse 5′-ATCTTCGGTATCCAGTGTCGG-3′; CASP1: Forward 5′-ATGCCTGTTCCTGTGATGTGG-3′ Reverse 5′-GGCATCTGCGCTCTACCATC-3′; PTPRC: Forward 5′-TTCTTAGGGACACGGCTGAC-3′ Reverse 5′-TTAAGGTAGGCATCAGTGGGGG-3′; COL6A3: Forward 5′-CAGTTCCCTGTTGTCCGTGA-3′ Reverse 5′-CTGTACTGAGCCACCGCAAT-3′; CST6: Forward 5′-TCCTGACGATGGAGATGGGG-3′ Reverse 5′-GACCTCAAAGTCACAGCGCA-3′; CELF2: Forward 5′-AGCACCAATGCAAACCCTCT-3′ Reverse 5′-CAGAGTCCCGAGAGAGGTCA-3′; KRT7:Forward 5′-TGGAGTGGGAGCCGTGAATA-3′ Reverse 5′-TGCGGTCCGGATGGAATAAG-3′; PYCARD: Forward 5′-TCAGTTTCACACCAGCCTGGAAC-3′ Reverse 5′-AGGTAGGACTGGGACTCCCTTAGG-3′. Data were represented as means ± standard deviation (SD). One-way analysis of variance (ANOVA) followed by Duncan’s multiple range test were used to check the significance of the differences in mean values using Graphpad Prism 5.0 (CA, USA).
Gene interference
Specific siRNAs targeting lncRNAs were synthesized by GenePharma (Shanghai, China). The sequences are: NR_130134.1 5′-CCAGGUGUCGAUAAGUAAUTT-3′, NR_029395.1 5′-GGGCACCUUAGACUCAGAATT-3′. Lipofectamine 3000 (Invitrogen, Carlsbad, CA, USA) was used for transfection.
Fluorescence in situ hybridization
Location and expression of lncRNAs were visualized using Fluorescence in situ hybridization (FISH) which is performed according to previous description34. The probe and kit for FISH was purchased from Ribobio (Guangzhou, China). Generally, cultured cells were fixed in paraformaldehyde for 10 min and incubated in Triton X-100 for 5 min. Slides were blocked using pre-hybridization buffer. Probes were added at 5 nM concentration. Images were captured using Fluorescence microscope (Leica, German) and gathered in Photoshop CS6 (Adobe, USA).
Supplementary information
Acknowledgements
This work was supported by the National Natural Science Foundation of China, grant numbers [81570690, 81873611 and 81700633], Science and Technology Innovation Team of Henan, grant numbers [17IRTSTHN020]; the Youth Foundation of the First Affiliated Hospital of Zhengzhou University to Jin Shang, Foundation and Frontier Technology Research Program of Henan Province, grant numbers [142300410211].
Author Contributions
Conceptualization, Zhanzheng Zhao; Formal analysis, Yumin Jiang and Genyang Cheng; Funding acquisition, Jin Shang and Zhanzheng Zhao; Methodology, Dong Liu and Jing Xiao; Project administration, Zhanzheng Zhao; Supervision, Zhanzheng Zhao; Writing – original draft, Jin Shang; Writing – review & editing, Shuai Wang and Yiqi Duan.
Data Availability
The datasets generated during and/or analyzed during the current study are available in the GEO database (https://www.ncbi.nlm.nih.gov/geo/).
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-019-39298-9.
References
- 1.Saran R, et al. US Renal Data System 2017 Annual Data Report: Epidemiology of Kidney Disease in the United States. Am J Kidney Dis. 2018;71:A7. doi: 10.1053/j.ajkd.2018.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Navarro-Gonzalez JF, Mora-Fernandez C, Muros de Fuentes M, Garcia-Perez J. Inflammatory molecules and pathways in the pathogenesis of diabetic nephropathy. Nat Rev Nephrol. 2011;7:327–340. doi: 10.1038/nrneph.2011.51. [DOI] [PubMed] [Google Scholar]
- 3.Brenner BM, et al. Effects of losartan on renal and cardiovascular outcomes in patients with type 2 diabetes and nephropathy. N Engl J Med. 2001;345:861–869. doi: 10.1056/NEJMoa011161. [DOI] [PubMed] [Google Scholar]
- 4.Liao, Z., Zhao, J. & Yang, Y. Downregulation of lncRNA H19 inhibits the migration and invasion of melanoma cells by inactivating the NFkappaB and PI3K/Akt signaling pathways. Mol Med Rep, 10.3892/mmr.2018.8782 (2018). [DOI] [PubMed]
- 5.Xu, J. H., Chang, W. H., Fu, H. W., Yuan, T. & Chen, P. The mRNA, miRNA and lncRNA networks in hepatocellular carcinoma: An integrative transcriptomic analysis from Gene Expression Omnibus. Mol Med Rep, 10.3892/mmr.2018.8694 (2018). [DOI] [PMC free article] [PubMed]
- 6.Li X, et al. Long noncoding RNA MALAT1 regulates renal tubular epithelial pyroptosis by modulated miR-23c targeting of ELAVL1 in diabetic nephropathy. Exp Cell Res. 2017;350:327–335. doi: 10.1016/j.yexcr.2016.12.006. [DOI] [PubMed] [Google Scholar]
- 7.Hanson RL, et al. Identification of PVT1 as a candidate gene for end-stage renal disease in type 2 diabetes using a pooling-based genome-wide single nucleotide polymorphism association study. Diabetes. 2007;56:975–983. doi: 10.2337/db06-1072. [DOI] [PubMed] [Google Scholar]
- 8.Duan LJ, et al. Long noncoding RNA TUG1 alleviates extracellular matrix accumulation via mediating microRNA-377 targeting of PPARgamma in diabetic nephropathy. Biochem Biophys Res Commun. 2017;484:598–604. doi: 10.1016/j.bbrc.2017.01.145. [DOI] [PubMed] [Google Scholar]
- 9.Woroniecka KI, et al. Transcriptome analysis of human diabetic kidney disease. Diabetes. 2011;60:2354–2369. doi: 10.2337/db10-1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Na J, Sweetwyne MT, Park AS, Susztak K, Cagan RL. Diet-Induced Podocyte Dysfunction in Drosophila and Mammals. Cell Rep. 2015;12:636–647. doi: 10.1016/j.celrep.2015.06.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ju W, et al. Defining cell-type specificity at the transcriptional level in human disease. Genome Res. 2013;23:1862–1873. doi: 10.1101/gr.155697.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Martini S, et al. Integrative biology identifies shared transcriptional networks in CKD. J Am Soc Nephrol. 2014;25:2559–2572. doi: 10.1681/ASN.2013080906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Yang L, et al. Inflammatory gene expression in OVE26 diabetic kidney during the development of nephropathy. Nephron Exp Nephrol. 2011;119:e8–20. doi: 10.1159/000324407. [DOI] [PubMed] [Google Scholar]
- 14.Hodgin JB, et al. Identification of cross-species shared transcriptional networks of diabetic nephropathy in human and mouse glomeruli. Diabetes. 2013;62:299–308. doi: 10.2337/db11-1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Fiorentino L, et al. Loss of TIMP3 underlies diabetic nephropathy via FoxO1/STAT1 interplay. EMBO Mol Med. 2013;5:441–455. doi: 10.1002/emmm.201201475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wang, S. et al. Repression of microRNA-382 inhibits glomerular mesangial cell proliferation and extracellular matrix accumulation via FoxO1 in mice with diabetic nephropathy. Cell Prolif, e12462, 10.1111/cpr.12462 (2018). [DOI] [PMC free article] [PubMed] [Retracted]
- 17.Tavazoie S, Hughes JD, Campbell MJ, Cho RJ, Church GM. Systematic determination of genetic network architecture. Nat Genet. 1999;22:281–285. doi: 10.1038/10343. [DOI] [PubMed] [Google Scholar]
- 18.Schenone M, Dancik V, Wagner BK, Clemons PA. Target identification and mechanism of action in chemical biology and drug discovery. Nat Chem Biol. 2013;9:232–240. doi: 10.1038/nchembio.1199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gluck C, Ko YA, Susztak K. Precision Medicine Approaches to Diabetic Kidney Disease: Tissue as an Issue. Curr Diab Rep. 2017;17:30. doi: 10.1007/s11892-017-0854-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yi H, et al. LincRNA-Gm4419 knockdown ameliorates NF-kappaB/NLRP3 inflammasome-mediated inflammation in diabetic nephropathy. Cell Death Dis. 2017;8:e2583. doi: 10.1038/cddis.2016.451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hu M, et al. LncRNA MALAT1 is dysregulated in diabetic nephropathy and involved in high glucose-induced podocyte injury via its interplay with beta-catenin. J Cell Mol Med. 2017;21:2732–2747. doi: 10.1111/jcmm.13189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Breyer MD, et al. Mouse models of diabetic nephropathy. J Am Soc Nephrol. 2005;16:27–45. doi: 10.1681/ASN.2004080648. [DOI] [PubMed] [Google Scholar]
- 23.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhu HG, et al. Efficacy of low-protein diet for diabetic nephropathy: a systematic review of randomized controlled trials. Lipids Health Dis. 2018;17:141. doi: 10.1186/s12944-018-0791-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Herman-Edelstein M, Scherzer P, Tobar A, Levi M, Gafter U. Altered renal lipid metabolism and renal lipid accumulation in human diabetic nephropathy. J Lipid Res. 2014;55:561–572. doi: 10.1194/jlr.P040501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ko, G. J., Kalantar-Zadeh, K., Goldstein-Fuchs, J. & Rhee, C. M. Dietary Approaches in the Management of Diabetic Patients with Kidney Disease. Nutrients9, 10.3390/nu9080824 (2017). [DOI] [PMC free article] [PubMed]
- 27.Wada J, Makino H. Inflammation and the pathogenesis of diabetic nephropathy. Clin Sci (Lond) 2013;124:139–152. doi: 10.1042/CS20120198. [DOI] [PubMed] [Google Scholar]
- 28.Xiao X, et al. Cellular and humoral immune responses in the early stages of diabetic nephropathy in NOD mice. J Autoimmun. 2009;32:85–93. doi: 10.1016/j.jaut.2008.12.003. [DOI] [PubMed] [Google Scholar]
- 29.Ritchie ME, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Chen C, et al. Removing batch effects in analysis of expression microarray data: an evaluation of six batch adjustment methods. PLoS One. 2011;6:e17238. doi: 10.1371/journal.pone.0017238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
- 32.Zhang, B. & Horvath, S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol4, Article17, 10.2202/1544-6115.1128 (2005). [DOI] [PubMed]
- 33.Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zhang, X. et al. Microarray analysis reveals long noncoding RNA SOX2OT as a novel candidate regulator in diabetic nephropathy. Mol Med Rep, 10.3892/mmr.2018.9534 (2018). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated during and/or analyzed during the current study are available in the GEO database (https://www.ncbi.nlm.nih.gov/geo/).