Abstract
This data descriptor is an extended version of the methodologies which have been described in a related paper [1]; its purpose is to disseminate the raw data and analyzed data produced in this experiment. For more insight please see the research article, “competing endogenous RNA expression profiling in pre-eclampsia identifies hsa_circ_0036877 as a potential novel blood biomarker for early pre-eclampsia” (Xiaopeng Hu and Junping Ao, 2018) [1]. Using microarray analysis, we investigated competing endogenous RNA (ceRNA) expression profiles in placentas of women with severe pre-eclampsia (SPE) and normal pregnancies. Competing endogenous RNA (ceRNA) refer to RNA transcripts, such as messenger RNA (mRNA), long noncoding RNA (lncRNA), and circular RNA (circRNA) that can regulate each other by competing for the same pool of miRNAs. CircRNAs, lncRNAs, and mRNAs differentially expressed between human normal and SPE placentas were obtained in the study. Metadata files were submitted to the Gene Expression Omnibus repository (GEO), with GEO accession number GSE102897. These data are potential useful for further study on the pathogenesis of PE and early prediction of PE onset.
Keywords: Competing endogenous RNA, Pre-eclampsia, Expression profiling dataset, circRNA, lncRNA, mRNA
Subject area | Medicine |
More specific subject area | Pre-eclampsia (PE) |
Type of data | Table and figure |
How data was acquired | Microarray |
Data format | Raw and analyzed data |
Parameters for data collection | The human placentas of controls and SPE were randomly devided into 3 groups respectively. |
Description of data collection | Total RNA were extracted from the placental tissue for analysis of their circRNA, lncRNA, and mRNA profiles using Agilent Human (4 × 180K) GeneChip circRNA and lncRNA arrays. |
Data source location | Shanghai, china |
Data accessibility | Data was submitted to the Gene Expression Omnibus repository (GEO) (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE102897), with GEO accession number GSE102897. |
Related research article | Hu X, Ao J, Li X, Zhang H, Wu J, Cheng W. Competing endogenous RNA expression profiling in pre-eclampsia identifies hsa_circ_0036877 as a potential novel blood biomarker for early pre-eclampsia. Clinical epigenetics 2018, 10: 48. |
Value of the Data
|
1. Data
The dataset contains original and analyzed data obtained through the circRNA, lncRNA, mRNA expression microarray analysis of 12 placental samples, which were collected from six patients with SPE and six healthy pregnant women (Fig. 1). The original and normalized data of all SPE/normal placental samples have been deposited in the NCBI GEO repository with accession number GSE102897 for the circRNA, lncRNA and mRNA expression microarrays. The metadata record associated with the samples is shown in Table 1 and demographic characteristics of Participants were shown in related work [1]. Analyzed data is shown in tables (Tables S1–12). Based on the experimental procedure, a total new set of figures (Fig. 2, Fig. 3, Fig. 4, Fig. 5) were generated and quality controls were used to generate the dataset. The ceRNA dataset has little inter-sample variation in the expression of circRNAs, lncRNAs, and mRNAs among the placentas of pregnant women with PE or normal. The findings also suggest that the levels of inter-PE/normal sample variation and/or technical variation were low.
Table 1.
Sample name | Source | Organism | Tissue | Age | Protocol 1 | Protocol 2 | Data |
---|---|---|---|---|---|---|---|
SPE_1 | Severe Preeclampsia | Homo sapiens | Placenta | 25-35y | RNA extraction | circRNA array IncRNA and mRNA array |
GSM2747897 |
SPE_2 | Severe Preeclampsia | Homo sapiens | Placenta | 25-35y | RNA extraction | circRNA array IncRNA and mRNA array |
GSM2747898 |
SPE_3 | Severe Preeclampsia | Homo sapiens | Placenta | 25-35y | RNA extraction | circRNA array IncRNA and mRNA array |
GSM2747899 |
N_1 | Normal human | Homo sapiens | Placenta | 25-35y | RNA extraction | circRNA array IncRNA and mRNA array |
GSM2747900 |
N_2 | Normal human | Homo sapiens | Placenta | 25-35y | RNA extraction | circRNA array IncRNA and mRNA array |
GSM2747901 |
N_3 | Normal human | Homo sapiens | Placenta | 25-35y | RNA extraction | circRNA array IncRNA and mRNA array |
GSM2747902 |
2. Experimental design, materials, and methods
These methods are expanded versions of descriptions in our related work [1].
2.1. Experimental design
As shown in Fig. 1, in this study, we included six patients with SPE and six healthy pregnant women as controls. The placentas of controls and SPE were randomly devided into 3 groups respectively. Total RNA were extracted from the placental tissue for analysis of their circRNA, lncRNA, and mRNA profiles using Agilent Human (4 × 180K) GeneChip circRNA and lncRNA arrays. The GeneChip circRNA array was designed to interrogate all sequences of circRNA in Circbase (88,371) [2], which is the updated international mainstream database of human circRNAs. The lncRNA array used lncRNA and mRNA probes derived from the updated version of a reference genome (human/GRCh38; hg38) [3], covering the following databases of lncRNA: GENCODE V21, Ensembl, UCSC, NONCODE, LNCipedia, and lncRNAdb. All lncRNA and mRNA sequences from the databases were aligned by Biological Information Experts, and then the redundant sequences were eliminated. The precise lncRNA (68,423) and mRNA (18,853) sequences were applied for specific probe design.
2.2. Samples and setting
The Ethics Committee of the International Peace Maternity and Child Health Hospital (Shanghai Jiaotong University, Shanghai, China) approved this research and prior written informed consent was signed by all participants to participate. From December 2015 to November 2017, Placental samples were taken from 6 Normal and 6 SPE Chinese pregnant women at the above-mentioned hospital, whose the demographic characteristics were shown in related work [1]. Each placental sample was stored at −80 °C until use and taken from the same representative region of the central portion of tissue at the lower third of the placenta, near the maternal side, which mainly contained cytotrophoblasts, syncytiotrophoblasts, and villous interstitium. In this research, patients including those with no history of hypertension, diabetes, or kidney disease were evaluated by an experienced obstetrician. SPE is defined as a higher blood pressure >160 mmHg systolic or >110 mmHg diastolic with protein urea of ≥300–5000 mg/24h, in accordance with the guidelines of the American College of Obstetricians and Gynecologists (ACOG) [4].
2.3. RNA extraction from placental samples
Using TAKARA RNAiso Plus#9109, total RNA was extracted following to the manufacturer's protocol.
2.4. RNA amplification and labeling
For RNA amplification and labeling, we used a Low Input Quick Amp WT Labeling Kit (Cat# 5190-2943; Agilent Technologies) according to the manufacturer's protocol and then purified by RNeasy Mini kit (Cat# 74106; QIAGEN, GmBH, Germany).
2.5. Quality control of the microarray
The coefficient of variation (CV) was applied as quality control and validation method for the analysis of microarray assays.
2.6. Microarray analysis
Each slide was hybridized with 1.65 μg of Cy3-labeled cRNA using a Gene Expression Hybridization Kit (Cat# 5188-5242; Agilent Technologies) in a Hybridization Oven (Cat# G2545A; Agilent Technologies) at 55 °C, 20 rpm for 17 hours according to the manufacturer's instructions. After hybridization, slides were washed in stain dishes (Cat# 121; Thermo Shandon, Waltham, MA, USA) with a Gene Expression Wash Buffer Kit (Cat# 5188–5327; Agilent Technologies).
Slides were scanned by an Agilent Microarray Scanner (Cat# G2565CA; Agilent Technologies) with the default settings (Agilent Technologies). Microarray data were feature extracted by Feature Extraction software 10.7 (Agilent Technologies). Raw data were normalized by the quantile algorithm of limma package in R.
To visualize the whole set of significant ceRNAs expression in PE compared to normal participants, hierarchical clustering in MeV_4_9_0 was used. The p-value was calculated by a right-sided hypergeometric test. Benjamini–Hochberg adjustment was used to correct for multiple tests. An adjusted p-value of <0.05 indicated a statistically significant deviation from the expected distribution. Gene-enrichment and functional annotation analyses for the significant probe list were performed using DAVID (http://david.abcc.ncifcrf.gov/home.jsp) [5,6].
3. Technical validation
3.1. Checked the RNA quality of samples
Using a Bioanalyzer 2100 (Agilent, CA, USA), RNA Integrity Number (RIN) was determined to evaluate RNA integrity. All RIN values of placental samples are greater than 7.0 and the ratio of 28S RNA to18S RNA is greater than 1.9.
3.2. Quality check of ceRNA profiling data
For this study, we confirmed the quality of the circRNA profiling data. The following methods were used: the number of circRNAs probes detected (Fig. 2a), the expression density (Fig. 2b)and the distribution of the normalized signal values (max, min, 1st Qu, 3rd Qu, Median) (Fig. 2c) for each group were analyzed; Correlation and hierarchical clustering analysis between groups were performed (Fig. 2d–f). In the same way, we confirmed the quality of the lncRNA and mRNA expression microarray analysis by analysing the number of detected probes (Fig. 3, Fig. 4a), the expression density (Fig. 3, Fig. 4b) and the distribution of the normalized signal values (max, min, 1st Qu, 3rd Qu, Median) (Fig. 3, Fig. 4c) for each group. Accordingly, Correlation and hierarchical clustering analysis between groups were also performed (Fig. 3, Fig. 4d–f).
3.3. Validation of mRNA, lncRNA and circRNA expression
As shown, circRNAs (Fig. 5a, Table S1), lncRNAs (Fig. 5b, Table S2) and genes (Fig. 5c, Table S3) showed significant changes in transcript expression between SPE and Normal placentas.
3.4. Functional enrichment analysis and prediction of ceRNA interaction
By DAVID analysis, Gene-enrichment, functional annotation and Pathway -enrichment were shown in Tables S4–9. By using cytoscape, ceRNA interaction between circRNA, lncRNA and mRNA were predicted, which were shown in Tables S10–12.
Acknowledgments
This work was supported by the National Natural Science Foundation of China (81720108017), National Basic Research Program of China (2017YFA0504201) and the interdisciplinary program of Shanghai Jiao Tong University (ZH2018ZDA31).
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.dib.2019.104795.
Contributor Information
Xiaopeng Hu, Email: huxiaopeng2017@sjtu.edu.cn.
Xinyue Li, Email: 1419974394@qq.com.
Geng G. Tian, Email: gengtian@sjtu.edu.cn.
Huijuan Zhang, Email: zhanghj815@126.com.
Weiwei Cheng, Email: 18017316001@163.com.
Ji Wu, Email: jiwu@sjtu.edu.cn.
Conflict of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Appendix A. Supplementary data
The following are the Supplementary data to this article:
References
- 1.Hu X., Ao J., Li X., Zhang H., Wu J., Cheng W. Competing endogenous RNA expression profiling in pre-eclampsia identifies hsa_circ_0036877 as a potential novel blood biomarker for early pre-eclampsia. Clin. Epigenet. 2018;10:48. doi: 10.1186/s13148-018-0482-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Glazar P., Papavasileiou P., Rajewsky N. circBase: a database for circular RNAs. RNA. 2014;20:1666–1670. doi: 10.1261/rna.043687.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Guo Y., Dai Y., Yu H., Zhao S., Samuels D.C., Shyr Y. Improvements and impacts of GRCh38 human reference on high throughput sequencing data analysis. Genomics. 2017;109:83–90. doi: 10.1016/j.ygeno.2017.01.005. [DOI] [PubMed] [Google Scholar]
- 4.Hypertension in pregnancy, report of the American College of Obstetricians and Gynecologists' task force on hypertension in pregnancy. Obstet. Gynecol. 2013;122:1122–1131. doi: 10.1097/01.AOG.0000437382.03963.88. [DOI] [PubMed] [Google Scholar]
- 5.Huang da W., Sherman B.T., Lempicki R.A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13. doi: 10.1093/nar/gkn923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Huang da W., Sherman B.T., Lempicki R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.