Skip to main content
Breast Cancer : Targets and Therapy logoLink to Breast Cancer : Targets and Therapy
. 2023 Jul 6;15:435–450. doi: 10.2147/BCTT.S408711

Promoter Methylation-Regulated Differentially Expressed Genes in Breast Cancer

Samar Sindi 1,*, Norah Hamdi 1,2,*, Sabah Hassan 1,3, Magdah Ganash 1, Mona Alharbi 1, Najla Alburae 1, Sheren Azhari 1, Shadi Alkhayyat 4, Ayman Linjawi 5, Heba Alkhatabi 6,7, Aisha Elaimi 7,8, Ghadeer Alrefaei 9, Nouf Alsubhi 10, Aziza Alrafiah 7,, Safiah Alhazmi 1,3
PMCID: PMC10332364  PMID: 37434588

Abstract

Background

Breast cancer is one of the most common malignancies among women. Recent studies revealed that differentially methylated regions (DMRs) are implicated in regulating gene expression. The goal of this research was to determine which genes and pathways are dysregulated in breast cancer when their promoters are methylated in an abnormal way, leading to differential expression. Whole-genome bisulfite sequencing was applied to analyze DMRs for eight peripheral blood samples collected from five Saudi females diagnosed with stages I and II of breast cancer aligned with three normal females. Three of those patients and three normal samples were used to determine differentially expressed genes (DEG) using Illumina platform NovaSeq PE150.

Results

Based on ontology (GO) and KEGG pathways, the analysis indicated that DMGs and DEG are closely related to associated processes, such as ubiquitin-protein transferase activity, ubiquitin-mediated proteolysis, and oxidative phosphorylation. The findings indicated a potentially significant association between global hypomethylation and breast cancer in Saudi patients. Our results revealed 81 differentially promoter-methylated and expressed genes. The most significant differentially methylated and expressed genes found in gene ontology (GO) are pumilio RNA binding family member 1 (PUM1) and zinc finger AN1-type containing 2B (ZFAND2B) also known as (AIRAPL).

Conclusion

The essential outcomes of this study suggested that aberrant hypermethylation at crucial genes that have significant parts in the molecular pathways of breast cancer could be used as a potential prognostic biomarker for breast cancer.

Keywords: breast cancer, DNA methylation, biomarker, differentially methylated regions, differentially expressed gene, whole-genome bisulfite sequencing

Introduction

Breast cancer represents the most frequently diagnosed and fatal malignant neoplasm among women worldwide. In 2020, breast cancer accounted for the fifth-highest rate of cancer death.1 Current data indicates that cancer is both a hereditary and epigenetic disease. Many major aspects of tumor biology are controlled by epigenetic modifications, ranging from the growth, and spread of primary tumors to how the immune system responds throughout the neoplasm microenvironment.2 Interestingly, germline mutations are related to 10% of the hereditary type of breast cancer, while about 90% of breast cancer cases without any somatic mutations revealed the epigenetic mechanisms responsible for this alteration.3

DNA methylation is one of the most researched epigenetic changes in humans, occurring by adding a methyl group to the carbon 5-position (C5) of cytosine to form 5-methylcytosine. Essentially, DNA methyltransferases (DNMTs) are a category of enzymes that catalyze this process. These enzymes convert cytosine into 5-methylcytosine by adding a methyl group at the fifth carbon atom from the S-adenosylmethionine (SAM) molecule.4 The differentially methylated regions (DMRs) in the genomic DNA have been reported to be associated with a variety of disorders and are thought to be involved in gene expression processes. As a result, identifying DMRs is an important and vital step in unraveling the causes of illnesses.5 Moreover, the level of global DNA methylation has decreased as a tumor progresses from normal tissue to invasive cancer, but the abundance of local hypermethylated CpG islands at promoters has increased, leading to transcriptional repression of tumor-suppressive genes. Methylation events of promoters and CpG islands in DNA prevent the expression of target genes by regulating the binding of transcriptional modulators to the promoter.6 In blood, both hypomethylation and hypermethylation mechanisms have been proposed to increase breast cancer risk. Hypomethylation may lead to the activation of proto-oncogenes to become oncogenes, whereas hypermethylation in the promoter region of tumor suppressor genes leads to their inactivation.7 Furthermore, peripheral blood or serum can be used to detect DNA methylation for the early diagnosis and treatment of breast cancer. The higher methylation rate in the promoter region was found in CpG islands, which decreases the expression of the corresponding gene.8

Therefore, the current study aimed to determine novel predictive biomarkers for breast cancer through analysis of DMR in promoter regions and gene expression of the whole genome.

Methodology

Study Population

The research was carried out using eight blood samples from five breast cancer patients and three normal women from Jeddah city, Saudi Arabia, while six of them were selected for gene expression. With ages ranging from 29 to 60 years old. Participants’ blood samples were collected in the EDTA Lavender tube (Franklin Lakes, NJ, USA) from the medical clinic’s reference and the King Abdulaziz University Hospital in Jeddah. The biomedical ethics unit at King Abdulaziz University approved the study (No.349-21).

Whole Genome Bisulfite Sequencing (WBI) Using Novaseq 6000

Genomic DNA was isolated from the whole blood using the DNeasy Blood QIAamp Kit (Qiagen, Hilden, Germany) in accordance with the manufacturer’s guidelines and then stored at −80 C. The whole-genome bisulfite sequencing (WGBS; Novaseq 6000 platform, PE150 read length) was performed by WBI with 10× genome coverage and used according to the manufacturer’s instructions.

Transcriptome Sequencing

Whole genomic RNA was isolated from blood using RNA bler-blood kit (Haven, Jeddah, Saudi Arabia) according to the instructions from the manufacturer and then stored at −80 C. To purify messenger RNA using poly-T oligo-attached magnetic beads. The first strand of cDNA was then generated using random hexamer primers. The libraries were sequenced via Illumina platforms NovaSeq PE150 (12G/sample), NOVOGENE (HK) COMPANY LIMITED.

GO and KEGG Enrichment Analysis of DMR-Related Genes and DEG

Gene Ontology (http://www.geneontology.org/) is an important bioinformatics categorization system that aims to standardize the presentation of gene features across all organisms. It includes three primary sections: cellular component, molecular function, and biological process. Bioconductor (2.13) was used to perform gene ontology (GO) enrichment analysis of DMR-related genes with gene length bias-corrected.9

The Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.genome.jp/kegg) is a database for figuring out the high-level functions and benefits of biological systems like cells, organisms, and ecosystems from molecular-level data.10 To examine the statistical enrichment of DMR-related genes in KEGG pathways, the KEGG Orthology-Based Annotation System (KOBAS), whereas clusterProfiler (3.8.1) for DEG with a corrected P-value was less than 0.05.

Whole-Genome Bisulfite Sequencing (WGBS) and mRNA Expression Analysis

The DSS software was used to detect differentially methylated regions (DMRs), while the differential expression gene (DEG) analysis was performed using the DESeq2 R package (1.20.0). Genes with P-value ≤0.05 found by DESeq2 were assigned as differentially expressed.

Results

Genome-Wide DNA Methylation Profiling

Table 1 represents the whole-genome bisulfite sequencing data for breast cancer (BC) and normal samples (N), including clean bases, clean reads, CG content, clean_Q30, bisulfite conversion rate, and mC percentages. For mC results, the most important component, the percentages for breast cancer patients were 0.74%, 0.57%, 0.93%, 1.07%, and 1.01% for BC1–BC5, respectively, while the percentages for normal samples were 1.34%, 0.62%, and 0.65% for N1-N3, respectively.

Table 1.

Whole-Genome Bisulfite Sequencing Data for Breast Cancer and Normal Individuals

Samples Clean Base (G) Clean Reads CG (%) Q30 (%) Rate of Bisulfite Conversion (%) Total-mC (%)
BC1 29.37 108284426 21.37 90.29 99.664 0.74
BC2 27.11 100008199 21.05 90.34 99.668 0.57
BC3 32.86 120905471 20.87 90.88 99.653 0.93
BC4 35.66 131353269 20.94 90.88 99.662 1.07
BC5 34.14 125856479 21.12 90.81 99.669 1.01
N1 39.47 145556222 21.24 89.16 99.666 1.34
N2 27.70 102355151 21.08 88.52 99.674 0.62
N3 27.10 99845502 21.20 90.71 99.674 0.65

Notes: Sample: sample name (BC= Breast Cancer, N= normal samples). Clean bases (G): clean read counts multiplied by read length, saved in the G unit. Clean reads: the sequencing read counts after trimming filtering. CG- Content (%): the proportion of G and C in all the bases. Clean_Q30 (%): the proportion of bases with a quality score of more than 20. Bisulfite conversion rate (%): the Bisulfite conversion rate for C converting to T. mC percent (%): percent of methylated cytosines in all the genomic cytosines.

Differentially Methylated Regions (DMRs) in Breast Cancer and Normal Individuals

Figure 1A shows the identification of differentially methylated regions (DMRs) between breast cancer patients and normal samples. The DMRs were mostly located at introns for 258 genes as hypermethylated and 837 genes as hypomethylated (258 Hyper and 837 Hypo) followed by other regions (124 Hyper and 447 Hypo), exons (94 Hyper and 272 Hypo), and regulatory regions such as promoters (92 Hyper and 329 Hypo), TSS (11 Hyper and 27 Hypo), 5′UTRs (18 Hyper and 83 Hypo), 3′UTR (16 Hyper and 40 Hypo) and TES (5 Hyper and 5Hypo).

Figure 1.

Figure 1

DMR distribution in breast cancer samples. (A) Bar graph represents CG context in the different functional regions. (B) Circos plot represents the distribution of Hyper/Hypo DMR methylation level.

The Distribution of Hyper/Hypo DMR Methylation Level

There were 7406 differentially methylated regions (DMRs) discovered, with 2299 being hyper-methylated and 5107 being hypo-methylated regions. The DMR distributions and their significance to the chromosomes are shown in (Figure 1B), and the circos plot represents from outside (the significance and distribution of DMRs on the chromosome) to inside (the distribution of DMRs based on gene regions).

GO and KEGG Enrichment Analysis of DMR-Related Genes

In Table 2, GO enrichment analysis was carried out utilizing the GOseq R program. According to the significance of the p-value, the hypomethylated genes were primarily participating in the cellular component (CC) including the nucleus, nucleoplasm, and nuclear part. Regarding molecular functions (MF), these genes participate in nucleic acid binding, DNA binding, and heterocyclic compound binding. Moreover, in the biological process (BP), the enrichment was primarily participating in ribosomal small subunit export from the nucleus, and regulation of natural killer cells mediated immune response to tumor cells, indicating that hypomethylated genes may play an important function in the cell cycle. The functional enrichment of hypermethylated genes in BP includes negative regulation of neutrophil degradation and blood vessel remodeling, and CC to interferon-beta. In addition, cellular components such as trans-Golgi network membrane, intracellular component, and heterotrimeric G-protein complex were displayed. Moreover, MF enrichment revealed the presence of (R, R)-butanediol dehydrogenase, L-iditol 2-dehydrogenase, and hexitol dehydrogenase. Furthermore, Figure 2A illustrates the most enriched GO terms for breast cancer patients compared to normal individuals in CG with hypomethylation, and Figure 2B illustrates the GO terms in hyper DMR in the gene promoter.

Table 2.

Functional Enrichment Analysis of Aberrant Hypo- and Hyper-Methylated Promoter Genes in Breast Cancer

Term _Type GO_Accession Description p-value DMR Genes Top Three Gene
Hypo-methylated promoter genes
Cellular Component (CC) GO:0005634 Nucleus 2.95E-07 130 LPIN2 / KLF6
MID1IP1
GO:0005654 Nucleoplasm 1.65E-06 80 STAG2/H2BC18
FAM222B
GO:0044428 Nuclear part 1.67E-06 96 NFIX/MAD1L1 H3C12
GO:0031981 Nuclear lumen 3.13E-06 90 RBPJL /MBNL3
GPR50
GO:0043231 Intracellular-membrane-bounded organelle 1.70E-05 175 PMEPA1/MAN1A2
ABHD17A
Molecular Function (MF) GO:0003676 Nucleic acid binding 6.85E-06 83 RPL37/KLF6/RPN1
GO:0003677 DNA-binding 8.83E-05 52 PGR/ ZBTB33
SUV39H1
GO:1901363 Heterocyclic-compound binding 0.00020848 103 TSPYL2/ G6PD
LITAF
GO:0043325 Phosphatidylinositol-3,4-bisphosphate binding 0.00030605 4 OBSCN/ AKT1
PHLDA3
GO:0097159 Organic cyclic compound binding 0.00038202 103 ALG13/ PGR
ZNF394
Biological Process (BP) GO:0000056 Ribosomal small subunit export from the nucleus 0.0050594 2 NUP214/ NUP88
GO:0005979 Regulation of glycogen biosynthetic process 0.0052089 3 INPP5K/AKT1
PASK
GO:0010962 Regulation of glucan biosynthetic process 0.0052089 3 PASK/ INPP5K
AKT1
GO:0032259 Methylation 0.0056769 11 EEF1AKMT2
BCOR/ TRMT10C
GO:0071103 DNA conformation change 0.0056806 10 MIS18BP1
H2BC17
CHTF18
Hyper-methylated promoter genes
Biological Process (BP) GO:0043314 Negative regulation of neutrophil degranulation 9.33E-06 2 ABR/ BCR
GO:0060313 Negative regulation of blood vessel remodeling 2.79E-05 2 ABR/ BCR
GO:0035458 Cellular response to interferon-beta 3.56E-05 3 NDUFA13
IRGM/ FI16
GO:1902564 Negative regulation of neutrophil activation 9.28E-05 2 BCR/ ABR
GO:0035456 Response to interferon-beta 0.00010641 3 IRGM
NDUFA13 / IFI16
Cellular Component (CC) GO:0032588 Trans-Golgi network membrane 0.0028742 3 LGR6/ COG2
GNAS
GO:0044424 Intracellular part 0.0040164 54 PLPP7/ PGS1
CAMSAP1
GO:0005622 Intracellular 0.0040966 54 ZNF444/ LGR6
TOX2
GO:0005834 Heterotrimeric G-protein complex 0.0043621 2 GNB1/GNAS
GO:0005737 Cytoplasm 0.0055811 46 LRRC61/ COG2
UBE2V1
Molecular Function (MF) GO:0000721 (R,R)-butanediol dehydrogenase activity 0.0030803 1 SORD
GO:0003939 L-iditol 2-dehydrogenase activity 0.0030803 1 SORD
GO:0031320 Hexitol dehydrogenase activity 0.0030803 1 SORD
GO:0043914 NADPH:sulfur oxidoreductase activity 0.0030803 1 MICAL2
GO:0043997 Histone acetyltransferase activity (H4-K12 specific) 0.0030803 1 KAT2A

Note: Genes are highlighted in italicized.

Figure 2.

Figure 2

Continued.

Figure 2.

Figure 2

GO and KEGG enrichment analysis of DMR-related genes. (A) The Most Enriched of GO terms in CG Hyper DMR promoter genes. (B) The Most Enriched of GO terms in CG Hypo DMR promoter genes. (C) Statistics of Pathway Enrichment CG Hypo DMR promoter genes. (D) Statistics of Pathway Enrichment CG Hyper DMR promoter genes.

Note: *Is a significantly enriched term.

Regarding Table 3, analysis of enrichment of KEGG pathways revealed that the major hypomethylated genes are the genes that play roles in different pathways such as ribosome biosynthesis, N-Glycan biosynthesis, insulin signaling pathway, viral carcinogenesis, and cancer’s central carbon metabolism (Figure 2C). In addition, eight genes were engaged in cancer pathways. The notch signaling system, regulation of lipolysis in adipocytes, pathways in cancer, metabolic pathways, and morphine addiction were affected by hypermethylated genes (Figure 2D).

Table 3.

KEGG Pathway Enrichment of Aberrantly Methylated Genes in Breast Cancer

Pathway ID Term Number of Methylated Genes P-value Top Three Hypomethylated Gene
Hypomethylated genes
hsa03010 Ribosome 7 0.000127759 MRPL3P1/ RPL27A/RPS2P32
hsa00510 N-Glycan biosynthesis 4 0.000710227 DPM1/ RPN1/ ALG13
hsa04910 Insulin signaling pathway 6 0.000874458 RP6-24A23.6/ EIF4E/RHEB
hsa05203 Viral carcinogenesis 7 0.001245086 H2BC17/ IKBKG/ MAD1L1
hsa05230 Central carbon metabolism in cancer 4 0.00211821 FLT3/ SLC16A3/ AKT1
Hypermethylated genes
hsa04330 Notch signaling pathway 2 0.00436209 NCOR2/ KAT2A
hsa04923 Regulation of lipolysis in adipocytes 2 0.005826959 GNAS/ RP6-24A23.6
hsa05200 Pathways in cancer 4 0.008035652 GNAS/ BCR/ NOS2P2
hsa01100 Metabolic pathways 7 0.011269921 NOS2P2/ PGS/ OXSM
hsa05032 Morphine addiction 2 0.01442998 GNAS/ GNB1

Note: Genes are highlighted in italicized.

The GO and KEGG Analysis of Differentially Expressed Genes

Gene Ontology (GO) enrichment analysis of differentially expressed genes and Kyoto Encyclopedia of Genes and Genomes (KEGG) was implemented by the clusterProfiler R package, in which gene length bias was corrected. GO terms and KEGG pathways with a corrected P value of less than 0.05 were considered significantly enriched by differentially expressed genes. As shown in Table 4, GO of cellular component (CC) results revealed a high enrichment of upregulated differentially expressed genes in chromatin, clear speck, and transcription elongation factor complex, whereas GO of molecular function (MF) results showed that the upregulated DEG were enriched in the activity of DNA-dependent ATPase, ubiquitin-protein transferase, and ubiquitin-like protein transferase. Additionally, the differentially expressed genes were most enriched in the biological processes (BP) including megakaryocyte differentiation, megakaryocyte differentiation, and myeloid cell differentiation (Figure 3A). On the other hand, the downregulated DEG were considerably enriched in CC including the ribosomal subunit, large ribosomal subunit, and the cytosolic ribosome. Meanwhile, these genes were involved in MF such as the structural constituent of ribosome, electron transfer activity, and sulfurtransferase activity. Moreover, BP genes were included in SRP-dependent cotranslational protein targeting to the membrane, cotranslational protein targeting to the membrane, and viral transcription (Figure 3B). As shown in Table 5, the Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis revealed that the upregulated differentially expressed genes are highly enriched in the ubiquitin-mediated proteolysis, leukocyte transendothelial migration, and Ras signaling pathways (Figure 3C). In contrast, the down-regulated differently expressed genes are found in ribosome, oxidative phosphorylation, and Huntington’s disease as presented in (Figure 3D).

Table 4.

Functional Enrichment Analysis of Differentially Expressed Genes

Term_Type GO_ID Description p-value Count Top Three Upregulated Genes
Up-regulated genes
Molecular Function (MF) GO:0008094 DNA-dependent ATPase activity 8.40E-07 15 RAD50/ BPTF/PBRM1
GO:0004842 Ubiquitin-protein transferase activity 0.000045 35 FBXO11/ FBXL13/RNF103
GO:0019787 Ubiquitin-like protein transferase activity 0.000068 36 UBR3/ FBXL4/ TRIM9
Biological Process (BP) GO:0060216 Definitive hemopoiesis 0.000012 7 ZFP36L2/ TEK/TGFBR3
GO:0030219 Megakaryocyte differentiation 0.000134 12 CBFB/ GABPA/ KAT2B
GO:0030099 Myeloid cell differentiation 0.000196 30 ETS1/ IKZF1/ KMT2A
Cellular Component (CC) GO:0000785 Chromatin 0.00022 36 KMT2E/ CENPF/RAD50
GO:0016607 Nuclear speck 0.001656 27 PUM1/ KMT2E/SMC5
GO:0008023 Transcription elongation factor complex 0.004677 7 EPOP/ CTR9/AFF2
Down-regulated genes
Molecular Function (MF) GO:0003735 Structural constituent of ribosome 2.412E-15 31 RPS11/ MRPL14/RPL38
GO:0009055 Electron transfer activity 4.739E-06 15 COX8A/ NDUFS6/CYBA
GO:0016783 Sulfurtransferase activity 0.000035 5 URM1/TSTD1/TRMU
Cellular Component (CC) GO:0044391 Ribosomal subunit 8.167E-14 32 MRPS21/ RPL3L/RPS9
GO:0015934 Large ribosomal subunit 2.840E-13 25 RPL39/ RPL35/RPL34
GO:0022626 Cytosolic ribosome 1.47E-12 24 RPL38/ RPS28/NAA10
Biological Process (BP) GO:0006614 SRP-dependent cotranslational protein targeting membrane 4.993E-14 23 ZFAND2B/ RPS11/RPL38
GO:0006613 Cotranslational protein targeting to membrane 1.7E-13 23 ZFAND2B/ UBA52/RPL41
GO:0019083 Viral transcription 4.71E-13 30 TRIM31/ RPS28/POLR2E

Note: Genes are highlighted in italicized.

Figure 3.

Figure 3

Continued.

Figure 3.

Figure 3

GO and KEGG enrichment analysis of down and up-regulated differentially expressed genes. (A) GO terms in up-regulated differentially expressed genes. (B) GO terms in down-regulated differentially expressed genes. (C) Statistics of Pathway Enrichment in up-regulated differentially expressed genes. (D) Statistics of Pathway Enrichment in down-regulated differentially expressed genes.

Table 5.

Analysis of KEGG Pathways of Up/Down Regulated Differentially Expressed Genes

Pathway ID Term Number of Genes P-value Top Three Upregulated Gene
Upregulated differentially expressed genes
hsa04120 Ubiquitin-mediated proteolysis 12 0.005185 UBR5/CUL4A/KLHL13
hsa04670 Leukocyte transendothelial migration 10 0.0052 PIK3R1/TXK/PTPN11
hsa04014 Ras signaling pathway 16 0.005699 PIK3R1/RASAL2/CHUK
Down-regulated differentially expressed genes
hsa03010 Ribosome 29 1.3199E-14 RPL3L/ MRPL4/MRPS21
hsa00190 Oxidative phosphorylation 24 1.2107E-10 COX8A/NDUFA11/NDUFS6
hsa05016 Huntington disease 29 2.249E-10 BAX/ NDUFS6/NDUFA1

Note: Genes are highlighted in italicized.

Differentially Methylated and Expression Genes

In total, 328 differentially methylated and expressed genes were found between normal and breast cancer samples. Of these 328 genes, 247 genes were found in DMR other than the promoter region, while 81 genes were related to promoter regions (Figure 4). The most significant differentially methylated and expressed genes, with a significant P-value of <0.05, which found in gene ontology (GO) are pumilio RNA binding family member 1 (PUM1) and zinc finger AN1-type containing 2B (ZFAND2B) also known as (AIRAPL). The expression level of PUM1 was significantly high in breast cancer compared to normal samples (with a P value of 9.41E-06) and the DMR level at the promoter region was low in breast cancer compared to normal samples. On the contrary, the degree of expression of AIRAPL was significantly low (hypomethylation) in breast cancer compared to normal samples (with a P value of 0.00067) and the DMR level at promoter region was high (hypermethylation) in breast cancer compared to normal samples.

Figure 4.

Figure 4

Venn diagram represents the number of differential methylated and expressed gene between normal and breast cancer samples.

PUM1 and AIRAPL Promoter Region and Transcription Factors (TFs)

MethPrimer online software (http://www.urogene.org/methprimer/) was used to explore the possible impact of DNA methylation on CpG islands at promoter regions in regulating the expression of PUM1 and AIRAPL. Two expected CpG islands were discovered in promoter regions of both genes (Figure 5A). Moreover, gene Cards (https://www.genecards.org) were used to identify the significant TFs for both genes. Specificity protein 1 (Sp1) was shown to be one of the most significant transcription factors (TFs) of both the PUM1 and AIRAPL genes. It has been found that one TF binding site of SP1 is in the PUM1 promoter region and two of them are in the promoter and first exon of the AIRAPL gene (Figure 5B).

Figure 5.

Figure 5

PUM1 and AIRAPL promoter region and transcription factors. (A) CpG islands in PUM1 and AIRAPL. (B) SP1 binding sites in PUM1 and AIRAPL.

Discussion

Globally, breast cancer has a significant death rate, mostly due to late diagnosis. DNA methylation patterns are connected with gene suppression or activation in numerous kinds of cancer. Emerging research reveals that aberrant DNA methylation is strongly related to the diagnosis and prognosis of breast cancer.11 Moreover, cancers in humans are characterized by widespread changes in DNA methylation patterns. The current study aimed to determine novel prognostic biomarkers for breast cancer in promoter regions. Whole-genome bisulfite sequencing was done to analyze DMRs and DEG for breast cancer-related genes.

According to the present results, 116 million clean readings per sample were acquired using filtering the WBI Novaseq 6000 raw data. About 0.86% of cytosine sites were methylated. This result is lower than a previous study which found that the rate of 5mC ranged from 1.9 to 4.3%).12 The explanation for the greater drop in methylcytosine compared to the previous study is that most of the breast cancer samples in our study were hypomethylated. These changes could be related to aberrant hypo- and hypermethylation distribution in the tumor cells of numerous (CpG) islands, which are mainly associated with gene promoters.13 Consequently, DNA hypomethylation is frequently detected at the early onset of carcinogenesis, although it may also be related to tumor development.14 In addition, DNA hypomethylation is often detected in the early stages of carcinogenesis or aberrant non-cancerous tissue.15 In a previous study, hypomethylation was suggested as a potential risk factor associated with breast cancer among women.16 Similarly, our bioinformatic findings revealed that DMRs are associated with differences in hypomethylation in different regions such as introns, promoters, and other regions, which revealed a correlation between hypomethylation and breast cancer patients in Saudi women since our patients were at an early stage (I and II). Correspondingly, Shen et al evaluated the methylation levels in the placenta in 22 pairs of chromosomes and discovered that chromosome 6 included a high number of DMRs accounting for 15% of all DMRs, while no DMR was detected on chromosome 9. Furthermore, the greatest incidence of hypermethylation and hypomethylation was seen on chromosome 12, while the lowest rate of hypomethylation was observed on chromosome 17.17 On the contrary, Zhou et al determined that chromosome 17 had the highest mean frequency of hypomethylated sites, whereas chromosome 6 had the highest frequency of hypermethylated sites.18 The present study revealed that chromosome 17 had both hypo and hypermethylation events, while chromosome 19 and chromosome X had hypomethylation events only. This suggested that environmental factors and lifestyle have been implicated in the variation in an individual’s DNA methylation patterns.

Previous research highlighted the significance of the GO enrichment analysis of differentially methylated genes and identified relevant GO keywords for cell adhesion-related biological processes, such as hemophilic cell adhesion through plasma membrane adhesion molecules. In addition, genes involved in the sister chromatid cohesion, mitotic nuclear division, chromosomal segregation, apoptotic processes, and cell cycle were hypomethylated.19,20 In this present study, hypermethylation-stimulated biological processes including negative regulation of neutrophil degradation and blood vessel remodeling were identified. Meanwhile, hypomethylation in CC, including the nucleus, nucleoplasm, nuclear part, nuclear lumen, and intracellular membrane-bounded organelle, was detected. Moreover, the most significant functional enrichment analysis (GO) was MF in up/down-regulated genes. Upregulated genes were involved in the activity of DNA-dependent ATPase, ubiquitin-protein transferase, and ubiquitin-like protein transferase, whereas downregulated genes were involved in the structural constituent of ribosome, electron transfer activity, and sulfurtransferase activity. These findings were also reported by Chen et al, who found that the upregulated differentially expressed genes were enriched in transcription factor activity, sequence-specific DNA binding, and nucleic acid binding transcription factor activity, and the downregulated differentially expressed genes were enriched in receptor binding and identical protein binding.21

DNA methylation may be variably regulated in genes that impact signaling pathways such as the cell cycle and the p53 signaling pathway. According to our results, hypomethylated genes are related to pathways that control ribosome biosynthesis, N-Glycan biosynthesis, and the insulin signaling pathway; therefore, it was assumed that glucose metabolism contributed to resistance to anti-cancer drugs and cell proliferation.20 Furthermore, hypermethylated genes contributed to the Notch signaling pathway, regulation of lipolysis in adipocytes, and pathways in cancer which are significantly affected in breast cancer.

Otherwise, the KEGG pathway analyses for upregulated genes were detected in ubiquitin-mediated proteolysis, leukocyte transendothelial migration, and Ras signaling pathway. Downregulated genes were found to be involved in the ribosome, oxidative phosphorylation, and Huntington’s disease. Despite prior results by Zhang et al, the study showed that downregulated genes enriched in cytokine–cytokine receptor interaction, calcium signaling pathway, and T cell receptor signaling pathway.22

The most striking observation to emerge from our data was identifying two coding-protein genes (PUM1 and AIRAPL) which are significantly differentially methylated and expressed genes, with a P value of <0.05, which are also found in gene ontology (GO) analysis. PUM1 is engaged in a variety of physiological processes, including the cell cycle, DNA repair, and cell renewal. It has been demonstrated that PUM1 functions as an oncogene in ovarian cancer and other types of malignancies.23 As per our findings, PUM1 was upregulated in breast cancer lineage cells and tissues.24 The encoded protein of this gene consists of eight repeats and N- and C-terminal flanking regions to form a sequence-specific RNA binding domain that acts as a post-transcriptional repressor by binding to the 3’-UTR of mRNA targets where the Pumilio Response Element (PRE), 5’-UGUANAUA-3’ leading to translational inhibition and mRNA degradation. PUM1 targets 48 genes such as DNA polymerase processivity factor proliferating cell nuclear antigen (PCNA) and Ubiquitin-conjugating enzyme E2 A (UBE2A) which are related to DNA repair and apoptosis, therefore decreasing the expression of PMUI by mediating its methylated prompter could be induced increasing its target gene expression and enhancing apoptosis and responding to anti-cancer drugs, such as cisplatin.25 Research in this area has shown that PUM1 is overexpressed in breast cancer lineage cells and tissues.24

Likewise, AIRAPL is a ubiquitin-binding protein found on the endoplasmic reticulum (ER) membranes necessary for maintaining cellular folding capacity, and its dysfunction leads to accelerated aging and protein aggregation.26 The upregulation of AIRAPL was most significant between normal and breast cancer due to lower methylation levels as hypomethylation. On the other hand, the crystal three-dimensional structure of AIRAPL protein was solved by Simin Rahighi who found that AIRAPL has three molecules; each domain comprises two ubiquitin-interacting motifs in tandem (tUIMs) and zinc motif in the medal.27 As a result, the protein encoded in this gene plays a part in maintaining the homeostasis of target proteins by controlling the translocation and proteasomal degradation of developing proteins at the endoplasmic reticulum. Additionally, it participates in the destruction of target proteins by ubiquitin-mediated proteasomes when signal-mediated translocation to the endoplasmic reticulum is unsuccessful. The zinc-finger domains are crucial in regulating the immunological response, and loss of AIRAPL function accelerates senescence and protein aggregation.26 Our findings showed that the methylation level within the promoter region of this gene was increased in breast cancer patients compared to normal individuals. Consequently, the overexpression of genes was decreased in breast cancer patients compared to the normal. In addition, Specificity protein 1 (Sp1) was shown to be one of the most significant transcription factors (TFs) on the PUM1 and AIRAPL genes. SP1 is a zinc-finger protein belonging to the SP family of transcription factors. The standard sequence of the SP1-binding site in GpC-rich promoter regions is 5’-(GGGGCGGC)-3’ where a high level of DNA methylation at that site could prevent the binding of SP1 to the target sequence resulting in the repression of gene expression. Similarly, a previous study revealed that the CpG island hypermethylation of the proximal promoter region of the FOXF2 inhibited the affinity of SP1 to its binding site resulting in the downregulation of FOXF2 expression.28 In our view, the most compelling explanation for the present set of findings is that the promoted hypomethylation on CpG islands may increase the expression of PUM1 which is contrary to the previous study, while the promoted hypermethylation on CpG islands could decrease the expression of the AIRAPL gene by controlling the affinity of SP1 to its binding site at the promoter region.

Conclusions

In the current study, a potentially significant connection between breast cancer patients and global hypomethylation has been detected. In addition, DMGs and DEG are closely related to ubiquitin-protein transferase activity, ubiquitin-mediated proteolysis, and oxidative phosphorylation. The essential outcomes of this study suggested that aberrant hypermethylation at critical genes that have significant roles in the molecular pathways of breast cancer could be used as a potential prognostic biomarker for breast cancer. The most significantly differentially methylated and expressed genes are PUM1 and AIRAPL. These findings added critical information to our understanding of the epigenetic mechanisms that have the potential to be used as breast cancer biomarkers.

Acknowledgments

The authors wish to express gratitude to the Central Laboratory for Biological Sciences at the Science College at King Abdulaziz University for providing advice and support.

Funding Statement

This research work was funded by Institutional Fund Projects under grant no. (IFPHI-096-247-2020). Therefore, the authors gratefully acknowledge technical and financial support from the Ministry of Education and King Abdulaziz University, DSR, Jeddah, Saudi Arabia.

Participation Approval and Ethical Approval

The investigation was conducted in accordance with the declaration of Helsinki and was approved by the King Abdulaziz University biomedical ethics unit’s ethics committee (No.349-21). The participants signed informed consent forms.

Disclosure

The authors report no conflicts of interest in this work.

References

  • 1.Soerjomataram I, Bray F. Planning for tomorrow: global cancer incidence and the role of prevention 2020–2070. Nat Rev Clin Oncol. 2021;18(10):663–672. doi: 10.1038/s41571-021-00514-z [DOI] [PubMed] [Google Scholar]
  • 2.Garcia-Martinez L, Zhang Y, Nakata Y, Chan HL, Morey L. Epigenetic mechanisms in breast cancer therapy and resistance. Nat Commun. 2021;12(1). doi: 10.1038/s41467-021-22024-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Al-Moghrabi N, Al-Qasem AJS, Aboussekhra A. Methylation-related mutations in the BRCA1 promoter in peripheral blood cells from cancer-free women. Int J Oncol. 2011;39(1):129–135. doi: 10.3892/ijo.2011.1021 [DOI] [PubMed] [Google Scholar]
  • 4.Moore LD, Le T, Fan G. DNA methylation and its basic function. Neuropsychopharmacology. 2013;38(1):23–38. doi: 10.1038/npp.2012.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Chen DP, Lin YC, Fann CSJ. Methods for identifying differentially methylated regions for sequence- and array-based data. Brief Funct Genomics. 2016;15(6):485–490. doi: 10.1093/bfgp/elw018 [DOI] [PubMed] [Google Scholar]
  • 6.Park SM, Choi EY, Bae M, Choi JK, Kim YJ. A long-range interactive DNA methylation marker panel for the promoters of HOXA9 and HOXA10 predicts survival in breast cancer patients. Clin Epigenetics. 2017;9(1). doi: 10.1186/s13148-017-0373-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wielsøe M, Tarantini L, Bollati V, Long M, Bonefeld-Jørgensen EC. DNA methylation level in blood and relations to breast cancer, risk factors and environmental exposure in Greenlandic Inuit women. Basic Clin Pharmacol Toxicol. 2020;127(4):338–350. doi: 10.1111/bcpt.13424 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Xu Z, Sandler DP, Taylor JA. Blood DNA methylation and breast cancer: a prospective case-cohort analysis in the sister study. J Natl Cancer Inst. 2020;112(1):87–94. doi: 10.1093/jnci/djz065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Young MD, Wakefield MJ, Smyth GK, Oshlack A. Open access METHOD gene ontology analysis for RNA-Seq: accounting for selection bias GOseq GOseq Is a Method for GO analysis of RNA-Seq data that takes into account the length bias inherent in RNA-Seq. Genome Biol. 2010;11:1–2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kanehisa M, Araki M, Goto S, et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36(SUPPL. 1). doi: 10.1093/nar/gkm882 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hong MX, Ye Q, Bing ZG, et al. Identification of differentially methylated genes as diagnostic and prognostic biomarkers of breast cancer. World J Surg Oncol. 2021;19(1). doi: 10.1186/s12957-021-02124-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Olivia OY, Jianga P. Genome-wide detection of cytosine methylation by single molecule real-time sequencing. Proc Natl Acad Sci. 2021;118(5):2019768118. doi: 10.1073/pnas.2019768118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Pfeifer GP. Defining driver DNA methylation changes in human cancer. Int J Mol Sci. 2018;19(4):1166. doi: 10.3390/ijms19041166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ehrlich M. DNA hypomethylation in cancer cells. Epigenomics. 2009;1(2):239–259. doi: 10.2217/EPI.09.33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Park SY, Yoo EJ, Cho NY, Kim N, Kang GH. Comparison of CpG island hypermethylation and repetitive DNA hypomethylation in premalignant stages of gastric cancer, stratified for Helicobacter pylori infection. J Pathol. 2009;219(4):410–416. doi: 10.1002/path.2596 [DOI] [PubMed] [Google Scholar]
  • 16.Choi JY, James SR, Link PA, et al. Association between global DNA hypomethylation in leukocytes and risk of breast cancer. Carcinogenesis. 2009;30(11):1889–1897. doi: 10.1093/carcin/bgp143 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Shen Z, Tang Y, Song Y, Shen W, Zou C. Differences of DNA methylation patterns in the placenta of large for gestational age infant. Medicine. 2020;99(39):e22389. doi: 10.1097/MD.0000000000022389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhou Q, Chen Q, Chen X, Hao L. Bioinformatics analysis to screen DNA methylation-driven genes for prognosis of patients with bladder cancer. Transl Androl Urol. 2021;10(9):3604–3619. doi: 10.21037/tau-21-326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ikuno A, Akeda K, Takebayashi SI, Shimaoka M, Okumura K, Sudo A. Genome-wide analysis of DNA methylation profile identifies differentially methylated loci associated with human intervertebral disc degeneration. PLoS One. 2019;14(9):e0222188. doi: 10.1371/journal.pone.0222188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Cai C, Wang W, Tu Z. Aberrantly DNA methylated-differentially expressed genes and pathways in hepatocellular carcinoma. J Cancer. 2019;10(2):355–366. doi: 10.7150/jca.27832 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Chen B, Tang H, Chen X, et al. Transcriptomic analyses identify key differentially expressed genes and clinical outcomes between triple-negative and non-triple-negative breast cancer. Cancer Manag Res. 2019;11:179–190. doi: 10.2147/CMAR.S187151 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zhang R, Chang C, Jin Y, et al. Identification of DNA methylation-regulated differentially expressed genes in RA by integrated analysis of DNA methylation and RNA-Seq data. J Transl Med. 2022;20(1):481. doi: 10.1186/s12967-022-03664-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Yang Y, Su X, Shen K, et al. PUM1 is upregulated by DNA methylation to suppress antitumor immunity and results in poor prognosis in pancreatic cancer. Transl Cancer Res. 2021;10(5):2153–2168. doi: 10.21037/tcr-20-3295 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Silva ILZ, Kohata AA, Shigunov P. Modulation and function of Pumilio proteins in cancer. Semin Cancer Biol. 2022;86:298–309. doi: 10.1016/j.semcancer.2022.03.010 [DOI] [PubMed] [Google Scholar]
  • 25.Yamada T, Imamachi N, Imamura K, et al. Systematic analysis of targets of pumilio-mediated mRNA decay reveals that PUM1 repression by DNA damage activates translesion synthesis. Cell Rep. 2020;31(5):107542. doi: 10.1016/j.celrep.2020.107542 [DOI] [PubMed] [Google Scholar]
  • 26.Glinka T, Alter J, Braunstein I, et al. Signal-peptide-mediated translocation is regulated by a p97-AIRAPL complex. Biochem J. 2014;457(2):253–261. doi: 10.1042/BJ20130710 [DOI] [PubMed] [Google Scholar]
  • 27.Rahighi S, Braunstein I, Ternette N, et al. Selective binding of AIRAPL tandem UIMs to Lys48-linked tri-ubiquitin chains. Structure. 2016;24(3):412–422. doi: 10.1016/j.str.2015.12.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Tian HP, Lun SM, Huang HJ, et al. DNA methylation affects the SP1-regulated transcription of FOXF2 in breast cancer cells. J Biol Chem. 2015;290(31):19173–19183. doi: 10.1074/jbc.M114.636126 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Breast Cancer : Targets and Therapy are provided here courtesy of Dove Press

RESOURCES