Skip to main content
Computational and Mathematical Methods in Medicine logoLink to Computational and Mathematical Methods in Medicine
. 2022 Sep 28;2022:2679050. doi: 10.1155/2022/2679050

A Methylation Diagnostic Model Based on Random Forests and Neural Networks for Asthma Identification

Dong-Dong Li 1,2, Ting Chen 3, You-Liang Ling 1, YongAn Jiang 1,, Qiu-Gen Li 1,2,
PMCID: PMC9534672  PMID: 36213574

Abstract

Background

Asthma significantly impacts human life and health as a chronic disease. Traditional treatments for asthma have several limitations. Artificial intelligence aids in cancer treatment and may also accelerate our understanding of asthma mechanisms. We aimed to develop a new clinical diagnosis model for asthma using artificial neural networks (ANN).

Methods

Datasets (GSE85566, GSE40576, and GSE13716) were downloaded from Gene Expression Omnibus (GEO) and identified differentially expressed CpGs (DECs) enriched by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Random forest (RF) and ANN algorithms further identified gene characteristics and built clinical models. In addition, two external validation datasets (GSE40576 and GSE137716) were used to validate the diagnostic ability of the model.

Results

The methylation analysis tool (ChAMP) considered DECs that were up-regulated (n =121) and down-regulated (n =20). GO results showed enrichment of actin cytoskeleton organization and cell-substrate adhesion, shigellosis, and serotonergic synapses. RF (random forest) analysis identified 10 crucial DECs (cg05075579, cg20434422, cg03907390, cg00712106, cg05696969, cg22862094, cg11733958, cg00328720, and cg13570822). ANN constructed the clinical model according to 10 DECs. In two external validation datasets (GSE40576 and GSE137716), the Area Under Curve (AUC) for GSE137716 was 1.000, and AUC for GSE40576 was 0.950, confirming the reliability of the model.

Conclusion

Our findings provide new methylation markers and clinical diagnostic models for asthma diagnosis and treatment.

1. Introduction

Asthma is a chronic, heterogeneous respiratory disease that affects people of all age groups. Recently, asthma-related morbidity and mortality have increased annually. The clinical manifestations of asthma are mainly respiratory symptoms. The main pathological features include chronic airway inflammation, high airway response, and airway remodeling [13]. Immunoglobulin E (IgE), interleukin-5 (IL-5) and its receptors, and interleukin-4 (IL-4) receptors are used as molecular targets for clinical diagnosis of asthma; however, specific and individual differences are very large, and the clinical treatment of asthma patients is still inadequate [4, 5].

DNA methylation, a major epigenetic component of humans, has a profound effect on the occurrence and development of various diseases [6, 7]. There is substantial evidence that the mechanisms and characteristics of asthma depend on methylation patterns. Gaffin et al. [8] studied DNA methylation in peripheral blood mononuclear cells nuclear airway epithelial cells of atopic, non-atopic, and healthy asthmatic children and confirmed that multiple CpG sites in the ARDB2 gene promoter region were associated with reduced dyspnea in children. RNA methylation provided new options for asthma treatment [9, 10].

Although multiple studies have been performed to distinguish the disease from healthy patients by identifying CpGs loci, however, the results are not encouraging [11]. Reliable quantitative measurements using fewer markers are a viable option. The application of machine learning technology in the medical field has significantly accelerated the research to understand the diseases [12, 13]. Machine learning can describe the complexity and unpredictability of human diseases as reported in various studies [1416]. Cao et al. [17] identified key genes for Th2-high asthma using weighted by weighted gene co-expression network analysis. There is currently no standard diagnostic model for screening and early detection of asthma. The rapid development of machine learning methods, such as random forests (RF) and artificial neural networks (ANN), is frequently used in biomarker research [1821].

This is the first study in which we have analyzed the methylation expression profile of asthma samples by machine learning (RF and ANN) and obtained DECs. The receiver operating characteristic (ROC) curve evaluated the diagnostic performance of our model. The external validation datasets also confirmed the efficiency of the model. This study aimed to identify asthma diseases by analyzing methylation data. The workflow of the study is shown in Figure 1.

Figure 1.

Figure 1

The workflow of our study.

2. Methods and Materials

2.1. Data Acquisition and Preprocessing

The methylation expression profiles GSE85566 [22] (asthma samples: 74, normal samples: 41), GSE40576 [23] (asthma samples: 97, normal samples: 97), and GSE13716 [24] (asthma samples: 16, normal samples: 17) were downloaded from Gene Expression Omnibus (GEO) database. The missing data from expression profiles were filled using the ChAMP package and normalized.

2.2. Differential Analysis and Design Grouping of GSE85566 Methylation Expression Profiles

Filter probes (p-value < 0.01) through champ.filter function of ChAMP package (version:2.24.0) performed CpGs difference analysis (deltaBeta > 0) with champ function and obtained top 1000 CpGs heat map according to the analysis results of champ. The threshold was deltaBeta <-0.05, p-value <-10−8, and matched gene symbols based on methylation array 450 k for later GO and KEGG analysis (clusterProfilter, version: 4.3.3). The above analysis was performed using the R environment installation package.

2.3. Random Forest (RF) Classification

The DECs obtained by ChAMP were initially identified and classified using the R package randomForest (version 4.7.1). The value of err.rate was minimized by calculating the average model miscalculation rate of all DECs in the data to ensure the best node (mtry). In this study, the optimal variable setting of the binary tree in the node was seven, and the optimal number of trees for the random forest was 600. The Gini coefficient selected significant DECs (top 10) as specific candidates for asthma. The heat map of these DECs was constructed by pheatmap (version: 1.0.12) to show their classification ability.

2.4. Artificial Neural Network Model Construction

The artificial neural network model of important candidate variables was constructed using R package (neuralnet, version: 1.44.2). According to the specification, the number of hidden neurons should be 2/3 of the size of the input layer plus 2/3 of the size of the output layer; the number of hidden neurons should be between the sizes of the input layer and output layers. The base expression profile data were normalized (0 to 1) and processed in neuralnet. The output was set to normal and asthma, and the output of the first hidden layer (input of the last output layer) was regarded as the result of gene weights. The termination condition was the absolute derivative of the error function (reaching the threshold < 0.01).

2.5. Model Performance Evaluation

Different R packages in the R environment (R version 4.1.3, https://www.r-project.org) were used to evaluate the model performance. For model prediction and identification, caret (version: 6.0-91) and confusionMatrix were used. For RF, pROC (version: 1.18.0) was used, and for ANN and AUC (Area Under Curve), ggplot2 (version: 3.3.5) was used. Classification and Regression Trees (CART), Support Vector Machines (SVM), eXtreme Gradient Boosting (XGBoost) algorithm by rpart (version 4.1.16), xgboost (version 1.6.0.1), and e1071 (version 1.7-9) packages were used for model validation on GSE40576 and GSE137716 datasets.

3. Results

3.1. CpGs Landscape of GSE85566

Methylation plays a key role in various diseases, as reported previously [2527]. The methylation ChAMP package champ.DMP was used to analyze and process the methylation expression profile in the dataset GSE85566 (74 asthma samples and 41 normal samples) to understand the methylation structure of asthma samples and to calculate the differential CpGs sites. The top 1000 CpGs heat map landscape (asthma and normal samples) is displayed in Figure 2(a). Further methylation targets were searched to differentiate between asthma and healthy samples. The DECs (asthma vs. healthy) of this methylation chip dataset were identified according to champ.DMP, and the results were presented in the volcano plot (Figure 2(b)). The threshold was set as adj.P.Val <10−8, deltaBeta <-0.05 for up-regulated DECs (n =121) and down-regulated DECs (n =20). The up-regulated and down-regulated DECs are shown in the heat map (Figure 2(c)). In the heat map, we observed that the asthma group (blue) and the healthy group (red) samples are almost separable, but some asthma samples were still mixed in the healthy group (red). Thus, the recognition ability of DECs for asthma and healthy samples still needs to be improved.

Figure 2.

Figure 2

Methylation landscape of GSE85566. (a) Heat map of the top 1000 most divergent CpGs; the gradient from dark blue to yellow represented the change in expression level. (b) Results of differential expression analysis of volcano plots (asthma vs healthy). The X-axis was log(deltaBeta), and the ordinate was -log10(adj.P.Val) value; DOWN (red): DECs with down-regulated expression, UP (gray): DECs with up-regulated expression, NOT (dark blue): meaningless. (c) Heat map of DECs. Dark blue to light blue means high to low expression, green represents asthma samples, red represented healthy samples, and a clustering tree aggregated similar samples together.

3.2. GO and KEGG Analyses of DECs

GO and KEGG analyses were used to understand the biological function and regulation of DECs GO results indicated that regulation of actin cytoskeleton organization and cell-substrate adhesion was enriched (Figure 3(a)). KEGG analysis showed the enrichment in shigellosis and serotonergic synapses (Figure 3(b)). The above results further confirmed that methylation played a key role in the pathogenesis of asthma. The identification of asthmatic and normal patients through a single CpGs site or multiple CpGs models is an urgent problem to be solved.

Figure 3.

Figure 3

GO and KEGG analysis results. (a) GO analysis (including molecular function, cellular component, and biological process). (b) KEGG analysis.

3.3. Differential CpGs (DECs) in the Random Forest (RF)

The above results provided a preliminary understanding of the key role of methylated CpGs in asthma. Although CpGs played an important role in differentiating asthma from healthy samples, the results are not satisfactory (Figure 2(c)). These DECs were used as the input of the random forest classifier. In order to make the error rate as small as possible, we calculated the mean error rate (err.rate), the parameter of the variable was considered to be 7, and the final neural network model incorporated 600 trees as the final model parameters to ensure that the errors were stable (Figure 4(a)). The random forest model dimension importance was obtained according to the Gini coefficient method (MeanDecreaseAccuracy and MeanDecreaseGini; Figure 4(b)). The top 10 DECs of importance were identified (cg05075579, cg20434422, cg03907390, cg00712106, cg05696969, cg22862094, cg11733958, cg00328720, cg13570892, and cg03325522). As follow-up candidates for the classification of our random forest classification results, in these DECs, cg05075579 was considered the most important, with the mean decrease of the Gini index being much higher than DECs (Table 1). The heat map (Figure 4(c)) showed that these 10 DCGs were better at clustering asthma samples together than in Figure 2(c).

Figure 4.

Figure 4

(a) The effect of the number of decision trees on the error rate. The X-axis was the number of decision trees, and the Y-axis was the error rate. The increase of trees did not affect the reduction of the error rate. (b) After the variables were entered into the random forest, the top 10 DECs were listed in order of importance according to MeanDecreaseAccuracy (left) and MeanDecreaseGini (right). (c) Hierarchical clustering results of 10 DECs in GSE85566 dataset; dark colors represent high expression, light colors represent low expression, the red band above the heat map represents normal samples, and green represents asthma samples.

Table 1.

MeanDecreaseGini of 10 DECs by random forest process.

CpGs MeanDecreaseGini
cg05075579 1.427152626
cg20434422 1.421703772
cg03907390 1.395149582
cg00712106 1.355657539
cg05696969 1.099293192
cg22862094 0.981588017
cg11733958 0.933543882
cg00328720 0.897459032
cg13570892 0.880545548
cg03325522 0.879691331

DECs: differentially expressed CpGs.

3.4. The Construction of Artificial Neural Network Model

The random forest classifier identified the most important 10 DECs with a significant discriminative effect to distinguish between asthma and healthy samples. The artificial neural network calculated the weights of these 10 DECs, 10 input layers, seven hidden layers, and two output layers in the GSE85566 methylation expression profile and constructed a new model (Figure 5(a)). For an effective evaluation of the results of the neural network model, we chose the 10-fold cross-validation method. The data were randomly divided into a training set and validation set and used the pROC installation package to visualize the results (Figure 5(b)). In addition, we adopted the confusion matrix of the caret package to evaluate the accuracy of the neural network models (accuracy: 0.9739). Using methylation expression profiles, we developed a novel model to differentiate asthma and healthy sample classifications based on what we demonstrated above.

Figure 5.

Figure 5

Neural network topology. (a) Artificial neural network visualization results for the train dataset. (b) ROC results (cg05075579, cg20434422, cg03907390, cg00712106, cg05696969, cg22862094, cg11733958, cg00328720, cg13570892, and cg03325522) analysis visualization for 10-fold cross-validation method.

3.5. ROC Identification of the Dataset

We showed the classification of asthma and normal samples based on neural network construction. Then, we utilized two methylation datasets (GSE40576 and GSE137716) to evaluate the classification performance of our neural network model. The receiver operating characteristic curve (ROC) calculated accuracy (Figures 6(a) and 6(b)), GSE137716 dataset has AUC: 1.000, the sensitivity and specificity of 100% under the best threshold, GSE40576 dataset has AUC: 0.950, the sensitivity and specificity were 0.959 and 0.969, respectively. Comparing SVM, CART, and XGBoost machine algorithms (Table 2), the AUCs for GSE40756 are 0.825%, 0.773%, and 0.619%, respectively, and for GSE137716, AUCs are 0.938, 0.818, and 0.881, respectively. These results indicate that our neural network model had high-precision classification performance and is indicative of the classification of asthmatic patients.

Figure 6.

Figure 6

Two datasets determine neural network classification efficiency. (a) ROC result of GSE137716 dataset. (b) ROC result of GSE40576. The points marked on ROC curve are the optimal threshold points, and the values in parentheses indicate sensitivity and specificity. The AUC value was the Area Under ROC Curve, X-axis was the specificity, and Y-axis was the sensitivity. The optimal threshold was marked at the inflection point, and sensitivity and specificity were listed in parentheses.

Table 2.

ROC validation results of three machine learning models (SVM, CART, and XGBoost).

Methods GSE40576 GSE137716
AUC Specificity Sensitivity AUC Specificity Sensitivity
SVM 0.825 0.845 0.804 0.938 1.000 0.875
CART 0.773 0.856 0.691 0.818 0.824 0.812
XGBoost 0.619 0.619 0.619 0.881 0.824 0.938

CART: Classification and Regression Trees; SVM: support vector machines; XGBoost: eXtreme Gradient Boosting; AUC: Area Under Curve.

4. Discussion

This was the first study to utilize DNA methylation-based machine learning to identify a series of asthma-related methylation loci (DECs). Interestingly, the selected methylation signatures were associated with actin cytoskeleton organization and cell-adhesion substrate, shigellosis, and serotonergic synapses, supporting the hypothesis that airway structural reorganization in asthma results from changes in DNA methylation in the epigenetic group [28, 29]. Then, ten distinct specific DECs were identified based on RF, and ANN model was built by calculating the weight coefficient of ANN. The model had high accuracy and stability (the AUC of the external validation datasets was 1 and 0.95, respectively).

Recently, due to the rapid advancement of computing power, artificial intelligence methods such as machine learning have been widely employed in medicine, including disease diagnosis and disease prognosis, thereby accelerating our understanding of various diseases. In addition, it facilitates the clinicians in patient management. Multiple studies have developed novel models to predict clinical outcomes of asthma [3032]. In this study, we focused on the key role of epigenetics (methylation) in asthma. The asthma-related DECs were obtained through differential analysis, 10 crucial candidate DECs were identified based on the random forest classifier, and the asthma-related neural classification scores were generated by artificial neural networks. We also compared the classification efficiency of individual CpGs with the classification efficiency of the model.

We identified the methylation landscape of the methylation data (GSE85566) and obtained 142 differentially expressed CpGs. GO analysis suggested that asthma was enriched in regulation of actin cytoskeleton organization [33], cell-substrate adhesion [34], and response to nutrient levels, and KEGG results identified the potential signaling pathways, shigellosis serotonergic synapse, and yersinia infection. In addition, 10 DECs obtained through the MeanDecreaseGini importance screening of the random forest model provided a base for the construction of a neural network model. The model was highly accurate (accuracy: 0.9739), and the results were also validated with two other datasets, giving the accuracy and high classification level (AUC: 1.000 and 0.950, respectively) of this neural network. We compared our model with other currently available machine learning algorithms (SVM, CART, and XGBoost) [35, 36] and found that the diagnostic ability of the methylation machine model constructed by ANN was higher than other models.

There are several limitations to this study. First, our analysis results were based on an online database. There were more influencing factors between different datasets, which can be biased in the results. In addition, our study was limited and could not be validated in clinical patient samples. Due to the paucity of available methylation data, our dataset contains data from children's peripheral blood single cells, which may have affected the results. In future studies, we will verify our results with prospective studies in an effort to implement them in clinical practice and provide doctors with a treatment formulation source.

5. Conclusion

In general, our neural network model based on methylation epigenetics has a significant clinical value for the prediction of asthma, which is beneficial for early diagnosis of asthma.

Contributor Information

YongAn Jiang, Email: 1296918592@qq.com.

Qiu-Gen Li, Email: tchlqg2021@163.com.

Data Availability

The data of this study were downloaded and compiled from the GEO database (https://www.ncbi.nlm.nih.gov/gds/?term=); data used to support the results of this study were obtained from the corresponding author.

Conflicts of Interest

This research does not include any research conducted by any author on human participants or animals. The authors declare no competing interests.

Authors' Contributions

All authors have contributed significantly, and all authors are in agreement with the content of the manuscript. Qiu-Gen Li and Yong'An Jiang contributed to the conceptualization. Dong-Dong Li contributed to the methodology. Dong-Dong Li was responsible for the software. Qiu-Gen Li contributed to the validation. Ting Chen contributed to the formal analysis. You-Liang Ling contributed to the data curation. Dong-Dong Li contributed to the writing – original draft. All authors contributed to the writing – review and editing. Dong-Dong Li contributed to the visualization. Qiu-Gen Li and Yong'An Jiang were responsible for the supervision.

References

  • 1.Mims J. W. Asthma: definitions and pathophysiology. International Forum of Allergy & Rhinology . 2015;5(Supplement 1):S2–S6. doi: 10.1002/alr.21609. [DOI] [PubMed] [Google Scholar]
  • 2.Ntontsi P., Photiades A., Zervas E., Xanthou G., Samitas K. Genetics and epigenetics in asthma. International Journal of Molecular Sciences . 2021;22(5):p. 2412. doi: 10.3390/ijms22052412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Miller R. L., Grayson M. H., Strothman K. Advances in asthma: new understandings of asthma’s natural history, risk factors, underlying mechanisms, and clinical management. The Journal of Allergy and Clinical Immunology . 2021;148(6):1430–1441. doi: 10.1016/j.jaci.2021.10.001. [DOI] [PubMed] [Google Scholar]
  • 4.Pelaia C., Crimi C., Vatrella A., Tinello C., Terracciano R., Pelaia G. Molecular targets for biological therapies of severe asthma. Frontiers in Immunology . 2020;11, article 603312 doi: 10.3389/fimmu.2020.603312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Weiss S. T. Emerging mechanisms and novel targets in allergic inflammation and asthma. Genome Medicine . 2017;9(1):p. 107. doi: 10.1186/s13073-017-0501-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Legaki E., Arsenis C., Taka S., Papadopoulos N. G. DNA methylation biomarkers in asthma and rhinitis: are we there yet? Clinical and Translational Allergy . 2022;12(3, article e12131) doi: 10.1002/clt2.12131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hawe J. S., Wilson R., Schmid K. T., et al. Genetic variation influencing DNA methylation provides insights into molecular mechanisms regulating genomic function. Nature Genetics . 2022;54(1):18–29. doi: 10.1038/s41588-021-00969-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Gaffin J. M., Raby B. A., Petty C. R., et al. β-2 adrenergic receptor gene methylation is associated with decreased asthma severity in inner-city school children. Clinical & Experimental Allergy . 2014;44(5):681–689. doi: 10.1111/cea.12219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Jiang Y., Xun Q., Wan R., et al. GLCCI1 gene body methylation in peripheral blood is associated with asthma and asthma severity. Clinica Chimica Acta . 2021;523:97–105. doi: 10.1016/j.cca.2021.09.006. [DOI] [PubMed] [Google Scholar]
  • 10.Renz H. DNA methylation and a biomarker panel to predict asthma development. The Journal of Allergy and Clinical Immunology . 2019;144(1):49–50. doi: 10.1016/j.jaci.2019.04.002. [DOI] [PubMed] [Google Scholar]
  • 11.Yu X., Yang Q., Wang D., Li Z., Chen N., Kong D. X. Predicting lung adenocarcinoma disease progression using methylation-correlated blocks and ensemble machine learning classifiers. PeerJ . 2021;9, article e10884 doi: 10.7717/peerj.10884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Handelman G. S., Kok H. K., Chandra R. V., Razavi A. H., Lee M. J., Asadi H. eDoctor: machine learning and the future of medicine. Journal of Internal Medicine . 2018;284(6):603–619. doi: 10.1111/joim.12822. [DOI] [PubMed] [Google Scholar]
  • 13.Lo Vercio L., Amador K., Bannister J. J., et al. Supervised machine learning tools: a tutorial for clinicians. Journal of Neural Engineering . 2020;17(6):p. 062001. doi: 10.1088/1741-2552/abbff2. [DOI] [PubMed] [Google Scholar]
  • 14.Rauschert S., Raubenheimer K., Melton P. E., Huang R. C. Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification. Clinical Epigenetics . 2020;12(1):p. 51. doi: 10.1186/s13148-020-00842-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zampieri G., Vijayakumar S., Yaneske E., Angione C. Machine and deep learning meet genome-scale metabolic modeling. PLoS Computational Biology . 2019;15(7, article e1007084) doi: 10.1371/journal.pcbi.1007084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Chen J., Siu S. W. I. Machine learning approaches for quality assessment of protein structures. Biomolecules . 2020;10(4):p. 626. doi: 10.3390/biom10040626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cao Y., Wu Y., Lin L., Yang L., Peng X., Chen L. Identifying key genes and functionally enriched pathways in Th2-high asthma by weighted gene co-expression network analysis. BMC Medical Genomics . 2022;15(1):p. 110. doi: 10.1186/s12920-022-01241-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gu W., Ming T., Xie Z. Developing a genetic biomarker-based diagnostic model for major depressive disorder using random forests and artificial neural networks. Combinatorial Chemistry & High Throughput Screening . 2022;25 doi: 10.2174/1386207325666220404123433. [DOI] [PubMed] [Google Scholar]
  • 19.Kawakami E., Tabata J., Yanaihara N., et al. Application of artificial intelligence for preoperative diagnostic and prognostic prediction in epithelial ovarian cancer based on blood biomarkers. Clinical Cancer Research . 2019;25(10):3006–3015. doi: 10.1158/1078-0432.Ccr-18-3378. [DOI] [PubMed] [Google Scholar]
  • 20.Li H., Lai L., Shen J. Development of a susceptibility gene based novel predictive model for the diagnosis of ulcerative colitis using random forest and artificial neural network. Aging (Albany NY) . 2020;12(20):20471–20482. doi: 10.18632/aging.103861. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zhao D., Zhang Z., Wang Z., et al. Diagnosis and prediction of endometrial carcinoma using machine learning and artificial neural networks based on public databases. Genes . 2022;13(6):p. 935. doi: 10.3390/genes13060935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Nicodemus-Johnson J., Myers R. A., Sakabe N. J., et al. DNA methylation in lung cells is associated with asthma endotypes and genetic risk. JCI Insight . 2016;1(20, article e90151) doi: 10.1172/jci.insight.90151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Yang I. V., Pedersen B. S., Liu A., et al. DNA methylation and childhood asthma in the inner city. The Journal of Allergy and Clinical Immunology . 2015;136(1):69–80. doi: 10.1016/j.jaci.2015.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Ruzzin J., Petersen R., Meugnier E., et al. Persistent organic pollutant exposure leads to insulin resistance syndrome. Environmental Health Perspectives . 2010;118(4):465–471. doi: 10.1289/ehp.0901321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Zafon C., Gil J., Pérez-González B., Jordà M. DNA methylation in thyroid cancer. Endocrine-Related Cancer . 2019;26:R415–r439. doi: 10.1530/erc-19-0093. [DOI] [PubMed] [Google Scholar]
  • 26.Pan Y., Liu G., Zhou F., Su B., Li Y. DNA methylation profiles in cancer diagnosis and therapeutics. Clinical and Experimental Medicine . 2018;18(1):1–14. doi: 10.1007/s10238-017-0467-0. [DOI] [PubMed] [Google Scholar]
  • 27.Morgan A. E., Davies T. J., Mc Auley M. T. The role of DNA methylation in ageing and cancer. The Proceedings of the Nutrition Society . 2018;77(4):412–422. doi: 10.1017/s0029665118000150. [DOI] [PubMed] [Google Scholar]
  • 28.Clifford R. L., Yang C. X., Fishbane N., et al. TWIST1 DNA methylation is a cell marker of airway and parenchymal lung fibroblasts that are differentially methylated in asthma. Clinical Epigenetics . 2020;12(1):p. 145. doi: 10.1186/s13148-020-00931-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Guo Y., Yuan X., Hong L., et al. Promotor hypomethylation mediated upregulation of miR-23b-3p targets PTEN to promote bronchial epithelial-mesenchymal transition in chronic asthma. Frontiers in Immunology . 2021;12, article 771216 doi: 10.3389/fimmu.2021.771216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kothalawala D. M., Murray C. S., Simpson A., et al. Development of childhood asthma prediction models using machine learning approaches. Clinical and Translational Allergy . 2021;11(9, article e12076) doi: 10.1002/clt2.12076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zein J. G., Wu C. P., Attaway A. H., Zhang P., Nazha A. Novel machine learning can predict acute asthma exacerbation. Chest . 2021;159(5):1747–1757. doi: 10.1016/j.chest.2020.12.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kaplan A., Cao H., FitzGerald J., et al. Artificial intelligence/machine learning in respiratory medicine and potential role in asthma and COPD diagnosis. In Practice . 2021;9(6):2255–2261. doi: 10.1016/j.jaip.2021.02.014. [DOI] [PubMed] [Google Scholar]
  • 33.Svitkina T. M. Ultrastructure of the actin cytoskeleton. Current Opinion in Cell Biology . 2018;54:1–8. doi: 10.1016/j.ceb.2018.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zhao J., Manuchehrfar F., Liang J. Cell-substrate mechanics guide collective cell migration through intercellular adhesion: a dynamic finite element cellular model. Biomechanics and Modeling in Mechanobiology . 2020;19(5):1781–1796. doi: 10.1007/s10237-020-01308-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wu Z., Zhu M., Kang Y., et al. Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets. Briefings in Bioinformatics . 2021;22(4) doi: 10.1093/bib/bbaa321. [DOI] [PubMed] [Google Scholar]
  • 36.Wang J. Prediction of postoperative recovery in patients with acoustic neuroma using machine learning and SMOTE-ENN techniques. Mathematical Biosciences and Engineering: MBE . 2022;19(10):10407–10423. doi: 10.3934/mbe.2022487. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data of this study were downloaded and compiled from the GEO database (https://www.ncbi.nlm.nih.gov/gds/?term=); data used to support the results of this study were obtained from the corresponding author.


Articles from Computational and Mathematical Methods in Medicine are provided here courtesy of Wiley

RESOURCES