Optimised deep neural network model to predict asthma exacerbation based on personalised weather triggers

Radiah Haque; Sin-Ban Ho; Ian Chai; Adina Abdullah

doi:10.12688/f1000research.73026.1

. 2021 Sep 10;10:911. [Version 1] doi: 10.12688/f1000research.73026.1

Optimised deep neural network model to predict asthma exacerbation based on personalised weather triggers

Radiah Haque ¹, Sin-Ban Ho ^1,^a, Ian Chai ¹, Adina Abdullah ²

PMCID: PMC8543171 PMID: 34745565

Abstract

Background – Recently, there have been attempts to develop mHealth applications for asthma self-management. However, there is a lack of applications that can offer accurate predictions of asthma exacerbation using the weather triggers and demographic characteristics to give tailored response to users. This paper proposes an optimised Deep Neural Network Regression (DNNR) model to predict asthma exacerbation based on personalised weather triggers.

Methods – With the aim of integrating weather, demography, and asthma tracking, an mHealth application was developed where users conduct the Asthma Control Test (ACT) to identify the chances of their asthma exacerbation. The asthma dataset consists of panel data from 10 users that includes 1010 ACT scores as the target output. Moreover, the dataset contains 10 input features which include five weather features (temperature, humidity, air-pressure, UV-index, wind-speed) and five demography features (age, gender, outdoor-job, outdoor-activities, location).

Results – Using the DNNR model on the asthma dataset, a score of 0.83 was achieved with Mean Absolute Error (MAE)=1.44 and Mean Squared Error (MSE)=3.62. It was recognised that, for effective asthma self-management, the prediction errors must be in the acceptable loss range (error<0.5). Therefore, an optimisation process was proposed to reduce the error rates and increase the accuracy by applying standardisation and fragmented-grid-search. Consequently, the optimised-DNNR model (with 2 hidden-layers and 50 hidden-nodes) using the Adam optimiser achieved a 94% accuracy with MAE=0.20 and MSE=0.09.

Conclusions – This study is the first of its kind that recognises the potentials of DNNR to identify the correlation patterns among asthma, weather, and demographic variables. The optimised-DNNR model provides predictions with a significantly higher accuracy rate than the existing predictive models and using less computing time. Thus, the optimisation process is useful to build an enhanced model that can be integrated into the asthma self-management for mHealth application.

Keywords: Machine learning, deep neural network, personalisation, asthma self-management

Introduction

Asthma is a chronic lung disease that affects people of all age groups around the world. ¹ Asthma exacerbation causes asthma attacks, and the frequency of asthma attacks depends on the exposure to asthma triggers. ² Weather is a common triggering factor of asthma exacerbation. ³ Studies show that weather triggers, such as temperature, humidity, air pressure, and wind, cause asthma attacks. ^4–
6 Weather impact is specific to individual asthmatic patients due to their lung performance, which varies among patients. This depends on their demographic characteristics, such as age and gender. ⁷ Geographical location is also a factor because the association between weather triggers and asthma is inconsistent in different climate regions. ⁴

Although asthma cannot be cured, avoiding exposure to weather triggers through asthma self-management can minimise the risk of asthma exacerbation. ⁸ Recently, there have been attempts to develop mHealth applications to assist asthma self-management. However, until now, no application for effective asthma self-management exists that has been widely adopted by users or integrated into primary asthma care records. ² This is because there is a lack of solutions that can offer accurate predictions of asthma exacerbation based on personalised weather triggers and provide tailored feedback to users.

Deep Neural Network (DNN) is a type of neural network algorithm with multiple hidden layers and several nodes. ⁹ In recent years, DNN has been significantly utilised in the health informatics research domain for forecasting and pattern recognition. ^10–
13 This is because DNN models tend to learn more effectively and have better performance in providing accurate predictions (especially through optimisation) than traditional Machine Learning (ML) algorithms. ¹⁴ Nevertheless, the application of ML and DNN in weather-based healthcare is still in its infancy. In fact, to the best of the authors’ knowledge, none of the existing research has applied DNN to predict asthma exacerbation based on demography and weather. Therefore, the main contribution of the work in this paper is to apply DNN and propose an optimisation process to predict asthma exacerbation based on personalised weather triggers with low error and high accuracy. The findings will be helpful for developing mHealth solutions with personalisation for effective asthma self-management.

Methods

Data collection

With the aim of integrating weather, demography, and asthma tracking, an mHealth application, namely Weather Asthma (WEA), was developed for this study. ¹⁵ The WEA is an android-based application that collects user demography and monitors daily weather forecasts in individual users’ location to identify the potential weather triggers. Consequently, both demography and weather data are selected as input features in the asthma dataset.

The WEA application also allows users to conduct the Asthma Control Test (ACT). ¹⁶ The ACT is a self-administered survey which is considered the standard assessment for monitoring chronic asthma and recommended by the Global Initiative for Asthma. ¹⁷ The ACT score is selected as the target output for prediction because it helps identify the severity and chances of asthma exacerbation. ¹⁶

Data was collected through the WEA application from ten participants with asthma over a period of one-year. Participants conducted ACTs by regularly answering five multiple-choice questions, which include four asthma symptom-related questions and one asthma self-evaluate question. Each question is scored between 1-5. Once the ACT was submitted, a timestamp was formed with the participant’s demography and the weather information of that day and time at their location. This timestamp, along with the total ACT score, are stored in the database, as seen in Table 1. All participants consented to the data collection and the ethical approval was obtained from the Multimedia University Research Ethics Committee (EA1532021).

Table 1. Data collection example.

Data name	Example
User ID	5QBEe959GPOxv3rDNYXZX
Timestamp	1607926290609
ACT score	20
Age	Above 50
Gender	Male
Outdoor job	Occasionally
Outdoor activites	Extremely likely
Smoking habit	No
Temperature	29.8°C
Humidity	70%
Air pressure	1009.0 hectopascals
UV index	Low (i.e. 0 to 2)
Wind speed	2.1 meters per second

Open in a new tab

Data pre-processing

The first step of data pre-processing is identifying the missing data in the dataset through a heatmap, illustrated in Figure 1, which visualises the locations of missing values. Fortunately, the selected dataset does not contain any missing or NaN values.

The second step is dropping irrelevant features including “User ID” and “Timestamp”. “Smoking habit” is also dropped because its correlation coefficient value with the target variable “ACT score” in the heatmap is close to zero. Table 2 represents the final dataset, which consists of 1010 records with ten input features and one output variable. Figure 2 shows the ACT scores’ distribution, which ranges from 12 to 21. Figure 3 illustrates a scatterplot and countplot for weather features, where a strong correlation can be observed between the weather features and “ACT score”.

Table 2. Input/output variables.

Name	Type	Description
Age	Categorical	Input demography
Gender	Categorical	Input demography
Location	Categorical	Input demography
Outdoor activities	Categorical	Input demography
Outdoor job	Categorical	Input demography
Temperature	Numerical	Input weather
Humidity	Numerical	Input weather
Air pressure	Numerical	Input weather
UV index	Categorical	Input weather
Wind speed	Numerical	Input weather
ACT score	Numerical	Target output

Open in a new tab

The third step is converting the categorical variables in the dataset to numeric representations using the label encoder. The fourth step is splitting the dataset into training (707 samples) and testing (303 samples) datasets.

Regression with DNN

DNN can be modelled with various ML techniques, such as regression and classification. ¹⁸ Regression is responsible for modelling and characterising the relationship between the input features and the target output. Regression is applied to predict numerical values. ¹⁹ Hence, regression is used in this study to predict the ACT score, which is a numerical value.

Consequently, a DNN Regression (DNNR) model is applied on the dataset. In DNNR, the hidden layers are located between the input layer and the output layer, as seen in Figure 4. The hidden layers apply weights to input values and direct them via an activation function for the output values. ²⁰ The activation function assists in deriving distinguishing features that are required for the prediction. ¹⁰ This is particularly helpful to model the asthma dataset which contains multiple types of input features. The Rectified Linear Unit (ReLU) activation function is used because it provides nonlinear transformations for deep modelling. ⁹

The following are the main equations used for prediction using the DNNR model:

x = [x_{1}, x_{2}, ..., x_{m}] w = [w_{1}, w_{2}, ..., w_{m}] f (e) = {\begin{array}{l} 0 & f o r & e < 0 \\ e & f o r & e \geq 0 \end{array} e = \sum_{j = 1}^{m} x w + b \hat{y} = f (e)

where x is the input features, $\hat{y}$ is the predicted values, w is the input weights, b is the bias (a constant number used for adjustment), e is the internal elements in the hidden layers, f( e) is the activation function, m is the number of input features, and j is a constant number between [0, m].

Evaluation metrics

Evaluating the DNNR model is essential to determine its prediction error and accuracy, which can be achieved through Mean Absolute Error (MAE), Mean Squared Error (MSE), and Explained Variance Score (EVS). The MAE sums up the absolute difference between the actual and the predicted values. The MSE sums up the squared differences between the actual and the predicted values. The EVS computes the variance score which determines the accuracy of nonlinear regression models. ⁹ The following equations calculate the MAE, MSE and EVS of the DNNR model:

MAE = \frac{1}{n} \sum_{i - 1}^{n} | {\hat{y}}_{i} - y_{i} | MSE = \frac{1}{n} \sum_{i - 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2} EVS = 1 - \frac{v (\hat{y} - y)}{v (y)}

where y is the actual output values, $\hat{y}$ is the predicted values, n is the number of records in the dataset, v is the biased variance, and i is a constant number between [0, n].

Optimisation methods

Optimising the DNNR model is crucial for prediction with low error, high accuracy, and less computing time. This can be achieved by applying essential optimisation methods which include data scaling and parameter tuning. For data scaling, standardisation is used because it is beneficial for enhancing the performance of the DNNR model and its optimisation. ⁹ This happens by rescaling the input and the output values using the following equations:

\begin{array}{l} s^{'} = \frac{s - μ}{σ} \\ μ = \frac{1}{n} \sum_{i - 1}^{n} s_{i} \\ σ = \sqrt{\frac{1}{n} \sum_{i - 1}^{n} {(s_{i} - μ)}^{2}} \end{array}

where s is the input/output variables, s′ is the standardised input/output values, $μ$ is the mean of the input/output values, $σ$ is the standard deviation of the input/output values, n is the number of records in the dataset, and i is a constant number between [0, n].

The DNNR parameters include hidden layers, nodes at each hidden layer, batch size, epochs, weight initialiser, loss function, and optimiser. Grid-search is an optimisation algorithm which automates the trial procedure of tuning these parameters and selecting their best values. ²¹ Nevertheless, tuning a large number of parameters and their search values using grid-search leads to excessive computational time and power. In this study, the fragmented-grid-search method is used where parameters are tuned independently in parallel, hence taking less computing time for optimisation. Figure 5 demonstrates the optimisation algorithm and Figure 6 illustrates the overall optimisation process.

Results

Regression results

Using the DNNR model on the dataset, a score of 0.83 is achieved with MAE = 1.44 and MSE = 3.62. Table 3 shows 5 predicted values against their actual values and Figure 7 contains the residual visualisation. It can be seen that the differences between the predicted and the actual values vary up to ±15. While this might seem an acceptable prediction error for some datasets, in the case of the asthma dataset, this amount of loss is unacceptable. This is because the range of the ACT score can only be from 5 to 25, where scores of 5 to 15 are categorised as “poorly-controlled asthma”, 16 to 19 as “not well-controlled asthma”, and 20 to 25 as “well-controlled asthma”. ¹⁷

Table 3. Actual vs. predicted values.

Actual values	Group	Predicted values	Group
12	Poorly-controlled	16	Not well-controlled
25	Well-controlled	19	Not well-controlled
17	Not well-controlled	22	Well-controlled
21	Well-controlled	18	Not well-controlled
19	Not well-controlled	20	Well-controlled

Open in a new tab

In the last row of Table 3, with the actual value of 19 (not well-controlled), the predicted value is 20 (well-controlled), which gives a contradictory prediction result. This can be a serious problem while providing tailored feedback to asthmatic patients, resulting in an insufficiently effective asthma self-management solution. For an optimised model, the acceptable loss range needs to be less than ±0.5. For example, with the actual value of 19, the prediction value can be at most 19.4≃19 (with maximum +0.4 loss) or at least 18.6≃19 (with maximum −0.4 loss). Therefore, an optimised-DNNR model is built to reduce the prediction error and increase the overall accuracy.

Optimisation results

For the optimised-DNNR model, two hidden layers are used with 50 nodes at each hidden layer. Adaptive Moment Estimation (Adam) is used as the optimiser, which is helpful for optimising the learning and convergence rates during model training. ¹³ Table 4 summarises the optimum parameter values obtained using fragmented-grid-search and the total tuning time. Figure 8 shows the loss rate of the training and the testing datasets swiftly decreased using the ReLU activation function.

Table 4. Optimum parameters.

No. of hidden layers	2
No. of nodes	50
Batch size	10
No. of epochs	100
Loss function	MSE
Optimiser	Adam
Weight initialiser	Normal
Total tuning time	26 minutes

Open in a new tab

Using the optimised-DNNR model on the dataset, a score of 0.91 was achieved with a total accuracy of around 94%. The MAE and the MSE rates are 0.20 and 0.09 respectively, which are in the acceptable loss range (error < 0.5). Figure 9 illustrates the residual plot of the optimised-DNNR model which shows a strong correlation between the predicted and the actual values. Figure 10 confirms that the optimised-DNNR model provides predictions within the loss range ±0.5.

Discussion and conclusion

Recent popularity of mHealth and DNN enabled developing solutions to collect data from asthmatic patients and provide accurate predictive alerts. Although several studies support the association between weather and asthma, there is a lack of solutions for effective asthma self-management that can predict asthma exacerbation based on personalised weather triggers. This is due to three problems:

1.
Limited availability of real-time weather data that can link weather triggers with demography and asthma severity for individual asthmatic patients. This study obtained the dataset from the WEA application which comprises relevant input features (weather and demography) and target output (asthma severity).
2.
Existence of nonlinear relationships in the asthma dataset due to multiple types of input features and interconnected correlations. This study applied DNN for modelling the dataset, which effectively handles nonlinearity by using the ReLU activation function.
3.
Lack of accurate predictive models and precautionary frameworks for effective asthma self-management. This study built an optimised model that provides accurate predictions of asthma exacerbation with errors in the acceptable loss range (error < 0.5).

The experimental results reveal that the standardisation technique improves the stability of the DNNR model, which enhances the performance of the optimisation algorithm and the optimiser. Furthermore, the fragmented-grid-search method is able to tune several parameters with much less computing time (≈26 minutes) than the standard grid-search used in previous studies (e.g. ≈4.3 hours for tuning 2 parameters ²²). Moreover, model training takes less than one minute due to the Adam optimiser, which helps the model converge efficiently. Overall, the optimised-DNNR model provides predictions with a significantly higher accuracy rate (94%) than the existing ML models in the literature for predicting asthma exacerbation (e.g. 87% with naïve Bayes, ² 85% through logistic regression, ⁸ and 84% using random forest ²³).

Consequently, the optimisation process helps build an enhanced model for effective asthma self-management. Subsequently, the optimised model will be integrated into the WEA application for predicting asthma exacerbation based on personalised weather triggers and providing tailored feedback to users. The main limitation of this study is that the data was collected from a limited number of users and in one climate region. In future, more users from different climate regions will be considered for testing the generalisation capability of the proposed model.

Author contribution

RH and SBH conducted the research, analysed the data, and wrote the paper. IC and AA improved and edited the paper. All authors have approved the final version.

Acknowledgements

The authors would like to thank all the participants of this study, who dedicated their time and effort to use the Weather Asthma application and conduct the Asthma Control Tests.

Funding Statement

The author(s) declared that no grants were involved in supporting this work.

[version 1; peer review: 3 approved]

Software and data availability

Zenodo. Dataset and source code for the research paper titled: “Optimised deep neural network model to predict asthma exacerbation based on personalised weather triggers”. DOI: https://doi.org/10.5281/zenodo.5271780. ²⁴

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC BY 4.0 Public domain dedication).

References

1. Seo S, Kho H, Kim K: Predicting asthma attacks: Effects of indoor pm concentrations on peak expiratory flow rates of asthmatic children. Chest. 2020;157(6):A8. 10.1016/j.chest.2020.05.009 [DOI] [Google Scholar]
2. Tsang KCH, Pinnock H, Wilson AM, et al. : Application of machine learning to support self-management of asthma with mHealth. Proc. 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). Montreal, QC, Canada;2020, pp.5673–5677. [DOI] [PubMed] [Google Scholar]
3. Bodaghkhani E, Mahdavian M, MacLellan C, et al. : Effects of meteorological factors on hospitalizations in adult patients with asthma: A systematic review. Canadian Resp J. 2019;9:1–11. 10.1155/2019/3435103 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Razavi-Termeh S, Sadeghi-Niaraki A, Choi S: Asthma-prone areas modeling using a machine learning model. Sci Rep. 2021;11(1). 10.1038/s41598-021-81147-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Sharma A, Saini S, Chhabra P, et al. : Air pollution and weather as the determinants of acute attacks of asthma: Spatiotemporal approach. Indian J Public Health .2020;64(2):124–129. 10.4103/ijph.IJPH_135_19 [DOI] [PubMed] [Google Scholar]
6. Poole J, Barnes CS, Demain JG, et al. : Impact of weather and climate change with indoor and outdoor air quality in asthma: A work group report of the AAAAI environmental exposure and respiratory health committee. J Allergy Clin Immunol .2019;143(5):1702–1710. 10.1016/j.jaci.2019.02.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Do Q, Doig A, Son T, et al. : Predicting lung healthiness risk scores to identify probability of an asthma attack. Procedia Computer Sci .2019;160:424–431. 10.1016/j.procs.2019.11.071 [DOI] [Google Scholar]
8. Zhang H, Liu S, Chen Z, et al. : Predicting asthma attacks: Effects of variations in meteorological factors on daily hospital visits for asthma: A time-series study. Environ Res. 2020;182:109115. 10.1371/journal.pone.0102475 [DOI] [PubMed] [Google Scholar]
9. Zhang A, Lipton Z, Li M, et al. : Deep neural networks. Dive into Deep Learning, Chapter .2021;3:87–126. Reference Source [Google Scholar]
10. Mukherjee H, Ghosh S, Dhar A, et al. : Deep neural network to detect COVID-19: one architecture for both CT Scans and Chest X-rays. Appl. Intel. 2021. 10.1007/s10489-020-01943-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Gadekallu T, Khare N, Bhattacharya S, et al. : Deep neural networks to predict diabetic retinopathy. J Ambient Intelligence Humanized Computing. 2020. 10.1007/s12652-020-01963-7 [DOI] [Google Scholar]
12. Koshimizu H, Kojima R, Kario K, et al. : Prediction of blood pressure variability using deep neural networks. Int J Med Informatics. 2020;136:104067. 10.1016/j.ijmedinf.2019.104067 [DOI] [PubMed] [Google Scholar]
13. Suresha H, Parthasarathy S: Alzheimer Disease Detection Based on Deep Neural Network with Rectified Adam Optimization Technique using MRI Analysis. Computers and Communications (ICAECC): Third International Conference on Advances in Electronics. 2020;1–6. 10.1109/ICAECC50550.2020.9339504 [DOI] [Google Scholar]
14. Pandey S, Janghel R: Recent deep learning techniques, challenges and its applications for medical healthcare system: A review. Neural Processing Letters. 2019;50(2):1907–1935. 10.1007/s11063-018-09976-2 [DOI] [Google Scholar]
15. Ho SB, Haque R, Chai I, et al. : Integrating mobile devices with cohort analysis into personalised weather-based healthcare. Lecture Notes Computer Sci. 2020;12496:606–618. 10.1007/978-3-030-63007-2_47 [DOI] [Google Scholar]
16. Haque R, Ho SB, Chai I, et al. : Intelligent health informatics with personalisation in weather-based healthcare using machine learning. Lecture Notes Data Engineering Communications Technologies. 2021;72:29–40. 10.1007/978-3-030-70713-2_4 [DOI] [Google Scholar]
17. Global Initiative for Asthma: Global strategy for asthma management and prevention.Accessed May 23, 2021. Reference Source
18. Subasi A: Practical Machine Learning for Data Analysis using Python. Academic Press;2020; pp.91–202. [Google Scholar]
19. Manasa J, Gupta R, Narahari NS: Machine learning based predicting house prices using regression techniques. Proc. 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA), Bangalore, India. 2020, pp.624–630. 10.1109/ICIMIA48430.2020.9074952 [DOI]
20. DeepAI Hidden Layer. Accessed May 23, 2021. Reference Source
21. Erten G, Keser S, Yavuz M: Grid search optimised artificial neural network for open stope stability prediction. Int J Mining, Reclamation Environment. 2021. 10.1080/17480930.2021.1899404 [DOI] [Google Scholar]
22. Liashchynskyi P, Liashchynskyi P: Grid search, random search, genetic algorithm: A big comparison for NAS. ArXiv abs .1912;06059:2019. [Google Scholar]
23. Kocsis O, Arvanitis G, Lalos A, et al. : Assessing machine learning algorithms for self-management of asthma. E-Health Bioengineering Conf (EHB). 2017;571–574. 10.1109/EHB.2017.7995488 [DOI] [Google Scholar]
24. 1191402606: 1191402606/Optimised-deep-neural-network-model: Dataset and source code (v1.0.0). Zenodo. 2021. 10.5281/zenodo.5271780 [DOI]

F1000Res. 2021 Nov 2. doi: 10.5256/f1000research.76644.r95805

Reviewer response for version 1

Kwan-Yong Sim ¹

The main contribution of the paper is to identify the correlation patterns among asthma, weather, and demographic variables. Optimization of the DNNR model by applying standardization and fragmented grid search successfully reduced the error rate below 0.5 required for effective asthma self-management.

Overall the paper is technically sound and made clear contributions to asthma self-management and improving the stability of the DNNR model proposed.

The authors should include the following discussion to make the paper more cohesive:

As the study was conducted based on limited data collected from 10 users, the author should discuss the potential impact if a larger dataset is being used, including from different climate regions or different demographics.
"smoking habit" was mentioned as one of the features in data pre-processing. The author should explain what other features were included the data set (if any) and why they were included in the first place.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Data science, software testing,

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2021 Oct 22. doi: 10.5256/f1000research.76644.r94416

Reviewer response for version 1

Meei Hao Hoo ¹

The novelty of this study is to improve the accuracy of prediction of asthma by using optimised deep neural network and this gives a contribution to enhance the current model in terms of accuracy and computational performance. Most decisions in the research work are justifiable with supported of references made to the previous related works. Main results are explained in regards to the use of optimizer in building the model.

In overall, the paper has a good-structured layout. However, the content in the method section can be improved to give a good flow of understanding. Either way to have an overall methodology of the study that describing the steps taken in the methods, or the arrangement and/or explanation in the methods follow the optimization process as in Figure 6.

Suggest also to include relevant works to support the choice of ADAM optimizer in the study as compared with other optimizers, as well as the used of ReLU activation function over Mish (for example) and others in order to build strong arguments.

Is the work clearly and accurately presented and does it cite the current literature?

Partly

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

usability engineering, persuasive design and data mining.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2021 Oct 7. doi: 10.5256/f1000research.76644.r95806

Reviewer response for version 1

Mark Elshaw ¹

This an interesting paper, which is timely. There is suitable use of deep learning, however there is a need to consider whether the problem is big data. It is good that there is some consideration of preprocessing. The paper follows a clear scientific process. There is a need to clearly outline why this paper is novel and to offer greater discussion of the future direction of the project.

There is some indication of the performance achieved, but I would have liked to see greater statistical analysis. There is a need for consideration of the state of the art and how the approach in this paper compares with the state of the art.

Overall a very well written and well presented paper that would benefit a little from greater analysis of the achievements. The analysis that you might consider using is the f-score, false positive and false negatives, and the use of ROC curves. The developed approach would be of clear benefit to asthma patients.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Partly

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

Machine learning and robotics.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC BY 4.0 Public domain dedication).

[ref1] 1. Seo S, Kho H, Kim K: Predicting asthma attacks: Effects of indoor pm concentrations on peak expiratory flow rates of asthmatic children. Chest. 2020;157(6):A8. 10.1016/j.chest.2020.05.009 [DOI] [Google Scholar]

[ref2] 2. Tsang KCH, Pinnock H, Wilson AM, et al. : Application of machine learning to support self-management of asthma with mHealth. Proc. 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). Montreal, QC, Canada;2020, pp.5673–5677. [DOI] [PubMed] [Google Scholar]

[ref3] 3. Bodaghkhani E, Mahdavian M, MacLellan C, et al. : Effects of meteorological factors on hospitalizations in adult patients with asthma: A systematic review. Canadian Resp J. 2019;9:1–11. 10.1155/2019/3435103 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref4] 4. Razavi-Termeh S, Sadeghi-Niaraki A, Choi S: Asthma-prone areas modeling using a machine learning model. Sci Rep. 2021;11(1). 10.1038/s41598-021-81147-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref5] 5. Sharma A, Saini S, Chhabra P, et al. : Air pollution and weather as the determinants of acute attacks of asthma: Spatiotemporal approach. Indian J Public Health .2020;64(2):124–129. 10.4103/ijph.IJPH_135_19 [DOI] [PubMed] [Google Scholar]

[ref6] 6. Poole J, Barnes CS, Demain JG, et al. : Impact of weather and climate change with indoor and outdoor air quality in asthma: A work group report of the AAAAI environmental exposure and respiratory health committee. J Allergy Clin Immunol .2019;143(5):1702–1710. 10.1016/j.jaci.2019.02.018 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref7] 7. Do Q, Doig A, Son T, et al. : Predicting lung healthiness risk scores to identify probability of an asthma attack. Procedia Computer Sci .2019;160:424–431. 10.1016/j.procs.2019.11.071 [DOI] [Google Scholar]

[ref8] 8. Zhang H, Liu S, Chen Z, et al. : Predicting asthma attacks: Effects of variations in meteorological factors on daily hospital visits for asthma: A time-series study. Environ Res. 2020;182:109115. 10.1371/journal.pone.0102475 [DOI] [PubMed] [Google Scholar]

[ref9] 9. Zhang A, Lipton Z, Li M, et al. : Deep neural networks. Dive into Deep Learning, Chapter .2021;3:87–126. Reference Source [Google Scholar]

[ref10] 10. Mukherjee H, Ghosh S, Dhar A, et al. : Deep neural network to detect COVID-19: one architecture for both CT Scans and Chest X-rays. Appl. Intel. 2021. 10.1007/s10489-020-01943-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref11] 11. Gadekallu T, Khare N, Bhattacharya S, et al. : Deep neural networks to predict diabetic retinopathy. J Ambient Intelligence Humanized Computing. 2020. 10.1007/s12652-020-01963-7 [DOI] [Google Scholar]

[ref12] 12. Koshimizu H, Kojima R, Kario K, et al. : Prediction of blood pressure variability using deep neural networks. Int J Med Informatics. 2020;136:104067. 10.1016/j.ijmedinf.2019.104067 [DOI] [PubMed] [Google Scholar]

[ref13] 13. Suresha H, Parthasarathy S: Alzheimer Disease Detection Based on Deep Neural Network with Rectified Adam Optimization Technique using MRI Analysis. Computers and Communications (ICAECC): Third International Conference on Advances in Electronics. 2020;1–6. 10.1109/ICAECC50550.2020.9339504 [DOI] [Google Scholar]

[ref14] 14. Pandey S, Janghel R: Recent deep learning techniques, challenges and its applications for medical healthcare system: A review. Neural Processing Letters. 2019;50(2):1907–1935. 10.1007/s11063-018-09976-2 [DOI] [Google Scholar]

[ref15] 15. Ho SB, Haque R, Chai I, et al. : Integrating mobile devices with cohort analysis into personalised weather-based healthcare. Lecture Notes Computer Sci. 2020;12496:606–618. 10.1007/978-3-030-63007-2_47 [DOI] [Google Scholar]

[ref16] 16. Haque R, Ho SB, Chai I, et al. : Intelligent health informatics with personalisation in weather-based healthcare using machine learning. Lecture Notes Data Engineering Communications Technologies. 2021;72:29–40. 10.1007/978-3-030-70713-2_4 [DOI] [Google Scholar]

[ref17] 17. Global Initiative for Asthma: Global strategy for asthma management and prevention.Accessed May 23, 2021. Reference Source

[ref18] 18. Subasi A: Practical Machine Learning for Data Analysis using Python. Academic Press;2020; pp.91–202. [Google Scholar]

[ref19] 19. Manasa J, Gupta R, Narahari NS: Machine learning based predicting house prices using regression techniques. Proc. 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA), Bangalore, India. 2020, pp.624–630. 10.1109/ICIMIA48430.2020.9074952 [DOI]

[ref20] 20. DeepAI Hidden Layer. Accessed May 23, 2021. Reference Source

[ref21] 21. Erten G, Keser S, Yavuz M: Grid search optimised artificial neural network for open stope stability prediction. Int J Mining, Reclamation Environment. 2021. 10.1080/17480930.2021.1899404 [DOI] [Google Scholar]

[ref22] 22. Liashchynskyi P, Liashchynskyi P: Grid search, random search, genetic algorithm: A big comparison for NAS. ArXiv abs .1912;06059:2019. [Google Scholar]

[ref23] 23. Kocsis O, Arvanitis G, Lalos A, et al. : Assessing machine learning algorithms for self-management of asthma. E-Health Bioengineering Conf (EHB). 2017;571–574. 10.1109/EHB.2017.7995488 [DOI] [Google Scholar]

[ref24] 24. 1191402606: 1191402606/Optimised-deep-neural-network-model: Dataset and source code (v1.0.0). Zenodo. 2021. 10.5281/zenodo.5271780 [DOI]

PERMALINK

Optimised deep neural network model to predict asthma exacerbation based on personalised weather triggers

Radiah Haque

Sin-Ban Ho

Ian Chai

Adina Abdullah

Roles

Abstract

Introduction

Methods

Data collection

Table 1. Data collection example.

Data pre-processing

Figure 1. Heatmap visualisation.

Table 2. Input/output variables.

Figure 2. ACT distribution.

Figure 3. Weather visualisation.

Regression with DNN

Figure 4. DNNR architecture.

Evaluation metrics

Optimisation methods

Figure 5. Optimisation algorithm.

Figure 6. Optimisation process.

Results

Regression results

Table 3. Actual vs. predicted values.

Figure 7. DNNR Residual.

Optimisation results

Table 4. Optimum parameters.

Figure 8. Loss function.

Figure 9. Optimised-DNNR prediction.

Figure 10. Optimised-DNNR residual.

Discussion and conclusion

Author contribution

Acknowledgements

Funding Statement

Software and data availability

References

Reviewer response for version 1

Kwan-Yong Sim

Roles

Reviewer response for version 1

Meei Hao Hoo

Roles

Reviewer response for version 1

Mark Elshaw

Roles

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases