Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform

Tiago de Oliveira Barreto; Fernando Lucas de Oliveira Farias; Nicolas Vinícius Rodrigues Veras; Pablo Holanda Cardoso; Gleyson José Pinheiro Caldeira Silva; Chander de Oliveira Pinheiro; Maria Valéria Bezerra Medina; Felipe Ricardo dos Santos Fernandes; Ingridy Marina Pierre Barbalho; Lyane Ramalho Cortez; João Paulo Queiroz dos Santos; Antonio Higor Freire de Morais; Gustavo Fontoura de Souza; Guilherme Medeiros Machado; Márcia Jacyntha Nunes Rodrigues Lucena; Ricardo Alexsandro de Medeiros Valentim

doi:10.1371/journal.pone.0315379

. 2024 Dec 30;19(12):e0315379. doi: 10.1371/journal.pone.0315379

Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform

Tiago de Oliveira Barreto ^1,^*, Fernando Lucas de Oliveira Farias ¹, Nicolas Vinícius Rodrigues Veras ^1,², Pablo Holanda Cardoso ^1,², Gleyson José Pinheiro Caldeira Silva ¹, Chander de Oliveira Pinheiro ³, Maria Valéria Bezerra Medina ³, Felipe Ricardo dos Santos Fernandes ¹, Ingridy Marina Pierre Barbalho ¹, Lyane Ramalho Cortez ^1,³, João Paulo Queiroz dos Santos ^1,², Antonio Higor Freire de Morais ^1,², Gustavo Fontoura de Souza ^1,², Guilherme Medeiros Machado ⁴, Márcia Jacyntha Nunes Rodrigues Lucena ⁵, Ricardo Alexsandro de Medeiros Valentim ¹

Editor: Luísa da Matta Machado Fernandes⁶

¹Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

²Advanced Nucleus of Technological Innovation (NAVI), Federal Institute of Rio Grande do Norte (IFRN), Natal, Rio Grande do Norte, Brazil

³Secretary of Public Health of Rio Grande do Norte, Natal, Rio Grande do Norte, Brazil

⁴LyRIDS, ECE-Engineering School, Paris, France

⁵Department of Informatics and Applied Mathematics, Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

⁶Fundacao Oswaldo Cruz Instituto Rene Rachou, BRAZIL

Competing Interests: The authors have declared that no competing interests exist.

^✉

* E-mail: tiago.barreto@lais.huol.ufrn.br

Roles

Tiago de Oliveira Barreto: Conceptualization, Data curation, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing

Fernando Lucas de Oliveira Farias: Conceptualization, Data curation, Investigation, Writing – original draft, Writing – review & editing

Nicolas Vinícius Rodrigues Veras: Data curation, Methodology, Resources, Writing – original draft, Writing – review & editing

Pablo Holanda Cardoso: Data curation, Methodology, Resources, Writing – original draft

Gleyson José Pinheiro Caldeira Silva: Investigation, Methodology, Visualization, Writing – original draft

Chander de Oliveira Pinheiro: Funding acquisition, Methodology, Writing – review & editing

Maria Valéria Bezerra Medina: Methodology, Visualization, Writing – review & editing

Felipe Ricardo dos Santos Fernandes: Conceptualization, Investigation, Methodology, Writing – review & editing

Ingridy Marina Pierre Barbalho: Investigation, Methodology, Visualization, Writing – review & editing

Lyane Ramalho Cortez: Funding acquisition, Methodology, Validation, Writing – review & editing

João Paulo Queiroz dos Santos: Methodology, Project administration, Writing – review & editing

Antonio Higor Freire de Morais: Conceptualization, Methodology, Resources, Writing – review & editing

Gustavo Fontoura de Souza: Data curation, Methodology, Writing – review & editing

Guilherme Medeiros Machado: Methodology, Validation, Writing – review & editing

Márcia Jacyntha Nunes Rodrigues Lucena: Investigation, Methodology, Writing – review & editing

Ricardo Alexsandro de Medeiros Valentim: Conceptualization, Investigation, Methodology, Writing – original draft, Writing – review & editing

Luísa da Matta Machado Fernandes: Editor

PMCID: PMC11684685 PMID: 39775276

Abstract

Bed regulation within Brazil’s National Health System (SUS) plays a crucial role in managing care for patients in need of hospitalization. In Rio Grande do Norte, Brazil, the RegulaRN Leitos Gerais platform was the information system developed to register requests for bed regulation for COVID-19 cases. However, the platform was expanded to cover a range of diseases that require hospitalization. This study explored different machine learning models in the RegulaRN database, from October 2021 to January 2024, totaling 47,056 regulations. From the data obtained, 12 features were selected from the 24 available. After that, blank and inconclusive data were removed, as well as the outcomes that had values other than discharge and death, rendering a binary classification. Data was also correlated, balanced, and divided into training and test portions for application in machine learning models. The results showed better accuracy (87.77%) and recall (87.77%) for the XGBoost model, and higher precision (87.85%) and F1-Score (87.56%) for the Random Forest and Gradient Boosting models, respectively. As for Specificity (82.94%) and ROC-AUC (82.13%), the Multilayer Perceptron with SGD optimizer obtained the highest scores. The results evidenced which models could adequately assist medical regulators during the decision-making process for bed regulation, enabling even more effective regulation and, consequently, greater availability of beds and a decrease in waiting time for patients.

Introduction

In Brazil, the hospital bed regulation process of the Brazilian Health System (SUS) plays a fundamental role in the management and distribution of care for patients requiring hospitalization [1, 2]. However, although the National Regulation Policy was instituted more than 15 years ago by Ordinance No. 1,559 of August 2008 [3] and consolidated by Ordinance No. 02 of September 2017 (Brazil, 2017) [4], many regions have difficulties in ensuring correct regulatory conduct.

In this context, in addition to organizational issues, the regulation system in Brazil still faces recurring problems such as the precariousness of hospital infrastructure, overcrowding in health units, an insufficient number of beds, difficulties in integration and communication between the entities involved in the regulatory process, greater transparency in processes and allocation of resources, in addition to not having efficient systems to help the regulation process [1, 5]. In Brazil, due to a non-mandatory recommendation from the Ministry of Health (MoH), the Regulation System—SISREG—is still used in many Brazilian states. This system was created in 2001 and is made available by the Brazilian Health System Informatics Department (DATASUS) [6]. Currently, this system is considered obsolete and inadequate, especially due to the lack of interoperability with the SUS technological ecosystem itself and the lack of transparency [7]. This is a legacy health information system which, although it is still used, is no longer able to play an effective role in the National Policy for the Regulation of Assistance in Access to Health Services in Brazil.

Until April 2020, the center for regulating access to health services in the state of Rio Grande do Norte did not have a platform for regulating hospital beds to systematically organize regulatory conduct within the scope of the SUS in the state. The regulatory flow control measures used were based on spreadsheets, e-mails and telephone communication, and messaging systems [8, 9].

Faced with the serious public health crisis caused by the COVID-19 pandemic, the government of the state of Rio Grande do Norte has set up technical-scientific cooperation between researchers in the field of digital health and the managers and formulators of public health policies at the State Secretariat of Public Health of RN (SESAP/RN). The aim of this technical-scientific cooperation was to formulate and implement a digital health solution that would make it possible to control and monitor the entire process of regulating hospital beds in all the state’s public hospitals online, on time and transparently, totaling 24 public hospitals, with more than 900 beds available.

Based on this technical-scientific cooperation, the RegulaRN Platform for COVID-19 was developed and implemented throughout the state of Rio Grande do Norte, whose initial objective was to monitor and control access to hospital beds in wards and intensive care units (ICUs) for the disease during the pandemic [2, 8, 9]. The state of Rio Grande Norte, which is located in the northeastern region of Brazil, currently has an area of 52,797 km² and a population of approximately 3.5 million inhabitants.

After the implementation of the RegulaRN Platform, it became necessary to expand the digital health solution to the other regulatory specialties. The system is currently responsible for regulating access to beds, vascular surgery, outpatient care, exams, and consultations. In this way, the RegulaRN Platform has become a unique digital health solution for the management of health regulation services in the state of Rio Grande do Norte, an important aspect because it has centralized and integrated, through international interoperability standards, the Health Data Network (RDS) with all the other technologies in the state’s public health ecosystem that are necessary for the process of regulating access to health services.

The health regulation process needs to be carried out in a rigorous, agile and transparent manner, as the incorrect conduct of a regulatory process in public health has intrinsic impacts on waiting times for access to hospital beds, as well as on hospitalization times, which can have negative impacts on the availability of hospital beds and increase the potential for existing problems [5, 10]. In this way, the inefficiency and ineffectiveness of this process can aggravate public health crisis situations, such as the COVID-19 pandemic, as it requires more rational use of health resources [8, 11–16]. Therefore, due to its complexity and the pressures that exist in all segments of the regulatory process, investment in intelligent computer systems can maximize the correct direction and assertive decision-making in healthcare systems [17–22].

Intelligent computer models have demonstrated significant potential in healthcare systems by reducing uncertainties and ambiguities in complex decision-making processes. For example, prior studies in similar healthcare contexts have shown that machine learning models can enhance hospital management by optimizing resource allocation and reducing patient waiting times [23–30]. This study aims to build on these findings to demonstrate the effectiveness of AI-based models specifically in bed regulation in Rio Grande do Norte.

In this context, the aim of this work is to analyze data from the RegulaRN Leitos Gerais Platform and use it to train and validate different machine learning models. Subsequently, to choose the most significant classification model capable of predicting the outcome of patients regulated by the RegulaRN Leitos Gerais platform with greater accuracy, precision, recall, specificity, F1 Score, and ROC-AUC. Furthermore, discuss the main impacts and potential of a digital health solution on the decision-making process of regulatory professionals.

Materials and methods

The methodological bias of this paper consists in two main steps: exploratory data analysis and applying the data to computer models. In the evaluation process, the data was extracted, evaluated, characterized, pre-processed, and correlated. For the application stage, concerning the computational models, four phases were taken into account: 1) definition of evaluation metrics; 2) data balancing and division into training and validation groups; 3) selecting the models for classification, and 4) hyperparameters to choose the best performing model; in line with Barreto et al [2].

Extraction, evaluation, characterization and pre-processing

This study used the database from the RegulaRN Leitos Gerais platform, a system adopted to manage the regulation flow of SUS beds in the state of Rio Grande do Norte. The database covers the period of October 2021 to January 2024, with 47.056 regulations in the two-state centers (Metropolitan and West). From this total, 1,868 regulations were removed because they were linked to newborn regulations, and these have different clinical assessment protocols when compared to adult and pediatric patients. The initial analysis therefore included 45,188 regulation requests. A more detailed descriptive analysis of the data is presented in the results section.

The initial data extraction included 24 features: a) date of request; b) occupancy type; c) case type; d) unified prioritization score (EUP); e) Sequential Organ Failure Assessment (SOFA) scale; f) type of hospital bed requested; g) admission date; h) type of input bed; i) discharge date; j) discharge bed type; k) national health card number; l) gender; m) patient’s municipality; n) patient’s federative unit; o) pregnant woman (yes or no); p) gestational period; q) age; r) regulator identification; s) outcome; t) requesting health unit; u) municipality of the requesting health unit; v) providing health unit; w) municipality of the providing health unit and x) ICD.

Thus, the features that were not associated with the patient’s clinical condition and do not show any impact in the final result, or that relate to the locality record, such as: date of request, national health card number, patient’s federative unit, patient’s municipality, regulator identification, requesting health unit, municipality of the requesting health unit, and municipality of the providing health unit. In addition, features with only one possible record or insufficient information were also removed: type of occupation, type of case, pregnant woman (yes or no), and gestational age.

Consequently, only 12 characteristics were selected, namely: EUP score, SOFA scale, type of hospital bed requested, admission date, admission bed type, discharge date, discharge bed type, gender, age, outcome, providing health unit, and ICD. Using the entry date and exit date features, it was possible to create the patient’s hospitalization time feature. As a result, 11 features were used in the classification process. Table 1 shows the description of all the data types extracted from RegulaRN Leitos Gerais.

Table 1. Description of database.

Data description
Field	Description
request date	Represents the date a bed request was registered.
type of occupation	Represents the type of occupation of the regulation request.
type of occupation	Represents the type of occupation of the regulation request.
case of type	Represents the results of tests for patients who were suspected of having Covid-19. The results could be: positive, negative, inconclusive, or null.
EUP	Represents the EUP value. The numerical scale ranges from 2 to 8 and is associated with the Charlson Comorbidity Index, Clinical Frailty Scale and simplified SOFA.
SOFA scale	Represents the patient’s prioritization value according to the values of this scale.
requested bed type	Represents the type of bed selected by the regulation center for a patient. The results could be: ward and uci.
entry date	Represents the date the patient was allocated to the health unit (hospital).
entry bed type	Represents the type of bed that the patient was allocated in the health unit (hospital). The results could be: ward and uci.
output date	Represents the date that the patient left the bed after the outcome.
output bed type	Represents the type of bed the patient was in before the outcome.
sus card number	Represents each patient’s SUS card number. Given by a 15-digit sequence.
sex	Represents the patient’s sex.
patient’s municipality	Indicates the patient’s city of residence.
patient’s federal unit	Is the acronym for the patient’s federal unit.
pregnant	Represents whether the patient is pregnant or not.
gestational age	Describe how far along the pregnancy is, measured in weeks.
age	Represents patient age.
regulator	Represents the regulator identification responsible for regulation.
outcome	Represents the outcome of the patient in bed. Possible values for this field are: discharge and death.
requesting health unit	Represents the unit health that solicits a bed for the patient.
municipality of the requesting health unit	Represents the municipality of the health unit that solicits the bed.
provider health unit	Represents the health unit that admits and accommodates the patient in the bed.
municipality of the provider health unit	Represents the municipality of the health unit that receives and accomodate the patient in the bed.
ICD	Represents the International Classification of Diseases for bed regulation.

Open in a new tab

After extracting the data, we evaluated the values contained in all the features and in order to guarantee the integrity of the analysis, the lines with blank data or inconclusive information were removed. In addition, the target column “outcome” contained six different values, namely: by discharge, by death, for other reasons, by stay, by delivery procedure, by transfer, etc. As these last four outcomes do not properly indicate a positive or negative closure of the regulation, as well as having a lower number of recurrences, around 7.151 regulations were removed. This maintains a binary classification (by discharge—positive, or by death—negative) for the computer models. Finally, 38.023 effective regulations were selected for application in the artificial intelligence models. Fig 1 shows the design used to process and select the data. Furthermore, in order to enable the reproducibility of this experiment, the final database used is available on the zenodo platform (https://zenodo.org/records/11387710).

Correlation between dataset features

The first task was to perform a pairwise correlation of the features. The objective is to identify features with greater or lesser correspondence with others. As many of these are categorical data, the phik correlation model was implemented in this analysis. Phik is abble to consistently correlate variables from several backgrounds, being categorical, ordinals and intervals a like, turning into a refinement of Pearson [31] hypothesis test.

Definition of evaluation metrics

The overall aim of the study is to classify hospital bed regulation data to predict a patient’s positive or negative outcome. Furthermore, it is important to investigate the models’ performance in situations where predictions are wrong, either due to a high number of false positives or false negatives. Thus, it is necessary to include not only accuracy, but also precision, recall, specificity, F1-Score, and ROC-AUC in a similar way to those found in the works of Iwendi et al [32], Aljameel et al [33] and Endo et al [34].

The accuracy consists in the set of data with correct predictions (true positive and true negative) divided by the sum of all predictions made by the model (true positive, true negative, false positive, false negative) (Eq 1):

\begin{matrix} A c c u r a c y = (T P + T N) / (T P + F P + F N + T N) \end{matrix}

(1)

Precision consists of dividing the true positive rate by the sum of the true positive and false positive rates (Eq 2).

\begin{matrix} P r e c i s i o n = T P / (T P + F P) \end{matrix}

(2)

Recall involves the rate of true positives divided by the rate of true positives plus false negatives (Eq 3).

\begin{matrix} R e c a l l = T P / (T P + F N) \end{matrix}

(3)

Specificity refers to the prediction of true negatives divided by the sum of true negatives and false positives (Eq 4).

\begin{matrix} S p e c i f i c i t y = T N / (T N + F P) \end{matrix}

(4)

The F1-score is the harmonic mean between the precision and recall. The formula involves the product of precision and recall divided by the sum of these metrics, multiplied by 2 (Eq 5).

\begin{matrix} F 1 S c o r e = 2 * (P r e c i s i o n * R e c a l l) / (P r e c i s i o n + R e c a l l) \end{matrix}

(5)

ROC-AUC can be obtained by recall divided by the complementary value of specificity (Eq 6).

\begin{matrix} R O C A U C = R e c a l l / (1 - S p e c i f i c i t y) \end{matrix}

(6)

Data balancing and splitting into training and validation data

The RegulaRN Leitos Gerais Platform database refers to real-world bed regulation data, in this sense, there is an unbalanced distribution of data when classified by outcome, 82.6% are discharges and 17.4% deaths. The use of an unbalanced database biases the machine learning classifiers, making the algorithms able to identify patterns from the predominant class much better than patterns from the minority class. To mitigate this problem, one of the most common techniques is SMOTE (Synthetic Minority Over-sampling), which works by increasing the number of data points in the minority class [35]. The SMOTE algorithm first identifies the minority class, then in the feature vector space identifies the k nearest neighbors of that class (k is usually equal to 5). Finally, a new instance of the minority class is generated by randomly selecting values in the vector space between an instance of the minority class and the nearest neighbors identified. This process is repeated until the database is completely balanced.

In addition, as for the division of training and validation data, the same segmentation was used as in other studies applying machine learning techniques that use a large volume of data [36–38]. Therefore, 80% of the data was directed to training and the others 20% for validation.

Definition of models for data classification

The selection of classification models was based on their proven ability to handle large volumes of imbalanced healthcare data [37, 39–41]. Decision tree was selected because it is one of the classic models that handles high volumes of data well and has wide application in problems in the health area [42]. Random Forest, on the other hand, was selected for its ability to manage complex decision trees and its resistance to overfitting, particularly in high-dimensional datasets [43]. Gradient Boosting and Adaboost due to their adaptability and efficient ability to capture non-linear relationships [44, 45]. XGBoost, for example, has been shown to perform well in healthcare settings due to its gradient boosting framework, which effectively handles missing data and provides robust performance on tabular datasets [46] and Multi-Layer Perceptron (MLP) has an architecture capable of modeling non-linear relationships and performing gradient learning, adjusting weights efficiently for larger volumes of data [47]. For the MLP models, two different paths were taken, the Stochastic Gradient Descent (SGD) was selected due to its performance and Adam because of its consistency in treating gradient explosion and fading problems [48]. These models were chosen for their complementarity in addressing the specific challenges of bed regulation data in this study.

Hyperparameters to define the best model

After the model selection, it was necessary to define the best combination of hyperparameters to enhance the evaluation metrics of each model. Thus, this section presents which hyperparameters were adopted and which methods were elaborated in the training and validation steps. It is worth mentioning that all computational model development in this research used python’s sckit-learn library [49].

For each selected model, hyperparameters were set aiming to boost the performance metrics. In this regard, the following hyperparameters were selected for Decision Tree: criterion, which measures the quality of node splitting; max depth of tree, which determines the maximum depth of the tree; min samples leaf, which represents the minimum number of samples needed in a leaf; and max features, which considers the maximum number of features analyzed to perform a split. For the Random Forest and Gradient Boosting models, the criterion, max depth of the tree and max features were also used, including the number estimators, which considers the number of trees in the forest. In the Adaboost model, the parameters number estimators, learning rate and algorithm were chosen. The learning rate refers to the learning weight at each iteration, while the algorithm relates to how the model can speed up the convergence of the classifier with the least possible error. For XGBoost: learning rate, number estimators, max depth and colsample by tree. This last hyperparameter is associated with the randomly selected fraction of resources that will be used to train each tree. Finally, for the MLP Adam and MLP SGD models, the following were used: hidden layer size, which represents the number of layers in the model; activation, which represents the model’s activation function and batch size, which represents the size of the minibatches that will be used to help the optimizers.

The grid GridSearchCV functionality, which allows all possible combinations of hyperparameters to be iterated, was applied during the training to find which parameters showed better results in the evaluated metrics [50, 51]. A proportional division of the training and test data was also carried out randomly using the cross validation attribute with a value of 10-folds in the GridSearchCV functionality, as a way of enhancing the model’s learning. In addition, the models were trained five times, similar to that developed by Ahsan et al [52], in order to determine the best set of hyperparameters more accurately. The details of the hyperparameters used and the respective values chosen for each model are shown in Table 2.

Table 2. Selection of hyperparameters and values for each model.

Models	Hyperparameter	Range and best values
Decision Tree	criterion	gini or entropy;
	max depth of the tree	[10, 50, 100];
	min samples leaf	range [1, 2, 3, 4];
	max features	[sqrt, log2].
Random Forest	criterion	gini or entropy;
	max depth of the tree	[10, 50, 100];
	number estimators	[100, 200, 400];
	max features	[sqrt, log2].
Gradient Boosting	criterion	friedman_mse or squared_error;
	max depth	[10, 50, 100];
	number estimators	[10, 50, 100];
	max features	[sqrt, log2].
Adaboost	learning rate	[0.1, 0.5, 1.0]
	number estimators	[100, 200, 400]
	algorithm	[samme, samme.r]
XGBoost	learning rate	[0.1, 0.5, 1.0]
	number estimators	[100, 200, 400]
	max depth	[10, 50, 100]
	colsample by tree	[0.1, 0.5, 1.0]
MLP SGD	hidden_layer_sizes	[5, 25, 70]
	activation	tanh or relu
	batch_size	[16, 32, 64]
MLP ADAM	hidden_layer_sizes	[5, 25, 70]
	activation	tanh or relu
	batch_size	[16, 32, 64]

Open in a new tab

Results

General data analysis

Considering the data profile from RegulaRN Leitos Gerais, between October 2021 and January 2024, it was possible to identify that most hospitalizations involve male adults, young people and children, in hospital beds and with lower EUP score and SOFA scale. The details of the values extracted from the database are presented in Table 3, classifying each of the characteristics based on their respective outcome.

Table 3. Data profile from the RegulaRN Leitos Gerais.

Features		Values	Outcomes
Age	≥ 60	18.641	Discharge: 13.549
	≥ 60	18.641	Death: 5.092
	< 60	19.382	Discharge: 17.860
	< 60	19.382	Death: 1.522
Sex	Masculine	20.235	Discharge: 16.941
	Masculine	20.235	Death: 3.384
	Feminine	17.698	Discharge: 14.468
	Feminine	17.698	Death: 3.230
EUP Score	2	17.572	Discharge:16.668
	2	17.572	Death: 904
	3	7.279	Discharge: 6.176
	3	7.279	Death: 1.103
	4	5.468	Discharge:4.113
	4	5.468	Death: 1.355
	5	5.485	Discharge: 3.543
	5	5.485	Death:1.942
	6	1.765	Discharge: 770
	6	1.765	Death: 995
	7	367	Discharge: 120
	7	367	Death: 247
	8	87	Discharge:19
	8	87	Death: 68
SOFA	1	29.699	Discharge: 26.070
	1	29.699	Death: 3.629
	2	6.816	Discharge: 4.652
	2	6.816	Death: 2.164
	3	1.179	Discharge: 598
	3	1.179	Death: 581
	4	329	Discharge: 89
	4	329	Death: 240
requested bed type	Ward	23.863	Discharge: 21.878
	Ward	23.863	Death: 1.985
	ICU	14.160	Discharge: 9.531
	ICU	14.160	Death: 4.629
entry bed type	Ward	23.858	Discharge: 21.851
	Ward	23.858	Death: 2.007
	ICU	14.165	Discharge: 9.558
	ICU	14.165	Death:4.607
output bed type	Ward	28.137	Discharge: 26.233
	Ward	28.137	Death: 1.904
	ICU	9.886	Discharge: 5.176
	ICU	9.886	Death: 4.710
Provider health unit	Health unit 1	1.234	Discharge: 1.061
	Health unit 1	1.234	Death: 173
	Health unit 2	794	Discharge: 554
	Health unit 2	794	Death: 240
	Health unit 3	299	Discharge: 213
	Health unit 3	299	Death:86
	Health unit 4	225	Discharge: 154
	Health unit 4	225	Death: 71
	Health unit 5	367	Discharge: 227
	Health unit 5	367	Death:140
	Health unit 6	2.104	Discharge: 1.722
	Health unit 6	2.104	Death: 382
	Health unit 7	2.865	Discharge: 2.341
	Health unit 7	2.865	Death: 524
	Health unit 8	2.331	Discharge: 1.971
	Health unit 8	2.331	Death: 360
	Health unit 9	112	Discharge: 104
	Health unit 9	112	Death: 8
	Health unit 10	8	Discharge: 7
	Health unit 10	8	Death: 1
	Health unit 11	2.138	Discharge: 2.094
	Health unit 11	2.138	Death: 44
	Health unit 12	703	Discharge: 625
	Health unit 12	703	Death: 78
	Health unit 13	101	Discharge: 56
	Health unit 13	101	Death: 45
	Health unit 14	11	Discharge: 11
	Health unit 14	11	Death: 0
	Health unit 15	422	Discharge: 225
	Health unit 15	422	Death:197
	Health unit 16	94	Discharge: 86
	Health unit 16	94	Death: 8
	Health unit 17	5	Discharge: 3
	Health unit 17	5	Death: 2
	Health unit 18	385	Discharge:233
	Health unit 18	385	Death:152
	Health unit 19	703	Discharge:703
	Health unit 19	703	Death:0
	Health unit 20	253	Discharge:211
	Health unit 20	253	Death:42
	Health unit 21	2.825	Discharge:2.646
	Health unit 21	2.825	Death:179
	Health unit 22	1.000	Discharge:778
	Health unit 22	1.000	Death:222
	Health unit 23	586	Discharge:373
	Health unit 23	586	Death:213
	Health unit 24	739	Discharge:608
	Health unit 24	739	Death:131
	Health unit 25	737	Discharge:631
	Health unit 25	737	Death:106
	Health unit 26	34	Discharge:23
	Health unit 26	34	Death:11
	Health unit 27	396	Discharge:173
	Health unit 27	396	Death:223
	Health unit 28	5.192	Discharge:4.320
	Health unit 28	5.192	Death:872
	Health unit 29	740	Discharge:662
	Health unit 29	740	Death:78
	Health unit 30	1.000	Discharge:850
	Health unit 30	1.000	Death:150
	Health unit 31	598	Discharge:438
	Health unit 31	598	Death:160
	Health unit 32	2.183	Discharge:1.590
	Health unit 32	2.183	Death:593
Health unit 33	395	Discharge:240
Health unit 33	395	Death:155
Health unit 34	443	Discharge:233
Health unit 34	443	Death:210
Health unit 35	193	Discharge:193
Health unit 35	193	Death:0
Health unit 36	4.181	Discharge:3.774
Health unit 36	4.181	Death:407
Health unit 37	422	Discharge:422
Health unit 37	422	Death:0
Health unit 38	12	Discharge:12
Health unit 38	12	Death:0
Health unit 39	522	Discharge:467
Health unit 39	522	Death:55
Health unit 40	161	Discharge:119
Health unit 40	161	Death:42
Health unit 41	510	Discharge:256
Health unit 41	510	Death:254
Length of stay	< 7	16.693	Discharge: 13.658
	< 7	16.693	Death:3.035
	7≤ LoS ≤ 14	11.499	Discharge: 9.824
	7≤ LoS ≤ 14	11.499	Death: 1.675
	>14	9.831	Discharge: 7.927
	>14	9.831	Death: 1.904
Outcomes	Discharge	31.409
Outcomes	Death	6.614

Open in a new tab

In addition, the database contains outcomes by each provider hospital (41), to address which health units had the highest number of requests and their respective outcomes, given that his feature showed a high correlation with several other dataset variables. Each hospital has a different treatment specialty, and thus some receive requests of greater complexity and mortality than others, culminating in different proportions of discharges and deaths. Finally, the data includes around 2055 different diseases classified by the International Classification of Diseases (ICD-10), which were also examined for recurrence.

As for the statistical profile, the average age is 53.38 years, with a standard deviation of 26.82 years and a median of 59 years. The average hospitalization time was 12.96 days, with a standard deviation of 17.67 days and a median of 7 days. The mean EUP score was 3.15 and the median was 3. The mean SOFA scale was 1.2 and the median 1.

Regarding the ICD, Table 4 shows the ten most recurrent ICDs, followed by the municipalities with the highest incidence and the hospitals that treat the most. The state capital, the city of Natal, has the highest number of inhabitants and has the highest incidence of ICDs 6 and 10. In contrast, Mossoró, the second largest municipality in terms of inhabitants, has a higher incidence in four of the ten. The noteworthy point is that there is no significant number of requests for these diseases among Parnamirim, São Gonçalo do Amarante, and Macaíba municipalities, which are the 3rd, 4th and 5th most populous municipalities.

Table 4. Distribution of the most frequent ICDs by municipality and hospital.

Code	Name	Frequency	Municipality with the highest incidence	Hospitals that treat the most
J18.9	Unspecified Pneumonia	4284	Natal: 1437	Provider Hospital 28: 504
			Mossoró: 763	Provider Hospital 6: 344
			Santo Antônio: 278	Provider Hospital 11: 333
I21.9	Unspecified Acute Myocardial Infarction	1738	Mossoró 781	Provider Hospital 36: 910
			Natal: 150	Provider Hospital 6: 158
			Currais Novos:97	Provider Hospital 7: 119
I64	Stroke, Not Specified as Hemorrhagic or Ischemic	1540	Mossoró: 1010	Provider Hospital 28: 969
			Caicó: 150	Provider Hospital 32: 164
			Natal: 148	Provider Hospital 6: 73
N39.0	Urinary Tract Infection of Unspecified Localization	965	Natal: 408	Provider Hospital 21: 125
			Mossoró: 83	Provider Hospital 41: 102
			Parnamirim: 79	Provider Hospital 30: 67
I50.0	Congestive Heart Failure	905	Natal: 243	Provider Hospital 6: 134
			Mossoró: 196	Provider Hospital 28: 73
			Currais Novos: 97	Provider Hospital 21: 68
I20.0	Unstable Angina	677	Mossoró: 531	Provider Hospital 36: 529
			Natal: 53	Provider Hospital 6: 30
			Currais Novos: 8	Provider Hospital 21: 13
F20.8	Other Schizophrenias	587	Natal: 292	Provider Hospital 21: 293
			Parnamirim: 50	Provider Hospital 7: 272
			Ceará-Mirim: 34	Provider Hospital 6: 9
A46	Erysipelas	546	Natal: 180	Provider Hospital 21: 85
			Mossoró: 70	Provider Hospital 32: 64
			Caicó: 62	Provider Hospital 39: 33
A41.9	Unspecified Septicemia	542	Natal: 151	Provider Hospital 32: 81
			Mossoró: 115	Provider Hospital 28: 72
			Caicó: 67	Provider Hospital 6: 38
I25.2	Old Myocardial Infarction	510	Mossoró: 373	Provider Hospital 36: 374
			Natal: 31	Provider Hospital 7: 19
			João Câmara: 19	Provider Hospital 25: 17

Open in a new tab

SOFA and EUP are two tools used to evaluate the hospital’s bed priority for each patient, considering that EUP revolves around SOFA, The Charlson Comorbidity Index (CCI) and the Clinical Frailty Scale (CFS). According to the data, it is possible to identify that EUP, has a more normalized aggregation to the outcomes classification. Meanwhile, SOFA = 1, was responsible for characterizing 78% of the data, a similar percentage is presented in the sum of requests with EUP 2 (46.2%), 3 (19.1%), and 4 (14.3%). In other words, while SOFA indicates that 78% of referrals had the same degree of priority, EUP structures the same percentage into three different categories. Given the health sector’s peculiarities, the EUP is an indicator that minimizes the generalization of different clinical conditions.

Another important point to evaluate is the ICD that most frequently resulted in death. Naturally, each ICD has its own intrinsic lethality level, meaning that some diseases kill more than others. However, it is necessary to analyze the frequency of certain occurrences and whether the incidence is local and already expected by public health institutions. Hence, with the data in hand, public health authorities can evaluate and orchestrate future intervention proposals. As shown in Table 5, Unspecified Pneumonia (J18.9) was the disease with the highest frequency (see Table 4) and resulted in the most deaths. Around 24.5% of the patients classified with this disease died, resulting in 15.9% of the total number of deaths. However, Unspecified Septicemia is one of the most lethal diseases and is responsible for the death of 50.5% of patients diagnosed with this disease.

Table 5. Distribution of the ICDs with the highest number of deaths.

Code	Name	Frequency	Deaths/Total incidence
J18.9	Unspecified Pneumonia	1052	24.5%
A41.9	Unspecified Septicemia	279	50.5%
I50.0	Congestive Heart Failure	247	27.3%
J18.0	Unspecified Bronchopneumonia	110	24.8%
I21.9	Unspecified Acute Myocardial Infarction	94	12.6%

Open in a new tab

Regarding the data correlation, shown in Fig 2, Phik’s correlation revealed that the features that relate most closely to the outcome are the output bed type, requested bed type, entry bed type, SOFA scale, ICD, age, EUP score, and provider unit. Length of stay and gender did not present any relevant correlation for this topic.

Machine learning model results

Table 2 shows the selection of hyperparameters that indicated the best results for the selected models. The Decision Tree model obtained the best criterion results when the entropy node division strategy was selected, max depth of the tree with a value of 50, min samples leaf with a value of 1 and max feature, square root. Random Forest obtained the best results with entropy (criterion), 50 (max depth of tree), 400 (number estimators) and sqrt (max features). For Gradient Boosting, squared error (criterion), 10 (max depth of tree), 50 (number estimators) and sqrt (max features). In the Adabost model, the best results were: 1.0 (learning rate), 400 (number estimators), and samme.r (algorithm). In XGBoost: 0.1 (learning rate), 200 (number estimators), 50 (max depth) and 1.0 (colsample by tree). The MLP models used the same hyperparameters in SGD and ADAM (hidden layer sizes, activation, batch size) which resulted in the same values: 70, relu and 32.

As for the results obtained by the selected metrics, XGBoost scored highest in accuracy (87.77%) and recall (87.77%). On the other hand, the Random Forest model (87.85%) was the most accurate, i.e. being the model that best classifies the positive outcome. As for the F1-Score value, the Gradient Boosting model had the highest value (87.56%). As for specificity, a parameter that assesses the classification performance of the negative outcome, it can be seen that the multilayer perceptron models outperform the others. The highest score was obtained by the SGD (82.94%). Table 6 presents the performance metrics for each machine learning model, including accuracy, precision, recall, F1-Score, and specificity. Notably, XGBoost outperformed the other models in accuracy and recall, making it a robust choice for predicting patient outcomes in bed regulation. However, the high specificity observed in the MLP models indicates that these models may be more suitable when the goal is to minimize false positives, particularly in critical care cases. For a better comparison of the performance of the models used, Fig 3 presents the values of each metric per computational model.

Table 6. Metrics obtained by the computer models.

Models	Accuracy	Precision	Recall	F1-Score	Specificity
Decision Tree	82.97(+0.13)	84.26(+0.19)	82.96(+0.13)	83.51(+0.14)	64.36(+0.42)
Random Forest	87.20(+0.01)	87.85(+0.03)	87.20(+0.01)	87.47(+0.01)	72.98(+0.17)
Gradient Boosting	87.14(+0.05)	88.21(+0.03)	87.14(+0.05)	87.56(+0.03)	75.47(+0.24)
Adaboost	86.69(+0.05)	87.76(+0.05)	86.69(+0.05)	87.12(+0.05)	74.25(+0.06)
XGBoost	87.77(+0.07)	87.46(0.04)	87.77(+0.07)	87.60(+0.10)	66.96(+0.04)
MLP SGD	83.36(+0.17)	88.10(+0.07)	83.36(+0.17)	84.73(0.13)	82.94(+0.50
MLP ADAM	82.88(+0.76)	87.87(+0.13)	82.88(+0.76)	84.33(+0.61)	82.58(+0.62)

Open in a new tab

Based on the results of the models, we performed a chi-square statistical validation to analyze whether the behavior of the models has statistical significance. For this, a contingency table was created with the distribution of real and predicted values of all models and for all cases a p value < 0.01 was obtained.

As for the features that were important for training the models, the most relevant features for classifying the Decision Tree were bed type, age, provider health unit and icd. The non-relevant elements were requested bed type, entry bed type and SOFA. In the Random Forest model, output bed type, age, EUP, provider health unit and icd scored the highest, while sex and SOFA were the least relevant characteristics. For the Gradient Boosting classifier output bed type, age, EUP and provider health unit were the most relevant, while sex, requested bed type, entry bed type were the lowest scorers. Adaboost considered the best characteristics to be length of stay, provider health unit, EUP and age, while the least relevant were sex, requested bed type and entry bed type. XGBoost considered output bed type, EUP and entry bed type as the most important characteristics and sex, SOFA and requested bed type as the least important. For the models that used MLP, Adam considered output bed type, requested bed type, provider health unit and age to be the most relevant, while SGD considered output bed type, provider health unit, age and ICD to be the most significant. The least important features were entry bed type and sex for Adam; and sex and requested bed type for SGD. Figs 4 and 5 show the important features of the machine learning models.

Compared to Phik’s correlation, except for Adabost, all the other classifiers included output bed type as the most important feature in the classification process, which corroborates Phik’s correlation (output bed being the feature with the highest correlation with the outcome) and the weaker correlation, sex was identified as the least relevant feature for Random Forest, Grandient Boosting, Adaboost, XGBoost and SGD, while length of stay, which is another feature that has been shown to have a low correlation with the outcome, was not identified as worse in any of the classifiers, however, for Adaboost this feature was the most significant.

The ROC curve (receiver operating characteristic curve) helps to visualize the performance of classifiers to select an appropriate operating point or decision threshold [53]. The discriminative capacity is usually quantified by the area under the AUC curve when considering the prediction of a binary event. It relates the variation in the rate of true positives and false positives predicted by the models, with results on a scale of 0 to 1. Although there is no definitive consensus in the literature, most studies using this tool consider an AUC between 0.7 and 0.8 to be good and acceptable, and between 0.8 and 0.9 to be very good [54, 55].

Thus, the Decision Tree (AUC = 0.738), XGBoost (AUC = 0.766) and Random Forest (AUC = 0.785) models performed well, while the Adaboost (AUC = 0.804), Adam (AUC = 0.814) and SGD (AUC = 0.821) models performed better, falling into the very good category. Fig 6 shows the results obtained.

Discussion

The use of artificial intelligence and computational methods to solve and predict problems in the health field has been going on for some years now, and although there is a considerable range of solutions in various segments, from predicting diseases by diagnosing medical images [56–60] to the classification of important markers for the prediction of cardiological [25, 61], and ophthalmological diseases [62, 63] or the analysis of data to predict early-stage cancer [64–66]; as well as robotic mechanisms for surgery, for example [67–70]. There are still some sectors that have not been explored or that have made negligible contributions [57, 71–73].

According to Yu, Beam and Kohane [57], the association of artificial intelligence will be able to contribute even more effectively to clinical practices and health management. In this way, healthcare professionals will be able to reduce the time spent on repetitive tasks in order to explore better treatments and clinical solutions aimed at patient care, something that machines cannot do and which require more humanized treatment. According to Valentim et al [8], the use of digital health solutions based on artificial intelligence are already considered relevant tools by healthcare managers, as they help to make decision-making more timely, effective and based on robust scientific evidence.

In the process of regulating hospital beds, the use of artificial intelligence helps to reduce medical subjectivity in the face of the repetitive process of countless daily regulations, tasks that can often become a tiring activity throughout the day. This certainly contributes to minimizing errors in the indication of hospital beds, especially when it comes to public health, since the daily volume of care is extremely high, as is the case in the state of Rio Grande Norte in Brazil, which has a population of approximately 3.5 million inhabitants. This could result in better resolutions for patients, as well as better equity in access to the resources available in the public health system. All of this will lead to a more timely hospitalization process for patients, and consequently to better performance in terms of hospital bed turnover—better average occupancy time for hospital beds across the entire public health network [2]. In general, the use of machine learning tools can optimize the care process, increasing efficacy, efficiency and effectiveness, which induces better resilience of the health system, especially in times of crisis, as was the case during COVID-19 [8, 74, 75].

At the management level, adopting AI-driven systems for bed regulation could lead to significant improvements in resource allocation, reducing patient wait times and optimizing bed occupancy rates. However, implementing these systems at scale presents challenges, such as ensuring adequate training for health professionals and integrating AI tools with existing hospital infrastructure. Addressing these challenges will be critical for maximizing the potential benefits of AI in the public healthcare system [5, 8].

In this study, machine learning techniques were used in different tree and ensemble models, as well as artificial neural network models on hospital bed regulation data, and the aim was to classify the outcome of patients regardless of their ICD, to help the regulating doctor and reduce subjectivity during the hospital bed regulation process.

As for the results of the computer models, XGBoost showed the best accuracy (87.77%) and recall (87.77%) values, i.e. of all the models used, it classifies the data better in general, regardless of the outcome (discharge or death), as well as, given the positive outcome, the proportion that was correctly classified. As for the accuracy indicator, which identifies which proportion of positive outcomes was correct, Random Forest performed best (87.05%). As for the F1-Score, Gradient Boosting has a better harmonic mean between precision and recall, i.e. it has a better balance in the metrics that assess the positive outcome. Regarding specificity, a metric that assesses the classification of the negative outcome, the neural network models showed the best results when compared to the tree and ensemble models, achieving scores of 82.58% (ADAM) and 82.94% (SGD). For the ROC-AUC, the SGD and ADAM models also performed better, because, as they had a more balanced classification of positive and negative outcomes, the ROC-AUC value was in the range of 82.13% and 81.42%, respectively.

Considering these results, the models used in this experiment are not only able to predict which patients are more likely to be discharged or die, but also allow us to understand which samples are being better classified concerning the outcome and the best type of hospital bed according to the clinical conditions of each patient. And so, the main metric analyzed should not only be accuracy; the other metrics that point to a positive outcome (precision, recall and ROC-AUC) should also be maximized [2]. Furthermore, it also has a positive impact on the pace of work of the regulatory professional, given that in situations of high demand and overload of requests, the assertiveness of the regulatory process can be compromised, and so the models contribute to better regulatory conduct [76, 77].

Conclusion

This study used the regulation database of the RegulaRN Leitos Gerais platform between 2021 and 2024 in machine learning models to predict the outcome of discharge and death in different diseases that require hospitalization. The results of this article show that there is no single model that obtains the best accuracy, precision, recall, F1-Score, specificity, and ROC-AUC metrics. Thus, depending on the objectives of the regulatory professionals, it should be observed which model can provide the best result based on the desired metric, i.e. for example, if the regulator’s objective is to observe the best classifications for the positive outcome, it should use XGBoost and Random Forest; If the objective is to evaluate the best classification for the negative outcome, the multilayer perceptron models should be evaluated.

It should be noted that artificial intelligence computer models enhance the activities carried out in the healthcare and management sectors. Research in this area should therefore be increasingly explored in order to minimize the precariousness and weaknesses that exist in the different health segments. In this way, this research also aims to make a positive contribution to the health system such as the SUS, which aims to guarantee universal and comprehensive access to health with equity.

A significant limitation of this study is the incomplete dataset, particularly the absence of detailed information such as pregnancy status and gestational age. This missing information could introduce bias in model predictions, particularly for patient subgroups with different clinical needs. Future work should focus on improving data collection protocols to ensure that such critical variables are recorded, allowing for more nuanced and accurate model predictions across diverse patient groups. During the evaluation of the database, some gaps were found in the data, which is why it was not included for training the models. However, for some diseases, knowing whether the patient is pregnant or not and the appropriate length of pregnancy are essential. In addition, this study considered the same evaluation of hospital outcomes in different diseases with different morbidity scales. Furthermore, another limitation of this work was the non-inclusion of other models widely used in academic literature, such as k-Nearest Neighbors (kNN) and Support Vector Machines (SVM) [78–80], as they were not included within the initial scope of this research. However, it is considered that for future work the scope of computational models can be expanded and these models included. Furthermore, still addressing future work, creating a new feature that can categorize diseases by morbidity could contribute to a more appropriate classification of the models. Furthermore, trying to identify which treatment protocols were used to treat certain diseases can also be a relevant indicator for classifying models.

Acknowledgments

We would like to thank the Public Health Secretariat of Rio Grande do Norte (SESAP/RN), the Health Technological Innovation Laboratory (LAIS) of the Federal University of Rio Grande do Norte (UFRN), the Advanced Innovation Center (NAVI) of the Federal Institute do Rio Grande do Norte (IFRN), to LyRIDS, ECE-Engineering School and the Department of Informatics and Applied Mathematics for the support necessary for the development of this research.

Data Availability

The data used in this research can be accessed via the link: https://zenodo.org/records/11387710.

Funding Statement

The present study was funded through the Project Regula SESAP-RN/FUNCERN, grant number 69/2021, carried out by the Laboratory of Technological Innovation in Health (LAIS) of the Federal University of Rio Grande do Norte (UFRN) in cooperation with the Secretary of Public Health of Rio Grande do Norte.

References

1. Bastos LBR, Barbosa MA, Rosso CFW, Oliveira LMdAC, Ferreira IP, Bastos DAdS, et al. Practices and challenges on coordinating the Brazilian Unified Health System. Revista de Saúde Pública. 2020;54:25. doi: 10.11606/s1518-8787.2020054001512 [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Barreto TdO, Veras NVR, Cardoso PH, Fernandes FRdS, Medeiros LPdS, Bezerra MV, et al. Artificial intelligence applied to analyzes during the pandemic: COVID-19 beds occupancy in the state of Rio Grande do Norte, Brazil. Frontiers in Artificial Intelligence. 2023;6:1290022. doi: 10.3389/frai.2023.1290022 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Brasil. Portaria nº 1.559, de 1º de agosto de 2008. Institui a Política Nacional de Regulação do Sistema Único de Saúde-SUS. Diário Oficial da União. 2008;.
4.Brasil. Portaria nº 2, de 28º de setembro de 2017. Consolidação das normas sobre as políticas nacionais de Saú do Sistema Único de Saúde-SUS. Diário Oficial da União. 2017;.
5. Maldonado RN, Savio RO, Feijó VBER, Aroni P, Rossaneis MA, Haddad MdCFL. Hospital indicators after implementation of bed regulation strategies: an integrative review. Revista Brasileira de Enfermagem. 2021;74:e20200022. doi: 10.1590/0034-7167-2020-0022 [DOI] [PubMed] [Google Scholar]
6.Cordeiro MF. SISREG: uma ferramenta de desafios e avanços para a garantia do direito a saúde. 2015;.
7. Junior JRM, de Souza Junior AA, da Luz AEdJ. The impact of COVID-19 on the municipal Regulation System (SISREG) of Rio de Janeiro (RJ). Research, Society and Development. 2024;13(4):e5613445564–e5613445564. doi: 10.33448/rsd-v13i4.45564 [DOI] [Google Scholar]
8. Valentim RAdM, Lima TS, Cortez LR, Barros DMdS, Silva RDd, Paiva JCd, et al. The relevance a technology ecosystem in the Brazilian National Health Service’s Covid-19 response: the case of Rio Grande do Norte, Brazil. Ciência & Saúde Coletiva. 2021;26:2035–2052. [DOI] [PubMed] [Google Scholar]
9.Medina MVB. Análise da utilização da Escala Quick Sequential Organ Failure Assessment para tomada de decisão na regulação de leitos de UTI. Universidade Federal do Rio Grande do Norte; 2023.
10.Konder M, O’Dwyer G. Regulation of access to hospital beds in emergency care and the development of integrated health services. 2019;.
11. Kim M, Lee JY, Park JS, Kim HA, Hyun M, Suh YS, et al. Lessons from a COVID-19 hospital, Republic of Korea. Bulletin of the World Health Organization. 2020;98(12):842. doi: 10.2471/BLT.20.261016 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Lu Y, Guan Y, Zhong X, Fishe JN, Hogan T. Hospital beds planning and admission control policies for COVID-19 pandemic: A hybrid computer simulation approach. In: 2021 IEEE 17th International Conference on Automation SCience and Engineering (CASE). IEEE; 2021. p. 956–961.
13. Pecoraro F, Luzi D, Clemente F. The efficiency in the ordinary hospital bed management: A comparative analysis in four European countries before the COVID-19 outbreak. Plos one. 2021;16(3):e0248867. doi: 10.1371/journal.pone.0248867 [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Shi F, Li H, Liu R, Liu Y, Liu X, Wen H, et al. Emergency preparedness and management of mobile cabin hospitals in China during the COVID-19 pandemic. Frontiers in Public Health. 2022;9:763723. doi: 10.3389/fpubh.2021.763723 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Alavinejad M, Mellado B, Asgary A, Mbada M, Mathaha T, Lieberman B, et al. Management of hospital beds and ventilators in the Gauteng province, South Africa, during the COVID-19 pandemic. PLOS global public health. 2022;2(11):e0001113. doi: 10.1371/journal.pgph.0001113 [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Kuzior A, Kashcha M, Kuzmenko O, Lyeonov S, Brożek P. Public health system economic efficiency and COVID-19 resilience: Frontier DEA analysis. International Journal of Environmental Research and Public Health. 2022;19(22):14727. doi: 10.3390/ijerph192214727 [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Taylor CA, Draney MT, Ku JP, Parker D, Steele BN, Wang K, et al. Predictive medicine: computational techniques in therapeutic decision-making. Computer Aided Surgery: Official Journal of the International Society for Computer Aided Surgery (ISCAS). 1999;4(5):231–247. doi: [DOI] [PubMed] [Google Scholar]
18. Shahid N, Rappon T, Berta W. Applications of artificial neural networks in health care organizational decision-making: A scoping review. PloS one. 2019;14(2):e0212356. doi: 10.1371/journal.pone.0212356 [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Tian S, Yang W, Le Grange JM, Wang P, Huang W, Ye Z. Smart healthcare: making medical care more intelligent. Global Health Journal. 2019;3(3):62–65. doi: 10.1016/j.glohj.2019.07.001 [DOI] [Google Scholar]
20. Panagiotou OA, Högg LH, Hricak H, Khleif SN, Levy MA, Magnus D, et al. Clinical application of computational methods in precision oncology: a review. JAMA oncology. 2020;6(8):1282–1286. doi: 10.1001/jamaoncol.2020.1247 [DOI] [PubMed] [Google Scholar]
21.Bian J, Modave F. The rapid growth of intelligent systems in health and health care; 2020. [DOI] [PubMed]
22. Gupta PK, Ramachandran AT, Keerthi AM, Dave PS, Giridhar S, Kallapur SS, et al. An overview of clinical decision support system (CDSS) as a computational tool and its applications in public health. Applications in ubiquitous computing. 2021; p. 81–117. doi: 10.1007/978-3-030-35280-6_5 [DOI] [Google Scholar]
23. Moulaei K, Shanbehzadeh M, Mohammadi-Taghiabad Z, Kazemi-Arpanahi H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC medical informatics and decision making. 2022;22(1):2. doi: 10.1186/s12911-021-01742-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Albuquerque G, Fernandes F, Barbalho IM, Barros DM, Morais PS, Morais AH, et al. Computational methods applied to syphilis: where are we, and where are we going? Frontiers in Public Health. 2023;11:1201725. doi: 10.3389/fpubh.2023.1201725 [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Carvalho DRd, Araújo BGd, Lacerda JMT, Dantas MdCR, Hékis HR, Valentim RAdM. An architecture for online transient detection in electrocardiogram signals on the MP-HA protocol. Revista Brasileira de Engenharia Biomédica. 2012;28:346–354. [Google Scholar]
26. Levantesi S, Pizzorusso V. Application of machine learning to mortality modeling and forecasting. Risks. 2019;7(1):26. doi: 10.3390/risks7010026 [DOI] [Google Scholar]
27. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE reviews in Biomedical Engineering. 2020;14:116–126. doi: 10.1109/RBME.2020.3007816 [DOI] [PubMed] [Google Scholar]
28. Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC medical research methodology. 2021;21:1–14. doi: 10.1186/s12874-021-01284-z [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Dixit RR. Risk Assessment for Hospital Readmissions: Insights from Machine Learning Algorithms. Sage Science Review of Applied Machine Learning. 2021;4(2):1–15. [Google Scholar]
30. Iwase S, Nakada Ta, Shimada T, Oami T, Shimazui T, Takahashi N, et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Scientific reports. 2022;12(1):12912. doi: 10.1038/s41598-022-17091-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Baak M, Koopman R, Snoek H, Klous S. A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics. Computational Statistics & Data Analysis. 2020;152:107043. doi: 10.1016/j.csda.2020.107043 [DOI] [Google Scholar]
32. Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID-19 patient health prediction using boosted random forest algorithm. Frontiers in public health. 2020;8:357. doi: 10.3389/fpubh.2020.00357 [DOI] [PMC free article] [PubMed] [Google Scholar]
33. Aljameel SS, Khan IU, Aslam N, Aljabri M, Alsulmi ES. Machine Learning-Based Model to Predict the Disease Severity and Outcome in COVID-19 Patients. Scientific programming. 2021;2021(1):5587188. [Google Scholar]
34. Endo PT, Santos GL, de Lima Xavier ME, Nascimento Campos GR, de Lima LC, Silva I, et al. Illusion of truth: analysing and classifying COVID-19 fake news in brazilian portuguese language. Big Data and Cognitive Computing. 2022;6(2):36. doi: 10.3390/bdcc6020036 [DOI] [Google Scholar]
35. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research. 2002;16:321–357. doi: 10.1613/jair.953 [DOI] [Google Scholar]
36.Maiga J, Hungilo GG, et al. Comparison of machine learning models in prediction of cardiovascular disease using health record data. In: 2019 international conference on informatics, multimedia, cyber and information system (ICIMCIS). IEEE; 2019. p. 45–48.
37.Hamida S, El Gannour O, Cherradi B, Ouajji H, Raihani A. Optimization of machine learning algorithms hyper-parameters for improving the prediction of patients infected with COVID-19. In: 2020 ieee 2nd international conference on electronics, control, optimization and computer science (icecocs). IEEE; 2020. p. 1–6.
38. Papaiz F, Dourado MET Jr, de Medeiros Valentim RA, Pinto R, de Morais AHF, Arrais JP. Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis. BMC Medical Informatics and Decision Making. 2024;24(1):80. doi: 10.1186/s12911-024-02484-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Divya KS, Bhargavi P, Jyothi S. Machine learning algorithms in big data analytics. Int J Comput Sci Eng. 2018;6(1):63–70. [Google Scholar]
40. Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. Library Hi Tech. 2020;38(4):835–858. doi: 10.1108/LHT-08-2019-0171 [DOI] [Google Scholar]
41.Rahul K, Banyal RK, Goswami P, Kumar V. Machine learning algorithms for big data analytics. In: Computational Methods and Data Engineering: Proceedings of ICMDE 2020, Volume 1. Springer; 2021. p. 359–367.
42. Charbuty B, Abdulazeez A. Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends. 2021;2(01):20–28. doi: 10.38094/jastt20165 [DOI] [Google Scholar]
43. Rigatti SJ. Random forest. Journal of Insurance Medicine. 2017;47(1):31–39. doi: 10.17849/insm-47-01-31-39.1 [DOI] [PubMed] [Google Scholar]
44. Fafalios S, Charonyktakis P, Tsamardinos I. Gradient boosting trees. Gnosis Data Analysis PC. 2020;1. [Google Scholar]
45. Schapire RE. Empirical inference. Berlin, Heidelberg. 2013; p. 37–52. [Google Scholar]
46.Sheng C, Yu H. An optimized prediction algorithm based on XGBoost. In: 2022 International Conference on Networking and Network Applications (NaNA). IEEE; 2022. p. 1–6.
47. Popescu MC, Balas VE, Perescu-Popescu L, Mastorakis N. Multilayer perceptron and neural networks. WSEAS Transactions on Circuits and Systems. 2009;8(7):579–588. [Google Scholar]
48.Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014;.
49. Pedregosa F. Scikit-learn: Machine learning in python Fabian. Journal of machine learning research. 2011;12:2825. [Google Scholar]
50. Ensor KB, Glynn PW. Stochastic optimization via grid search. Lectures in Applied Mathematics-American Mathematical Society. 1997;33:89–100. [Google Scholar]
51. Bergstra J, Bengio Y. Random search for hyper-parameter optimization. Journal of machine learning research. 2012;13(2). [Google Scholar]
52. Ahsan MM, E Alam T, Trafalis T, Huebner P. Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and Non-COVID-19 patients. Symmetry. 2020;12(9):1526. doi: 10.3390/sym12091526 [DOI] [Google Scholar]
53. Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern recognition. 1997;30(7):1145–1159. doi: 10.1016/S0031-3203(96)00142-2 [DOI] [Google Scholar]
54. De Hond AA, Steyerberg EW, Van Calster B. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health. 2022;4(12):e853–e855. doi: 10.1016/S2589-7500(22)00188-1 [DOI] [PubMed] [Google Scholar]
55. Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean journal of anesthesiology. 2022;75(1):25–36. doi: 10.4097/kja.21209 [DOI] [PMC free article] [PubMed] [Google Scholar]
56. Mohammad-Rahimi H, Nadimi M, Ghalyanchi-Langeroudi A, Taheri M, Ghafouri-Fard S. Application of machine learning in diagnosis of COVID-19 through X-ray and CT images: a scoping review. Frontiers in cardiovascular medicine. 2021;8:638011. doi: 10.3389/fcvm.2021.638011 [DOI] [PMC free article] [PubMed] [Google Scholar]
57. Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nature biomedical engineering. 2018;2(10):719–731. doi: 10.1038/s41551-018-0305-z [DOI] [PubMed] [Google Scholar]
58. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future healthcare journal. 2019;6(2):94–98. doi: 10.7861/futurehosp.6-2-94 [DOI] [PMC free article] [PubMed] [Google Scholar]
59. Owoyemi A, Owoyemi J, Osiyemi A, Boyd A. Artificial Intelligence for Healthcare in Africa. Frontiers in digital health 2: 6; 2020. doi: 10.3389/fdgth.2020.00006 [DOI] [PMC free article] [PubMed] [Google Scholar]
60. Yang CC. Explainable artificial intelligence for predictive modeling in healthcare. Journal of healthcare informatics research. 2022;6(2):228–239. doi: 10.1007/s41666-022-00114-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
61. Cuocolo R, Perillo T, De Rosa E, Ugga L, Petretta M. Current applications of big data and machine learning in cardiology. Journal of geriatric cardiology: JGC. 2019;16(8):601. doi: 10.11909/j.issn.1671-5411.2019.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
62. Barros DM, Moura JC, Freire CR, Taleb AC, Valentim RA, Morais PS. Machine learning applied to retinal image processing for glaucoma detection: review and perspective. Biomedical engineering online. 2020;19:1–21. doi: 10.1186/s12938-020-00767-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
63. Srivastava O, Tennant M, Grewal P, Rubin U, Seamone M. Artificial intelligence and machine learning in ophthalmology: A review. Indian Journal of Ophthalmology. 2023;71(1):11–17. doi: 10.4103/ijo.IJO_1569_22 [DOI] [PMC free article] [PubMed] [Google Scholar]
64. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
65. Firmino M, Angelo G, Morais H, Dantas MR, Valentim R. Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomedical engineering online. 2016;15:1–17. doi: 10.1186/s12938-015-0120-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
66. Galvao-Lima L, Morais H, Valentim R, Barreto E. miRNAs as biomarkers for early cancer detection and their application in the development of new diagnostic tools. Biomedical engineering online. 2021;20:1–21. doi: 10.1186/s12938-021-00857-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
67. Zhao B, Waterman R, Urman R, Gabriel RA. A machine learning approach to predicting case duration for robot-assisted surgery. Journal of Medical Systems. 2019;43:1–32. doi: 10.1007/s10916-018-1151-y [DOI] [PubMed] [Google Scholar]
68. Panesar S, Cagle Y, Chander D, Morey J, Fernandez-Miranda J, Kliot M. Artificial intelligence and the future of surgical robotics. Annals of surgery. 2019;270(2):223–226. doi: 10.1097/SLA.0000000000003262 [DOI] [PubMed] [Google Scholar]
69. Zhou XY, Guo Y, Shen M, Yang GZ. Application of artificial intelligence in surgery. Frontiers of medicine. 2020;14:417–430. doi: 10.1007/s11684-020-0770-0 [DOI] [PubMed] [Google Scholar]
70. Moglia A, Georgiou K, Georgiou E, Satava RM, Cuschieri A. A systematic review on artificial intelligence in robot-assisted surgery. International Journal of Surgery. 2021;95:106151. doi: 10.1016/j.ijsu.2021.106151 [DOI] [PubMed] [Google Scholar]
71. Fernandes YYMP, Araújo GTd, Araújo BGd, Dantas MdCR, Carvalho DRd, Valentim RAdM. ILITIA: telehealth architecture for high-risk gestation classification. Research on Biomedical Engineering. 2017;33(3):237–246. doi: 10.1590/2446-4740.09416 [DOI] [Google Scholar]
72. Reddy S. Explainability and artificial intelligence in medicine. The Lancet Digital Health. 2022;4(4):e214–e215. doi: 10.1016/S2589-7500(22)00029-2 [DOI] [PubMed] [Google Scholar]
73. Schwalbe N, Wahl B. Artificial intelligence and the future of global health. The Lancet. 2020;395(10236):1579–1586. doi: 10.1016/S0140-6736(20)30226-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
74. Ammar W, Kdouh O, Hammoud R, Hamadeh R, Harb H, Ammar Z, et al. Health system resilience: Lebanon and the Syrian refugee crisis. Journal of global health. 2016;6(2). doi: 10.7189/jogh.06.020704 [DOI] [PMC free article] [PubMed] [Google Scholar]
75. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: progress, crisis and resilience. BMJ global health. 2018;3(4):e000829. doi: 10.1136/bmjgh-2018-000829 [DOI] [PMC free article] [PubMed] [Google Scholar]
76. Muhammad L, Algehyne EA, Usman SS, Ahmad A, Chakraborty C, Mohammed IA. Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN computer science. 2021;2(1):1–13. doi: 10.1007/s42979-020-00394-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
77. Silva Junior CL, Guabiraba KPdL, Gomes GG, Andrade CLTd, Melo EA. Outpatient regulation in Primary Care in the municipality of Rio de Janeiro, Brazil, based on the local regulatory doctors. Ciência & Saúde Coletiva. 2022;27:2481–2493. [DOI] [PubMed] [Google Scholar]
78. Ali A, Khan Z, Khan DM, Aldahmani S. An Optimal Random Projection k Nearest Neighbours Ensemble via Extended Neighbourhood Rule for Binary Classification. IEEE Access. 2024;. [Google Scholar]
79. Ali A, Hamraz M, Gul N, Khan DM, Aldahmani S, Khan Z. A k nearest neighbour ensemble via extended neighbourhood rule and feature subsets. Pattern Recognition. 2023;142:109641. doi: 10.1016/j.patcog.2023.109641 [DOI] [Google Scholar]
80. Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. International Journal of Computing and Business Research (IJCBR). 2015;6(2):1–12. [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0315379.r001

Decision Letter 0

Luísa da Matta Machado Fernandes

24 Sep 2024

PONE-D-24-27498Artificial Intelligence Applied to Bed Regulation in Rio Grande do Norte: Data Analysis and Application of Machine Learning on the “RegulaRN Leitos Gerais” PlatformPLOS ONE

Dear Dr. Barreto,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

I hope this message finds you well. I have reviewed your study on improving technology for bed regulation within a universal health system and would like to offer some suggestions to enhance its international relevance.

To strengthen your findings, I recommend providing a more detailed explanation of the current limitations of the national system used for bed regulation. Additionally, contextualizing these regulations within the broader health system would provide valuable insights.

As per the reviewers' feedback, please update the discussion with current literature and enhance the comprehensibility of the methods and results section. Including a flowchart and improving the graphics and tables will also aid in conveying your findings more effectively.

I look forward to seeing the revised manuscript.

==============================

Please submit your revised manuscript by Nov 08 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Luísa da Matta Machado Fernandes, DrPH

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following financial disclosure:

[copy in funding statement].

Please state what role the funders took in the study. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

If this statement is not correct you must amend it as needed.

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This study effectively applies machine learning to the RegulaRN Leitos Gerais platform to optimize hospital bed regulation in Rio Grande do Norte. Analyzing data from October 2021 to January 2024, it shows strong performance from models like XGBoost, Random Forest, and Gradient Boosting in accuracy, precision, and recall. However, the study would benefit from incorporating additional methods such as k-Nearest Neighbors (kNN) and Support Vector Machines (SVM), as well as referencing recent literature (e.g., doi.org/10.1109/ACCESS.2024.3392729 and doi.org/10.1016/j.patcog.2023.109641) to ensure a comprehensive evaluation and alignment with current advancements in healthcare machine learning.

Reviewer #2: • Suggested Improvement: The introduction clearly outlines the general problem of bed regulation but could be more detailed in discussing the relevance of artificial intelligence in this specific context by comparing it with similar studies.

o Lines 60-61: "Intelligent computer models can help reduce the impact of uncertainties and ambiguities in the regulatory process and improve decision-making support."

o Replace with: "Intelligent computer models have demonstrated significant potential in healthcare systems by reducing uncertainties and ambiguities in complex decision-making processes. For example, prior studies in similar healthcare contexts have shown that machine learning models can enhance hospital management by optimizing resource allocation and reducing patient waiting times (reference). This study aims to build on these findings to demonstrate the effectiveness of AI-based models specifically in bed regulation in Rio Grande do Norte."

More detailed discussion on model limitations The article mentions limitations but could provide a more robust discussion of how these limitations affect the results and suggest strategies to mitigate them.

• Line 398: "The study’s limitations include the fact that the health professionals did not provide more precise information on some of the data that could have been better analyzed in the models, such as whether the patient was pregnant."

• Replace with: "A significant limitation of this study is the incomplete dataset, particularly the absence of detailed information such as pregnancy status and gestational age. This missing information could introduce bias in model predictions, particularly for patient subgroups with different clinical needs. Future work should focus on improving data collection protocols to ensure that such critical variables are recorded, allowing for more nuanced and accurate model predictions across diverse patient groups."

More detailed explanation of the choice of machine learning models The choice of machine learning models is explained, but a more in-depth justification of why certain models (like XGBoost and Random Forest) were selected would be helpful.

• Line 165: "The definition of models for data classification involved algorithms that, according to the literature, perform well with high volumes of data."

• Replace with: "The selection of classification models was based on their proven ability to handle large volumes of imbalanced healthcare data. XGBoost, for example, has been shown to perform well in healthcare settings due to its gradient boosting framework, which effectively handles missing data and provides robust performance on tabular datasets (Chen & Guestrin, 2016). Random Forest, on the other hand, was selected for its ability to manage complex decision trees and its resistance to overfitting, particularly in high-dimensional datasets (Breiman, 2001). These models were chosen for their complementarity in addressing the specific challenges of bed regulation data in this study."

Better organization of tables and figures The presentation of tables and figures can be improved with more descriptive captions and the inclusion of an analysis immediately after presenting each figure/table to facilitate the interpretation of the results.

• Line 276: "Table 6 shows all the values obtained."

• Replace with: "Table 6 presents the performance metrics for each machine learning model, including accuracy, precision, recall, F1-Score, and specificity. Notably, XGBoost outperformed the other models in accuracy and recall, making it a robust choice for predicting patient outcomes in bed regulation. However, the high specificity observed in the MLP models indicates that these models may be more suitable when the goal is to minimize false positives, particularly in critical care cases."

More in-depth discussion of the practical impact of the results The discussion could delve deeper into the practical impact of adopting AI in the healthcare system and the challenges of implementing it on a large scale.

• Line 349: "At the management level, in response to a more efficient regulatory system, financial and human resources can be distributed in a way that is more coherent with the needs of the different health sectors."

• Replace with: "At the management level, adopting AI-driven systems for bed regulation could lead to significant improvements in resource allocation, reducing patient wait times and optimizing bed occupancy rates. However, implementing these systems at scale presents challenges, such as ensuring adequate training for health professionals and integrating AI tools with existing hospital infrastructure. Addressing these challenges will be critical for maximizing the potential benefits of AI in the public healthcare system."

I noticed the absence of graphs and visual figures that could significantly improve the clarity and understanding of the article, especially concerning the methodology and results. The inclusion of graphical elements such as flowcharts and result charts would be a valuable contribution to make the methodological process more accessible and the data easier to interpret.

Firstly, presenting a detailed methodology flowchart would be extremely helpful in guiding readers through all the stages described in the article. This would clarify the process from data extraction and processing to machine learning model selection, allowing for a clearer understanding of the methodological sequence.

Additionally, to facilitate the understanding of the results, it would be interesting to include graphs comparing the performance metrics of the different machine learning models. Bar graphs, for example, could visually show the accuracy, recall, specificity, and F1-Score of the tested models. The inclusion of ROC-AUC curves would also help visually present each model’s discriminative capabilities, making the information easier to interpret for readers.

Finally, the article mentions correlation analysis between variables, but including a correlation matrix, such as a heatmap, could visually highlight the most significant relationships between the variables. This type of graphical visualization would make data interpretation more immediate and intuitive for the reader.

These visual additions, such as flowcharts and result charts, would not only enhance the manuscript’s clarity but also help more effectively convey the complexities involved in data analysis and the results obtained.

Reviewer #3: Texto apresenta clareza de dados, necessidade de pequenos ajustes apontados no manuscrito enviado. A publicação duplicada, intencional ou não, pode prejudicar a credibilidade da pesquisa e comprometer os direitos de propriedade intelectual de ambos os periódicos.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Danielle Torres dos Santos Lopes

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE-D-24-27498_reviewer.docx

pone.0315379.s001.docx^{(146.9KB, docx)}

PLoS One. 2024 Dec 30;19(12):e0315379. doi: 10.1371/journal.pone.0315379.r002

Author response to Decision Letter 0

11 Oct 2024

A response to each suggestion was attached as a response document to the reviewers.

Attachment

Submitted filename: renamed_025d2.pdf

pone.0315379.s002.pdf^{(113.8KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0315379.r003

Decision Letter 1

Luísa da Matta Machado Fernandes

26 Nov 2024

Artificial Intelligence Applied to Bed Regulation in Rio Grande do Norte: Data Analysis and Application of Machine Learning on the “RegulaRN Leitos Gerais” Platform

PONE-D-24-27498R1

Dear Dr. Barreto,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Luísa da Matta Machado Fernandes, DrPH

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

PLoS One. doi: 10.1371/journal.pone.0315379.r004

Acceptance letter

Luísa da Matta Machado Fernandes

28 Nov 2024

PONE-D-24-27498R1

PLOS ONE

Dear Dr. Barreto,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Luísa da Matta Machado Fernandes

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Attachment

Submitted filename: PONE-D-24-27498_reviewer.docx

pone.0315379.s001.docx^{(146.9KB, docx)}

Attachment

Submitted filename: renamed_025d2.pdf

pone.0315379.s002.pdf^{(113.8KB, pdf)}

Data Availability Statement

The data used in this research can be accessed via the link: https://zenodo.org/records/11387710.

[pone.0315379.ref001] 1. Bastos LBR, Barbosa MA, Rosso CFW, Oliveira LMdAC, Ferreira IP, Bastos DAdS, et al. Practices and challenges on coordinating the Brazilian Unified Health System. Revista de Saúde Pública. 2020;54:25. doi: 10.11606/s1518-8787.2020054001512 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref002] 2. Barreto TdO, Veras NVR, Cardoso PH, Fernandes FRdS, Medeiros LPdS, Bezerra MV, et al. Artificial intelligence applied to analyzes during the pandemic: COVID-19 beds occupancy in the state of Rio Grande do Norte, Brazil. Frontiers in Artificial Intelligence. 2023;6:1290022. doi: 10.3389/frai.2023.1290022 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref003] 3.Brasil. Portaria nº 1.559, de 1º de agosto de 2008. Institui a Política Nacional de Regulação do Sistema Único de Saúde-SUS. Diário Oficial da União. 2008;.

[pone.0315379.ref004] 4.Brasil. Portaria nº 2, de 28º de setembro de 2017. Consolidação das normas sobre as políticas nacionais de Saú do Sistema Único de Saúde-SUS. Diário Oficial da União. 2017;.

[pone.0315379.ref005] 5. Maldonado RN, Savio RO, Feijó VBER, Aroni P, Rossaneis MA, Haddad MdCFL. Hospital indicators after implementation of bed regulation strategies: an integrative review. Revista Brasileira de Enfermagem. 2021;74:e20200022. doi: 10.1590/0034-7167-2020-0022 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref006] 6.Cordeiro MF. SISREG: uma ferramenta de desafios e avanços para a garantia do direito a saúde. 2015;.

[pone.0315379.ref007] 7. Junior JRM, de Souza Junior AA, da Luz AEdJ. The impact of COVID-19 on the municipal Regulation System (SISREG) of Rio de Janeiro (RJ). Research, Society and Development. 2024;13(4):e5613445564–e5613445564. doi: 10.33448/rsd-v13i4.45564 [DOI] [Google Scholar]

[pone.0315379.ref008] 8. Valentim RAdM, Lima TS, Cortez LR, Barros DMdS, Silva RDd, Paiva JCd, et al. The relevance a technology ecosystem in the Brazilian National Health Service’s Covid-19 response: the case of Rio Grande do Norte, Brazil. Ciência & Saúde Coletiva. 2021;26:2035–2052. [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref009] 9.Medina MVB. Análise da utilização da Escala Quick Sequential Organ Failure Assessment para tomada de decisão na regulação de leitos de UTI. Universidade Federal do Rio Grande do Norte; 2023.

[pone.0315379.ref010] 10.Konder M, O’Dwyer G. Regulation of access to hospital beds in emergency care and the development of integrated health services. 2019;.

[pone.0315379.ref011] 11. Kim M, Lee JY, Park JS, Kim HA, Hyun M, Suh YS, et al. Lessons from a COVID-19 hospital, Republic of Korea. Bulletin of the World Health Organization. 2020;98(12):842. doi: 10.2471/BLT.20.261016 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref012] 12.Lu Y, Guan Y, Zhong X, Fishe JN, Hogan T. Hospital beds planning and admission control policies for COVID-19 pandemic: A hybrid computer simulation approach. In: 2021 IEEE 17th International Conference on Automation SCience and Engineering (CASE). IEEE; 2021. p. 956–961.

[pone.0315379.ref013] 13. Pecoraro F, Luzi D, Clemente F. The efficiency in the ordinary hospital bed management: A comparative analysis in four European countries before the COVID-19 outbreak. Plos one. 2021;16(3):e0248867. doi: 10.1371/journal.pone.0248867 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref014] 14. Shi F, Li H, Liu R, Liu Y, Liu X, Wen H, et al. Emergency preparedness and management of mobile cabin hospitals in China during the COVID-19 pandemic. Frontiers in Public Health. 2022;9:763723. doi: 10.3389/fpubh.2021.763723 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref015] 15. Alavinejad M, Mellado B, Asgary A, Mbada M, Mathaha T, Lieberman B, et al. Management of hospital beds and ventilators in the Gauteng province, South Africa, during the COVID-19 pandemic. PLOS global public health. 2022;2(11):e0001113. doi: 10.1371/journal.pgph.0001113 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref016] 16. Kuzior A, Kashcha M, Kuzmenko O, Lyeonov S, Brożek P. Public health system economic efficiency and COVID-19 resilience: Frontier DEA analysis. International Journal of Environmental Research and Public Health. 2022;19(22):14727. doi: 10.3390/ijerph192214727 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref017] 17. Taylor CA, Draney MT, Ku JP, Parker D, Steele BN, Wang K, et al. Predictive medicine: computational techniques in therapeutic decision-making. Computer Aided Surgery: Official Journal of the International Society for Computer Aided Surgery (ISCAS). 1999;4(5):231–247. doi: [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref018] 18. Shahid N, Rappon T, Berta W. Applications of artificial neural networks in health care organizational decision-making: A scoping review. PloS one. 2019;14(2):e0212356. doi: 10.1371/journal.pone.0212356 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref019] 19. Tian S, Yang W, Le Grange JM, Wang P, Huang W, Ye Z. Smart healthcare: making medical care more intelligent. Global Health Journal. 2019;3(3):62–65. doi: 10.1016/j.glohj.2019.07.001 [DOI] [Google Scholar]

[pone.0315379.ref020] 20. Panagiotou OA, Högg LH, Hricak H, Khleif SN, Levy MA, Magnus D, et al. Clinical application of computational methods in precision oncology: a review. JAMA oncology. 2020;6(8):1282–1286. doi: 10.1001/jamaoncol.2020.1247 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref021] 21.Bian J, Modave F. The rapid growth of intelligent systems in health and health care; 2020. [DOI] [PubMed]

[pone.0315379.ref022] 22. Gupta PK, Ramachandran AT, Keerthi AM, Dave PS, Giridhar S, Kallapur SS, et al. An overview of clinical decision support system (CDSS) as a computational tool and its applications in public health. Applications in ubiquitous computing. 2021; p. 81–117. doi: 10.1007/978-3-030-35280-6_5 [DOI] [Google Scholar]

[pone.0315379.ref023] 23. Moulaei K, Shanbehzadeh M, Mohammadi-Taghiabad Z, Kazemi-Arpanahi H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC medical informatics and decision making. 2022;22(1):2. doi: 10.1186/s12911-021-01742-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref024] 24. Albuquerque G, Fernandes F, Barbalho IM, Barros DM, Morais PS, Morais AH, et al. Computational methods applied to syphilis: where are we, and where are we going? Frontiers in Public Health. 2023;11:1201725. doi: 10.3389/fpubh.2023.1201725 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref025] 25. Carvalho DRd, Araújo BGd, Lacerda JMT, Dantas MdCR, Hékis HR, Valentim RAdM. An architecture for online transient detection in electrocardiogram signals on the MP-HA protocol. Revista Brasileira de Engenharia Biomédica. 2012;28:346–354. [Google Scholar]

[pone.0315379.ref026] 26. Levantesi S, Pizzorusso V. Application of machine learning to mortality modeling and forecasting. Risks. 2019;7(1):26. doi: 10.3390/risks7010026 [DOI] [Google Scholar]

[pone.0315379.ref027] 27. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE reviews in Biomedical Engineering. 2020;14:116–126. doi: 10.1109/RBME.2020.3007816 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref028] 28. Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC medical research methodology. 2021;21:1–14. doi: 10.1186/s12874-021-01284-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref029] 29. Dixit RR. Risk Assessment for Hospital Readmissions: Insights from Machine Learning Algorithms. Sage Science Review of Applied Machine Learning. 2021;4(2):1–15. [Google Scholar]

[pone.0315379.ref030] 30. Iwase S, Nakada Ta, Shimada T, Oami T, Shimazui T, Takahashi N, et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Scientific reports. 2022;12(1):12912. doi: 10.1038/s41598-022-17091-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref031] 31. Baak M, Koopman R, Snoek H, Klous S. A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics. Computational Statistics & Data Analysis. 2020;152:107043. doi: 10.1016/j.csda.2020.107043 [DOI] [Google Scholar]

[pone.0315379.ref032] 32. Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID-19 patient health prediction using boosted random forest algorithm. Frontiers in public health. 2020;8:357. doi: 10.3389/fpubh.2020.00357 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref033] 33. Aljameel SS, Khan IU, Aslam N, Aljabri M, Alsulmi ES. Machine Learning-Based Model to Predict the Disease Severity and Outcome in COVID-19 Patients. Scientific programming. 2021;2021(1):5587188. [Google Scholar]

[pone.0315379.ref034] 34. Endo PT, Santos GL, de Lima Xavier ME, Nascimento Campos GR, de Lima LC, Silva I, et al. Illusion of truth: analysing and classifying COVID-19 fake news in brazilian portuguese language. Big Data and Cognitive Computing. 2022;6(2):36. doi: 10.3390/bdcc6020036 [DOI] [Google Scholar]

[pone.0315379.ref035] 35. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research. 2002;16:321–357. doi: 10.1613/jair.953 [DOI] [Google Scholar]

[pone.0315379.ref036] 36.Maiga J, Hungilo GG, et al. Comparison of machine learning models in prediction of cardiovascular disease using health record data. In: 2019 international conference on informatics, multimedia, cyber and information system (ICIMCIS). IEEE; 2019. p. 45–48.

[pone.0315379.ref037] 37.Hamida S, El Gannour O, Cherradi B, Ouajji H, Raihani A. Optimization of machine learning algorithms hyper-parameters for improving the prediction of patients infected with COVID-19. In: 2020 ieee 2nd international conference on electronics, control, optimization and computer science (icecocs). IEEE; 2020. p. 1–6.

[pone.0315379.ref038] 38. Papaiz F, Dourado MET Jr, de Medeiros Valentim RA, Pinto R, de Morais AHF, Arrais JP. Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis. BMC Medical Informatics and Decision Making. 2024;24(1):80. doi: 10.1186/s12911-024-02484-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref039] 39. Divya KS, Bhargavi P, Jyothi S. Machine learning algorithms in big data analytics. Int J Comput Sci Eng. 2018;6(1):63–70. [Google Scholar]

[pone.0315379.ref040] 40. Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. Library Hi Tech. 2020;38(4):835–858. doi: 10.1108/LHT-08-2019-0171 [DOI] [Google Scholar]

[pone.0315379.ref041] 41.Rahul K, Banyal RK, Goswami P, Kumar V. Machine learning algorithms for big data analytics. In: Computational Methods and Data Engineering: Proceedings of ICMDE 2020, Volume 1. Springer; 2021. p. 359–367.

[pone.0315379.ref042] 42. Charbuty B, Abdulazeez A. Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends. 2021;2(01):20–28. doi: 10.38094/jastt20165 [DOI] [Google Scholar]

[pone.0315379.ref043] 43. Rigatti SJ. Random forest. Journal of Insurance Medicine. 2017;47(1):31–39. doi: 10.17849/insm-47-01-31-39.1 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref044] 44. Fafalios S, Charonyktakis P, Tsamardinos I. Gradient boosting trees. Gnosis Data Analysis PC. 2020;1. [Google Scholar]

[pone.0315379.ref045] 45. Schapire RE. Empirical inference. Berlin, Heidelberg. 2013; p. 37–52. [Google Scholar]

[pone.0315379.ref046] 46.Sheng C, Yu H. An optimized prediction algorithm based on XGBoost. In: 2022 International Conference on Networking and Network Applications (NaNA). IEEE; 2022. p. 1–6.

[pone.0315379.ref047] 47. Popescu MC, Balas VE, Perescu-Popescu L, Mastorakis N. Multilayer perceptron and neural networks. WSEAS Transactions on Circuits and Systems. 2009;8(7):579–588. [Google Scholar]

[pone.0315379.ref048] 48.Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014;.

[pone.0315379.ref049] 49. Pedregosa F. Scikit-learn: Machine learning in python Fabian. Journal of machine learning research. 2011;12:2825. [Google Scholar]

[pone.0315379.ref050] 50. Ensor KB, Glynn PW. Stochastic optimization via grid search. Lectures in Applied Mathematics-American Mathematical Society. 1997;33:89–100. [Google Scholar]

[pone.0315379.ref051] 51. Bergstra J, Bengio Y. Random search for hyper-parameter optimization. Journal of machine learning research. 2012;13(2). [Google Scholar]

[pone.0315379.ref052] 52. Ahsan MM, E Alam T, Trafalis T, Huebner P. Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and Non-COVID-19 patients. Symmetry. 2020;12(9):1526. doi: 10.3390/sym12091526 [DOI] [Google Scholar]

[pone.0315379.ref053] 53. Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern recognition. 1997;30(7):1145–1159. doi: 10.1016/S0031-3203(96)00142-2 [DOI] [Google Scholar]

[pone.0315379.ref054] 54. De Hond AA, Steyerberg EW, Van Calster B. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health. 2022;4(12):e853–e855. doi: 10.1016/S2589-7500(22)00188-1 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref055] 55. Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean journal of anesthesiology. 2022;75(1):25–36. doi: 10.4097/kja.21209 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref056] 56. Mohammad-Rahimi H, Nadimi M, Ghalyanchi-Langeroudi A, Taheri M, Ghafouri-Fard S. Application of machine learning in diagnosis of COVID-19 through X-ray and CT images: a scoping review. Frontiers in cardiovascular medicine. 2021;8:638011. doi: 10.3389/fcvm.2021.638011 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref057] 57. Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nature biomedical engineering. 2018;2(10):719–731. doi: 10.1038/s41551-018-0305-z [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref058] 58. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future healthcare journal. 2019;6(2):94–98. doi: 10.7861/futurehosp.6-2-94 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref059] 59. Owoyemi A, Owoyemi J, Osiyemi A, Boyd A. Artificial Intelligence for Healthcare in Africa. Frontiers in digital health 2: 6; 2020. doi: 10.3389/fdgth.2020.00006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref060] 60. Yang CC. Explainable artificial intelligence for predictive modeling in healthcare. Journal of healthcare informatics research. 2022;6(2):228–239. doi: 10.1007/s41666-022-00114-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref061] 61. Cuocolo R, Perillo T, De Rosa E, Ugga L, Petretta M. Current applications of big data and machine learning in cardiology. Journal of geriatric cardiology: JGC. 2019;16(8):601. doi: 10.11909/j.issn.1671-5411.2019.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref062] 62. Barros DM, Moura JC, Freire CR, Taleb AC, Valentim RA, Morais PS. Machine learning applied to retinal image processing for glaucoma detection: review and perspective. Biomedical engineering online. 2020;19:1–21. doi: 10.1186/s12938-020-00767-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref063] 63. Srivastava O, Tennant M, Grewal P, Rubin U, Seamone M. Artificial intelligence and machine learning in ophthalmology: A review. Indian Journal of Ophthalmology. 2023;71(1):11–17. doi: 10.4103/ijo.IJO_1569_22 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref064] 64. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref065] 65. Firmino M, Angelo G, Morais H, Dantas MR, Valentim R. Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomedical engineering online. 2016;15:1–17. doi: 10.1186/s12938-015-0120-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref066] 66. Galvao-Lima L, Morais H, Valentim R, Barreto E. miRNAs as biomarkers for early cancer detection and their application in the development of new diagnostic tools. Biomedical engineering online. 2021;20:1–21. doi: 10.1186/s12938-021-00857-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref067] 67. Zhao B, Waterman R, Urman R, Gabriel RA. A machine learning approach to predicting case duration for robot-assisted surgery. Journal of Medical Systems. 2019;43:1–32. doi: 10.1007/s10916-018-1151-y [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref068] 68. Panesar S, Cagle Y, Chander D, Morey J, Fernandez-Miranda J, Kliot M. Artificial intelligence and the future of surgical robotics. Annals of surgery. 2019;270(2):223–226. doi: 10.1097/SLA.0000000000003262 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref069] 69. Zhou XY, Guo Y, Shen M, Yang GZ. Application of artificial intelligence in surgery. Frontiers of medicine. 2020;14:417–430. doi: 10.1007/s11684-020-0770-0 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref070] 70. Moglia A, Georgiou K, Georgiou E, Satava RM, Cuschieri A. A systematic review on artificial intelligence in robot-assisted surgery. International Journal of Surgery. 2021;95:106151. doi: 10.1016/j.ijsu.2021.106151 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref071] 71. Fernandes YYMP, Araújo GTd, Araújo BGd, Dantas MdCR, Carvalho DRd, Valentim RAdM. ILITIA: telehealth architecture for high-risk gestation classification. Research on Biomedical Engineering. 2017;33(3):237–246. doi: 10.1590/2446-4740.09416 [DOI] [Google Scholar]

[pone.0315379.ref072] 72. Reddy S. Explainability and artificial intelligence in medicine. The Lancet Digital Health. 2022;4(4):e214–e215. doi: 10.1016/S2589-7500(22)00029-2 [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref073] 73. Schwalbe N, Wahl B. Artificial intelligence and the future of global health. The Lancet. 2020;395(10236):1579–1586. doi: 10.1016/S0140-6736(20)30226-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref074] 74. Ammar W, Kdouh O, Hammoud R, Hamadeh R, Harb H, Ammar Z, et al. Health system resilience: Lebanon and the Syrian refugee crisis. Journal of global health. 2016;6(2). doi: 10.7189/jogh.06.020704 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref075] 75. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: progress, crisis and resilience. BMJ global health. 2018;3(4):e000829. doi: 10.1136/bmjgh-2018-000829 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref076] 76. Muhammad L, Algehyne EA, Usman SS, Ahmad A, Chakraborty C, Mohammed IA. Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN computer science. 2021;2(1):1–13. doi: 10.1007/s42979-020-00394-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0315379.ref077] 77. Silva Junior CL, Guabiraba KPdL, Gomes GG, Andrade CLTd, Melo EA. Outpatient regulation in Primary Care in the municipality of Rio de Janeiro, Brazil, based on the local regulatory doctors. Ciência & Saúde Coletiva. 2022;27:2481–2493. [DOI] [PubMed] [Google Scholar]

[pone.0315379.ref078] 78. Ali A, Khan Z, Khan DM, Aldahmani S. An Optimal Random Projection k Nearest Neighbours Ensemble via Extended Neighbourhood Rule for Binary Classification. IEEE Access. 2024;. [Google Scholar]

[pone.0315379.ref079] 79. Ali A, Hamraz M, Gul N, Khan DM, Aldahmani S, Khan Z. A k nearest neighbour ensemble via extended neighbourhood rule and feature subsets. Pattern Recognition. 2023;142:109641. doi: 10.1016/j.patcog.2023.109641 [DOI] [Google Scholar]

[pone.0315379.ref080] 80. Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. International Journal of Computing and Business Research (IJCBR). 2015;6(2):1–12. [Google Scholar]

PERMALINK

Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform

Tiago de Oliveira Barreto

Fernando Lucas de Oliveira Farias

Nicolas Vinícius Rodrigues Veras

Pablo Holanda Cardoso

Gleyson José Pinheiro Caldeira Silva

Chander de Oliveira Pinheiro

Maria Valéria Bezerra Medina

Felipe Ricardo dos Santos Fernandes

Ingridy Marina Pierre Barbalho

Lyane Ramalho Cortez

João Paulo Queiroz dos Santos

Antonio Higor Freire de Morais

Gustavo Fontoura de Souza

Guilherme Medeiros Machado

Márcia Jacyntha Nunes Rodrigues Lucena

Ricardo Alexsandro de Medeiros Valentim

Roles

Abstract

Introduction

Materials and methods

Extraction, evaluation, characterization and pre-processing

Table 1. Description of database.

Fig 1. Workflow defined for data processing and selection.

Correlation between dataset features

Definition of evaluation metrics

Data balancing and splitting into training and validation data

Definition of models for data classification

Hyperparameters to define the best model

Table 2. Selection of hyperparameters and values for each model.

Results

General data analysis

Table 3. Data profile from the RegulaRN Leitos Gerais.

Table 4. Distribution of the most frequent ICDs by municipality and hospital.

Table 5. Distribution of the ICDs with the highest number of deaths.

Fig 2. Presentation of the Phik correlation for RegulaRN Leitos Gerais data.

Machine learning model results

Table 6. Metrics obtained by the computer models.

Fig 3. Comparison of the performance of the models used.

Fig 4. Feature importance of the machine learning models.

Fig 5. Feature importance of MLP models.

Fig 6. ROC curve and AUC value of all models.

Discussion

Conclusion

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Luísa da Matta Machado Fernandes

Roles

Author response to Decision Letter 0

Decision Letter 1

Luísa da Matta Machado Fernandes

Roles

Acceptance letter

Luísa da Matta Machado Fernandes

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases