Skip to main content
PLOS One logoLink to PLOS One
. 2024 Dec 30;19(12):e0315379. doi: 10.1371/journal.pone.0315379

Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform

Tiago de Oliveira Barreto 1,*, Fernando Lucas de Oliveira Farias 1, Nicolas Vinícius Rodrigues Veras 1,2, Pablo Holanda Cardoso 1,2, Gleyson José Pinheiro Caldeira Silva 1, Chander de Oliveira Pinheiro 3, Maria Valéria Bezerra Medina 3, Felipe Ricardo dos Santos Fernandes 1, Ingridy Marina Pierre Barbalho 1, Lyane Ramalho Cortez 1,3, João Paulo Queiroz dos Santos 1,2, Antonio Higor Freire de Morais 1,2, Gustavo Fontoura de Souza 1,2, Guilherme Medeiros Machado 4, Márcia Jacyntha Nunes Rodrigues Lucena 5, Ricardo Alexsandro de Medeiros Valentim 1
Editor: Luísa da Matta Machado Fernandes6
PMCID: PMC11684685  PMID: 39775276

Abstract

Bed regulation within Brazil’s National Health System (SUS) plays a crucial role in managing care for patients in need of hospitalization. In Rio Grande do Norte, Brazil, the RegulaRN Leitos Gerais platform was the information system developed to register requests for bed regulation for COVID-19 cases. However, the platform was expanded to cover a range of diseases that require hospitalization. This study explored different machine learning models in the RegulaRN database, from October 2021 to January 2024, totaling 47,056 regulations. From the data obtained, 12 features were selected from the 24 available. After that, blank and inconclusive data were removed, as well as the outcomes that had values other than discharge and death, rendering a binary classification. Data was also correlated, balanced, and divided into training and test portions for application in machine learning models. The results showed better accuracy (87.77%) and recall (87.77%) for the XGBoost model, and higher precision (87.85%) and F1-Score (87.56%) for the Random Forest and Gradient Boosting models, respectively. As for Specificity (82.94%) and ROC-AUC (82.13%), the Multilayer Perceptron with SGD optimizer obtained the highest scores. The results evidenced which models could adequately assist medical regulators during the decision-making process for bed regulation, enabling even more effective regulation and, consequently, greater availability of beds and a decrease in waiting time for patients.

Introduction

In Brazil, the hospital bed regulation process of the Brazilian Health System (SUS) plays a fundamental role in the management and distribution of care for patients requiring hospitalization [1, 2]. However, although the National Regulation Policy was instituted more than 15 years ago by Ordinance No. 1,559 of August 2008 [3] and consolidated by Ordinance No. 02 of September 2017 (Brazil, 2017) [4], many regions have difficulties in ensuring correct regulatory conduct.

In this context, in addition to organizational issues, the regulation system in Brazil still faces recurring problems such as the precariousness of hospital infrastructure, overcrowding in health units, an insufficient number of beds, difficulties in integration and communication between the entities involved in the regulatory process, greater transparency in processes and allocation of resources, in addition to not having efficient systems to help the regulation process [1, 5]. In Brazil, due to a non-mandatory recommendation from the Ministry of Health (MoH), the Regulation System—SISREG—is still used in many Brazilian states. This system was created in 2001 and is made available by the Brazilian Health System Informatics Department (DATASUS) [6]. Currently, this system is considered obsolete and inadequate, especially due to the lack of interoperability with the SUS technological ecosystem itself and the lack of transparency [7]. This is a legacy health information system which, although it is still used, is no longer able to play an effective role in the National Policy for the Regulation of Assistance in Access to Health Services in Brazil.

Until April 2020, the center for regulating access to health services in the state of Rio Grande do Norte did not have a platform for regulating hospital beds to systematically organize regulatory conduct within the scope of the SUS in the state. The regulatory flow control measures used were based on spreadsheets, e-mails and telephone communication, and messaging systems [8, 9].

Faced with the serious public health crisis caused by the COVID-19 pandemic, the government of the state of Rio Grande do Norte has set up technical-scientific cooperation between researchers in the field of digital health and the managers and formulators of public health policies at the State Secretariat of Public Health of RN (SESAP/RN). The aim of this technical-scientific cooperation was to formulate and implement a digital health solution that would make it possible to control and monitor the entire process of regulating hospital beds in all the state’s public hospitals online, on time and transparently, totaling 24 public hospitals, with more than 900 beds available.

Based on this technical-scientific cooperation, the RegulaRN Platform for COVID-19 was developed and implemented throughout the state of Rio Grande do Norte, whose initial objective was to monitor and control access to hospital beds in wards and intensive care units (ICUs) for the disease during the pandemic [2, 8, 9]. The state of Rio Grande Norte, which is located in the northeastern region of Brazil, currently has an area of 52,797 km² and a population of approximately 3.5 million inhabitants.

After the implementation of the RegulaRN Platform, it became necessary to expand the digital health solution to the other regulatory specialties. The system is currently responsible for regulating access to beds, vascular surgery, outpatient care, exams, and consultations. In this way, the RegulaRN Platform has become a unique digital health solution for the management of health regulation services in the state of Rio Grande do Norte, an important aspect because it has centralized and integrated, through international interoperability standards, the Health Data Network (RDS) with all the other technologies in the state’s public health ecosystem that are necessary for the process of regulating access to health services.

The health regulation process needs to be carried out in a rigorous, agile and transparent manner, as the incorrect conduct of a regulatory process in public health has intrinsic impacts on waiting times for access to hospital beds, as well as on hospitalization times, which can have negative impacts on the availability of hospital beds and increase the potential for existing problems [5, 10]. In this way, the inefficiency and ineffectiveness of this process can aggravate public health crisis situations, such as the COVID-19 pandemic, as it requires more rational use of health resources [8, 1116]. Therefore, due to its complexity and the pressures that exist in all segments of the regulatory process, investment in intelligent computer systems can maximize the correct direction and assertive decision-making in healthcare systems [1722].

Intelligent computer models have demonstrated significant potential in healthcare systems by reducing uncertainties and ambiguities in complex decision-making processes. For example, prior studies in similar healthcare contexts have shown that machine learning models can enhance hospital management by optimizing resource allocation and reducing patient waiting times [2330]. This study aims to build on these findings to demonstrate the effectiveness of AI-based models specifically in bed regulation in Rio Grande do Norte.

In this context, the aim of this work is to analyze data from the RegulaRN Leitos Gerais Platform and use it to train and validate different machine learning models. Subsequently, to choose the most significant classification model capable of predicting the outcome of patients regulated by the RegulaRN Leitos Gerais platform with greater accuracy, precision, recall, specificity, F1 Score, and ROC-AUC. Furthermore, discuss the main impacts and potential of a digital health solution on the decision-making process of regulatory professionals.

Materials and methods

The methodological bias of this paper consists in two main steps: exploratory data analysis and applying the data to computer models. In the evaluation process, the data was extracted, evaluated, characterized, pre-processed, and correlated. For the application stage, concerning the computational models, four phases were taken into account: 1) definition of evaluation metrics; 2) data balancing and division into training and validation groups; 3) selecting the models for classification, and 4) hyperparameters to choose the best performing model; in line with Barreto et al [2].

Extraction, evaluation, characterization and pre-processing

This study used the database from the RegulaRN Leitos Gerais platform, a system adopted to manage the regulation flow of SUS beds in the state of Rio Grande do Norte. The database covers the period of October 2021 to January 2024, with 47.056 regulations in the two-state centers (Metropolitan and West). From this total, 1,868 regulations were removed because they were linked to newborn regulations, and these have different clinical assessment protocols when compared to adult and pediatric patients. The initial analysis therefore included 45,188 regulation requests. A more detailed descriptive analysis of the data is presented in the results section.

The initial data extraction included 24 features: a) date of request; b) occupancy type; c) case type; d) unified prioritization score (EUP); e) Sequential Organ Failure Assessment (SOFA) scale; f) type of hospital bed requested; g) admission date; h) type of input bed; i) discharge date; j) discharge bed type; k) national health card number; l) gender; m) patient’s municipality; n) patient’s federative unit; o) pregnant woman (yes or no); p) gestational period; q) age; r) regulator identification; s) outcome; t) requesting health unit; u) municipality of the requesting health unit; v) providing health unit; w) municipality of the providing health unit and x) ICD.

Thus, the features that were not associated with the patient’s clinical condition and do not show any impact in the final result, or that relate to the locality record, such as: date of request, national health card number, patient’s federative unit, patient’s municipality, regulator identification, requesting health unit, municipality of the requesting health unit, and municipality of the providing health unit. In addition, features with only one possible record or insufficient information were also removed: type of occupation, type of case, pregnant woman (yes or no), and gestational age.

Consequently, only 12 characteristics were selected, namely: EUP score, SOFA scale, type of hospital bed requested, admission date, admission bed type, discharge date, discharge bed type, gender, age, outcome, providing health unit, and ICD. Using the entry date and exit date features, it was possible to create the patient’s hospitalization time feature. As a result, 11 features were used in the classification process. Table 1 shows the description of all the data types extracted from RegulaRN Leitos Gerais.

Table 1. Description of database.

Data description
Field Description
request date Represents the date a bed request was registered.
type of occupation Represents the type of occupation of the regulation request.
type of occupation Represents the type of occupation of the regulation request.
case of type Represents the results of tests for patients who were suspected of having Covid-19. The results could be: positive, negative, inconclusive, or null.
EUP Represents the EUP value. The numerical scale ranges from 2 to 8 and is associated with the Charlson Comorbidity Index, Clinical Frailty Scale and simplified SOFA.
SOFA scale Represents the patient’s prioritization value according to the values of this scale.
requested bed type Represents the type of bed selected by the regulation center for a patient. The results could be: ward and uci.
entry date Represents the date the patient was allocated to the health unit (hospital).
entry bed type Represents the type of bed that the patient was allocated in the health unit (hospital). The results could be: ward and uci.
output date Represents the date that the patient left the bed after the outcome.
output bed type Represents the type of bed the patient was in before the outcome.
sus card number Represents each patient’s SUS card number. Given by a 15-digit sequence.
sex Represents the patient’s sex.
patient’s municipality Indicates the patient’s city of residence.
patient’s federal unit Is the acronym for the patient’s federal unit.
pregnant Represents whether the patient is pregnant or not.
gestational age Describe how far along the pregnancy is, measured in weeks.
age Represents patient age.
regulator Represents the regulator identification responsible for regulation.
outcome Represents the outcome of the patient in bed. Possible values for this field are: discharge and death.
requesting health unit Represents the unit health that solicits a bed for the patient.
municipality of the requesting health unit Represents the municipality of the health unit that solicits the bed.
provider health unit Represents the health unit that admits and accommodates the patient in the bed.
municipality of the provider health unit Represents the municipality of the health unit that receives and accomodate the patient in the bed.
ICD Represents the International Classification of Diseases for bed regulation.

After extracting the data, we evaluated the values contained in all the features and in order to guarantee the integrity of the analysis, the lines with blank data or inconclusive information were removed. In addition, the target column “outcome” contained six different values, namely: by discharge, by death, for other reasons, by stay, by delivery procedure, by transfer, etc. As these last four outcomes do not properly indicate a positive or negative closure of the regulation, as well as having a lower number of recurrences, around 7.151 regulations were removed. This maintains a binary classification (by discharge—positive, or by death—negative) for the computer models. Finally, 38.023 effective regulations were selected for application in the artificial intelligence models. Fig 1 shows the design used to process and select the data. Furthermore, in order to enable the reproducibility of this experiment, the final database used is available on the zenodo platform (https://zenodo.org/records/11387710).

Fig 1. Workflow defined for data processing and selection.

Fig 1

Correlation between dataset features

The first task was to perform a pairwise correlation of the features. The objective is to identify features with greater or lesser correspondence with others. As many of these are categorical data, the phik correlation model was implemented in this analysis. Phik is abble to consistently correlate variables from several backgrounds, being categorical, ordinals and intervals a like, turning into a refinement of Pearson [31] hypothesis test.

Definition of evaluation metrics

The overall aim of the study is to classify hospital bed regulation data to predict a patient’s positive or negative outcome. Furthermore, it is important to investigate the models’ performance in situations where predictions are wrong, either due to a high number of false positives or false negatives. Thus, it is necessary to include not only accuracy, but also precision, recall, specificity, F1-Score, and ROC-AUC in a similar way to those found in the works of Iwendi et al [32], Aljameel et al [33] and Endo et al [34].

The accuracy consists in the set of data with correct predictions (true positive and true negative) divided by the sum of all predictions made by the model (true positive, true negative, false positive, false negative) (Eq 1):

Accuracy=(TP+TN)/(TP+FP+FN+TN) (1)

Precision consists of dividing the true positive rate by the sum of the true positive and false positive rates (Eq 2).

Precision=TP/(TP+FP) (2)

Recall involves the rate of true positives divided by the rate of true positives plus false negatives (Eq 3).

Recall=TP/(TP+FN) (3)

Specificity refers to the prediction of true negatives divided by the sum of true negatives and false positives (Eq 4).

Specificity=TN/(TN+FP) (4)

The F1-score is the harmonic mean between the precision and recall. The formula involves the product of precision and recall divided by the sum of these metrics, multiplied by 2 (Eq 5).

F1Score=2*(Precision*Recall)/(Precision+Recall) (5)

ROC-AUC can be obtained by recall divided by the complementary value of specificity (Eq 6).

ROCAUC=Recall/(1-Specificity) (6)

Data balancing and splitting into training and validation data

The RegulaRN Leitos Gerais Platform database refers to real-world bed regulation data, in this sense, there is an unbalanced distribution of data when classified by outcome, 82.6% are discharges and 17.4% deaths. The use of an unbalanced database biases the machine learning classifiers, making the algorithms able to identify patterns from the predominant class much better than patterns from the minority class. To mitigate this problem, one of the most common techniques is SMOTE (Synthetic Minority Over-sampling), which works by increasing the number of data points in the minority class [35]. The SMOTE algorithm first identifies the minority class, then in the feature vector space identifies the k nearest neighbors of that class (k is usually equal to 5). Finally, a new instance of the minority class is generated by randomly selecting values in the vector space between an instance of the minority class and the nearest neighbors identified. This process is repeated until the database is completely balanced.

In addition, as for the division of training and validation data, the same segmentation was used as in other studies applying machine learning techniques that use a large volume of data [3638]. Therefore, 80% of the data was directed to training and the others 20% for validation.

Definition of models for data classification

The selection of classification models was based on their proven ability to handle large volumes of imbalanced healthcare data [37, 3941]. Decision tree was selected because it is one of the classic models that handles high volumes of data well and has wide application in problems in the health area [42]. Random Forest, on the other hand, was selected for its ability to manage complex decision trees and its resistance to overfitting, particularly in high-dimensional datasets [43]. Gradient Boosting and Adaboost due to their adaptability and efficient ability to capture non-linear relationships [44, 45]. XGBoost, for example, has been shown to perform well in healthcare settings due to its gradient boosting framework, which effectively handles missing data and provides robust performance on tabular datasets [46] and Multi-Layer Perceptron (MLP) has an architecture capable of modeling non-linear relationships and performing gradient learning, adjusting weights efficiently for larger volumes of data [47]. For the MLP models, two different paths were taken, the Stochastic Gradient Descent (SGD) was selected due to its performance and Adam because of its consistency in treating gradient explosion and fading problems [48]. These models were chosen for their complementarity in addressing the specific challenges of bed regulation data in this study.

Hyperparameters to define the best model

After the model selection, it was necessary to define the best combination of hyperparameters to enhance the evaluation metrics of each model. Thus, this section presents which hyperparameters were adopted and which methods were elaborated in the training and validation steps. It is worth mentioning that all computational model development in this research used python’s sckit-learn library [49].

For each selected model, hyperparameters were set aiming to boost the performance metrics. In this regard, the following hyperparameters were selected for Decision Tree: criterion, which measures the quality of node splitting; max depth of tree, which determines the maximum depth of the tree; min samples leaf, which represents the minimum number of samples needed in a leaf; and max features, which considers the maximum number of features analyzed to perform a split. For the Random Forest and Gradient Boosting models, the criterion, max depth of the tree and max features were also used, including the number estimators, which considers the number of trees in the forest. In the Adaboost model, the parameters number estimators, learning rate and algorithm were chosen. The learning rate refers to the learning weight at each iteration, while the algorithm relates to how the model can speed up the convergence of the classifier with the least possible error. For XGBoost: learning rate, number estimators, max depth and colsample by tree. This last hyperparameter is associated with the randomly selected fraction of resources that will be used to train each tree. Finally, for the MLP Adam and MLP SGD models, the following were used: hidden layer size, which represents the number of layers in the model; activation, which represents the model’s activation function and batch size, which represents the size of the minibatches that will be used to help the optimizers.

The grid GridSearchCV functionality, which allows all possible combinations of hyperparameters to be iterated, was applied during the training to find which parameters showed better results in the evaluated metrics [50, 51]. A proportional division of the training and test data was also carried out randomly using the cross validation attribute with a value of 10-folds in the GridSearchCV functionality, as a way of enhancing the model’s learning. In addition, the models were trained five times, similar to that developed by Ahsan et al [52], in order to determine the best set of hyperparameters more accurately. The details of the hyperparameters used and the respective values chosen for each model are shown in Table 2.

Table 2. Selection of hyperparameters and values for each model.

Models Hyperparameter Range and best values
Decision Tree criterion gini or entropy;
max depth of the tree [10, 50, 100];
min samples leaf range [1, 2, 3, 4];
max features [sqrt, log2].
Random Forest criterion gini or entropy;
max depth of the tree [10, 50, 100];
number estimators [100, 200, 400];
max features [sqrt, log2].
Gradient Boosting criterion friedman_mse or squared_error;
max depth [10, 50, 100];
number estimators [10, 50, 100];
max features [sqrt, log2].
Adaboost learning rate [0.1, 0.5, 1.0]
number estimators [100, 200, 400]
algorithm [samme, samme.r]
XGBoost learning rate [0.1, 0.5, 1.0]
number estimators [100, 200, 400]
max depth [10, 50, 100]
colsample by tree [0.1, 0.5, 1.0]
MLP SGD hidden_layer_sizes [5, 25, 70]
activation tanh or relu
batch_size [16, 32, 64]
MLP ADAM hidden_layer_sizes [5, 25, 70]
activation tanh or relu
batch_size [16, 32, 64]

Results

General data analysis

Considering the data profile from RegulaRN Leitos Gerais, between October 2021 and January 2024, it was possible to identify that most hospitalizations involve male adults, young people and children, in hospital beds and with lower EUP score and SOFA scale. The details of the values extracted from the database are presented in Table 3, classifying each of the characteristics based on their respective outcome.

Table 3. Data profile from the RegulaRN Leitos Gerais.

Features Values Outcomes
Age ≥ 60 18.641 Discharge: 13.549
Death: 5.092
< 60 19.382 Discharge: 17.860
Death: 1.522
Sex Masculine 20.235 Discharge: 16.941
Death: 3.384
Feminine 17.698 Discharge: 14.468
Death: 3.230
EUP Score 2 17.572 Discharge:16.668
Death: 904
3 7.279 Discharge: 6.176
Death: 1.103
4 5.468 Discharge:4.113
Death: 1.355
5 5.485 Discharge: 3.543
Death:1.942
6 1.765 Discharge: 770
Death: 995
7 367 Discharge: 120
Death: 247
8 87 Discharge:19
Death: 68
SOFA 1 29.699 Discharge: 26.070
Death: 3.629
2 6.816 Discharge: 4.652
Death: 2.164
3 1.179 Discharge: 598
Death: 581
4 329 Discharge: 89
Death: 240
requested bed type Ward 23.863 Discharge: 21.878
Death: 1.985
ICU 14.160 Discharge: 9.531
Death: 4.629
entry bed type Ward 23.858 Discharge: 21.851
Death: 2.007
ICU 14.165 Discharge: 9.558
Death:4.607
output bed type Ward 28.137 Discharge: 26.233
Death: 1.904
ICU 9.886 Discharge: 5.176
Death: 4.710
Provider health unit Health unit 1 1.234 Discharge: 1.061
Death: 173
Health unit 2 794 Discharge: 554
Death: 240
Health unit 3 299 Discharge: 213
Death:86
Health unit 4 225 Discharge: 154
Death: 71
Health unit 5 367 Discharge: 227
Death:140
Health unit 6 2.104 Discharge: 1.722
Death: 382
Health unit 7 2.865 Discharge: 2.341
Death: 524
Health unit 8 2.331 Discharge: 1.971
Death: 360
Health unit 9 112 Discharge: 104
Death: 8
Health unit 10 8 Discharge: 7
Death: 1
Health unit 11 2.138 Discharge: 2.094
Death: 44
Health unit 12 703 Discharge: 625
Death: 78
Health unit 13 101 Discharge: 56
Death: 45
Health unit 14 11 Discharge: 11
Death: 0
Health unit 15 422 Discharge: 225
Death:197
Health unit 16 94 Discharge: 86
Death: 8
Health unit 17 5 Discharge: 3
Death: 2
Health unit 18 385 Discharge:233
Death:152
Health unit 19 703 Discharge:703
Death:0
Health unit 20 253 Discharge:211
Death:42
Health unit 21 2.825 Discharge:2.646
Death:179
Health unit 22 1.000 Discharge:778
Death:222
Health unit 23 586 Discharge:373
Death:213
Health unit 24 739 Discharge:608
Death:131
Health unit 25 737 Discharge:631
Death:106
Health unit 26 34 Discharge:23
Death:11
Health unit 27 396 Discharge:173
Death:223
Health unit 28 5.192 Discharge:4.320
Death:872
Health unit 29 740 Discharge:662
Death:78
Health unit 30 1.000 Discharge:850
Death:150
Health unit 31 598 Discharge:438
Death:160
Health unit 32 2.183 Discharge:1.590
Death:593
Health unit 33 395 Discharge:240
Death:155
Health unit 34 443 Discharge:233
Death:210
Health unit 35 193 Discharge:193
Death:0
Health unit 36 4.181 Discharge:3.774
Death:407
Health unit 37 422 Discharge:422
Death:0
Health unit 38 12 Discharge:12
Death:0
Health unit 39 522 Discharge:467
Death:55
Health unit 40 161 Discharge:119
Death:42
Health unit 41 510 Discharge:256
Death:254
Length of stay < 7 16.693 Discharge: 13.658
Death:3.035
7≤ LoS ≤ 14 11.499 Discharge: 9.824
Death: 1.675
>14 9.831 Discharge: 7.927
Death: 1.904
Outcomes Discharge 31.409
Death 6.614

In addition, the database contains outcomes by each provider hospital (41), to address which health units had the highest number of requests and their respective outcomes, given that his feature showed a high correlation with several other dataset variables. Each hospital has a different treatment specialty, and thus some receive requests of greater complexity and mortality than others, culminating in different proportions of discharges and deaths. Finally, the data includes around 2055 different diseases classified by the International Classification of Diseases (ICD-10), which were also examined for recurrence.

As for the statistical profile, the average age is 53.38 years, with a standard deviation of 26.82 years and a median of 59 years. The average hospitalization time was 12.96 days, with a standard deviation of 17.67 days and a median of 7 days. The mean EUP score was 3.15 and the median was 3. The mean SOFA scale was 1.2 and the median 1.

Regarding the ICD, Table 4 shows the ten most recurrent ICDs, followed by the municipalities with the highest incidence and the hospitals that treat the most. The state capital, the city of Natal, has the highest number of inhabitants and has the highest incidence of ICDs 6 and 10. In contrast, Mossoró, the second largest municipality in terms of inhabitants, has a higher incidence in four of the ten. The noteworthy point is that there is no significant number of requests for these diseases among Parnamirim, São Gonçalo do Amarante, and Macaíba municipalities, which are the 3rd, 4th and 5th most populous municipalities.

Table 4. Distribution of the most frequent ICDs by municipality and hospital.

Code Name Frequency Municipality with the highest incidence Hospitals that treat the most
J18.9 Unspecified Pneumonia 4284 Natal: 1437 Provider Hospital 28: 504
Mossoró: 763 Provider Hospital 6: 344
Santo Antônio: 278 Provider Hospital 11: 333
I21.9 Unspecified Acute Myocardial Infarction 1738 Mossoró 781 Provider Hospital 36: 910
Natal: 150 Provider Hospital 6: 158
Currais Novos:97 Provider Hospital 7: 119
I64 Stroke, Not Specified as Hemorrhagic or Ischemic 1540 Mossoró: 1010 Provider Hospital 28: 969
Caicó: 150 Provider Hospital 32: 164
Natal: 148 Provider Hospital 6: 73
N39.0 Urinary Tract Infection of Unspecified Localization 965 Natal: 408 Provider Hospital 21: 125
Mossoró: 83 Provider Hospital 41: 102
Parnamirim: 79 Provider Hospital 30: 67
I50.0 Congestive Heart Failure 905 Natal: 243 Provider Hospital 6: 134
Mossoró: 196 Provider Hospital 28: 73
Currais Novos: 97 Provider Hospital 21: 68
I20.0 Unstable Angina 677 Mossoró: 531 Provider Hospital 36: 529
Natal: 53 Provider Hospital 6: 30
Currais Novos: 8 Provider Hospital 21: 13
F20.8 Other Schizophrenias 587 Natal: 292 Provider Hospital 21: 293
Parnamirim: 50 Provider Hospital 7: 272
Ceará-Mirim: 34 Provider Hospital 6: 9
A46 Erysipelas 546 Natal: 180 Provider Hospital 21: 85
Mossoró: 70 Provider Hospital 32: 64
Caicó: 62 Provider Hospital 39: 33
A41.9 Unspecified Septicemia 542 Natal: 151 Provider Hospital 32: 81
Mossoró: 115 Provider Hospital 28: 72
Caicó: 67 Provider Hospital 6: 38
I25.2 Old Myocardial Infarction 510 Mossoró: 373 Provider Hospital 36: 374
Natal: 31 Provider Hospital 7: 19
João Câmara: 19 Provider Hospital 25: 17

SOFA and EUP are two tools used to evaluate the hospital’s bed priority for each patient, considering that EUP revolves around SOFA, The Charlson Comorbidity Index (CCI) and the Clinical Frailty Scale (CFS). According to the data, it is possible to identify that EUP, has a more normalized aggregation to the outcomes classification. Meanwhile, SOFA = 1, was responsible for characterizing 78% of the data, a similar percentage is presented in the sum of requests with EUP 2 (46.2%), 3 (19.1%), and 4 (14.3%). In other words, while SOFA indicates that 78% of referrals had the same degree of priority, EUP structures the same percentage into three different categories. Given the health sector’s peculiarities, the EUP is an indicator that minimizes the generalization of different clinical conditions.

Another important point to evaluate is the ICD that most frequently resulted in death. Naturally, each ICD has its own intrinsic lethality level, meaning that some diseases kill more than others. However, it is necessary to analyze the frequency of certain occurrences and whether the incidence is local and already expected by public health institutions. Hence, with the data in hand, public health authorities can evaluate and orchestrate future intervention proposals. As shown in Table 5, Unspecified Pneumonia (J18.9) was the disease with the highest frequency (see Table 4) and resulted in the most deaths. Around 24.5% of the patients classified with this disease died, resulting in 15.9% of the total number of deaths. However, Unspecified Septicemia is one of the most lethal diseases and is responsible for the death of 50.5% of patients diagnosed with this disease.

Table 5. Distribution of the ICDs with the highest number of deaths.

Code Name Frequency Deaths/Total incidence
J18.9 Unspecified Pneumonia 1052 24.5%
A41.9 Unspecified Septicemia 279 50.5%
I50.0 Congestive Heart Failure 247 27.3%
J18.0 Unspecified Bronchopneumonia 110 24.8%
I21.9 Unspecified Acute Myocardial Infarction 94 12.6%

Regarding the data correlation, shown in Fig 2, Phik’s correlation revealed that the features that relate most closely to the outcome are the output bed type, requested bed type, entry bed type, SOFA scale, ICD, age, EUP score, and provider unit. Length of stay and gender did not present any relevant correlation for this topic.

Fig 2. Presentation of the Phik correlation for RegulaRN Leitos Gerais data.

Fig 2

Machine learning model results

Table 2 shows the selection of hyperparameters that indicated the best results for the selected models. The Decision Tree model obtained the best criterion results when the entropy node division strategy was selected, max depth of the tree with a value of 50, min samples leaf with a value of 1 and max feature, square root. Random Forest obtained the best results with entropy (criterion), 50 (max depth of tree), 400 (number estimators) and sqrt (max features). For Gradient Boosting, squared error (criterion), 10 (max depth of tree), 50 (number estimators) and sqrt (max features). In the Adabost model, the best results were: 1.0 (learning rate), 400 (number estimators), and samme.r (algorithm). In XGBoost: 0.1 (learning rate), 200 (number estimators), 50 (max depth) and 1.0 (colsample by tree). The MLP models used the same hyperparameters in SGD and ADAM (hidden layer sizes, activation, batch size) which resulted in the same values: 70, relu and 32.

As for the results obtained by the selected metrics, XGBoost scored highest in accuracy (87.77%) and recall (87.77%). On the other hand, the Random Forest model (87.85%) was the most accurate, i.e. being the model that best classifies the positive outcome. As for the F1-Score value, the Gradient Boosting model had the highest value (87.56%). As for specificity, a parameter that assesses the classification performance of the negative outcome, it can be seen that the multilayer perceptron models outperform the others. The highest score was obtained by the SGD (82.94%). Table 6 presents the performance metrics for each machine learning model, including accuracy, precision, recall, F1-Score, and specificity. Notably, XGBoost outperformed the other models in accuracy and recall, making it a robust choice for predicting patient outcomes in bed regulation. However, the high specificity observed in the MLP models indicates that these models may be more suitable when the goal is to minimize false positives, particularly in critical care cases. For a better comparison of the performance of the models used, Fig 3 presents the values of each metric per computational model.

Table 6. Metrics obtained by the computer models.

Models Accuracy Precision Recall F1-Score Specificity
Decision Tree 82.97(+0.13) 84.26(+0.19) 82.96(+0.13) 83.51(+0.14) 64.36(+0.42)
Random Forest 87.20(+0.01) 87.85(+0.03) 87.20(+0.01) 87.47(+0.01) 72.98(+0.17)
Gradient Boosting 87.14(+0.05) 88.21(+0.03) 87.14(+0.05) 87.56(+0.03) 75.47(+0.24)
Adaboost 86.69(+0.05) 87.76(+0.05) 86.69(+0.05) 87.12(+0.05) 74.25(+0.06)
XGBoost 87.77(+0.07) 87.46(0.04) 87.77(+0.07) 87.60(+0.10) 66.96(+0.04)
MLP SGD 83.36(+0.17) 88.10(+0.07) 83.36(+0.17) 84.73(0.13) 82.94(+0.50
MLP ADAM 82.88(+0.76) 87.87(+0.13) 82.88(+0.76) 84.33(+0.61) 82.58(+0.62)

Fig 3. Comparison of the performance of the models used.

Fig 3

Based on the results of the models, we performed a chi-square statistical validation to analyze whether the behavior of the models has statistical significance. For this, a contingency table was created with the distribution of real and predicted values of all models and for all cases a p value < 0.01 was obtained.

As for the features that were important for training the models, the most relevant features for classifying the Decision Tree were bed type, age, provider health unit and icd. The non-relevant elements were requested bed type, entry bed type and SOFA. In the Random Forest model, output bed type, age, EUP, provider health unit and icd scored the highest, while sex and SOFA were the least relevant characteristics. For the Gradient Boosting classifier output bed type, age, EUP and provider health unit were the most relevant, while sex, requested bed type, entry bed type were the lowest scorers. Adaboost considered the best characteristics to be length of stay, provider health unit, EUP and age, while the least relevant were sex, requested bed type and entry bed type. XGBoost considered output bed type, EUP and entry bed type as the most important characteristics and sex, SOFA and requested bed type as the least important. For the models that used MLP, Adam considered output bed type, requested bed type, provider health unit and age to be the most relevant, while SGD considered output bed type, provider health unit, age and ICD to be the most significant. The least important features were entry bed type and sex for Adam; and sex and requested bed type for SGD. Figs 4 and 5 show the important features of the machine learning models.

Fig 4. Feature importance of the machine learning models.

Fig 4

Fig 5. Feature importance of MLP models.

Fig 5

Compared to Phik’s correlation, except for Adabost, all the other classifiers included output bed type as the most important feature in the classification process, which corroborates Phik’s correlation (output bed being the feature with the highest correlation with the outcome) and the weaker correlation, sex was identified as the least relevant feature for Random Forest, Grandient Boosting, Adaboost, XGBoost and SGD, while length of stay, which is another feature that has been shown to have a low correlation with the outcome, was not identified as worse in any of the classifiers, however, for Adaboost this feature was the most significant.

The ROC curve (receiver operating characteristic curve) helps to visualize the performance of classifiers to select an appropriate operating point or decision threshold [53]. The discriminative capacity is usually quantified by the area under the AUC curve when considering the prediction of a binary event. It relates the variation in the rate of true positives and false positives predicted by the models, with results on a scale of 0 to 1. Although there is no definitive consensus in the literature, most studies using this tool consider an AUC between 0.7 and 0.8 to be good and acceptable, and between 0.8 and 0.9 to be very good [54, 55].

Thus, the Decision Tree (AUC = 0.738), XGBoost (AUC = 0.766) and Random Forest (AUC = 0.785) models performed well, while the Adaboost (AUC = 0.804), Adam (AUC = 0.814) and SGD (AUC = 0.821) models performed better, falling into the very good category. Fig 6 shows the results obtained.

Fig 6. ROC curve and AUC value of all models.

Fig 6

Discussion

The use of artificial intelligence and computational methods to solve and predict problems in the health field has been going on for some years now, and although there is a considerable range of solutions in various segments, from predicting diseases by diagnosing medical images [5660] to the classification of important markers for the prediction of cardiological [25, 61], and ophthalmological diseases [62, 63] or the analysis of data to predict early-stage cancer [6466]; as well as robotic mechanisms for surgery, for example [6770]. There are still some sectors that have not been explored or that have made negligible contributions [57, 7173].

According to Yu, Beam and Kohane [57], the association of artificial intelligence will be able to contribute even more effectively to clinical practices and health management. In this way, healthcare professionals will be able to reduce the time spent on repetitive tasks in order to explore better treatments and clinical solutions aimed at patient care, something that machines cannot do and which require more humanized treatment. According to Valentim et al [8], the use of digital health solutions based on artificial intelligence are already considered relevant tools by healthcare managers, as they help to make decision-making more timely, effective and based on robust scientific evidence.

In the process of regulating hospital beds, the use of artificial intelligence helps to reduce medical subjectivity in the face of the repetitive process of countless daily regulations, tasks that can often become a tiring activity throughout the day. This certainly contributes to minimizing errors in the indication of hospital beds, especially when it comes to public health, since the daily volume of care is extremely high, as is the case in the state of Rio Grande Norte in Brazil, which has a population of approximately 3.5 million inhabitants. This could result in better resolutions for patients, as well as better equity in access to the resources available in the public health system. All of this will lead to a more timely hospitalization process for patients, and consequently to better performance in terms of hospital bed turnover—better average occupancy time for hospital beds across the entire public health network [2]. In general, the use of machine learning tools can optimize the care process, increasing efficacy, efficiency and effectiveness, which induces better resilience of the health system, especially in times of crisis, as was the case during COVID-19 [8, 74, 75].

At the management level, adopting AI-driven systems for bed regulation could lead to significant improvements in resource allocation, reducing patient wait times and optimizing bed occupancy rates. However, implementing these systems at scale presents challenges, such as ensuring adequate training for health professionals and integrating AI tools with existing hospital infrastructure. Addressing these challenges will be critical for maximizing the potential benefits of AI in the public healthcare system [5, 8].

In this study, machine learning techniques were used in different tree and ensemble models, as well as artificial neural network models on hospital bed regulation data, and the aim was to classify the outcome of patients regardless of their ICD, to help the regulating doctor and reduce subjectivity during the hospital bed regulation process.

As for the results of the computer models, XGBoost showed the best accuracy (87.77%) and recall (87.77%) values, i.e. of all the models used, it classifies the data better in general, regardless of the outcome (discharge or death), as well as, given the positive outcome, the proportion that was correctly classified. As for the accuracy indicator, which identifies which proportion of positive outcomes was correct, Random Forest performed best (87.05%). As for the F1-Score, Gradient Boosting has a better harmonic mean between precision and recall, i.e. it has a better balance in the metrics that assess the positive outcome. Regarding specificity, a metric that assesses the classification of the negative outcome, the neural network models showed the best results when compared to the tree and ensemble models, achieving scores of 82.58% (ADAM) and 82.94% (SGD). For the ROC-AUC, the SGD and ADAM models also performed better, because, as they had a more balanced classification of positive and negative outcomes, the ROC-AUC value was in the range of 82.13% and 81.42%, respectively.

Considering these results, the models used in this experiment are not only able to predict which patients are more likely to be discharged or die, but also allow us to understand which samples are being better classified concerning the outcome and the best type of hospital bed according to the clinical conditions of each patient. And so, the main metric analyzed should not only be accuracy; the other metrics that point to a positive outcome (precision, recall and ROC-AUC) should also be maximized [2]. Furthermore, it also has a positive impact on the pace of work of the regulatory professional, given that in situations of high demand and overload of requests, the assertiveness of the regulatory process can be compromised, and so the models contribute to better regulatory conduct [76, 77].

Conclusion

This study used the regulation database of the RegulaRN Leitos Gerais platform between 2021 and 2024 in machine learning models to predict the outcome of discharge and death in different diseases that require hospitalization. The results of this article show that there is no single model that obtains the best accuracy, precision, recall, F1-Score, specificity, and ROC-AUC metrics. Thus, depending on the objectives of the regulatory professionals, it should be observed which model can provide the best result based on the desired metric, i.e. for example, if the regulator’s objective is to observe the best classifications for the positive outcome, it should use XGBoost and Random Forest; If the objective is to evaluate the best classification for the negative outcome, the multilayer perceptron models should be evaluated.

It should be noted that artificial intelligence computer models enhance the activities carried out in the healthcare and management sectors. Research in this area should therefore be increasingly explored in order to minimize the precariousness and weaknesses that exist in the different health segments. In this way, this research also aims to make a positive contribution to the health system such as the SUS, which aims to guarantee universal and comprehensive access to health with equity.

A significant limitation of this study is the incomplete dataset, particularly the absence of detailed information such as pregnancy status and gestational age. This missing information could introduce bias in model predictions, particularly for patient subgroups with different clinical needs. Future work should focus on improving data collection protocols to ensure that such critical variables are recorded, allowing for more nuanced and accurate model predictions across diverse patient groups. During the evaluation of the database, some gaps were found in the data, which is why it was not included for training the models. However, for some diseases, knowing whether the patient is pregnant or not and the appropriate length of pregnancy are essential. In addition, this study considered the same evaluation of hospital outcomes in different diseases with different morbidity scales. Furthermore, another limitation of this work was the non-inclusion of other models widely used in academic literature, such as k-Nearest Neighbors (kNN) and Support Vector Machines (SVM) [7880], as they were not included within the initial scope of this research. However, it is considered that for future work the scope of computational models can be expanded and these models included. Furthermore, still addressing future work, creating a new feature that can categorize diseases by morbidity could contribute to a more appropriate classification of the models. Furthermore, trying to identify which treatment protocols were used to treat certain diseases can also be a relevant indicator for classifying models.

Acknowledgments

We would like to thank the Public Health Secretariat of Rio Grande do Norte (SESAP/RN), the Health Technological Innovation Laboratory (LAIS) of the Federal University of Rio Grande do Norte (UFRN), the Advanced Innovation Center (NAVI) of the Federal Institute do Rio Grande do Norte (IFRN), to LyRIDS, ECE-Engineering School and the Department of Informatics and Applied Mathematics for the support necessary for the development of this research.

Data Availability

The data used in this research can be accessed via the link: https://zenodo.org/records/11387710.

Funding Statement

The present study was funded through the Project Regula SESAP-RN/FUNCERN, grant number 69/2021, carried out by the Laboratory of Technological Innovation in Health (LAIS) of the Federal University of Rio Grande do Norte (UFRN) in cooperation with the Secretary of Public Health of Rio Grande do Norte.

References

  • 1. Bastos LBR, Barbosa MA, Rosso CFW, Oliveira LMdAC, Ferreira IP, Bastos DAdS, et al. Practices and challenges on coordinating the Brazilian Unified Health System. Revista de Saúde Pública. 2020;54:25. doi: 10.11606/s1518-8787.2020054001512 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Barreto TdO, Veras NVR, Cardoso PH, Fernandes FRdS, Medeiros LPdS, Bezerra MV, et al. Artificial intelligence applied to analyzes during the pandemic: COVID-19 beds occupancy in the state of Rio Grande do Norte, Brazil. Frontiers in Artificial Intelligence. 2023;6:1290022. doi: 10.3389/frai.2023.1290022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Brasil. Portaria nº 1.559, de 1º de agosto de 2008. Institui a Política Nacional de Regulação do Sistema Único de Saúde-SUS. Diário Oficial da União. 2008;.
  • 4.Brasil. Portaria nº 2, de 28º de setembro de 2017. Consolidação das normas sobre as políticas nacionais de Saú do Sistema Único de Saúde-SUS. Diário Oficial da União. 2017;.
  • 5. Maldonado RN, Savio RO, Feijó VBER, Aroni P, Rossaneis MA, Haddad MdCFL. Hospital indicators after implementation of bed regulation strategies: an integrative review. Revista Brasileira de Enfermagem. 2021;74:e20200022. doi: 10.1590/0034-7167-2020-0022 [DOI] [PubMed] [Google Scholar]
  • 6.Cordeiro MF. SISREG: uma ferramenta de desafios e avanços para a garantia do direito a saúde. 2015;.
  • 7. Junior JRM, de Souza Junior AA, da Luz AEdJ. The impact of COVID-19 on the municipal Regulation System (SISREG) of Rio de Janeiro (RJ). Research, Society and Development. 2024;13(4):e5613445564–e5613445564. doi: 10.33448/rsd-v13i4.45564 [DOI] [Google Scholar]
  • 8. Valentim RAdM, Lima TS, Cortez LR, Barros DMdS, Silva RDd, Paiva JCd, et al. The relevance a technology ecosystem in the Brazilian National Health Service’s Covid-19 response: the case of Rio Grande do Norte, Brazil. Ciência & Saúde Coletiva. 2021;26:2035–2052. [DOI] [PubMed] [Google Scholar]
  • 9.Medina MVB. Análise da utilização da Escala Quick Sequential Organ Failure Assessment para tomada de decisão na regulação de leitos de UTI. Universidade Federal do Rio Grande do Norte; 2023.
  • 10.Konder M, O’Dwyer G. Regulation of access to hospital beds in emergency care and the development of integrated health services. 2019;.
  • 11. Kim M, Lee JY, Park JS, Kim HA, Hyun M, Suh YS, et al. Lessons from a COVID-19 hospital, Republic of Korea. Bulletin of the World Health Organization. 2020;98(12):842. doi: 10.2471/BLT.20.261016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Lu Y, Guan Y, Zhong X, Fishe JN, Hogan T. Hospital beds planning and admission control policies for COVID-19 pandemic: A hybrid computer simulation approach. In: 2021 IEEE 17th International Conference on Automation SCience and Engineering (CASE). IEEE; 2021. p. 956–961.
  • 13. Pecoraro F, Luzi D, Clemente F. The efficiency in the ordinary hospital bed management: A comparative analysis in four European countries before the COVID-19 outbreak. Plos one. 2021;16(3):e0248867. doi: 10.1371/journal.pone.0248867 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Shi F, Li H, Liu R, Liu Y, Liu X, Wen H, et al. Emergency preparedness and management of mobile cabin hospitals in China during the COVID-19 pandemic. Frontiers in Public Health. 2022;9:763723. doi: 10.3389/fpubh.2021.763723 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Alavinejad M, Mellado B, Asgary A, Mbada M, Mathaha T, Lieberman B, et al. Management of hospital beds and ventilators in the Gauteng province, South Africa, during the COVID-19 pandemic. PLOS global public health. 2022;2(11):e0001113. doi: 10.1371/journal.pgph.0001113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Kuzior A, Kashcha M, Kuzmenko O, Lyeonov S, Brożek P. Public health system economic efficiency and COVID-19 resilience: Frontier DEA analysis. International Journal of Environmental Research and Public Health. 2022;19(22):14727. doi: 10.3390/ijerph192214727 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Taylor CA, Draney MT, Ku JP, Parker D, Steele BN, Wang K, et al. Predictive medicine: computational techniques in therapeutic decision-making. Computer Aided Surgery: Official Journal of the International Society for Computer Aided Surgery (ISCAS). 1999;4(5):231–247. doi: [DOI] [PubMed] [Google Scholar]
  • 18. Shahid N, Rappon T, Berta W. Applications of artificial neural networks in health care organizational decision-making: A scoping review. PloS one. 2019;14(2):e0212356. doi: 10.1371/journal.pone.0212356 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Tian S, Yang W, Le Grange JM, Wang P, Huang W, Ye Z. Smart healthcare: making medical care more intelligent. Global Health Journal. 2019;3(3):62–65. doi: 10.1016/j.glohj.2019.07.001 [DOI] [Google Scholar]
  • 20. Panagiotou OA, Högg LH, Hricak H, Khleif SN, Levy MA, Magnus D, et al. Clinical application of computational methods in precision oncology: a review. JAMA oncology. 2020;6(8):1282–1286. doi: 10.1001/jamaoncol.2020.1247 [DOI] [PubMed] [Google Scholar]
  • 21.Bian J, Modave F. The rapid growth of intelligent systems in health and health care; 2020. [DOI] [PubMed]
  • 22. Gupta PK, Ramachandran AT, Keerthi AM, Dave PS, Giridhar S, Kallapur SS, et al. An overview of clinical decision support system (CDSS) as a computational tool and its applications in public health. Applications in ubiquitous computing. 2021; p. 81–117. doi: 10.1007/978-3-030-35280-6_5 [DOI] [Google Scholar]
  • 23. Moulaei K, Shanbehzadeh M, Mohammadi-Taghiabad Z, Kazemi-Arpanahi H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC medical informatics and decision making. 2022;22(1):2. doi: 10.1186/s12911-021-01742-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Albuquerque G, Fernandes F, Barbalho IM, Barros DM, Morais PS, Morais AH, et al. Computational methods applied to syphilis: where are we, and where are we going? Frontiers in Public Health. 2023;11:1201725. doi: 10.3389/fpubh.2023.1201725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Carvalho DRd, Araújo BGd, Lacerda JMT, Dantas MdCR, Hékis HR, Valentim RAdM. An architecture for online transient detection in electrocardiogram signals on the MP-HA protocol. Revista Brasileira de Engenharia Biomédica. 2012;28:346–354. [Google Scholar]
  • 26. Levantesi S, Pizzorusso V. Application of machine learning to mortality modeling and forecasting. Risks. 2019;7(1):26. doi: 10.3390/risks7010026 [DOI] [Google Scholar]
  • 27. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE reviews in Biomedical Engineering. 2020;14:116–126. doi: 10.1109/RBME.2020.3007816 [DOI] [PubMed] [Google Scholar]
  • 28. Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC medical research methodology. 2021;21:1–14. doi: 10.1186/s12874-021-01284-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Dixit RR. Risk Assessment for Hospital Readmissions: Insights from Machine Learning Algorithms. Sage Science Review of Applied Machine Learning. 2021;4(2):1–15. [Google Scholar]
  • 30. Iwase S, Nakada Ta, Shimada T, Oami T, Shimazui T, Takahashi N, et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Scientific reports. 2022;12(1):12912. doi: 10.1038/s41598-022-17091-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Baak M, Koopman R, Snoek H, Klous S. A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics. Computational Statistics & Data Analysis. 2020;152:107043. doi: 10.1016/j.csda.2020.107043 [DOI] [Google Scholar]
  • 32. Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID-19 patient health prediction using boosted random forest algorithm. Frontiers in public health. 2020;8:357. doi: 10.3389/fpubh.2020.00357 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Aljameel SS, Khan IU, Aslam N, Aljabri M, Alsulmi ES. Machine Learning-Based Model to Predict the Disease Severity and Outcome in COVID-19 Patients. Scientific programming. 2021;2021(1):5587188. [Google Scholar]
  • 34. Endo PT, Santos GL, de Lima Xavier ME, Nascimento Campos GR, de Lima LC, Silva I, et al. Illusion of truth: analysing and classifying COVID-19 fake news in brazilian portuguese language. Big Data and Cognitive Computing. 2022;6(2):36. doi: 10.3390/bdcc6020036 [DOI] [Google Scholar]
  • 35. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research. 2002;16:321–357. doi: 10.1613/jair.953 [DOI] [Google Scholar]
  • 36.Maiga J, Hungilo GG, et al. Comparison of machine learning models in prediction of cardiovascular disease using health record data. In: 2019 international conference on informatics, multimedia, cyber and information system (ICIMCIS). IEEE; 2019. p. 45–48.
  • 37.Hamida S, El Gannour O, Cherradi B, Ouajji H, Raihani A. Optimization of machine learning algorithms hyper-parameters for improving the prediction of patients infected with COVID-19. In: 2020 ieee 2nd international conference on electronics, control, optimization and computer science (icecocs). IEEE; 2020. p. 1–6.
  • 38. Papaiz F, Dourado MET Jr, de Medeiros Valentim RA, Pinto R, de Morais AHF, Arrais JP. Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis. BMC Medical Informatics and Decision Making. 2024;24(1):80. doi: 10.1186/s12911-024-02484-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Divya KS, Bhargavi P, Jyothi S. Machine learning algorithms in big data analytics. Int J Comput Sci Eng. 2018;6(1):63–70. [Google Scholar]
  • 40. Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. Library Hi Tech. 2020;38(4):835–858. doi: 10.1108/LHT-08-2019-0171 [DOI] [Google Scholar]
  • 41.Rahul K, Banyal RK, Goswami P, Kumar V. Machine learning algorithms for big data analytics. In: Computational Methods and Data Engineering: Proceedings of ICMDE 2020, Volume 1. Springer; 2021. p. 359–367.
  • 42. Charbuty B, Abdulazeez A. Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends. 2021;2(01):20–28. doi: 10.38094/jastt20165 [DOI] [Google Scholar]
  • 43. Rigatti SJ. Random forest. Journal of Insurance Medicine. 2017;47(1):31–39. doi: 10.17849/insm-47-01-31-39.1 [DOI] [PubMed] [Google Scholar]
  • 44. Fafalios S, Charonyktakis P, Tsamardinos I. Gradient boosting trees. Gnosis Data Analysis PC. 2020;1. [Google Scholar]
  • 45. Schapire RE. Empirical inference. Berlin, Heidelberg. 2013; p. 37–52. [Google Scholar]
  • 46.Sheng C, Yu H. An optimized prediction algorithm based on XGBoost. In: 2022 International Conference on Networking and Network Applications (NaNA). IEEE; 2022. p. 1–6.
  • 47. Popescu MC, Balas VE, Perescu-Popescu L, Mastorakis N. Multilayer perceptron and neural networks. WSEAS Transactions on Circuits and Systems. 2009;8(7):579–588. [Google Scholar]
  • 48.Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014;.
  • 49. Pedregosa F. Scikit-learn: Machine learning in python Fabian. Journal of machine learning research. 2011;12:2825. [Google Scholar]
  • 50. Ensor KB, Glynn PW. Stochastic optimization via grid search. Lectures in Applied Mathematics-American Mathematical Society. 1997;33:89–100. [Google Scholar]
  • 51. Bergstra J, Bengio Y. Random search for hyper-parameter optimization. Journal of machine learning research. 2012;13(2). [Google Scholar]
  • 52. Ahsan MM, E Alam T, Trafalis T, Huebner P. Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and Non-COVID-19 patients. Symmetry. 2020;12(9):1526. doi: 10.3390/sym12091526 [DOI] [Google Scholar]
  • 53. Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern recognition. 1997;30(7):1145–1159. doi: 10.1016/S0031-3203(96)00142-2 [DOI] [Google Scholar]
  • 54. De Hond AA, Steyerberg EW, Van Calster B. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health. 2022;4(12):e853–e855. doi: 10.1016/S2589-7500(22)00188-1 [DOI] [PubMed] [Google Scholar]
  • 55. Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean journal of anesthesiology. 2022;75(1):25–36. doi: 10.4097/kja.21209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Mohammad-Rahimi H, Nadimi M, Ghalyanchi-Langeroudi A, Taheri M, Ghafouri-Fard S. Application of machine learning in diagnosis of COVID-19 through X-ray and CT images: a scoping review. Frontiers in cardiovascular medicine. 2021;8:638011. doi: 10.3389/fcvm.2021.638011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nature biomedical engineering. 2018;2(10):719–731. doi: 10.1038/s41551-018-0305-z [DOI] [PubMed] [Google Scholar]
  • 58. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future healthcare journal. 2019;6(2):94–98. doi: 10.7861/futurehosp.6-2-94 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Owoyemi A, Owoyemi J, Osiyemi A, Boyd A. Artificial Intelligence for Healthcare in Africa. Frontiers in digital health 2: 6; 2020. doi: 10.3389/fdgth.2020.00006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Yang CC. Explainable artificial intelligence for predictive modeling in healthcare. Journal of healthcare informatics research. 2022;6(2):228–239. doi: 10.1007/s41666-022-00114-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Cuocolo R, Perillo T, De Rosa E, Ugga L, Petretta M. Current applications of big data and machine learning in cardiology. Journal of geriatric cardiology: JGC. 2019;16(8):601. doi: 10.11909/j.issn.1671-5411.2019.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Barros DM, Moura JC, Freire CR, Taleb AC, Valentim RA, Morais PS. Machine learning applied to retinal image processing for glaucoma detection: review and perspective. Biomedical engineering online. 2020;19:1–21. doi: 10.1186/s12938-020-00767-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Srivastava O, Tennant M, Grewal P, Rubin U, Seamone M. Artificial intelligence and machine learning in ophthalmology: A review. Indian Journal of Ophthalmology. 2023;71(1):11–17. doi: 10.4103/ijo.IJO_1569_22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Firmino M, Angelo G, Morais H, Dantas MR, Valentim R. Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomedical engineering online. 2016;15:1–17. doi: 10.1186/s12938-015-0120-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Galvao-Lima L, Morais H, Valentim R, Barreto E. miRNAs as biomarkers for early cancer detection and their application in the development of new diagnostic tools. Biomedical engineering online. 2021;20:1–21. doi: 10.1186/s12938-021-00857-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Zhao B, Waterman R, Urman R, Gabriel RA. A machine learning approach to predicting case duration for robot-assisted surgery. Journal of Medical Systems. 2019;43:1–32. doi: 10.1007/s10916-018-1151-y [DOI] [PubMed] [Google Scholar]
  • 68. Panesar S, Cagle Y, Chander D, Morey J, Fernandez-Miranda J, Kliot M. Artificial intelligence and the future of surgical robotics. Annals of surgery. 2019;270(2):223–226. doi: 10.1097/SLA.0000000000003262 [DOI] [PubMed] [Google Scholar]
  • 69. Zhou XY, Guo Y, Shen M, Yang GZ. Application of artificial intelligence in surgery. Frontiers of medicine. 2020;14:417–430. doi: 10.1007/s11684-020-0770-0 [DOI] [PubMed] [Google Scholar]
  • 70. Moglia A, Georgiou K, Georgiou E, Satava RM, Cuschieri A. A systematic review on artificial intelligence in robot-assisted surgery. International Journal of Surgery. 2021;95:106151. doi: 10.1016/j.ijsu.2021.106151 [DOI] [PubMed] [Google Scholar]
  • 71. Fernandes YYMP, Araújo GTd, Araújo BGd, Dantas MdCR, Carvalho DRd, Valentim RAdM. ILITIA: telehealth architecture for high-risk gestation classification. Research on Biomedical Engineering. 2017;33(3):237–246. doi: 10.1590/2446-4740.09416 [DOI] [Google Scholar]
  • 72. Reddy S. Explainability and artificial intelligence in medicine. The Lancet Digital Health. 2022;4(4):e214–e215. doi: 10.1016/S2589-7500(22)00029-2 [DOI] [PubMed] [Google Scholar]
  • 73. Schwalbe N, Wahl B. Artificial intelligence and the future of global health. The Lancet. 2020;395(10236):1579–1586. doi: 10.1016/S0140-6736(20)30226-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Ammar W, Kdouh O, Hammoud R, Hamadeh R, Harb H, Ammar Z, et al. Health system resilience: Lebanon and the Syrian refugee crisis. Journal of global health. 2016;6(2). doi: 10.7189/jogh.06.020704 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: progress, crisis and resilience. BMJ global health. 2018;3(4):e000829. doi: 10.1136/bmjgh-2018-000829 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Muhammad L, Algehyne EA, Usman SS, Ahmad A, Chakraborty C, Mohammed IA. Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN computer science. 2021;2(1):1–13. doi: 10.1007/s42979-020-00394-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Silva Junior CL, Guabiraba KPdL, Gomes GG, Andrade CLTd, Melo EA. Outpatient regulation in Primary Care in the municipality of Rio de Janeiro, Brazil, based on the local regulatory doctors. Ciência & Saúde Coletiva. 2022;27:2481–2493. [DOI] [PubMed] [Google Scholar]
  • 78. Ali A, Khan Z, Khan DM, Aldahmani S. An Optimal Random Projection k Nearest Neighbours Ensemble via Extended Neighbourhood Rule for Binary Classification. IEEE Access. 2024;. [Google Scholar]
  • 79. Ali A, Hamraz M, Gul N, Khan DM, Aldahmani S, Khan Z. A k nearest neighbour ensemble via extended neighbourhood rule and feature subsets. Pattern Recognition. 2023;142:109641. doi: 10.1016/j.patcog.2023.109641 [DOI] [Google Scholar]
  • 80. Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. International Journal of Computing and Business Research (IJCBR). 2015;6(2):1–12. [Google Scholar]

Decision Letter 0

Luísa da Matta Machado Fernandes

24 Sep 2024

PONE-D-24-27498Artificial Intelligence Applied to Bed Regulation in Rio Grande do Norte: Data Analysis and Application of Machine Learning on the “RegulaRN Leitos Gerais” PlatformPLOS ONE

Dear Dr. Barreto,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

I hope this message finds you well. I have reviewed your study on improving technology for bed regulation within a universal health system and would like to offer some suggestions to enhance its international relevance.

To strengthen your findings, I recommend providing a more detailed explanation of the current limitations of the national system used for bed regulation. Additionally, contextualizing these regulations within the broader health system would provide valuable insights.

As per the reviewers' feedback, please update the discussion with current literature and enhance the comprehensibility of the methods and results section. Including a flowchart and improving the graphics and tables will also aid in conveying your findings more effectively.

I look forward to seeing the revised manuscript.

==============================

Please submit your revised manuscript by Nov 08 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Luísa da Matta Machado Fernandes, DrPH

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following financial disclosure:

 [copy in funding statement]. 

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

If this statement is not correct you must amend it as needed.

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This study effectively applies machine learning to the RegulaRN Leitos Gerais platform to optimize hospital bed regulation in Rio Grande do Norte. Analyzing data from October 2021 to January 2024, it shows strong performance from models like XGBoost, Random Forest, and Gradient Boosting in accuracy, precision, and recall. However, the study would benefit from incorporating additional methods such as k-Nearest Neighbors (kNN) and Support Vector Machines (SVM), as well as referencing recent literature (e.g., doi.org/10.1109/ACCESS.2024.3392729 and doi.org/10.1016/j.patcog.2023.109641) to ensure a comprehensive evaluation and alignment with current advancements in healthcare machine learning.

Reviewer #2: • Suggested Improvement: The introduction clearly outlines the general problem of bed regulation but could be more detailed in discussing the relevance of artificial intelligence in this specific context by comparing it with similar studies.

o Lines 60-61: "Intelligent computer models can help reduce the impact of uncertainties and ambiguities in the regulatory process and improve decision-making support."

o Replace with: "Intelligent computer models have demonstrated significant potential in healthcare systems by reducing uncertainties and ambiguities in complex decision-making processes. For example, prior studies in similar healthcare contexts have shown that machine learning models can enhance hospital management by optimizing resource allocation and reducing patient waiting times (reference). This study aims to build on these findings to demonstrate the effectiveness of AI-based models specifically in bed regulation in Rio Grande do Norte."

More detailed discussion on model limitations The article mentions limitations but could provide a more robust discussion of how these limitations affect the results and suggest strategies to mitigate them.

• Line 398: "The study’s limitations include the fact that the health professionals did not provide more precise information on some of the data that could have been better analyzed in the models, such as whether the patient was pregnant."

• Replace with: "A significant limitation of this study is the incomplete dataset, particularly the absence of detailed information such as pregnancy status and gestational age. This missing information could introduce bias in model predictions, particularly for patient subgroups with different clinical needs. Future work should focus on improving data collection protocols to ensure that such critical variables are recorded, allowing for more nuanced and accurate model predictions across diverse patient groups."

More detailed explanation of the choice of machine learning models The choice of machine learning models is explained, but a more in-depth justification of why certain models (like XGBoost and Random Forest) were selected would be helpful.

• Line 165: "The definition of models for data classification involved algorithms that, according to the literature, perform well with high volumes of data."

• Replace with: "The selection of classification models was based on their proven ability to handle large volumes of imbalanced healthcare data. XGBoost, for example, has been shown to perform well in healthcare settings due to its gradient boosting framework, which effectively handles missing data and provides robust performance on tabular datasets (Chen & Guestrin, 2016). Random Forest, on the other hand, was selected for its ability to manage complex decision trees and its resistance to overfitting, particularly in high-dimensional datasets (Breiman, 2001). These models were chosen for their complementarity in addressing the specific challenges of bed regulation data in this study."

Better organization of tables and figures The presentation of tables and figures can be improved with more descriptive captions and the inclusion of an analysis immediately after presenting each figure/table to facilitate the interpretation of the results.

• Line 276: "Table 6 shows all the values obtained."

• Replace with: "Table 6 presents the performance metrics for each machine learning model, including accuracy, precision, recall, F1-Score, and specificity. Notably, XGBoost outperformed the other models in accuracy and recall, making it a robust choice for predicting patient outcomes in bed regulation. However, the high specificity observed in the MLP models indicates that these models may be more suitable when the goal is to minimize false positives, particularly in critical care cases."

More in-depth discussion of the practical impact of the results The discussion could delve deeper into the practical impact of adopting AI in the healthcare system and the challenges of implementing it on a large scale.

• Line 349: "At the management level, in response to a more efficient regulatory system, financial and human resources can be distributed in a way that is more coherent with the needs of the different health sectors."

• Replace with: "At the management level, adopting AI-driven systems for bed regulation could lead to significant improvements in resource allocation, reducing patient wait times and optimizing bed occupancy rates. However, implementing these systems at scale presents challenges, such as ensuring adequate training for health professionals and integrating AI tools with existing hospital infrastructure. Addressing these challenges will be critical for maximizing the potential benefits of AI in the public healthcare system."

I noticed the absence of graphs and visual figures that could significantly improve the clarity and understanding of the article, especially concerning the methodology and results. The inclusion of graphical elements such as flowcharts and result charts would be a valuable contribution to make the methodological process more accessible and the data easier to interpret.

Firstly, presenting a detailed methodology flowchart would be extremely helpful in guiding readers through all the stages described in the article. This would clarify the process from data extraction and processing to machine learning model selection, allowing for a clearer understanding of the methodological sequence.

Additionally, to facilitate the understanding of the results, it would be interesting to include graphs comparing the performance metrics of the different machine learning models. Bar graphs, for example, could visually show the accuracy, recall, specificity, and F1-Score of the tested models. The inclusion of ROC-AUC curves would also help visually present each model’s discriminative capabilities, making the information easier to interpret for readers.

Finally, the article mentions correlation analysis between variables, but including a correlation matrix, such as a heatmap, could visually highlight the most significant relationships between the variables. This type of graphical visualization would make data interpretation more immediate and intuitive for the reader.

These visual additions, such as flowcharts and result charts, would not only enhance the manuscript’s clarity but also help more effectively convey the complexities involved in data analysis and the results obtained.

Reviewer #3: Texto apresenta clareza de dados, necessidade de pequenos ajustes apontados no manuscrito enviado. A publicação duplicada, intencional ou não, pode prejudicar a credibilidade da pesquisa e comprometer os direitos de propriedade intelectual de ambos os periódicos.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Danielle Torres dos Santos Lopes

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE-D-24-27498_reviewer.docx

pone.0315379.s001.docx (146.9KB, docx)
PLoS One. 2024 Dec 30;19(12):e0315379. doi: 10.1371/journal.pone.0315379.r002

Author response to Decision Letter 0


11 Oct 2024

A response to each suggestion was attached as a response document to the reviewers.

Attachment

Submitted filename: renamed_025d2.pdf

pone.0315379.s002.pdf (113.8KB, pdf)

Decision Letter 1

Luísa da Matta Machado Fernandes

26 Nov 2024

Artificial Intelligence Applied to Bed Regulation in Rio Grande do Norte: Data Analysis and Application of Machine Learning on the “RegulaRN Leitos Gerais” Platform

PONE-D-24-27498R1

Dear Dr. Barreto,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Luísa da Matta Machado Fernandes, DrPH

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Luísa da Matta Machado Fernandes

28 Nov 2024

PONE-D-24-27498R1

PLOS ONE

Dear Dr. Barreto,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Luísa da Matta Machado Fernandes

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: PONE-D-24-27498_reviewer.docx

    pone.0315379.s001.docx (146.9KB, docx)
    Attachment

    Submitted filename: renamed_025d2.pdf

    pone.0315379.s002.pdf (113.8KB, pdf)

    Data Availability Statement

    The data used in this research can be accessed via the link: https://zenodo.org/records/11387710.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES