. 2023 Dec 16;13:22420. doi: 10.1038/s41598-023-49899-0

Table 3.

Comparison of models.

Model type	Model name	Description	Differences with the other similar model
Linear regression	R_ML	Multiple Linear Regression model with multiple independent variables, assuming a linear relationship between the dependent variable and the independent variables	Multiple Linear Regression model with nonlinear terms has the advantage of allowing for a more complex relationship between the dependent variable and the independent variables, which can improve the fitting capability of the model, especially for nonlinear data
Linear regression	R_MLNE	Multiple Linear Regression model with nonlinear terms, allowing for a more complex relationship between the dependent variable and the independent variables. Differs from R_ML in the inclusion of nonlinear terms
Neural network	NN_BP[X]	An artificial neural network model trained using the backpropagation algorithm with 1 times the number of input variables in the hidden layer(s). Differs from NN_BP[2X] in the number of neurons in the hidden layer(s)	The main difference between these two models lies in their complexity and potential learning capability. NN_BP[2X] has a higher number of neurons, which increases the model's capacity to learn more complex relationships between the input and output variables. This can lead to better fitting results and more accurate predictions.A higher number of neurons also increases the risk of overfitting, as the model may become too complex and fit the noise in the training data
Neural network	NN_BP[2X]	An artificial neural network model similar to NN_BP[X] but with twice as many neurons in the hidden layer(s)
Recurrent neural network	NN_R[Y]	A neural network model that can process sequential data, using the first 30 time steps to make predictions. Differs from NN_R[0.5Y] in the number of time steps used for prediction	The main difference between these two models lies in the amount of historical information they consider when making predictions. NN_R[Y] takes into account a longer sequence of past data, which may provide more context and improve the model's ability to capture temporal patterns and trends. Using more time steps also increases the computational complexity of the model and may require more data to train effectively
Recurrent neural network	NN_R[0.5Y]	A neural network model similar to NN_R[Y] but uses the first 15 time steps for prediction
Random forest regression	RFR₁₀₀	An ensemble learning model that combines multiple decision trees for regression prediction, using 100 decision trees. Differs from RFR₂₀₀ in the number of decision trees used	The increased number of decision trees in RFR200 generally leads to a more complex model, which can capture more subtle patterns in the data and potentially result in more accurate predictions. However, this comes at the cost of increased computational complexity and a higher risk of overfitting, particularly if the dataset is small
Random forest regression	RFR₂₀₀	An ensemble learning model similar to RFR₁₀₀ but uses 200 decision trees for regression prediction