Performance metrics of the different models used. For the original data a linear multiple regression model was used (Anegagrie et al., 2021), and then the baseline (which included all the variables except the number of people per house) and final XGBoost model (including the top five variables and the number of people per household) for the current study. In adition: R2 train refers to R-squared of the training data set, MSE train refers to the Mean Square Error of the test data set, R2 test refers to R-squared of the training data set, MSE test refers to the Mean Square Error of the test data set.