Skip to main content
. 2024 May 2;14:10074. doi: 10.1038/s41598-024-59643-x

Table 2.

Optimization algorithm for single pollutant DLNM.

Step Description
1 Y is a daily count data that originates from quasi-Poisson distribution. Depending on the type of outcome, it can be assumed as one of the exponential families of distributions
2 Consider 2k of DLNMs by best subset selection, where k is the number of covariates (e.g. meteorological factors) except the terms to describe seasonality, trend, holiday effects, etc. In other words, comparing the model performance by considering everything from the non-covariate single-exposure model to the full-covariates single-exposure model
3

Tune hyper-parameters of each 2k of DLNM based on QAIC (the AIC for quasi-Poisson)

• Maximum lag days: [m1,m2,]

• Degrees of freedom in predictor space(dfp): [v1,v2,]

• Degrees of freedom in additional lag dimension(dfl): [l1,l2,]

Knots are equally spaced, and a natural cubic spline is selected as a basis function. To tune and optimize each 2k of DLNM, use one of the search methods (e.g. grid search, random search)

4

Among the 2k of DLNMs optimized in step 3, The model with the smallest QAIC value is selected as the best model. However, if there is a model with a QAIC difference of less than 2 from the optimal model with the smallest QAIC as follows:

Δi=QAICi-QAICmin<2

The simplest model is the best model by comparing the models, including the optimal one

DLNM distributed lag non-linear models.