Table 2. Engineered features (total count, 165).
Feature type* | Description | No. of features |
[Disease category]Δ | Likelihood defect (see Materials and Methods) |
17 |
[Disease category]0 | Likelihood of control model (see Materials and Methods) |
17 |
[Disease category]proportion | Occurrences in the encoded sequence/length of the sequence |
17 |
[Disease category]streak | Maximum length of adjacent occurrences of [disease category] |
51 |
[Disease category]prevalence |
Maximum, mean, and variance of occurrences in the encoded sequence/total number of diagnostic codes in the mapped sequence |
51 |
Feature mean, feature variance, and feature maximum for difference of control and case models |
Mean, variance, and maximum of the [disease category]Δ values |
3 |
Feature mean, feature variance, and feature maximum for control models |
Mean, variance, and maximum of the [disease category]0 values |
3 |
Streak | Maximum, mean, and variance of the length of adjacent occurrences of [disease category] |
3 |
Intermission | Maximum, mean, and variance of the length of adjacent empty weeks |
3 |
*Disease categories are described in table S1.