Table 2. Engineered features (total count, 165).
| Feature type* | Description | No. of features |
| [Disease category]Δ | Likelihood defect (see Materials and Methods) |
17 |
| [Disease category]0 | Likelihood of control model (see Materials and Methods) |
17 |
| [Disease category]proportion | Occurrences in the encoded sequence/length of the sequence |
17 |
| [Disease category]streak | Maximum length of adjacent occurrences of [disease category] |
51 |
| [Disease category]prevalence |
Maximum, mean, and variance of occurrences in the encoded sequence/total number of diagnostic codes in the mapped sequence |
51 |
| Feature mean, feature variance, and feature maximum for difference of control and case models |
Mean, variance, and maximum of the [disease category]Δ values |
3 |
| Feature mean, feature variance, and feature maximum for control models |
Mean, variance, and maximum of the [disease category]0 values |
3 |
| Streak | Maximum, mean, and variance of the length of adjacent occurrences of [disease category] |
3 |
| Intermission | Maximum, mean, and variance of the length of adjacent empty weeks |
3 |
*Disease categories are described in table S1.