Skip to main content
. 2021 Oct 6;7(41):eabf0354. doi: 10.1126/sciadv.abf0354

Table 2. Engineered features (total count, 165).

Feature type* Description No. of features
[Disease category]Δ Likelihood defect (see
Materials and
Methods)
17
[Disease category]0 Likelihood of control
model (see
Materials and
Methods)
17
[Disease category]proportion Occurrences in the
encoded
sequence/length
of the sequence
17
[Disease category]streak Maximum length of
adjacent
occurrences of
[disease category]
51
[Disease
category]prevalence
Maximum, mean, and
variance of
occurrences in the
encoded
sequence/total
number of
diagnostic codes in
the mapped
sequence
51
Feature mean, feature
variance, and
feature maximum
for difference of
control and case
models
Mean, variance, and
maximum of the
[disease category]Δ values
3
Feature mean, feature
variance, and
feature maximum
for control models
Mean, variance, and
maximum of the
[disease category]0
values
3
Streak Maximum, mean, and
variance of the
length of adjacent
occurrences of
[disease category]
3
Intermission Maximum, mean, and
variance of the
length of adjacent
empty weeks
3

*Disease categories are described in table S1.