Skip to main content
. 2018 Oct 15;7:e35856. doi: 10.7554/eLife.35856

Figure 1. Non-parametric mixture-model-based clustering of CAS dataset, based on 174 features.

SPT = skin prick test. White spaces within the heatmap indicate missing data. Rows represent individuals; columns represent clustering features with general categories as labelled on grey background. Variables with grey background are clustering features ordered by category or type of variable first (e.g. all HDM IgE-related variables grouped together), then by timepoint (earlier to later, from left to right). Variables with lilac background indicate resultant cluster membership and outcome variable (age-5 wheeze). Heatmap values are scaled relative to range and median values for each feature; the median is coloured beige-yellow, the median +range red, and median – range blue. For sex, −1/blue = female, 0/yellow (median) = male.

Figure 1.

Figure 1—figure supplement 1. Scatterplot of principal components analysis (PCA) of the complete-case CAS dataset (N = 186), with points coloured by npEM clusters Each point represents an individual.

Figure 1—figure supplement 1.

The first two PCs (shown) account for 16.7% of the total variance.

Figure 1—figure supplement 2. Silhouette widths of clusters generated by npEM.

Figure 1—figure supplement 2.

j = cluster number; nj = cluster size; avei∈Cjsi = average silhouette width among members i of cluster Cj. Overall average silhouette width across all clusters is also given.

Figure 1—figure supplement 3. Overview of study methodology.

Figure 1—figure supplement 3.

Dashed arrows indicate non-critical elements of our method.