(a, b) K-means clustering with cosine distance, (c, d) spectral clustering with cosine distance, and (e, f) hierarchical clustering with Euclidean distance and Ward linkage across 1,000 training set replicates (N = 284). In (g, h) we show the original analysis using hierarchical clustering with cosine distance and average linkage (see Methods for more details). Boxplots show distribution of clinical symptom z-scores (superimposed bar graphs depict the median) for social affect, repetitive, restrictive behaviors and interests (RRB), verbal IQ, and total severity (color indicates subgroup). Plots include 284 subjects x 1,000 training sets to indicate distribution of clinical behaviors across all 1,000 training set cluster assignments. Box bounds: [25th,75th percentile]; center: median; whiskers: 99.3% data in + /−2.7 σ; outliers: circles). Heatmaps show patterns of mean atypical connectivity across replicates in each subgroup across brain regions (rows) and functional networks (columns), and were thresholded for significant atypical connectivity (two-sided Welch’s t-test, mean FDR < 0.05), evaluated relative to N = 907 neurotypical controls. See additional comparisons in Supplementary Fig. 9.