Fig. 1.
Methodology outline. Schematic figure of the methodology used, from variant call format (VCF) files to 2D-plot of t-SNE. a Genomic Data Structure (GDS), converted file from VCF, for each gene. Number of variants may vary for a constant number of individuals (N = 765). b Individual by individual distance matrix calculated for each gene (765 × 765; 292,230 unique distances). c Table containing linearized distance matrix values for 11,318 genes (11,318 × 292,230 values). d t-SNE 2D-plot using the table with linearized distances for all genes