Figure - PMC

Skip to main content

View full-text article in PMC

. Author manuscript; available in PMC: 2014 Sep 1.

Published in final edited form as: Nat Genet. 2014 Feb 2;46(3):310–315. doi: 10.1038/ng.2892

Relationship of scaled C-scores and categorical variant consequences. The upper plot shows the proportion of substitutions with a specific consequence for each scaled C-score bin, while the middle panel shows the proportion of substitutions with a specific consequence after first normalizing by the total number of variants observed in that category. The legend indicates the median and range of scaled C-score values for each category. Consequences are obtained from the Ensembl Variant Effect Predictor¹⁶ (Supplementary Note), e.g. “noncoding change” refers to changes in annotated non-coding transcripts. Detailed counts of functional assignments in each C-score bin are in Supplementary Table 8. The lower panel shows violin plots of the median C-scores of potential nonsense (stop-gained) variants for genes that: harbor at least 5 known pathogenic mutations⁴⁸ (“disease”); are predicted to be “essential”²³; harbor variants associated with complex traits⁴¹ (“GWAS”); harbor at least 2 loss-of-function mutations in 1000 Genomes⁴⁹ (“LoF”); encode olfactory receptor proteins; or are in a random selection of 500 genes (“Other”; see Supplementary Note).