Skip to main content
[Preprint]. 2024 Apr 10:2023.05.19.541520. Originally published 2023 May 21. [Version 2] doi: 10.1101/2023.05.19.541520

Figure 3: GeneBayes estimates of shet perform well at identifying constrained and unconstrained genes.

Figure 3:

A) Precision-recall curves comparing the performance of shet against other methods in classifying essential genes (left: all genes, right: quartile of genes with the fewest expected unique LOFs). B) Precision-recall curves comparing the performance of shet against LOEUF in classifying developmental disorder genes. C) Scatterplots showing the enrichment (respectively depletion) of the top 10% most (respectively least) constrained genes in HPO terms, with genes ranked by shet (y-axis) or LOEUF (x-axis). D) Enrichment of de novo mutations in patients with developmental disorders, calculated as the observed number of mutations over the expected number under a null mutational model. We plot the enrichment of synonymous, missense, splice, and nonsense variants in the 10% most constrained genes, ranked by shet (blue) or LOEUF (orange); or enrichment in the remaining genes, ranked by shet (green) or LOEUF (brown). Bars represent 95% confidence intervals. E) Left: LOESS curve showing the relationship between constraint (gene rank, x-axis) and absolute log fold change in expression between chimp and human cortical cells (y-axis). Genes are ranked by shet (blue) or LOEUF (orange). Right: LOESS curve showing the relationship between constraint (gene rank, x-axis) and gene expression variation in GTEx samples after controlling for mean expression levels.