Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

[Preprint]. 2024 Jul 8:2023.07.03.23292162. Originally published 2023 Jul 6. [Version 2] doi: 10.1101/2023.07.03.23292162

PMC10350132.1; 2023 Jul 6
PMC10350132.2; 2024 Jul 8

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.

PMC Copyright notice

Figure 6. — HTT repeat structures show varied prevalence across genetic ancestries and are associated with CAG repeat size. A) Allele structures observed within exon 1 of HTT. The CAG repeat is denoted as “Q1” and marked in gold. The CAACAG unit is referred to as “Q2” and is marked in green. The first proline-encoding “CCGCCA” repeat element is referred to as “P1” and is marked in purple. B) The prevalence of the allele structures is plotted across the studied genetic ancestries in bar plots on the x-axis. The ancestries are defined on the y-axis. The number of alleles in each of the genetic ancestries is denoted as “N=...” at each of the y-axis ticks. C) Boxplots display the distribution of CAG repeat sizes across different repeat structures. Box plots highlight the median (horizontal lines in the centre of each boxplot), interquartile range (bounds) and black dots show values outside 1.5 times the interquartile range. The repeat structures are separated on the x-axis and the repeat size is shown on the y-axis. The number of alleles with different repeat structures is denoted as “N=...” on the x-axis. A linear model was used to compare the repeat size distribution of the canonical alleles versus that of all atypical structures. Kruskal-Wallis tests with Dunn’s correction for multiple comparisons p value; p-values resulting from pairwise tests are displayed above each structure (*** < 0.001; * < 0.05). Q2 versus canonical (p-value = 6.4×10⁻³²), Q2 versus partialQ2 loss (p-value = 3.5×10⁻²), Q2 duplication versus P1 loss (p-value = 5.9×10⁻⁹⁸), Q2 duplication versus Q2 loss (p-value = 8.5×10⁻¹⁶); Q2 duplication versus Q2-P1 loss (p-value = 6.2×10⁻²⁰), canonical versus P1 loss (p-value = 2.4×10⁻⁸⁰), canonical versus Q2 loss (p-value = 2.8×10⁻⁸), canonical versus Q2-P1 loss (p value = 1.2×10¹²), P1 loss versus Q2 loss (p-value = 2.8×10⁻²), P1 loss versus vs Q2-P1 loss ( p-value = 5.6×10⁻⁶)