Abstract
The ubiquitin fold modifier 1 (UFM1) cascade is a recently identified evolutionarily conserved ubiquitin-like modification system whose function and link to human disease have remained largely uncharacterized. By using exome sequencing in Finnish individuals with severe epileptic syndromes, we identified pathogenic compound heterozygous variants in UBA5, encoding an activating enzyme for UFM1, in two unrelated families. Two additional individuals with biallelic UBA5 variants were identified from the UK-based Deciphering Developmental Disorders study and one from the Northern Finland Intellectual Disability cohort. The affected individuals (n = 9) presented in early infancy with severe irritability, followed by dystonia and stagnation of development. Furthermore, the majority of individuals display postnatal microcephaly and epilepsy and develop spasticity. The affected individuals were compound heterozygous for a missense substitution, c.1111G>A (p.Ala371Thr; allele frequency of 0.28% in Europeans), and a nonsense variant or c.164G>A that encodes an amino acid substitution p.Arg55His, but also affects splicing by facilitating exon 2 skipping, thus also being in effect a loss-of-function allele. Using an in vitro thioester formation assay and cellular analyses, we show that the p.Ala371Thr variant is hypomorphic with attenuated ability to transfer the activated UFM1 to UFC1. Finally, we show that the CNS-specific knockout of Ufm1 in mice causes neonatal death accompanied by microcephaly and apoptosis in specific neurons, further suggesting that the UFM1 system is essential for CNS development and function. Taken together, our data imply that the combination of a hypomorphic p.Ala371Thr variant in trans with a loss-of-function allele in UBA5 underlies a severe infantile-onset encephalopathy.
Main Text
Post-translational modifications through attachment of ubiquitin or ubiquitin-like proteins (UBLs) are involved in various biological processes.1 There are eight human UBL-conjugating systems, in which each UBL is attached to specific proteins or lipids usually through three-step cascades involving E1-, E2-, and E3-like enzymes.2 UBA5 is an E1-like (activating) enzyme for the most recently identified UBL, UFM1 (Figure 1A).4 All molecules (i.e., UFM1, UBA5, UFC1, UFL1, and UFSP2) involved in conjugation of UFM1 to intracellular proteins (ufmylation) are conserved in metazoa and plants, but not in yeast, suggesting important roles in multicellular organisms. In fact, the ubiquitously expressed UFM1 system has an essential role in erythroid differentiation in mice,7, 8 plays a crucial role in breast cancer development,5 and is implicated in cellular stress response8, 9, 10, 11 (UFM1 system is reviewed in Daniel and Liebau12). However, the role of the UFM1 system in the central nervous system (CNS) has not been studied, and the mechanism by which UFM1 system executes its functions is largely unknown.
In this study, we identified pathogenic compound heterozygous UBA5 (MIM: 610552) variants in nine affected individuals who are from five unrelated families and show early infantile-onset encephalopathy. Functional analysis of the mutants suggests that reduced UBA5 activity leading to impaired UFM1 system underlies this syndrome.
As part of a study that aimed to identify genetic causes underlying severe infantile-onset epileptic syndromes in 30 Finnish individuals (A.L., unpublished data), we exome sequenced an index case (A-4) and his parents in family A (Figure 1B). Informed consent was obtained from the parents and the study was approved by an institutional review board at the Helsinki University Central Hospital. Whole-exome capture (Agilent SureSelect Human All Exon 50Mb V3), sequencing (Illumina HiSeq 2000; performed at the Wellcome Trust Sanger Institute), sequence read alignment to hs37d5 reference genome (based on GRCh37), and variant calling (Genome Analysis Tool Kit [GATK] HaplotypeCaller v. 3.3)13, 14, 15 was done as described previously with minor modifications.16 Given that the index case subject A-4 has an affected sister (A-6) in family A (Figure 1B), we analyzed the exome data of A-4 (see Table S1 for sequencing metrics) primarily for rare (<1% allele frequency) potentially deleterious autosomal-recessive variants including missense, nonsense, splice site, in-frame insertion and deletion, and frameshift variants based on Variant Effect Predictor17 annotations in CCDS genes (Ensembl release 78). We also assessed the possibility of parental mosaicism by calling de novo variants using DeNovoGear,18 and additionally, we analyzed heterozygous, potentially deleterious variants absent from population variant databases separately without using the de novo variant caller. The following databases were used to determine population allele frequencies: the ∼60,000 exomes of the Exome Aggregation Consortium (ExAC; v.0.3),19 phase 3 release of the 1000 Genomes project20 (2,535 individuals), and Exome Variant Server (EVS) of the NHLBI GO Exome Sequencing Project (v.0.0.25; 6,503 individuals). All candidate variants were confirmed by bidirectional Sanger sequencing, and segregation analysis was performed on available family members.
The only variants passing the filtering under recessive model in affected individual A-4 were compound heterozygous missense variants c.164G>A (p.Arg55His; affects also splicing, see below) and c.1111G>A (p.Ala371Thr) in UBA5 (GenBank: NM_024818.3, Ensembl: ENST00000356232; Figures 1B–1E and S1A, Tables S2 and S3). Sanger sequencing of additional family members revealed that these variants were compound heterozygous also in the affected sister but not in two unaffected sisters (Figure 1B). Analysis for variants causing the disease in the two siblings due to parental mosaicism did not yield any candidate variants (data not shown).
In addition to the above-mentioned cohort of 30 affected individuals sequenced at the Wellcome Trust Sanger Institute, we have exome sequenced and analyzed in-house six additional families (five of which are Finnish) with severe epileptic syndromes. One of these (family B; Figure 1B), ascertained by clinicians (H.R., M.P., R.K.-F., and R.H.) at the Oulu University Hospital Finland, has four affected individuals with clinical features similar to the affected individuals in family A. After informed consent was given by parents of family B, exome capture (Nimblegen SeqCap EZ Human Exome Library v.2.0), sequencing (Illumina HiSeq 1500; performed at the Institute for Molecular Medicine Finland), sequence read alignment to hg19 (GRCh37), and variant calling (samtools)21 was done as described previously with minor modifications.22 Exome sequencing of two affected siblings (B-3 and B-8), their parents, and one unaffected sibling (B-4) revealed compound heterozygous UBA5 variants c.855C>A (p.Tyr285Ter) and c.1111G>A (p.Ala371Thr) in the affected siblings (Figures 1B–1E and S1B, Tables S1–S3). No other variants passed filtering under the recessive model (Table S2). No candidate variants were identified when assessing the possibility of parental mosaicism (data not shown). Sanger sequencing of one additional affected individual and two additional unaffected siblings confirmed autosomal-recessive segregation of the UBA5 variants with the disease in the family (Figure 1B).
To attempt to identify additional individuals with biallelic UBA5 variants, we used GeneMatcher website23 and accessed data on 178 exomes or whole genomes of epileptic encephalopathy cases generated in EuroEPINOMICS Rare Epilepsy Syndromes consortium, 455 exomes from the Northern Finland Intellectual Disability cohort, as well as the exomes from the UK-based Deciphering Developmental Disorders (DDD) study.24, 25 In the DDD study with more than 4,000 exome-sequenced families with developmental disorders, of which 3,072 were without genetic diagnosis after initial analysis,24 we identified two unrelated individuals (C-4 of British and D-3 of Northern Irish and Romanian ancestry; Figure 1B) who are similarly affected to siblings in families A and B and are compound heterozygous for rare UBA5 variants. Exome data of C-4 and D-3 did not reveal other plausible candidate variants (data not shown). Families C and D were included in the DDD study after informed consent, and the study has an UK Research Ethics Committee approval (10/H0305/83 granted by the Cambridge South REC and GEN/284/12 granted by the Republic of Ireland REC). Both C-4 and D-3 are compound heterozygous for the c.1111G>A (p.Ala371Thr) variant and a nonsense variant (C-4: c.562C>T [p.Arg188Ter]; D-3: c.181C>T [p.Arg61Ter]) (Figures 1B and S2, Table S3). Finally, within the Northern Finland Intellectual Disability cohort, we identified one affected individual (E-3) compound heterozygous for c.1111G>A (p.Ala371Thr) and c.562C>T (p.Arg188Ter), the same variant combination as in C-4 (Figures 1B and S2, Table S3).
The p.Ala371Thr substitution, present in heterozygous state in all five families, has an allele frequency of 0.19% in the total of approximately 60,000 individuals in the ExAC database, 0.28% in non-Finnish Europeans, and 0.46% in Finns, with no homozygotes identified. There is one heterozygous carrier for p.Arg55His in ExAC and two heterozygous carriers for p.Arg188Ter in EVS, whereas the p.Arg61Ter and p.Tyr285Ter changes are novel. The missense substitutions occur at residues conserved down to C. elegans (p.Ala371Thr) or A. thaliana (p.Arg55His; Figure 1E) and are predicted to be deleterious by all four in silico methods used (Table S3).
Because p.Ala371Thr is present at low frequency in the general population, we assessed the probability for observing rare (<1%) biallelic UBA5 variants in five unrelated families in the study populations. We employed a recently established method, recessiveStats, developed as part of the DDD study,24 in which for a given gene, the number of unrelated cases with one of three possible biallelic genotype classes is tallied after filtering exome variant data for rare (<1%) variants. The three biallelic genotype classes are loss-of-function/loss-of-function, loss-of-function/functional, and functional/functional, with functional variants defined as those altering amino acid sequence but not likely to cause complete loss of function, e.g., missense variants. Then the probability to observe n number of unrelated individuals with such biallelic genotypes in the study cohort is determined after calculating the expected frequency of the genotypes based on the summed allele frequencies of rare (<1%), functional, or loss-of-function variants in the ExAC database (see original publication24 for methodological details and Table S4 for analysis details in this study). The statistical analysis was performed separately for the three genotype classes. For observing four independent families (B–E) with compound heterozygous functional (p.Ala371Thr) and loss-of-function variants, we obtained a p value of 3.30 × 10−10, strongly indicating that our datasets are enriched for causal, compound heterozygous UBA5 variants (Table S4). The p value remains statistically significant after conservative correction for multiple testing for the three genotype classes and for the total number of annotatable protein coding genes based on GENCODE release 19 (0.05/3/17,370 = 9.60 × 10−7). Notably, even though affected siblings in family A were not tallied in the functional/loss-of-function genotype class, they have in effect a combination of a missense and a loss-of-function variant, because the second UBA5 variant in this family, c.164G>A (p.Arg55His), encodes UBA5 mutant with severely reduced enzymatic activity and also affects splicing (see below). Finally, to exclude the presence of p.Ala371Thr/loss-of-function variant genotype in the general population, we have queried genotype information in >75,000 exomes of control individuals with no severe pediatric diseases (60,706 exomes in ExAC, 10,490 Finnish exomes of which ∼7,000 are not in ExAC, and ∼8,000 parents from the DDD study). In these datasets, none of the few carriers of loss-of-function variants in UBA5 also have p.Ala371Thr.
All affected individuals in non-consanguineous families A–E presented in early infancy with irritability, and most had pronounced dystonic movements (e.g., axial hyperextension, head version, tonic upward gaze deviation, and pronation of arms and legs) and truncal hypotonia and they later developed spasticity (Table S5). In families A, B, C, and E, epileptic seizures, including myoclonic jerks (family A and B) and infantile spasms (A–C), started in infancy. Individual D-3, who is currently 5 years and 7 months of age, has not had seizures and EEG has been normal. The affected individuals did not reach any motor milestones and had severe intellectual disability, besides D-3 who is considered to have moderate intellectual deficit. The siblings in family B and individual E-3 also showed progressive growth failure. All individuals developed progressive microcephaly, besides E-3 who had occipito-frontal circumference of −2 SD at 9 years, which does not reach the applied diagnostic threshold for microcephaly of −3 SD. The affected individuals did not share any distinctive dysmorphic features, but individuals A-4 and C-4 presented with expressionless face, tented upper lip, long and deep philtrum, micrognathia, and puffy hands and feet with tapering fingers (Figure S3). Brain MRI (performed in seven individuals) findings were subtle and included mildly delayed myelination, slight T2-hyperintensity in thalami (Figure S4), thalamic volume reduction, and mild cerebral or cerebellar atrophy. Four out of nine affected individuals have died at 5, 12, 16, and 21 years of age, and five, aged 3–42 years, are alive.
All affected individuals had normal karyotype. Microarray-based analyses on individuals A-4, A-6, C-4, and D-3 had yielded inherited microdeletions or microduplications interpreted as benign. Sequencing of the SLC16A2 (MIM: 300095) and ARX (MIM: 300382) in case A-4 had not revealed pathogenic variants.
The neuropathological findings in A-4, B-5, B-8, and B-10 were mild and nonspecific (data not shown). White matter volume seemed slightly reduced and corpus callosum was somewhat thin, but no signs of an ongoing leukodystrophy could be found. Slight diffuse atrophy could be seen in thalami, brain stem, and cerebellum. The density of especially mediodorsal and reticular neurons was diminished in thalami, and some postnecrotic calcified neurons were seen. In the mesencephalon there was slight neuronal loss and vacuolization in the periaqueductal gray matter, pontine basis was slightly thinned, and cerebello-olivary fibers were thinned in medulla oblongata. Cerebellar cortex showed slightly narrowed molecular and granular cell layers and marginal drop-out of Purkinje cells. Dentate nucleus harbored several pyknotic neurons. Both the clinical features and neuropathological findings were non-specific, resembling the pattern seen in various progressive encephalopathies.
We studied the consequence of the UBA5 variants on RNA level in primary skin fibroblasts obtained from affected individuals A-4 and B-3, who are both heterozygotes for the missense substitution c.1111G>A (p.Ala371Thr). The other variant in B-3, the nonsense variant c.855C>A (p.Tyr285Ter) in the exon 9 (Figure 1C), is predicted to result in nonsense-mediated decay (NMD). As expected, we noted an approximately 50% reduction in UBA5 mRNA levels in B-3 fibroblasts (Figure 2A). Sequencing of UBA5 cDNA in B-3 fibroblasts revealed that the only allele present on RNA level is c.1111G>A (Figure S5A). We observed an approximately 30% reduction in mRNA levels also in fibroblasts of A-4 (Figure 2A), whose other heterozygous variant is c.164G>A. This change, which is predicted to cause the p.Arg55His substitution, occurs in the third nucleotide in the 5′ end of exon 2 (Figure S5B) and thus possibly affects splicing. Capillary sequencing of cDNA amplified using primers in exons 1 and 3 implied that the minority of sequences are from the c.164G>A mutant allele (Figure S5B). Suggesting that the variant facilitates exon 2 skipping, RT-PCR showed a weak band whose size and sequence corresponds to a UBA5 transcript where exon 2 is skipped (Figures S5C and S5D). This would lead to a frameshift and consequent NMD. Confirming further that the UBA5 allele with c.164G>A is expressed at lower levels, cDNA sequencing of exon 11 revealed that the majority of UBA5 mRNA has c.1111G>A (Figure S5B).
We observed that expression of not only the UBA5 mRNA but also the UBA5 protein was lower in fibroblasts of affected individuals than of control subjects (Figure 2B). By immunoblot analysis with non-reducing samples, we detected the intermediates of UFM1-UBA5 and of UFM1-UFC1. As expected, the level of UFM1-UBA5 intermediates was significantly lower in fibroblasts of affected individuals compared to control subjects (Figure 2B). Likewise, the formation of the UFM1-UFC1 intermediate declined (Figure 2B). Surprisingly, UFM1 conjugates were hardly detected in human fibroblasts regardless of the genotypes (Figure 2B). We confirmed a similar pattern in mouse embryonic fibroblasts (Figure 2C).
To test whether the missense variants affect the E1-like activity of UBA5 in cells, we used a UBA5 mutant in which the active site, cysteine (Cys250), was substituted with serine (termed UBA5Cys250Ser). When the cysteine residue at the active site of E1 and E2 enzymes is replaced with serine, an O-ester bond instead of a thioester bond is formed with its respective modifier proteins, and the intermediates become stable even under reducing conditions.4 In addition, to exclude the effect of endogenous UBA5, UBA5 in HEK293T cells was deleted by CRISPR/Cas9 technology (Figures 3A and S6). We expressed a FLAG-tagged UBA5Cys250Ser (FLAG-UBA5Cys250Ser) together with MYC-tagged UFM1ΔC2, a mature form of UFM1 with the glycine residue at the C terminus (Figure 1A), in UBA5-deficient HEK293T cells, and analyzed the cell lysates by immunoblot assay. An intermediate between FLAG-UBA5Cys250Ser and MYC-UFM1ΔC2 was clearly recognized (Figure 3A). We next analyzed UBA5 constructs carrying both p.Cys250Ser and one of the missense variants observed in affected individuals (FLAG-UBA5Arg55His/Cys250Ser or FLAG-UBA5Ala371Thr/Cys250Ser). FLAG-UBA5Arg55His/Cys250Ser had lower ability to form the intermediate with MYC-UFM1ΔC2 than UBA5Cys250Ser (Figure 3A). With FLAG-UBA5Ala371Thr/Cys250Ser, we did not observe a statistically significant decrease in the intermediate formation (Figure 3A). Next, we examined the effect of UBA5 mutants on transferring the activated UFM1 to the E2 enzyme UFC1 (see Figure 1A) because the Ala371 residue is located within the C-terminal transthiolation domain critical for the transfer of UFM1 to UFC1 (Figure 1D).26 We detected the intermediate between FLAG-UFC1Cys116Ser and MYC-UFM1ΔC2 when wild-type UBA5 was expressed in UBA5−/− HEK293T cells (Figure 3B). By contrast, in the case of expressing either UBA5Arg55His or UBA5Ala371Thr mutant, formation of such intermediates was suppressed (Figure 3B).
In good agreement with these results, in vitro thioester formation assay with recombinant proteins revealed the decreased E1 activity of UBA5Arg55His and UBA5Ala371Thr mutants. As shown in Figure 3C, the UBA5-UFM1ΔC2 intermediate was formed in an ATP-dependent manner in vitro. Although both mutants still had ability to form the intermediate, their activity was weaker compared to that of wild-type UBA5 (Figure 3C). In addition, the UBA5Arg55His and UBA5Ala371Thr mutants failed to transfer the activated UFM1 to UFC1 in the initial stage of reaction (Figure 3D). In vitro kinetics analysis of multiple time points up to 60 min of reaction revealed that the UFC1-UFM1ΔC2 intermediate was formed in the case of both mutants, but that, compared to wild-type UBA5, their reaction rates were slower (Figure S7). Whereas the activity of UBA5Arg55His was approximately half of that with wild-type UBA5, UBA5Ala371Thr had approximately 70%–80% activity (Figure S7).
Finally, we examined whether UBA5Arg55His and UBA5Ala371Thr mutants suppress the UFM1 conjugate formation in cells. HEK293T cells deficient for both UBA5 and UFSP2, of which latter is the unique de-conjugating enzyme for UFM1 conjugates (see Figure 1A), were generated by CRISPR/Cas9 technology (UBA5−/−;UFSP2−/−; Figure S8). As shown in Figure 3E, only a few UFM1 conjugates were detected when we expressed E3-ligating enzyme UFL1, its adaptor protein UFBP15 (see Figure 1A), and UFM1ΔC2 together with wild-type UBA5 in single knockout UBA5−/− HEK293T cells. Remarkably, concomitant loss of the de-conjugating enzyme UFSP2 dramatically increased both number and amount of UFM1 conjugates in the UBA5-deficient HEK293T cells, indicating that most UFM1 conjugates are de-conjugated by UFSP2. We next evaluated the effect of UBA5 mutants (p.Arg55His or p.Ala371Thr) in this experimental setting. The UFM1 conjugate formation in the UBA5−/−;UFSP2−/− double knock-out (DKO) cells expressing UBA5Arg55His was markedly suppressed (Figure 3F) (p = 0.002; ratio of actin-normalized quantity of UFM1 conjugates with UBA5Arg55His to that with wild-type: 0.77, with 95% CI 0.72–0.82), whereas UBA5Ala371Thr only showed suggestive decrease (p = 0.056; ratio of actin-normalized quantity of UFM1 conjugates with UBA5Ala371Thr to that with wild-type: 0.91, with 95% CI 0.79–1.02) (Figure 3F). Because ufmylation of UFBP1 is required for its tight binding to UFL1 and in turn for promotion of the E3 ligase activity,5 we further examined the level of the UFM1-UFBP1 conjugate in the DKO cells. We found that the conjugate formation was impaired in both DKO cells expressing either UBA5Arg55His (p = 0.0005; ratio of actin-normalized quantity of UFM1-UFBP1 conjugate with UBA5Arg55His to that with wild-type: 0.76, with 95% CI 0.70–0.82) or UBA5Ala371Thr (p = 0.013; ratio of actin-normalized quantity of UFM1-UFBP1 conjugate with UBA5Ala371Thr to that with wild-type: 0.86, with 95% CI 0.79–0.93) (Figure 3G). Taken together, we conclude that the UBA5 mutants exhibit decreased E1 activity with attenuated ability to transfer the activated UFM1 to UFC1, which might cause impaired UFM1 conjugate formation, with p.Arg55His exhibiting more pronounced defects.
Germline Uba57 or Ufm1 (M.K., unpublished data) knockout is embryonic lethal in mice. We generated conditional knockout mice for Ufm1 (Ufm1f/f) and crossed them with transgenic mice expressing Cre recombinase under the control of the nestin promoter27 (nestin-Cre) to create CNS-specific Ufm1 knockout (Ufm1f/f;nestin-Cre) mice (Figure S9). The Ufm1f/f;nestin-Cre mice allowed us to examine the neuronal pathology associated with deficiency of the UFM1 system in vivo. At embryonic day (E) 14.5, UFM1 protein was almost absent in brain of Ufm1f/f;nestin-Cre but not of control (Ufm1f/+;nestin-Cre) mice (Figure S10). Although a few UFM1 conjugates were detected in brains of control mice by immunoblotting with an anti-UFM1 antibody at E18.5, they were weak or undetectable in brains of mutant mice (Figure 4A). These results indicate impairment of the UFM1 conjugation system in the CNS of Ufm1f/f;nestin-Cre mice. Ufm1f/f;nestin-Cre mice were viable at birth and indistinguishable in appearance from their littermates. However, all Ufm1f/f;nestin-Cre mice died within 1 day of birth (Table S6). Macroscopic anatomical analysis of the brains of mice that were delivered by Caesarean section at E18.5 revealed that Ufm1f/f;nestin-Cre mice had microcephaly (Figure 4B). Both transverse and longitudinal distances in the mutant brains were significantly shorter than those of control brains (Figure 4B). Histological analysis using Karachi’s hematoxylin and eosin staining showed that some parts of the brain, such as the occipital region of neopallium, midbrain, and thalamus, of Ufm1f/f;nestin-Cre mice were consistently smaller than those of control mice (Figure 4C). However, no apparent abnormality in cellular organization was found in the mutant brain. To examine whether loss of UFM1 causes cell death, we carried out immunohistochemical analysis using an antibody against cleaved Caspase-3, a hallmark of apoptosis. A marked increase in the number of cleaved Caspase-3-positive cells was noted in the occipital region of neopallium in the Ufm1f/f;nestin-Cre mice at E18.5 (Figure 4D), compared with control mice. Double immunofluorescence analysis with antibodies against βIII Tubulin, a neuronal marker, and cleaved Caspase-3 revealed that neurons underwent apoptosis (Figure 4E). We hardly detected such apoptotic cells in other brain regions of Ufm1f/f;nestin-Cre mice at E18.5 (data not shown). Taken together, findings in the Ufm1f/f;nestin-Cre mice suggest that an intact UFM1 system is pivotal for neuronal development and survival.
Here, we combined genetic, statistical, phenotypic, and functional data to show that the hypomorphic p.Ala371Thr variant in trans with a loss-of-function change in UBA5 causes a severe, early-onset encephalopathy. The likelihood for observing biallelic missense/loss-of-function UBA5 variants only by chance in the assessed disease cohorts was determined to be negligible. The assessment of UBA5 in fibroblasts of the affected individuals and the biochemical analyses with mutant UBA5 proteins corresponding to the two missense variants indicated that the compound heterozygous status in the affected individuals is not accompanied by a complete loss of UBA5 function. The p.Ala371Thr variant, which is present in all five families with altogether nine affected individuals, encodes UBA5 with only mildly reduced enzymatic activity. In contrast, the impact of the c.164G>A (p.Arg55His) variant on UBA5 function in family A is more pronounced. This variant is causing aberrant splicing, but this effect does not appear to be complete in cells of the affected individuals. However, given the low enzymatic activity of p.Arg55His UBA5 mutant protein, the function of UBA5 can be predicted to be severely compromised from this allele also when the gene is normally spliced. The nonsense variant p.Tyr285Ter in family B seems to undergo NMD based on analysis of UBA5 mRNA expression and is thus likely to result in severely compromised enzymatic activity. It is likely that the gene product is degraded also in individuals C-4, D-3, and E-3 with nonsense variants. Thus, the affected individuals seem to have one variant with a major and one with a milder, hypomorphic effect on UBA5 function. It is plausible that biallelic, complete loss-of-function variants in UBA5 would not be compatible with life as observed in the Uba5−/− mice.7 Interestingly, the p.Ala371Thr variant with only mildly compromised UBA5 activity is present in considerable frequencies in Finns (0.46% allele frequency in ExAC) and other Europeans (0.28%). This implies that the syndrome associated with deficient UBA5 function should be encountered in the European population. Indeed, in addition to the five families identified in this study, individuals from two families of European ancestry with biallelic compound heterozygous variants in UBA5 and a disease manifestation comparable to the affected individuals in our study have the p.Ala371Thr variant in combination with a loss-of-function UBA5 variant (D. Bonneau, personal communication).
Notably, given its relatively mild effect on UBA5 function, the phenotypic consequences of p.Ala371Thr when occurring in homozygosity could be predicted to be mild. To search for individuals homozygous for p.Ala371Thr (no homozygotes were present in the databases used in exome data filtering, see above), we queried ∼10,490 exomes from various Finnish population-based and disease cohorts (see SISu in Web Resources). In the exome data from the FINRISK cohort,29 we identified one homozygous individual (V. Salomaa, personal communication) who is in his fifties and, based on data gathered from National Health registers, does not have any hospitalizations or chronic medication for any neurological or neuropsychiatric diseases. This observation suggests that, as described in this paper, p.Ala371Thr indeed contributes to a severe neurological phenotype only when it is in trans with a severe loss-of-function variant. This observation also has relevance for filtering strategies of exome variant data, because it exemplifies that homozygosity for a relatively rare missense variant in “control” individuals does not necessarily mean non-pathogenicity. During the review of this paper, Duan and colleagues reported a family with two siblings with compound heterozygous variants in UBA5 (p.Lys310Glu and p.Arg246Ter) and a childhood-onset neurological disease with ataxia as the primary symptom.30 Altogether, the existing data on different combinations of biallelic UBA5 variants both in humans and in mouse models suggest that the phenotypic consequences range from embryonic lethality (biallelic loss-of-function in knockout mice), severe infantile-onset encephalopathy (this study), childhood-onset neurological disease (Duan et al.30), to possibly no or very mild phenotype.
The symptoms in individuals with biallelic UBA5 variants are predominantly related to CNS. Previously, however, we and other groups have reported the function of the ubiquitously expressed UFM1 system outside the CNS.5, 7, 8, 9, 10, 11 For example, mice with a germline deletion of Uba5,7 Ufm1 (M.K., unpublished data), or Ddrgk1 (Ufbp1)8 are embryonic lethal and show a severe defect of erythroid differentiation. The affected individuals in our study, with partial loss of UBA5 function, do not present with anemia, suggesting that the remaining UBA5 activity is sufficient for normal hematopoiesis. Finally, to further support the role of UFM1 system beyond CNS, a suggestive association of a UFSP2 (MIM: 611482; UFM1-specific peptidase) variant with Beukes hip dysplasia was recently established.31
Consistent with imaging and neuropathological observations in affected individuals, the analysis of CNS-specific Ufm1 knockout mice revealed that dysfunction of the UFM1 system causes atrophy in several regions of the brain and results in neonatal death. Our data imply compromised UFM1 conjugate formation in the brains of the CNS-specific Ufm1 knockout mice with neuronal apoptosis in restricted regions (i.e., the occipital region of neopallium). Therefore it is possible that spatiotemporally regulated and cell-specific ufmylation is necessary not only for prevention of neuronal cell death but also for neuronal development. Further studies assessing the specific function of UFM1 conjugation to its target proteins in CNS and elsewhere are warranted.
Published: August 18, 2016
Footnotes
Supplemental Data include ten figures, seven tables, and Supplemental Acknowledgments and can be found with this article online at http://dx.doi.org/10.1016/j.ajhg.2016.06.020.
Contributor Information
Anna-Elina Lehesjoki, Email: anna-elina.lehesjoki@helsinki.fi.
Masaaki Komatsu, Email: komatsu-ms@med.niigata-u.ac.jp.
Accession Numbers
The raw aligned sequence reads of family A were submitted to the European Genome-phenome Archive by Wellcome Trust Sanger Institute under study accession IDs EGAS00001000190 and EGAS00001000386. Exome sequencing data from the DDD study are downloadable under study accession ID EGAS00001000775.
Web Resources
1000 Genomes, http://www.1000genomes.org
Clustal: Multiple Sequence Alignment, http://www.clustal.org/
European Genome-phenome Archive (EGA), https://www.ebi.ac.uk/ega
ExAC Browser, http://exac.broadinstitute.org/
GenBank, http://www.ncbi.nlm.nih.gov/genbank/
NHLBI Exome Sequencing Project (ESP) Exome Variant Server, http://evs.gs.washington.edu/EVS/
OMIM, http://www.omim.org/
SISu Project, www.sisuproject.fi
Variant Effect Predictor, http://useast.ensembl.org/Homo_sapiens/Tools/VEP
Supplemental Data
References
- 1.van der Veen A.G., Ploegh H.L. Ubiquitin-like proteins. Annu. Rev. Biochem. 2012;81:323–357. doi: 10.1146/annurev-biochem-093010-153308. [DOI] [PubMed] [Google Scholar]
- 2.Schulman B.A., Harper J.W. Ubiquitin-like protein activation by E1 enzymes: the apex for downstream signalling pathways. Nat. Rev. Mol. Cell Biol. 2009;10:319–331. doi: 10.1038/nrm2673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kang S.H., Kim G.R., Seong M., Baek S.H., Seol J.H., Bang O.S., Ovaa H., Tatsumi K., Komatsu M., Tanaka K., Chung C.H. Two novel ubiquitin-fold modifier 1 (Ufm1)-specific proteases, UfSP1 and UfSP2. J. Biol. Chem. 2007;282:5256–5262. doi: 10.1074/jbc.M610590200. [DOI] [PubMed] [Google Scholar]
- 4.Komatsu M., Chiba T., Tatsumi K., Iemura S., Tanida I., Okazaki N., Ueno T., Kominami E., Natsume T., Tanaka K. A novel protein-conjugating system for Ufm1, a ubiquitin-fold modifier. EMBO J. 2004;23:1977–1986. doi: 10.1038/sj.emboj.7600205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Yoo H.M., Kang S.H., Kim J.Y., Lee J.E., Seong M.W., Lee S.W., Ka S.H., Sou Y.S., Komatsu M., Tanaka K. Modification of ASC1 by UFM1 is crucial for ERα transactivation and breast cancer development. Mol. Cell. 2014;56:261–274. doi: 10.1016/j.molcel.2014.08.007. [DOI] [PubMed] [Google Scholar]
- 6.Tatsumi K., Sou Y.S., Tada N., Nakamura E., Iemura S., Natsume T., Kang S.H., Chung C.H., Kasahara M., Kominami E. A novel type of E3 ligase for the Ufm1 conjugation system. J. Biol. Chem. 2010;285:5417–5427. doi: 10.1074/jbc.M109.036814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tatsumi K., Yamamoto-Mukai H., Shimizu R., Waguri S., Sou Y.-S., Sakamoto A., Taya C., Shitara H., Hara T., Chung C.H. The Ufm1-activating enzyme Uba5 is indispensable for erythroid differentiation in mice. Nat. Commun. 2011;2:181. doi: 10.1038/ncomms1182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cai Y., Pi W., Sivaprakasam S., Zhu X., Zhang M., Chen J., Makala L., Lu C., Wu J., Teng Y. UFBP1, a key component of the Ufm1 conjugation system, is essential for ufmylation-mediated regulation of erythroid development. PLoS Genet. 2015;11:e1005643. doi: 10.1371/journal.pgen.1005643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hertel P., Daniel J., Stegehake D., Vaupel H., Kailayangiri S., Gruel C., Woltersdorf C., Liebau E. The ubiquitin-fold modifier 1 (Ufm1) cascade of Caenorhabditis elegans. J. Biol. Chem. 2013;288:10661–10671. doi: 10.1074/jbc.M113.458000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lemaire K., Moura R.F., Granvik M., Igoillo-Esteve M., Hohmeier H.E., Hendrickx N., Newgard C.B., Waelkens E., Cnop M., Schuit F. Ubiquitin fold modifier 1 (UFM1) and its target UFBP1 protect pancreatic beta cells from ER stress-induced apoptosis. PLoS ONE. 2011;6:e18517. doi: 10.1371/journal.pone.0018517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhang Y., Zhang M., Wu J., Lei G., Li H. Transcriptional regulation of the Ufm1 conjugation system in response to disturbance of the endoplasmic reticulum homeostasis and inhibition of vesicle trafficking. PLoS ONE. 2012;7:e48587. doi: 10.1371/journal.pone.0048587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Daniel J., Liebau E. The ufm1 cascade. Cells. 2014;3:627–638. doi: 10.3390/cells3020627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.McKenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A., Garimella K., Altshuler D., Gabriel S., Daly M., DePristo M.A. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Van der Auwera G.A., Carneiro M.O., Hartl C., Poplin R., Del Angel G., Levy-Moonshine A., Jordan T., Shakir K., Roazen D., Thibault J. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics. 2013;43:1–33. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.DePristo M.A., Banks E., Poplin R., Garimella K.V., Maguire J.R., Hartl C., Philippakis A.A., del Angel G., Rivas M.A., Hanna M. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 2011;43:491–498. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Muona M., Berkovic S.F., Dibbens L.M., Oliver K.L., Maljevic S., Bayly M.A., Joensuu T., Canafoglia L., Franceschetti S., Michelucci R. A recurrent de novo mutation in KCNC1 causes progressive myoclonus epilepsy. Nat. Genet. 2015;47:39–46. doi: 10.1038/ng.3144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.McLaren W., Pritchard B., Rios D., Chen Y., Flicek P., Cunningham F. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics. 2010;26:2069–2070. doi: 10.1093/bioinformatics/btq330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ramu A., Noordam M.J., Schwartz R.S., Wuster A., Hurles M.E., Cartwright R.A., Conrad D.F. DeNovoGear: de novo indel and point mutation discovery and phasing. Nat. Methods. 2013;10:985–987. doi: 10.1038/nmeth.2611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lek M., Karczewski K., Minikel E., Samocha K., Banks E., Fennell T., O’Donnell-Luria A., Ware J., Hill A., Cummings B. Analysis of protein-coding genetic variation in 60,706 humans. bioRxiv. 2015 doi: 10.1038/nature19057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Abecasis G.R., Auton A., Brooks L.D., DePristo M.A., Durbin R.M., Handsaker R.E., Kang H.M., Marth G.T., McVean G.A., 1000 Genomes Project Consortium An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. doi: 10.1038/nature11632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., Subgroup G.P.D.P., 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sulonen A.-M., Ellonen P., Almusa H., Lepistö M., Eldfors S., Hannula S., Miettinen T., Tyynismaa H., Salo P., Heckman C. Comparison of solution-based exome capture methods for next generation sequencing. Genome Biol. 2011;12:R94. doi: 10.1186/gb-2011-12-9-r94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sobreira N., Schiettecatte F., Valle D., Hamosh A. GeneMatcher: a matching tool for connecting investigators with an interest in the same gene. Hum. Mutat. 2015;36:928–930. doi: 10.1002/humu.22844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Akawi N., McRae J., Ansari M., Balasubramanian M., Blyth M., Brady A.F., Clayton S., Cole T., Deshpande C., Fitzgerald T.W., DDD study Discovery of four recessive developmental disorders using probabilistic genotype and phenotype matching among 4,125 families. Nat. Genet. 2015;47:1363–1369. doi: 10.1038/ng.3410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Deciphering Developmental Disorders Study Large-scale discovery of novel genetic causes of developmental disorders. Nature. 2015;519:223–228. doi: 10.1038/nature14135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Xie S. Characterization, crystallization and preliminary X-ray crystallographic analysis of the human Uba5 C-terminus-Ufc1 complex. Acta Crystallograph. Sect. F Struct. Biol. Cryst. Commun. 2014;70:1093–1097. doi: 10.1107/S2053230X14014502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Tronche F., Kellendonk C., Kretz O., Gass P., Anlag K., Orban P.C., Bock R., Klein R., Schütz G. Disruption of the glucocorticoid receptor gene in the nervous system results in reduced anxiety. Nat. Genet. 1999;23:99–103. doi: 10.1038/12703. [DOI] [PubMed] [Google Scholar]
- 28.Horie M., Watanabe K., Bepari A.K., Nashimoto J., Araki K., Sano H., Chiken S., Nambu A., Ono K., Ikenaka K. Disruption of actin-binding domain-containing Dystonin protein causes dystonia musculorum in mice. Eur. J. Neurosci. 2014;40:3458–3471. doi: 10.1111/ejn.12711. [DOI] [PubMed] [Google Scholar]
- 29.Borodulin K., Vartiainen E., Peltonen M., Jousilahti P., Juolevi A., Laatikainen T., Männistö S., Salomaa V., Sundvall J., Puska P. Forty-year trends in cardiovascular risk factors in Finland. Eur. J. Public Health. 2015;25:539–546. doi: 10.1093/eurpub/cku174. [DOI] [PubMed] [Google Scholar]
- 30.Duan R., Shi Y., Yu L., Zhang G., Li J., Lin Y., Guo J., Wang J., Shen L., Jiang H. UBA5 mutations cause a new form of autosomal recessive cerebellar ataxia. PLoS ONE. 2016;11:e0149039. doi: 10.1371/journal.pone.0149039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Watson C.M., Crinnion L.A., Gleghorn L., Newman W.G., Ramesar R., Beighton P., Wallis G.A. Identification of a mutation in the ubiquitin-fold modifier 1-specific peptidase 2 gene, UFSP2, in an extended South African family with Beukes hip dysplasia. S. Afr. Med. J. 2015;105:558–563. doi: 10.7196/SAMJnew.7917. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.