Abstract
Immunoglobulin (Ig) diversification occurs via somatic hypermutation (SHM) and class switch recombination (CSR), and is initiated by activation-induced deaminase (AID), which converts cytosine to uracil. Variable (V) region genes undergo SHM to create amino acid substitutions that produce antibodies with higher affinity for antigen. The conversion of cytosine to uracil in DNA promotes mutagenesis. Two distinct DNA repair mechanisms regulate uracil processing in Ig genes. The first involves base removal by the uracil DNA glycosylase (UNG), and the second detects uracil via the mismatch repair (MMR) complex. Methyl binding domain protein 4 (MBD4) is a uracil glycosylase and an intriguing candidate for involvement in somatic hypermutation because of its interaction with the MMR MutL homolog 1 (MLH1). We found that the DNA uracil glycosylase domain of MBD4 is highly conserved among mammals, birds, shark, and insects. Conservation of the human and chicken MBD4 uracil glycosylase domain structure is striking. Here we examined the function of MBD4 in chicken DT40 B cells which undergo constitutive SHM. We constructed structural variants of MBD4 DT40 cells using CRISPR/Cas9 genome editing. Disruption of the MBD4 uracil glycosylase catalytic region increased SHM frequency in IgM loss assays. We propose that MBD4 plays a role in SHM.
Keywords: Ig, somatic hypermutation, B cells, uracil DNA glycosylase, DT40, CRISPR
Introduction
Activation induced deaminase (AID) is essential for both Ig somatic hypermutation (SHM) and class switch recombination (CSR) in mature B cells (1, 2). SHM increases diversification of V region genes and when followed by selection, results in improved antibody:antigen binding affinities (1). CSR is responsible for diversification of Ig effector functions (3, 4). AID initiates SHM and CSR by deaminating deoxycytidine (dC) to deoxyuracil (dU) that is processed by base excision repair (BER) and mismatch repair (MMR) [reviewed in (4–6)]. The BER pathway facilitates the excision of dU bases by a uracil DNA glycosylase (UNG) leaving an abasic site that is cleaved by apurinic/apyrimidinic endonuclease (APE) producing a single strand break. The mismatch repair (MMR) pathway also functions to detect and excise mismatches generated by DNA replication and AID mediated deamination and form ssDNA nicks and gaps (6). During SHM the MMR pathway is co-opted to promote the re-synthesis of the nicks and gaps using error prone polymerases Pols η and ζ resulting in nucleotide substitutions (7).
There are four uracil DNA glycosylases, that are capable of recognizing and removing dU in U:G mismatches, UNG, SMUG1, TDG, and methyl binding domain protein 4 (MBD4) (8, 9). Genetic studies show that UNG deficiency in mice (10) and humans (11) reduces CSR 90–95% and perturbs SHM mutation spectra but does not alter SHM frequency. SMUG1 plays little natural role in CSR since it is poorly expressed in activated B cells (12, 13). TDG cannot substitute for UNG during CSR in activated B cells (5). A recent study suggests that during SHM TDG and SMUG1 provide uracil glycosylase activity in the absence of UNG (14).
We were intrigued by the association of MBD4 and AID in zebrafish (15) and its link to MMR through its interaction with MutL homolog 1 (MLH1) (16, 17). However, we and others found no role for MBD4 in CSR or SHM when Mbd4 5′ exons were deleted in mice (18, 19). In contrast, deletion of Mbd4 3′ exons have striking consequences for CSR in the CH12 lymphoma cell line (20). Our studies identified two isoforms of Mbd4 transcripts, the canonical long form and a new short isoform of Mbd4 that is retained in mice with 5′-Mbd4 deletions and may support uracil glycosylase activity (19, 20). The interaction between MBD4 and MLH1 has been postulated to play a role in the coordination of BER and MMR to rectify T:G and U:G mismatches (21). MBD4-MLH1 interaction has been confirmed in activated splenic B cells using co-immunoprecipitation studies (19). Interestingly, ~43% of primary human colorectal carcinomas with- and without- microsatellite instability (MSI) also harbor inactivating mutations in Mbd4 (22–24). Examination of Mbd4 in these tumors revealed mutations resulting in a predicted truncated protein lacking the C-terminal glycosylase domain and are similar to the 3′Mbd4 deletions we engineered into the CH12 clones. However, it was not possible to study SHM in the CH12 cell line.
Here, we explore the contribution of MBD4 to SHM in chicken B cells. To begin, MBD4 is highly conserved in mammals, birds and insects. The predicted three-dimensional structure of the chicken MBD4 uracil glycosylase domain is essentially identical with that of human making it highly likely that it has functional activity. To test MBD4 in SHM we used the well-established chicken DT40 B cell line that has been engineered to allow only SHM to diversify V exons (25, 26). We constructed DT40 sub-lines in which segments of the Mbd4 3′ uracil glycosylase domain were deleted using CRISPR/Cas9 genome editing. Loss of the highly conserved amino acids in the uracil glycosylase domain led to increase of SHM frequency in the DT40 deletion variants as compared to control cells. Our studies provide the first evidence of a role for MBD4 in SHM.
Methods
MBD4 Structural Analysis
Using the human MBD4 aa sequence as the control, annotated MBD4 proteins from chicken, mouse, shark, platypus, coelacanth and aphid were identified using Uniprot. Multiple amino acid sequence alignments and the estimated homology to the human MBD4 protein were performed using Clustal Omega's multiple sequence alignment tool (https://www.ebi.ac.uk/Tools/msa/clustalo/). The human MBD4 glycosylase domain crystal structure was previously solved and was used here as a reference structure for comparison (27). The predicted three-dimensional (3D) structure of the chicken MBD4 glycosylase domain was derived using the SWISSMODEL workspace via the ExPASy web server (https://swissmodel.expasy.org/interactive) using the PDB template 4E9E human MBD4 glycosylase (27). The structural superimposition of human and chicken MBD4 glycosylase domains was performed using PyMOL. Comparison of the chicken MBD4 and predicted MBD4 in CRISPR/Cas9 edited DT40 clones were derived using the same strategy.
DT40 Cells and Cell Culture
The DT40 control cells were AIDRψV− (28). All DT40 cells were maintained in culture at 39.5°C with 5% CO2 in RPMI 1640 (Corning) supplemented with 10% fetal bovine serum (Atlanta Biologicals) 2% Penicillin Streptomycin (Gibco), 1% L-glutamine (Gibco), 1% chicken serum (Sigma), and 0.1% β-mercaptoethanol (Sigma).
Generation of MBD4Δ/Δ Cells
The chicken Mbd4 gene in DT40 cells was disrupted in exon 5 at genomic coordinates chr12:19,878,301-19,881,713 (build GRCg6a). Target sequence (5′CTGCACGGAATCGGAAAGTA-3′) for CRISPR/Cas9 editing was identified using www.crispr.mit.edu (no longer available). DT40 cells were CRISPR/Cas9 edited using two strategies. SgRNA-CRISPR/Cas9 expression plasmids were constructed by cloning DNA oligonucleotides complementary to the Mbd4 gRNA (5′CUGCACGGAAUCGGAAAGUA-3′) (IDT) into pX330 (29) (pX330-U6-Chimeric_BB-CBh-hSpCas9; Addgene plasmid #42230) as described (30). PX330 plasmids were transformed into DH5α (Thermo fisher Scientific) and cloned inserts were validated by DNA sequencing using primer hU6-F: 5′-GAGGGCCTATTTCCCATGATT-3′. For ribonuclear protein (RNP) based delivery, Alt-R S.p. Cas9 Nuclease V3 (1 μM) (IDT) was mixed with gRNA (2 μM) comprised of Alt-R CRISPR-Cas9 crRNA (IDT) and Alt-R CRISPR-Cas9 tracrRNA conjugated to ATTO 550 dye (IDT) according to manufacturer's instructions. Nucleofections were carried out when DT40 cells were 80-100% confluent using Amaxa Cell line nucleofector Kit T (Lonza) and program B-023 following the manufacturer's instructions. Cells were allowed to recover for 24 h, stained with anti-chicken IgM-PE (Southern Biotech) and then FACS (MoFlo Astrios) sorted for IgM+GFP+ cells. Cells nucleofected with RNP were sorted for IgM+GFP+Atto550+. Cells were submitted to limiting dilution to isolate subclones. Subclones were expanded for 10–14 days and genomic DNA was harvested using the alkaline lysis method. Indels in Mbd4 exon 5 were identified by size change of the PCR amplification product using primers F1: 5′-CAGTCCTGGTGGTTGGTTTT-3′ and R1: 5′-TGAGGCAGACTTGCAGAAGA-3′ and verified by DNA sequence analysis using the same primers. MBD4Δ/Δ.14 clone was generated via the RNP method and MBD4Δ/Δ.11 and MBD4Δ/Δ.12 were generated via the plasmid-based method.
RT-PCR
Total RNA was extracted from DT40 (3 × 106) cells using Trizol as recommended by the manufacturer. RNA (1 μg) was pretreated with DNase I (Invitrogen) and then cDNA was synthesized using Superscript II (Invitrogen). Quantitative (q) RT-PCR was performed for 18S RNA using SYBRGREEN (Life Technologies), as described (31) with 18S F: 5′-TTGACGGAAGGGCACCACCAG-3′ and 18S R: 5′-GCACCACCACCCACGGAATCG-3′ primers. Semi-quantitative PCR amplification of Mbd4 exons 5-6 was performed using DreamTaq Polymerase (Thermo Fisher Scientific) and 26-32 PCR cycles with primers F2: 5′-TCTTCTGCGTCAATGAATGG-3′ and R2: 5′-GTCCACGCTCAGCTTCTCAT-3′. Thermal cycler conditions: 95°C initial denaturation for 2 min 30 followed by cycles of 30 s at 95°C denaturation, 30 s at 61°C annealing, 20 s extension at 72°C.
SHM of the DT40 IgM Locus Assessed in the IgM Loss Assay
DT40 cells from each genotype were FACS sorted (MoFlo Astrios) for IgM+GFP+ cells on day 0. IgM+GFP+ cells were subcloned by limiting dilution and 24 parental subclones and 12 subclones each from MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12, were obtained. All subclones were maintained in culture for 28 days then re-analyzed (BD LSR Fortessa) for surface IgM and GFP.
Mutation Analysis of the IgL V Region in DT40c and Mbd4 Deletion Clones
Subclones (n = 12) from the control and MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12 cell lines that were isolated in the IgM loss assay were taken for mutation analysis. The IgL V region was PCR amplified with IgL F 5′ TTCTCCCCTCTCTCCTCTCC 3′ and IgL R 5′AGACGAGGTCAGCGACTCA 3′ primers using Q5 High-Fidelity DNA Polymerase (New England BioLabs, M0491S) and an amplification protocol of 35 cycles at 55 sec/98C, 20 sec/62C, 15 sec/72C with a final extension time of 2 min/72C. Amplicons were 390 bp and were gel purified and submitted to Sanger sequencing on both the forward and reverse strands. Mutations were scored when found on both the forward and reverse strands. The reference DNA sequence was previously described (28) and matched the DNA sequence from unmutated DT40c cells used in our studies. No insertions or deletions were detected.
Statistical Analyses
Statistical analyses were performed using non-parametric Kruskal-Wallis test and GraphPad Prism (GraphPad Software, La Jolla California USA).
Results
The MBD4 Glycosylase Domain Is Highly Conserved Amongst Animal Species
Human MBD4 has 2 distinct domains, the MBD (aa 82–147) and a uracil glycosylase domain (aa 426–580) that are separated by a linker (aa 401–425) (16). The linker domain contains an MLH1 binding motif (SLYFSS) that may link MBD4 function with the MMR pathway of DNA repair (16). To determine whether the MBD and the uracil glycosylase elements are conserved we interrogated the Uniprot protein database for annotated Mbd4 genes across multiple species and compared them to the human MBD4. The MBD4 uracil glycosylase domain (red rectangle) was detected in mammals (mouse, human, platypus), birds (chicken) fish (shark and coelacanth) and insects (aphid) (Figure 1A). However, the MBD (black rectangle) was only sporadically paired with the MBD4 uracil glycosylase domain (Figure 1A). The MLH1 binding motif (blue oval) was variably conserved in mammals as it was present in human and mouse but absent in platypus (Figure 1A). In all annotated MBD4 uracil glycosylase domains aa sequence homology ranged 70–95% (Figure 1B). A value of 20% protein sequence homology is considered significant (32). Thus, the MBD4 uracil glycosylase domain is highly conserved in animals.
To further assess the conservation of the MBD4 uracil glycosylase domain we analyzed the preservation of the inferred catalytically active R468, Y540, D560, K562 residues found in human (27) across the annotated MBD4 sequences. To begin, R468 (identified as 1) is conserved between humans and mice and a functionally similar amino acid, lysine, is in the synonymous position for all other species (Figure 1B). Y540 (2) and D560 (3) are present in all species studied. K562 (4) is conserved in all species except aphid (insects), in which it is replaced by the chemically distinct leucine (Figure 1B). Thus, the catalytically active residues within the MBD4 uracil glycosylase domain are very highly conserved in chicken as well as other species.
Next, it was important to determine whether other aa differences in the chicken MBD4 impact on the overall structure that in turn could affect function. The DNA binding site of human MBD4 uracil glycosylase is in a cleft formed by the inferred catalytic residues orientated toward the active site (27, 33). The DNA helix is bound to the enzyme via hydrogen bonds formed between Y376 and the mismatched uracil or thymine, which is “flipped” into the active site by bending at a 55° angle, with K398 acting as a dock (27, 33). K304 fills the space left by the flipped-out base and D396 is thought to catalyze the removal from the DNA helix by directly breaking the N-glycosidic bond between the sugar and the mismatched base (27, 33). We aligned the predicted three-dimensional (3D) structure of the chicken MBD4 uracil glycosylase domain with the previously defined human MBD4 crystal structure that had been solved at the 1.8A° resolution (27). Strikingly, the predicted chicken MBD4 uracil glycosylase structure (dark salmon) is almost identical with the human MBD4 crystal structure (green) (Figure 1C). All four catalytically active amino acids, K304 (1), Y376 (2), D396 (3), and K398 (4) in chicken (marked in blue) are similarly aligned with those in human (red) strongly suggesting that MBD4 glycosylase function is preserved in chicken (Figure 1C). This comparative analysis of human and chicken MBD4 crystal structures indicates that the glycosylase domains fold in a similar fashion and identifies critical catalytically active amino acid residues as potential targets in genome editing.
Deletions in Chicken MBD4 Are Predicted to Disrupt MBD4 Structure and Function
To assess the involvement of MBD4 in SHM in chicken DT40 cells we generated deletions in the Mbd4 gene that affect the catalytically active amino acid Y376 using CRISPR/Cas9 genome editing (Figure 2A). The Mbd4 exon structure in human (exons 4–8) and chicken (exons 2–6) is essentially identical (Figure 2A). Chicken MBD4 Y376 in exon 5 is synonymous with the human Y540 in exon 7 and deletion of this or surrounding residues leads to perturbation of the catalytic center and its ability to interact with a mismatched base (27) (Figure 2A). To this end, we designed a guide (g) RNA targeting exon 5 in chicken DT40 cells (Figures 2A,B). DT40 AIDRψV− B cells used here were previously deleted for endogenous AID and AID expression was complemented by Tg AID linked to GFP (28). Consequently, AID expression can be followed by GFP expression. DT40 AIDRψV− cells, hitherto referred to as control DT40 (DT40c) were also deleted for pseudo V (ψV) donor genes which abolished gene conversion and enforced SHM at the rearranged light chain VJ segment in IgM (28). Introduction of AID lesions into the VJ exon of DT40c cells may cause frameshifts or missense mutations and lead to loss of Ig expression. Hence, SHM can be tracked in DT40c cells by IgM expression loss over time (28).
In this study genome editing was performed using DT40c cells that were transfected with gRNA and Cas9, then sub-clones were isolated by limiting dilution and screened for Mbd4 exon 5 deletions using PCR. Deletion clones were validated by DNA sequencing (Figure 2B). Three independent CRISPR/Cas9 edited subclones, MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12, were isolated and each contained a unique deletion in exon 5 located adjacent to the gRNA position (Figure 2B). We compared the DNA sequence of the Mbd4 DT40 deletion clones to the intact chicken Mbd4 gene sequence (Figure 2B). The 12 bp deletion in MBD4Δ/Δ.14 led to H371Q and loss of four residues including G372-K375 which form part of a helix hairpin helix (HhH) motif adjacent to Y376 (Figure 2B). The MBD4Δ/Δ.11 associated 17 bp deletion caused loss of six aa, H371-Y376, followed by a frameshift and a premature stop codon (Figure 2B). The large 62 bp in MBD4Δ/Δ.12 begins in intron 4 and caused a frameshift with altered expression of three aa residues and a premature stop codon at the 3′ end of exon 5 (Figure 2B). In all three cases, part of the HhH motif (aa 367–384) was lost which is important for uracil glycosylase activity (27, 33). MBD4Δ/Δ.11 and MBD4Δ/Δ.12 subclones have also lost the three critical aa; Y376, D398, and K398.
Using the predicted chicken MBD4 3D structure as a reference we analyzed MBD4 structure in MBD4Δ/Δ.14, MBD4Δ/Δ.11, and MBD4Δ/Δ.12 subclones. The chicken aa residues K304, Y376, D396 and K398 are highlighted (blue) and numbered 1–4 (Figure 2C, left panel). MBD4Δ/Δ.14 lost aa G372-K375 which likely causes the orientation of Y376 (black box) to swing out of the active site and now face away from the other catalytic amino acids which remain intact. This change in Y376 orientation could lead to diminished uracil glycosylase activity as the mismatched uracil or thymine would not adequately bind into the active site of the enzyme. In MBD4Δ/Δ.11, five aa including Y376 in exon 5 and D396 and K398 in the C-terminus are deleted. The loss of Y376 and K398 is predicted to affect DNA binding and a reduction in catalysis of any bound DNA. In MBD4Δ/Δ.12, nineteen aa were deleted including the G372-K375 loop and Y376, as well as the C-terminus. MBD4Δ/Δ.12 is the most severely disrupted of the edited subclones and uracil glycosylase function is predicted to be disrupted.
Chicken MBD4 Uracil Glycosylase Domain Deletions Lead to Increased SHM
Mbd4 gene expression was compared in DT40c and Mbd4 deletion clones, MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12, in semi-quantitative RT-PCR assays using primers spanning exons 5-6 (Figure 3A, upper panel). The 18S RNA loading control assessed in qRT-PCR analyses indicates that equal concentrations of cDNA were analyzed for each sample (Figure 3A, lower panel). Expression of Mbd4 transcripts was essentially identical in DT40c as compared to the Mbd4 variant clones indicating that the DNA deletions did not impair Mbd4 steady state transcript levels. A direct assessment of MBD4 protein levels was not possible due to the lack of appropriate anti-MBD4 reagents.
SHM requires cell proliferation (34). To determine whether deletions in the Mbd4 gene influence cell proliferation, DT40c cells and the MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12 subclones were assessed. No differences were found in cell numbers over 96 h of cell growth between the DT40c cells and the deletion subclones indicating that Mbd4 deletions have no impact on proliferation or viability (Figure 3B). Together, these studies indicate that DT40 cell growth and Mbd4 transcription are not affected by Mbd4 gene deletions.
We next asked whether deletions in the Mbd4 glycosylase domain impact SHM frequencies using the well-established criteria of IgM loss as a measure of SHM in DT40 cells (28). In this assay, loss of surface IgM is correlated with the frequency of deleterious mutations in the V(D)J exon of the IgH and the VJ exon of IgL. Thus, increased loss of IgM is indicative of greater SHM. Populations of GFP+IgM+ cells from DT40c, MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12 lines were isolated by FACS at day 0 and subclones from each genotype were further isolated by limiting dilution. DT40c (n = 24), MBD4Δ/Δ.14 (n = 12), MBD4Δ/Δ.11 (n = 12) and MBD4Δ/Δ.12 (n = 12) GFP+IgM+ subclones were cultured for 28 days and then analyzed for IgM expression levels by flow cytometry. Only cells expressing GFP, an indicator of AID expression, are included for IgM analyses. A representative histogram of IgM and GFP fluorescence for each genotype indicates that the percentage of IgM-GFP+ cells is greater for the Mbd4 deletion clones as compared to DT40c cells when the same number of cells are analyzed (Figure 3C). Quantitative analyses indicate that MBD4Δ/Δ.14, MBD4Δ/Δ.11 and MBD4Δ/Δ.12 subclones were subject to significantly (p <0.0005) greater IgM loss as compared to DT40c cells (Figure 3D). The greatest IgM loss was found in MBD4Δ/Δ.12 which also had the greatest predicted disruption to the MBD4 uracil glycosylase domain (Figures 2D, 3D). Previously, disrupting the UNG gene in DT40 was shown to lead to a very high rate of mutations in the VJ region at deaminated cytosine residues and the resulting uracils were not processed by uracil glycosylase activity (35). Further, UNG deficiency in mice and humans has been shown to lead to hypermutation at C/G bases (10, 36). This is consistent with our findings in chicken DT40 cells, in which disrupting MBD4 uracil glycosylase activity leads to increased mutation frequency.
Complementation studies designed to rescue MBD4 function are difficult to perform since overexpression of full length MBD4 leads to cell death in murine B cells (20). Consequently, we opted not to perform rescue studies here. We conclude that disruption of the catalytic amino acids in the MBD4 uracil glycosylase domain leads to a reduction in glycosylase activity and an increase in somatic hypermutation. This is shown by MBD4Δ/Δ.12 having the greatest predicted disruption to the uracil glycosylase domain structure and far higher levels of mutation. However, both MBD4Δ/Δ.11 and MBD4Δ/Δ.14 may retain some uracil glycosylase function as mutation frequency was lower than that found for MBD4Δ/Δ.12. Nevertheless, care must be taken interpreting the relationship between MBD4 structure and the magnitude of mutation frequency as clone to clone variation cannot be completely eliminated.
Mutations in the IgL of the Mbd4 Deletion Clones Are Predominantly at G:C Nucleotides
To analyze the impact of Mbd4 deletion on SHM the rearranged VJ regions of IgL were PCR amplified and sequenced from DT40c and Mbd4 deletion subclones isolated in the IgM loss study (Figures 3C,D). This approach to mutation analysis will detect substitutions, deletions and insertions that are in the majority of cells of a subclone and not rare mutations that may have occurred late in the culture period. Nucleotide changes were found in the 0.39 kb amplicon between the V leader and the 5′ end of the J-C intron and focused to AID hotspots (RGYW, WRCY) (Figure 4). The number of mutations ranged from 0-2 in DT40c, 1–2 in MBD4Δ/Δ.14 and MBD4Δ/Δ.11 and 1–4 in MBD4Δ/Δ.12 subclones. Many of the mutations are located in the same positions as would be expected after clonal expansion. Nevertheless, a number of clones have acquired unique mutations in addition to the dominant mutation, indicating that SHM was operational in those clones (Supplementary Figures 1–4). No insertions or deletions were detected. All mutations were sequenced on both the forward and reverse strands, were unambiguous and were present in the great majority of cells as no background was evident. Two mutated nucleotides noted by asterisks were present in equal quantity with control sequence indicating that the PCR products were mixed at that position and that the mutations occurred in a subset of cells in the subclone (Figure 4). Mutations were predominantly G:C transitions and transversions for all subclones tested as previously noted for DT40 B cells (Figure 4) (28). The clonal nature of mutations in the VJ exon precludes calculation of mutation frequency. Therefore, we infer that SHM frequency is increased in the Mbd4 deletion subclones based on the increase of IgM loss.
These results indicate that both MBD4 and UNG glycosylases are involved in recognizing and removing AID-induced uracils. Earlier analyses demonstrated that MBD4 interacts with MLH1 in mice and therefore might be involved in regulating the rectification of T:G and U:G mismatches via MMR (21). Although earlier genetic studies suggested that MLH1 does not function in SHM (37, 38) more recent work has revealed the importance of MLH1 for SHM in mouse (14). However, the vast majority of mutations in DT40 cells are G:C transitions and transversions (28). The diminished frequency of A:T mutations implies that error-prone synthesis by Pol η in the MMR pathway is negligible in these cells. Therefore, the exact mechanism by which MBD4 influences SHM in mice and humans remains to be determined. It should be noted that the DT40 IgM loss assay relies on complementation with overexpressed AID which may in turn influence the efficiency of uracil recognition by MBD4. Confirmation of the role of MBD4 in SHM requires analysis in a setting with endogenous levels of AID.
Data Availability Statement
All datasets generated for this study are included in the article/Supplementary Material.
Author Contributions
The protein comparison of annotated Mbd4 species (Figure 1A) and multiple protein sequence alignments (Figure 1B), the DT40 culture and CRISPR/Cas9 editing experiments and analyses (Figures 2A,B) were designed by AK and RC and performed by RC. JC and AK designed and JC performed the 3D structure analysis and alignment (Figures 1C, 2C). All cell proliferation and cDNA/DNA analyses (Figures 3A,B) as well as the flow cytometry for the IgM loss assay (Figures 3C,D) were designed by AK and RC and carried out by RC. Mutation analysis (Figure 4) was designed by AK and generated by JC. The manuscript was written by RC and AK. AK critically revised the paper and all authors approved the final version of the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank the UIC flow cytometry core, the DNA core and Dr. R. Maul (NIH) for the DT40 AIDRψV− cells, Dr. D. Ucker for use of a CO2 incubator and Dr. H.M. Shen and A. Levi for helpful advice (University of Illinois College of Medicine). We thank Dr. S. Naiyer for help with sequencing the Mbd4 deletion subclones for mutations. The pX330-U6-Chimeric_BB-CBh-hSpCas9 was a gift from F. Zhang (Addgene plasmid # 42230; http://n2t.net/addgene:42230; RRID:Addgene_42230).
Glossary
Abbreviations
- AID
Activation induced deaminase
- AP
Apurinic/pyrimidinic
- CSR
Class switch recombination
- BER
Base excision repair
- FACS
Fluorescence-activated cell sorting
- gRNA
guide RNA
- HhH
Helix hairpin helix
- MBD4
Methyl binding domain protein 4
- MLH1
MutL homolog 1
- MMR
Mismatch repair
- MSI
miscrosatellite instability
- RNP
Ribonuclear protein
- SHM
Somatic hypermutation
- SMUG1
Single-strand selective monofunctional uracil DNA glycosylase
- TDG
Thymine DNA glycosylase
- UNG
Uracil DNA glycosylase
- WT
wild type.
Footnotes
Funding. This work was supported by grants to AK from the National Institutes of Health (R21AI133050, RO1AI121286).
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2019.02540/full#supplementary-material
References
- 1.Peled JU, Kuang FL, Iglesias-Ussel MD, Roa S, Kalis SL, Goodman MF, et al. The biochemistry of somatic hypermutation. Annu Rev Immunol. (2008) 26:481–511. 10.1146/annurev.immunol.26.021607.090236 [DOI] [PubMed] [Google Scholar]
- 2.Revy P, Muto T, Levy Y, Geissmann F, Plebani A, Sanal O, et al. Activation-induced cytidine deaminase (AID) deficiency causes the autosomal recessive form of the Hyper-IgM syndrome (HIGM2). Cell. (2000) 102:565–75. 10.1016/S0092-8674(00)00079-9 [DOI] [PubMed] [Google Scholar]
- 3.Chaudhuri J, Basu U, Zarrin A, Yan C, Franco S, Perlot T, et al. Evolution of the immunoglobulin heavy chain class switch recombination mechanism. Adv Immunol. (2007) 94:157–214. 10.1016/S0065-2776(06)94006-1 [DOI] [PubMed] [Google Scholar]
- 4.Stavnezer J, Guikema JE, Schrader CE. Mechanism and regulation of class switch recombination. Annu Rev Immunol. (2008) 26:261–92. 10.1146/annurev.immunol.26.021607.090248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Di Noia JM, Neuberger MS. Molecular mechanisms of antibody somatic hypermutation. Annu Rev Biochem. (2007) 76:1–22. 10.1146/annurev.biochem.76.061705.090740 [DOI] [PubMed] [Google Scholar]
- 6.Zanotti KJ, Gearhart PJ. Antibody diversification caused by disrupted mismatch repair and promiscuous DNA polymerases. DNA Repair. (2016) 38:110–6. 10.1016/j.dnarep.2015.11.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Saribasak H, Rajagopal D, Maul RW, Gearhart PJ. Hijacked DNA repair proteins and unchained DNA polymerases. Philos Trans R Soc Lond Ser B Biol Sci. (2009) 364:605–11. 10.1098/rstb.2008.0188 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Krokan HE, Drablos F, Slupphaug G. Uracil in DNA–occurrence, consequences and repair. Oncogene. (2002) 21:8935–48. 10.1038/sj.onc.1205996 [DOI] [PubMed] [Google Scholar]
- 9.Visnes T, Doseth B, Pettersen HS, Hagen L, Sousa MM, Akbari M, et al. Uracil in DNA and its processing by different DNA glycosylases. Philos Trans R Soc Lond B Biol Sci. (2009) 364:563–8. 10.1098/rstb.2008.0186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Rada C, Williams GT, Nilsen H, Barnes DE, Lindahl T, Neuberger MS. Immunoglobulin isotype switching is inhibited and somatic hypermutation perturbed in UNG-deficient mice. Curr Biol. (2002) 12:1748–55. 10.1016/S0960-9822(02)01215-0 [DOI] [PubMed] [Google Scholar]
- 11.Durandy A, Peron S, Fischer A. Hyper-IgM syndromes. Curr Opin Rheumatol. (2006) 18:369–76. 10.1097/01.bor.0000231905.12172.b5 [DOI] [PubMed] [Google Scholar]
- 12.Di Noia JM, Rada C, Neuberger MS. SMUG1 is able to excise uracil from immunoglobulin genes: insight into mutation versus repair. EMBO J. (2006) 25:585–95. 10.1038/sj.emboj.7600939 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Dingler FA, Kemmerich K, Neuberger MS, Rada C. Uracil excision by endogenous SMUG1 glycosylase promotes efficient Ig class switching and impacts on A:T substitutions during somatic mutation. Eur J Immunol. (2014) 44:1925–35. 10.1002/eji.201444482 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Girelli Zubani G, Zivojnovic M, De Smet A, Albagli-Curiel O, Huetz F, Weill JC, et al. Pms2 and uracil-DNA glycosylases act jointly in the mismatch repair pathway to generate Ig gene mutations at A-T base pairs. J Exp Med. (2017) 214:1169–80. 10.1084/jem.20161576 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rai K, Huggins IJ, James SR, Karpf AR, Jones DA, Cairns BR. DNA demethylation in zebrafish involves the coupling of a deaminase, a glycosylase, and gadd45. Cell. (2008) 135:1201–12. 10.1016/j.cell.2008.11.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bellacosa A, Cicchillitti L, Schepis F, Riccio A, Yeung AT, Matsumoto Y, et al. MED1, a novel human methyl-CpG-binding endonuclease, interacts with DNA mismatch repair protein MLH1. Proc Natl Acad Sci USA. (1999) 96:3969–74. 10.1073/pnas.96.7.3969 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cortellino S, Turner D, Masciullo V, Schepis F, Albino D, Daniel R, et al. The base excision repair enzyme MED1 mediates DNA damage response to antitumor drugs and is associated with mismatch repair system integrity. Proc Natl Acad Sci USA. (2003) 100:15071–6. 10.1073/pnas.2334585100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bardwell PD, Martin A, Wong E, Li Z, Edelmann W, Scharff MD. Cutting edge: the G-U mismatch glycosylase methyl-CpG binding domain 4 is dispensable for somatic hypermutation and class switch recombination. J Immunol. (2003) 170:1620–4. 10.4049/jimmunol.170.4.1620 [DOI] [PubMed] [Google Scholar]
- 19.Grigera F, Bellacosa A, Kenter AL. Complex relationship between mismatch repair proteins and MBD4 during immunoglobulin class switch recombination. PLoS ONE. (2013) 8:e78370. 10.1371/journal.pone.0078370 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Grigera F, Wuerffel R, Kenter AL. MBD4 facilitates immunoglobulin class switch recombination. Mol Cell Biol. (2017) 37:e00316–16. 10.1128/MCB.00316-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bellacosa A. Functional interactions and signaling properties of mammalian DNA mismatch repair proteins. Cell Death Differ. (2001) 8:1076–92. 10.1038/sj.cdd.4400948 [DOI] [PubMed] [Google Scholar]
- 22.Bader S, Walker M, Heindrich B, Bird A, Bird C, Hooper M, et al. Somatic frameshift mutations in the MBD4 gene of sporadic colon cancers with mismatch repair deficiency. Oncogene. (1999) 18:8044–7. 10.1038/sj.onc.1203229 [DOI] [PubMed] [Google Scholar]
- 23.Miyaki M, Iijima T, Shiba K, Aki T, Kita Y, Yasuno M, et al. Alterations of repeated sequences in 5' upstream and coding regions in colorectal tumors from patients with hereditary nonpolyposis colorectal cancer and Turcot syndrome. Oncogene. (2001) 20:5215–8. 10.1038/sj.onc.1204578 [DOI] [PubMed] [Google Scholar]
- 24.Riccio A, Aaltonen LA, Godwin AK, Loukola A, Percesepe A, Salovaara R, et al. The DNA repair gene MBD4 (MED1) is mutated in human carcinomas with microsatellite instability. Nat Genet. (1999) 23:266–8. 10.1038/15443 [DOI] [PubMed] [Google Scholar]
- 25.Arakawa H, Hauschild J, Buerstedde JM. Requirement of the activation-induced deaminase (AID) gene for immunoglobulin gene conversion. Science. (2002) 295:1301–6. 10.1126/science.1067308 [DOI] [PubMed] [Google Scholar]
- 26.Buerstedde JM, Reynaud CA, Humphries EH, Olson W, Ewert DL, Weill JC. Light chain gene conversion continues at high rate in an ALV-induced cell line. EMBO J. (1990) 9:921–7. 10.1002/j.1460-2075.1990.tb08190.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Morera S, Grin I, Vigouroux A, Couve S, Henriot V, Saparbaev M, et al. Biochemical and structural characterization of the glycosylase domain of MBD4 bound to thymine and 5-hydroxymethyuracil-containing DNA. Nucleic Acids Res. (2012) 40:9917–26. 10.1093/nar/gks714 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Arakawa H, Saribasak H, Buerstedde JM. Activation-induced cytidine deaminase initiates immunoglobulin gene conversion and hypermutation by a common intermediate. PLoS Biol. (2004) 2:E179. 10.1371/journal.pbio.0020179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. (2013) 339:819–23. 10.1126/science.1231143 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ran FA, Hsu PD, Wright J, Agarwala V, Scott DA, Zhang F. Genome engineering using the CRISPR-Cas9 system. Nat Protoc. (2013) 8:2281–308. 10.1038/nprot.2013.143 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kumar S, Wuerffel R, Achour I, Lajoie B, Sen R, Dekker J, et al. Flexible ordering of antibody class switch and V(D)J joining during B-cell ontogeny. Genes Dev. (2013) 27:2439–44. 10.1101/gad.227165.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Pearson WR. An introduction to sequence similarity (homology) searching. Curr Protoc Bioinformatics. (2013) Chapter 3:Unit3 1. 10.1002/0471250953.bi0301s42 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wu PY, Qiu C, Sohail A, Zhang X, Bhagwat AS, Cheng XD. Mismatch repair in methylated DNA - Structure and activity of the mismatch-specific thymine glycosylase domain of methyl-CpG-binding protein MBD4. J Biol Chem. (2003) 278:5285–91. 10.1074/jbc.M210884200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Neuberger MS, Milstein C. Somatic hypermutation. Curr Opin Immunol. (1995) 7:248–54. 10.1016/0952-7915(95)80010-7 [DOI] [PubMed] [Google Scholar]
- 35.Saribasak H, Saribasak NN, Ipek FM, Ellwart JW, Arakawa H, Buerstedde JM. Uracil DNA glycosylase disruption blocks Ig gene conversion and induces transition mutations. J Immunol. (2006) 176:365–71. 10.4049/jimmunol.176.1.365 [DOI] [PubMed] [Google Scholar]
- 36.Imai K, Slupphaug G, Lee WI, Revy P, Nonoyama S, Catalan N, et al. Human uracil-DNA glycosylase deficiency associated with profoundly impaired immunoglobulin class-switch recombination. Nat Immunol. (2003) 4:1023–8. 10.1038/ni974 [DOI] [PubMed] [Google Scholar]
- 37.Frey S, Bertocci B, Delbos F, Quint L, Weill JC, Reynaud CA. Mismatch repair deficiency interferes with the accumulation of mutations in chronically stimulated B cells and not with the hypermutation process. Immunity. (1998) 9:127–34. 10.1016/S1074-7613(00)80594-4 [DOI] [PubMed] [Google Scholar]
- 38.Phung QH, Winter DB, Alrefai R, Gearhart PJ. Hypermutation in Ig V genes from mice deficient in the MLH1 mismatch repair protein. J Immunol. (1999) 162:3121–4. [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All datasets generated for this study are included in the article/Supplementary Material.