Skip to main content
eLife logoLink to eLife
. 2018 Apr 17;7:e28477. doi: 10.7554/eLife.28477

Somatic hypermutation of T cell receptor α chain contributes to selection in nurse shark thymus

Jeannine A Ott 1, Caitlin D Castro 2, Thaddeus C Deiss 1, Yuko Ohta 2, Martin F Flajnik 2, Michael F Criscitiello 1,3,
Editor: David G Schatz4
PMCID: PMC5931798  PMID: 29664399

Abstract

Since the discovery of the T cell receptor (TcR), immunologists have assigned somatic hypermutation (SHM) as a mechanism employed solely by B cells to diversify their antigen receptors. Remarkably, we found SHM acting in the thymus on α chain locus of shark TcR. SHM in developing shark T cells likely is catalyzed by activation-induced cytidine deaminase (AID) and results in both point and tandem mutations that accumulate non-conservative amino acid replacements within complementarity-determining regions (CDRs). Mutation frequency at TcRα was as high as that seen at B cell receptor loci (BcR) in sharks and mammals, and the mechanism of SHM shares unique characteristics first detected at shark BcR loci. Additionally, fluorescence in situ hybridization showed the strongest AID expression in thymic corticomedullary junction and medulla. We suggest that TcRα utilizes SHM to broaden diversification of the primary αβ T cell repertoire in sharks, the first reported use in vertebrates.

Research organism: Other

Introduction

All jawed vertebrates share fundamental components of the adaptive immune system. The cartilaginous fish (including sharks) are the most divergent jawed vertebrate group relative to mammals and use a polymorphic major histocompatibility complex (MHC) (Kasahara et al., 1992), multiple isotypes of immunoglobulin (Ig) heavy and light chains (Flajnik, 2002; Criscitiello and Flajnik, 2007), and the typical four T-cell receptor (TcR) chains (Rast et al., 1997). Shark lymphocyte antigen receptors are diversified by RAG-mediated V(D)J somatic rearrangement (Bernstein et al., 1994). After antigen exposure, B cells also use the enzyme activation-induced cytidine deaminase (AID) for receptor modification via somatic hypermutation (SHM) (Conticello et al., 2005), allowing activated B cells to extensively alter their rearranged Ig variable region genes (Muramatsu et al., 2000). Some of the variants produced by this process bind antigen with higher affinity, enhancing humoral immunity through affinity maturation. In addition to SHM in all jawed vertebrate Igs, AID catalyzes the processes of heavy chain class switch recombination (CSR) in tetrapods (and is implicated in shark CSR [Zhu et al., 2012]) and Ig gene conversion (in birds and some mammals) (Barreto and Magor, 2011). AID is a member of the APOBEC family of nucleic acid mutators, two of which likely diversify the variable lymphocyte receptor (VLR) system in the more ancient vertebrate lineages of lamprey and hagfish (Alder et al., 2005; Guo et al., 2009).

Although in general immunologists think TcR loci do not undergo somatic hypermutation, a few reports do exist of AID-mediated SHM in T cells. However, non-productive TcRα rearrangements in hybridomas (Marshall et al., 1999), TcRβ sequences from HIV-positive individuals (Cheynier et al., 1998), and reports of SHM in TcRα murine germinal center T cells (Zheng et al., 1994) are not thought to describe any normal physiology (Bachl and Wabl, 1995). More recent studies indicate that TcR δ and γ in the dromedary camel and TcR γ in the sandbar shark somatically hypermutate (see below) (Antonacci et al., 2011; Chen et al., 2012). Despite these findings, the general consensus has remained that AID does not target TcR loci (Pavri and Nussenzweig, 2011; Choudhary et al., 2018). In over 30 years of studies of TcR repertoires, it has been clear that SHM is not functioning to either generate or further enhance the TcR repertoire of mouse and human.

Recent studies in the sandbar shark (Carcharhinus plumbeus) revived the notion of SHM at TcR loci. Sequencing of the entire TcRγ translocon in C. plumbeus showed definitively that SHM is occurring at that locus (Chen et al., 2009). Shark TcRγ SHM occurs in two distinct patterns: point mutations and tandem mutations characteristic of B cell SHM in cartilaginous fish (Anderson et al., 1995; Lee et al., 2002; Rumfelt et al., 2002; Zhu et al., 2012), possibly suggesting two different cellular mechanisms for generating mutations (Chen et al., 2012). The sandbar shark analysis found targeted nucleotide motifs of AID activity at the TcRγ locus. Chen et al. (2012) examined ratios of replacement (R) and silent (S) mutations between CDR and framework regions to determine if mutation altered affinity of receptors, a method commonly used to study B cell affinity maturation by SHM. Finding no difference between R/S ratios in CDR versus framework regions, they concluded that TcRγ uses SHM to generate a more diverse repertoire rather than for affinity maturation. SHM-induced changes to TcRδ in camels showed similar results.

Early work in our lab also suggested that SHM occurs in the less restricted γδ T cells in nurse shark (Ginglymostoma cirratum) and perhaps in the alpha chain of MHC-restricted αβ T cells (Criscitiello et al., 2010), encouraging us to examine this phenomenon further. Thus, we performed a systematic analysis of this process in shark primary and secondary lymphoid tissues using thymocyte clones containing the same unique third complementarity-determining region (CDR3). Our data suggest that SHM of TcRα is involved in primary T cell repertoire diversification and the enhancement of positive or negative selection in the thymic cortex. This finding is consistent with a model put forward by Niels Jerne over 40 years ago to explain antigen receptor positive selection (Jerne, 1971).

Results

Somatic hypermutation in TcR γV and TcR δV

We assessed the presence of possible SHM within TcR V segments from γ, δ, and β chains. Using neighbor-joining consensus trees, we grouped sequences from each chain into V families based on 85% nucleotide identity and into V subfamilies based on 90% nucleotide identity. We then examined each V subfamily for the possibility of SHM. However, since none of the sequences from these chains contained CDR3 regions, we did not rigorously analyze mutations within these chains.

We first corroborated the original finding of SHM in TcRγ variable regions (V) in sandbar shark spleen (Chen et al., 2012; Chen et al., 2009) using peripheral lymphoid tissue from the spiral valve (intestine) of nurse shark. We also examined mutation in clones from the thymus. We conservatively assigned 69 sequences to 9 V genes from three TcRγ families (Figure 1). Even with a conservative assignment of clones to predicted germline V sequences, we found nearly twice as many V genes as in the sandbar shark locus (which only contains five Vs). Only TcR γ4 did not display mutations in the nurse shark, but since we found only two γ4 sequences, it is possible that it occurs but our sample was too small to observe it.

Figure 1. Alignment of Gamma V clones suggests minimal somatic hypermutation.

Figure 1.

Thymocyte clones for nine γV groups from three different predicted V genes. CDR regions are marked above the scale for each γV alignment. Amino acids are shown under the nucleotide consensus sequence, and dots represent identity to this sequence. We highlighted nonsynonymous changes in black; synonymous changes are underlined. Gaps are used for alignment purposes and indicate a shortened sequence (at the beginning or end). Sequences are identified by a single clone number or a group of identical clones condensed to a single line (the number of clones are indicated). Clone numbers that contain ‘THY’ are from thymus, ‘PBL’ are from peripheral blood leukocytes, and ‘SPV’ are from spiral valve (intestine). We deposited all 69 sequences into GenBank under accession numbers KY351639 – KY351707.

We then investigated the possibility of mutation occurring at the TcRδ locus. We analyzed mutation in clones from nurse shark thymus, peripheral blood leukocytes, and spiral valve, conservatively grouping 111 clones into 12 V genes from 7 TcRδ families (Figure 2). Only one of these families (TcR δ10) lacked mutation. We found that in five of the seven δV families, the same V-δ segments used to generate δ cDNA sequences also generated α cDNA sequences. The sequence diversity at TcRγ and TcRδ was in contrast to TcRβ, where we found no such evidence for mutation in 56 sequences representing six V segments from six different V families (Figure 3). Limited existing data also do not support mutation at NAR-TcR, a distinct TcR containing a NAR V domain supported by a more canonical V δ domain, each resulting from independent V(D)J rearrangements (Criscitiello et al., 2006) (data not shown).

Figure 2. Alignment of Delta V clones suggests somatic hypermutation.

Figure 2.

Thymocyte clones for 12 δV groups from seven different predicted V genes. CDR regions are marked above the scale for each δV alignment. Amino acids are shown under the nucleotide consensus sequence, and dots represent identity to this sequence. We highlighted nonsynonymous changes in black; synonymous changes are underlined. Gaps are used for alignment purposes and indicate a shortened sequence (at the beginning or end). Sequences are identified by a single clone number or a group of identical clones condensed to a single line (the number of clones are indicated). Clone numbers that contain ‘THY’ are from thymus, ‘PBL’ are from peripheral blood leukocytes, and ‘SPV’ are from spiral valve (intestine). We deposited all 112 sequences into GenBank under accession numbers KY346705 – KY346816.

Figure 3. Alignment of Beta V clones illustrates a lack of somatic hypermutation.

Figure 3.

Thymocyte clones for βV groups from six different predicted V genes. CDR regions are marked above the scale for each βV alignment. Amino acids are shown under the nucleotide consensus sequence, and dots represent identity to this sequence. We observed three nonsynonymous changes within a single sequence (highlighted in black). Sequences are identified by a single clone number or a group of identical clones condensed to a single line (the number of clones are indicated). Clone numbers that contain ‘PBL’ are from peripheral blood leukocytes and ‘THY’ are from thymus. We deposited all 57 sequences into GenBank under accession numbers KY351708 – KY351764.

Identification of TcR Vα genes in the nurse shark genome

We identified 17 germline α/δ V gene sequences corresponding to 12 unique V segments. Unfortunately, these segments matched only two groups of sequences in our TcRα dataset (TcRα/δ V4 and V9), likely due to inter-individual polymorphisms. In the absence of a complete germline sequence for this locus, we limited our database for TcRα to thymocyte clones with the same unique CDR3 signature. The CDR3 region results from the somatic recombination and assembly of variable (V) and joining (J) gene segments during lymphocyte development in the thymus (Kuklina, 2006; Lantelme et al., 2008). The recombination process cleaves DNA and initiates repair mechanisms that result in the random insertion of non-template (N) nucleotides within the join, forming a unique binding sequence that contributes to the diversity and specificity of a TcR (Gellert, 2002; Kuklina, 2006). Once recombination ceases and the thymocyte proliferates, this distinctive CDR3 sequence is perpetuated in all daughter cells (Murphy and Weaver, 2017). Since it is unlikely that two thymocytes would generate identical CDR3 sequences during VJ recombination, we predict that amplicons containing identical CDR3 sequences derived from the same progenitor and thus must contain the same germline V and J segments. Alpha CDR3s exhibited substantial variation within our shark sequences, despite the absence of diversity (D) segments. For example, TcRα V1 sequences using the same V and J segments had CDR3 lengths that differed by as many as six amino acids (18 nucleotides), and few CDR3s shared more than one amino acid within this V-J join (Figure 4). Further, the majority of sequences in our overall dataset do contain N and palindromic (P) nucleotides within this join. Using clones containing the same V segment from different sharks, we determined the putative end of each V segment. We first aligned sequences containing the same V segment. Then, assuming that any nucleotide present in the same position within more than one shark must be germline, we determined which nucleotides within a join belong to the V segment. We then repeated this process for each J segment. Of 290 clones, we found 197 (68%) unique sequences within this join (0–34 nucleotides in length), suggesting that most sequences contain N/P nucleotides (see Figure 4—source data 1). Finally, we never observed the same CDR3 sequence in more than one shark, suggesting both exonuclease activity and addition of N and P nucleotides help diversify alpha CDR3s in nurse shark. Therefore, even in the absence of an assembled locus, we were able to evaluate mutation to germline αV segments by considering changes within only those thymocyte clones containing identical CDR3s. Our extremely conservative approach of using only clones containing the same CDR3 encoding rearrangement and N and P nucleotide sequences provided us the assurance that we had distinct αVs descendant from clonal T cells, since it would be extremely unlikely that two T cells created receptors that contained the exact same nucleotide sequence by chance.

Figure 4. CDR3s of TcR Alpha chain are diverse.

Figure 4.

Amino acid (aa) alignment of TcR αV1 thymocyte clones illustrating diversity of the third complementarity-determining region (CDR3). All clones contain identical variable (V) region sequence (aa 1–61). We grouped clones by shared, identical joining (J) regions (purple boxes) and highlight the differences in the V-J join (CDR3 region) in red boxes.

Figure 4—source data 1. CDR3 regions diversified by exonuclease activity and addition of N and P nucleotides.
Alignment of nucleotides belonging to the join between variable (V) and joining (J) segments within TcRα thymocyte clones. We determined the putative ends of each V segment and putative beginning of each J segment by comparing alignments between different sharks, assuming that identical nucleotides between sharks were germline. The last number of each sequence name indicates the number of clones containing that nucleotide sequence between the V and J segments.
DOI: 10.7554/eLife.28477.006

Somatic hypermutation in nurse shark TcR αV

With SHM confirmed in γ and δ TcR chains but apparently not the TcR beta chain of nurse shark, we checked for mutation of the TcRα locus. One might expect mutation in γδ T cells since antigen binding more closely mirrors that of B cells. However, mutations to receptors of MHC-restricted αβ T cells would be surprising given that even minor modifications to these receptors could risk incompatibility with MHC.

Our preliminary V α dataset contained 539 TcRα clones (encoding 286 unique amino acid sequences representing nine V α families) from three tissues (PBL, spleen, thymus) of two sharks (Joanie, Mary Junior). Of this total, only 447 sequences contained complete CDR3-J junctions (including all bases after the last predicted nucleotide of the V segment to the last predicted nucleotide of the J segment; see Table 1). We observed 239 (53.5%) sequences with unique V regions (from the first predicted nucleotide of the V segment to the last predicted nucleotide of the J segment) and 179 (40%) with unique V segments (from the first to the last predicted nucleotide of the V segment only). We found 226 sequences (50%) containing unique CDR3-J regions (all bases after the last predicted nucleotide of the V to the last predicted nucleotide of the J). However, we found 48 groups containing identical CDR3-J sequences across all nine V α families (suggesting they bear the V-J rearrangement from a single founder thymocyte), each V α family containing anywhere from one to ten clonal groups (Table 1). The majority of these groups contained no mutation within V, J or C regions. For example, one α V3 sequence group occurred 131 times, the most numerous sequence in the dataset, yet contained no mutation in any sequence. We did observe mutation in 12 of these 48 groups belonging to seven different V α families. Each family contained between one and four clonal groups, and each group contained two to nine sequences with identical CDR3-J regions (45 total sequences; see Table 1). We include these 45 sequences in our TcR V α dataset (see Table 1 – source data for sequence data).

Table 1. Summary of sequence data used in this paper.

Putative subfamilies within each TcR alpha V family share at least 85% nucleotide identity using nearest-neighbor consensus trees of V segments. Number of TcR alpha nucleotide (NUC), amino acid (AA) sequences or sequence groups within each category. Highlighted columns specifically refer to data used in this study. (See results for detailed descriptions of sequences included within each column.)

Table 1–source data 1. Tab-delimited text file containing sequences used in this paper for analysis of somatic hypermutation of TcR alpha chain.
DOI: 10.7554/eLife.28477.008
Alpha V segment Putative # Sub-families All cloned sequences Complete CDR3-J junction* Unique V Region Unique V segment Unique CDR3-J§ Groups with identical CDR3-J# CDR3-J groups in Study** Sequences in each dataset††
NUC AA NUC AA NUC AA
TRA V1 1 40 40 35 34 24 16 34 32 7 4 2, 3, 3, 5
TRA V2 3 18 18 13 13 13 10 12 12 3 1 5
TRA V3 3 217 194 55 52 35 34 51 50 5 1 4
TRA V4 3 60 28 22 22 21 21 21 21 6 2 2, 2
TRA V5 4 35 34 15 13 9 8 13 13 3 1 2
TRA V6 2 9 9 7 7 5 6 7 7 1 0 0
TRA V7 5 96 60 49 48 39 38 48 48 9 2 2, 2
TRA V9 3 19 19 14 14 12 11 13 13 4 0 0
TRA V10 2 45 45 29 26 21 19 27 26 10 2 4, 9
539 447 239 229 179 163 226 224 48 13 45

*A full list of these sequences can be found in Table 1—source data 1.

V Region includes all bases between the 1 st predicted nucleotide of the V segment to the last predicted nucleotide of the J segment (V and J).

‡V Segment includes all bases between the 1 st predicted nucleotide of the V segment to the last predicted nucleotide of the V segment (V only).

§CDR3-J includes all bases after the last predicted nucleotide of the V segment to the last predicted nucleotide of the J segment.

#Number of groups with identical CDR3-J sequences, which we used to determine sequence relatedness (see text for details).

**Number of groups with identical CDR3-J sequences used in this study. (Those not used contained no mutation with V segments.).

††Total number of sequences for each alpha V used to assess somatic hypermutation within this study (e.g., for aV1, 4 different clonal groups contained 2, 3, 3, and 5 identical CDR3-J regions, respectively).

Using these 45 sequences from both thymus and peripheral immune tissues, we found evidence for SHM acting on the TcRα genes. We divided our 45 clones into 13 CDR3-sharing groups from seven different α V families and then analyzed sequences within groups for potential mutation (Figure 5). We excluded two groups (four sequences) from analyses (αV1.3 and αV7.1) that contained no mutations within FR or CDR regions (leaving 41 clones for analysis). Two sequences (aV7.4m_THY09_051410 and aV5J_SPV17_102810) contained one 3-base insertion and one sequence (aV1.4J_SPV07_090110) contained an 18-base insertion; although SHM can result in insertions and deletions (Diaz et al., 2002), we did not include insertion nucleotides in mutation counts. All sequences were in-frame and contained no internal stop codons, suggesting functionality of cells. Average lengths of CDRs were as follows: CDR1: 7.0 amino acids (range: 5–8); CDR2: 5.7 amino acids (range: 5–7); and CDR3: 6.0 amino acids (range: 1–10). Naming of families and subfamilies followed Criscitiello et al. (2010). However, with the accumulation of sequence data over our previous analysis of nurse shark TcRα (Criscitiello et al., 2010), we expanded our nomenclature considerably. Additional annotation followed the IMGT guidelines for TcRs (Lefranc et al., 2003).

Figure 5. Alignment of Alpha V cDNA clones suggest somatic hypermutation at shark TCRα.

Figure 5.

Thymocyte clones for all 11 αV groups with the same CDR3 from six different predicted V genes. Locations of framework regions (FR), complementarity determining regions (CDR), joining regions (J), and constant (C) regions are marked above the scale for each αV. In absence of germline sequence information, we used a Geneious-derived nucleotide consensus sequence for analysis of nucleotide changes in thymocyte clones. Amino acids are shown under the consensus sequence, and dots represent identity to this sequence. Nonsynonymous changes are highlighted in black; synonymous changes are underlined. Gaps are used for alignment purposes and indicate either a shortened sequence (at the beginning or end) or insertions or deletions within the sequence. Sequences are identified by a single clone number or a group of identical clones condensed to a single line (the number of clones are indicated). Clone numbers that contain ‘THY’ are from thymus, ‘PBL’ are from peripheral blood leukocytes, and ‘SPV’ are from spiral valve (intestine). We did not use clones from αV1.3 and αV7.1 because they did not contain mutations in FR or CDR regions. We deposited all 42 sequences into GenBank under accession numbers KY189332 – KY189354 or KY366469 – KY366487.

Figure 5 shows all 12 αV CDR3 groups exhibiting mutation. The overall TcRα mutation frequency was 0.0226 substitutions per nucleotide (S/N), with 66% of all substitutions (187 of 283) resulting in amino acid replacements. The CDRs accumulated significantly more mutations than FRs (CDR: 0.0352 S/N; FR: 0.0188 S/N; p=0.0373), and substitutions in CDRs were twice as likely to be nonsynonymous changes (NSYN) than those in FRs (CDR: 0.0235 S/N; FR: 0.0122 S/N; p=0.0312; Table 2). There was no difference in frequency of synonymous (SYN) mutations between regions (CDR: 0.0117 S/N, FR: 0.0066 S/N; p=0.0705). Finally, although we found more tandemly mutated bases in CDRs (41 of 81, or 50.6% of all CDR mutations) than in FRs (73 of 192, or 38.0% of all FR mutations), this difference was not significant; p=0.721; Table 3). Tandem mutations ranged from two to four bases in length (mean = 2.78 bases). That this feature of SHM, specific of cartilaginous fish Ig (Rumfelt et al., 2001; Lee et al., 2002; Diaz et al., 1999), also occurs in the TcR strongly supports the validity of our analyses.

Table 2. Frequencies of somatic hypermutation in nurse shark alpha V groups (αV G) containing the same CDR3.

Mutation frequency is the total number of nucleotide changes to a Geneious-derived consensus sequence divided by the total number of nucleotides. We counted synonymous (S) and nonsynymous (N) mutations separately for each FR and CDR for 11 different CDR3 groups in seven predicted alpha V genes. [FR: framework region; CDR: complementarity-determining region; Seqs: sequences; Nuc: nucleotides; Freq: frequency]

FR1 FR2 FR3 FR Means
αV Group # Seqs # Codons Total Mutations Total Mutation # Codons Total Mutations Total Mutation # Codons Total Mutations Total Mutation
Nuc S N Mutation Freq (%) Nuc S N Mutation Freq (%) Nuc S N Mutation Freq (%) S N ALL
αV1.1 3 25 225 2 12 14 6.222 17 153 0 2 2 1.307 41 369 4 4 8 2.168 6 18 24
αV1.2 5 25 375 10 8 18 4.800 17 255 1 8 9 3.529 41 615 5 5 10 1.626 16 21 37
αV1.4 3 25 225 1 0 1 0.444 17 153 1 3 4 2.614 41 369 4 5 9 2.439 6 8 14
αV2.1 5 25 375 0 0 0 0.000 17 255 1 0 1 0.392 42 630 0 0 0 0.000 1 0 1
αV3.1 4 25 300 0 4 4 1.333 17 204 0 0 0 0.000 40 480 0 1 1 0.208 0 5 5
αV4.1 2 25 150 1 2 3 2.000 17 102 0 2 2 1.961 43 258 11 7 18 6.977 12 11 23
αV4.2 2 25 150 0 0 0 0.000 17 102 1 1 2 1.961 43 258 2 7 9 3.488 3 8 11
αV5.1 2 25 150 0 6 6 4.000 17 102 2 3 5 4.902 42 252 1 2 3 1.190 3 11 14
αV7.2 2 25 150 4 7 11 7.333 17 102 1 3 4 3.922 41 246 1 6 7 2.846 6 16 22
αV10.1 4 25 300 0 0 0 0.000 17 204 1 1 2 0.980 41 492 4 0 4 0.813 5 1 6
αV10.2 9 25 675 2 11 13 1.926 17 459 4 4 8 1.743 40 1080 3 11 14 1.296 9 26 35
Sum 41 275 3075 20 50 70 187 2091 12 27 39 455 5049 35 48 83 67 125 192
Mean Mutation Freq (%) 0.65 1.63 2.28 0.57 1.29 1.87 0.69 0.95 1.64 0.66 1.22 1.88
Standard Deviation 2.994 4.547 6.531 1.136 2.252 2.841 3.125 3.414 5.429 4.742 8.201 11.861
CDR1 CDR2 CDR3 CDR Means
AlphaV Group # Seqs # Codons Total Mutations Total Mutation # Codons Total Mutations Total Mutation # Codons Total Mutations Total Mutation
Nuc S N Mutation Freq (%) Nuc S N Mutation Freq (%) Nuc S N Mutation Freq (%) S N ALL
αV1.1 3 8 72 2 2 4 5.556 5 45 1 0 1 2.222 10 90 1 6 7 7.778 4 8 12
αV1.2 5 8 120 5 4 9 7.500 5 75 1 2 3 4.000 8 120 1 4 5 4.167 7 10 17
αV1.4 3 8 72 0 1 1 1.389 5 45 0 0 0 0.000 5 45 0 2 2 4.444 0 3 3
αV2.1 5 7 105 0 1 1 0.952 5 75 0 0 0 0.000 3 45 0 0 0 0.000 0 1 1
αV3.1 4 6 72 0 3 3 4.167 6 72 0 0 0 0.000 1 12 0 0 0 0.000 0 3 3
αV4.1 2 7 42 3 1 4 9.524 6 36 0 1 1 2.778 6 36 2 0 2 5.556 5 2 7
αV4.2 2 7 42 1 1 2 4.762 6 36 0 0 0 0.000 6 36 0 0 0 0.000 1 1 2
αV5.1 2 7 42 0 3 3 7.143 6 36 0 7 7 19.444 7 42 0 0 0 0.000 0 10 10
αV7.2 2 5 30 0 0 0 0.000 7 42 1 1 2 4.762 6 36 0 2 2 5.556 1 3 4
αV10.1 4 7 84 3 0 3 3.571 6 72 0 0 0 0.000 7 84 0 3 3 3.571 3 3 6
αV10.2 9 7 189 1 8 9 4.762 6 162 2 2 4 2.469 7 189 3 0 3 1.587 6 10 16
Sum 41 77 870 15 24 39 63 696 5 13 18 66 735 7 17 24 27 54 81
Mean Mutation Freq (%) 1.72 2.76 4.48 0.72 1.87 2.59 0.95 2.31 3.27 1.17 2.35 3.52
Standard Deviation 1.690 2.316 2.979 0.688 2.089 2.248 1.027 2.067 2.272 2.659 3.754 5.626

Table 3. Number and frequency of DNA mutations that occur in tandem within framework regions (FR) and complementarity determining regions (CDR) in nurse shark alpha V (αV) groups.

All mutations include both tandem and point mutations within a region. [Seqs: sequences]

FR CDR
# Nucleotides Tandemly Mutated All mutation Frequency of tandem mutation # Nucleotides Tandemly Mutated All mutation Frequency of tandem mutation
αV Group # Seqs 2 3 4 Sum 2 3 4 Sum
αV1.1 3 1 0 1 6 24 25.0 3 0 0 6 12 50.0
αV1.2 5 4 1 0 11 37 29.7 5 1 0 13 17 76.5
αV1.4 3 3 0 0 6 14 42.9 0 0 0 0 3 0.0
αV2 5 0 0 0 0 1 0.0 0 0 0 0 1 0.0
αV3 4 2 0 0 4 5 80.0 1 0 0 2 3 66.7
αV4.1 2 4 1 0 11 23 47.8 0 1 0 3 7 42.9
αV4.2 2 1 1 0 5 11 45.5 0 1 0 3 2 150.0
αV5 2 1 0 0 2 14 14.3 1 1 0 5 10 50.0
αV7.2 2 4 1 0 11 22 50.0 1 0 0 2 4 50.0
αV10.1 4 3 0 0 6 6 100.0 0 0 0 0 6 0.0
αV10.2 9 4 1 0 11 35 31.4 2 1 0 7 16 43.8
Total 41 27 5 1 73 192 38.0 13 5 0 41 81 50.6

Mutation frequency also varied by region (CDR or FR). The highest mutation frequency occurred in CDR1 and accumulated significantly more mutations overall than other regions (X¯=4.48%; see Figures 6 and 7a, Table 2). CDR3 displayed an unusually low mean mutation frequency (2.97%). However, these frequencies may be artificially low since our groups were based only on clones containing the same CDR3 sequence, and clones whose CDR3s deviated markedly from the consensus would have been excluded by our conservative grouping approach. We observed the lowest mean mutation frequency in FR3 (1.64%). These results are consistent with what is known about human TcR binding to MHC:Ag structures. CDR1 and CDR3 make more contacts with Ag while CDR2 interacts primarily with non-polymorphic regions of MHC (Buslepp et al., 2003; Garcia and Adams, 2005), making mutation in CDR2 less favorable; further, FR regions are important for the structural stability of the domain and mutations to these regions may affect the ability of the Ig superfamily domain to fold properly (Mantovani et al., 2002; Reinherz et al., 1999).

Figure 6. Mutation frequencies differed within TcR V regions.

Figure 6.

Mutability of complementarity determining regions (CDR), especially CDR1, exceeded that of framework regions (FR) for all mutations together (black bars) and for nonsynonymous mutations alone (NSYN, hatched bars). We found no statistical difference in synonymous mutations (SYN, white bars) between CDRs and FRs.

Figure 6—source data 1. Frequency of mutation in TcR alpha V framework and complementarity determining regions for all mutation, nonsynonymous (NSYN) mutation only, or synonymous (SYN) mutation only.
DOI: 10.7554/eLife.28477.013

Figure 7. Mutation and motif locations within individual domains of TcR αV sequences.

Figure 7.

(A) Number of mutations (both single and tandem, 283 total) to A:T nucleotides (black bars, 122 mutations) or G:C nucleotides (blue bars, 161 mutations) observed at each position along the sequence length of TcR αV sequences. We counted mutations to Geneious-derived consensus sequences within framework and complementarity-determining region (CDR) domains (41 sequences within 11 αV groups). (B) Number of WA/TW motifs (black bars, indicating possible polymerase η action) or DGYW/WRCH motifs (blue bars, indicating hotspots for AID activity, and thus possible mutation) at each position along the sequence length. Position indicates the forward (3’ to 5’) location of the mutable base of each motif within a Geneious-derived consensus sequence for each TcR αV group (eleven groups). Red boxes indicate the location of CDR1 (positions 76–99), CDR2 (positions 151–171), and CDR3 (positions 300–328) within each panel. [WA/TW: A:T is the mutable position; DGYW/WRCH: G:C is the mutable position; W = A/T, D = A/G/T, Y = C/T, R = A/G, and H = T/C/A].

Hotspots

We found 1327 G/C base pairs within DGYW/WRCH hotspot motifs (4,402 G/C base pairs occurred outside these motifs) and 2931 A/T base pairs within WA/WT hotspot motifs (3690 A/T base pairs occurred outside these motifs; see Table 4). Mutations of G:C nucleotides were strongly associated with DGYW/WRCH hotspots (Figure 7b,c). Overall, G:C mutations occurred 4.3x as often within hotspots as those outside hotspots (p=0.0267). However, G:C mutations were 1.4x more likely within CDRs than within FRs (p<0.0001). A:T nucleotides did not appear to prefer WA/TW hotspots (p=0.2248), although they were 2.3x as likely to mutate within hotspots than outside hotspots. Further, while A:T mutations occurred more often than expected in both FRs and CDRs (p=0.0015), the frequency of A:T mutations were 1.9x more likely in CDRs than FRs. Chen et al. (2012) observed similar results in TcR γV sequences of sandbar shark. The authors suggested that A:T mutations still are likely the result of DNA polymerase η use during mismatch repair mechanisms due to the lack of A:T point mutations in that study (the majority occurring in tandem with other mutations) (Rogozin et al., 2001; Chen et al., 2012; Wei et al., 2015). In nurse shark Ig light chain genes, the majority of mutation to A:T nucleotides (55%) also occurred in tandem (Alder et al., 2005). However, 63% (76 of 121) of A:T mutations occurred as point mutations in our study, which is inconsistent with DNA polymerase η use during mismatch repair (Rogozin et al., 2001; Chen et al., 2012; Wei et al., 2015). This result suggests that an alternate A:T motif is targeted or that shark TcRs employ a different mechanism to alter A:T nucleotides.

Table 4. Target nucleotide mutation frequency in DGYW/WRCH or WA/TW mutation hotspots within framework regions (FR) and complementarity determining regions (CDR).

[DGYW/WRCH (G:C is the mutable position; D = A/G/T, Y = C/T, W = A/T, R = A/G, and H = T/C/A); WA/TW (A:T is the mutable position; W = A/T); ‘ALL’ refers to nucleotides within a hotspot motif; ‘All other’ refers to nucleotides outside a hotspot motif]

Mutated base Hotspot motif Region Total nucleotides Observed # Mutations Expected # Mutations Mutation freq (%) χ2 p T-test P
G/C DGYW/WRCH FR 927 56 18.16 6.04 0.0000 0.0267
CDR 400 35 26.22 8.75
ALL 1327 91 37.52 6.86 0.0000
Outside Motif 4402 71 124.48 1.61
A/T WA/TW FR 2531 54 36.07 2.13 0.0015 0.2248
CDR 400 26 21.12 4.03
ALL 2931 80 55.97 2.52 0.0000
Outside Motif 3690 41 65.03 1.11

*T-test analysis was used to compare mutations within hotspot motifs to those outside hotspot motifs. Mutations to G and C nucleotides occurred significantly more often within DGYW/WRCH motifs than outside these motifs, while mutations to A and T nucleotides showed no preference for WA/TW motifs.

†χ2 analysis was used to compare observed and expected numbers of mutations between FR and CDR regions and between mutations inside and outside hotspot motifs. More mutations to all nucleotides occurred within hotspots than outside hotspots, and significantly more mutations occurred to nucleotides within CDRs than FRs.

Base substitution indices

We identified 283 mutations within the eleven CDR3 groups. There was a bias toward mutations of G and C nucleotides (p=0.0022) and G:C changes comprised 57% of all mutations (Table 5a). More mutations to both G and C nucleotides occurred in FRs (p=0.0037) while only C nucleotides showed greater mutation than expected in CDRs (p=0.1136). There were fewer mutations of A and T nucleotides in FRs (p=0.0082; Table 5b), but mutations of A and T nucleotides in CDRs did not differ from random (p=0.3620; Table 5c). The frequency of mutated nucleotides also varied by region: In FR1, more A and T nucleotides mutated while fewer C nucleotides mutated. Mutations of G and C nucleotides were lower in FR2 and FR3, respectively (Table 5b,c). In both CDR1 and CDR2, there were more A mutations and fewer G mutations than expected. However, more G nucleotides mutated in CDR3. We saw a bias toward G:A and C:T transitions (42.8%) among the TcR α mutations. Transition mutations appeared only slightly more often in CDRs (45.1%) than in FRs (41.7%). Transversions of C:A or G:T mutations occurred only 25.1% of the time (Table 5).

Table 5. Bias in base substitution during somatic hypermutation of TcR alpha V genes within all sequence regions (ALL), framework regions (FR), or complementarity determining regions (CDR).

Probability of occurrence is the proportion of that base out of the total nucleotides. [Nuc: nucleotides; OBS: Observed; EXP: expected; MI: mutability index; ChiSq: Chi squared]

a ALL Base Occurrence Probability of occurrence OBS EXP MI* ChiSq
G 2895 0.230 77 65.05 1.18 0.0022
C 2834 0.225 85 63.68 1.33
A 3498 0.278 64 78.60 0.81 0.0068
T 3368 0.267 57 75.68 0.75
Total 12595 1.00 283 283
GC Mutation: 57.0%; Transitions: 42.8%; Transversions: 25.1%
b FR Base Occurrence Probability of Occurrence OBS EXP MI* ChiSq
G 2378 0.23 58 44.50 1.30 0.0037
C 2268 0.22 56 42.44 1.32
A 2779 0.27 38 52.00 0.73 0.0082
T 2835 0.28 40 53.05 0.75
Total 10260 1.00 192 192
GC Mutation: 59.0%; Transitions: 41.7%; Transversions: 24.0%
c CDR Base Occurrence Probability of Occurrence OBS EXP MI ChiSq
G 517 0.22 19 20.15 0.94 0.1336
C 566 0.24 29 22.06 1.31
A 719 0.31 26 28.02 0.93 0.3620
T 533 0.23 17 20.77 0.82
Total 2335 1.00 91 91
GC Mutation: 52.9%; Transitions: 45.1%; Transversions: 27.5%

*Mutability Index, as first defined in Chen et al., 2012. χ2 analysis was used to compare observed and expected numbers of mutations. G and C mutated significantly more often than expected, while A and T mutated significantly less often than expected. Base composition: 23.0% G, 22.5% C 27.8% A, 26.7% T.

Mutations, in situ hybridization, and AID expression in thymus

Since we cloned TcR αV sequences in both central (thymus) and peripheral (blood, spiral valve) lymphoid tissues, we analyzed mutation frequencies by tissue type. Although data were limited to only six sequences in two CDR3 groups (αV2, αV7.2), we found more mutations to FRs of peripheral sequences, though this was not significant (p=0.0541; Table 6). We found no differences between tissues within CDRs (p=0.2) from this limited data set. It is possible that positive and negative selection pressures remove more clonal sequences within the thymus, but with so few sequences to compare it is difficult to determine.

Table 6. Frequencies of somatic hypermutation in nurse shark thymus and peripheral lymphoid tissue (blood and spiral valve).

Mutations were analyzed only in alpha V groups containing the same third complementarity-determining region (CDR). Mutation frequency was measured as the total number of nucleotide changes to a Geneious-derived consensus sequence divided by the total number of nucleotides in all sequences. Nonsynymous (N) and synonymous (S) mutations (mut) were counted separately for each framework (FR) and CDR for two predicted alpha V genes. [FR1, FR2, FR3, CDR1, CDR2, and CDR3 refer to the first, second, or third FR or CDR region, respectively.]

Tissue type Mut type FR Mutations (#) CDR Mutations (#) FR mutation frequency CDR mutation frequency
 FR1 FR2 FR3 All FR CDR1 CDR2 CDR3 All CDR FR1 FR2 FR3 CDR1 CDR2 CDR3
Thymus
 (6 sequences)
N 8 8 7 23 1 3 3 7 0.570 0.871 0.317 0.071 0.327 0.136
S 8 2 6 16 1 0 0 1 0.570 0.218 0.272 0.071 0.000 0.000
ALL 16 10 13 39 2 3 3 8 1.140 1.089 0.590 0.529 1.075 1.235
Total Nucleotides 1404 918 2205 4527 378 279 243 900
Periphery
 (2 sequences)
N 7 4 6 17 0 2 2 4 1.496 1.307 0.833 0.000 1.852 1.587
S 4 0 1 5 0 0 0 0 0.855 0.000 0.139 0.000 0.000 0.000
ALL 11 4 7 22 0 2 2 4 2.350 1.307 0.972 0.000 1.852 1.587
Total Nucleotides 468 306 720 1494 126 108 126 360

The mutated sequences in the primary T lymphoid tissue suggests the activity of activation-induced cytidine deaminase (AID) in the thymus. We confirmed the expression of AID in the thymus through real-time RT-qPCR, where thymus tissue expressed AID at more than half (0.7x) the levels found in spleen (positive control, where B cell SHM is known to occur) and nearly 6x the levels observed in forebrain (negative control; Figure 8a). Further, colorimetric in situ hybridization (CISH) of nurse shark thymus revealed a diffuse signal of TcR β (Figure 8b, panels 1,2) and AID (Figure 8b, panels 3,4) mRNA expression throughout the thymic cortex. However, AID expression was greatest in the central cortex and cortico-medullary junction (CMJ), while TcR β expression was highest in the outer cortex. The thymic sequence data combined with the tissue localization of AID message strongly suggest that TcRα loci undergo AID-dependent SHM in the shark thymus.

Figure 8. Shark thymus expresses AID.

Figure 8.

(A) Expression of AID in shark spleen (black bar), thymus (hashed bar), and forebrain (gray bar) using real-time quantitative PCR. Expression was measured using the delta delta Cq method and are normalized against shark muscle tissue using β2M as a reference gene. Data represent expression fold-change differences at four cDNA concentrations. (B) In situ hybridization of mRNA using ribo-probes on adult shark thymus sections. 1,2: Two different fields probing for TcRβ antisense. 3,4: AID antisense. 5,6: TcRβ sense. 7,8: AID sense. All micrographs at 10X magnification; black scale bar in lower right of each panel is 100 uM. Anatomical structures are designated on the top panels [c: cortex; m: medulla; sc: subcapsular region; cmj: corticomedullary junction].

We refined our CISH results with RNA fluorescence in situ hybridization (FISH) using probes against the TcR alpha constant region (αC) and exons 2 and 3 of AID. Since Stellaris RNA FISH uses a mixture of shorter probes (~20 bp in length) that hybridize along the length of the target RNA, the resulting signal is detectable only when tens of probes hybridize to the target. This makes the technology very specific -- only those transcripts bound to numerous probes are visible -- and limits the potential for false-positive or false-negative results (Orjalo et al., 2011; Raj and Tyagi, 2010). Our FISH results indicated a more specific TcR αC signal within the inner cortex and medulla adjacent to the CMJ (Figure 9; Figure 9—figure supplement 1), regions where, in mammals, developing cells actively rearrange their alpha chain genes and where mature αβ T cells are found. AID expression occurred in ‘rings’ around areas of expressed TcR αC messages within the inner cortex and CMJ (where positive selection occurs in mammals) and the medulla (where negative selection occurs in mammals). Further, AID always co-localized with TcR αC (Figure 9). Thus, we observed a consistent pattern of expression where a ‘ring’ of cells expressing both TcR αC and AID surround a central cell expressing only TcR αC. The more specific signal generated by FISH may suggest that, once a T cell completes RAG-mediated somatic rearrangement of its alpha chain locus, it clonally expands to form a ring of daughter cells around it. These daughter T cells then express AID (and TcR αC), promoting somatic hypermutation within their TcR alpha sequences during times when cells also undergo positive and negative selection.

Figure 9. AID expression localized to inner cortex and cortico-medullary junction.

(a) H and E staining of fixed shark thymus tissue illustrating thymic architecture (10x). The densely packed cells at the margins of the image comprise the cortex (cor), while the less densely packed cells in the center constitute the medulla (med). The region at the junction between cortex and medulla incorporates the corticomedullary junction (CMJ), delineated generally by a hashed white circle. [b-z] Single molecule RNA fluorescence in situ hybridization (FISH) probing fixed thymus sections simultaneously for AID (probes labeled with Quasar 670; pseudo colored red) and TcRα (probes labeled with CalFluor Red 610; pseudo colored green) and counterstained with DAPI (blue). (b) Composite of seven Z-stacked images (10x) depicting overall thymic architecture and the localization of AID expression to the inner cortex and cortico-medullary junction regions of shark thymus. We superimposed (and minimally adjusted) the outlined CMJ boundaries from (a) onto (b) to elucidate the junction between cortex and medulla. [c-z] We obtained images of each fluorophore using 10x, 20x, and 63x magnification and merged Z-stacked images together. Individual (10x) fluorophore images of DAPI [c-f], TcRα [g-j], and AID [k-n] and Z-stacked merged images [o-r] illustrate AID and TcRα expression in four locations of shark thymus. White boxes indicate the magnified regions of the 10x and 20x images shown in the 20x [s-v] and 63x [w-z] images, respectively. Scale bars [a,b,c,g,k,o] 150 μm, [s] 75 μm, and [w] 30 μm. [cor: cortex; med: medulla; CMJ: corticomedullary junction].

Figure 9.

Figure 9—figure supplement 1. Localization of AID and TcRα probes is independent.

Figure 9—figure supplement 1.

Single molecule RNA fluorescence in situ hybridization (FISH) showing three separate regions of fixed thymus sections demonstrating the independent localization of our probes. We probed individually for TcRα (a-c) (probes labeled with CalFluor Red 610; pseudo colored green), simultaneously for both TcRα and AID (d-f), or individually for AID (probes labeled with Quasar 670; pseudo colored red) and counterstained with DAPI (blue). There were two tissue sections in between imaged regions of TcRα section (a-c) and combined AID/TcRα section (d-f). The combined AID/TcRα section (d-f) was consecutive with the AID only section (g-h). We obtained images of each fluorophore using 10x magnification and merged Z-stacked images together. Scale bar 100 μm.
Figure 9—figure supplement 2. Lack of AID and TcRα probe hybridization in shark brain.

Figure 9—figure supplement 2.

Single molecule RNA fluorescence in situ hybridization (FISH) showing fixed brain sections on four separate slides demonstrating the lack of probe localization in non-immune tissue. We probed simultaneously for both TcRα and AID (a-c), individually for AID (probes labeled with Quasar 670 and imaged with Cy5 filter; pseudo colored red) (d-f), individually for TcRα (probes labeled with CalFluor Red 610 and imaged with Cy3 filter; pseudo colored green) (g-i), or for neither probe (negative control) (j-l). We show probe staining in shark thymus tissue as a positive control [m-o]. We counterstained all slides with DAPI (blue). We imaged fluorescence detected using each filter (DAPI, Cy3/Cy5) regardless of the probe used to illustrate that background fluorescence is not dependent on probe application. We obtained images of each fluorophore using 10x magnification and merged Z-stacked images together (DAPI fluorescence [a,d,g,j,m]; Cy3 (green) and Cy5 (red) fluorescence [b,e,h,k,n]; merged [c,f,I,l,o]). Scale bar 150 μm.

Discussion

The role and diversifying mechanisms of SHM in B cells are well known (Li et al., 2004), as are the consequences of off-target AID activity (Álvarez-Prado et al., 2018). In B cells, AID mediates SHM within germinal centers of lymph nodes and spleen in mammals (Crouch et al., 2007), and we predict this is similarly occurring in the B cells zones identified in the shark splenic white pulp (Rumfelt et al., 2002). Somatic mutations occur in rearranged variable regions of B cells responding to antigen at rates of 10−3 mutations per base pair per cell division (Odegard and Schatz, 2006). These changes are dominated by point mutations (and in shark, tandem mutations), biased toward transitions (G:A and C:T), and preferentially targeted to the AID motifs DGYW/WRCH (and less to WA/TW) (Li et al., 2004; Malecek et al., 2005; Odegard and Schatz, 2006; Rogozin and Diaz, 2004). The sequences of B cell V genes have evolved to maximize mutational effects, targeting the accumulation of replacement mutation within the antigen-binding CDRs and limiting mutation within the more structural FRs. In humans, this focused mutation correlates with the long-term survival of B cell receptor repertoires (Saini and Hershberg, 2015). The ability of B cells to use SHM for receptor diversification and improved antigen affinity is the basis of adaptive immunity (Saini and Hershberg, 2015). Despite having similar developmental machinery as B cells (Gellert, 2002), the assumption has long been held that αβ T cells do not undergo SHM because mutation could have deleterious effects on the binding of TcRs to MHC:Ag complexes (MHC:Ag) (Wagner et al., 1995; Mantovani et al., 2002). Despite some suggestion that SHM was occurring in T cells, studies designed to either quantify or characterize mutation in mouse or human TcRs did not gain traction and the textbook definition of SHM still defines it as an exclusively B cell mechanism (Murphy and Weaver, 2017).

Recent studies reported the incidence of SHM in the γ chain of γδ T cells of sandbar shark (Chen et al., 2012; Chen et al., 2009) and in both γ and δ chains of dromedary camel (Antonacci et al., 2011; Ciccarese et al., 2014; Vaccarelli et al., 2012). In each study, SHM mirrored the mutational patterns observed in B cells during affinity maturation. However, in both sandbar shark γ and dromedary camel γ and δ chains, the authors hypothesized that T cells employ mutation as a means to generate a more diverse receptor repertoire rather than to improve receptor affinity to Ag (Chen et al., 2012; Antonacci et al., 2011; Vaccarelli et al., 2012). In contrast to αβ T cells, γδ T cells that interact with non-classical MHC often recombine tissue-specific, restricted sets of genes that have limited junctional diversity (Allison et al., 2001; Adams et al., 2005). Thus, it is reasonable to consider that SHM could be used as a receptor-diversifying mechanism to fine-tune ligand recognition within a prospective tissue or to allow changes within the loci that allow receptors to evolve more rapidly to changing ligand environments (Adams et al., 2005; Kazen and Adams, 2011). Further, many γδ T cells typically bind Ag in a manner more similar to that of Ig than to αβ T cells, recognizing and directly binding to small molecules and intact proteins without presentation by classical MHC:Ag complexes (Adams et al., 2005; Allison and Garboczi, 2002; Allison et al., 2001). Inflammation stimulates activation of γδ T cells earlier in an immune response, releasing pro-inflammatory cytokines and killing infected macrophages. Thus, γδ T cells combine an innate-like response with an adaptive recognition strategy, providing both an immediate response to pathogen invasion and an ongoing, adaptive response to inflammation (Adams et al., 2005; Allison and Garboczi, 2002; Allison et al., 2001). It is evident that how SHM presents a useful solution for accomplishing these tasks by creating a more diverse repertoire of these antibody-like γδ TcRs. Taken together, these studies clearly demonstrate that we can no longer regard SHM as a uniquely B cell mechanism. Considering the diversity of TcRs and TcR diversification mechanisms being found even in mammals (Hansen and Miller, 2015; Miller, 2010), perhaps we should prepare for more surprises in TCR antigen recognition.

In the present study, we verified SHM occurring within the γ and δ chains of γδ T cells in both thymic and peripheral immune tissue of nurse shark. Remarkably, we also detected SHM occurring in the α chain of αβ T cells. We observed mutational characteristics within α chain of nurse sharks similar to those found in B cell SHM. We observed an overall mutation frequency of 0.0226 substitutions per nucleotide (S/N) and a bias toward transition mutations. Further, we detected both single and tandem mutations, a pattern unique to sharks that also occurs in shark B cells. Changes to G and C nucleotides comprise 66.1% of all mutations. Mutation was twice as frequent in CDRs as in FRs (0.0352 versus 0.0188 S/N, respectively), and substitutions in CDRs were significantly more likely to result in amino acid changes. Further, mutations were strongly associated with AID hotspots, and substitutions to G and C nucleotides occurred nearly 1.4x as often within CDR hotspots than FR hotspots. Out of curiosity, we compared counts of AID hotspot motifs within CDR and FR regions between our 11 nurse shark TcR αV consensus sequences and 6 human TcR αV segments (V1.1, V1.2, V2, V3, V4 and V5). We found that shark TcR V segments exhibit far more WRCH/DGYW motifs per sequence than do human V segments (p=0.02). Further, motifs in CDRs of shark occurred 2-3x as often as in humans [human: average of 2.27 motifs per FR (range 1.97–2.65), 2.28 per CDR (range 1.97–2.59); shark: average of 3.25 motifs per FR (range 2.85–3.88), 5.09 per CDR (range 4.02–6.16); data not shown]. The bias we found for nonsynonymous and non-conservative mutations in TcRα CDRs in the shark thymus are consistent with more than simple repertoire diversification; it suggests selection for changes in paratope.

We identified SHM from identical cDNA clones originating from both thymus and spiral valve tissues (see Table 6), suggesting that T cells with SHM-modified receptors must have originated within the thymus and then traveled to peripheral gut-associated lymphoid tissue. Unsurprisingly, we detected the most AID expression within the inner cortex, medulla, and CMJ of shark thymus, where rearrangement and testing of TcRα takes place in mammals. Positive selection on self-MHC/self peptide for mature thymocytes begins with the CD4/CD8 double positive (DP) stage of development while differentiation into CD4/CD8 single positive (SP) cells requires that the TcR interact with MHC (Huesmann et al., 1991). If there is no TcR: MHC/peptide match found, T cell differentiation stalls with failure to be positively selected (Reinherz et al., 1999). However, the unusual nature of the TcRα locus, with up to 100 J segments depending on species, permits multiple successive rearrangements within a single cell, rescuing non-productive or self-selectable receptors with further gene rearrangements, a process called receptor editing (Bedel et al., 2012; Borgulya et al., 1992; Guo et al., 2002; Petrie et al., 1993). In mice, unlike the situation in developing B cells, receptor editing does not seem to rescue T cells from negative selection (Kreslavsky et al., 2013) and thus provides several opportunities for positive selection of DP thymocytes. Thymic nurse cells may help optimize these opportunities for selection by providing microenvironments favorable to secondary alpha chain rearrangement (Nakagawa et al., 2012).

In developing shark thymocytes, SHM in TcRα loci in conjunction with receptor editing (note that sharks, like all other gnathostomes, have a large number of TcRα J segments) could be involved in salvaging cells for positive selection or rescuing cells from death by negative selection. If AID-induced SHM occurs in conjunction with receptor editing and positive selection, AID should be upregulated in cells undergoing RAG-mediated alpha rearrangement (and thus in cells also expressing RAG). However, if SHM occurs after rearrangement of TcR α and thus used for rescuing cells during negative selection, the same T cell would not express both AID and RAG. While we cannot determine conclusively without RAG expression data, the patterns of AID and TcRα expression (Figure 9) suggest that AID is upregulated after cells proliferate and diversify following alpha rearrangement (within the ‘ring’ of cells). Thus, it is likely that AID is used primarily to rescue cells from negative selection, providing a ‘mini-expanded self-referential repertoire’ (Figure 10) and reducing the ‘profligate waste of thymocytes’ (Murphy and Weaver, 2017). However, based on the works above by Kreslavsky et al. and Nakagawa et al. in mice, we cannot discount the possibility that TcRs use SHM in conjunction with receptor editing for positive selection since developing shark T cells could still undergo negative selection after SHM (Figure 10). We cannot completely rule out AID use in mature shark T cells, although our sequence data show no greater mutation frequency in the periphery, and abatement of AID expression in the thymic medulla are consistent with AID being a mechanism used only in T cell development. Further studies examining expression data from single cells could elucidate the timing of AID-catalyzed SHM in relation to T cell development.

Figure 10. Model predicting how AID acts on T cells in the thymus.

Figure 10.

CD4/CD8 double negative (DN) thymocytes in the subcapsular region (SC) and cortex rearrange the β chain, using a surrogate pTα receptor to test for expression signaling. Cells with productive β arrangements then proliferate and express both CD4 and CD8, becoming double positive thymocytes (DPs). As DPs move toward the inner cortex and cortico-medullary junction (CMJ) where α chain rearranges, cells may begin to express AID. Non-productive rearrangements can be rescued from apoptosis by receptor editing or by receptor salvaging, in which AID catalyzes SHM to produce cells with improved affinity to MHC:Ag complexes (to pass positive selection). Salvaged thymocytes then proliferate and express either CD4 or CD8 on their surface as single-positive (SP) cells. AID-mediated receptor salvaging may also reduce recognition of self-peptide, rescuing self-reactive thymocytes from apoptosis (to pass negative selection). [sc: subcapsular region; cmj: corticomedullary junction; green shading indicates region of AID expression].

These results are not without precedent. Qin et al. (2011) reported endogenous AID expression by peripheral CD4+ T cells and immature B cells in mice. T cells that expressed AID also produce a distinctive cytokine profile, are associated with cell activation, and increase in abundance with age, suggesting these cells have distinctive long-term functions in aging cells. In immature B cells, AID expression may help negatively select autoreactive B cells and contribute to primary repertoire diversity. Together these results may indicate that AID expression in mice is a relic of a more extended expression in earlier species (Qin et al., 2011). Some mammals (e.g. rabbit, sheep, and cattle) use SHM in B cell primary repertoire diversification in the gut associated lymphoid tissues (Alitheen et al., 2010; Archer et al., 1963; Becker and Knight, 1990; Butler et al., 2011; Lanning et al., 2000; Reynaud et al., 1995; Reynaud et al., 1991; Reynolds and Morris, 1983). Even more recently, research implicated AID in central B cell tolerance in mammalian pre-B cells (Cantaert et al., 2015; Kuraoka et al., 2011), and there are examples of AID-mediated SHM being used alongside RAG-mediated V(D)J recombination in primary B cell repertoire generation in many mammals and in AID-mediated primary B cell diversification during gene conversion in the bird bursa of Fabricius (Reynaud et al., 1987; Thompson and Neiman, 1987). We suggest that these B-cell-specific features are much later mechanisms of AID-driven primary lymphocyte repertoire diversification and honing, perhaps mechanisms convergent with the thymic process we describe here. However, we predict that studies of this type in other vertebrate species will reveal the use of SHM as a T cell diversifying mechanism in a much broader collection of species.

Although recent studies report B cells present in the thymus of mice (Perera and Huang, 2015; Perera et al., 2013), Miracle et al. examined B cell expression levels (IgM and IgX) in various tissues (including thymus) at different ages in the clearnose skate. While skate thymus expressed both IgM and IgX in early life stages, adult skates no longer express Ig in the thymus (Miracle et al., 2001). Criscitiello and Flajnik, 2007 also found little evidence that Ig light chain is expressed in adult nurse shark thymus (see Fig S3 of paper), corroborating older northern blot studies with heavy chain in the species (Rumfelt et al., 2001; Rumfelt et al., 2004). A concurrent study within our lab used RT-qPCR to analyze IgM expression within shark thymus and found that, while young sharks do express IgM in the thymus, adult nurse sharks (which we used in the current study) do not express IgM (data not shown). Further, preliminary results using FISH probes to IgM constant region also indicate that adult nurse sharks do not express IgM in the thymus. What little Ig expression we found in the thymus did not co-localize with AID expression.

Shark TcRs are capable of a wide variety of diversification mechanisms. In addition to RAG-mediated combinatorial and junctional diversity from complex loci, TcRδ (and possibly α) rearrange Ig-TcR chimera using IgM and IgW V exons with TcR D-J-C (Criscitiello et al., 2010). The TcR δ locus also encodes the doubly rearranging NAR-TcR, a δ chain with two diverse V domains (Criscitiello et al., 2006). Interestingly, our existing data do not suggest that SHM targets the IgHV or the TcR δV of the NAR-TcR δV chains for diversification. This suggests great control over the (nearly synchronous if not concomitant) expression of the potentially genotoxic AID and RAG. Thus, SHM at the γ, δ, and α loci adds to the battery of extraordinary diversification mechanisms used by shark lymphocytes in antigen recognition, although we do not yet understand the full effects of SHM on the animal’s immunity to infectious disease. As for the dangers of any aberrant mutational activity genome-wide, sharks could be more resilient than other taxa due to their inherent slow rate of mutation (Martin, 1999), possibly linked to the exceptional longevity of some individuals (Nielsen et al., 2016).

In the broader scope of lymphocyte evolution, we must consider whether the ancestral vertebrate lymphocyte employed APOBEC-family-mediated diversification before the ‘big bang’ of RAG (reviewed in [Hirano, 2015]). Lamprey lymphocytes express at least two lymphocyte-specific cytidine deaminases (CDA1/CDA2) in the AID/APOBEC family. These deaminases emerged phylogenetically as the closest sister group in the AID/APOBEC family to the AID used by gnathostomes for Ig class-switch recombination, somatic hypermutation, and Ig gene conversion (Rogozin et al., 2007). CDA-mediated gene rearrangement in lampreys occurs in a manner similar to AID-induced immunoglobulin gene conversion in some birds and mammals (Rogozin et al., 2007; Zheng et al., 1994). One study suggested that VLRA (the analog to αβ TcR in jawed vertebrates) also might use CDA to affinity mature its receptors, indicating that CDA contributes both to repertoire generation and to somatic mutation after antigen exposure (Deng et al., 2010; Flajnik, 2014). If this is true, it may be possible that the ancestor to modern vertebrates also used an AID-like enzyme to assist with lymphocyte receptor development in a thymus-like organ. The expression of AID in the thymus of primitive sharks may be a remnant of this ancestral process, a mechanism lost in later vertebrates because of its potential for breaking down self-tolerance in mature lymphocytes. Perhaps, agnathans evolved specific APOBEC molecules for diversification of their B and T like VLRs, while gnathostomes evolved AID for T cell primary repertoire diversification (Neils Jerne’s ‘mutant breeding organ’ [Jerne, 1971]) and B cell affinity maturation, eventually co-opting AID for use in class switch recombination at IGH translocons, and later still, gene conversion and SHM for primary B cell repertoires.

From this trend of comparative TcR studies, we conclude with two hypotheses that we will test with further immunogenetic and functional studies in shark and other vertebrate models. First, the division between B and T cell repertoire diversification components and mechanisms was not as clear-cut in ancestral lymphocytes as in modern humans and mice. The second is that different vertebrate groups have not only evolved myriad diversifications for Ig repertoires and function, but TcR biology may be just as varied. This premise is already accumulating ample supporting evidence as IgHV domains (Parra et al., 2010; Parra et al., 2012), high allelic polymorphism (Criscitiello et al., 2004a; Criscitiello et al., 2004b), germline joined V exons (Wang and Miller, 2012; Wang et al., 2011), and now mechanisms such as SHM, all once considered the immune privilege of Ig or MHC genes, are also employed for TcRs.

Materials and methods

Study animals

TCR sequence data used in this study came from two adult female nurse sharks (‘Joanie’ and ‘Mary Junior’; Ginglymostoma cirratum) delivered by caesarian section off the Florida Keys and matured in the aquatic vivarium of the University of Maryland’s Center of Marine Biotechnology. We used published T cell αV sequences (Criscitiello et al., 2010) as a baseline for αV locus numbering (sharks Yellow and 1299), though we did not analyze any of these sequences for mutation.

Total RNA isolation and cDNA synthesis

We harvested tissues from animals after MS-222 (Argent, Redmond, WA) overdose, and immediately purified RNA with TRIzol reagent (Life Technologies, Carlsbad, CA). Nurse shark thymi are located dorsomedial to the gills (Luer et al., 1995) in the crevasse between the epaxial and brachial constrictor muscle groups. We used 5 ug total RNA from spiral valve, spleen, thymus, and peripheral blood leukocytes (PBL) for oligo-dT primed cDNA generation with Superscript III First Strand Synthesis System (Thermo Fisher Scientific, Inc., Waltham, MA). (Criscitiello et al., 2010) We estimated cDNA concentration using a Nanodrop 2000 Spectrophotometer (Thermo Fisher Scientific, Inc.).

RACE PCR, Cloning, and Sanger Sequencing

We generated a 5’ RACE (Rapid Amplification of cDNA Ends) library using the GeneRacer Kit (Life Technologies) and reverse primers designed to the end of the shark TcRβ, TcRγ, or TcRδ variable (V) region or to the middle of the shark TcRα constant (C) region. We amplified RACE products using Phusion High-Fidelity DNA polymerase (New England Bio Labs, Inc., Ipswich, MA) to minimize PCR errors under these specific PCR conditions: primary denaturing at 94°C for 2 min; 30 cycles at 94°C for 30 s and 78°C for 1 min; and a final extension at 72°C for 10 min. Using this RACE library, we then amplified a specific α V region using a gene-specific primer to its leader region and the following PCR conditions: 98°C for 1 min; 25 cycles of 98°C for 5 s, 49–60°C for 30 s, 72°C for 150 s; 72°C for 10 min. Annealing temperatures varied for each amplified αV (see Table 7). We visualized PCR products with agarose gel (8%) electrophoresis and then excised bands of correct size. We then isolated amplified bands from agarose gels using the PureLink Quick Gel Extraction Kit (Life Technologies) or RICO chips (TaKaRa Bio USA, Mountain View, CA).

Table 7. List of forward (F) and reverse (R) primers used to generate T cell receptor (TcR) sequences and expression data.

[AID: Activation induced cytidine deaminase; B2M: beta-2 microglobulin; α: alpha; β: beta; γ: gamma; δ: delta; V: variable region; C: constant region].

Primer F/R ID Location Nucleotide sequence (5' to 3') Amino acid Tm
TcR αV1 F MFC370 leader region of αV1 ATG TTG CCT GAA GCT C MLPEA 55
R MFC191 alpha C region CAT TGG TGG ATA GCA AGC CCT TCG AT SKGLLSTN 76
TcR αV4 F MFC122 beginning of αV4 GTC TCC TCA GTT GTT CGT AC VSSVVR 58
R MFC123 end of αV4 CAG TAA TAC ACA GCA GCG TC DAAVYY 58
F MFC374 leader region of αV4 TGG ATT GTG TGG GCA GTA WIVWAV 54
R MFC191 alpha C region CAT TGG TGG ATA GCA AGC CCT TCG AT SKGLLSTN 76
TcR αV5 F MFC124 beginning of αV5 CTC AGG AAG GAG AGA TTA TCA C QEGEII 60
R MFC125 end of αV5 CAA TGA TAC ACG GCG GAG TC DSAVYH 60
F MFC124 beginning of αV5 CTC AGG AAG GAG AGA TTA TCA C QEGEII 60
R MFC191 alpha C region CAT TGG TGG ATA GCA AGC CCT TCG AT SKGLLSTN 76
TcR αV7 F MFC376 end of leader αV7 AGC GAT GGA GTT TCT GTG ATT SDGVSVI 58
R MFC191 alpha C region CAT TGG TGG ATA GCA AGC CCT TCG AT SKGLLSTN 76
TcR αV10 F MFC378 leader region of αV10 CTA TTT CTT CAC TAC CGC AG YFFTTA 56
R MFC191 alpha C region CAT TGG TGG ATA GCA AGC CCT TCG AT SKGLLSTN 76
TcR α 5' F GeneRacer 5' Nested homologous to RNA oligo GGA CAC TGA CAT GGA CTG AAG GAG TA -- 78
TcR α 3' R MFC191 alpha C region CAT TGG TGG ATA GCA AGC CCT TCG AT SKGLLSTN 76
TcR βV1 F MFC126 beginning of bV1 CTC CGT ACA TCG TCT CTA TTG PYIVSI 60
R MFC127 end of βV1 CAC GCA CAG AAA TAG ACA GC AVYFCA 58
TcR βV2 F MFC128 beginning of βV2 CTA CGT GGA GCA GTC TCC ATC YVEQSP 63
R MFC129 end of βV2 GCA CGC ACA ATA ATA GAC AGC C AVYYCAC 62
TcR βV3 F MFC130 beginning of βV3 CTA CGT GGA ACA GTC TCC TTC YVEQSP 61
R MFC131 end of βV3 CAC GCG CAG AAA TAG ACA G VYFCA 57
TcR βV5 F MFCb50 beginning of βV5 GTT CGG TGC TCT TTC TCT GC MFGALSLH 60
R MFCb54 end of βV5 GAC TGC AGT ATC AGT CGG CAC C LVPTDTAV 66
TcR γV1 F MFCg56 beginning of γV1 GTC GCT GTA TTA CTG GCT CAT TG MSLYYWL 63
R MFCg59 end of γV1 GAG CGC ACA GTA ATA GGT GGC AG TATYYCAL 67
TcR γV3 F MFCg58 beginning of γV3 GAA GGG TCA CGT CCT TGC G MKGHVLA 62
R MFCg61 end of γV3 GAT CCC AGA GTC ATC CTC EDDSGI 56
F MFC170 beginning of γV3 CAA TAA CCA GAG CAC CGG G ITRAP 56
R MFC171 end of γV3 AGA TCC CAG AGT CGT CCT C EDDSGI 56
TcR δV3 F MFCd62 beginning of δV3 GAT TCC CCG TCC CTG GTG TC DSPSLVS 65
R MFCd66 end of δV3 CAG TGC ACA GTG ATA CAC AGC AVYHCAL 61
TcR δV5 F MFCd63 beginning of δV5 GCA GCT ACT CAG TAT CTG G MQLLSIW 57
R MFCd67 end of δV5 GAA AGC ACA GTA ATA CAG AG ALYYCAF 54
TcR δV7 F MFC172 beginning of δV7 CTG TCA CTC AGT TAT TCT CCT C VTQLFS 60
R MFC173 end of δV7 GCA GCC CAG TTA TAG TCA AAC LTITGL 60
TcR δV12 F MFC174 beginning of δV12 CAG AGC CCA CCT CAG TTA C QSPPQ; 60
R MFC175 end of δV12 GAG CGC AGT AAT AGA TGG C AIYYCA 57
TcR δV16 R MFC176 end of δV16 GCA GCT CCG AGA TAG ACA AC LSISEL 60
F MFC177 beginning of δV16 GAG TCC TGG CTC ACG CAA TC ESWLTQ 63
TcR δV17 F MFC178 beginning of δV17 CAG TCT TGG TCA GAA ATA ACC QSWSEIT 57
R MFC179 end of δV17 CAA CTG AAG ATA AGT GAT CG ITYLQL 54
AID F MFC342 beginning of AID exon 1 AGG CAC GAG ACC TAC ATG TTG RHETYML 61
R MFC347 end of AID exon 2 TGA ACC AGG TGA GGC GGT A YRLTWF 60
B2M F MFC211 first cysteine AAC GTG TTG CTC TGT CAT GC NVLLCHA 58
R MFC212 before second cysteine GGG GTG AAC TCC ACA TAA CG RYVEFTP 60

We transformed PCR amplicons into One-Shot Top10-competent cells (Thermo Fisher) using a pCR4-TOPO TA blunt end vector and cloning kit (Thermo Fisher) followed by a Zyppy plasmid miniprep kit (Zymo Research, Irvine, CA) for plasmid purification of individual clones (Criscitiello et al., 2012). We checked insert size using an Eco RI restriction enzyme (Promega Corp, Madison, WI), then amplified and purified the sequencing reaction using BigDye xTerminator Sequencing and Purification Kit (Thermo Fisher Scientific, Inc.). We submitted samples for sequencing to the DNA Technologies Core Lab on the Texas A and M University campus (College Station, TX). We deposited sequences in GenBank with the following accession numbers: Alpha KY189332-KY189354 and KY366469-KY355487; Beta KY351708-KY366487; Gamma KY351639-KY351707; Delta KY346705-KY346816.

Sequence alignment and tree building

We used Geneious and BioEdit (v7.2.5, Ibis BioSciences, Carlsbad, CA) software to manage DNA sequence data. We aligned nucleotide and amino acid sequences using the ClustalW Multiple Alignment tool in Geneious with a gap penalty of 15, a gap extension penalty of 6.66, and free end gaps. We manually adjusted the alignments as necessary. We determined sequence relationships phylogenetically using the Geneious tree builder with default settings. We grouped sequences into unique V families based on 70% nucleotide sequence identity and 75% amino acid sequence identity (Brodeur and Riblet, 1984; Rumfelt et al., 2004) using the same α V numbering scheme as in Criscitiello et al. (2010). We created graphical alignments in BioEdit and imported these files into Microsoft Word to generate figures. Our preliminary dataset contained 564 TcRα clones (encoding 286 unique amino acid sequences representing nine Vα families) from three tissues (PBL, spleen, thymus) of two sharks (Joanie, Mary Junior). Using this dataset, we separated sequences containing identical CDR3 rearrangements and counted mutations within each TcR ‘clone family’ bearing the V-J rearrangement from single founder thymocytes.

Identification of TcR Vα genes in the nurse shark genome

We probed the filter sets for the G. cirratum BAC library (Arizona Genomics Institute, Tucson, AZ) of shark ‘Yellow’ and screened with variable segment and constant region probes for TcRα and TcRδ. We cultured several positive clones and isolated BAC DNA according to manufacturer’s protocol with the Qiagen Large Construct Kit (Qiagen, Valencia, CA). We sent purified BAC DNA to the Duke University Center for Genomic and Computational Biology (Durham, NC) for PacBio SMRT (Menlo Park, CA) large insert (15–20 kb) library preparation, sequenced on the PacBio RSII platform with P6-C4 chemistry. Read correction and contig assembly were performed with the PBcR software (Koren et al., 2012), using the BLASR error correction method and the Celera Assembler 8.2. We annotated the resulting sequencing within the Geneious software suite (v9.1.5, Biomatters Ltd., Auckland, NZ) using a custom BLAST database of all TcR and IgH sequences for G. cirratum in the IMGT database (Montpellier, France).

Our search yielded 17 α/δ V germline segments, significantly fewer than expected based on TcR αV segment numbers in other species (Murphy and Weaver, 2017). Of these 17 segments, only 13 contained unique nucleotide sequences, and all V segments were highly similar to each other (69–100% nucleotide and 52–100% amino acid identity). Twelve germline V segments shared >93% nucleotide identity (>85% amino acid identity), with three segments differing by only a single nucleotide (Figure 11). Based on the variability we observed in our sequence data, these 17 germline α/δ V segments must represent only a small portion of the available Vs in the nurse shark genome.

Figure 11. Observed TcR Alpha/Delta germline Vs exhibit high sequence identity.

Figure 11.

Nucleotide (A) and amino acid (B) alignments of 17 germline variable (V) region gene segments. Two V groups contained three identical germline gene segments each (highlighted in gray), leaving only 13 unique V gene segments. Boxes surround conserved amino acids. Numbers at the ends of sequences indicate percent identity to the first germline sequence (α/δ V5) within the alignment.

We compared these 17 germline α/δ V segments to our TcR αV database containing all nine potential V families from two different sharks. All 17 germline α/δ V segments aligned to our αV4 data with >75% nucleotide identity, while 15 segments shared >93% nucleotide identity to at least one sequence in our αV4 dataset. Of the 60 sequences in our αV4 dataset, 37 sequences aligned specifically to eight germline α/δ V segments, with alignments containing one to 17 aV4 sequences per germline segment (nucleotide alignments shared >97% identity; Figure 12.). While we did observe nucleotide differences within alignments, most differed by fewer than four nucleotides from the germline α/δ V segments. Because several germline segments differed by only a single nucleotide and we are certain that we have not found all α/δ V segments in the genome, these differences could represent variation in alleles or individuals rather than mutation. Thus, we chose not to rely on these data for mutational analysis.

Figure 12. Observed germline sequences align only to TcR αV4 clones.

Figure 12.

Nucleotide alignments of TcR αV4 thymocyte clones to known germline V segments. Highlighted and underlined bases denote nonsynonymous and synonymous differences (respectively) to the germline V segment. Boxed regions represent nucleotides of the third complementarity-determining region (CDR3) according to IMGT guidelines, accounting for differences between clone and germline sequences. We highlight the single nucleotide/amino acid change between α/δ V7 and α/δ V10 germline segments in red.

Mutation frequency

We defined mutation frequency as the number of nucleotide changes divided by the total number of nucleotides within a particular region (e.g. FR, CDR, J, C) based on differences to a consensus sequence. We classified all nucleotide changes as either synonymous (SYN) or non-synonymous (NSYN) mutations based on whether or not the codon was unaltered or altered, respectively. For tandem base changes, we assessed the effect of each nucleotide change independent of its neighboring mutation(s). We then compared mutation frequencies between CDR and FR regions for all clone families that contained mutations using a Student’s 1-tailed t-test unless otherwise noted.

Determination of hotspots

We searched for the ProSite motifs DGYW/WRCH (G:C mutable target) and WA/TW (A:T mutable target) using the motif search function in Geneious. These motifs serve as common ‘hotspots’ for SHM within Ig variable regions, where AID favors the G/C bases within DGYW/WRCH motifs during the first phase of SHM while low-fidelity polymerases (i.e., polymerase η) preferentially target A/T bases within WA/TW motifs during the second phase of SHM (Wei et al., 2015; Rogozin and Diaz, 2004; Chen et al., 2012). We counted only motifs present in the consensus sequence (rather than those created by the mutation) as hotspots. For each domain, we first counted the number of target nucleotides within DGYW/WRCH or WA/TW hotspots. We then examined each mutation to determine if it occurred inside or outside a hotspot. We counted changes from the consensus sequence to a target nucleotide within its respective motif as a hotspot mutation. We defined the frequency of hotspot mutation as the number of mutations (of target nucleotides) occurring within hotspots divided by the total number of mutations for each region. From these data, we compared the mutability of bases between FR and CDR regions using χ2 Analysis.

Base substitution indices

We calculated a mutability index for each nucleotide using methods similar to Chen et al. (2012). In our case, we derived the expected number of mutations by multiplying the frequency of a particular nucleotide within a family of sequences (e.g. α V2) by the total number of observed mutations within that family. We then defined the mutability index as in Chen et al. (2012) [the observed number of mutations of a specific nucleotide divided by the expected number of mutations of that nucleotide, with a value of 1.00 indicating random mutation]. We used χ2 analysis to compare mutability indices between FR and CDR regions.

In situ hybridization

We used thymus tissue from an adult nurse shark for in situ hybridization as previously described (Criscitiello et al., 2010). We generated a probe for G. cirratum AID mRNA with primers NSAIDEH2 and NSAIDEH1 (Table 7) designed to amplify 211 base pairs of cDNA sequence of the first two AID exons (sequence and primers kindly shared by Ellen Hsu). We acquired images on an Axioscop2 microscope with AxioCam MRc5 (Zeiss, Thornwood CT) using Zeiss Axio Vision software.

Additionally, we performed fluorescence in situ hybridization (FISH) on adult nurse shark (‘Black’) thymus tissue. Slides contained two 8 μm thick sections of flash frozen thymus tissue preserved in OCT. We designed custom Stellaris FISH Probes against the TcR alpha constant region (αC) for T cell identification and exons 1 and 2 of AID by utilizing the Stellaris RNA FISH Probe Designer (Biosearch Technologies Inc., Petaluma, CA) available online at www.biosearchtech.com/stellarisdesigner (Version 2). We hybridized TcR αC with the CalFluor Red 610 fluorophore and the AID sequence with the Quasar 670 fluorophore for the Stellaris RNA FISH Probe set (Biosearch Technologies, Inc.). We followed all manufacturer’s instructions for frozen tissue (available online at www.biosearchtech.com/stellarisprotocols), allowing hybridization probes to incubate for 16 hr. We counterstained slides with wash buffer containing 5 ng/mL of DAPI (Sigma-Aldrich, St. Louis, MO). We obtained 10x, 20x, and 63x images using a Zeiss Stallion Digital Imaging Workstation including a 2x CoolSnap HQ Camera and Zeiss Stallion software. We merged Z-stacked images of each fluorophore together and edited and processed images using ImageJ software, version 1.47 (Schneider et al., 2012).

Real-time qPCR for AID expression

We synthesized cDNA from nurse shark spleen, thymus, muscle, and forebrain RNA (see RNA purification and extraction methods above) using SuperScript III First-strand Synthesis System (Thermo Fisher Scientific, Inc.) and a 1:1 mixture of oligo-dT and random hexamer primers. We then amplified cDNA using touchdown PCR on an MJ mini thermal cycler (Bio-Rad, Hercules, CA) and GoTaq colorless DNA polymerase (Promega Corp) using the following conditions: primary denaturing at 94°C for 2 min, five cycles at 94°C for 30 s and 56°C for 4 min; five cycles at 94°C for 30 s and 54°C for 4 min; 20 cycles at 94°C for 30 s, and 52°C for 30 s, and 72°C for 4 min; with a final extension at 72°C for 10 min. We visualized PCR products using agarose gel electrophoresis (as described above) to verify presence of AID in each tissue. We then cloned and sequenced the resulting PCR products to confirm the sequence was AID.

We looked for relative AID expression in shark spleen (positive control, where B cell AID-mediated SHM is known to occur), thymus, and forebrain (negative control) at four tissue concentrations (50 ng, 25 ng, 12.5 ng, and 6.25 ng) using the SYBR-green RT-PCR reagents kit (Thermo Fisher Scientific, Inc.) on a LightCycler 480 System (Roche Diagnostics Corp, Indianapolis, IN). We analyzed relative quantification using the LightCycler480 software and quantified relative AID expression using the ΔΔCq method. We normalized results against shark muscle tissue using beta2-microglobulin (β2M) as a reference gene. We present data as expression-fold changes of AID to β2M.

Acknowledgements

We acknowledge expert technical assistance from Pat Chen and Ferenc Livak for helpful advice and comments on the manuscript. Ellen Hsu kindly shared a partial nurse shark AID sequence.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Michael F Criscitiello, Email: mcriscitiello@cvm.tamu.edu.

David G Schatz, Yale University School of Medicine, United States.

Funding Information

This paper was supported by the following grants:

  • National Science Foundation IOS 1257829 to Michael F Criscitiello.

  • National Institute of Allergy and Infectious Diseases R01OD0549 to Martin F Flajnik.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft.

Formal analysis, Investigation, Methodology.

Software, Formal analysis, Investigation, Methodology, Writing—review and editing.

Formal analysis, Investigation, Methodology.

Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Writing—review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Investigation, Project administration, Writing—review and editing.

Ethics

Animal experimentation: These studies were carried out in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. The protocol was approved by the Animal Care and Use Committees at Texas A&M University and University of Maryland School of Medicine. Experiments in the Criscitiello lab were performed under Texas A&M University Institutional Biosafety Committee permit IBC 2014-293 and Animal Use Protocol 2015-0374.

Additional files

Transparent reporting form
DOI: 10.7554/eLife.28477.026

Data availability

T cell receptor sequences have been deposited in Genbank of NCBI. Alpha KY189332-KY189354 and KY366469-KY355487; Beta KY351708-KY366487; Gamma KY351639-KY351707; Delta KY346705-KY346816.

References

  1. Adams EJ, Chien YH, Garcia KC. Structure of a gammadelta T cell receptor in complex with the nonclassical MHC T22. Science. 2005;308:227–231. doi: 10.1126/science.1106885. [DOI] [PubMed] [Google Scholar]
  2. Alder MN, Rogozin IB, Iyer LM, Glazko GV, Cooper MD, Pancer Z. Diversity and function of adaptive immune receptors in a jawless vertebrate. Science. 2005;310:1970–1973. doi: 10.1126/science.1119420. [DOI] [PubMed] [Google Scholar]
  3. Alitheen NB, McClure S, McCullagh P. B-cell development: one problem, multiple solutions. Immunology and Cell Biology. 2010;88:445–450. doi: 10.1038/icb.2009.119. [DOI] [PubMed] [Google Scholar]
  4. Allison TJ, Garboczi DN. Structure of gammadelta T cell receptors and their recognition of non-peptide antigens. Molecular Immunology. 2002;38:1051–1061. doi: 10.1016/S0161-5890(02)00034-2. [DOI] [PubMed] [Google Scholar]
  5. Allison TJ, Winter CC, Fournié JJ, Bonneville M, Garboczi DN. Structure of a human gammadelta T-cell antigen receptor. Nature. 2001;411:820–824. doi: 10.1038/35081115. [DOI] [PubMed] [Google Scholar]
  6. Anderson MK, Shamblott MJ, Litman RT, Litman GW. Generation of immunoglobulin light chain gene diversity in Raja erinacea is not associated with somatic rearrangement, an exception to a central paradigm of B cell immunity. Journal of Experimental Medicine. 1995;182:109–119. doi: 10.1084/jem.182.1.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Antonacci R, Mineccia M, Lefranc MP, Ashmaoui HM, Lanave C, Piccinni B, Pesole G, Hassanane MS, Massari S, Ciccarese S. Expression and genomic analyses of Camelus dromedarius T cell receptor delta (TRD) genes reveal a variable domain repertoire enlargement due to CDR3 diversification and somatic mutation. Molecular Immunology. 2011;48:1384–1396. doi: 10.1016/j.molimm.2011.03.011. [DOI] [PubMed] [Google Scholar]
  8. Archer OK, Sutherland DER, Good RA. Appendix of the RABBIT: A homologue of the bursa in the chicken? Nature. 1963;200:337–339. doi: 10.1038/200337a0. [DOI] [PubMed] [Google Scholar]
  9. Álvarez-Prado ÁF, Pérez-Durán P, Pérez-García A, Benguria A, Torroja C, de Yébenes VG, Ramiro AR. A broad atlas of somatic hypermutation allows prediction of activation-induced deaminase targets. The Journal of Experimental Medicine. 2018;215:761–771. doi: 10.1084/jem.20171738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bachl J, Wabl M. Hypermutation in T cells questioned. Nature. 1995;375:285–286. doi: 10.1038/375285c0. [DOI] [PubMed] [Google Scholar]
  11. Barreto VM, Magor BG. Activation-induced cytidine deaminase structure and functions: a species comparative view. Developmental & Comparative Immunology. 2011;35:991–1007. doi: 10.1016/j.dci.2011.02.005. [DOI] [PubMed] [Google Scholar]
  12. Becker RS, Knight KL. Somatic diversification of immunoglobulin heavy chain VDJ genes: evidence for somatic gene conversion in rabbits. Cell. 1990;63:987–997. doi: 10.1016/0092-8674(90)90502-6. [DOI] [PubMed] [Google Scholar]
  13. Bedel R, Matsuda JL, Brigl M, White J, Kappler J, Marrack P, Gapin L. Lower TCR repertoire diversity in Traj18-deficient mice. Nature Immunology. 2012;13:705–706. doi: 10.1038/ni.2347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bernstein RM, Schluter SF, Lake DF, Marchalonis JJ. Evolutionary conservation and molecular cloning of the recombinase activating gene 1. Biochemical and Biophysical Research Communications. 1994;205:687–692. doi: 10.1006/bbrc.1994.2720. [DOI] [PubMed] [Google Scholar]
  15. Borgulya P, Kishi H, Uematsu Y, von Boehmer H. Exclusion and inclusion of alpha and beta T cell receptor alleles. Cell. 1992;69:529–537. doi: 10.1016/0092-8674(92)90453-J. [DOI] [PubMed] [Google Scholar]
  16. Brodeur PH, Riblet R. The immunoglobulin heavy chain variable region (Igh-V) locus in the mouse. I. One hundred Igh-V genes comprise seven families of homologous genes. European Journal of Immunology. 1984;14:922–930. doi: 10.1002/eji.1830141012. [DOI] [PubMed] [Google Scholar]
  17. Buslepp J, Wang H, Biddison WE, Appella E, Collins EJ. A correlation between TCR Valpha docking on MHC and CD8 dependence: implications for T cell selection. Immunity. 2003;19:595–606. doi: 10.1016/S1074-7613(03)00269-3. [DOI] [PubMed] [Google Scholar]
  18. Butler JE, Santiago-Mateo K, Sun XZ, Wertz N, Sinkora M, Francis DH. Antibody repertoire development in fetal and neonatal piglets. XX. B cell lymphogenesis is absent in the ileal Peyer's patches, their repertoire development is antigen dependent, and they are not required for B cell maintenance. The Journal of Immunology. 2011;187:5141–5149. doi: 10.4049/jimmunol.1101871. [DOI] [PubMed] [Google Scholar]
  19. Cantaert T, Schickel JN, Bannock JM, Ng YS, Massad C, Oe T, Wu R, Lavoie A, Walter JE, Notarangelo LD, Al-Herz W, Kilic SS, Ochs HD, Nonoyama S, Durandy A, Meffre E. Activation-Induced cytidine deaminase expression in human b cell precursors is essential for central B cell tolerance. Immunity. 2015;43:884–895. doi: 10.1016/j.immuni.2015.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Chen H, Bernstein H, Ranganathan P, Schluter SF. Somatic hypermutation of TCR γ V genes in the sandbar shark. Developmental & Comparative Immunology. 2012;37:176–183. doi: 10.1016/j.dci.2011.08.018. [DOI] [PubMed] [Google Scholar]
  21. Chen H, Kshirsagar S, Jensen I, Lau K, Covarrubias R, Schluter SF, Marchalonis JJ. Characterization of arrangement and expression of the T cell receptor gamma locus in the sandbar shark. PNAS. 2009;106:8591–8596. doi: 10.1073/pnas.0811283106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Cheynier R, Henrichwark S, Wain-Hobson S. Somatic hypermutation of the T cell receptor V beta gene in microdissected splenic white pulps from HIV-1-positive patients. European Journal of Immunology. 1998;28:1604–1610. doi: 10.1002/(SICI)1521-4141(199805)28:05&#x0003c;1604::AID-IMMU1604&#x0003e;3.0.CO;2-R. [DOI] [PubMed] [Google Scholar]
  23. Choudhary M, Tamrakar A, Singh AK, Jain M, Jaiswal A, Kodgire P. AID Biology: A pathological and clinical perspective. International Reviews of Immunology. 2018;37:37–56. doi: 10.1080/08830185.2017.1369980. [DOI] [PubMed] [Google Scholar]
  24. Ciccarese S, Vaccarelli G, Lefranc MP, Tasco G, Consiglio A, Casadio R, Linguiti G, Antonacci R. Characteristics of the somatic hypermutation in the Camelus dromedarius T cell receptor gamma (TRG) and delta (TRD) variable domains. Developmental & Comparative Immunology. 2014;46:300–313. doi: 10.1016/j.dci.2014.05.001. [DOI] [PubMed] [Google Scholar]
  25. Conticello SG, Thomas CJ, Petersen-Mahrt SK, Neuberger MS. Evolution of the AID/APOBEC family of polynucleotide (deoxy)cytidine deaminases. Molecular Biology and Evolution. 2005;22:367–377. doi: 10.1093/molbev/msi026. [DOI] [PubMed] [Google Scholar]
  26. Criscitiello MF, Flajnik MF. Four primordial immunoglobulin light chain isotypes, including lambda and kappa, identified in the most primitive living jawed vertebrates. European Journal of Immunology. 2007;37:2683–2694. doi: 10.1002/eji.200737263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Criscitiello MF, Kamper SM, McKinney EC. Allelic polymorphism of TCRalpha chain constant domain genes in the bicolor damselfish. Developmental & Comparative Immunology. 2004a;28:781–792. doi: 10.1016/j.dci.2003.12.004. [DOI] [PubMed] [Google Scholar]
  28. Criscitiello MF, Ohta Y, Graham MD, Eubanks JO, Chen PL, Flajnik MF. Shark class II invariant chain reveals ancient conserved relationships with cathepsins and MHC class II. Developmental & Comparative Immunology. 2012;36:521–533. doi: 10.1016/j.dci.2011.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Criscitiello MF, Ohta Y, Saltis M, McKinney EC, Flajnik MF. Evolutionarily conserved TCR binding sites, identification of T cells in primary lymphoid tissues, and surprising trans-rearrangements in nurse shark. The Journal of Immunology. 2010;184:6950–6960. doi: 10.4049/jimmunol.0902774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Criscitiello MF, Saltis M, Flajnik MF. An evolutionarily mobile antigen receptor variable region gene: doubly rearranging NAR-TcR genes in sharks. Proceedings of the National Academy of Sciences. 2006;103:5036–5041. doi: 10.1073/pnas.0507074103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Criscitiello MF, Wermenstam NE, Pilström L, McKinney EC. Allelic polymorphism of T-cell receptor constant domains is widespread in fishes. Immunogenetics. 2004b;55:818–824. doi: 10.1007/s00251-004-0652-7. [DOI] [PubMed] [Google Scholar]
  32. Crouch EE, Li Z, Takizawa M, Fichtner-Feigl S, Gourzi P, Montaño C, Feigenbaum L, Wilson P, Janz S, Papavasiliou FN, Casellas R. Regulation of AID expression in the immune response. The Journal of Experimental Medicine. 2007;204:1145–1156. doi: 10.1084/jem.20061952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Deng L, Velikovsky CA, Xu G, Iyer LM, Tasumi S, Kerzic MC, Flajnik MF, Aravind L, Pancer Z, Mariuzza RA. A structural basis for antigen recognition by the T cell-like lymphocytes of sea lamprey. PNAS. 2010;107:13408–13413. doi: 10.1073/pnas.1005475107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Diaz M, Stanfield RL, Greenberg AS, Flajnik MF. Structural analysis, selection, and ontogeny of the shark new antigen receptor (IgNAR): identification of a new locus preferentially expressed in early development. Immunogenetics. 2002;54:501–512. doi: 10.1007/s00251-002-0479-z. [DOI] [PubMed] [Google Scholar]
  35. Diaz M, Velez J, Singh M, Cerny J, Flajnik MF. Mutational pattern of the nurse shark antigen receptor gene (NAR) is similar to that of mammalian Ig genes and to spontaneous mutations in evolution: the translesion synthesis model of somatic hypermutation. International Immunology. 1999;11:825–833. doi: 10.1093/intimm/11.5.825. [DOI] [PubMed] [Google Scholar]
  36. Flajnik MF. Comparative analyses of immunoglobulin genes: surprises and portents. Nature Reviews Immunology. 2002;2:688–698. doi: 10.1038/nri889. [DOI] [PubMed] [Google Scholar]
  37. Flajnik MF. Re-evaluation of the immunological Big Bang. Current Biology. 2014;24:R1060–R1065. doi: 10.1016/j.cub.2014.09.070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Garcia KC, Adams EJ. How the T cell receptor sees antigen--a structural view. Cell. 2005;122:333–336. doi: 10.1016/j.cell.2005.07.015. [DOI] [PubMed] [Google Scholar]
  39. Gellert M. V(D)J recombination: RAG proteins, repair factors, and regulation. Annual Review of Biochemistry. 2002;71:101–132. doi: 10.1146/annurev.biochem.71.090501.150203. [DOI] [PubMed] [Google Scholar]
  40. Guo J, Hawwari A, Li H, Sun Z, Mahanta SK, Littman DR, Krangel MS, He YW. Regulation of the TCRalpha repertoire by the survival window of CD4(+)CD8(+) thymocytes. Nature Immunology. 2002;3:469–476. doi: 10.1038/ni791. [DOI] [PubMed] [Google Scholar]
  41. Guo P, Hirano M, Herrin BR, Li J, Yu C, Sadlonova A, Cooper MD. Dual nature of the adaptive immune system in lampreys. Nature. 2009;459:796–801. doi: 10.1038/nature08068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hansen VL, Miller RD. Pathogen-Host Interactions: Antigenic Variation v. Somatic Adaptations. Results and Problems in Cell Differentiation. Cham: Springer; 2015. The Evolution and Structure of Atypical T Cell Receptors; pp. 265–278. [DOI] [PubMed] [Google Scholar]
  43. Hirano M. Evolution of vertebrate adaptive immunity: immune cells and tissues, and AID/APOBEC cytidine deaminases. BioEssays. 2015;37:877–887. doi: 10.1002/bies.201400178. [DOI] [PubMed] [Google Scholar]
  44. Huesmann M, Scott B, Kisielow P, von Boehmer H. Kinetics and efficacy of positive selection in the thymus of normal and T cell receptor transgenic mice. Cell. 1991;66:533–540. doi: 10.1016/0092-8674(81)90016-7. [DOI] [PubMed] [Google Scholar]
  45. Jerne NK. The somatic generation of immune recognition. European Journal of Immunology. 1971;1:1–9. doi: 10.1002/eji.1830010102. [DOI] [PubMed] [Google Scholar]
  46. Kasahara M, Vazquez M, Sato K, McKinney EC, Flajnik MF. Evolution of the major histocompatibility complex: isolation of class II A cDNA clones from the cartilaginous fish. PNAS. 1992;89:6688–6692. doi: 10.1073/pnas.89.15.6688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kazen AR, Adams EJ. Evolution of the V, D, and J gene segments used in the primate gammadelta T-cell receptor reveals a dichotomy of conservation and diversity. PNAS. 2011;108:E332–E340. doi: 10.1073/pnas.1105105108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Koren S, Schatz MC, Walenz BP, Martin J, Howard JT, Ganapathy G, Wang Z, Rasko DA, McCombie WR, Jarvis ED. Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nature Biotechnology. 2012;30:693–700. doi: 10.1038/nbt.2280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kreslavsky T, Kim HJ, Koralov SB, Ghitza D, Buch T, Cantor H, Rajewsky K, von Boehmer H. Negative selection, not receptor editing, is a physiological response of autoreactive thymocytes. The Journal of Experimental Medicine. 2013;210:1911–1918. doi: 10.1084/jem.20130876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kuklina EM. Revision of the antigen receptor of T-lymphocytes. Biochemistry. 2006;71:827–837. doi: 10.1134/S0006297906080025. [DOI] [PubMed] [Google Scholar]
  51. Kuraoka M, Holl TM, Liao D, Womble M, Cain DW, Reynolds AE, Kelsoe G. Activation-induced cytidine deaminase mediates central tolerance in B cells. PNAS. 2011;108:11560–11565. doi: 10.1073/pnas.1102571108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Lanning D, Sethupathi P, Rhee KJ, Zhai SK, Knight KL. Intestinal microflora and diversification of the rabbit antibody repertoire. The Journal of Immunology. 2000;165:2012–2019. doi: 10.4049/jimmunol.165.4.2012. [DOI] [PubMed] [Google Scholar]
  53. Lantelme E, Orlando L, Porcedda P, Turinetto V, De Marchi M, Amoroso A, Mantovani S, Giachino C. An in vitro model of T cell receptor revision in mature human CD8+ T cells. Molecular Immunology. 2008;45:328–337. doi: 10.1016/j.molimm.2007.06.153. [DOI] [PubMed] [Google Scholar]
  54. Lee SS, Tranchina D, Ohta Y, Flajnik MF, Hsu E. Hypermutation in shark immunoglobulin light chain genes results in contiguous substitutions. Immunity. 2002;16:571–582. doi: 10.1016/S1074-7613(02)00300-X. [DOI] [PubMed] [Google Scholar]
  55. Lefranc MP, Pommié C, Ruiz M, Giudicelli V, Foulquier E, Truong L, Thouvenin-Contet V, Lefranc G. IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Developmental & Comparative Immunology. 2003;27:55–77. doi: 10.1016/S0145-305X(02)00039-3. [DOI] [PubMed] [Google Scholar]
  56. Li Z, Woo CJ, Iglesias-Ussel MD, Ronai D, Scharff MD. The generation of antibody diversity through somatic hypermutation and class switch recombination. Genes & Development. 2004;18:1–11. doi: 10.1101/gad.1161904. [DOI] [PubMed] [Google Scholar]
  57. Luer CA, Walsh CJ, Bodine AB, Wyffels JT, Scott TR. The elasmobranch thymus: Anatomical, histological, and preliminary functional characterization. Journal of Experimental Zoology. 1995;273:342–354. doi: 10.1002/jez.1402730408. [DOI] [Google Scholar]
  58. Malecek K, Brandman J, Brodsky JE, Ohta Y, Flajnik MF, Hsu E. Somatic hypermutation and junctional diversification at Ig heavy chain loci in the nurse shark. The Journal of Immunology. 2005;175:8105–8115. doi: 10.4049/jimmunol.175.12.8105. [DOI] [PubMed] [Google Scholar]
  59. Mantovani S, Palermo B, Garbelli S, Campanelli R, Robustelli Della Cuna G, Gennari R, Benvenuto F, Lantelme E, Giachino C. Dominant TCR-alpha requirements for a self antigen recognition in humans. The Journal of Immunology. 2002;169:6253–6260. doi: 10.4049/jimmunol.169.11.6253. [DOI] [PubMed] [Google Scholar]
  60. Marshall B, Schulz R, Zhou M, Mellor A. Alternative splicing and hypermutation of a nonproductively rearranged TCR alpha-chain in a T cell hybridoma. Journal of Immunology. 1999;162:871–877. [PubMed] [Google Scholar]
  61. Martin AP. Substitution rates of organelle and nuclear genes in sharks: implicating metabolic rate (again) Molecular Biology and Evolution. 1999;16:996–1002. doi: 10.1093/oxfordjournals.molbev.a026189. [DOI] [PubMed] [Google Scholar]
  62. Miller RD. Those other mammals: the immunoglobulins and T cell receptors of marsupials and monotremes. Seminars in Immunology. 2010;22:3–9. doi: 10.1016/j.smim.2009.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Miracle AL, Anderson MK, Litman RT, Walsh CJ, Luer CA, Rothenberg EV, Litman GW. Complex expression patterns of lymphocyte-specific genes during the development of cartilaginous fish implicate unique lymphoid tissues in generating an immune repertoire. International Immunology. 2001;13:567–580. doi: 10.1093/intimm/13.4.567. [DOI] [PubMed] [Google Scholar]
  64. Muramatsu M, Kinoshita K, Fagarasan S, Yamada S, Shinkai Y, Honjo T. Class switch recombination and hypermutation require activation-induced cytidine deaminase (AID), a potential RNA editing enzyme. Cell. 2000;102:553–563. doi: 10.1016/S0092-8674(00)00078-7. [DOI] [PubMed] [Google Scholar]
  65. Murphy K. Weaver C. Janeway's Immunology. 9th ed. New York: Garland Science; 2017. 9780815345053 [Google Scholar]
  66. Nakagawa Y, Ohigashi I, Nitta T, Sakata M, Tanaka K, Murata S, Kanagawa O, Takahama Y. Thymic nurse cells provide microenvironment for secondary T cell receptor α rearrangement in cortical thymocytes. PNAS. 2012;109:20572–20577. doi: 10.1073/pnas.1213069109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Nielsen J, Hedeholm RB, Heinemeier J, Bushnell PG, Christiansen JS, Olsen J, Ramsey CB, Brill RW, Simon M, Steffensen KF, Steffensen JF. Eye lens radiocarbon reveals centuries of longevity in the Greenland shark (Somniosus microcephalus) Science. 2016;353:702–704. doi: 10.1126/science.aaf1703. [DOI] [PubMed] [Google Scholar]
  68. Odegard VH, Schatz DG. Targeting of somatic hypermutation. Nature Reviews Immunology. 2006;6:573–583. doi: 10.1038/nri1896. [DOI] [PubMed] [Google Scholar]
  69. Orjalo A, Johansson HE, Ruth JL. Stellaris™ fluorescence in situ hybridization (FISH) probes: a powerful tool for mRNA detection. Nature Methods. 2011;8:i–0. doi: 10.1038/nmeth.f.349. [DOI] [Google Scholar]
  70. Parra ZE, Mitchell K, Dalloul RA, Miller RD. A second TCRδ locus in Galliformes uses antibody-like V domains: insight into the evolution of TCRδ and TCRμ genes in tetrapods. The Journal of Immunology. 2012;188:3912–3919. doi: 10.4049/jimmunol.1103521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Parra ZE, Ohta Y, Criscitiello MF, Flajnik MF, Miller RD. The dynamic TCRδ: TCRδ chains in the amphibian Xenopus tropicalis utilize antibody-like V genes. European Journal of Immunology. 2010;40:2319–2329. doi: 10.1002/eji.201040515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Pavri R, Nussenzweig MC. AID targeting in antibody diversity. Advances in Immunology. 2011;110:1–26. doi: 10.1016/B978-0-12-387663-8.00005-3. [DOI] [PubMed] [Google Scholar]
  73. Perera J, Huang H. The development and function of thymic B cells. Cellular and Molecular Life Sciences. 2015;72:2657–2663. doi: 10.1007/s00018-015-1895-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Perera J, Meng L, Meng F, Huang H. Autoreactive thymic B cells are efficient antigen-presenting cells of cognate self-antigens for T cell negative selection. PNAS. 2013;110:17011–17016. doi: 10.1073/pnas.1313001110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Petrie HT, Livak F, Schatz DG, Strasser A, Crispe IN, Shortman K. Multiple rearrangements in T cell receptor alpha chain genes maximize the production of useful thymocytes. Journal of Experimental Medicine. 1993;178:615–622. doi: 10.1084/jem.178.2.615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Qin H, Suzuki K, Nakata M, Chikuma S, Izumi N, Huong leT, Maruya M, Fagarasan S, Busslinger M, Honjo T, Nagaoka H. Activation-induced cytidine deaminase expression in CD4+ T cells is associated with a unique IL-10-producing subset that increases with age. PLoS ONE. 2011;6:e29141. doi: 10.1371/journal.pone.0029141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Raj A, Tyagi S. Chapter 17 - Detection of Individual Endogenous RNA Transcripts In Situ Using Multiple Singly Labeled Probes. In: WALTER N. G, editor. Methods in Enzymology. Academic Press; 2010. pp. 365–386. [DOI] [PubMed] [Google Scholar]
  78. Rast JP, Anderson MK, Strong SJ, Luer C, Litman RT, Litman GW. α, β, γ, and δ T Cell Antigen Receptor Genes Arose Early in Vertebrate Phylogeny. Immunity. 1997;6:1–11. doi: 10.1016/S1074-7613(00)80237-X. [DOI] [PubMed] [Google Scholar]
  79. Reinherz EL, Tan K, Tang L, Kern P, Liu J, Xiong Y, Hussey RE, Smolyar A, Hare B, Zhang R, Joachimiak A, Chang HC, Wagner G, Wang J. The crystal structure of a T cell receptor in complex with peptide and MHC class II. Science. 1999;286:1913–1921. doi: 10.1126/science.286.5446.1913. [DOI] [PubMed] [Google Scholar]
  80. Reynaud CA, Anquez V, Grimal H, Weill JC. A hyperconversion mechanism generates the chicken light chain preimmune repertoire. Cell. 1987;48:379–388. doi: 10.1016/0092-8674(87)90189-9. [DOI] [PubMed] [Google Scholar]
  81. Reynaud CA, Garcia C, Hein WR, Weill JC. Hypermutation generating the sheep immunoglobulin repertoire is an antigen-independent process. Cell. 1995;80:115–125. doi: 10.1016/0092-8674(95)90456-5. [DOI] [PubMed] [Google Scholar]
  82. Reynaud CA, Mackay CR, Müller RG, Weill JC. Somatic generation of diversity in a mammalian primary lymphoid organ: the sheep ileal Peyer's patches. Cell. 1991;64:995–1005. doi: 10.1016/0092-8674(91)90323-Q. [DOI] [PubMed] [Google Scholar]
  83. Reynolds JD, Morris B. The evolution and involution of Peyer's patches in fetal and postnatal sheep. European Journal of Immunology. 1983;13:627–635. doi: 10.1002/eji.1830130805. [DOI] [PubMed] [Google Scholar]
  84. Rogozin IB, Diaz M. Cutting edge: DGYW/WRCH is a better predictor of mutability at G:C bases in Ig hypermutation than the widely accepted RGYW/WRCY motif and probably reflects a two-step activation-induced cytidine deaminase-triggered process. The Journal of Immunology. 2004;172:3382–3384. doi: 10.4049/jimmunol.172.6.3382. [DOI] [PubMed] [Google Scholar]
  85. Rogozin IB, Iyer LM, Liang L, Glazko GV, Liston VG, Pavlov YI, Aravind L, Pancer Z. Evolution and diversification of lamprey antigen receptors: evidence for involvement of an AID-APOBEC family cytosine deaminase. Nature Immunology. 2007;8:647–656. doi: 10.1038/ni1463. [DOI] [PubMed] [Google Scholar]
  86. Rogozin IB, Pavlov YI, Bebenek K, Matsuda T, Kunkel TA. Somatic mutation hotspots correlate with DNA polymerase eta error spectrum. Nature Immunology. 2001;2:530–536. doi: 10.1038/88732. [DOI] [PubMed] [Google Scholar]
  87. Rumfelt LL, Avila D, Diaz M, Bartl S, McKinney EC, Flajnik MF. A shark antibody heavy chain encoded by a nonsomatically rearranged VDJ is preferentially expressed in early development and is convergent with mammalian IgG. PNAS. 2001;98:1775–1780. doi: 10.1073/pnas.98.4.1775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Rumfelt LL, Lohr RL, Dooley H, Flajnik MF. Diversity and repertoire of IgW and IgM VH families in the newborn nurse shark. BMC Immunology. 2004;5:8–15. doi: 10.1186/1471-2172-5-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Rumfelt LL, McKinney EC, Taylor E, Flajnik MF. The development of primary and secondary lymphoid tissues in the nurse shark Ginglymostoma cirratum: B-cell zones precede dendritic cell immigration and T-cell zone formation during ontogeny of the spleen. Scandinavian Journal of Immunology. 2002;56:130–148. doi: 10.1046/j.1365-3083.2002.01116.x. [DOI] [PubMed] [Google Scholar]
  90. Saini J, Hershberg U. B cell variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions. Molecular Immunology. 2015;65:157–167. doi: 10.1016/j.molimm.2015.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Schneider CA, Rasband WS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nature Methods. 2012;9:671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Thompson CB, Neiman PE. Somatic diversification of the chicken immunoglobulin light chain gene is limited to the rearranged variable gene segment. Cell. 1987;48:369–378. doi: 10.1016/0092-8674(87)90188-7. [DOI] [PubMed] [Google Scholar]
  93. Vaccarelli G, Antonacci R, Tasco G, Yang F, Giordano L, El Ashmaoui HM, Hassanane MS, Massari S, Casadio R, Ciccarese S. Generation of diversity by somatic mutation in the Camelus dromedarius T-cell receptor gamma variable domains. European Journal of Immunology. 2012;42:3416–3428. doi: 10.1002/eji.201142176. [DOI] [PubMed] [Google Scholar]
  94. Wagner SD, Milstein C, Neuberger MS. Codon bias targets mutation. Nature. 1995;376:732. doi: 10.1038/376732a0. [DOI] [PubMed] [Google Scholar]
  95. Wang X, Miller RD. Recombination, transcription, and diversity of a partially germline-joined VH in a mammal. Immunogenetics. 2012;64:713–717. doi: 10.1007/s00251-012-0627-z. [DOI] [PubMed] [Google Scholar]
  96. Wang X, Parra ZE, Miller RD. Platypus TCRμ provides insight into the origins and evolution of a uniquely mammalian TCR locus. The Journal of Immunology. 2011;187:5246–5254. doi: 10.4049/jimmunol.1101113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Wei L, Chahwan R, Wang S, Wang X, Pham PT, Goodman MF, Bergman A, Scharff MD, MacCarthy T. Overlapping hotspots in CDRs are critical sites for V region diversification. PNAS. 2015;112:E728–E737. doi: 10.1073/pnas.1500788112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Zheng B, Xue W, Kelsoe G. Locus-specific somatic hypermutation in germinal centre T cells. Nature. 1994;372:556–559. doi: 10.1038/372556a0. [DOI] [PubMed] [Google Scholar]
  99. Zhu C, Lee V, Finn A, Senger K, Zarrin AA, Du Pasquier L, Hsu E. Origin of Immunoglobulin Isotype Switching. Current Biology. 2012;22:872–880. doi: 10.1016/j.cub.2012.03.060. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: David G Schatz1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Somatic hypermutation of T cell receptor α chain contributes to thymic positive selection in nurse sharks thymus" for consideration by eLife. Your article has been favorably evaluated by Michel Nussenzweig (Senior Editor) and three reviewers, one of whom, David Schatz (Reviewer #1), served as a Guest Reviewing Editor. The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

The authors demonstrate and characterize somatic hypermutation (SHM) in TcRα, γ, and δ chains in the nurse shark. TcRα hypermutation is described for the first time in any TcRαβ system, although it was previously observed in sandbar shark TcRγδ chains. No mutation was observed for nurse shark TcRβ. SHM had been predicted to be disallowed in TcRαβ systems as it was never observed in mammals and has the potential to disrupt TcR-MHC I/II interactions. The findings here challenge that view and suggest that at least in shark, SHM of TcRα in the thymus contributes to repertoire diversification and potentially to α-β T cell positive selection. Consistent with the findings from TcR chain sequencing, the authors provide evidence in favor of AID expression in the thymic cortex and corticomedullary junction. The authors are experts in the field of comparative VDJ immunity and have published widely on this topic. The presentation of the findings and the discussion of their implications are clear and interesting. The finding highlights an important and unexpected aspect of TcR biology with implications for TcRαβ function in jawed vertebrates. This study should be of significant general interest to immunologists and evolutionary biologists.

Essential revisions:

1) The major challenge facing the authors is that they do not have the germline sequence of most nurse shark Vαα gene segments, and hence they have to infer a germline sequence from the cDNA sequences they obtain. To do this, the authors assume that if the same CDR3 sequence arises more than once, it must derive from the same original V-to-J rearrangement, and hence that sequence differences between two such clones must represent somatic mutations. Unfortunately, while this argument is strong for IgH, TcRδ, and TcRβ sequences that include D segments, it is potentially weak for TcRα rearrangements where a single joint is formed and junctional diversity can be low. To have confidence that duplicate CDR3 sequences represent unique rearrangements, one would want to examine the extent of nucleotide deletion and addition; only in joints with significant addition/deletion would one be confident that the junction is likely to be unique. The problem is compounded by the fact that VDJ recombination has a tendency to use microhomologies (if they exist) at the ends of the two gene segments to mediate joining, and this can lead to the same junction repeatedly. Was the same CDR3 sequence ever observed in different sharks? The authors need to justify their underlying assumption and provide an explicit discussion of the specificity of CDR3 sequence for rearrangements in each chain.

2) In addition, the data contain a significant puzzle: a very high level of shared CDR3 among a relatively small number of unique sequences. Specifically, 72 unique amino acid sequences, representing 12 V-α families, were obtained, which would correspond to an average of 6 unique sequences per V family, and an average of only 3 unique sequences per V family per shark. Given that the starting 5 microgram aliquots of RNA should derive from hundreds of thousands of cells, each starting RNA sample should contain a large number of independent V-J rearrangements for any given V. Given this and that there appear to be multiple Jα gene segments in the nurse shark, the chance of finding the same CDR3 sequence in a handful of sequences for a given V would be predicted to be very low. The chance of finding 42 sequences out of the 72 (more than half!) that share CDR3 with another sequence seems exceedingly unlikely. The authors need to carefully address this issue. The authors should provide a table that lists the number of unique aa sequences obtained for each V family within the 72 sequences, and of those, the number that have an identical CDR3.

The authors did in fact derive the germline sequence of two V family members (V4 and V9). Why not focus the analysis on these two families, where such ambiguities would be less of an issue? Furthermore, one of the reported set of sequences does arise from the V4 family, but it does not appear that the germline sequence information was taken advantage of. Why? And, were other sequences from V4 and V9 obtained in the dataset, and did they show mutations compared to the germline sequences?

It is noted that all reviewers agreed that the nature of the sequence differences observed (focus on hotspots, abundance of tandem mutations, etc.) are consistent with the mutations being real SHM events.

3) Expression of Aicda is not particularly clear in the ISH analysis. The data show a generally darker appearance with the antisense compared to the sense probe, but these differences are not completely convincing. Furthermore, it was difficult to distinguish the cortex vs. the medulla and the appearance is so uniform overall that claims for region specificity for Aicda are unconvincing. More annotation highlighting thymic structure and the probes used would be helpful. Scale bars should be included instead of objective magnification. Overall, it was felt that the Aicda ISH data need to be improved. Possibly other images or higher magnification images would help.

4) There were some concerns about the Aicda RT-PCR data. The results are expressed in delta Ct, which is a logarithmic measure, and it would be better to calculate a fold-difference in the expression of AID in spleen and thymus. How much of a difference in expression is there between the negative control (forebrain) and thymus? Was the resulting PCR product cloned and sequenced to confirm its identity? Are there B cells in the nurse shark thymus as there are in mouse and perhaps other cartilaginous fish such as the skate? If so, are the authors sure that AID expression is in T cells in the ISH and RT-PCR analyses? The authors should consider discussing this in the manuscript.

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Somatic hypermutation of T cell receptor α chain contributes to positive selection in nurse shark thymus" for further consideration at eLife. Your article has been favorably evaluated by Michel Nussenzweig (Senior Editor) and three reviewers, one of whom served as Guest Reviewing Editor.

The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below:

The authors have added new RNA fluorescence in situ hybridization data that strengthen the claim that AID is expressed in the shark thymus, and have adequately addressed some of the points previously raised. There remain a number of significant issues that need to be addressed satisfactorily before publication can be considered.

Major Points:

1) Based on the information provided by the authors in their response to the review, it is now clear that the analysis of mutations in TcRβ, γ, and δ (no CDR3 sequence obtained) was less rigorous and less definitive than in TcRαα (and, the TcRβ analysis is weaker than the rest because of the small number of sequences obtained (56)). Hence, all claims that mutations were not found in TcRβ must either be removed or carefully qualified. For example, the second sentence of the Abstract contrasts the findings at TcRβ (no mutation) and TcRα (mutation found) with no qualification, leaving the reader with the mistaken impression that the two analyses were equivalent and the two conclusions equally strong. The same problem was found in the last paragraph of the Introduction, subsection “Somatic hypermutation in TcR γV and TcR δV but not TcR βV”, and in the third paragraph of the Discussion.

2) "Since it is highly improbable that two thymocytes would generate identical CDR3 sequences during VJ recombination…" As was pointed out in the original review, this claim is not obvious; it should not be stated in this fashion. Indeed, there are a priori reasons to think that the same CDR3 sequence could readily arise from independent V-J rearrangements. The authors should present this claim as a hypothesis that they then carefully validate with data (which they begin to do in the first paragraph of the subsection “Somatic hypermutation in nurse shark TcR αV”). Furthermore, the conclusion of this sentence that "amplicons containing identical CDR3 sequences must have derived from the same progenitor" is overstated. At best, the conclusion could be that they are "likely to have derived from the same progenitor". A statistical analysis of the V-J sequences might allow the authors to make a quantitative claim of just how likely this is, but in the absence of such an analysis, the claims should be stated cautiously. The manuscript needs to be rewritten to accommodate these points. Same issue in the aforementioned paragraph.

3) Closely related to this: The authors have clarified that a larger number of sequences were obtained than was apparent in the initial submission, but this information is not provided in the Results section. Since most readers will not look at the Materials and methods section, it is suggested that the authors add information to the second paragraph of the subsection “Somatic hypermutation in nurse shark TcR αV” to inform readers of the total number of underlying sequences from the two relevant sharks. In addition, the Materials and methods states that sequences were obtained from four sharks; it is confusing that data from only two of the sharks was actually used. Why are all 4 sharks mentioned if data from only two of them were used? How many total sequences were obtained from the two relevant sharks? This is highly relevant given that these are (apparently) the only data from which the analyzed TcRα sequences are derived. This is directly relevant to the concern, expressed in the original review, that the identification of so many repeated CDR3 regions from a small sequence dataset is extremely unlikely. This issue has not been satisfactorily resolved in the resubmission. The requested table that shows how many unique aa sequences were obtained for each V family, and of those, the number that have an identical CDR3, was not provided. Nor have the authors provided a detailed description of the proportion of TcRα sequences that have N/P nucleotides (which would provide higher confidence of assigning SHM) versus those that are directly joined (less confidence). The reviewers remain concerned by the possibility that the number of repeat CDR3 sequences is exceedingly unlikely given the number of sequences obtained. The authors should carefully address this concern.

4) The new RNA fluorescence FISH analysis is a significant improvement over the ISH analysis, but despite the claimed specificity of the method used, experiments to document specificity are needed. Ideally, a tissue not expressing AID would be analyzed (or do the authors feel that the negative portions of the thymus provide sufficient specificity controls?). Given that the authors now present a detailed model for AID expression/SHM in shark thymus (Figure 10) based on the RNA fluorescence FISH data, it is particularly important to address specificity.

5) The claim made in the last paragraph of the subsection “Mutations, in situ hybridization, and AID expression in thymus “that TcRα expression is more diffuse than that of AID is unconvincing. TcRαA and AID expression appear to coincide in all areas visible in Figure 9. The authors should not push their interpretation just because it fits with the proposed role of AID in selection. Similarly, the organization of AID in rings around TcRα+ regions is intriguing and while the authors' interpretation in terms of relative timing of expression is plausible, it is quite speculative and should be clearly stated as such.

6) The in situ hybridization in Figure 8B (panels 3,4) for AID is still hard to interpret as the signal is weak and diffuse. The apparent expression pattern suggested by these images does not seem to correspond to the localized patterns seen in the fluorescent in situs in Figure 9 and Figure 9—figure supplement 1. Why is this? Is this a matter of scale? It is a concern that one set of data seems to contradict the other.

eLife. 2018 Apr 17;7:e28477. doi: 10.7554/eLife.28477.033

Author response


Essential revisions:

1) The major challenge facing the authors is that they do not have the germline sequence of most nurse shark Vα gene segments, and hence they have to infer a germline sequence from the cDNA sequences they obtain. To do this, the authors assume that if the same CDR3 sequence arises more than once, it must derive from the same original V-to-J rearrangement, and hence that sequence differences between two such clones must represent somatic mutations. Unfortunately, while this argument is strong for IgH, TcRδ, and TcRβ sequences that include D segments, it is potentially weak for TcRα rearrangements where a single joint is formed and junctional diversity can be low. To have confidence that duplicate CDR3 sequences represent unique rearrangements, one would want to examine the extent of nucleotide deletion and addition; only in joints with significant addition/deletion would one be confident that the junction is likely to be unique. The problem is compounded by the fact that VDJ recombination has a tendency to use microhomologies (if they exist) at the ends of the two gene segments to mediate joining, and this can lead to the same junction repeatedly. Was the same CDR3 sequence ever observed in different sharks? The authors need to justify their underlying assumption and provide an explicit discussion of the specificity of CDR3 sequence for rearrangements in each chain.

In our shark sequences, we found substantial diversity within CDR3 regions. We present data to illustrate CDR3 junctional diversity and include a new figure (Figure 4) depicting this diversity. (We observed similar results in other α V segments as well.) We never observed the same CDR3 sequence in more than one shark.

We included a paragraph under the “Somatic hypermutation in nurse shark TcR αV” section of the results to discuss these data: “Despite the absence of diversifying segments, α CDR3s exhibited substantial variation within our shark sequences. For example, TcRα V1 sequences using the same V and J segments had CDR3 lengths that differed by as many as 6 amino acids (12 nucleotides), and few CDR3s shared more than one amino acid within this V-J join (Figure 4). Further, we never observed the same CDR3 sequence in more than one shark, suggesting both exonuclease activity and addition of N and P nucleotides help diversify α CDR3s in nurse shark.”

2) In addition, the data contain a significant puzzle: a very high level of shared CDR3 among a relatively small number of unique sequences. Specifically, 72 unique amino acid sequences, representing 12 V-α families, were obtained, which would correspond to an average of 6 unique sequences per V family, and an average of only 3 unique sequences per V family per shark. Given that the starting 5 microgram aliquots of RNA should derive from hundreds of thousands of cells, each starting RNA sample should contain a large number of independent V-J rearrangements for any given V. Given this and that there appear to be multiple Jα gene segments in the nurse shark, the chance of finding the same CDR3 sequence in a handful of sequences for a given V would be predicted to be very low. The chance of finding 42 sequences out of the 72 (more than half!) that share CDR3 with another sequence seems exceedingly unlikely. The authors need to carefully address this issue. The authors should provide a table that lists the number of unique aa sequences obtained for each V family within the 72 sequences, and of those, the number that have an identical CDR3.

As written, the paper does suggest an incredible coincidence of shared CDR3s among our sequences. However, we actually acquired significantly more sequences than the 72 we describe in the paper. We have revised this section to include a discussion of the total number of sequences (564) from which we found those with shared CDR3s.

Within the Materials and methods section titled “Sequence Alignment and Tree Building,” we added: “Our preliminary dataset contained 564 TcRα clones (encoding 286 unique amino acid sequences representing nine Vα families) from three tissues (PBL, spleen, thymus) of four sharks (Joanie, Mary Junior, Yellow, and 1299). Using this dataset, we separated sequences containing identical CDR3 rearrangements and counted mutations within each TcR “clone family” bearing the V-J rearrangement from single founder thymocytes.”

The authors did in fact derive the germline sequence of two V family members (V4 and V9). Why not focus the analysis on these two families, where such ambiguities would be less of an issue? Furthermore, one of the reported set of sequences does arise from the V4 family, but it does not appear that the germline sequence information was taken advantage of. Why? And, were other sequences from V4 and V9 obtained in the dataset, and did they show mutations compared to the germline sequences?

We added two figures and a couple of paragraphs in the Materials and methods section “Identification of TcR Genes in the Nurse Shark Genome” explaining our reasons for not using germline alignments for SHM analysis:

“Our search yielded 17 α/δ V germline segments, significantly fewer than expected based on TcR αV segment numbers in other species (Murphy and Weaver, 2017). […] Thus, we chose not to rely on these data for mutational analysis.”

It is noted that all reviewers agreed that the nature of the sequence differences observed (focus on hotspots, abundance of tandem mutations, etc.) are consistent with the mutations being real SHM events.

3) Expression of Aicda is not particularly clear in the ISH analysis. The data show a generally darker appearance with the antisense compared to the sense probe, but these differences are not completely convincing. Furthermore, it was difficult to distinguish the cortex vs. the medulla and the appearance is so uniform overall that claims for region specificity for Aicda are unconvincing. More annotation highlighting thymic structure and the probes used would be helpful. Scale bars should be included instead of objective magnification. Overall, it was felt that the Aicda ISH data need to be improved. Possibly other images or higher magnification images would help.

We present new fluorescence in situ hybridization (FISH) data to illustrate more clearly the thymic architecture and regional specificity of AID expression within the shark thymus. The development of this technique delayed our resubmission. In a new figure (Figure 9), we provide a composite of seven Z-stacked images (10x) depicting the localization of AID expression to the cortico-medullary junction and medullary regions of shark thymus. We also present individual fluorophore images of DAPI, TcRα, and AID and the Z-stacked merged images (10x, 20x, and 63x) to illustrate the pattern of AID and TcRα expression within four locations of shark thymus. To ensure our probes were hybridizing independently of one another, we probed AID and TRA individually on adjacent slides and looked for co-localization of the probes (see Figure 9—figure supplement 1).

We also included a paragraph in the Results section describing these data: “We confirmed our colorimetric ISH results with RNA fluorescence in situ hybridization (FISH) using probes against the TcR α constant region (αC) and exons 2 and 3 of AID. […] These daughter T cells then express AID, promoting somatic hypermutation within their TcR α sequences during a time when cells also undergo positive and negative selection.”

4) There were some concerns about the Aicda RT-PCR data. The results are expressed in delta Ct, which is a logarithmic measure, and it would be better to calculate a fold-difference in the expression of AID in spleen and thymus. How much of a difference in expression is there between the negative control (forebrain) and thymus?

We present expression-fold changes in the different tissues (see Figure 8A). We also included the expression-fold difference between forebrain and thymus: “We confirmed the expression of AID in the thymus through real-time RT-qPCR, where thymus tissue expressed AID at roughly half (0.7x) the levels found in spleen (where B cell SHM is known to occur) and nearly 6x the levels observed in forebrain (negative control; Figure 8A).”

Was the resulting PCR product cloned and sequenced to confirm its identity?

Yes. We included a sentence in the qPCR Materials and methods section to inform readers: “We then cloned and sequenced the resulting PCR products to confirm the sequence was AID.”

Are there B cells in the nurse shark thymus as there are in mouse and perhaps other cartilaginous fish such as the skate? If so, are the authors sure that AID expression is in T cells in the ISH and RT-PCR analyses? The authors should consider discussing this in the manuscript.

We addressed thymic B cells in the Discussion: “Although recent studies report B cells present in the thymus of mice (Perera and Huang, 2015, Perera et al., 2013), Miracle et al. (2001) examined B cell expression levels (IgM and IgX) in various tissues (including thymus) at different ages in the clearnose skate. […] Further, preliminary results using FISH probes to IgM constant region also indicate that adult nurse sharks do not express IgM in the thymus."

These preliminary data (both RT-qPCR and FISH) give us confidence that the phenomenon we are seeing is not due to the presence of B cells in the thymus. However, these data are part of a larger study with a different aim, and we prefer not to report the data in this paper. However, we provide them here for you to assess their potential importance. The bar graph (Author response image 1) illustrates RT-qPCR expression of various immune cell markers in thymus tissue from sharks of varying ages. The red bars indicate IgM/IgW expression, which is clearly absent in the older sharks (red box).

Author response image 1.

Author response image 1.

Using FISH (Author response image 2), we found many IgM transcripts expressed in spleen (right) but few expressed in thymus (right). With the lack of IgM expression indicated by RT-qPCR, this may suggest that cells transcribe the IgM message when the DNA is open and the cell also is transcribing TcR genes, but translation of the IgM message does not occur. We need more evidence to make further conclusions.

Author response image 2.

Author response image 2.

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Major Points:

1) Based on the information provided by the authors in their response to the review, it is now clear that the analysis of mutations in TcRβ, γ, and δ (no CDR3 sequence obtained) was less rigorous and less definitive than in TcRα (and, the TcRβ analysis is weaker than the rest because of the small number of sequences obtained (56)). Hence, all claims that mutations were not found in TcRβ must either be removed or carefully qualified. For example, the second sentence of the Abstract contrasts the findings at TcRβ (no mutation) and TcRα (mutation found) with no qualification, leaving the reader with the mistaken impression that the two analyses were equivalent and the two conclusions equally strong. The same problem was found in the last paragraph of the Introduction, subsection “Somatic hypermutation in TcR γV and TcR δV but not TcR βV”, and in the third paragraph of the Discussion.

Agreed. We are in the process of repeating the mutation analysis of the other chains to the same rigor as α. Collecting enough CDR3-sharing sequences is taking some time. However, for this manuscript, we removed language that implies the analyses or conclusions are equivalent. In the above order, these changes are as follows:

“Remarkably, we found SHM acting in the thymus on α chain of shark T cell receptors (TcR).”

“Our data suggest that SHM of TcRα is involved in primary T cell repertoire diversification and the enhancement of positive selection in the thymic cortex.”

“Somatic hypermutation in TcR γV and TcR δV”.

“Remarkably we also detected SHM occurring in the α chain of αβ T cells.”

2) "Since it is highly improbable that two thymocytes would generate identical CDR3 sequences during VJ recombination…" As was pointed out in the original review, this claim is not obvious; it should not be stated in this fashion. Indeed, there are a priori reasons to think that the same CDR3 sequence could readily arise from independent V-J rearrangements. The authors should present this claim as a hypothesis that they then carefully validate with data (which they begin to do in the first paragraph of the subsection “Somatic hypermutation in nurse shark TcR αV”). Furthermore, the conclusion of this sentence that "amplicons containing identical CDR3 sequences must have derived from the same progenitor" is overstated. At best, the conclusion could be that they are "likely to have derived from the same progenitor". A statistical analysis of the V-J sequences might allow the authors to make a quantitative claim of just how likely this is, but in the absence of such an analysis, the claims should be stated cautiously. The manuscript needs to be rewritten to accommodate these points. Same issue in the aforementioned paragraph.

We modified this section to be less speculative. “Since it is highly unlikely that two thymocytes would generate identical CDR3 sequences during VJ recombination, we predict that amplicons containing identical CDR3 sequences derived from the same progenitor and thus must contain the same germline V and J segments. […] Therefore, even in the absence of an assembled locus, we were able to evaluate mutation to germline αV segments by considering changes within only those thymocyte clones containing identical CDR3s.”

3) Closely related to this: The authors have clarified that a larger number of sequences were obtained than was apparent in the initial submission, but this information is not provided in the Results section. Since most readers will not look at the Materials and methods section, it is suggested that the authors add information to the second paragraph of the subsection “Somatic hypermutation in nurse shark TcR αV” to inform readers of the total number of underlying sequences from the two relevant sharks. In addition, the Materials and methods states that sequences were obtained from four sharks; it is confusing that data from only two of the sharks was actually used. Why are all 4 sharks mentioned if data from only two of them were used? How many total sequences were obtained from the two relevant sharks? This is highly relevant given that these are (apparently) the only data from which the analyzed TcRα sequences are derived. This is directly relevant to the concern, expressed in the original review, that the identification of so many repeated CDR3 regions from a small sequence dataset is extremely unlikely. This issue has not been satisfactorily resolved in the resubmission. The requested table that shows how many unique aa sequences were obtained for each V family, and of those, the number that have an identical CDR3, was not provided. Nor have the authors provided a detailed description of the proportion of TcRα sequences that have N/P nucleotides (which would provide higher confidence of assigning SHM) versus those that are directly joined (less confidence). The reviewers remain concerned by the possibility that the number of repeat CDR3 sequences is exceedingly unlikely given the number of sequences obtained. The authors should carefully address this concern.

We apologize for not including the requested table. We misunderstood the initial request. We include this table now (Table 1) and discuss the table in the Results section titled “Somatic hypermutation in nurse shark TcR αV”:

“Our preliminary V α dataset contained 539 TcRα clones (encoding 286 unique amino acid sequences representing nine V α families) from three tissues (PBL, spleen, thymus) of two sharks (Joanie, Mary Junior). […] We include these 45 sequences in our TcR Vα dataset (see Table 1 – source data for sequence data).”

During the construction of this new table, we realized we had omitted sequences in our initial mutation analysis. Further, we decided to divide our aV4 group so that we estimated mutation even more conservatively. Thus, we reanalyzed our entire mutation data set. While the overall conclusions of this study did not change, some of the results did change. Most notably, we found that A:T mutations did not prefer WA/TW hotspots. This is consistent with data reported in sandbar sharks for TcR γ (Chen et al., 2012). However, our data suggest a different A:T motif may be targeted or that sharks employ a different mechanism than DNA pol eta use during mismatch repair to alter A:T nucleotides.

“A:T nucleotides did not appear to prefer WA/TW hotspots (p=0.2248), though they were 2.3x as likely to mutate within hotspots than outside hotspots. […] This result suggests that an alternate A:T motif is targeted or that sharks employ another mechanism altogether to alter A:T nucleotides.”

Further, we no longer found a significant difference in tandemly mutated bases between CDRs and FRs, which is consistent with results observed for TcR-γ in sandbar sharks. “Finally, though we found more tandemly mutated bases in CDRs (41 of 81, or 50.6% of all CDR mutations) than in FRs (73 of 192, or 38.0% of all FR mutations), this difference was not significant; p=0.721; Table 2).”

We also added a section at the beginning of the Materials and methods to clarify the confusion over sharks. We used sequences from only two sharks in our mutation analyses. However, we used published data from two different sharks to aid in loci identification to ensure consistency. This paragraph reads as follows: “Study Animals: TCR sequence data used in this study came from two adult female nurse sharks (“Joanie” and “Mary Junior”; Ginglymostoma cirratum) delivered by caesarian section off the Florida Keys and matured in the aquatic vivarium of the University of Maryland’s Center of Marine Biotechnology. We used published T cell αV sequences (Criscitiello et al., 2010) as a baseline for αV locus numbering (sharks Yellow and 1299), though we did not analyze any of these sequences for mutation.”

As for the presence of N/P nucleotides, please see our response to #2, above.

4) The new RNA fluorescence FISH analysis is a significant improvement over the ISH analysis, but despite the claimed specificity of the method used, experiments to document specificity are needed. Ideally, a tissue not expressing AID would be analyzed (or do the authors feel that the negative portions of the thymus provide sufficient specificity controls?). Given that the authors now present a detailed model for AID expression/SHM in shark thymus (Figure 10) based on the RNA fluorescence FISH data, it is particularly important to address specificity.

We analyzed a non-immune tissue (shark brain) as a negative FISH control for AID and TcRα fluorescence. We created a new supplementary figure with images of these slides (Figure 9—figure supplement 2). This figure depicts some background fluorescence regardless of probe application. However, we observed no AID or TcRα expression in brain tissue like the patterns we saw in thymus (immune tissue).

5) The claim made in the last paragraph of the subsection “Mutations, in situ hybridization, and AID expression in thymus “that TcRα expression is more diffuse than that of AID is unconvincing. TcRαA and AID expression appear to coincide in all areas visible in Figure 9. The authors should not push their interpretation just because it fits with the proposed role of AID in selection. Similarly, the organization of AID in rings around TcRα+ regions is intriguing and while the authors' interpretation in terms of relative timing of expression is plausible, it is quite speculative and should be clearly stated as such.

We modified the text to better reflect what our images depict (see also #7, below). “Our FISH results indicated a more specific TcR αC signal within the inner cortex and medulla adjacent to the CMJ (Figure 9; Figure 9—figure supplement 1), regions where, in mammals, cells actively rearrange their α chain and where mature αβ T cells are found. […] Thus, we observed a consistent pattern of expression where a “ring” of cells expressing both TcR αC and AID surround a central cell expressing only TcR αC.”

6) The in situ hybridization in Figure 8B (panels 3,4) for AID is still hard to interpret as the signal is weak and diffuse. The apparent expression pattern suggested by these images does not seem to correspond to the localized patterns seen in the fluorescent in situs in Figure 9 and Figure 9—figure supplement 1. Why is this? Is this a matter of scale? It is a concern that one set of data seems to contradict the other.

We auto-adjusted the colors on the CISH image in ImageJ to darken the signal expression [old image (L) and darkened image (R), Author response image 3]. We adjusted images in all eight panels uniformly, ensuring we did not alter expression differences between panels. This should make it easier to see the signal itself, with AID expression localized to the cortex and CMJ and minimal expression within the medulla.

Author response image 3.

Author response image 3.

As you point out, it is a much more diffuse signal. Because the Stellaris FISH method uses a mixture of short probes for binding the transcript, the pattern we see in the is much more specific – the image will only show those areas with many bound probes rather than the presence of a transcript in general. For example, the CISH image of TcR αC from Criscitiello et al.,2010 (Author response image 4) shows a much more diffuse pattern of expression than what we see with FISH as well.

Author response image 4.

Author response image 4.

We believe the expression patterns revealed with the two different methods are both correct, but they tell us different things. AID (and TcR αC) expression occurs throughout the cortex (and medulla for TcR αC; based on the diffuse signal from CISH), but the greatest expression of both messages occurs near the CMJ (based on the much more precise and higher intensity signal from FISH). We currently are working with another method that incorporates both a colorimetric and fluorescent signal to verify this explanation, but we have not been successful with that approach yet.

We modified the text to explain this difference in method.

CISH section: “Further, colorimetric in situ hybridization (CISH) of nurse shark thymus revealed a diffuse signal of TcR β (Figure 8B, panels 1,2) and AID (Figure 8B, panels 3,4) mRNA expression throughout the thymic cortex. However, AID expression was greatest in the central cortex and cortico-medullary junction (CMJ), while TcR β expression was highest in the outer cortex.”

FISH section: “We refined our CISH results with RNA fluorescence in situ hybridization (FISH) using probes against the TcR α constant region (αC) and exons 2 and 3 of AID. […] These daughter T cells then express AID (and TcR αC), promoting somatic hypermutation within their TcR α sequences during a time when cells also undergo positive and negative selection.”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 4—source data 1. CDR3 regions diversified by exonuclease activity and addition of N and P nucleotides.

    Alignment of nucleotides belonging to the join between variable (V) and joining (J) segments within TcRα thymocyte clones. We determined the putative ends of each V segment and putative beginning of each J segment by comparing alignments between different sharks, assuming that identical nucleotides between sharks were germline. The last number of each sequence name indicates the number of clones containing that nucleotide sequence between the V and J segments.

    DOI: 10.7554/eLife.28477.006
    Table 1–source data 1. Tab-delimited text file containing sequences used in this paper for analysis of somatic hypermutation of TcR alpha chain.
    DOI: 10.7554/eLife.28477.008
    Figure 6—source data 1. Frequency of mutation in TcR alpha V framework and complementarity determining regions for all mutation, nonsynonymous (NSYN) mutation only, or synonymous (SYN) mutation only.
    DOI: 10.7554/eLife.28477.013
    Transparent reporting form
    DOI: 10.7554/eLife.28477.026

    Data Availability Statement

    T cell receptor sequences have been deposited in Genbank of NCBI. Alpha KY189332-KY189354 and KY366469-KY355487; Beta KY351708-KY366487; Gamma KY351639-KY351707; Delta KY346705-KY346816.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES