Abstract
The fold of the murine Sox-5 (mSox-5) HMG box in free solution has been determined by multidimensional NMR using 15N-labeled protein and has been found to adopt the characteristic twisted L-shape made up of two wings: the major wing comprising helix 1 (F10–F25) and helix 2 (N32–A43), the minor wing comprising helix 3 (P51–Y67) in weak antiparallel association with the N-terminal extended segment. 15N relaxation measurements show considerable mobility (reduced order parameter, S2) in the minor wing that increases toward the amino and carboxy termini of the chain. The mobility of residues C-terminal to Q62 is significantly greater than the equivalent residues of non-sequence-specific boxes, and these residues show a weaker association with the extended N-terminal segment than in non-sequence boxes. Comparison with previously determined structures of HMG boxes both in free solution and complexed with DNA shows close similarity in the packing of the hydrophobic cores and the relative disposition of the three helices. Only in hSRY/DNA does the arrangement of aromatic sidechains differ significantly from that of mSox-5, and only in rHMG1 box 1 bound to cisplatinated DNA does helix 1 have no kink. Helix 3 in mSox-5 is terminated by P68, a conserved residue in DNA sequence-specific HMG boxes, which results in the chain turning through ∼90°.
Keywords: HMG box, LEF-1, TCF1α SRY, DNA bending
In mammalian embryogenesis the male determining factor is the SRY protein encoded on the Y chromosome, which switches development from the default (female) pathway to male development (Sinclair et al. 1990). The characteristic feature of the SRY protein is the presence of a sequence-specific DNA-binding HMG box, and a number of other proteins have been described containing HMG boxes with sequences closely homologous to that in SRY (called Sox proteins, where Sox stands for SRY-related HMG box). Alternative splicing of the mouse Sox-5 (mSox-5) gene product results in a short form of the protein, found in the nuclei of postmeiotic spermatids (Denny et al. 1992; Connor et al. 1994) and a long form (L-mSox-5) expressed in chondrocytes (Lefebvre et al. 1998). Both forms of the protein contain the single HMG box. This HMG box recognizes the 6-bp sequence AACAAT (Denny et al. 1992) with an association constant Ka of 6.2 × 107 M−1 (Privalov et al. 1999). Several other Sox proteins have been studied in some detail: for example, defective human Sox9 has been shown to be responsible for the condition campomelic dysplasia and can be a cause of autosomal XY sex reversal (Wagner et al. 1994), whereas the mouse transcription factor Sox-4, which contains an HMG box closely related to that of mSox-5, plays an important role in early lymphoid and heart development (van de Wetering et al. 1993). Drosophila Sox70D (dichaete, fish-hook) encodes a protein required for embryonic segmentation (Nambu and Nambu 1996; Russell et al. 1996). Sequence-specific DNA-binding HMG boxes not classed as Sox proteins by virtue of somewhat divergent amino acid sequences have been recognized in other mammalian proteins, such as human and mouse LEF-1 (Giese et al. 1991; Waterman et al. 1991), a transcription factor that binds to the T-cell α-chain enhancer. It is clear that the sequence-specific HMG box is a DNA-binding domain found in many transcription factors playing key developmental roles (for review, see Wegner 1999).
A second class of HMG boxes that bind DNA are those that show no obvious DNA sequence-specific recognition but prefer to bind to preformed specific DNA structures such as the 4-way junction (Wright and Dixon 1988; Bianchi et al. 1989, 1992; Ferrari et al. 1992; Webb and Thomas 1999), DNA bulges (Payet et al. 1999), and cisplatin-modified DNA (Pil and Lippard 1992; Ohndorf et al. 1999). These non-sequence-specific HMG boxes frequently occur as multiple boxes and include those from the mammalian chromosomal proteins HMG1 and HMG2 (Johns 1982), Upstream Binding Factor UBF (Jantzen et al. 1990; Bazett-Jones et al. 1994), and mtTF1 (Parisi and Clayton 1991). The term ``architectural transcription factors'' has been applied to these HMG proteins (Groschedl et al. 1994; Wolffe 1994), to indicate their ability to manipulate the structure of the DNA to which they bind.
The structures of several HMG boxes have been determined for both the non-sequence-specific (Read et al. 1993; Weir et al. 1993; Jones et al. 1994; Hardman et al. 1995; Allain et al. 1999) and sequence-specific (van Houte et al. 1995) classes. Additionally, the structure of HMG boxes in complex with DNA is known, both non-sequence-specific (Allain et al. 1999; Murphy et al. 1999; Ohndorf et al. 1999) and sequence-specific (Love et al. 1995; Werner et al. 1995). In all cases the protein fold is observed to be a 3-helix bundle in the form of a somewhat asymmetric L-shape with helices 1 and 2 in one arm (the major wing) and most of helix 3 together with the most N-terminal residues in the other arm (the minor wing). The structures of the sequence-specific HMG box of hSRY bound to 8-bp DNA (Werner et al. 1995) and of mLEF-1 bound to 15-bp DNA (Love et al. 1995) showed that the L-shaped protein takes the minor groove of the DNA in a pincer-like grip and causes the DNA to bend away from the protein, toward the major groove, with considerable unwinding. DNA bending is brought about by a series of contacts on the inside of the L-shaped HMG box fold and in part by partial intercalation of a single hydrophobic sidechain between two adjacent adenines.
The mLEF-1/DNA complex additionally revealed that residues C-terminal to the minimal HMG box crossed over the major groove on the inside of the bend to make contact with the far end of the DNA (Love et al. 1995). This feature may explain the exceptionally large bend angle (130°) generated by mLEF-1 (Giese et al. 1992), shown to be dependent on the presence of a C-terminal extension (Lnenicek-Allen et al. 1996). In the model of the non-sequence-specific HMG box of NHP6A bound to DNA, a basic N-terminal extension of the folded domain wraps around the major groove (Allain et al. 1999).
Absent from our understanding of sequence-specific HMG box structures and their interactions with DNA is knowledge of the structure of the same HMG box both free in solution and bound to DNA, since the free-solution structures of neither hSRY nor mLEF-1 have been determined. This is a matter of particular interest since calorimetric measurements of the mSox-5 HMG box have shown that significant levels of protein refolding occur on association, in addition to the DNA bending (Privalov et al. 1999). The present study of mSox-5 represents the first part of a project to make a detailed comparison of the structure of a free and DNA-bound sequence-specific HMG box. We also take the opportunity to compare the structure of the mSox-5 HMG box with other HMG box structures both free and bound to DNA.
Results
NMR resonance assignments of the mSox-5 HMG box
Sequential assignment of the amino acid spin systems was achieved by a combination of 2d 1H–1H NOESY and 1H–1H HOHAHA NMR spectra obtained at pH 5.4 and 6.2 in 90% H2O and 100% 2H2O, plus 1H NOESY 15N–1H HMQC and 1H HOHAHA 15N–1H HMQC 3d spectra at pH 6.2 in 90% H2O. All spectra were recorded at 25°C. By comparing spectra, all the backbone connectivities were completed, and 82.5% of all protons were assigned. Owing to chemical shift overlap in the 2.8–3.3-ppm region of the proton spectrum it was not possible to assign the arginine CδH protons of residues 5, 19, 58, 75, and 78 nor the lysine CɛH protons of residues 4, 15, 35, 42, 49, 61, 66, 71, 73, and 77. The assignment of the backbone 15N resonances was complete except for N30 and S31, for both of which the amide proton is in fast exchange with the solvent. The sidechain 15N resonances of all the asparagine, glutamine, and tryptophan residues were completely assigned, plus the 15Nε resonances of residues R18, R19, and R40. The remaining sidechain nitrogen resonances could not be identified.
The NOE restraints
From the 2d and 3d NOESY spectra, a total of 1424 nonredundant distance restraints were established, of which 277 were intraresidue and 1147 were interresidue—452 sequential (i, i + 1), 145 (i, i + 2), 159 (i, i + 3), 70 (i, i + 4) and 321 long range (i, i > 4). Figure 1 ▶ shows the distribution of NOE contacts in the mSox-5 HMG box. The (i, i + 3) and (i, i + 4) contacts run parallel to the diagonal and indicate the three α-helical regions F10 to F25, N32 to A43, and P51 to Y67. The location of helices can also be seen from Figure 2B ▶, which documents the runs of dαN (i, i + 3) and dαN (i, i + 4) cross-peaks that define these regions as being α-helical. This assignment of helices is supported by the Chemical Shift Index of the CαH protons (Fig. 2C ▶). Together these represent 45 out of 81 residues (55%) in an α-helical conformation, in good agreement with CD data (Connor et al. 1994). The position and length of the three α helices in mSox-5 are similar to those observed in other sequence-specific HMG box structures (Love et al. 1995; Werner et al. 1995).
For mSox-5, no backbone-to-backbone NOE contacts could be observed between N-terminal residues 1–12 and helix 3. There were also fewer and weaker backbone-to-sidechain and sidechain-to-sidechain contacts between these two segments of mSox-5 than were observed for hamster HMG1 box 2 (Read et al. 1993). This indicates a weaker association between the N-terminal segment and helix 3 in the Sox-5 HMG box than in HMG1 box 2.
A plot of the number of NOE restraints against position in the sequence (Fig. 2A ▶) shows that the most ill-defined regions correspond to P1–H2, R5–M7, N30–S31, T45–L47, and P74–T79. Lack of definition in the R5–M7 region was partly caused by chemical shift overlap, which also prevented observation of sequential amide NOEs between residues Y70–K71 and Y72–K73. The Chemical Shift Index (Fig. 2C ▶) suggests that residues 1–8 and 73–79 are in an irregular conformation (CSI = 0).
The volume of peaks arising from each amide proton in the domain was measured in a 15N–1H HSQC spectrum. A plot of the relative peak volumes (Fig. 2D ▶) shows that these are smaller for residues I3–M7, H29–N32, T45–L47, and H63, in comparison with the remainder. This is probably because of fast solvent exchange of these amide protons and could account for the fewer NOEs observed in these regions. Measurement of NH proton cross-peak volumes at the NH and water chemical shift positions in 3D NOESY spectra confirmed this rapid exchange (see Fig. 2E,F ▶). Helix 3 (P51–Y67) was found to contain slowly exchanging amide protons up to residue L59, beyond which more rapid exchange occurs.
3 JHNα coupling data
Coupling constants were measured at two separate temperatures (12° and 25°C, Fig. 3A,B ▶). A series of values between 2.5 and 5.0 Hz was observed in the helical regions, as expected. However, in helix 1, 3JHNα values of 6.5 Hz (at 12°C) and 7.5 Hz (at 25°C) were obtained for residue D16, which suggests a break or kink in the helix at this position. In helix 3 it is notable that the 3JHNα values steadily increase from 3.0 to 7.0 Hz between residues Q56 and Y67, suggesting a gradual loosening or stretching toward the C-terminal end of the helix. Regions with 3JHNα values generally greater than 6.0 Hz at 25°C are P1–N8, A24–N32, A43–K49, and K66–T79. A lower temperature of 12°C did not significantly affect these values in regions A43–K49 and Y67–T79. However, 3JHNα coupling constants for the C-terminal end of helix 3 (S60–K66) and for the N-terminal residues K4, R5, and M7 are reduced at 12°C by 0.5 to 2.0 Hz (Fig. 3C ▶). This reduction represents an increase in structural order in this region as the temperature is lowered. Increased structural order at lower temperatures has previously been observed for the mSox-5 HMG box using CD and DSC (Crane-Robinson et al. 1998).
Relaxation measurements
Figure 4A–C ▶ shows the experimentally determined 15N longitudinal (T1) and transverse (T2) relaxation times, with the heteronuclear {1H}–15N NOE values, obtained at 25°C, for 64 out of a possible 72 backbone amides. The combination of 15N T1 and T2 relaxation times, with the {1H}–15N NOE values were then analyzed within a framework of isotropic rotational diffusion, using the Lipari–Szabo model-free formalism (Lipari and Szabo 1982a,b). In this procedure the internal dynamic properties of the protein are described in terms of residue-specific measures of the generalized order parameter (S2; see Fig. 4D ▶), a time constant (τi) for fast (picosecond) internal motion, and where necessary a factor Rex introduced for a good fit. The Rex term may be taken to indicate a contribution to the transverse relaxation rate arising from slow (milli- to microsecond) interconversion processes (Clore et al. 1990a) or from specific self-association phenomena (Pfuhl et al. 1999). Where necessary for a good fit an additional order parameter (Sf2) was introduced, which may be taken to indicate motion on an intermediate time-scale (Clore et al. 1990b; Mandel et al. 1995). For the most important parameter to assess fast internal motion, a limiting value of S2 = 1 indicates total restriction and S2 = 0 would indicate no restriction of the internal motion of the amide N—H bond.
The relaxation data (Fig. 4 ▶) for residues between F10 and Y67 show rather uniform values in both the relaxation parameters and extracted S2 values (with a mean S2 = 0.83 and mean {1H}–15N NOE = 0.63) as expected for a folded domain. Outside of this region, from residue A9 to the N terminus and from residue P68 to the C terminus, the relaxation parameters and S2 values indicate a gradually increasing flexibility of the polypeptide chain. In addition, derived Sf2 values (data not shown) were observed in the N- and C-terminal regions of mSox-5, as expected for dynamic disorder having both fast and intermediate time-scale components.
Helices 1 (F10–F25) and 2 (N32–A43) have mean S2 values of 0.85 and 0.86, with mean {1H}–15N NOE values of 0.67 and 0.68, respectively. However helix 3 (residues P51–Y67) exhibits mean S2 and {1H}–15N NOE values of only 0.78 and 0.52, respectively. These lower values, as compared to the rest of the domain, arise from a series of low S2 and {1H}–15N NOE values for residues Q62 to Y67, in the C-terminal end of helix 3. Calculation of the mean S2 and {1H}–15N NOE for just the N-terminal end of helix 3 (residues P51–K61) yielded values of 0.85 and 0.64, values similar to that found for helices 1 and 2. This indicates that helices 1 and 2 and the N-terminal segment of helix 3 do not display fast internal motion, whereas the six residues at the C-terminal end of helix 3 show a degree of dynamic disorder.
In addition, helix 1 exhibits a trend toward lower S2 values along the helix, suggesting the N-terminal end of the helix has a slightly greater degree of dynamic order than the C-terminal end. Isolated low values occurred near the ends of helices 1 and 2 at residues F25, I33, and W41. For loop 1 slightly low values for the mean S2 and {1H}–15N NOE were observed (0.81 and 0.61, respectively), although this was the not case for loop 2 (0.85 and 0.67, respectively). This could suggest that the degree of dynamic order in the loops is as great as in the helices.
The derived Rex terms (data not shown) were widely dispersed throughout mSox-5 and were all small in value (<3.30 sec−1) with the exception of residues L22 (Rex = 4.64) and K49 (Rex = 8.79). Since no concentration dependence on chemical shifts in the 15N–1H HSQC spectra was observed, this would argue against self-association and implies that slow conformational interconversion processes occur at residues L22 and K49, in loops 1 and 2, respectively.
Determination of structures
Structures were initially determined using a restraint file for the complete 79-residue mSox-5 HMG box. Families of consistent structures were clearly observed, but these structures showed that beyond residue 70 there was a high degree of conformational variability. Further sets of structures were therefore obtained using a restraint file for residues 1–70. From 50 calculated structures, the 30 of lowest energy (<1300 kcal mole−1) were selected and further refined in the presence of selected hydrogen bond and dihedral angle restraints, which were based on hard-to-exchange amide proton data (Fig. 2E,F ▶) and 3JHNα coupling-constant data (Fig. 3 ▶), respectively. After the final minimization, all selected structures contained no distance restraint violations greater than 0.5 Å for both backbone and sidechain distance restraints and no dihedral angle violations greater than 10°.
Deviation from a standard α helix was found within helix 1. The 3JHNα coupling constant for D16 is 7.5 ± 0.5 Hz (Fig. 3A ▶), a value compatible with a phi angle of −90 ± 10°. Furthermore, the dαN(i, i + 4) NOE between residues V12 and D16 was absent from the NOESY spectra. Taken altogether, this indicates that the hydrogen bond between the carbonyl O of V12 and the NH of D16 is very extended and nonlinear, leading to distortion of helix 1 at this point. This hydrogen bond restraint was therefore not included in the final modeling.
In the C-terminal part of helix 3 (H63–Y67), the 3JHNα values for K66 and Y67 are 7.2 and 8.2 Hz, respectively, and all the dαN(i, i + 3) and dαN(i, i + 4) NOE cross-peaks were very weak. Furthermore, an increase in exchange volumes of the amide protons of residues 60–67 at the water chemical shift position was observed (Fig. 2E ▶). The more mobile C-terminal end of helix 3 is also manifested in the longer T2 and lower S2 values for residues 62–67 (Fig. 4B,D ▶). Hydrogen bond restraints were therefore applied only to residues E54–Q62 of helix 3.
Residues 1–70 were further examined for multiconformational states using the Xplor v3.851 ensemble program for cross-validation of structures (Bovin and Brunger 1995,1996). Twenty structures were collected, and the averages were determined for the existence of 1, 2, or 3 conformers. The number of violations greater than 0.2 Å were 38.1, 2.3, and 1.3 (with standard deviations of 3.1, 2.0, and 1.3, respectively) for nonrefined 1-, 2-, and 3-conformer models, respectively. The considerable reduction in violations with increasing number of possible conformers, in particular from 1 to 2, could result either from the fact that there are genuinely two conformers or because in a fold having significant flexibility in parts of the structure, an increase in the number of allowed conformers inevitably leads to fewer violations. We therefore compared the two averaged conformers in the 2-conformer model for clear signs of differences at particular points. In loop 2 there were no significant conformational differences, but in loop 1 the two conformers were different. To decide if this was caused by flexibility or the presence of two genuine conformers, we compared the 3 structures of loop 1 in the 3-conformer model: The third average conformer was intermediate between the first two, and we conclude that loop 1 is indeed flexible. This conclusion accords with the finding that the amide NHs of N30 and S31 are in very rapid solvent exchange (Fig. 2F ▶).
Description of the mSox-5 HMG box structure
Figure 5A ▶ shows a stereo backbone view of the 30 final structures comprising residues 1–70 of the mSox-5 HMG box. The structures were overlaid by best-fit superposition of the backbone heavy atoms (amide N, Cα, carbonyl C and O) of residues F10–F25 and N32–K42. These two regions, comprising helices 1 and 2, were chosen for overlay since they were the best defined (number of NOEs per residue >21, with an average of 58) and consistent in position relative to each other, that is, low average pairwise RMSDs. The average pairwise RMSD for the heavy backbone atoms in this set was 0.27 ± 0.13 Å. Superposition of residues F10–F25, N32–K42, and P51–Y67 gave a pairwise RMSD of 1.56 ± 0.90 Å, whereas superposition of all heavy backbone atoms for residues 1–67 gave a pairwise RMSD of 2.3 ± 1.7 Å.
The overall L-shaped conformation of the mSox-5 HMG box is similar to that found for other HMG boxes. The major wing (containing the majority of the amino acids) is the short arm of the fold and contains A9, helix 1 (F10–F25), loop 1 (P26–S31), helix 2 (N32–A43), loop 2 (M44–Q50), and the N-terminal part of helix 3 (P51–Y53) that projects sidechains into the hydrophobic core between helices 1 and 2. Helices 1 and 2 make an angle of 56° to each other. In loop 1, P26–H29 makes a type III β turn, although the distance between the NHs of D27 and M28 is ∼1.0 Å larger than expected for a canonical turn. The following two residues, N30 and S31, are in an extended conformation. In loop 2, M44–L47 are in a partially extended conformation, and the following 4 residues, E48–P51, have an approximately α-helical conformation with a similar set of NOE contacts to the equivalent residues in HMG1 box 2 (Read et al. 1993; Weir et al. 1993; Read et al. 1995). The minor wing is the long arm of the fold and consists of the extended N-terminal segment P1–N8 running alongside and antiparallel to most of helix 3 (E54–Y67).
Figure 5B ▶ shows a view of the major wing, highlighting selected sidechains in all of the 30 final structures. A consistent hydrophobic core (F10, W13, L37, W41, M44, and Y52) is seen with the main apolar contacts having a well-defined geometry. In contrast, the sidechains of M11, N30, and S31 point out into the solvent and show no fixed conformation.
The residues involved in forming the apolar core maintain the orientation of the surrounding three α helices (Fig. 5B ▶). The aromatic rings of F10, W13, and W41 stack onto one another and orient helices 1 and 2. Residues L22 and L37 form hydrophobic contacts between helix 1 and the β turn of P26–H29 and between helices 1 and 2, respectively. The other aromatic residue, F25, makes close contact with the methyl groups of M28 and I21: These serve to maintain the angle between helix 1 and helix 2, as well as stabilizing the β turn between helices 1 and 2. The orientation of helix 1 to helix 3 depends on apolar contact between the methyl group of A9 and the sidechain of Y52 in helix 3.
In the minor wing, NOEs were observed between the sidechains of L59, H63, and L64 and the backbone residues of M7/N8, R5, and K4, respectively. There were also weak NOEs between the sidechains of H63 and Y67 and the sidechain of I3. These fix the position of the N-terminal segment of the box alongside helix 3. At the C-terminal end of helix 3, residues Y67–Y70 form a type III β turn similar to that found in loop 1. In order to better assess the degree of local order in the minor wing, superpositions were made of residues 51–67 (helix 3) and residues 1–9 in the N-terminal segment. Figure 5C ▶ shows that helix 3 exhibits considerable regularity, whereas the nine N-terminal residues are significantly less ordered but more ordered than appears to be the case from the helix 1/2 superpositions of Fig. 5A ▶. The disorder in the central part of the N-terminal segment, R5–M7, is owing to a lack of NOEs to residue P6, which in turn is caused by chemical shift overlap. However, the apparent decrease in structural order from residues 9 to 1 (Fig. 5C ▶) is in good accord with the reduction in S2 shown in Figure 4D ▶.
Conformation of the C-terminal segment
Since a regular NOE intensity pattern was observed for residues lying beyond helix 3, P68–R75, that is, medium/strong dNN, dαN(i, i), very strong dαN(i, i + 1), and very weak dαN(i, i + 2), a new set of structures were calculated that included restraints observed for residues 71–79. The conformation of the C-terminal region, including and beyond helix 3, was then examined using the following superpositions: 51–67, 51–77, and 68–77. This procedure is similar to that carried out for rHMG1 box 2 (Weir et al. 1993) and for mSox-4 (van Houte et al. 1995). From 50 calculated structures, 30 low-energy structures were selected and refined. The final structures when overlaid by superposition of all backbone heavy atoms for residues 51–77 (Fig. 6A ▶) gave a pairwise RMSD of 3.78 ± 1.0 Å, whereas superposition utilizing only helix 3 (P51–Y67, Fig. 6B ▶) gave an RMSD of 0.99 ± 0.30 Å, a value typical for an ordered structure with some variability. The large difference between these two RMSD values indicates that residues beyond the C-terminal end of helix 3 do not have a particularly fixed position. In order to see if there is any regular structure at all in the region beyond helix 3, residues 68–77 were superimposed (Fig. 6C ▶). The backbone heavy atom RMSD for this region was 2.9 ± 1.1 Å. Figure 6C ▶ shows that the overall conformational envelope of these C-terminal residues (68–79) is a hooked shape with a poorly defined turn at P68 (the exit to helix 3) and a second turn centred around P74.
Discussion
Comparison with other HMG box structures
A comparison of the lowest-energy mSox-5 HMG box structure was made with other HMG box structures whose coordinates are available. These are: (1) box 2 of HMG1, of which there are two examples: our own derived from the hamster sequence (Read et al. 1993, 1995) and that derived from the rat sequence (Weir et al. 1993); (2) box 1 from rat HMG1 (Hardman et al. 1995) and the same box complexed with cisplatinated 16-bp DNA (Ohndorf et al. 1999); (3) Drosophila HMG-D (Jones et al. 1994) and the same box complexed with 10 bp of DNA (Murphy et al. 1999); (4) the HMG box of human SRY complexed with 8 bp of DNA (Werner et al. 1995); (5) the mouse LEF-1 HMG box complexed with 15 bp of DNA (Love et al. 1995); and (6) the yeast HMG protein NHP6A, both alone and complexed with 15 bp of DNA (Allain et al. 1999). In the comparison that follows, the residue numbering of mSox-5 is used for the equivalent residues in all structures.
It is striking that superposition of the various HMG box folds using the heavy backbone atoms of helices 1 and 2 (residues 10–25 and 32–42) yields little variation, with the RMSD values ranging only between 1.00 and 1.72 Å (Table 1, column 2). When helix 3 (residues 52–67 in mSox-5) is included in the superposition, the RMSD values rise to between 1.30 and 3.12 Å (Table 1, column 4), indicating that the position of helix 3 in the minor wing is not well defined relative to the major wing (which includes helices 1 and 2). Removal of the last 5 residues of helix 3 from the superpositions, that is, using only residues 52–62 for helix 3, significantly reduced the RMSD values for some of the boxes (Table 1, column 3) and reflects the increased uncertainties in the fold between residues 62 and 67.
Table 1.
RMSD values for the three helices (in Å)2 | Angles in and between helices3 | Pairwise RMSD values of aromatic residues (in Å)4 | |||||||
HMG box | For residues 10–25, 32–42 | For residues 10–25, 32–42, 52–62 | For residues 10–25, 32–42, 52–67 | Bend angle in helix 1 | Angle of Helix 1 to Helix 2 | Angle of helix 2 to Helix 3 | Positions in mSox-5 HMG box | RMSD for Cβ & Cγ atoms | RMSD for all sidechain heavy atoms |
mSox–5 | 35° | 56° | 60° | ||||||
rHMG1 box 1 | 1.32 | 1.25 | 1.30 | 30° | 55° | 58° | 10, 13, 41, 52 | 0.83 | 1.38 |
chHMG1 box 2 | 1.15 | 2.48 | 2.69 | 35° | 52° | 71° | 10, 13, 41, 52 | 0.76 | 1.31 |
rHMG1 box 2 | 1.02 | 2.17 | 2.59 | 40° | 55° | 80° | 10, 13, 41, 52 | 0.86 | 1.60 |
dHMG–D | 1.23 | 1.61 | 1.66 | 55° | 65° | 70° | 10, 13, 41, 52 | 0.84 | 1.40 |
yNHP6A | 1.00 | 1.87 | 2.22 | 35° | 55° | 70° | 10, 13, 41, 52, 53 | 0.90 | 3.39 |
hSRY/DNA | 1.70 | 2.59 | 3.12 | 33° | 60° | 87° | 10, 13, 41, 52 | 2.33 | 3.30 |
hSRY*/DNA | 10, 13, 41, 52 | 2.43 | 1.59 | ||||||
mLEF–1/DNA | 1.22 | 1.54 | 1.49 | 40° | 72° | 55° | 10, 13, 41, 52, 53 | 1.36 | 1.88 |
yNHP6A/DNA | 1.13 | 2.08 | 2.38 | 35° | 55° | 70° | 10, 13, 41, 52 | 0.86 | 0.84 |
dHMG–D/DNA | 1.07 | 1.51 | 1.62 | 50° | 65° | 65° | 10, 13, 41, 52 | 0.48 | 1.47 |
rHMG1 box 1/DNA | 1.72 | 1.65 | 1.63 | 0°–5° | 62° | 64° | 10, 13, 41, 52 | 1.78 | 1.28 |
1 Protein Databank accession numbers laab model 22 (rat HMG1 box 1), 1hsm (hamster HMG1 box 2), 1hme (rat HMG1 box 2), 1hma model 1 (D HMG–D), 1hry (human SRY/DNA), 1qrv (Drosophila HMG–D/DNA), 1 ckt (rat HMG1 box 1/cisplatin DNA). Mouse LEF–1/DNA coordinates were provided by Dr. Peter Wright. Yeast NHP6A and yNHP6A/DNA coordinates were provided by Dr. Julie Feigon.
2 RMSD values were obtained by superimposing the heavy backbone atoms of the above residues with respect to the lowest-energy mSox–5 HMG box structure (residue numbering as in mSox–5).
3 The bend angles in helix 1 are measured in the plane containing both parts of the helix. The included angle between helices 2 and 3 was measured by projecting helix 2 onto the plane through the long axis of helix 3 that lies parallel to the long axis of helix 2. In several cases the long axes of both helices lie approximately in the same plane. The angle between helices 1 and 2 was measured in the same way as that between helices 2 and 3.
4 Sidechain superpositions were carried out as pairwise alignments with the lowest energy structure of the mSox–5 HMG box. hSRY/DNA and mLEF–1/DNA HMG boxes have an aromatic residue at position 53. hSRY*/DNA differs from hSRY/DNA by the removal of residue F53 (to make it more readily comparable with the other HMG boxes).
Most structures show a considerable kink in helix 1 at residue 16 (Table 1, column 5), which does not vary greatly in magnitude, averaging 39.0 ± 8.0°. However, in the case of box 1 of rHMG1 bound to 16 bp of cisplatinated DNA, the bend in helix 1 is absent. This change could be caused by the presence of the cross-linked cisplatinated DNA, changed solvation, or crystal packing, since this is a structure obtained by X-ray crystallography (Ohndorf et al. 1999). Helix 3 of mSox-5 is defined as starting at P51, but the preceding 4 residues are in an approximately helical conformation and have been defined as included in helix 3 in some HMG box structures. We do not include residues L47–Q50 in helix 3 since that turn is inclined at an angle of about 20° to the axis of helix 3 in mSox-5, and is similarly inclined in the other HMG box structures. The angle between helices 1 and 2 is fairly constant (Table 1, column 6), although there is a small difference between the average angle for free and for bound HMG boxes. The angle between helices 2 and 3 (Table 1, column 7) represents the angle between the two wings of the HMG box fold and likewise shows little variation between free and DNA bound boxes. The suggestion that the angle between the two wings of the fold might alter on binding DNA, that is, that there might be a considerable degree of induced fit in the protein (Ferrari et al. 1992) is therefore not supported.
Figure 7A ▶ shows the hydrophobic core for all 6 HMG box folds determined in the absence of DNA, and it is seen that the relative positions of the 4 principal aromatic sidechains (residues 10, 13, 41, and 52) are the same in all 6 folds. Figure 7B ▶ shows the hydrophobic core for the five HMG box/DNA structures (together with the free-solution mSox-5 structure). For these structures, again, the 4 principal aromatic sidechains are in similar orientations, except for hSRY/DNA, for which the positions of the W41 and W13 sidechains are inverted such that it is the sidechain of W13 that contacts residues F52 and F53, rather than the sidechain of W41. A detailed comparison of the conserved aromatic sidechains in the hydrophobic core was made by superimposing them onto those of mSox-5. The pairwise RMSD values obtained for the Cβ and Cγ atoms (Table 1, column 9) and for all sidechain heavy atoms (Table 1, column 10) in the 10 HMG box structures do not differ greatly, with the notable exception of hSRY/DNA. This divergence of hSRY is unexpected because the amino acid sequence of mSox-5 is closer to hSRY than to any of the other 9 structures, and there are considerable differences between the sequence of mSox-5 and the non-sequence-specific HMG boxes.
DNA binding and bending by HMG boxes
HMG boxes all have very similar L-shaped structures, so what distinguishes the sequence-specific and structure-specific HMG boxes and their binding to DNA? Recently, a binding mode has been shown for non-sequence-specific HMG boxes bound to DNA (Allain et al. 1999; Murphy et al. 1999; Ohndorf et al. 1999) in which the residue at position 30 is hydrophobic and partially intercalates into the DNA. For box 1 of rHMG1 and yNHP6A, residue 30 is phenylalanine, and in dHMG-D it is valine. In other non-sequence-specific boxes, for example, mammalian HMG1 box 2, residue 30 is isoleucine. In sequence-specific HMG boxes this residue is polar (asparagine in hSRY and Sox proteins, serine in LEF-1 and TCF1). Partial intercalation of residue 30 is thus not expected for sequence-specific HMG boxes, but could be the norm for non-sequence-specific boxes.
The primacy of the minor wing in establishing DNA sequence specificity was shown by a subdomain swap experiment using hLEF-1 and HMG1 box 2 (Read et al. 1994), and this was borne out by the structures of the two sequence-specific HMG box/DNA complexes (Love et al. 1995; Werner et al. 1995). Several amino acids in the minor wing have been singled out as critical for sequence-specific DNA binding: N8 is involved in hydrogen bonding and electrostatic interactions with three bases and forms the stem of an amino acid wedge that forces intercalation of the sidechain of residue 11 between two adenine rings (Werner et al. 1996), and it is likely that this mechanism operates for mSox-5/DNA binding and involves N8–W13, with M11 as the intercalating residue. In non-sequence-specific HMG boxes, residue 8 is normally serine and never asparagine. At position 11 in sequence-specific boxes the residue is either methionine or isoleucine (occasionally phenylalanine), and in non-sequence-specific boxes it is one of several large hydrophobics, with the notable exception of box 1 of mammalian HMG1 and HMG2, in which it is alanine. Indeed, this residue was found not to be intercalated into the cisplatinated DNA of the complex with rHMG1 (Ohndorf et al. 1999). It seems therefore that for most non-sequence-specific HMG boxes there are two DNA-intercalating residues but only one for sequence-specific boxes.
Residues V3, Y67, and Y70 in hSRY pack together so as to present a precise surface of the N-terminal region to the DNA, which appears critical for DNA sequence-specific recognition (Werner et al. 1995, 1996): In DNA non-sequence-specific boxes, residue 3 is normally proline (Read et al. 1995) and never valine or isoleucine (as in mLEF-1). For this hydrophobic cluster of 3 sidechains to form it appears necessary for the backbone to turn through an approximate right angle at the end of helix 3, a conformation very evident in the structures of hSRY and mLEF-1 bound to DNA (Love et al. 1995; Werner et al. 1995). A proline residue (P68 in mSox-5) seems essential for this change in backbone direction, and proline is conserved at this position in all sequence-specific HMG boxes but absent from all non-sequence-specific boxes (Ner 1992). In free-solution mSox-5, P68 is part of a turn (Y67–Y70) that also results in a change of chain direction by ∼90° (Fig. 6 ▶).
The minor wing of the mSox-5 fold
The structural independence of the minor wing of the HMG box fold from its major wing is indicated by several criteria. In our early studies of the domain structure of HMG1 using limited trypsin digestion (Cary et al. 1983) it was found that box 1 was preferentially cut at the C-terminal sides of residues 6 and 60, positions now seen to correspond to the boundary between the two wings. Importantly, the remaining major wing fragment was shown to be fully folded, demonstrating that the major wing does not require the minor wing in order to fold (Cary et al. 1983). In later experiments we showed that it was possible to construct a folded chimeric HMG box from the major wing of HMG1 box 2 and the minor wing of hLEF-1 that preserved the sequence-specific DNA-binding characteristics of hLEF-1 (Read et al. 1994).
A difference between mSox-5 and non-sequence-specific boxes in free solution is the increased flexibility in helix 3 of mSox-5. The relaxation data in Figure 4 ▶ show a steady decrease in the order parameter S2 beyond residue Q62, 6 residues before the end of helix 3. In marked contrast, the equivalent residues in helix 3 of non-sequence-specific boxes (both boxes from HMG1 [Broadhurst et al. 1995] and dHMG-D [Jones et al. 1994]) showed no increase in dynamic disorder. This might be related to a requirement for a higher level of induced fit in the binding of sequence-specific HMG boxes to DNA. The importance of protein refolding in DNA sequence recognition has recently been discussed (Wright and Dyson 1999).
A DSC and CD study demonstrated that the mSox-5 HMG box denatures as two separate subdomains. The lower melting subdomain was assigned to the minor wing on the basis of changes in the intrinsic fluorescence and NMR spectrum (Crane-Robinson et al. 1998). DSC and CD melting studies of HMG1 box 2 also show the presence of two subdomains, but the stability of the lower melting minor wing is substantially greater than for mSox-5 (P.D. Cary, C.M. Read, C. Crane-Robinson and P.L. Privalov, unpubl.). Deconvolution of the calorimetric Cp/T function observed for mSox-5 suggested that the melting of the minor wing is a cooperative transition with a Tm of 34°C that partially overlaps the melting of the major wing (Tm = 46°C). The relaxation measurements obtained at 25°C (Fig. 4 ▶) are revealing in this respect. If a subdomain denatures cooperatively in a two-state process, then at some defined intermediate temperature one would expect the mobility of all residues in the subdomain to be the same and, in a fast exchange situation, to reflect the relative proportions of the native and denatured states at that temperature. But that is not what is seen in Figure 4 ▶ for the minor wing (residues 54 to the C-terminus and 1–8): The degree of dynamic order of residues 54–61 is high, but that for the succeeding C-terminal residues gradually decreases. The degree of dynamic order of the first 8 N-terminal residues also varies with its position in the chain. This is what would be expected if temperature increase resulted in a gradual unfolding of the minor wing (from the N- and C-terminal ends) rather than a cooperative melting of the whole wing. Relaxation measurements at several temperatures would be required to absolutely prove this point, but such considerable differences in mobility along the minor wing measured at a single temperature leave little doubt that its melting is a continuous process.
Materials and methods
Protein expression, purification, and characterization
The HMG box of mSox-5 (residues 182–260, Denny et al. 1992) was expressed using the pGEX-2T system in Escherichia coli BL21 (DE3) plysS cells, essentially as described (Read et al. 1993, 1994). Following thrombin cleavage of the box from the glutathione–agarose beads, the protein was purified by reverse phase HPLC on a C4 column. The domain was refolded by addition of 20 mM potassium phosphate (pH 6.2), and an NMR spectrum was obtained to confirm that the box folded correctly (as compared with a sample that had never been denatured). The 15N uniformly labeled mSox-5 HMG box was obtained by growth of the cells in an ampicillin-containing M9 minimal medium, with 10 mM 15N ammonium chloride as the sole nitrogen source. When the cell concentration reached 1.0 OD600, IPTG induction was followed by a further 4 hr of growth. Labeled protein was purified by the same means as used for the unlabeled.
The domain studied represents an 81-residue peptide having two additional amino acids (Gly–Ser) at the N terminus of the 79-residue HMG box of mSox-5 (for amino acid sequence, see Fig. 1 ▶). The purified recombinant mSox-5 HMG box showed a single band on both SDS and acetic acid/urea polyacrylamide gels. Electrospray mass spectrometry gave the expected mass of 9804.6 Da. CD spectroscopy of the refolded mSox-5 HMG box indicated an α-helical content of ∼55%. Gel retardation studies showed a band of reduced electrophoretic mobility on binding the protein to a 12-bp DNA duplex containing the recognition sequence 5′-AACAAT-3′, and a circular permutation assay gave a bend angle of 70° (Privalov et al. 1999).
NMR spectroscopy
Two- and three-dimensional NMR spectra were obtained on a home-built General Electrics Omega 600-MHz spectrometer (OCMS, University of Oxford) in 90% H2O/10% 2H2O at 25°C, pH 6.2. Further 2d NMR spectra were obtained at 25°C and 32°C at pH values of 6.2 and 5.4 in 90% H2O/10% 2H2O and in 100% 2H2O in order to resolve problems of cross-peak overlap. The spectra obtained at pH 5.4 were recorded on a Bruker AM600 at Oxford University. Protein concentrations of ∼3 mM and 2 mM were used for the 2d and 3d spectra, respectively, in 75 mM potassium phosphate, 0.5 mM DTT. 1H–1H NOESY spectra (Jeener et al. 1979; Kumar et al. 1980; Kieffer et al. 1994) were collected in the phase-sensitive manner by the time proportional increments method, with mixing times of 120 msec (pH 6.2) and 130 msec (25°C, pH 5.4)/160 msec (32°C, pH 5.4). 1H–1H HOHAHA spectra (Braunschweiler et al. 1983; Davis and Bax 1985; Bax et al. 1987; Kieffer et al. 1994) were collected with a mixing time of 32 msec (pH 6.2) and 35 or 40 msec (pH 5.4), as previously described. Care was taken to optimize baseline flatness in the spectra by appropriate choice of the initial t1 and preacquisition delays (Marion and Bax 1988; Bax et al. 1991). The solvent signal was removed in the F2 dimension by time domain deconvolution (Marion et al. 1989b). Three-dimensional 1H NOESY 15N–1H HMQC and 1H HOHAHA 15N–1H HMQC experiments (Marion et al. 1989a; Messerle et al. 1989; Driscoll et al. 1990) were recorded with 120-msec and 40-msec mixing times, respectively, using 128 × 32 × 512 complex points with spectral widths of 6 kHz (F1), 2 kHz (F2), and 12 kHz (F3). NMR data were processed using the program Felix v2.30 (MSI/Biosym Technologies) and displayed using NMRview (Johnson and Blevins 1994).
Further NMR data were acquired on a Varian UnityPlus 500-MHz spectrometer at University College London. 3JHNα coupling constants, and their temperature dependence, were determined from 15N–1H HMQC-J spectra sequentially recorded at 12°C and 25°C. 3JHNα coupling constants were evaluated with a nonlinear least-squares curve fit to the observed peaks using an analytical expression for the cross-peak lineshape, including the effects of antiphase dispersive character, cross-correlation, and apodization, according to previously described procedures (Norwood et al. 1992). The 15N–1H HMQC-J experiments were also used to define the solvent exchange rate of backbone amide NH protons.
The 15N T1, T2, and {1H}–15N heteronuclear NOE data (Kay et al. 1989) were recorded with a 1 mM mSox-5 sample at 25°C, at a 15N frequency of 50.6 MHz. The data were analyzed using the Lipari–Szabo model-free formalism (Lipari and Szabo 1982a,b) according to the procedures detailed in Pfuhl et al. (1999) and Kristensen et al. (2000) and are essentially identical to those of Palmer and co-workers (Mandel et al. 1995). The 15N T1 and T2 values were estimated by fitting the heights from any given NH peak to a two-parameter exponential decay function. Steady-state {1H}–15N NOE values were calculated as NOE = Īsat/Īunsat, where Īsat and Īunsat are the average signal heights in the presence and absence of 1H presaturation, respectively. All spectra were processed with Felix v2.30, and signals were integrated with Xeasy (Bartels et al. 1995). The uncertainties of the signal heights were estimated from the rms noise level in the spectra. Uncertainties in T1 and T2 times were estimated from Monte Carlo simulation, and uncertainties of the steady-state {1H}–15N NOE were calculated by error propagation.
Structure calculations
Calibration of NOE-derived distances was based on known interatomic spacings within aromatic rings (F10, W13, F25, W41, and Y52) and checked using the well-defined α helices (averaged distances for i to i + 1, i + 2, i + 3, and i + 4). All assigned NOE cross-peaks were then classified as either strong, medium, or weak and for the 3d spectra the distance restraints for the 3 categories were set at 1.8–2.7, 1.8–3.5, and 1.8–5.0 Å, respectively. Since a number of 2d spectra involved mixing times of 120–160 msec, the possibility of spin diffusion effects meant that these spectra had to be recalibrated. It was found, for example, in the case of 2d spectra recorded in 2H2O, that interactions of up to 7.0 Å could be observed; accordingly, distance categories of 1.8–2.7, 1.8–3.5, 1.8–5.5, and 1.8–7.0 Å were applied to these spectra. Stereospecific assignment (Gronenborn et al. 1991) of the methyl protons of V12, L22, and L39 was achieved from 2d NOESY data in which clear chemical shift differences were observed in conjunction with differential intensities.
Xplor v3.851 (Brunger 1993) was used to calculate structures from an initial pseudorandom coordinate template file. Distance NOE restraint files were written as described by Wuthrich et al. (1983), and redundant NOE restraints were eliminated with Aqua v2.0 (Laskowski et al. 1996). Hydrogen bonds and dihedral angles within expected helices were initially set on the basis of: (1) hard-to-exchange backbone NH protons [NOE volumes measured at the NH (15N–1H HSQC) and NH and H2O (3d NOESY) chemical shift positions]; (2) continuous stretches of dαN(i, i + 3) and dαN(i, i + 4) NOE cross-peaks; (3) CαH proton chemical shifts (Wishart et al. 1992); and (4) 3JHNα coupling constants. Restraints for phi dihedral angles were determined using the equation 3JHNα = 6.98 cos2 θ − 1.38 cos θ + 1.72. Psi dihedral angles were not restrained. This led to the use of 1424 (1383) distant NOE restraints, 68 (61) dihedral restraints, and 24 (24) pairs of hydrogen bond restraints, on structures modeled for residues 1–79 and 1–70, respectively.
Simulated annealing was carried out using center averaging for pseudo atom positions using the soft potential for all initial calculations, and these were refined until all NOE restraints, including hydrogen bond violations, were less than 0.7 Å for all structures. Low-energy structures selected from these were further refined to produce a consistent family of structures with distance NOE violations less than 0.5 Å. Structures with energies less than 1300 kcal mole−1 were then refined, from which 30 were selected having energies less than 500 kcal mole−1. Multiconformer analysis of the 1––70-residue domain was carried out using Xplor v3.851 with the lowest energy refined structure as initial template and the same restraint files as used previously. Ensembles of 20 structures were examined for each conformer using a violation setting of 0.2 Å, for a maximum of 3 conformer ensembles.
Acknowledgments
The financial support of the Wellcome Trust is very gratefully acknowledged. We thank Peter Wright and Julie Feigon for provision of coordinates and Iain Campbell for his continuing support. P.C. Driscoll is a Royal Society University Research Fellow.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked ``advertisement'' in accordance with 18 USC section 1734 solely to indicate this fact.
Abbreviations
NOESY, nuclear Overhauser enhancement and exchange spectroscopy
HOHAHA, homonuclear Hartmann Hahn
HMQC, heteronuclear multiple quantum coherence
HSQC, heteronuclear single quantum coherence
CD, circular dichroism
DSC, differential scanning calorimetry.
Article and publication are at www.proteinscience.org/cgi/doi/10.1110/ps.32801.
References
- Allain, F.H.T., Yen, Yi-.M., Masse, J.E., Schultze, P., Dieckmann, T., Johnson, R.C., and Feigon, J. 1999. Solution structure of the HMG protein NHP6A and its interaction with DNA reveals the structural determinants for non-sequence-specific binding. EMBO J. 18 2563–2579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartels, C., Xia, T.H., Billeter, M., Guntert, P., and Wuthrich, K. 1995. The program XEASY for computer-supported NMR spectral analysis of biological macromolecules. J. Biomol. NMR 5 1–10. [DOI] [PubMed] [Google Scholar]
- Bax, A., Sklenar, V., Clore, G.M., and Gronenborn, A.M. 1987. Water suppression in two-dimensional spin-locked nuclear resonance experiments using a novel phase-cycling procedure. J. Amer. Chem. Soc. 109 6511–6513. [Google Scholar]
- Bax, A., Ikura, M., Kay, L.E., and Zhu, G. 1991. Removal of F1-base-line distortion and optimisation of folding in multidimensional NMR-spectra. J. Magn. Reson. 91 174–178. [Google Scholar]
- Bazett-Jones, D.P., Leblanc, B., Hertfort, M., and Moss, T. 1994. Short-range DNA looping by the Xenopus HMG-box transcription factor, xUBF. Science 264 1134–1137. [DOI] [PubMed] [Google Scholar]
- Bianchi, M.E., Beltrame, M., and Paonessa, G. 1989. Specific recognition of cruciform DNA by nuclear protein HMG1. Science 243 1056–1059. [DOI] [PubMed] [Google Scholar]
- Bianchi, M.E., Falciola, L., Ferrari, S., and Lilley, D.M.J. 1992. The DNA binding site of HMG1 protein is composed of two similar segments (HMG boxes), both of which have counterparts in other eukaryotic regulatory proteins. EMBO J. 11 1055–1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bovin, A.M.J.J. and Brunger, A.T. 1995. Variablity of solution nuclear magnetic resonance structures. J. Mol. Biol. 250 80–93. [DOI] [PubMed] [Google Scholar]
- ———. 1996. Do NOE distances contain enough information to assess the relative populations of multi-conformer structures? J. Biomol. NMR 7 72–76. [DOI] [PubMed] [Google Scholar]
- Braunschweiler, L. and Ernst, R.R. 1983. Coherence transfer by isotropic mixing: Application to protein correlation spectroscopy. J. Magn. Reson. 53 521–528. [Google Scholar]
- Broadhurst, R.W., Hardman, C.H., Thomas, J.O., and Laue, E.D. 1995. Backbone dynamics of the A-domain of HMG1 as studied by 15N NMR spectroscopy. Biochemistry 34 16608–16617. [DOI] [PubMed] [Google Scholar]
- Brunger, A.T. 1993. XPLOR: A system for X-ray crystallography and NMR. Yale University, New Haven, CT.
- Cary, P.D., Turner, C.H., Mayes, E., and Crane-Robinson, C. 1983. Conformation and domain structure of the non-histone chromosomal proteins, HMG 1 and 2. Isolation of two folded fragments from HMG 1 and 2. Eur. J. Biochem. 131 367–374. [DOI] [PubMed] [Google Scholar]
- Clore, G.M., Driscoll, P.C., Wingfield, P.T., and Gronenborn, A.M. 1990a. Analysis of the backbone dynamics of interleukin-1β using 2-dimensional inverse detected heteronuclear 15N–1H NMR spectroscopy. Biochemistry 29 7387–7401. [DOI] [PubMed] [Google Scholar]
- Clore, G.M., Szabo, A., Bax, A., Kay, L.E., Driscoll, P.C., and Gronenborn, A.M. 1990b. Deviations from the simple 2-parameter model-free approach to the interpretation of 15N nuclear magnetic-relaxation of proteins. J. Amer. Chem. Soc. 112 4989–4991. [Google Scholar]
- Connor, F., Cary, P.D., Read, C.M., Preston, N.S., Driscoll, P.C., Denny, P., Crane-Robinson, C., and Ashworth, A. 1994. DNA binding and bending properties of the post-meiotically expressed Sry-related protein Sox-5. Nucleic Acids Res. 22 3339–3346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crane-Robinson, C., Read, C.M., Cary, P.D., Driscoll, P.C., Dragan, A.I., and Privalov, P.L. 1998. The energetics of HMG box interactions with DNA. Thermodynamic description of the box from mouse Sox-5. J. Mol. Biol. 281 705–717. [DOI] [PubMed] [Google Scholar]
- Davis, D.G. and Bax, A. 1985. Assignment of complex 1H NMR spectra via two-dimensional homonuclear Hartmann–Hahn spectroscopy. J. Amer. Chem. Soc. 107 2820–2821. [Google Scholar]
- Denny, P., Swift, S., Connor, F., and Ashworth, A. 1992. An Sry-related gene expressed during spermatogenesis in the mouse encodes a sequence-specific DNA-binding protein. EMBO J. 11 3705–3712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Driscoll, P.C., Clore, G.M., Marion, D., Wingfield, P.T., and Gronenborn, A.M. 1990. Complete resonance assignment for the polypeptide backbone of interleukin 1β using three-dimensional heteronuclear NMR spectroscopy. Biochemistry 29 3542–3556. [DOI] [PubMed] [Google Scholar]
- Ferrari, S., Harley, V.R., Pontiggia, A., Goodfellow, P.N., Lovell-Badge, R., and Bianchi, M.E. 1992. SRY, like HMG1, recognizes sharp angles in DNA. EMBO J. 11 4497–4506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giese, K., Amsterdam, A,. and Grosschedl, R. 1991. DNA-binding properties of the HMG domain of the lymphoid-specific transcriptional regulator LEF-1. Genes & Dev. 5 2567–2578. [DOI] [PubMed] [Google Scholar]
- Giese, K., Cox, J., and Grosschedl, R. 1992. The HMG domain of lymphoid enhancer factor 1 bends DNA and facilitates assembly of functional nucleoprotein structures. Cell 69 185–195. [DOI] [PubMed] [Google Scholar]
- Gronenborn, A.M., Filpula, D.R., Essig, N.Z., Achari, A., Whitlow, M., Wingfield, P.T., and Clore, G.M. 1991. A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein-G. Science 253 657–661. [DOI] [PubMed] [Google Scholar]
- Groschedl, R., Giese, K., and Pagel, J. 1994. HMG domain proteins—architectural elements in the assembly of nucleoprotein structures. Trends Genet. 10 94–100. [DOI] [PubMed] [Google Scholar]
- Hardman, C.H., Broadhurst, R.W., Raine, A.R.C., Grasser, K.D., Thomas, J.O., and Laue, E.D. 1995. Structure of the A-domain of HMG1 and its interaction with DNA as studied by heteronuclear three- and four-dimensional NMR spectroscopy. Biochemistry 34 16596–16607. [DOI] [PubMed] [Google Scholar]
- Jantzen, H.M., Adam, A., Bell, S.P., and Tjian, R. 1990. Nucleolar transcription factor hUBF contains a DNA-binding motif with homology to HMG proteins. Nature 344: 830–836. [DOI] [PubMed] [Google Scholar]
- Jeener, J., Meier, B.H., Bachmann, P., and Ernst, R.R. 1979. Investigation of exchange processes by two-dimensional NMR spectroscopy. J. Phys. Chem. 71 4546–4553. [Google Scholar]
- Johns, E.W. 1982. The HMG chromosomal proteins. Academic Press, New York, NY.
- Johnson, B.A. and Blevins, RA. 1994. NMRView: A computer programme for the visualization and analysis of NMR data. J. Biomol. NMR 4 603–614. [DOI] [PubMed] [Google Scholar]
- Jones, D.N.M., Searles, M.A., Shaw, G.L., Churchill, M.E.A., Ner, S.S., Keeler, J., Travers, A.A., and Neuhaus, D. 1994. The solution structure and dynamics of the DNA-binding domain of HMG-D from Drosophila melanogaster. Structure 2 609–627. [DOI] [PubMed] [Google Scholar]
- Kay, L.E., Torchia, D.A., and Bax, A. 1989. Backbone dynamics of proteins as studied by 15N inverse detected heteronuclear NMR-spectroscopy—Application to Staphylococcal nuclease. Biochemistry 28 8972–8979. [DOI] [PubMed] [Google Scholar]
- Kieffer, B., Driscoll, P.C., Campbell, I.D., Willis, A.C., van der Merwe, P.A., and Davis, S.J. 1994. 3-dimensional solution structure of the extracellular region of the complement regulatory protein CD59, a new cell-surface protein domain related to snake-venom neurotoxins. Biochemistry 15 4471–4482. [PubMed] [Google Scholar]
- Kristensen, S.M., Siegal, G., Sankar, A., and Driscoll, P.C. 2000. Backbone dynamics of the C-terminal SH2 domain of the p85α subunit of phosphoinositide 3-kinase: Effect of phosphotyrosine peptide binding and characterization of slow conformational exchange processes. J. Mol. Biol. 299 771–788. [DOI] [PubMed] [Google Scholar]
- Kumar, A., Ernst, R.R., and Wüthrich, K. 1980. A two-dimensional nuclear overhauser enhancement (2d NOE) experiment for the elucidation of complete proton–proton cross-relaxation networks in biological macromolecules. Biochem. Biophys. Res. Commun. 95 1–6. [DOI] [PubMed] [Google Scholar]
- Laskowski, R.A., Rullmann, J.A.C., MacArthur, M.W., Kaptein, R., and Thornton, J.M. 1996. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 8 477–486. [DOI] [PubMed] [Google Scholar]
- Lefebvre, V., Li, P., and de Crombrugghe B. 1998. A new long form of Sox5 (L-Sox5), Sox6 and Sox9 are coexpressed in chondrogenesis and cooperatively activate the type II collagen gene. EMBO J. 17 5718–5733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lipari, G. and Szabo, A. 1982a. Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 1. Theory and range of validity. J. Amer. Chem. Soc. 104 4546–4559. [Google Scholar]
- Lipari, G. and Szabo, A. 1982b. Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 2. Analysis of experimental results. J. Amer. Chem. Soc. 104 4559–4570. [Google Scholar]
- Lnenicek-Allen, M., Read, C.M., and Crane-Robinson, C. 1996. The DNA bend angle and binding affinity of an HMG box increased by the presence of short terminal arms. Nucleic Acids Res. 24 1047–1051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Love, J.J., Li, X., Case, D.A., Giese, K., Grosschedl, R., and Wright, P.E. 1995. Structural basis for DNA bending by the architectural transcription factor LEF-1. Nature 376 791–795. [DOI] [PubMed] [Google Scholar]
- Mandel, A.M., Akke, M., and Palmer, A.G. 1995. Backbone dynamics of Escherichia coli ribonuclease H1—Correlations with structure and function in an active enzyme. J. Mol. Biol. 246 144–163. [DOI] [PubMed] [Google Scholar]
- Marion, D. and Bax, A. 1988. Baseline distortion in real-fourier-transform NMR-spectra. J. Magn. Reson. 79 352–356. [Google Scholar]
- Marion, D., Driscoll, P.C., Kay, L.E., Wingfield, P.T., Bax, A., Gronenborn, A.M., and Clore, G.M. 1989a. Overcoming the overlap problem in the assignment of 1H NMR spectra of larger proteins by use of three-dimensional heteronuclear 1H–15N Hartmann–Hahn–multiple quantum coherence and nuclear overhauser–multiple quantum coherence spectroscopy: Application to interleukin 1β. Biochemistry 28 6150–6156. [DOI] [PubMed] [Google Scholar]
- Marion, D., Ikura, M., and Bax, A. 1989b. Improved solvent suppression in one-dimensional and two-dimensional NMR-spectra by convolution of time-domain data. J. Magn. Reson. 79 352–356. [Google Scholar]
- Messerle, B.A., Wider, G., Otting, G., Weber, C., and Wuthrich, K. 1989. Solvent suppression using a spin lock in 2d NMR and 3d NMR-spectroscopy with H2O solutions. J. Magn. Reson. 85 608–613. [Google Scholar]
- Murphy, F.V., Sweet, R.M., and Churchill, M.E.A. 1999. The structure of a chromosomal high mobility group protein–DNA complex reveals sequence-neutral mechanisms important for no-sequence-specific DNA recognition. EMBO J. 18 6610–6618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nambu, P.A. and Nambu, J.R. 1996. The Drosophila fish-hook gene encodes a HMG domain protein essential for segmentation and CNS development. Development 122 3467–3475. [DOI] [PubMed] [Google Scholar]
- Ner, S.S. 1992. HMGs everywhere. Curr. Biol. 2 208–210. [DOI] [PubMed] [Google Scholar]
- Norwood, T.J., Crawford, D.A., Steventon, M.E., Driscoll, P.C., and Campbell, I.D. 1992. Heteronuclear 1H–15N nuclear magnetic resonance studies of the C subunit of the E. coli F1F0 ATP synthase: Assignment and secondary structure. Biochemistry 31 6285–6290. [DOI] [PubMed] [Google Scholar]
- Ohndorf, U-.M., Rould, M.A., He, Q., Pabo, C.O., and Lippard, S.J. 1999. Basis for recognition of cisplatin-modified DNA by high-mobility-group proteins. Nature 399 708–712. [DOI] [PubMed] [Google Scholar]
- Parisi, A. and Clayton, D.A. 1991. Similarity of human mitochondrial transcription factor 1 to high mobility group proteins. Science 252 965–969. [DOI] [PubMed] [Google Scholar]
- Payet, D., Hillisch, A., Lowe, N., Diekmann, S., and Travers, A. 1999. The recognition of distorted DNA structures by HMG-D: A footprinting and molecular modelling study. J. Mol. Biol. 294 79–91. [DOI] [PubMed] [Google Scholar]
- Pfuhl, M., Chen, H.A., Kristensen, S.M., and Driscoll, P.C. 1999. NMR exchange broadening arising from specific low affinity protein self-association: Analysis of nitrogen-15 nuclear relaxation for rat CD2 domain 1. J. Biomol. NMR 14 307–320. [DOI] [PubMed] [Google Scholar]
- Pil, P.M. and Lippard, S.J. 1992. Specific binding of chromosomal protein HMG1 to DNA damaged by the anticancer drug cisplatin. Science 256 234–237. [DOI] [PubMed] [Google Scholar]
- Privalov, P.L., Jelesarov, I., Read, C.M., Dragan, A.I., and Crane-Robinson, C. 1999. The energetics of HMG box interactions with DNA: Thermodynamics of the DNA binding of the HMG box from mouse Sox-5. J. Mol. Biol. 294 997–1013. [DOI] [PubMed] [Google Scholar]
- Read, C.M., Cary, P.D., Crane-Robinson, C., Driscoll, P.C., and Norman, D.G. 1993. Solution structure of a DNA-binding domain from HMG1. Nucleic Acids Res. 21 3427–3436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Read, C.M., Cary, P.D., Preston, N.S., Lnenicek-Allen, M., and Crane-Robinson, C. 1994. The DNA sequence specificity of HMG boxes lies in the minor wing of the structure. EMBO J. 13 5639–5646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Read, C.M., Cary, P.D., Crane-Robinson, C., Driscoll, P.C., Carrillo, M.O.M., and Norman, D.G. 1995. The structure of the HMG box and its interaction with DNA. In Nucleic acids and molecular biology (eds. F. Eckstein and D.M.J. Lilley), Volume 9, pp. 222–250. Springer-Verlag, Berlin.
- Russell, S.R.H., Sanchez-Soriano, N., Wright, C.R., and Ashburner, M. 1996. The Dichaete gene of Drosophila melanogaster encodes a SOX-domain protein required for embryonic segmentation. Development 122 3669–3676. [DOI] [PubMed] [Google Scholar]
- Sinclair, A.H., Berta, P., Palmer, M.S., Hawkin, J.R., Griffiths, B.L., Smith, M.I., Foster, J.W., Fischauf, A.M., Lovell-Badge, R., and Goodfellow, P.N. 1990. A gene from the human sex-determining region encodes a protein with homology to a conserved DNA-binding motif. Nature 354 240–244. [DOI] [PubMed] [Google Scholar]
- Van de Wetering, M., Oosterwegel, M., van Norren, K., and Clevers, H. 1993. Sox 4, an Sry like HMG box protein, is a transcriptional activator in lymphocytes. EMBO J. 12 3847–3854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Houte, L.P.A., Chuprina, V.P., van der Wetering, M., Boelens, R., Kaptein, R., and Clevers, H. 1995. Solution structure of the sequence specific HMG box of the lymphocyte transcription activator Sox-4. J. Biol. Chem. 270 30516–30524. [DOI] [PubMed] [Google Scholar]
- Wagner, T., Wirth, J., Meyer, J., Zabel, B., Held, M., Zimmer, J., Pasantes, J., Bricarelli, F.D., Keutel, J., Hustert, E., et al. 1994. Autosomal sex reversal and campomelic dysplasia are caused by mutations in and around the SRY-related gene SOX9. Cell 79 1111–1120. [DOI] [PubMed] [Google Scholar]
- Waterman, M.L., Fischer, W.H., and Jones, K.A. 1991. A thymus-specific member of the HMG protein family regulates the human T cell receptor Cα enhancer. Genes & Dev. 5 656–669. [DOI] [PubMed] [Google Scholar]
- Webb, M. and Thomas, J.O. 1999. Structure-specific binding of the two tandem HMG boxes of HMG1 to four-way junction DNA is mediated by the A domain. J. Mol. Biol. 294 373–387. [DOI] [PubMed] [Google Scholar]
- Wegner, M. 1999. From head to toes: The multiple facets of Sox proteins. Nucleic Acids Res. 27 1409–1420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weir, H.M., Kraulis, P.J., Hill, C.S., Raine, A.R.C., Laue, E.D., and Thomas, J.O. 1993. Structure of the HMG box motif in the B-domain of HMG1. EMBO J. 12 1311–1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werner, M.H., Huth, J.R., Gronenborn, A.M., and Clore, G.M. 1995. Molecular basis of human 46X,Y sex reversal revealed from the three-dimensional solution structure of the human SRY-DNA complex. Cell 81 705–714. [DOI] [PubMed] [Google Scholar]
- Werner, M.H., Huth, J.R., Gronenborn, A.M., and Clore, G.M. 1996. Molecular determinants of mammalian sex. TIBS 21 302–308. [PubMed] [Google Scholar]
- Wishart, D.S., Sykes, B.D., and Richards, F.M. 1992. The chemical shift index: Fast and simple method for the assignment of protein secondary structure through NMR spectroscopy. Biochemistry 31 1647–1651. [DOI] [PubMed] [Google Scholar]
- Wolffe, A.P. 1994. Architectural transcription factors. Science 264 1100–1101. [DOI] [PubMed] [Google Scholar]
- Wright, J.M. and Dixon, G.H. 1988. Induction by torsional stress of an altered DNA conformation 5′ upstream of the gene for a high mobility group protein from trout and specific binding to flanking sequences by the gene product HMG-T. Biochemistry 27 576–581. [DOI] [PubMed] [Google Scholar]
- Wright, P.E. and Dyson, H.J. 1999. Intrinsically unstructured proteins: Re-assessing the protein structure–function paradigm. J. Mol. Biol. 293 321–331. [DOI] [PubMed] [Google Scholar]
- Wuthrich, W., Billeter, M., and Braun, W. 1983. Pseudo-structures for the 20 amino acids for use in studies of protein conformations by measurements of intramolcular proton–proton distance constraints with nuclear magnetic resonance. J. Mol. Biol. 169 949–961. [DOI] [PubMed] [Google Scholar]