Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2006 Nov 3;103(46):17225–17230. doi: 10.1073/pnas.0607171103

Insights into transcription enhancer factor 1 (TEF-1) activity from the solution structure of the TEA domain

Asokan Anbanandam *, Diana C Albarado *, Catherine T Nguyen *, Georg Halder , Xiaolian Gao , Sudha Veeraraghavan *,§
PMCID: PMC1859914  PMID: 17085591

Abstract

Transcription enhancer factor 1 is essential for cardiac, skeletal, and smooth muscle development and uses its N-terminal TEA domain (TEAD) to bind M-CAT elements. Here, we present the first structure of TEAD and show that it is a three-helix bundle with a homeodomain fold. Structural data reveal how TEAD binds DNA. Using structure-function correlations, we find that the L1 loop is essential for cooperative loading of TEAD molecules on to tandemly duplicated M-CAT sites. Furthermore, using a microarray chip-based assay, we establish that known binding sites of the full-length protein are only a subset of DNA elements recognized by TEAD. Our results provide a model for understanding the regulation of genome-wide gene expression during development by TEA/ATTS family of transcription factors.

Keywords: DNA-binding protein, gene regulation, NMR structure, development, Scalloped


Transcription enhancer factors (TEFs) and their homologs make a highly conserved family of eukaryotic DNA-binding proteins whose expression patterns have been correlated with the transcription of viral genes (1, 2) and development (37) (Fig. 1A). Four TEFs have been identified in humans, namely, TEF-1, TEF-3, TEF-4, and TEF-5. TEF-1 binds M-CAT elements and regulates the development of cardiac, skeletal, and smooth muscles (3, 8, 9). Consistent with this, TEF-1 knockout results in defective heart development and embryonic lethal phenotypes in mouse (10). TEF-3 is found to be expressed largely in lungs and liver, but its expression also is induced by mitogenic stimulation in quiescent fibroblasts or during differentiation of myoblasts to myotubes (8). Expressed from the two-cell stage onwards, mTEAD2, hTEF-4 homolog, is sufficient for TEAD-dependent transcriptional activity during early embryonic development (7, 11, 12). TEF-5 is expressed in placenta and regulates human somatomammotropin gene enhancer (13, 14) and influences fetal development. Widespread early expression of mouse TEAD4 becomes restricted at later stages to neural and mesenchymal tissues as well as nephronic regions of kidneys. TEF orthologs of other eukaryotes also regulate developmental processes. Scalloped (SD) is required for wing development (15). The only Caenorhabditis elegans TEF homolog, egl-44, regulates differentiation of touch-sensitive cells and egg-laying motor neurons (1618). The less conserved orthologs of Saccharomyces cerevisiae and Aspergillus nidulans, namely TEC1, regulate hyphal development (filamentous growth) (19, 20). Development of virulence in systemic candidiasis in mouse models also depends on Candida albicans TEC1 activity (21). Thus, the requirement for TEA/ATTS family transcription factors is conserved throughout eukaryotic development.

Fig. 1.

Fig. 1.

TEAD transcription factors. (A) TEAD proteins regulate tissue-development in eukaryotes. TEF-1 also regulates viral gene activation and TEC1 homolog is essential for Candida virulence. (B) TEAD sequence alignment shows strict evolutionary conservation of most residues (white letters on black background). TEAD used here contains five extraneous residues (in italics) resulting from cloning procedures. Residue 2 of our TEAD corresponds to residue 28 of the TEF-1. Positions of secondary structural elements, based on our NMR-derived structure, are shown over the sequence. Mutations discussed in the article are identified by ▾.

The most conserved region of TEA/ATTS transcription factors is their N-terminal DNA-binding TEA domain (TEAD) (22, 23) (Fig. 1B), which is unique to this family of proteins. TEF-1 and SD are 68% identical overall and 99% identical in the TEAD. This sequence conservation is sufficient for functional conservation because TEF-1 substitutes for SD in wing development (24). The gene regulatory activities of TEF-1 are governed by interactions with protein cofactors (2527). For instance, interactions of TEF-1 with Max and serum response factor (SRF) regulate the α-MHC gene and normal cardiac and smooth muscle development (28, 29). Vestigial (VG), a protein cofactor of SD in Drosophila, is required for transactivation of wing-specific genes (30, 31). Notably, the VG–TEAD fusion protein is sufficient for wing development (32), indicating that TEAD is necessary and sufficient for the in vivo DNA-binding activity of TEA/ATTS transcription factors.

Two decades since the discovery of TEF-1, the three-dimensional structure of TEAD remains unknown, largely because of difficulties in generating sufficient quantities of protein. Here, we report the first three-dimensional structure of TEAD determined by using solution NMR spectroscopy. This structure contains three α-helices resembling the homeodomain fold and clarifies previously reported mutational effects. The structure provides insights regarding the DNA-binding activity of TEAD, which we have tested by using mutational analyses. We also establish the consensus DNA-binding sequence for TEAD. Based on these findings, we propose a model for TEF activity.

Results and Discussion

Structure Validation.

The accuracy of the final set of 25 structures (Fig. 8, which is published as supporting information on the PNAS web site) is presented in terms of violations of the input distance and dihedral restraints (Table 1). Errors in rmsd reflect precision of structure. Procheck NMR (33) and Errat (34) were used for validation (, which are published as supporting information on the PNAS web site). For the lowest-energy structure shown in Fig. 2, the backbone dihedral angles of the 82 residues exist in most favored (77.5%) and additional allowed (22.5%) regions of the Ramachandran space (Fig. 9).

Table 1.

Structural statistics for 25 final NMR structures of TEAD (1–82)

Statistic Value
Distance and dihedral constraints
    Distance restraints
        Total NOE 1,121
        Intraresidue 585
        Sequential (∣ij∣ = 1) 291
        Medium-range (1 < ∣iJ∣ ≤ 4) 184
        Long-range (∣ij∣ ≥ 4) 60
    Dihedral restraints
        φ 69 (38 HNHA; 31 TALOS)
        ψ 69 (TALOS)
Structure statistics
    Violations, mean ± SD
        Distance constraints, Å 0.031 ± 0.001
        Dihedral angle constraints 0.192 ± 0.063
Deviations from idealized geometry
    Bonds, Å 0.004 ± 0.0001
    Angles, ° 0.363 ± 0.006
    Impropers, ° 0.190 ± 0.009
    Total energy 186.3 ± 6.4
    Bond energy 19.8 ± 1.1
    Angle energy 49.8 ± 1.8
    NOE energy 80.6 ± 4.6
    van der Waals energy 31.9 ± 3.7
    Impropers 3.9 ± 0.4
rms distance from mean structure
    (H1, H2, and H3: residues 11–23, 43–54,     and 62–75)
        Backbone 0.76 ± 0.35
        Heavy 1.31 ± 0.28
    (H2 and H3: residues 43–54 and 62–75)
        Backbone 0.67 ± 0.35
        Heavy 1.31 ± 0.27

Fig. 2.

Fig. 2.

Solution NMR structure of TEAD. (A) Ribbon diagram shows front view of the TEAD fold in which helices H1 and H2 pack against H3. N and C termini are labeled. (B) View down the H3 helix illustrates hydrophobic residues (sticks) contributing to core packing and the surface hydrophobic patch. (C) TEAD unfolds with a midpoint of 2.5 M urea. Fluorescence intensity is relative to that of an equimolar solution of N-acetyl tryptophanamide (NATA).

TEAD Structure.

TEAD is a folded globular protein made of three α-helices, H1, H2, and H3 (Fig. 2A). H1 and H2 are nearly antiparallel with an interhelical angle of 36° (H1-H2). H1 and H2 pack on either side of the H3 with interhelix angles of 109.8° (H1-H3) and 101.2° (H2-H3). Residues 32–34 and 59–60 exist in β space, which allows loops to reverse direction. Residues 1–11, 24–31, 35–42, 55–58, and 61 are unstructured. Of 28 hydrophobic residues in TEAD, the core consists of only 12 residues (Ile-13, Ile-47, Ile-51, Ile-68; Val-64; Leu-21, Leu-71; Ala-48, Ala-20; Phe-17; and Thr-55, Thr-58; Fig. 2B). Of these, L71 makes weak contact to L21 alone. T58 interacts with I13 and T55 to stabilize the L2 loop. The relatively few hydrophobic contacts defining the protein core predicts the thermodynamic stability of TEAD to be low. Consistent with this observation, TEAD unfolds irreversibly with a midpoint for urea denaturation of 2.5 M (Fig. 2C). The TEAD surface contains a hydrophobic patch that consists of I23, Y24, L46, Y50, and L53 and is created by the H1- H2 contact (Fig. 2B). This surface is likely to be a protein docking site (see below).

TEAD Binds DNA with Nanomolar Affinity.

On binding double-stranded 1xGT, the electrophoretic movement of the TEA/DNA complex is retarded relative to the free probe (Fig. 3A). The dissociation constant, KD, is 4–8 nM (Fig. 3B). In comparison, KD for full-length TEAD proteins is 16–45 nM (7). Similarity in these KD values affirms that TEAD alone confers DNA-binding activity to full-length TEA/ATTS proteins. Like the full-length proteins, isolated TEAD also binds tandem sites (2xGT) cooperatively and with nanomolar affinity (35).

Fig. 3.

Fig. 3.

DNA-binding activity of TEAD. (A) EMSA. (B) Nonlinear least-squares fit (solid line) of EMSA data (circles) and 95% confidence limits (dotted lines). (C) Design of DNA sequences used in D-LIA. DNA is attached to the chip by using a linker containing thymidines (T) and poly(ethylene glycol) (P). N5 = GCTATG or GGGGG. (D) Fluorescent spots extracted from the inverted fluorescence image of a microfluidic chip show TEAD binding to various synthetic double-stranded DNA elements. Dark and light spots indicate strong and weak binding, respectively. ∗, DNA sequence is given in Supporting Text. (E) Identification of DNA length suitable for NMR investigations of protein–DNA complex.

TEAD Binds Numerous M-CAT-Like DNA Sequences.

TEFs, SD, and TEC1 share the highly conserved TEAD. The high degree of sequence conservation of the DNA-binding TEAD suggests strongly that their tertiary structure and DNA-binding specificity should also be conserved. Differences in their observed binding selectivities could arise from amino acid differences in their C-terminal domains or be attributable to differences in their cofactor preferences. If so, DNA elements recognized by different full-length TEAD-containing proteins will be a subset of the sequences recognized by the isolated TEAD. To test this in a systematic manner using a large array of synthetic elements, we designed a chip-based assay for direct measurement of protein–DNA interaction named “DNA–ligand interaction assay,” or D-LIA (Fig. 3C and Supporting Text, which is published as supporting information on the PNAS web site). D-LIA results (Fig. 3D) show that TEAD binds strongly to 1xGT DNA with the core sequence 5′-TGGAATGT-3′. Further, TEAD binds all previously reported TEF-1 and TEC1 binding sites, some with higher affinities than others (Fig. 11, which is published as supporting information on the PNAS web site). D-LIA data also yielded the consensus DNA sequence recognized by the isolated TEAD with high affinity, namely: N, not C, G, not G, A, T, N, T; where, N = A, T, C, or G. Thus, six of eight nucleotides in the M-CAT sequence confer binding selectivity. Our finding is somewhat surprising given that known TEF-1 binding sequences contain the central ATG. Also surprising is the finding that T is favored in the eighth position because many TEF-1 binding sequences end with C. These differences suggest that although TEF-1 binds to sequences with G7, C8, its intrinsic affinity for such a sequence is lower than for strongly selected sequences such as TGGAATTT. It also might reflect alterations in DNA-binding specificity induced in the full-length proteins by conformational changes in TEAD (35), either because of the intact C-terminal domain or an associating protein factor.

We also used D-LIA to determine the length of DNA suitable for NMR experiments to map the protein–DNA interactions. D-LIA results show that TEAD binds 1xGT DNA of different lengths, namely, 8-mer, 12-mer, and 16-mer, equally well (Fig. 3E). We chose the 12-mer because it is small enough for NMR studies and expected to be more double-stranded relative to the 8-mer when devoid of 5′ and 3′ flanking nucleotides.

H3 Is the DNA-Recognition Helix.

Two-dimensional 1H,15N-heteronuclear single quantum coherence (HSQC) NMR experiments reveal that ≈30% of TEAD resonances show chemical shift perturbations (Δδ1H ≥ 0.02 ppm and/or Δδ15N ≥ 0.1 ppm) upon binding DNA (Fig. 4A). Of these, 15 backbone and 3 side-chain resonances are shifted unambiguously (labeled). The majority of these residues are located on the back surface of TEAD and in helix H3 and in the L2 loop immediately preceding H3 (Fig. 4B), identifying this as the DNA-binding surface. This result is in agreement with a lack of negative charges and the accessibility of H3 on the DNA-binding surface (Fig. 4C). Negatively charged groups on the front surface would preclude a favorable interaction between TEAD and DNA. Thus, H3 is the DNA-recognition helix of TEAD.

Fig. 4.

Fig. 4.

NMR-mapping of DNA-binding surface. (A) 1H,15N-HSQC spectra of DNA-free (black) and DNA-bound (red) TEAD. Labeled residues show unambiguous and substantial chemical shift changes upon binding DNA. Amino acids in the L1 loop are shown in italics. Asterisks denote side-chain resonances. (B) Backbone (orange) and side-chain (red) resonances that respond to DNA binding identify the DNA-binding surface. (Left) Front view. (Right) Back view. (C) Surface electrostatic potential of TEAD. The front surface (Left) comprises some negatively charged (red) residues, whereas back surface (Right) lacks negative charges.

We find three serines (Ser-65, Ser-66, and Ser-76) on the DNA-binding surface. Phosphorylation of one or more of these could interfere with DNA-binding activity, by introducing electrostatic repulsion and/or steric hindrance, and help regulate TEA/ATTS transcription factor activity. Biochemical data concur with our structural insights; phosphorylation of Ser-76 (Ser-102 of TEF-1) by protein kinase A (36) or of Ser-65 by protein kinase C (37) diminishes DNA-binding activity.

Structural Homologs.

Search for structural homologs of TEAD used DALI (3840). TEAD most closely resembles the homeodomain protein MatA1 (41) (PDB ID 1AKH; Fig. 5A) with an overall backbone rmsd of 1.8 Å (49 residues superimposed, Q score = 0.31, Z score = 3.1). Although large, this root mean square distance (rmsd) is appropriate when comparing the more dynamic DNA-free solution structure of TEAD with the highly stabilized DNA-bound MatA1 in the ordered crystalline form. Structural similarity is even stronger when only residues in helices are compared. The rmsd is 1.4 Å for superimposing residues in the three helices (11–23, 43–54, and 62–75 of TEAD on 79–91, 97–107, and 111–124 of MatA1) or 0.90 Å for residues in H2 and H3 (HTH motif) (Fig. 5B). This is in agreement with our finding that H3 of TEAD is the DNA-recognition helix. The structural homology with homeodomains allows modeling of TEAD/DNA interaction. The results presented here indicate that TEAD is a HTH protein and that it belongs to the homeodomain structural family, proving correct the two-decade-old hypothesis of Davidson et al. (1).

Fig. 5.

Fig. 5.

Models of TEAD structure when bound to DNA. (A) The MatA1 homeodomain (gold) in the MatA1/Matα2/DNA complex (PDB ID 1AKH; ref. 62) is the closest structural homolog of TEAD. (B) Model of TEAD/DNA complex generated by superimposition of H2 and H3 helices of TEAD on MatA1.

A hydrophobic surface, similar to that of TEAD (Fig. 2B), is present in the Matα2/Mat A1 structure and the homeodomain of the Prospero protein. It is used to convert the three-helix homeodomain structure into a four-helix bundle (42, 43). Further, in the Prospero protein, this structure masks the nuclear export signal. Structural homology suggests that the hydrophobic patch of TEAD also might be involved in the formation of a four-helix bundle. The formation of such a four-helix bundle could increase stability of TEAD, provide additional constraints for DNA-selectivity, or regulate nuclear export of TEAD transcription factors.

Like some other homeodomain proteins, TEF-1 is regulated by and interacts with SRF, a protein that is essential for cardiac development (4446). Furthermore, the binding of TEAD and SRF to adjacent elements enhances transcription (29). TEAD/SRF interactions, like in the SAP-1/SRF complex (47, 48), may be mediated by contacts between the two DNA-binding domains (29).

Despite its requirement for normal development, nanomolar affinity for DNA, homeodomain structure, and cooperation with SRF, TEAD is distinct from known homeodomains. First, the amino acid sequence of TEAD is distinct enough from the homeodomain consensus that primary structure analyses do not identify TEAD as a homeodomain family member. Second, the activities of TEAD transcription factors cannot be substituted by other homeodomain proteins, or else, TEF-1 and SD knockout would not produce the reported cardiac or wing phenotypes. Third, the DNA-selectivity of TEAD is different from that of homeodomains. Fourth, unlike homeodomain proteins, both the isolated TEAD and the full-length proteins bind cooperatively to tandem, nonpalindromic, and enhancer elements (1, 35, 49). Thus, TEAD is a divergent member of the homeodomain structural family. The fold conservation suggests an evolutionary advantage in using the same architecture to produce divergent functions.

Structural Correlations to TEAD Activity.

Some residues in the L1 loop respond to DNA binding (Fig. 4A), suggesting that the L1 loop becomes structured and/or contacts DNA. To test whether the L1 loop is a determinant of the DNA-binding activity, we deleted a portion of the L1 loop (residues 26–37) to create the ΔL1 TEAD mutant. The resultant protein is structured (Fig. 12, which is published as supporting information on the PNAS web site) and is competent for DNA binding (Fig. 6A, 1xGT data). However, unlike wild-type TEAD, ΔL1 TEAD does not bind cooperatively to 2xGT DNA. Thus, the L1 loop is essential for cooperative loading of TEAD to tandem sites. Earlier work by DePamphilis and coworkers (7, 50) found a TEF-3 mutation (Gly-38-Asp) that renders the protein DNA-binding-deficient. In light of our findings about the L1 loop, we posit that the Asp-38 TEF-3 mutant does not bind DNA either because of (i) modified L1 structure and/or (ii) unfavorable charge–charge interactions with DNA.

Fig. 6.

Fig. 6.

Correlating TEAD structure and function. (A) EMSA of wild-type and mutant TEAD. Wild-type TEAD molecules bind cooperatively to tandem M-CAT-like elements (2xGT). The L1 deletion mutant of TEAD binds M-CAT DNA stochastically, and cooperativity is lost. (B) Stereoview of TEAD structure is shown. Amino acid side chains in the three helices are shown as lines or sticks (mutants are also labeled). Loops are not shown for clarity of illustration. Gly-28 resides in loop L1. DNA-recognition helix, H3, is shown in red. Side-chain colors: pink, acidic; blue, basic; gold, hydrophobic; gray, other.

In an earlier report, proline substitutions were used to establish identity of the DNA-recognition helix (51). Our TEAD structure indicates that helix I mutant (F17P, L21P) is DNA-binding-deficient because both F17 and L21 are part of the hydrophobic core (Fig. 6B). The mutations likely disrupt secondary and tertiary structures of TEAD. Helix III mutant (Q63P, S66P) weakened but did not abolish DNA binding presumably because these side chains point away from the DNA-binding surface, and their replacement does not disrupt tertiary structure or other key determinants of DNA binding. Helix II mutant L46 and Y50 did not affect DNA-binding activity of TEAD in vitro (51), because these residues are part of the surface hydrophobic patch (Fig. 2B) and are not involved in core formation or DNA interactions. Disruption of SRF-binding in the A48, K52 double-mutant (29) might have resulted from destabilization of tertiary structure because A48 also is part of the hydrophobic core and packs against V64 of the recognition helix.

Two other mutations within TEAD have physiological consequences in Drosophila and C. elegans. First, Sd31H mutant (R59K in TEAD) of Drosophila is wing-formation-deficient (52). R59 is located in the invariant and structurally ordered L2 loop immediately preceding H3 helix. The guanidino group of R59 is likely to hydrogen-bond to DNA backbone. This is supported by interactions with neighboring residues as well as chemical shift changes exhibited by T60 and R61 on binding DNA (Fig. 4A). Therefore, although conserved, the R→K mutation likely reduces hydrogen-bonding interactions and weakens DNA binding. Second, the egl-44 mutation in C. elegans (R54Q in TEAD) converts FSL neuron cells to touch-cell fate (16). R54Q, located at the beginning of the L2 loop, also might affect the DNA-binding function. Deficiency in DNA binding by the R54Q mutant (egl-44) would be consistent with lower expression of egl-46 corepressor and observed derepression of touch-cell fate.

Regulation of TEAD Activity.

TEF-1 and SRF, a MADS box protein like MEF-2, induce transcription in a cooperative manner, and SRF can bind directly to TEAD (29). To determine whether functional cooperativity requires cooperativity in DNA-binding activity of these proteins, we investigated the ability of TEAD and SRF MADS box to bind to the CArG–M-CAT synthetic DNA. We find that although each of the proteins is DNA-binding-competent, the two proteins do not load cooperatively (Fig. 7). It suggests that cooperativity in enhancing transcription may be independent of cooperativity in DNA binding. It is, however, possible that functional cooperativity requires direct or indirect (cofactor-mediated) interactions between the two full-length proteins (TEF-1 and SRF).

Fig. 7.

Fig. 7.

SRF and TEF-1 DNA-binding domains do not load on to adjacent elements cooperatively.

Using the microarray-based D-LIA, we have shown that the isolated TEAD can bind a large number of M-CAT-like sequences. Our finding supports the idea that TEFs may act as repressors, either because of increased occupancy on promoters or because of a shortage of its protein cofactors (4). For instance, in Drosophila, the ectopic expression of VG in the head is sufficient to induce sprouting of wing-like structures (31). Similarly, in mammalian cells, the VG-like-2 protein promotes skeletal muscle differentiation and activates muscle gene expression by switching DNA-binding properties of TEF-1 factors (27, 53). However, in C. elegans, the only TEAD protein, egl-44, requires egl-46 to repress touch-cell fate (16). Thus, the ability of TEAD proteins to bind many M-CAT-like DNA elements makes it a robust transcription factor, whereas its interactions with protein cofactors provide the fine-tuning necessary for tissue-specific action. It is tantalizing to speculate that TEFs, with their promiscuous DNA-binding TEAD, early zygotic expression (12), and ability to interact with p160/SRC (26), might represent a master switch in eukaryotic development.

Materials and Methods

Protein Construct and Sample Preparation.

TEF-1 TEAD (A49S mutant, S49 is found in Sd TEAD) was subcloned into pET21d plasmid (Invitrogen, Carlsbad, CA). A thrombin cut site was introduced between the C terminus of the TEAD and the His6 tag. The L1 mutant was generated by using the QuikChange method (Stratagene, La Jolla, CA) in which residues Pro-26-Arg-37 were deleted. TEAD proteins were overexpressed in CodonPlus Escherichia coli cells (Novagen, San Diego, CA). 15N- and 13C-labeled proteins were grown in minimal media with 15NH4Cl as the only nitrogen source as described previously (54). Up to 1.5 mg of purified TEAD was obtained per liter of cell culture (see Supporting Text for protein purification details). Molecular weight of the 82-aa thrombin cut A49S TEAD (Fig. 1B) is 9,513. TEAD was labeled with Cy-dyes according to the manufacturer's protocol (Amersham Biosciences, Piscataway, NJ). We used an SRF construct provided by Robert Schwartz (Institute of Biosciences and Technology, Texas A&M University, Houston, TX) to similarly clone the MADS box (residues 132–223) into pET21a plasmid and purified the E. coli-expressed protein by using nickel-affinity chromatography.

Urea Unfolding of the TEAD.

Equilibrium denaturation of TEAD was carried out by using ultra-pure urea in 1× binding buffer [15 mM Hepes (pH 7.9) and 150 mM KCl, 1 mM EDTA, 8% glycerol, and 10 mM β-mercaptoethanol]. Individual solutions of 3 μM TEAD were prepared at different urea concentrations and equilibrated overnight at 4°C, and intrinsic fluorescence intensities were recorded by using an SLM Aminco SPF-500 spectrofluorometer (final temperature was ≈12°C). All other procedures were as described elsewhere (55).

Electrophoretic Mobility-Shift Assay (EMSA).

DNA-binding activity of TEAD was established with EMSA. Synthetic 1xGT, 2xGT, and SRE-M-CAT DNA probes used for EMSA contained double-stranded DNA made of two complementary strands. One strand for each is as follows: 1xGT (35): 5′-TTCGATACACTTGTGGAATGTGTTTGATTTGTTAGCCCCG-3′; 2xGT (35): 5′-TTCGATACACTTGTGGAATGTGTGGAATGTGTTAGCCCCG-3′; and SREM-CAT (29): 5′-TGCCTGCTGCCTAAATTTGGAATGTTCTGCTGGGACAA-3′.

DNA was labeled with [32P]ATP (30). Each reaction contained 0, 0.1, 0.3, 1, 3, 10, 30, 100, or 256 ng of protein and 2–4 fmol of labeled DNA. Additional details can be found in the supporting information. SRE–M-CAT element (29) uses the M-CAT sequence TGGAATGT instead of TGGAGTCA. Cooperativity between SRF MADS box and TEAD were carried out by first preparing a complex of SRF (30 ng) with labeled SRE–M-CAT DNA (4 fmol). Because SRF is reported to bind directly to TEAD, we added an equimolar amount of unlabeled SRE to remove excess SRF that was not bound to the SRE–M-CAT probe. This established an equilibrium in which SRF was distributed between the two probes, which yields free radioactively labeled SRF–M-CAT probe. TEAD was then added to this mixture.

D-LIA.

D-LIA chips were designed as shown in Fig. 3C. Each chip contained a total of 3,968 (4 K) wells and tested binding of TEAD to ≈1,800 unique DNA sequences including 1xGT sequence. All steps, except optical scanning, were performed at 4°C. Images were scanned by using the GenePix 4000B DNA chip reader and GenePix Pro 4.1 software (Molecular Devices, Sunnyvale, CA). Fluorescence intensities were determined by using ArrayPro Analyzer (Media Cybernetics, Silver Spring, MD). The intensities were sorted and background subtracted with Microsoft Excel. Mean fluorescence intensity in the blank wells was designated as background. Wells containing linker alone showed background (blank) level signal. The fluorescence intensity in each of the 3,968 wells of a given chip was normalized relative to the average intensity at the 1xGT DNA (5′-TGGAATGT-3′) containing wells in that chip. The normalized fluorescence intensities were used to compile the binding preferences of the TEAD to each DNA sequence on the chips. The results were derived by using the data from three different chips, each containing at least two copies of a given sequence (n ≥ 6).

NMR Spectroscopy and Structure Calculation.

Bruker (Billerica, MA) Avance 600- and 800-MHz spectrometers, equipped with cryoprobes, were used. Center of the proton dimension was referenced relative to water; those of carbon and nitrogen dimensions were indirectly referenced (56). Nitrogen and carbon dimensions were centered at 118 ppm and 75 ppm in the 1H,15N-HSQC and 1H,13C-HSQC spectra. Data were processed and analyzed using FELIX (Accelrys, San Diego, CA). Resonance assignments (1H, 13C, and 15N) used standard triple-resonance methods (57) (see Supporting Text). Valine and leucine methyls were stereo-specifically assigned by using 10% 13C-enriched sample (58). Distance restraints for structure calculations were obtained from proton–proton homonuclear NOESY and three-dimensional 15N- and 13C-separated NOESY spectra. Φ angle restraints were derived from HNHA data (38) or TALOS (59) (Table 1). Structure calculations used crystallography and NMR software (60). Of the final set of 50 structures, 25 lowest-energy structures were selected for further analyses (Fig. 8) and analyzed by using AQUA and PROCHECK-NMR (61).

Chemical-shift mapping used TEAD-DNA complex at 1 mM TEAD and 1.1 mM DNA. Samples were prepared as described elsewhere (60). The double-stranded DNA contained 5′-AGTGGAATGTGC-3′ and its complementary strand. Sample conditions: 100 mM sodium phosphate and acetate-d4 (pH 5.8), 100 mM NaCl, 10 mM βME-d6, and 0.01% NaN3 at 25°C.

Supplementary Material

Supporting Information

Acknowledgments

S.V. thanks C. S. Raman for insightful discussions during manuscript preparation. S.V. thanks the late Fred Rudolf for providing laboratory space immediately after Hurricane Allison (2001) and colleagues at Rice University (Houston, TX) for generously providing space for our NMR laboratory during the recovery period. Youlin Xia is acknowledged for valuable assistance in using the 800-MHz NMR spectrometer located at the University of Houston. Xiaochuan Zhou generously shared his microarray equipment. We thank Pierre Nioche for valuable suggestions regarding cloning and Dong-Sun Lee and Minjun Li for generating the MADS box construct and purifying the corresponding protein, respectively. We thank Eugene Krissinel and Alexey Murzin for discussions on DALI. S.V. gratefully acknowledges the Cold Spring Harbor Laboratories for providing a fellowship to attend their first “Making and Using of DNA Microarrays” course, which spurred on parts of this work. This work was supported in part by American Heart Association Beginning-Grant-in-Aid AHA 0365134Y (to S.V.).

Abbreviations

TEF

transcription enhancer factor

TEAD

TEA domain

SD

Scalloped

SRF

serum response factor

VG

Vestigial

D-LIA

DNA–ligand interaction assay

HSQC

heteronuclear single quantum coherence.

Footnotes

The authors declare no conflict of interest.

Data deposition: The atomic coordinates for TEAD have been deposited in the Protein Data Bank, www.pdb.org (PDB ID 2HZD).

This article is a PNAS direct submission.

References

  • 1.Davidson I, Xiao JH, Rosales R, Staub A, Chambon P. Cell. 1988;54:931–942. doi: 10.1016/0092-8674(88)90108-0. [DOI] [PubMed] [Google Scholar]
  • 2.Ishiji T, Lace MJ, Parkkinen S, Anderson RD, Haugen TH, Cripe TP, Xiao JH, Davidson I, Chambon P, Turek LP. EMBO J. 1992;11:2271–2281. doi: 10.1002/j.1460-2075.1992.tb05286.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Mar JH, Ordahl CP. Mol Cell Biol. 1990;10:4271–4283. doi: 10.1128/mcb.10.8.4271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Xiao JH, Davidson I, Matthes H, Garnier JM, Chambon P. Cell. 1991;65:551–568. doi: 10.1016/0092-8674(91)90088-g. [DOI] [PubMed] [Google Scholar]
  • 5.Campbell S, Inamdar M, Rodrigues V, Raghavan V, Palazzolo M, Chovnick A. Genes Dev. 1992;6:367–379. doi: 10.1101/gad.6.3.367. [DOI] [PubMed] [Google Scholar]
  • 6.Jacquemin P, Hwang JJ, Martial JA, Dolle P, Davidson I. J Biol Chem. 1996;271:21775–21785. doi: 10.1074/jbc.271.36.21775. [DOI] [PubMed] [Google Scholar]
  • 7.Kaneko KJ, DePamphilis ML. Dev Genet. 1998;22:43–55. doi: 10.1002/(SICI)1520-6408(1998)22:1<43::AID-DVG5>3.0.CO;2-7. [DOI] [PubMed] [Google Scholar]
  • 8.Hsu DK, Guo Y, Alberts GF, Copeland NG, Gilbert DJ, Jenkins NA, Peifley KA, Winkles JA. J Biol Chem. 1996;271:13786–13795. doi: 10.1074/jbc.271.23.13786. [DOI] [PubMed] [Google Scholar]
  • 9.Farrance I, Ordahl CP. J Biol Chem. 1996;271:8226–8274. doi: 10.1074/jbc.271.14.8266. [DOI] [PubMed] [Google Scholar]
  • 10.Chen Z, Friedrich GA, Soriano P. Genes Dev. 1994;8:2293–2301. doi: 10.1101/gad.8.19.2293. [DOI] [PubMed] [Google Scholar]
  • 11.Yockey CE, Smith G, Izumo S, Shimizu N. J Biol Chem. 1996;271:3727–3736. doi: 10.1074/jbc.271.7.3727. [DOI] [PubMed] [Google Scholar]
  • 12.Kaneko KJ, Cullinan EB, Latham KE, DePamphilis ML. Development (Cambridge, UK) 1997;124:1963–1973. doi: 10.1242/dev.124.10.1963. [DOI] [PubMed] [Google Scholar]
  • 13.Jacquemin P, Martial JA, Davidson I. J Biol Chem. 1997;272:12928–12937. doi: 10.1074/jbc.272.20.12928. [DOI] [PubMed] [Google Scholar]
  • 14.Jiang SW, Wu K, Eberhardt NL. Mol Endocrinol. 1999;13:879–889. doi: 10.1210/mend.13.6.0288. [DOI] [PubMed] [Google Scholar]
  • 15.Bray S. Curr Biol. 1999;9:R245–R247. doi: 10.1016/s0960-9822(99)80154-7. [DOI] [PubMed] [Google Scholar]
  • 16.Wu J, Duggan A, Chalfie M. Genes Dev. 2001;15:789–802. doi: 10.1101/gad.857401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Desai C, Horvitz HR. Genetics. 1989;121:703–721. doi: 10.1093/genetics/121.4.703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Desai C, Garriga G, McIntire SL, Horvitz HR. Nature. 1988;336:638–643. doi: 10.1038/336638a0. [DOI] [PubMed] [Google Scholar]
  • 19.Gavrias V, Andrianopoulos A, Gimeno CJ, Timberlake WE. Mol Microbiol. 1996;19:1255–1263. doi: 10.1111/j.1365-2958.1996.tb02470.x. [DOI] [PubMed] [Google Scholar]
  • 20.Andrianopoulos A, Timberlake WE. Mol Cell Biol. 1994;14:2503–2515. doi: 10.1128/mcb.14.4.2503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Schweizer A, Rupp S, Taylor BN, Rollinghoff M, Schroppel K. Mol Microbiol. 2000;38:435–445. doi: 10.1046/j.1365-2958.2000.02132.x. [DOI] [PubMed] [Google Scholar]
  • 22.Burglin TR. Cell. 1991;66:11–12. doi: 10.1016/0092-8674(91)90132-i. [DOI] [PubMed] [Google Scholar]
  • 23.Andrianopoulos A, Timberlake WE. Plant Cell. 1991;3:747–748. doi: 10.1105/tpc.3.8.747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Deshpande N, Chopra A, Rangarajan A, Sashidhara LS, Rogdrigues V, Krishna S. J Biol Chem. 1997;272:10664–10668. doi: 10.1074/jbc.272.16.10664. [DOI] [PubMed] [Google Scholar]
  • 25.Mahoney WM, Jr, Hong JH, Yaffe MB, Farrance IK. Biochem J. 2005;388:217–225. doi: 10.1042/BJ20041434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Belandia B, Parker MG. J Biol Chem. 2000;275:30801–30805. doi: 10.1074/jbc.C000484200. [DOI] [PubMed] [Google Scholar]
  • 27.Maeda T, Chapman DL, Stewart AF. J Biol Chem. 2002;277:48889–48898. doi: 10.1074/jbc.M206858200. [DOI] [PubMed] [Google Scholar]
  • 28.Gupta MP, Amin CS, Gupta M, Hay N, Zak R. Mol Cell Biol. 1997;17:3924–3936. doi: 10.1128/mcb.17.7.3924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gupta M, Kogut P, Davis FJ, Belaguli NS, Schwartz RJ, Gupta MP. J Biol Chem. 2001;276:10413–10422. doi: 10.1074/jbc.M008625200. [DOI] [PubMed] [Google Scholar]
  • 30.Halder G, Polaczyk P, Kraus ME, Hudson A, Kim J, Laughon A, Carroll S. Genes Dev. 1998;12:3900–3909. doi: 10.1101/gad.12.24.3900. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Simmonds AJ, Liu X, Soanes KH, Krause HM, Irvine KD, Bell JB. Genes Dev. 1998;12:3815–3820. doi: 10.1101/gad.12.24.3815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Srivastava A, MacKay JO, Bell JB. Genesis. 2002;33:40–47. doi: 10.1002/gene.10086. [DOI] [PubMed] [Google Scholar]
  • 33.Laskowski RA, Rullmann JA, MacArthur MW, Kaptein R, Thornton JM. J Biomol NMR. 1996;8:477–486. doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
  • 34.Colovos C, Yeates TO. Protein Sci. 1993;2:1511–1519. doi: 10.1002/pro.5560020916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Halder G, Carroll SB. Development (Cambridge, UK) 2001;128:3295–3305. doi: 10.1242/dev.128.17.3295. [DOI] [PubMed] [Google Scholar]
  • 36.Gupta MP, Kogut P, Gupta M. Nucleic Acids Res. 2000;28:3168–3177. doi: 10.1093/nar/28.16.3168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Jiang SW, Dong M, Trujillo MA, Miller LJ, Eberhardt NL. J Biol Chem. 2001;276:23464–23470. doi: 10.1074/jbc.M010934200. [DOI] [PubMed] [Google Scholar]
  • 38.Holm L, Sander C. Trends Biochem Sci. 1995;20:478–480. doi: 10.1016/s0968-0004(00)89105-7. [DOI] [PubMed] [Google Scholar]
  • 39.Holm L, Sander C. Nucleic Acids Res. 1999;27:244–247. doi: 10.1093/nar/27.1.244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Holm L, Sander C. J Mol Biol. 1993;233:123–138. doi: 10.1006/jmbi.1993.1489. [DOI] [PubMed] [Google Scholar]
  • 41.Li T, Stark MR, Johnson AD, Wolberger C. Science. 1995;270:262–269. doi: 10.1126/science.270.5234.262. [DOI] [PubMed] [Google Scholar]
  • 42.Ryter JM, Doe CQ, Matthews BW. Structure (London) 2002;10:1541–1549. doi: 10.1016/s0969-2126(02)00883-3. [DOI] [PubMed] [Google Scholar]
  • 43.Yousef MS, Matthews BW. Structure (London) 2005;13:601–607. doi: 10.1016/j.str.2005.01.023. [DOI] [PubMed] [Google Scholar]
  • 44.MacLellan WR, Lee TC, Schwartz RJ, Schneider MD. J Biol Chem. 1994;269:16754–16760. [PubMed] [Google Scholar]
  • 45.Carson JA, Schwartz RJ, Booth FW. Am J Physiol. 1996;270:C1624–C1633. doi: 10.1152/ajpcell.1996.270.6.C1624. [DOI] [PubMed] [Google Scholar]
  • 46.Maeda T, Gupta MP, Stewart AF. Biochem Biophys Res Commun. 2002;294:791–797. doi: 10.1016/S0006-291X(02)00556-9. [DOI] [PubMed] [Google Scholar]
  • 47.Mo Y, Ho W, Johnston K, Marmorstein R. J Mol Biol. 2001;314:495–506. doi: 10.1006/jmbi.2001.5138. [DOI] [PubMed] [Google Scholar]
  • 48.Hassler M, Richmond TJ. EMBO J. 2001;20:3018–3028. doi: 10.1093/emboj/20.12.3018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Jiang SW, Desai D, Khan S, Eberhardt NL. DNA Cell Biol. 2000;19:507–514. doi: 10.1089/10445490050128430. [DOI] [PubMed] [Google Scholar]
  • 50.Vassilev A, Kaneko KJ, Shu H, Zhao Y, DePamphilis ML. Genes Dev. 2001;15:1229–1241. doi: 10.1101/gad.888601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Hwang JJ, Chambon P, Davidson I. EMBO J. 1993;12:2337–2348. doi: 10.1002/j.1460-2075.1993.tb05888.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Srivastava A, Simmonds AJ, Garg A, Fossheim L, Campbell SD, Bell JB. Genetics. 2004;166:1833–1843. doi: 10.1534/genetics.166.4.1833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chen HH, Maeda T, Mullett SJ, Stewart AF. Genesis. 2004;39:273–279. doi: 10.1002/gene.20055. [DOI] [PubMed] [Google Scholar]
  • 54.Veeraraghavan S, Mello CC, Lee KM, Androphy EJ, Baleja JD. J Biomol NMR. 1998;11:457–458. doi: 10.1023/a:1008237029912. [DOI] [PubMed] [Google Scholar]
  • 55.Veeraraghavan S, Holzman TF, Nall BT. Biochemistry. 1996;35:10601–10607. doi: 10.1021/bi960329q. [DOI] [PubMed] [Google Scholar]
  • 56.Wishart DS, Bigam CG, Yao J, Abildgaard F, Dyson HJ, Oldfield E, Markley JL, Sykes BD. J Biomol NMR. 1995;6:135–140. doi: 10.1007/BF00211777. [DOI] [PubMed] [Google Scholar]
  • 57.Clore GM, Gronenborn AM. Prog NMR Spect. 1991;23:43–92. [Google Scholar]
  • 58.Neri D, Szyperski T, Otting G, Senn H, Wuthrich K. Biochemistry. 1989;28:7510–7516. doi: 10.1021/bi00445a003. [DOI] [PubMed] [Google Scholar]
  • 59.Cornilescu G, Delaglio F, Bax A. J Biomol NMR. 1999;13:289–302. doi: 10.1023/a:1008392405740. [DOI] [PubMed] [Google Scholar]
  • 60.Veeraraghavan S, Mello CC, Androphy EJ, Baleja JD. Biochemistry. 1999;38:16115–16124. doi: 10.1021/bi991633x. [DOI] [PubMed] [Google Scholar]
  • 61.Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM. J Biomol NMR. 1996;8:477–486. doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
  • 62.Li T, Jin Y, Vershon AK, Wolberger C. Nucleic Acids Res. 1998;26:5707–5718. doi: 10.1093/nar/26.24.5707. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information
pnas_0607171103_1.pdf (84.3KB, pdf)
pnas_0607171103_2.pdf (56.6KB, pdf)
pnas_0607171103_3.pdf (39.7KB, pdf)
pnas_0607171103_4.pdf (74.7KB, pdf)
pnas_0607171103_5.pdf (26.9KB, pdf)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES