Abstract
Intrinsic disorder (ID) is a widespread phenomenon found especially in signaling and regulation-related eukaryotic proteins. The functional importance of flexible disordered regions often resides in their ability to allow proteins to bind different partners. The incidence and location of intrinsic disorder in 369 human single-pass transmembrane receptors with the type I topology was assessed based on both disorder predictions and amino acid physico-chemical properties. We provide evidence that ID concentrates in the receptors’ cytoplasmic region. As a benchmark for this analysis, we present a structural study on the previously uncharacterized intracellular region of human Delta-4 (DLL4_IC), a single-pass transmembrane protein and a ligand of Notch receptors. DLL4_IC is required for receptor/ligand endocytosis, undergoes regulated intra-membrane proteolysis, and mediates protein-protein interactions through its C-terminal PDZ binding motif. Using a recombinant purified protein, we demonstrate using various biophysical methods that DLL4_IC is mainly disordered in solution but can form inter-convertible local secondary structures in response to variations in the physico-chemical milieu. Most of these conformational changes occur in the highly conserved C-terminal segment that includes the PDZ-binding motif. Based on our results, we propose that global disorder, in concert with local pre-organization, may play a role in Notch signaling mediated by Delta-4.
Keywords: Intrinsic disorder, single-pass transmembrane receptors, Notch signaling, PDZ-binding motif, SDS micelles, TFE
INTRODUCTION
Intrinsic disorder (ID) in proteins has been shown to be a widespread phenomenon by both computational and experimental methods and it is now recognized that a certain protein not only can be functional without having a defined three-dimensional structure, but that its functionality can lie indeed in its being disordered.1,2 Extensive disorder predictions in genomes of increasing complexity revealed that ID is the highest in eukaryotes3 and that long ID regions are mostly found in proteins involved in regulation and signaling.4 These two observations were interpreted as the consequence of the increased complexity of the regulatory networks in multicellular organisms, where the plasticity of ID regions in highly connected proteins is exploited to bind different signaling partners through disorder-to-order transitions.5
Systematic analyses carried out on human transcription factors revealed that they have, in general, a high fraction of residues predicted to be disordered.6,7 A recent report computed the distribution of ID in plasma membrane proteins of different transmembrane classes.8 Intriguingly, the authors showed that, in general, ID is more prominent in the cytoplasmic segments as compared to the extracellular ones and proposed that the phenomenon reflects the different cellular niches of proteins (inside versus outside). However, the function of this skewed ID distribution remains unclear. In the report, ID regions in single-spanning membrane proteins appear to be evenly distributed on both sides of the membrane, in contrast to other transmembrane classes.
Here, we focus on the ID incidence and location (intra- or extracellular) in the subclass of human single-pass transmembrane proteins with the type I topology (i.e. proteins with a single transmembrane helix and the N-terminus located in in the extracellular space) and annotated by the Swiss-Prot database as "receptor". Our disorder predictions, mean net charge versus hydropathy plots, and amino acid compositional analysis indicate that ID has a relevant incidence in this class of proteins, and that it is mostly concentrated in the intracellular region. As a benchmark for this analysis, we present an experimental study on a relevant target, Delta-4, a single-pass transmembrane protein and a ligand of Notch receptors. The Notch pathway is a highly conserved, intercellular and bi-directional mechanism that regulates cell differentiation.9,10,11 Five different ligands of Notch receptors have been identified in mammals, three Drosophila Delta orthologs (Delta-like−1, −3, and −4) and two Drosophila Serrate orthologs (Jagged−1 and −2). In recent studies, it was shown that inhibition of the Notch pathway mediated by Delta-4 can promote tumor regression by deregulating angiogenesis, making Delta-4 a potential pharmacological target for the treatment of solid tumors.12,13 Delta-4 (Figure 1) is composed of an extracellular portion, consisting mainly of globular domains (i.e., a Delta/Serrate/Lag-2 (DSL) domain and eight epidermal growth factor-like repeats14), which mediates the binding to the receptor, a trans-membrane segment, and a cytoplasmic tail of 133 amino acids which is likely to be involved in "receptor shedding" through ubiquitination and endocytosis.15 Also, the cytoplasmic tail of Notch ligands couples the Notch signaling network to PDZ-bearing proteins localized at the membrane/cytoplasm interface and is involved in the organization of cell-cell junctions. Delta-4 interacts through its conserved C-terminal motif (ATEV) with the PDZ domains of Dlg-1, the human homologue of Drosophila Discs Large protein.16 Finally, there is compelling evidence that, as a consequence of the proteolytic cleavage of the intracellular region of the ligand and its release from the cell membrane,17–19 Notch-related signal transduction is active not only in the signal-receiving cell, but also in the signal-sending one.17,20,21 The intracellular region of Delta-4 does not display any sequence homology with proteins of known fold. Therefore, our main concern here was to assess wheteher it might encode a new globular fold or, inversely, if it is partly or entirely disordered. Our studies show that a recombinant protein representing the cytoplasmic tail of Delta-4 is intrinsically disordered in solution and that its C-terminal segment is highly plastic, as it can switch between three conformational states (coil, strand, helix) in response to changes in the chemical milieu (pH, dielectric permittivity, hydrophobic-hydrophilic interface). We discuss these results in light of our findings on the ID distribution in single-pass transmembrane proteins and propose that global disorder, in concert with local pre-organization, may play a role in Notch signaling mediated by Delta-4.
Figure 1.
Domain architecture of Delta-4. MNLL, N-terminal domain; DSL, Delta/Serrate/Lag-2 domain; EGF, Epidermal Growth Factor repeat; the transmembrane segment is shown as a blue bar.
MATERIALS AND METHODS
Dataset Preparation and analysis
A set of human membrane proteins was generated by a search of the Swiss-Prot database through the Sequence Retrieval System using “receptor” and “transmembrane” as keywords and “single-pass” in the comment field. Entries having type II topology were manually discarded from the dataset. The total number of sequences thereby collected was 369. This dataset was then divided into two subsets containing intracellular and extracellular domains named “intracellular subset” and “extracellular subset”, respectively. The boundaries of the domains were selected according to the position of the transmembrane helix in the sequences as provided in Swiss-Prot. The DisProt22 dataset (release 3.6) was downloaded from the Database of Protein Disorder (www.disprot.com) and contained 469 entries, while the reduced SCOP dataset was created from the SCOP23 database 1.69 (http://astral.berkeley.edu) by discarding all entries with >40% identity, and contained 1357 sequences. Disorder predictions were carried out using DisEMBL,24 IUPred,25 charge/hydropathy plots,26 and amino acid compositional analysis.27 DisEMBL (v. 1.5; http://dis.embl.de) was run using the three definitions of protein disorder, based on assignments of secondary structure (loops/coils), high values of Cα B-factors (hot loops), and missing coordinates in X-Ray structures (Remark465). IUPred (http://iupred.enzim.hu/) was run calculating pairwise energies within a window of 100 or 25 residues ("long" and "short" disorder definitions, respectively). Charge/hydropathy for each sequence was obtained from the absolute value of the mean net charge versus the mean residue hydropathy calculated using the normalized Kyte-Doolittle scale. Amino acid compositional analysis was carried out using Composition Profiler28 (http://www.cprofiler.org) using the PDB Select 2529 or the DisProt22 datasets as reference for ordered and disordered proteins, respectively. Enrichment or depletion in each amino acid type was expressed as (Cs1-Cs2)/Cs2, i.e., the normalized excess of a given residue's “concentration” in a dataset (Cs1) relative to the corresponding value in the other dataset (Cs2). Amino acid types were ranked according to increasing flexibility.30
DLL4_IC sequence analysis
The DLL4_IC protein sequence was submitted to the PONDR server (http://www.pondr.com) using the default predictor VL-XT.31 Secondary structure predictions (PSIPRED,32 JNet,33 SSpro34) were run from the PHYRE web server (http://www.sbg.bio.ic.ac.uk).
Gene synthesis
The oligonucleotides for the gene assembly were designed with DNAWorks v2.3.35 The amino acid sequence of human Delta-like protein 4 cytoplasmic region (DLL4_IC, corresponding to residues 553–685 of DLL4_HUMAN) was backtranslated using the E. coli Class II codon usage,36 and the generated DNA sequence (dll4_ic) was divided into 18 partially overlapping nucleotides with a maximum length of 40 bases, a calculated annealing temperature (Tm) of 60°C, a Tm range of 2.9°C and a minimal overlap of 13 bases. The oligonucleotide sequences were designed to have the lowest propensity to form hairpins within each oligonucleotide, and to contain no repeats that might lead to mispriming in the polymerase chain reaction (PCR). Synthetic oligonucleotides were purchased from Sigma-Genosys (0.05 µmol scale) and, after being dissolved in equimolar concentration (200 nM), assembled by PCR using Pfu polymerase (Promega) with the following forward and reverse primers (MWG-biotech, 0.05 µmol scale) containing the wanted restriction sites: 5’-TAA TAG TAG CAT ATG AAA CAC CAT CAC CAT CAC CAT CGC CAG CTG CGT CTG CGT-3’ (the underlined sequence encodes the start methionine, followed by a lysine residue and a six-histidine tag) and 5’-TAG TAG GGA TCC TCA TTA AAC TTC AGT TGC GAT CAC GCA CTC ATT ACG TTC-3’, respectively. PCR conditions were: 5 min at 95°C (hot start), 25 cycles of amplification (30 s denaturation at 95°C, 30 s annealing at 58°C, 90 sec elongation at 72°C), 10 min at 72 °C for the final elongation. The assembled and amplified synthetic gene, resulting in a sharp band of the correct size in the agarose gel, was digested by Nde I/BamH I, and ligated into a pET11a vector using standard procedures. DH5α E. coli cells were transformed with the dll4_ic-pET11a construct and selected on LB plates with 100 µg/mL ampicillin. The positive clones were sequenced by automatic DNA sequencing in both forward and reverse directions and the correct one (1/12) used for protein expression.
The nucleotide sequence of the truncated form of DLL4_IC lacking the first 23 amino acids (ΔN-DLL4_IC) was amplified by PCR from the dll4_ic-pET11a construct with the reverse primer used in the DLL4_IC gene synthesis and the following forward primer: 5’-TAG TAG TAG CAT ATG AAA GAT AAC CTG ATT CCG-3’. PCR conditions were the same as above except for the annealing temperature which was set at 58°C. The PCR product was digested by Nde I/BamH I, ligated into a pET11a vector, and the construct used to transform DH5α E. coli cells. The positive clones were selected as above, sequenced in both directions, and the correct one (1/2) used for protein expression.
Protein expression and purification
Purification in denaturing conditions
1 L of LB containing 100 µg/mL ampicillin and 25 µg/mL chloramphenicol was inoculated with a clone of BL21(DE3)pLysS cells transformed with the dll4_ic-pET11a construct. Cells were grown at 37°C to an OD of ~0.8 and protein expression was induced with IPTG 1 mM for 3 h at room temperature. Cells were harvested, washed, and resuspended in the lysis buffer (20 mM phosphate buffer, 0.5 M NaCl, 50 mM CHAPS, 2% TWEEN, 5 mM TCEP, protein inhibitor cocktail tablet (Roche), 10 mM imidazole, 6 M GuHCl, pH 7.4) and sonicated on ice. After centrifugation and filtration through a 0.22 µm filter, the supernatant was loaded on a Ni2+ Sepharose His-Trap HP column (1 mL, Amersham Biosciences), the column washed with 20 mM phosphate buffer, 0.5 M NaCl, 10 mM imidazole, 5 mM TCEP, 6 M GuHCl, pH 7.4, and the protein eluted with a 0.01–0.5 M imidazole gradient. In order to remove fragments derived from partial proteolytic degradation, the eluted material was purified by RP-HPLC with a Zorbax 300SB-CN column (9.4 × 250 mm, 5 µm, Agilent) using a 0–40% gradient of 0.1% TFA in H2O and 0.1% TFA in CH3CN and freeze-dried. The N-terminal His6-tag was removed using a recombinant dipeptidyl aminopeptidase I (DAPase) containing a C-terminal His-tag (TAGzyme, Qiagen) for 2 h at 37°C according to the manufacturer’s protocol. An additional IMAC step on a His-Trap HP column (1 mL) using a 0–0.5 M imidazole gradient removed the peptidase, the partially digested protein and the cleaved His2 dipeptides. The protein was subjected to a final RP-HPLC step and analyzed by LC-MS on a Gilson HPLC system coupled to an ESI-MS single quadrupole mass spectrometer (Applied Biosystems API-150EX), using a Zorbax 300SB-CN column (2.1 × 150 mm, 5 µm, Agilent) and a 0–50% gradient of 0.1% TFA in H2O and 0.1% TFA in CH3CN. Deconvolution of the multicharge ion spectrum was carried out using the BioMultiView software (Applied Biosystems). and confirmed the correct molecular size of the purified product (Mr calculated: 14894 Da; Mr observed: 14893 Da). The purified protein was freeze-dried and used for spectroscopic studies. The yield was 8 mg protein per 1 L of culture.
Purification in native conditions
In order to test the possibility that the above harsh purification conditions (i.e., 6 M GuHCl in lysis and IMAC buffers, RP-HPLC acidic buffers [pH~2], freeze-drying process) could irreversibly denature the protein, DLL4_IC was also purified in native conditions. Cells were grown and protein expression was induced as described above. Cells were then harvested, washed and resuspended in the lysis buffer (20 mM phosphate buffer, 0.5 M NaCl, 50 mM CHAPS, 2% TWEEN, 5 mM TCEP, protein inhibitor cocktail tablet, 10 mM imidazole, pH 7.4). After the cells had been sonicated and spun as described, the supernatant was loaded onto a His-Trap HP column, which was washed with 20 mM phosphate buffer, 0.5 M NaCl, 10 mM imidazole, 5 mM TCEP, pH 7.4, and the protein eluted with a 0.01–0.5 M imidazole gradient. The eluted fractions were pooled and diluted with buffer A (20 mM phosphate buffer, 5 mM DTT, pH 7.4). The His6-DLL4_IC protein was purifed by ionexchange chromatography on an 8 × 75 mm SP column (SP-825, Shodex) using a 0–50% gradient from buffer A to buffer B (20 mM sodium phosphate, 1 M NaCl, 5 mM DTT, pH 7.4). The eluate was concentrated by ultrafiltration on a Centricon 3000 (Amicon), followed by dilution with the final buffer (5 mM phosphate, 1 mM TCEP) for the subsequent spectroscopic analysis. Although the protein yield was quite low, the achieved protein purity was satisfactory (>95%) and the correct molecular size was confirmed by LC-MS.
15N isotopic enrichment
1 L of 15N-M9 minimal medium (6 g/L Na2HPO4, 3 g/L KH2PO4, 0.5 g/L NaCl, 1.3 g/L glycerol, 0.5 g/L 15NH4Cl, 0.12 g/L MgSO4, 0.01 g/L CaCl2) containing nutrients supplemented as yeast nitrogen base w/o amino acids or NH4SO4 (1.7 g/L), pH 7, with 100 µg/mL ampicillin and 25 µg/mL chloramphenicol was inoculated with a clone of BL21(DE3)pLysS cells transformed with the dll4_ic-pET11a construct. Cells were grown overnight at room temperature to an OD of ~0.6 and protein expression was induced with IPTG (1 mM) for 4.5 h at 37°C. Protein purification was carried out in denaturing conditions as described above, with a final yield of ~8 mg of pure product per 1 L of culture
Expression of ΔN-DLL4_IC and purification from inclusion bodies
1 L of LB containing 100 µg/mL ampicillin and 25 µg/mL chloramphenicol was inoculated with a clone of BL21(DE3)pLysS cells transformed with the ΔN-dll4_IC construct. Cells were grown at 37°C to an OD of ~0.9 and protein expression induced with IPTG (1 mM) for 3 h at 37°C. Cells were harvested, washed and resuspended in the lysis buffer (50 mM Tris-HCl buffer, 5 mM EDTA, 0.5% Triton-X100, 0.1 mM PMSF, 1 mM DTT and protein inhibitor cocktail tablet (Roche)) and sonicated on ice. After sonication, MgSO4 (10 mM) was added to chelate EDTA and the inclusion bodies collected by centrifugation at 6000 rpm for 15 min. The pellet was washed twice with lysis buffer and an additional wash was carried out without Triton-X100. The final inclusion body pellet was resuspended in 100 mM Tris-HCl, 50 mM Glycine, pH 8.0, dispersed by sonication and dissolved dropwise with the same buffer containing urea to a final concentration of 6 M urea. The urea was eliminated from the solution with a HiPrep 26/10 Desalting column and a final purification step was performed using RP-HPLC with a Zorbax 300SB-CN column (9.4 × 250 mm, 5 µm, Agilent) using a 0–60% gradient of 0.1% TFA in H2O and 0.1% TFA in CH3CN. The freeze-dried product was analyzed by LC-MS which confirmed the correct molecular size (Mr calculated: 1267.9 Da; Mr observed: 1265.0 Da).The yield was ~4 mg protein (~95% purity) per 1 L of culture.
Peptide synthesis
All peptides (P1, res. 582–618; P2, res. 619–661; P3, res. 662–685) were prepared by standard solid-phase Fmoc methods using a home-built automatic synthesizer based on a Gilson Aspec XL SPE. After cleavage/deprotection of the peptide-resin (preloaded NovaSyn TGT, Novabiochem) in TFA/1,2-ethanedithiol/triisopropylsilane/H2O 90/5/2.5/2.5 v/v/v for 2 h, deprotected, reduced peptides were purified by semi-preparative RP-HPLC on a Zorbax 300SB-C18 column (9.4 × 250 mm, 5 µm, Agilent) and freeze-dried. P1 was purified using a 0–60% gradient of 0.1% TFA in H2O and 0.1% TFA in CH3CN. P2 and P3 were purified using a 0–40% gradient of triethylammonium acetate (TEAA) (10 mM, pH 7 in H2O) and TEAA in 80% CH3CN, followed by further purification and desalting using a 0–60% gradient of 0.1% TFA in H2O and 0.1% TFA in CH3CN. The identity of the peptides was checked by LC-MS and the yield and purity estimated from RP-HPLC. Final yields were in the range 40–60% and purity > 95%.
Size exclusion chromatography
The freeze-dried protein powder was dissolved in the elution buffer (50 mM Tris-HCl, 100 mM KCl, pH 7.4), loaded onto a Sephacryl S-200 column (Pharmacia) and eluted in the same elution buffer. The apparent molecular mass of DLL4_IC was deduced from a calibration carried out with the following molecular standards: lactate dehydrogenase (147 kDa), bovine serum albumin (67 kDa), carbonic anhydrase (29 kDa) and horse myoglobin (17 kDa). Stokes radii of native (RSN) and fully unfolded (RSU) proteins of known molecular weight (MW) were determined according to the equations37: log(RSN) = −(0.254 ± 0.002) + (0.369 ± 0.001) log(MW), and log(RSU) = −(0.543 ± 0.004 )+ (0.502 ± 0.001) log(MW).
Circular dichroism (CD)
The freeze-dried protein powder was dissolved either in 5 mM MES buffer, 1 mM TCEP, pH 6.3, or in 5 mM Tris buffer, 1 mM TCEP, pH 7.5. Protein concentration was determined by UV absorbance at 280 nm using the calculated ε value of 2560 M−1cm−1. CD spectra of solutions of DLL4_IC, ΔN-DLL4_IC or the synthetic peptides, were recorded on a Jasco-810 spectropolarimeter in the 190–250 nm range, using quartz cuvettes (path length 0.1 cm). Spectra were averaged from 5 scans of 0.1 nm steps at 20–50 nm/min. The secondary structure analysis was performed using the Dichroweb38 tools CDSSTR, CONTIN and SELCON. The helical content was also determined from the mean residue ellipticity (MRE, deg·cm2/dmol) at 222 nm ([θ]222) according to the equation: [α] = 100·[θ]222/θf and θf = −40000·(1–2.57/n) where [α] is the amount of helix, n is the number of residues, and θf is the maximum MRE of an α-helix of n residues.39
NMR spectroscopy
The sample for NMR spectroscopy was prepared by dissolving the freeze-dried material in H2O/D2O (90/10, v/v) containing 4 mM TCEP, 2 mM EDTA-d16, 15 µM DSS and adjusting the pH to 5.6 with small aliquots of 0.1 N NaOH, for a final protein concentration of ~0.5 mM. The sample containing SDS was prepared dissolving solid SDS sodium salt in the NMR sample, for a final SDS concentration of 50 mM. After flushing the NMR tube with argon, spectra were recorded at 303 K on a Bruker spectrometer operating at a 1H frequency of 600.13 MHz and equipped with a 1H/13C/15N triple resonance Z-axis gradient probe. Transmitter frequencies in the 1H and 15N dimensions were set on the water line and at 118.0 ppm, respectively. HSQC and HSQC-TOCSY experiments were carried out in phase-sensitive mode using echo/antiecho-TPPI gradient selection and 15N decoupling during acquisition. HSQC spectra were acquired with 1K complex points, 256 t1 experiments, 32 scans per increment, over a spectral width of 13 and 28 ppm in the 1H and 15N dimensions, respectively. HSQC-TOCSY spectra were acquired with the same parameters, but with 128 scans per t1 increment and a 40 ms DIPSI mixing time. Data were transformed using X-WinNMR (Bruker) and analyzed using CARA (http://www.nmr.ch). 1H chemical shifts were referenced to internal DSS.
RESULTS
Intrinsic disorder is prevalent in the cytoplasmic region of human single-pass transmembrane receptors
A dataset of 369 sequences that included all human single-pass receptors was generated and subsequently divided into two subsets: the first consisted of the receptors' intracellular regions, while the second included the extracellular regions. The location of the transmembrane helices for identifying the sequences' boundaries within the subsets was assigned according to the Swiss-Prot database. DisProt,22 the database of protein disorder (469 sequences), and a reduced set (1357 sequences) extracted from SCOP,23 a database of domains of known structure, were used as control datasets. ID regions in each of these datasets were predicted by subjecting the sequences to DisEMBL.24 Figure 2 shows the fraction of sequences in the datasets versus the fraction (≥ 20%) of residues predicted to be disordered. First, a surprisingly high incidence of ID is observed in the intracellular subset, as it contains a significantly higher fraction of disordered residues relatively to the DisProt dataset. Also, the intracellular subset contains on average a higher amount of disordered residues as compared to the extracellular subset, with a relevant fraction of mainly (>50% of disordered residues) disordered sequences (13% and 38% according to the Remark465 and Hot Loop definitions, respectively24), whereas in the latter the incidence of ID is only slightly higher than that computed for the SCOP domain database. IUPred predictions are consistent with DisEMBL results. The fraction of disordered residues is higher in intracellular regions, as calculated from pairwise energies (see supplementary data). Overall, the % of residues with a IUPred score above the threshold of 0.5 is 29 and 10 for intra- and extracellular regions, respectively. Very similar results were obtained using either the "long" or "short" disorder definition.
Figure 2.
Disorder predictions. Percentage of sequences in datasets (reduced SCOP, DisProt, extracellular and intracellular regions of type I single-pass receptors) containing ≥ 20% predicted disordered residues. Intrinsically disordered regions were computed by DisEMBL using the Remark465 definition.
A combination of low mean hydropathy and relatively high net charge was shown to represent an important prerequisite for the absence of compact structure in proteins under native conditions.26 Consequently, intrinsically disordered proteins generally localize within a unique region of the charge/hydropathy phase space, whereas ordered proteins cluster in a separate region (grey and green shaded areas in Figure 3, respectively).40 In Figure 3, we plotted the absolute value of the mean net charge versus the mean hydropathy values for the sequences in both the intracellular and extracellular subsets. Clearly, the mean net charge/hydropathy of the intracellular regions is broadly distributed in the phase space, as compared to that of their extracellular counterparts, with an averagely higher mean net charge and a relatively lower mean hydropathy, suggesting that the intracellular regions are more prone to intrinsic disorder. This is consistent with the abovementioned disorder predictions.
Figure 3.
Intrinsic disorder. The absolute value of the mean net charge is plotted versus the mean scaled Kyte-Doolittle hydropathy for the Intracellular Subset (circles) and Extracellular Subset (diamonds). The shaded areas contain 90% of the points from the dataset of disordered regions (DisProt, grey) and the dataset of domains of known structure (SCOP, green). The border between structured and natively disordered proteins is drawn as a line.
The amino acid compositional analysis confirms these observations. Figure 4 shows a comparison of the amino acid compositions of the intra- and extracellular subsets along with the comparison between the two control datasets (ordered and disordered proteins). With few exceptions (see Discussion), the intracellular set is depleted in the order-promoting residues41 (W, C, F, I, Y, V, L, N) and enriched in the disorder-promoting residues (A, R, G, Q, S, P, E, K) compared to the extracellular set. The same trend is observed in the comparison between the datasets of ordered and disordered proteins.
Figure 4.
Amino acid compositional analysis. Enrichment or depletion in each amino acid type appears as a positive or negative bar, respectively. Amino acids are indicated by the single-letter code and ordered according to increasing flexibility. Order-promoting residues: W, C, F, I, Y, V, L, N; disorder-promoting residues: A, R, G, Q, S, P, E, K; undefined: H, M, T, D. Error bars are also shown.
Taken together, our disorder predictions, charge/hydropathy and compositional analyses strongly suggest a major incidence of ID in human single-pass transmembrane receptors, and that ID is highly concentrated in the cytoplasmic region.
A Case Study: the cytoplasmic region of the Notch ligand Delta-4
Sequence analysis
The amino acid sequence (Figure 5a) of the cytoplasmic region of Notch ligand Delta-4 (DLL4_IC) was retrieved from the intracellular subset (blue circle in the mean net charge/hydropathy plot in Figure 3) and subjected to disorder and secondary structure predictions. PONDR,31 a predictor-based neural network trained with sequences of intrinsically disordered regions, predicted four disordered stretches that account for 37% of the entire sequence of DLL4_IC (Figure 5b), whereas DisEMBL predicted 10% (Remark465) and 55% (Hot Loop) of disordered residues. DLL4_IC is expected to adopt some secondary structure, as suggested by different secondary structure predictors (PSIPRED,32 JNet,33 SSpro34). All tested methods predicted the presence of an α-helix in the N-terminal region, starting at R13, and four stretches of β conformation, two located after the α-helix and two located at the C-terminus and partially including the PDZ binding motif (Figure 5a).
Figure 5.
Sequence analysis and peptide design. (a) Amino acid sequence of DLL4_IC and secondary structure predictions (h, helix; e, β-strand; c, coil) obtained by running PSIPRED, JNET and SSpro from the PHYRE web server (http://www.sbg.bio.ic.ac.uk/); the consensus secondary structure prediction and the score are also shown. The solid line above the sequence (_) indicates residues included in ΔN-DLL4_IC (res. 24–133). Segments covered by peptides are marked with (--) (P1, res. 30–67), (**) (P2, res. 68– 109), (++) (P3, res. 110–133). (b) PONDR prediction of unstructured regions in DLL4_IC with use of the VL-XT predictor. The prediction score is plotted against the residue number. Regions with a score higher than 0.5 are considered to be disordered. (c) Predicted secondary structure and sequence alignment of Delta-4 C-terminal region.
Protein expression and purification
The recombinant protein corresponding to the intracellular region of human Delta-like 4 (DLL4_IC, residues 553–685 of DLL4_HUMAN, 133 amino acids) (Figure 5a) was expressed in E. coli from a synthetic gene designed to optimize the codon usage for heterologous expression (see supplementary data). Despite an extensive proteolytic degradation, the final material was highly pure (>95%, as determined by RP-HPLC), and could be recovered with a good yield (8 mg/L), thus allowing its characterization by circular dichroism, NMR and size exclusion chromatography. The truncated protein ΔN-DLL4_IC was expressed in a similar system, and recovery from inclusion bodies allowed for a single-step purification by RP-HPLC.
DLL4_IC is mainly disordered in solution
The presence of secondary structure in DLL4_IC was investigated by circular dichroism spectroscopy (CD). The far-UV CD spectrum of DLL4_IC (Figure 6a) in Tris buffer shows a strong minimum at 198 nm, which is typical of disordered proteins. The deconvolution results using CDSSTR showed a high content of unordered structure (77%) and a poor residual presence of secondary structure (3% helix, 10% strand, and 8% turns). Very similar results were obtained from the CD spectrum of DLL4_IC purified in native conditions, confirming that the purification process did not affect the intrinsic conformation of DLL4_IC (see supplementary data).
Figure 6.
Circular dichroism. Far-UV CD spectra of DLL4_IC (7.6 µM) in 5 mM Tris-HCl buffer, 1 mM TCEP, pH 7.5, and in the presence of increasing concentrations of TFE (5, 10, 15, 20 %, v/v) (a) or 10 mM SDS at pH 7.5 or 6 (b).
The conformation of DLL4_IC was further analyzed by NMR spectroscopy. From the 1H-15N HSQC spectrum of the 15N-labelled protein, ~100 HN backbone resonances could be identified, which corresponds to ~80% of the peaks expected, the greater majority being clustered in a narrow region comprised between 8.0 and 8.5 ppm (Figure 7a). The average value of 1HN chemical shifts (8.27 ppm) is nearly identical and the dispersion only slightly larger (σ = 0.18) compared to the values expected for a protein of the same amino acid composition and assuming random coil values42 for all residues (8.22 ppm and σ = 0.14, respectively) (Figure 7c). The lack of chemical shift dispersion in the HN region as well as in the methyl region (data not shown) indicates the absence of globular structure, and the presence of little, if any, secondary structure. The presence of strong and sharp resonances accompanied by much weaker peaks in the 1H-15N HSQC spectrum, and the few peaks that could be identified in the HN-Hα region of the 1H-15N HSQC-TOCSY spectrum (see supplementary data) are also suggestive of the presence of conformational exchange processes.
Figure 7.
NMR. 1H-15N HSQC spectra of DLL4_IC (0.5 mM) in H2O/D2O (90/10, v/v), 4 mM TCEP, pH 5.6, recorded at 303 K in the absence (a) or presence (b) of SDS (50 mM); (c) distribution of 1HN chemical shifts of DLL4_IC in H2O/D2O (black bars), in the presence of SDS (grey bars) and for random coil values for a protein of the same amino acid composition (white bars).
In order to better characterize the conformation of DLL4_IC in solution, we studied its hydrodynamic properties through size exclusion chromatography. DLL4_IC (15 kDa) was eluted from the size exclusion column as a peak corresponding to a 31 kDa globular protein (Figure 8 and supplementary data). The sharpness and symmetry of the peak indicates the presence of a single, well-defined species. The elution volume of a protein from a size exclusion column correlates with its hydrodynamic properties. The hydrodynamic radius (Stokes radius, RS) of a protein can be deduced from its apparent molecular weight (MW) as determined by size exclusion. The calculated RS for an apparent MW of 31 kDa is 25.3 ± 0.4 Å. The theoretical radius of a monomeric protein in either a native or unfolded state (RSN or RSU) can be derived from its known molecular size.37 In the case of DLL4_IC, RSN = 19.3 ± 0.3 Å and RSU = 35.6 ± 0.7 Å. Therefore, the large value of the Stokes radius for DLL4_IC, experimentally determined by size exclusion (25.3 Å), intermediate between RSN and RSU, is consistent either with a folded, stable dimer, or with a monomeric, disordered but partially compact state. Since both CD and NMR data appear to rule out a folded globular state, we interpret size exclusion chromatography results as indicative of the presence of a monomeric, disordered but partially compact state.
Figure 8.
Size exclusion chromatography. Calibration standards are shown as open circles (1, lactate dehydrogenase [147 kDa]; 2, bovine serum albumin [67 kDa]; 3, carbonic anhydrase [29 kDa]; 4, horse myoglobin [17 kDa]), DLL4_IC as a filled square (apparent MW = 31 kDa). The calibration curve (R = 0.99) is also shown. Ve = elution volume.
Taken together, CD, size exclusion chromatography, and NMR data are consistent with a mainly disordered state of the protein in solution, and the presence of little or no secondary structure.
DLL4_IC displays propensity to form secondary structures
Prediction results lead us to speculate that the secondary structure of DLL4_IC might be stabilized by agents promoting native structure or by an artificial hydrophilic/hydrophobic interface (SDS micelles). In order to test this possibility, we first analyzed the secondary structure of DLL4_IC in the presence of different concentrations of TFE, which promotes secondary structure formation by reducing the protein backbone exposure to the aqueous solvent and favoring the formation of intramolecular hydrogen bonds.43 Starting from a disordered conformation in aqueous solution, a significant change in the secondary structure was observed upon the addition of increasing amounts of TFE. The CD spectra developed a strong ellipticity at 206 nm and a shoulder at 222 nm, characteristic of an α-helical structure, at the expense of the minimum at 198 nm (Figure 6a). The helical content increased from 3% to 17% upon TFE addition (0–20%, v/v), with a drastic change in ellipticity already observed between 10% and 15% TFE. Also, a significant increase in β-strand and turns structure was observed in the presence of TFE. These results confirm that DLL4_IC has the intrinsic propensity to form secondary structures, and the measured content of these is consistent with the predictions.
Similarly, at increasing concentrations of SDS, DLL4_IC undergoes a conformational change towards the α-helical structure, reaching a maximum of ~8% of α-helix at saturation (10 mM SDS, see supplementary Material). Interestingly, at the same saturating SDS concentration (10 mM), the α-helical content experiences a significant increase as the pH decreases (8% of α-helix at pH 7.5 versus 23% at pH 6.3, Figure 6b), while the same shift in pH only slightly increases the β-strand content of the protein in buffer.
The conformation of DLL4_IC in the presence of SDS micelles was further analyzed by NMR. The 1H-15N HSQC spectrum of DLL4_IC obtained at saturating concentrations of SDS is somewhat different from that of the protein alone (Figure 7a and b). Although several resonances are still missing, probably due to overlap, HN cross-peaks appear to be of similar intensity and slightly better dispersed. Most of HN backbone resonances are still clustered in a relatively narrow region (7.7–8.4 ppm), but the average value of 1HN chemical shifts (8.11 ppm) is smaller and the dispersion slightly larger (σ = 0.20) compared to the values obtained for the protein alone (Figure 7c). Moreover, ~90 cross-peaks could be counted in the HN-Hα region of the 1H-15N HSQC-TOCSY spectrum (see supplementary data), most of them in the 4.2–4.6 ppm region. Significantly, a discrete number of non-glycine HN-Hα cross-peaks displayed a high field shift (< 4.2 ppm). The lack of a significant chemical shift dispersion in the HN and Hα chemical shifts, even in the presence of SDS micelles, is evidence for the lack of tertiary structure. Also, NMR spectra suggest that DLL4_IC is conformationally restrained in the presence of SDS micelles. Determination of secondary structure, if any, from NMR data is less straightforward. Deviations from random coil values in the chemical shifts of 1Hα, 13Cα, and 13C' have been widely used to map regions with well-defined secondary structure, but require residue-specific sequential assignments of the backbone resonances. Recently, it has been shown that backbone 1HN and 15NH chemical shifts are also somewhat sensitive to secondary structure.44 The small upfield shift of these and of selected Hα resonances in the presence of SDS might then be explained in terms of partial α-helical formation, consistent with CD results. It cannot be ruled out, however, that the negatively charged head group of SDS can also contribute to the upfield shift. It must be noted that all NMR spectra in the presence or absence of SDS were acquired at pH 5.6, a value at which the pH-dependent secondary structures in the C-terminal region of DLL4_IC are fully formed (see below).
The C-terminal region of DLL4_IC is plastic
In order to locate the region(s) of DLL4_IC that adopt the observed secondary structures based on the environment, we broke down the DLL4_IC sequence into four fragments (ΔN-DLL4_IC and peptides P1, P2, and P3), as indicated in Figure 5a, and studied their conformation alone or in the presence of TFE or SDS using far-UV CD spectroscopy. The DLL4_IC sequence was split at proline sites, because of its helix-breaking properties, and boundaries were selected based on the predictions by PONDR (Figure 5b). The procedure for generating the fragments is described in the Material and Methods section. Our results show that the central region covered by P1 and P2 is disordered and does not adopt any secondary structure in all tested conditions (supplementary material). However, the region encompassing the last 24 residues is highly plastic (P3, Figure 9a, b, and supplementary material) and, starting from a prevalent coil conformation in buffer at neutral pH, it can adopt (i) a mainly strand conformation in the presence of TFE or when the pH is slightly acidic (appearance of the negative band at 219 nm, typical of β-strand structures) or (ii) a α-helical conformation in the presence of SDS micelles when the pH is dropped from neutrality to 6. All these conformational changes are fully reversible (data not shown). Overall, these data indicate that the last 24 C-terminal residues account for most of the plasticity displayed by DLL4_IC and that the pH-dependent conformational switch in the full-length DLL4_IC, in the presence of SDS micelles, is due to the formation of a ~10 residue α-helix in the C-terminal region, as assessed from the changes in the mean residue ellipticity at 222 nm. The possible implications of these findings are discussed below.
Figure 9.
Circular dichroism. Far-UV CD spectra of P3 (17.9 µM) in (a) 5 mM Tris-HCl buffer, 1 mM TCEP, in the presence of 10 mM SDS at pH 7.5 or 6.0, and in 20% TFE or in (b) 5 mM Tris-HCl buffer (pH 7.5 or 6) or 20% TFE.
DISCUSSION
Single-pass transmembrane receptors play an important role in cell communication and signal transduction. In the simplest functional model, upon binding of a signaling molecule to the extracellular region of a receptor, a response is initiated on the inner side of the membrane. This response is in fact accomplished by the receptor's intracellular tail. An analysis of the modular domain architecture of the human single-pass transmembrane receptors dataset (369) by the SMART tool14 revealed that the majority of the ligand-binding extracellular regions are composed of known globular domains. On the other hand, SMART failed to identify any known domains in 63% of the intracellular regions, with the notable exception of receptors containing a kinase domain in their cytoplasmic region. This result can be due to either the presence of yet unidentified globular domains, or the prevalence of disorder in the intracellular tails of receptors. Our findings indicate that ID in the human single-pass transmembrane receptors with the type I topology is indeed predominant within the cytoplasmic regions, whereas the extracellular regions behave more like ordered proteins. These results are supported by the DisEMBL and IUPred predictions, by the plots of mean net charge versus hydropathy, and by the amino acid compositional analysis. When considering the mean net charge versus hydropathy plot (Figure 3), it should be kept in mind that kinase domains appear with a certain frequency in the intracellular region of single-pass receptors, which justifies the occurrence of several entries in the right-hand part of the plot, corresponding to ordered proteins. Incidentally, we remarked that there is a significant difference in the mean net charge of the extracellular and intracellular regions of the human transmembrane proteins analyzed in this work. While the majority (81%) of the extracellular regions have a negative mean net charge, the intracellular regions are nearly equally distributed between positively (47%) and negatively (53%) charged. In fact, most type I transmembrane proteins bear a short stretch of positively charged residues in their cytoplasmic tail, in the region close to the inner side of the membrane and protruding from it. As the inner leaflet of the membrane in eukaryotic cells is negatively charged, the presence of a positively charged segment is supposed to be a signal that drives the protein into the correct orientation.45
With respect to the amino acid composition, the interpretation of the results is more complex. While the intracellular regions are depleted of several order-promoting residue types, such as W, C, F, and V, and enriched in some of the disorder-promoting ones such as E and K, the trend observed in the DisProt versus Ordered comparison is not always respected. For example, methionine, which belongs to the order-promoting residues, displays a high frequency in intracellular regions. It is likely that methionine, as it is susceptible to oxidation, prefers the reducing environment of the cytosol, compared to the extracellular environment. The different redox potential in the cytoplasm and in the extracellular space may also be associated with the different distribution of other amino acid types, like cysteine, which is most frequently found in its oxidized half-cystine, structure stabilizing form in the extracellular space, and almost exclusively found in its reduced form, often coordinated to metal ions, in the intracellular space. Tyrosine, which is also an order-promoting amino acid type, is over-represented rather than depleted in intracellular regions. We think that a possible explanation is given by the fact that Y is a target for phosphorylation, known to be one of the main signaling mechanisms. The amino acids E, Q, P, and S are largely over-represented in disordered regions, but not quite so in the intracellular regions of receptors. This is due to the fact that these residues are often found in low complexity tracts (poly-glutamic acid, poly-glutamine, polyproline and domain linkers, respectively). The intracellular regions appear to be disordered, but not compositionally biased in this sense.
Several MIRR (Multichain immune recognition receptors) cytoplasmic domains belong to this uncharacterized subset and were experimentally proven to be intrinsically disordered.46 Previously, we showed that the cytoplasmic tails of the five different human Notch ligands display little sequence similarity between one another, display no homology with sequences of known fold, and are predicted to be disordered.47 We have also shown that a recombinant protein corresponding to the intracellular region of the Notch ligand Jagged-1 is actually disordered in solution.48 Here, for the first time, we prove, through a combination of predictions and experimental biophysical methods, that a fragment spanning the intracellular region of human Delta-4 is intrinsically disordered in solution, yet at the same time, in response to changes in the physico-chemical environment, can form inter-convertible secondary structures through its plastic C-terminal region (P3). Delta-4 was shown to interact through its evolutionarily conserved C-terminal motif (ATEV) (Figure 5c) with the PDZ domains of Dlg-1, the human homologue of Drosophila Discs Large protein.16 In the available crystal structures, peptides bearing a PDZ recognition motif bind in a groove between βB and αB on the surface of the PDZ domain and adopt a β-strand conformation, stitched as an additional strand into the antiparallel β-sheet on the surface of the PDZ domain.49 The mainly β structure seen in P3 in the presence of TFE or at slightly acidic pH, may be representative of the conformation that the C-terminal region of Delta-4 adopts when bound to Dlg-1. We are currently exploring the possibility that the C-terminal region of Delta-4 is pre-organized in a β-hairpin conformation, ready to dock the target PDZ domain. A stable β-hairpin at the C-terminus, or a significant population in the same conformation would drastically reduce the entropic cost of binding to the target PDZ domain. As the C-terminal region is very well conserved through species (Figure 5c), it is likely that this pre-organization is not limited to human Delta-4, but applies to all the other homologues. The co-existence of global disorder (i.e., lack of a well-defined globular structure) and local pre-organization (i.e., the propensity to form certain types of secondary structures locally, either in a stable or transient way) may indeed represent a general mechanism to increase binding specificity while maintaining structural flexibility. Such regions might be identified through a combination of secondary structure and disorder prediction tools and assayed experimentally.
The mainly helical conformation of P3 in the presence of SDS may, on the other hand, be representative of the membrane-bound, uncomplexed form of Delta-4. The possible biological implications of the pH-dependent conformational change are not known yet. Whereas different cell compartments can be associated with different pH values, little is known of the biophysical properties of the membrane-cytoplasm interface.50 In an early study, fluorescein was used to map the pH distribution in yeast cells, and it was proposed that the intracellular pH is not homogenous, but decreases to ~6.0 in proximity of the membrane.51 The pH gradient between the membrane interface and the cytosol would be generated by the negatively charged head groups of phospholipids present in the membrane of eukaryotic cells.
Alternatively, the observed secondary structures might be induced through the binding of Delta-4's cytoplasmic tail to other still unknown cytoplasmic or nuclear partners, before or after it is cleaved and released as a signaling fragment. In this context, the intracellular domain of Delta-1 was recently shown to act as a transcription-cofactor in the signal-sending cell, as it mediates TGF-β/Activin signaling through binding to Smad proteins in the nucleus.20
Overall, there is convergent evidence that the asymmetric disorder distribution observed in Delta-4 and its homologues reflects a more general phenomenon in transmembrane receptors of the same class, where the ordered extracellular domains appear to act as rigid scaffolds for ligand binding, while the flexible intracellular tails transduce the signal within the cell and activate the complex cellular response. Consequently, we speculate that the intracellular tails carry out their function by binding possibly multiple partners, either remaining unstructured, by exploiting pre-organized local secondary structures, or through disorder-to-order transitions. Further experimental data on the structural characterization of the interactions between the cytoplasmic tails of transmembrane receptors and their partners are needed in order to explore this intriguing hypothesis.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Doriano Lamba (CNR-ELETTRA, Trieste, Italy) for using of the CD spectropolarimeter. We are grateful to Fabio Calogiuri (CERM, Sesto Fiorentino, Italy) for technical assistance with the acquisition of NMR spectra. We acknowledge the support of the EU (European Network of Research Infrastructures for Providing Access and Technological Advancements in Bio-NMR) for access to the CERM NMR facility. We also thank Zsuzsanna Dosztányi (Institute of Enzymology, Budapest, Hungary) for the IUPred code, Mircea Pacurar (ICGEB) and Somdutta Dhir (ICGEB) for writing the scripts used in disorder analysis, Ruth W. Craig (Dartmouth Medical School) for critical reading of the manuscript, and Maristella Coglievina (ICGEB) for her useful suggestions. This work is part of A. De Biasio’s PhD thesis.
Abbreviations used
- ID
Intrinsic Disorder
- DLL4_IC
human Delta-like 4 cytoplasmic region
- DSS
2,2-Dimethyl-2-silapentane-5-sulfonate-d6 sodium salt
- DTT
DL-Dithiothreitol
- GuHCl
guanidine hydrochloride
- HSQC
heteronuclear single quantum correlation
- IMAC
immobilized metal ion affinity chromatography
- IPTG
Isopropyl β-D-1-thiogalactopyranoside
- LC-MS
liquid chromatography coupled to mass spectrometry
- PDZ
domain present in PSD-95, Dlg, and ZO-1/2
- RP-HPLC
reverse phase high performance liquid chromatography
- TCEP
Tris(2-carboxyethyl)phosphine hydrochloride
- TFA
Trifluoroacetic acid
- TFE
2,2,2-Trifluoroethanol
Footnotes
Sequences of the extra- and intracellular regions of human single-pass transmembrane receptors with the type I topology used in this study are available on request.
This information is available free of charge via the Internet at http://pubs.acs.org.
REFERENCES
- 1.Tompa P, Szasz C, Buday L. Structural disorder throws new light on moonlighting. Trends Biochem. Sci. 2005;30:484–489. doi: 10.1016/j.tibs.2005.07.008. [DOI] [PubMed] [Google Scholar]
- 2.Dyson HJ, Wright PE. Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell. Biol. 2005;6:197–208. doi: 10.1038/nrm1589. [DOI] [PubMed] [Google Scholar]
- 3.Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, Jones DT. Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 2004;337:635–645. doi: 10.1016/j.jmb.2004.02.002. [DOI] [PubMed] [Google Scholar]
- 4.Iakoucheva LM, Brown CJ, Lawson JD, Obradovic Z, Dunker AK. Intrinsic disorder in cell-signaling and cancer-associated proteins. J. Mol. Biol. 2002;323:573–584. doi: 10.1016/s0022-2836(02)00969-5. [DOI] [PubMed] [Google Scholar]
- 5.Mohan A, Oldfield CJ, Radivojac P, Vacic V, Cortese MS, Dunker AK, Uversky VN. Analysis of molecular recognition features (MoRFs) J. Mol. Biol. 2006;362:1043–1059. doi: 10.1016/j.jmb.2006.07.087. [DOI] [PubMed] [Google Scholar]
- 6.Minezaki Y, Homma K, Kinjo AR, Nishikawa K. Human transcription factors contain a high fraction of intrinsically disordered regions essential for transcriptional regulation. J. Mol. Biol. 2006;359:1137–1149. doi: 10.1016/j.jmb.2006.04.016. [DOI] [PubMed] [Google Scholar]
- 7.Liu J, Perumal NB, Oldfield CJ, Su EW, Uversky VN, Dunker AK. Intrinsic disorder in transcription factors. Biochemistry. 2006;45:6873–6888. doi: 10.1021/bi0602718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Minezaki Y, Homma K, Nishikawa K. Intrinsically disordered regions of human plasma membrane proteins preferentially occur in the cytoplasmic segment. J. Mol. Biol. 2007;368:902–913. doi: 10.1016/j.jmb.2007.02.033. [DOI] [PubMed] [Google Scholar]
- 9.Hurlbut GD, Kankel MW, Lake RJ, Artavanis-Tsakonas S. Crossing paths with Notch in the hyper-network. Curr. Opin. Cell Biol. 2007;19:166–175. doi: 10.1016/j.ceb.2007.02.012. [DOI] [PubMed] [Google Scholar]
- 10.Bray SJ. Notch signalling: a simple pathway becomes complex. Nat. Rev. Mol. Cell. Biol. 2006;7:678–689. doi: 10.1038/nrm2009. [DOI] [PubMed] [Google Scholar]
- 11.Artavanis-Tsakonas S, Rand MD, Lake RJ. Notch signaling: cell fate control and signal integration in development. Science. 1999;284:770–776. doi: 10.1126/science.284.5415.770. [DOI] [PubMed] [Google Scholar]
- 12.Noguera-Troise I, Daly C, Papadopoulos NJ, Coetzee S, Boland P, Gale NW, Lin HC, Yancopoulos GD, Thurston G. Blockade of Dll4 inhibits tumour growth by promoting non-productive angiogenesis. Nature. 2006;444:1032–1037. doi: 10.1038/nature05355. [DOI] [PubMed] [Google Scholar]
- 13.Ridgway J, Zhang G, Wu Y, Stawicki S, Liang WC, Chanthery Y, Kowalski J, Watts RJ, Callahan C, Kasman I, Singh M, Chien M, Tan C, Hongo JA, de Sauvage F, Plowman G, Yan M. Inhibition of Dll4 signalling inhibits tumour growth by deregulating angiogenesis. Nature. 2006;444:1083–1087. doi: 10.1038/nature05313. [DOI] [PubMed] [Google Scholar]
- 14.Letunic I, Copley RR, Pils B, Pinkert S, Schultz J, Bork P. SMART 5: domains in the context of genomes and networks. Nucleic Acids Res. 2006;34:D257–D260. doi: 10.1093/nar/gkj079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Le Borgne R, Bardin A, Schweisguth F. The roles of receptor and ligand endocytosis in regulating Notch signaling. Development. 2005;132:1751–1762. doi: 10.1242/dev.01789. [DOI] [PubMed] [Google Scholar]
- 16.Six EM, Ndiaye D, Sauer G, Laabi Y, Athman R, Cumano A, Brou C, Israel A, Logeat F. The notch ligand Delta1 recruits Dlg1 at cell-cell contacts and regulates cell migration. J. Biol. Chem. 2004;279:55818–55826. doi: 10.1074/jbc.M408022200. [DOI] [PubMed] [Google Scholar]
- 17.LaVoie MJ, Selkoe DJ. The Notch ligands, Jagged and Delta, are sequentially processed by alpha-secretase and presenilin/gamma-secretase and release signaling fragments. J. Biol. Chem. 2003;278:34427–34437. doi: 10.1074/jbc.M302659200. [DOI] [PubMed] [Google Scholar]
- 18.Six E, Ndiaye D, Laabi Y, Brou C, Gupta-Rossi N, Israel A, Logeat F. The Notch ligand Delta1 is sequentially cleaved by an ADAM protease and gamma-secretase. Proc. Natl. Acad. Sci. U. S. A. 2003;100:7638–7643. doi: 10.1073/pnas.1230693100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ikeuchi T, Sisodia SS. The Notch ligands, Delta1 and Jagged2, are substrates for presenilin-dependent "gamma-secretase" cleavage. J. Biol. Chem. 2003;278:7751–7754. doi: 10.1074/jbc.C200711200. [DOI] [PubMed] [Google Scholar]
- 20.Hiratochi M, Nagase H, Kuramochi Y, Koh CS, Ohkawara T, Nakayama K. The Delta intracellular domain mediates TGF-beta/Activin signaling through binding to Smads and has an important bi-directional function in the Notch-Delta signaling pathway. Nucleic Acids Res. 2007;35:912–922. doi: 10.1093/nar/gkl1128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ascano JM, Beverly LJ, Capobianco AJ. The C-terminal PDZ-ligand of JAGGED1 is essential for cellular transformation. J. Biol. Chem. 2003;278:8771–8779. doi: 10.1074/jbc.M211427200. [DOI] [PubMed] [Google Scholar]
- 22.Sickmeier M, Hamilton JA, LeGall T, Vacic V, Cortese MS, Tantos A, Szabo B, Tompa P, Chen J, Uversky VN, Obradovic Z, Dunker AK. DisProt: the Database of Disordered Proteins. Nucleic Acids Res. 2007;35:D786–D793. doi: 10.1093/nar/gkl893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Andreeva A, Howorth D, Chandonia JM, Brenner SE, Hubbard TJ, Chothia C, Murzin AG. Data growth and its impact on the SCOP database: new developments. Nucleic Acids Res. 2007 doi: 10.1093/nar/gkm993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Linding R, Jensen LJ, Diella F, Bork P, Gibson TJ, Russell RB. Protein disorder prediction: implications for structural proteomics. Structure. 2003;11:1453–1459. doi: 10.1016/j.str.2003.10.002. [DOI] [PubMed] [Google Scholar]
- 25.Dosztanyi Z, Csizmok V, Tompa P, Simon I. The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. J. Mol. Biol. 2005;347:827–839. doi: 10.1016/j.jmb.2005.01.071. [DOI] [PubMed] [Google Scholar]
- 26.Uversky VN, Gillespie JR, Fink AL. Why are "natively unfolded" proteins unstructured under physiologic conditions? Proteins. 2000;41:415–427. doi: 10.1002/1097-0134(20001115)41:3<415::aid-prot130>3.0.co;2-7. [DOI] [PubMed] [Google Scholar]
- 27.Romero P, Obradovic Z, Li X, Garner EC, Brown CJ, Dunker AK. Sequence complexity of disordered protein. Proteins. 2001;42:38–48. doi: 10.1002/1097-0134(20010101)42:1<38::aid-prot50>3.0.co;2-3. [DOI] [PubMed] [Google Scholar]
- 28.Vacic V, Uversky VN, Dunker AK, Lonardi S. Composition Profiler: a tool for discovery and visualization of amino acid composition differences. BMC Bioinformatics. 2007;8:211. doi: 10.1186/1471-2105-8-211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Vihinen M, Torkkila E, Riikonen P. Accuracy of protein flexibility predictions. Proteins. 1994;19:141–149. doi: 10.1002/prot.340190207. [DOI] [PubMed] [Google Scholar]
- 31.Obradovic Z, Peng K, Vucetic S, Radivojac P, Brown CJ, Dunker AK. Predicting intrinsic disorder from amino acid sequence. Proteins. 2003;53 Suppl 6:566–572. doi: 10.1002/prot.10532. [DOI] [PubMed] [Google Scholar]
- 32.Jones DT. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 1999;292:195–202. doi: 10.1006/jmbi.1999.3091. [DOI] [PubMed] [Google Scholar]
- 33.Cuff JA, Barton GJ. Evaluation and improvement of multiple sequence methods for protein secondary structure prediction. Proteins. 1999;34:508–519. doi: 10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4. [DOI] [PubMed] [Google Scholar]
- 34.Pollastri G, Przybylski D, Rost B, Baldi P. Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins. 2002;47:228–235. doi: 10.1002/prot.10082. [DOI] [PubMed] [Google Scholar]
- 35.Hoover DM, Lubkowski J. DNAWorks: an automated method for designing oligonucleotides for PCR-based gene synthesis. Nucleic Acids Res. 2002;30:e43. doi: 10.1093/nar/30.10.e43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Medigue C, Rouxel T, Vigier P, Henaut A, Danchin A. Evidence for horizontal gene transfer in Escherichia coli speciation. J. Mol. Biol. 1991;222:851–856. doi: 10.1016/0022-2836(91)90575-q. [DOI] [PubMed] [Google Scholar]
- 37.Uversky VN. Use of fast protein size-exclusion liquid chromatography to study the unfolding of proteins which denature through the molten globule. Biochemistry. 1993;32:13288–13298. doi: 10.1021/bi00211a042. [DOI] [PubMed] [Google Scholar]
- 38.Lobley A, Whitmore L, Wallace BA. DICHROWEB: an interactive website for the analysis of protein secondary structure from circular dichroism spectra. Bioinformatics. 2002;18:211–212. doi: 10.1093/bioinformatics/18.1.211. [DOI] [PubMed] [Google Scholar]
- 39.Chen YH, Yang JT, Chau KH. Determination of the helix and beta form of proteins in aqueous solution by circular dichroism. Biochemistry. 1974;13:3350–3359. doi: 10.1021/bi00713a027. [DOI] [PubMed] [Google Scholar]
- 40.Uversky VN. What does it mean to be natively unfolded? Eur. J. Biochem. 2002;269:2–12. doi: 10.1046/j.0014-2956.2001.02649.x. [DOI] [PubMed] [Google Scholar]
- 41.Dunker AK, Lawson JD, Brown CJ, Williams RM, Romero P, Oh JS, Oldfield CJ, Campen AM, Ratliff CM, Hipps KW, Ausio J, Nissen MS, Reeves R, Kang C, Kissinger CR, Bailey RW, Griswold MD, Chiu W, Garner EC, Obradovic Z. Intrinsically disordered protein. J. Mol. Graph. Model. 2001;19:26–59. doi: 10.1016/s1093-3263(00)00138-8. [DOI] [PubMed] [Google Scholar]
- 42.Wishart DS, Sykes BD, Richards FM. Relationship between nuclear magnetic resonance chemical shift and protein secondary structure. J. Mol. Biol. 1991;222:311–333. doi: 10.1016/0022-2836(91)90214-q. [DOI] [PubMed] [Google Scholar]
- 43.Roccatano D, Colombo G, Fioroni M, Mark AE. Mechanism by which 2,2,2-trifluoroethanol/water mixtures stabilize secondary-structure formation in peptides: a molecular dynamics study. Proc. Natl. Acad. Sci. U. S. A. 2002;99:12179–12184. doi: 10.1073/pnas.182199699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wang Y, Jardetzky O. Probability-based protein secondary structure identification using combined NMR chemical-shift data. Protein Sci. 2002;11:852–861. doi: 10.1110/ps.3180102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.von Heijne G. Control of topology and mode of assembly of a polytopic membrane protein by positively charged residues. Nature. 1989;341:456–458. doi: 10.1038/341456a0. [DOI] [PubMed] [Google Scholar]
- 46.Sigalov AB, Aivazian DA, Uversky VN, Stern LJ. Lipid-binding activity of intrinsically unstructured cytoplasmic domains of multichain immune recognition receptor signaling subunits. Biochemistry. 2006;45:15731–15739. doi: 10.1021/bi061108f. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pintar A, De Biasio A, Popovic M, Ivanova N, Pongor S. The intracellular region of Notch ligands: does the tail make the difference? Biol. Direct. 2007;2:19. doi: 10.1186/1745-6150-2-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Popovic M, De Biasio A, Pintar A, Pongor S. The intracellular region of the Notch ligand Jagged-1 gains partial structure upon binding to synthetic membranes. Febs J. 2007;274:5325–5336. doi: 10.1111/j.1742-4658.2007.06053.x. [DOI] [PubMed] [Google Scholar]
- 49.Jelen F, Oleksy A, Smietana K, Otlewski J. PDZ domains - common players in the cell signaling. Acta Biochim. Pol. 2003;50:985–1017. [PubMed] [Google Scholar]
- 50.Olivotto M, Arcangeli A, Carla M, Wanke E. Electric fields at the plasma membrane level: a neglected element in the mechanisms of cell signalling. Bioessays. 1996;18:495–504. doi: 10.1002/bies.950180612. [DOI] [PubMed] [Google Scholar]
- 51.Slavik J. Intracellular pH topography: determination by a fluorescent probe. FEBS Lett. 1983;156:227–230. doi: 10.1016/0014-5793(83)80501-8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.









