Abstract
Basement membranes are thin extracellular protein layers, which separate endothelial and epithelial cells from the underlying connecting tissue. The main non-collagenous components of basement membranes are laminins, trimeric glycoproteins, which form polymeric networks by interactions of their N-terminal (LN) domains; however, no high-resolution structure of laminin LN domains exists so far. To construct models for laminin β1 and γ1 LN domains 14 potentially suited template structures were determined using fold recognition methods. For each target/template-combination comparative models were created with Rosetta. Final models were selected based on their agreement with experimentally obtained distance constraints from natural cross-links, i.e., disulfide bonds as well as chemical cross-links obtained from reactions with two amine-reactive cross-linkers. We predict that laminin β1 and γ1 LN domains share the galactose-binding domain-like fold.
Keywords: laminin, N-terminal domains, disulfide bonds, chemical cross-linking, mass spectrometry, computational modeling
INTRODUCTION
Laminins are the major non-collagenous proteins of basement membranes and are known to form networks due to crucial non-covalent self-interactions (1). Each member of the laminin protein family consists of three polypeptide chains, with one copy of the α, β and γ polypeptide chains being present. At the N-termini of all three chains globular (LN) domains are followed by a series of laminin-type EGF-like (LE) modules (for nomenclature see (2)) that embed one or two additional domains with a total length of 40–60 nm each. Towards the C-termini the three polypeptide chains intertwine into a long coiled-coil region of 77 nm that ends in five LG domains connected to the α chain. This arrangement gives laminins their typical cross-shaped structure carrying with three short and one long arm. The laminin N-terminal (LN) globular domains require the subsequent four LE-domains for efficient expression and proper folding (3). This region of the molecule is involved in Ca2+ dependent laminin self-assembly and binds to the corresponding N-terminal regions of other laminin chains (4). The LN domains of all three chains are required for efficient polymerization as deletion mutants with two or fewer LN domains fail to form networks (5).
The N-terminal portions of laminin chains share sequence homology and domain structure with the netrins, a family of extracellular proteins that were originally identified as neural guidance molecules. Netrin-4 was recently shown to interact with the N-terminal portions of laminin γ1 and γ3 chains in a manner that may regulate basement membrane assembly (6).
No high-resolution three-dimensional structures for laminin LN-domains have been obtained so far. An alternative approach that provides structural insight into proteins is based on chemical cross-linking and subsequent mass spectrometric analysis of the created products. Structural information can be obtained by the insertion of a chemical cross-linker between two functional groups within a protein. The cross-linker has a defined length and is connected via covalent bonds to functional groups of amino acid side chains. The cross-linked amino acids can be identified after enzymatic digestion. This chemical cross-linking approach is also applied to study protein-protein interfaces. The sequence separation of cross-linked amino acids, combined with the cross-linker length, impose a distance constraint on the structure of protein fold or protein complex (7) (8) (9) (10). Analysis of cross-linked peptides by mass spectrometry (MS) uses several advantages associated with MS analysis: (I) The mass of the protein or the protein complex under investigation is theoretically unlimited as the proteolytic peptides of the cross-linked proteins after an enzymatic digest are analyzed (in case a standard “bottom-up” strategy for mass spectrometric protein analysis is employed), (II) the analysis is rapid, (III) it requires very small (10™14 – 10™15 mol) amounts of protein, and (IV) as the cross-linking reaction can be executed in a native-like environment protein structure and flexibility are accurately reflected. It is possible to study membrane proteins, post-translational modifications, or splice variants. The broad range of cross-linking reagents with different specificities (primary amines, sulfhydryls, or carboxylic acids) and the wide range of distances (0 Å – >20 Å) allow setup of fine-tuned experimental strategies (7) (8) (9) (10) (11).
However, despite the straightforwardness of the cross-linking approach, the identification of the cross-linked products can be cumbersome due to the complexity of the reaction mixtures. Several strategies have been employed to enrich cross-linker-containing species by affinity chromatography or to facilitate the identification of the cross-linked products; e.g., by using isotope-labeled cross-linkers or proteins, fluorogenic cross-linkers, or cleavable cross-linkers (10).
We recently reported the identification of the disulfide bond pattern of laminin β1 LN-domain using offline nano-high performance liquid chromatography (nano-HPLC)/matrix-assisted laser desorption/ionization time-of-flight/time-of-flight (MALDI-TOF/TOF) mass spectrometry (12). The aim of the present study was to obtain structural models of laminin β1 and γ1 LN-domains based on sparse distance constraints imposed by natural cross-links, i.e., disulfide bonds, as well as by chemical cross-links obtained from reactions with two amine-reactive cross-linkers. For chemical cross-linking of laminin LN domains, we used isotope-labeled, i.e. deuterated, cross-linkers to facilitate the identification of cross-linker-containing species in the mass spectra based on their characteristic isotope patterns (13) (14) (15).
Sparse experimental distance restraints can restrict the conformational space for a protein substantially and therefore enable determination of tertiary structure via computational modeling. For example, the Rosetta de-novo protein structure prediction algorithm (16) (17) allows the prediction of protein structures with medium to high resolution (1.5–3.0 Å) with less than one restraint per amino acid (distance and orientation restraints from NMR spectroscopy (18) (19)). Even with only approximately 0.1 constraints per amino acid the correct fold was determined de novo for T4-lysozyme and αA-crystallin using EPR distance restraints (20). The present project is challenging in that even fewer restraints are available for modeling. On the other hand, the protein models are not folded de novo, but template structures define the fold of the protein. Nevertheless, we underline that the structural models presented here should be treated as working hypotheses that need to be further validated and refined. We are confident that the availability of these models as supplementary information will facilitate this process.
MATERIALS AND METHODS
Materials
The cross-linking reagents BS3-D0/D4 bissulfosuccinimidyl)suberate–D0/D4), and BS2G-D0/D4 bissulfosuccinimidyl)glutarate–D0/D4) were obtained from Pierce Inc. (Rockford, IL, USA). The proteases trypsin, chymotrypsin, LysC, endoproteinase AspN, and GluC (all sequencing grade) were obtained from Roche Diagnostics (Mannheim, Germany). MALDI matrices were obtained from Bruker Daltonik (Bremen, Germany), all other chemicals were purchased from Sigma (Taufkirchen, Germany). Nano-HPLC solvents were spectroscopic grade (Uvasol, VWR, Darmstadt, Germany). Water was purified with a Direct-Q5 water purification system (Millipore, Eschborn, Germany).
Expression and Purification of Laminin β1 and γ1 N-terminal Constructs
Laminin β1 und γ1 constructs (mouse) comprising one LN domain plus LE1-4 domains were expressed in 293-EBNA human embryonic kidney cells (3). Amino acid sequences were confirmed by peptide mass fingerprint analysis using trypsin, endoproteinase AspN, chymotrypsin, endoproteinase GluC or a mixture of trypsin and AspN (enzyme:substrate ratio 1:50) as digestion enzymes.
Cross-Linking Reactions
For chemical cross-linking of laminin β1 and γ1 N-terminal constructs, the homobifunctional amine-reactive cross-linkers BS2G and BS3 were employed as 1:1 mixtures of non-deuterated and four-times deuterated derivatives (D0/D4). Cross-linking reactions were conducted with 2 μM protein solutions in 20 mM HEPES buffer, 100 mM NaCl, 5 mM CaCl2, pH 7.4. Freshly prepared stock solutions of the cross-linkers (10 mg/ml in DMSO) were added in 100- and 200-fold molar excess (final concentrations 200 μM and 400 μM) to the protein solution. The reactions were conducted at room temperature under gentle shaking of the reaction mixtures and were quenched after 45 and 90 min, respectively, by adding NH4HCO3 to a final concentration of 20 mM.
In-Solution Digestion
For in-solution digestion, the cross-linking reaction mixtures were denatured, reduced, alkylated, and digested with a mixture of AspN and trypsin (enzyme: substrate ratio 1:50) according to an existing protocol (21).
Gel Electrophoresis and In-Gel Digestion
A part of the cross-linking reaction mixtures was desalted with Microcon YM-10 filters (Millipore, Eschborn, Germany) and separated by one-dimensional SDS-PAGE (5% stacking gel / 5%, 8% or 12% resolving gel) according to Laemmli (22). The bands of monomeric laminin β1 und γ1 were excised, reduced, alkylated, and digested at 37°C for 16 hrs with a mixture of AspN and trypsin (enzyme: substrate ratio 1:30) as described previously (21). Peptides were extracted by adding three times 50 μl of 5% TFA (for MALDI-MS analysis) or 5% (V/V) FA (for ESI-MS analysis); samples were concentrated in a vacuum concentrator to a volume of 5–10 μl.
Nano-HPLC / MALDI TOF/TOF-Mass Spectrometry
Proteolytic peptide mixtures were analyzed by offline coupling of a nano-HPLC system (Ultimate 3000, Dionex, Idstein, Germany) to a MALDI-TOF/TOF mass spectrometer (Ultraflex III, Bruker Daltonik, Bremen, Germany). Samples were injected via an autosampler with a 200-μl sample loop onto a precolumn (PepMap, C18, 300 μm × 5 mm, 3 μm, 100 Å, Dionex) and desalted by washing the precolumn for 15 min with 0.1% TFA before the peptides were eluted onto the separation column (PepMap, C18, 75 μm × 150 mm, 3μm, 100 Å, Dionex), which had been equilibrated with 95% solvent A (A: 5% ACN, 0.05% TFA). Peptides were separated with a 30 min-gradient (0–30 min: 5–50% B, 30–31 min: 50–95% B, 31–35 min: 95% B (solvent B: 80% ACN, 0.04% TFA) at a flow rate of 300 nl/min with UV detection at 214 nm and 280 nm. Eluates were fractionated into 15-sec fractions with the fraction collector Proteineer fc (Bruker Daltonik), mixed with 1.1 μl of matrix solution (0.7 μg/μl HCCA in 90 % ACN/0.1 % TFA, 1 mM NH4H2PO4) and directly prepared onto a 384 MTP 800 μm AnchorChip target (Bruker Daltonik). Hystar software 3.2 controlled data collection with the nano-HPLC system, UV data acquisition, and fraction collector sampling.
MALDI-TOF-MS analyses were conducted in the positive ionization and reflectron mode by adding 2,000 laser shots in the range m/z 800–4,000 to one mass spectrum. Mass spectra were processed (Savitzky-Golay smoothing and baseline correction) and externally calibrated using Peptide Calibration Standard II (Bruker Daltonik). Monoisotopic mass signals with a signal-to-noise ratio (S/N) > 2 were automatically labeled using the SNAP (Sophisticated Numerical Annotation Procedure) algorithm. Afterwards, peak lists of labeled monoisotopic signals were created and signals with S/N >10 were selected for laser-induced fragmentation. In the MS/MS mode, up to 2,000 laser shots were accumulated for measurement of the intact precursor ion (termination at S/N of 30); the optimum laser energy was determined for each precursor ion by fuzzy logic. Additional 2,000 laser shots were accumulated at 50% higher laser energy for acquisition of fragment ion mass spectra. Spectra were processed, calibrated based on the exact mass of the precursor ion, annotated (SNAP algorithm, S/N>2) and combined with MS data to one data file. Data acquisition was done automatically by the WarpLC 1.1 software (Bruker Daltonik) coordinating MS data acquisition (FlexControl 1.3) and data processing (FlexAnalysis 3.0) softwares.
Nano-HPLC / Nano-ESI-FTICR Mass Spectrometry
Proteolytic peptide mixtures were additionally analyzed by nano-HPLC/nano-ESI mass spectrometry using the two FTICR (Fourier transform ion cyclotron resonance) mass spectrometers Apex II (Bruker Daltonics, Billerica, MA) and LTQ-FT (ThermoFisher Scientific, Bremen, Germany). The Apex II mass spectrometer was online coupled to the nano-HPLC system (Ultimate II with Switchos II and autosampler). Samples were injected via the autosampler with a 20-μl sample loop, and were desalted and concentrated by washing the precolumn (PepMap, C18, 300 μm × 5 mm, 5 μm, 100 Å, Dionex) with 0.1% FA at a flow rate of 2 μl/min for 10 min, before peptides were eluted onto the separation column (PepMap, C18, 75 μm × 150 mm, 3 μm, 100 Å, Dionex). The peptides were separated using a 30-min gradient (0–30min: 5–50% B, 30–31 min: 50–95% B, 31–35 min: 95% B, with solvent A: 5% ACN, 0.1% FA und B: 80% ACN, 0.1% FA) at a flow rate of 200 nl/min monitoring the elution of peptides by their UV absorption at 214 nm und 280 nm. Data acquisition was controlled by the Hystar (version 2.3).
The Apex II FTICR mass spectrometer was equipped with a 7 T supra-conducting magnet and a nano-ESI source (Agilent Technologies, Waldbronn, Germany). For nano-ESI-MS measurements, fused-silica-nano-ESI needles (PicoTips, ID 8 μm, New Objective, Woburn, MA, USA) were employed. The instrument was tuned using the doubly charged signal of the LHRH peptide at m/z 592.2358. Calibration was performed using the LHRH peptide fragments, which were obtained by capillary skimmer dissociation. Data were acquired in broadband mode (m/z range 400–2,000) with 256k data points per spectrum. 10 scans were accumulated to a single spectrum (ca. 12 sec per spectrum), the XMASS software (versions 7.0.2, 7.0.3, 7.0.8, Bruker Daltonik) was used for data acquisition and data.
Nano-HPLC/nano-ESI-MS measurements with the LTQ-FT hybrid mass spectrometer were conducted in online coupling with the Ultimate 3000 Nano-HPLC-System (Dionex). The LTQ-FT combines a linear ion trap (LTQ) and an ion cyclotron resonance (ICR) analyzer and is equipped with a 7 T magnet. Samples were injected via the autosampler with a 100-μl sample loop, desalted and concentrated on a precolumn (PepMap, C18, 300 μm × 5 mm, 3 μm, 100 Å). Separation of peptides was performed on a C18 column (PepMap, C18, 75 μm × 150 mm, 3 μm, 100 Å) using a 90-min gradient (0 to 60 % solvent B in 90 min, followed by isocratic elution at 90% B for 3 min; solvent A: 5% ACN, 0.1% FA, B: 80% ACN, 0.08% FA) at 300 nl/min. The Chromeleon software (version 2.3, Dionex) was used to control the HPLC system, UV data (214 nm und 280 nm) and MS data acquisition. Data acquisition was performed over 100 min. One duty cycle comprised one high-resolution full scan spectrum (m/z 300–2000, resolution 100,000 at m/z 400) in the ICR cell and 10 fragment ion mass spectra of the most intense signals in the linear ion trap. Dynamic exclusion (exclusion time 20 sec, exclusion window ±5 ppm) was used to enhance acquisition of signals with low intensity.
Identification of Cross-Linked Products
Analysis of MS data and identification of cross-linked products was performed with the programs IsoFind, GPMAW, Biotools und MS2Assign. The in-house software tool Isofind determines signals with a defined mass difference, i.e., 4.025 u for the identification of reaction products with D0/D4 labeled cross-linkers, yielding peak lists with the respective mass differences and peak intensity ratios. Only signals exhibiting the characteristic D0/D4 isotope patterns with similar peak intensities (0.25:1 to 1:4) and similar LC retention times (±1 min) were considered as potential cross-linked products. The GPMAW (General Protein Mass Analysis for Windows, version 8.0, Lighthouse Data, Odense, Denmark) (23) software was used for assignment of cross-linked products. Maximum mass deviations between theoretical and experimental masses of 5 ppm (LTQ-FT), 10 ppm (Apex II), and 50 ppm (Ultraflex III) were allowed. MS/MS data were manually compared to the predicted fragment masses for a cross-linked product. Additionally, the freely accessible program MS2Assign (http://roswell.ca.sandia.gov/~mm-young/ms2assign.html; part of the Collaboratory for MS3D, http://ms3d.org/home.php) was used for calculation of MS/MS data and comparison to experimental data. The software package Biotools 3.1 (Bruker Daltonik) was used to identify peptides that are modified by a hydrolyzed cross-linker as well as intrapeptidal cross-linked products based on exact mass and MS/MS data. Lysines, serines, tyrosines, and threonines were considered as potential cross-linking sites (21) (24). Additionally, oxidation (Met), carboxamidomethylation (Cys), amidation (at cross-linker and at C-terminus), deamidation (Gln, Asn), cyclization (Cys) were taken into account as potential modifications as well as incomplete cleavage (up to 8 missed cleavage sites) (21).
Fold Recognition
The general workflow of computational modeling is summarized in Figure 1. Laminin β1 and γ1 sequences were split into separate domains as defined by the ExPASy Proteomic Server (www.expasy.ch) and modeled independently. The amino acid sequences of murine LN domains of laminin β1 (ExPASy entry P02469, 31-270) and laminin γ1 (ExPASy entry P02469, 44-283) were analyzed by the 3D-Jury metaserver (www.bioinfo.pl, (25), Figure 1A). Potential templates (PDB entries: 1CZT, 1D7P, 1KEX, 1XPW) were identified with a 3D-Jury score of 45–55, which roughly correlates with the number of amino acids that can be superimposed between a template and a target structure with an RMSD <3.5 Å. A score larger 50 has a 90% chance of being a correctly identified fold. Sequence identities between laminin β1 and laminin γ1 and the templates were calculated with the software SIM (http://www.expasy.ch/tools/sim-prot.html, (26)) and were all smaller than 18%.
Figure 1.
General workflow of the computational modeling procedure, which includes as main steps (A) detection and identification of templates, (B) sequence-sequence-alignment between target and template sequences, (C) modeling of the LN domains of laminin β1 and γ1, (D) experimentally guided scoring and selection of the best models, and (E) validation.
Rectangles: intermediate results; trapeze: verifications and controls; ovals: calculation and predictions steps; parallelograms: input data.
The quality of final comparative models is highly dependent on the template that is used for modeling. Given the low sequence similarity of the initial templates two alternative approaches were employed to more comprehensively search the PDB for additional potential template structures. In the first approach, homologous sequences of laminin β1 and γ1 LN domains were searched with PSI-BLAST (27) (www.ncbi.nlm.nih.gov/blast/Blast.cgi; E-value threshold 0.005; ten iterative rounds). In this experiment, the sequences of netrin I, II, IV as well as these of LN domains of laminin chains α1, α2, α3B, α5, β2, β3, γ3 were identified to be related to the β1 and γ1 LN domains. These homologues as well as the sequences of the original 3D Jury fits 1CZT, 1D7P, 1KEX, 1XPW were submitted to the 3D-Jury metaserver for a second iteration. Two additional template candidates (1SDD and 1GOF) were identified by that procedure. To test orthogonal fold recognition approaches the threading servers Phyre [www.sbg.bio.ic.ac.uk/phyre/ (28)], Libellula [www.pdg.cnb.uam.es/servers/libellula, (29)], Wurst [www.zbh.uni-hamburg.de/wurst/, (30)], HHPred [http://toolkit.tue-bingen.mpg.de/hhpred, (31)], and Loopp [http://loopp.org/, (32)] were applied. 35 putative templates were identified in addition to the previously detected structures yielding a total of 40 candidate templates (Figure 1A).
In an unbiased approach the structures of all 40 potential templates were compared pairwise with the program Mammoth (http://ub.cbm.uam.es/mammoth/ (33)) and clustered based on their structural homologies. 14 structures including 1CZT, 1D7P, 1GOF, 1KEX, 1SDD and 1XPW were grouped in one cluster of which all members share the galactose-binding domain-like fold (SCOP: 49784, CATH: 2.60.120.260, compare Table 4). These 14 structures were considered as templates in the subsequent comparative modeling efforts (Figure 1B). Other structures have been dismissed as potential templates as in structure clustering (Figure 2) only one cluster was observed containing the 14 members with a galactose-binding domain-like fold, whereas further clusters contained only two or less members. These 26 additional structures included immunoglobulin-like β-sandwiches and carbonic anhydrase among others.
Table 4.
Results of the template search; 14 structures were found using the programs 3D Jury, Superfamily –“Superfam“, Phyre, Libellula “Lib“, HHPred; the programs Wurst and Loop yielded only false positive template structures. The six templates found with 3D-Jury are printed in bold.
No | Protein Name | PDB entry |
Method | Template Score for Laminin β1; γ1 |
SCOP Fold name |
SCOP Fold classification code |
CATH Fold name |
CATH Fold classification code |
Protein Function Gene Ontology Annotation |
Reference |
---|---|---|---|---|---|---|---|---|---|---|
1 | factor V | 1CZT |
3DJury HHPred* |
52; 58 95; 87 |
Galactose-binding domain-like |
49784 |
Galactose-binding domain-like |
2.60.120.260 | Blood Clotting | (47) |
2 | factor VIII | 1D7P |
3D Jury HHPred |
42; 57 88; 82 |
Galactose-binding domain-like |
49784 |
Galactose-binding domain-like |
2.60.120.260 | Blood Clotting | (48) |
3 | Sialidase | 1EUT | Libellula, Phyre* |
25%; 80% | Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Hydrolase | (49) |
4 | Galactose oxidase | 1GOF | 3DJury | 49; 50 |
Galactose-binding domain-like |
49784 |
Galactose-binding domain-like |
2.60.120.260 | Oxidoreductase(Oxygen(A)) | (50) |
5 | APC10/DOC1 subunit of the anaphase- promoting complex |
1GQP | Phyre HHPred* |
-; 30% -; 74 |
Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Cell Cycle | (51) |
6 | Carbohydrate binding module from laminarinase 16A |
1GUI | Libellula | Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Carbohydrate Binding Module |
(52) | |
7 | APC10/DOC1 subunit of the anaphase- promoting complex |
1JHJ | Phyre HHPred* |
-; 25% -; 48 |
Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Cell Cycle | (53) |
8 | Fucose binding lectin |
1K12 | HHPred* | 94; 95 | Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Sugar Binding Protein | (54) |
9 | Galactose oxidase | 1K3I | Phyre HHPred |
30%; 80% 96; 95 |
Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Oxidoreductase | (55) |
10 |
B1 domain of neuropilin-1 |
1KEX |
3D-Jury HHPred |
37; 53 96; 93 |
Galactose-binding domain-like |
49784 |
Galactose-binding domain-like |
2.60.120.260 | Protein Binding | (56) |
11 |
Coagulation factor V |
1SDD |
3DJury HHPred Phyre |
49; 39 67; 41 94%; 94% |
Galactose-binding domain-like |
49784 |
Galactose-binding domain-like |
2.60.120.260 | Blood Clotting | (57) |
12 | Placental protein 25, pp25 |
1TVG | HHPred Phyre |
54; 53 98%; 97% |
Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | Cell Cycle | (58) |
13 | xrcc1 | 1XNA | Libellula | Galactose-binding domain-like |
49784 | Galactose-binding domain-like |
2.60.120.260 | DNA Binding Protein | (59) | |
14 |
Placental protein 25, pp25 |
1XPW | 3DJury | 47; 50 |
Galactose-binding domain-like |
49784 |
Galactose-binding domain-like |
2.60.120.260 |
Structural Genomics, Unknown Function |
(58) |
15 | chitobiase (N- acetyl-beta- glucoseaminidase) |
1QBA | Phyre | 45%; - | Immunoglobulin- like beta-sandwich |
[48725] | Chitobiase, domain 2 |
3.30.379.10 | Glycosyl Hydrolase | (60) |
16 | Hypothetical protein TM1631 |
1VPQ | Wurst | 9.4; - | TIM beta/alpha- barrel |
[51350] | Hypothetical protein tm1631. |
3.20.20.410 | Structural Genomics, Unknown Function |
(61) |
17 | Class I MHC | 1HSB | Wurst | 9.3; - | Immunoglobulin- like beta-sandwich |
[48725] | Immunoglobulins | 2.60.40.10 | Histocompatibility Antigen | (62) |
18 | Cu,Zn superoxide dismutase |
1TO4 | Wurst | 9.1; - | Immunoglobulin- like beta-sandwich |
[48725] | Immunoglobulin- like |
2.60.40.200 | Oxidoreductase | (63) |
19 | Putative regulator protein YcfX |
1AP1 | Wurst | 9.1; - | Ribonuclease H- like motif |
[53066] | Not yet assigned |
Transferase | (64) | |
20 | Alginate-binding periplasmic protein AlgQ1 |
1Y3N | Wurst | 9.1; - | Periplasmic binding protein-like II |
[53849] | Periplasmic binding protein- like II |
3.40.190.10 | Sugar Binding Protein | (65) |
21 | Myo-inositol 1- phosphate synthase |
1VKO | Wurst | -; 11.0 | AD(P)-binding Rossmann-fold domains |
[51734] | Not yet assigned |
Isomerase | (66) | |
22 | c-KIT receptor | 1T46 | Wurst | -; 10.0 | Protein kinase-like (PK-like) |
[56111] | Phosphorylase Kinase; domain 1 |
3.30.200.20 | Transferase Activator | (67) |
23 | Interleukin-8 | 1ILQ | Wurst | -; 10.0 | IL8-like | [54116] | OB fold (Dihydrolipoamide Acetyltransferase, E2P) |
2.40.50.40 | Cytokine | (68) |
24 | FYVE, RhoGEF and PH domain containing protein 3, FGD3 |
2COC | Wurst | -; 10.0 | PH domain-like barrel |
[50728] | Not yet assigned |
Signaling Protein | (69) | |
25 | Class delta GST | 1V2A | Wurst | -; 10.0 | GST C-terminal domain-like |
[47615] | Glutaredoxin | 3.40.30.10 | Transferase | (70) |
26 | PI-Pfui intein | 1DQ3 | Superfam | Hedgehog/intein (Hint) domain |
[51293] | Endonuclease - Pi- scei; Chain A, domain 1 |
2.170.16.10 | Hydrolase | (71) | |
27 | Galectin-1 | 1W6O | Superfam | Concanavalin A- like lectins/glucanases |
[49898] | Jelly Rolls | 2.60.120.200 | Lectin | (72) | |
28 | Isoamylase | 1BF2 | Libellula | 2.6; - | Immunoglobulin- like beta-sandwich |
[48725] | Immunoglobulins | 2.60.40.10 | Hydrolase | (73) |
29 | N-cadherin (neural) |
1OP4 | Libellula | 9.0; - | Immunoglobulin- like beta-sandwich |
[48725] | Not yet assigned | Cell Adhesion | (74) | |
30 | Plasminogen | 1KI0 | Libellula | 4.3; - | Kringle-like | [57439] | Plasminogen Kringle 4 |
2.40.20.10 | Hydrolase | (75) |
31 | CD69 | 1E8I | Libellula | -; 8.6 | C-type lectin-like | [56435] | Mannose-Binding Protein A, subunit A |
2.60.120.40 | Hematopoietic Cell Receptor | (76) |
32 | Tumor necrosis factor (TNF) |
2TNF | Libellula | -; 10.5 | TNF-like | [49841] | Jelly Rolls | 2.30.27.10 | Cytokine | (77) |
33 | minor coat protein g3p |
1G3P | Libellula | -; 11.7 | N-terminal domains of the minor coat protein g3p |
[50175] | Phage FD Coat Protein,Membrane penetration domain |
2.60.40.10 | Viral Protein | (78) |
34 | Immunoglobulin heavy chain |
1QFW | Libellula | -; 14.4 | Immunoglobulin- like beta-sandwich |
[48725] | Immunoglobulins | 2.10.60.10 | Immune System | (79) |
35 | TGF-beta type II receptor |
1KTZ | Libellula | -; 12.2 | Snake toxin-like | [57301] | CD59 | 3.10.200.10 | Cytokine/Cytokine Receptor | (80) |
36 | Carbonic anhydrase |
2ZNC | Loopp | 1.1; - | Carbonic anhydrase | [51068] | Carbonic Anhydrase II |
2.60.40.200 | Lyase | (81) |
37 | Copper chaperone for superoxide dismutase |
1DO5 | Loopp | 0.9; - | Immunoglobulin- like beta-sandwich |
[48725] | Immunoglobulin- like |
3.10.200.10 | Chaperone | (82) |
38 | Carbonic anhydrase |
1RJ6 | Loopp | 0.9; - | Carbonic anhydrase | [51068] | Carbonic Anhydrase II |
2.80.10.50 | Lyase | (83) |
39 | Winged bean albumin 1 |
1WBA | Loopp | 0.9; - | beta-Trefoil | [50352] | Trefoil (Acidic Fibroblast Growth Factor, subunit A) |
2.60.120.260 | Seed Storage Protein | (84) |
40 | Micromonospora sialidase |
1W8O | HHPred | 94; 97 | 6-bladed beta- propeller |
[50938] | Galactose-binding domain-like |
Hydrolase | (85) |
Figure 2.
Dendrogram of the selected and discarded template structures. The clustering was performed based on the structural homology (normalized Z-Score calculated with Mammoth). The templates exhibiting a galactose-binding domain-like fold are highlighted in red.
Parametric Sequence Alignment
BCL::Align ((34), http://www.meilerlab.org) was used to create a pairwise sequence alignment between the sequences of laminin β1 and γ1 LN domains and the 14 template proteins. Given the low sequence similarity predicted secondary structure was used as component of the scoring function in sequence alignment. Secondary structure prediction of laminin LN domains was performed with the program JUFO ((19), (35), http://www.meilerlab.org/), Psipred ((36); http://bioinf.cs.ucl.ac.uk/psipred/) and SAM (37) yielding large content of random coil and β-strands. In order to sample the space of possible alignments densely a grid search of alignment parameters was performed (38) (Supplementary Table 1, Figure 1B) for each of the 28 combinations of a template with one LN domain creating 52,200 pairwise sequence-sequence alignments. After removal of redundant alignments 5,371 and 9,830 unique alignments were obtained for laminin β1 and laminin γ1, respectively. Moreover, alignments were excluded that prevent construction of all loops assuming that each amino acid can bridge approximately 3Å. 4,333 (laminin β1) and 7,306 (laminin γ1) target/template-alignments fulfilling this rule were carried forward (Figure 1B).
Construction of Comparative Models
For each of the 11,639 alignments the backbone coordinates of the aligned regions were copied from the template structure into the model of the target. Loops were constructed in ten independent runs of Rosetta (39). Always the best of the 10 models with correctly constructed loop regions enter construction of side chain coordinates using the feature “-find_disulf“, which favors the formation of disulfide bonds. A gradient-based relaxation of the protein backbone structure was used to enable changes of the protein backbone conformation. A total of 3,550 and 5,104 models of laminin β1 and γ1 LN domains were constructed, respectively, each representing one of the original alignments (Figure 1C).
Validation and Refinement of LN Domain Structures
Models, in which not all loops could be closed, were removed (criterion 1). Additionally, models were filtered by energy (Rosetta total energy – bk_tot – larger than −100, criterion 2), and models with a solvent accessible surface area larger than 20,000 Å2 for lack of compactness were removed (criterion 3) (Figure 1C). Further models were removed that displayed CA-CA distances for cross-links larger than 30 Å (criterion 4) or CA-CA distances of disulfide bridges larger than 13 Å (criterion 5) (Figure 1D & Table 5). A total of seven constraints were available for the laminin β1 LN domain, five for the laminin γ1 LN domain. Only three models of the laminin β1 LN domain and twelve models of the γ1 LN domain fulfilled all experimental constraints (Figure 1D). This highlights the power of the chemical cross-linking approach enabling an enormous reduction of potential structures, even if only a small number of distance constraints are obtained. In the present study, less than 1% of the initial models fulfill all distance constraints (Table 5). The models were ranked according to an empirical composite score derived from these four criteria (Table 6). The composite score minimizes the deviation between distances observed experimentally and in the model. At the same time, it minimizes Rosetta energy and solvent accessible surface area (see Supplementary Material).
Table 5.
Remaining structures based on the number of fulfilled distance constraints (disulfide bonds and cross-links, see Table 3). Compactness (according to Table 6) was used as an additional criterion to validate the obtained structures, which fulfill all cross-linking constraints.
Protein | Modeling Parameters criterion I-III |
Number of fulfilled distance constraints criterion IV and V |
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
sequence-sequence alignments |
Non-redundant alignments |
Criterion I Modeled loops |
Criterion II Similarity to Template and energy cut-off |
Criterion III Compactness |
1 | 2 | 3 | 4 | 5 | 6 | 7 | |
Ges. β1 | 730,800 | 5,371 | 4,333 | 506 | 506 | 506 | 501 | 472 | 288 | 54 | 3 | |
1CZT | 52,200 | 499 | 296 | 43 | 41 | 43 | 43 | 43 | 43 | 28 | 8 | 0 |
1D7P | 52,200 | 364 | 313 | 84 | 83 | 84 | 84 | 84 | 81 | 52 | 10 | 3 |
1EUT | 52,200 | 317 | 294 | 14 | 7 | 14 | 14 | 14 | 13 | 8 | 1 | 0 |
1GOF | 52,200 | 1017 | 799 | 48 | 10 | 48 | 44 | 48 | 47 | 27 | 7 | 0 |
1GQP | 52,200 | 241 | 241 | 84 | 45 | 84 | 84 | 84 | 79 | 43 | 0 | 0 |
1GUI | 52,200 | 344 | 344 | 44 | 44 | 44 | 44 | 44 | 44 | 42 | 8 | 0 |
1JHJ | 52,200 | 300 | 283 | 25 | 6 | 25 | 25 | 25 | 24 | 13 | 2 | 0 |
1K12 | 52,200 | 261 | 201 | 25 | 24 | 25 | 25 | 25 | 24 | 15 | 3 | 0 |
1K3I | 52,200 | 727 | 591 | 35 | 10 | 35 | 35 | 35 | 34 | 17 | 5 | 0 |
1KEX | 52,200 | 177 | 141 | 16 | 16 | 16 | 16 | 16 | 16 | 11 | 0 | 0 |
1SDD | 52,200 | 614 | 372 | 59 | 3 | 59 | 59 | 55 | 40 | 12 | 2 | 0 |
1TVG | 52,200 | 193 | 184 | 16 | 16 | 16 | 16 | 16 | 16 | 3 | 0 | 0 |
1XPW | 52,200 | 193 | 192 | 12 | 10 | 12 | 12 | 12 | 12 | 12 | 6 | 0 |
Ges. γ1 | 730,800 | 9,830 | 7,306 | 1,349 | 1,349 | 1,334 | 1,159 | 387 | 12 | |||
1CZT | 52,200 | 695 | 370 | 150 | 145 | 150 | 150 | 144 | 53 | 3 | ||
1D7P | 52,200 | 616 | 507 | 146 | 142 | 146 | 146 | 132 | 31 | 1 | ||
1EUT | 52,200 | 509 | 432 | 54 | 34 | 54 | 54 | 31 | 14 | 0 | ||
1GOF | 52,200 | 811 | 657 | 57 | 30 | 57 | 56 | 37 | 11 | 0 | ||
1GQP | 52,200 | 354 | 285 | 180 | 170 | 180 | 80 | 142 | 71 | 1 | ||
1GUI | 52,200 | 409 | 319 | 103 | 102 | 103 | 103 | 96 | 26 | 1 | ||
1JHJ | 52,200 | 797 | 776 | 48 | 18 | 48 | 48 | 47 | 8 | 0 | ||
1K12 | 52,200 | 572 | 385 | 171 | 171 | 171 | 171 | 168 | 12 | 1 | ||
1K3I | 52,200 | 704 | 438 | 52 | 27 | 52 | 52 | 39 | 10 | 0 | ||
1KEX | 52,200 | 180 | 118 | 57 | 56 | 57 | 57 | 56 | 38 | 5 | ||
1SDD | 52,200 | 2896 | 1947 | 208 | 57 | 208 | 208 | 147 | 42 | (1)* | ||
1TVG | 52,200 | 485 | 374 | 30 | 29 | 30 | 30 | 28 | 20 | 0 | ||
1XNA | 52,200 | 348 | 263 | 8 | 8 | 8 | 8 | 8 | 5 | 0 | ||
1XPW | 52,200 | 454 | 435 | 30 | 22 | 30 | 30 | 29 | 17 | 0 |
Structure did not fulfill the compactness criterion.
Table 6.
Composite scoring function which integrates the deviation of the experimental constraints from the theoretical maximum distance as well as the Rosetta energy and compactness of the structure
| |||
---|---|---|---|
With | |||
Optimal Xopt | Maximal allowed Xmax | ||
Chemical Cross-Link | <19 Å | <24 Å | |
Disulfide | <8 Å | <13 Å | |
Energy (bk_tot) | <−250 | <−180 | |
Compactness (SASA) | <15.000 | <20.000 |
If Xi<Xopt than was set to 1, where else in the case Xi>Xmax the term was set to 0.05.
Leave-One-Out Cross-Validation
To test the robustness of the model selection protocol a leave-one-out (LOO) cross-validation was performed by selecting the final models using only six out of seven (for laminin β1) and four out of five (for laminin γ1) distance constraints. Agreement of these models with the final constraint was taken as a measure of accuracy. Moreover, these models were compared to the models selected using all experimental data using the structure-structure alignment method (33). A Z-score larger than 8 was used as criterion for high structural homology (Table 7 & Supplementary Table 2). The final assessment of the models was based on precision as measured by the composite score (Table 6 & Supplementary Table 2).
Table 7.
Summary of the filter function and the leave-one-out (LOO) experiments. “filter total“, “filter (6/7)“, and “filter (4/5)“ indicate how many models in total and how many models that fulfill 6 out of 7, respectively, 4 out of 5 distance constraints pass the filter; LOO distance and LOO fold indicate the results of LOO in respect to distance fulfillment and structural homology with a Z score of larger than 8; information content measure (sequence separation [aa] / Euclidian separation [Å]) of the distance constraints. XL: Cross-link.
Laminin β1 | Disulfide 1 |
Disulfide 2 |
Disulfide 3 |
Disulfide 4 |
XL 1 | XL 2 | XL 3 | Mean |
---|---|---|---|---|---|---|---|---|
Filter (total) | 484 95.7% |
117 23.1% |
467 92.3% |
47 9.3% |
493 97.4% |
288 56.9% |
435 86.0% |
65% |
Filter (6/7) | 33 100% |
27 81.8% |
33 100% |
9 27.3% |
33 100% |
33 100% |
33 100% |
84% |
LOO distance |
3/3 100% |
3/9 33% |
3/3 100% |
3/27 11.1% |
3/3 100% |
3/3 100% |
3/3 100% |
78% |
LOO fold | 3/3 100% |
6/9 66% |
3/3 100% |
14/27 52% |
3/3 100% |
3/3 100% |
3/3 100% |
88% |
Information content [Å−1] |
0.38 | 0.69 | 0.23 | 1.54 | 0.37 | 1.90 | 0.47 |
Laminin γ1 | Disulfide 1 | Disulfide 2 | Disulfide 3 | Disulfide 4 | XL 1 | Mean |
---|---|---|---|---|---|---|
Filter (total) | 1226 90.8% |
260 19.3% |
1348 99.9% |
218 16.1% |
1193 88.4% |
63% |
Filter (4/5) | 291 99.3% |
161 54.9% |
293 100.0 |
148 50.5% |
291 99.3% |
81% |
LOO distance | 12/14 86% |
12/133 9% |
12/12 100% |
12/145 8% |
12/12 100% |
61% |
LOO fold | 9/14 64% |
68/133 51% |
9/12 75% |
82/145 57% |
9/12 75% |
64% |
Information content [Å−1] |
0.77 | 0.62 | 0.23 | 1.31 | 0.43 |
Evaluation of Structure Quality
The structure quality of these models fulfilling all experimental distance constraints was evaluated using the programs VADAR ((40), http://redpoll.pharmacy.ualberta.ca/vadar/) and MolProbity ((41), http://molprobity.biochem.duke.edu/). The evaluated structures are shown in the Supplementary Material.
Estimation of the Influence of Templates and Distance Constraints
For each of the 14 templates the most likely structure for laminin β1 and γ1 was determined based on the composite score discussed above (Table 6). These structures were evaluated based on the structure homology to give the overall best model. To assess the importance of each distance constraints in model selection the fraction of models that fulfill each restraint before filtering was computed. Furthermore, we determined how often a single constraint is exclusively responsible for the rejection of models, i.e., in these cases for which all except one constraint are fulfilled.
RESULTS AND DISCUSSION
The aim of the present study was to obtain structural models of laminin β1 and γ1 LN domains based on sparse distance constraints imposed by natural cross-links, i.e., disulfide bonds, as well as by chemical cross-links obtained from reactions with the amine-reactive cross-linkers BS2G and BS3. The amino acid sequences of recombinant laminin β1 and laminin γ1 fragments comprised of one LN domain and LE domains 1-4 used in this study are shown in Figure 3.
Figure 3.
Amino acid sequences of recombinant (A) laminin β1 and (B) laminin γ1 fragments comprised of one LN domain and LE domains 1-4. Amino acids that were detected during peptide mass fingerprint and peptide fragment fingerprint analyses are shown in bold. Putative glycosylation sites are shown in italics and underlined. The tags that were added to the laminin N-terminal constructs are printed in italics. LN domains are shaded.
Intramolecular Cross-Linked Products
The homobifunctional NHS esters BS3-D0/D4 and BS2G-D0/D4 bridging distances of ca. 7.7 Å and 11.4 Å were used for chemical cross-linking as described in the Materials and Methods. A sulfonate group at the NHS moiety improves water solubility of the reagent. NHS esters are highly reactive towards primary amines, i.e., ε-amine groups of lysines and the free N-terminus of a protein, but as a side reaction they are also susceptible to hydrolysis. The amino acids that are modified by a partially hydrolyzed cross-linkers (so-called “dead-end” cross-links) do not yield direct distance information, but give valuable insights into the solvent accessibility of a specific amino acid (9). In this work, we did not consider these ”dead-end” cross-links for structural modeling. In addition to reacting with amine groups, NHS esters have also been found to react with hydroxyl groups of tyrosine, threonine and serine residues (21) (24). Upon cross-linking, NHS esters create an amide bond with mass increases of 138.068 u (BS3-D0) and 96.021 u (BS2G-D0), respectively. Peptides, which are modified by a partially hydrolyzed cross-linker exhibit mass increases of 156.079 u (BS3-D0) and 114.032 u (BS2G-D0), respectively. BS3 and BS2G were employed as 1:1 mixtures of their non-deuterated (D0) and deuterated (D4) species in order to facilitate the identification of cross-linked products by means of their distinct doublet isotope patterns with mass differences of 4.025 u (D0/D4) in the deconvoluted mass spectra (13).
To gain insight into the three-dimensional structures of laminin β1 and γ1 LN domains the cross-linking reaction had to be optimized. In contrast to determining protein-protein interaction sites, intramolecular cross-linking within the laminin LN domain monomers had to be favored. Under the present conditions no formation of laminin homodimers was observed in MALDI-TOF-MS or 1D-SDS-PAGE. Gel bands of laminin LN domain monomers from cross-linking reaction mixtures, in which both laminin β1 and γ1 constructs were contained as well as the isolated laminin fragments in solution were digested either in-gel or in-solution and analyzed by nano-HPLC/MALDI-TOF/TOF-MS(/MS).
When isolated laminins were subjected to the cross-linking reaction with BS3 and digestion was performed in-solution and in-gel, five and two, intramolecular cross-linked products were obtained for laminin β1 and γ1 LN domain, respectively. Table 1 summarizes three unique cross-links for laminin β1 and two cross-links for laminin γ1. In addition, intramolecular cross-linking products were identified from in-gel digestions of monomeric bands and in-solution digestions of cross-linking reaction mixtures with laminin β1 and γ1 LN domains using the cross-linkers BS3 and BS2G (Table 2). The cross-linking products that had already been found when performing in-solution digestion of separated laminin β1 and γ1 LN domain were confirmed. In total, eleven cross-linked products for laminin β1 LN domain and four cross-linked products for laminin γ1 LN domain were observed.
Table 1.
Intramolecular cross-linked products of isolated laminin β1 and γ1 LN domains using BS3 as cross-linker; all cross-linked products were verified by MS/MS data. The cross-linked amino acids are printed in italics and are underlined. The cross-linked product presented in Figure 4 is highlighted.
Protein | MH+exp | MH+theo | Sequence | Amino acids | Δm [ppm] |
MS/MS | D0/D4 | Digestion |
---|---|---|---|---|---|---|---|---|
β1 | 1293.794 1297.821 |
1293.793 1297.818 |
218–227 | R/IKFVKLHTLG/D | 1 2 |
b5–9 y4–5 |
1.9 | In-solution |
β1 | 1293.786 1297.810 |
1293.793 1297.818 |
218–227 | R/IKFVKLHTLG/D | 5 6 |
b5–9 y3–5 |
1.8 | In-gel |
β1 | 1672.824 1676.846 |
1672.822 1676.847 |
194–206 | L.DPAFKIEDPYSPR.I | 1 1 |
y1,y12 b3 |
1.4 | In-solution |
β1 | 1672.804 1676.830 |
1672.822 1676.847 |
194–206 | L.DPAFKIEDPYSPR.I | 11 10 |
y1–3,y12; b2,b4 |
1.4 | In-gel |
β1 | 1856.951 1860.976 |
1856.943 1860.968 |
192–206 | R.ALDPAFKIEDPYSPR.I | 4 4 |
y1–3;y12, b2 | 1.4 | In-solution |
β1 | 2282.310 2286.329 |
2282.334 2286.359 |
194–200 + 207–217 |
L.DPAFKIE.D + R.IQNLLKITNLR.I |
11 13 |
1: b3,y6 2: y1,2;b2,3 |
1.6 | In-solution |
β1 | 2466.415 2470.444 |
2466.433 2470.458 |
192–200 +207–217 |
R.ALDPAFKIE.D +R.IQNLLKITNLR.I |
7 6 |
2: b1,3;y1–4 | 1.5 | In-gel |
β1 | 2466.455 2470.479 |
2466.433 2470.458 |
192–200 +207–217 |
R.ALDPAFKIE.D +R.IQNLLKITNLR. |
9 9 |
1:b2,3 2:b2 |
1.6 | In-solution |
γ1 | 2118.135 2122.156 |
2118.149 2122.174 |
128–143 | R.LKFHTSRPESFAIYKR.T | 7 8 |
y1,2,5–7,9 b7,14,15 |
1.8 | In-solution |
γ1 | 2118.145 2122.166 |
2118.149 2122.174 |
128–143 | R.LKFHTSRPESFAIYKR.T | 2 4 |
y1, b15 | 1.7 | In-gel |
γ1 | 2651.284 2655.307 |
2651.299 2655.324 |
189 – 212 | S.DISPLTGGNVAFSTLEGRPSAYNF.D | 6 6 |
y1,2,6– 12,14,18,21,23 b2–4,6,9–11,18 |
1.3 | In-gel |
γ1 | 2651.286 2655.301 |
2651.299 2655.324 |
189 – 212 | S.DISPLTGGNVAFSTLEGRPSAYNF.D | 5 9 |
y2,6,23; b2,3,6 | 1.5 | In-solution |
Table 2.
Intramolecular cross-linked products of cross-linking reaction mixtures containing both laminin β1 and γ1 LN domains; the cross-linker (BS2G or BS3) is indicated. All cross-linked products were verified by MS/MS data. The cross-linked amino acids are printed in italics and are underlined.
Laminin β1 | |||||||
---|---|---|---|---|---|---|---|
MH+exp | MH+theo | Sequence | Amino acids and cross-linker | Δm [ppm] |
MS/MS | D0/D4 | Digestion |
1293.787 1297.812 |
1293.793 1297.818 |
218–227 | R.IKFVKLHTLG.D + BS3 | 5 5 |
y4–5; b5–9 |
1.3 | In-solution |
1293.821 1297.841 |
1293.793 1297.818 |
218–227 | R.IKFVKLHTLG.D + BS3 | 22 18 |
y3-5; b5–9 | 1.8 | In-gel |
1672.823 1676.847 |
1672.822 1676.847 |
194–206 | L.DPAFKIEDPYSPR.I + BS3 | 1 0 |
b3, y1–3,12 | 1.4 | In-solution |
1672.833 1676.860 |
1672.822 1676.847 |
194–206 | L.DPAFKIEDPYSPR.I + BS3 | 7 8 |
b3; y1–3,9,12 | 3.0 | In-gel |
2282.322 2286.342 |
2282.312 2286.337 |
194-200+ 207-217 |
L.DPAFKIE.D +R.IQNLLKITNLR.I + BS3 |
4 2 |
1 :b3 2 :b2–4; y1–5, |
1.8 | In-gel |
2466.430 2470.454 |
2466.433 2470.458 |
192-200+ 207–217 |
R.IQNLLKITNLR.I + R.ALDPAFKIE.D + BS3 |
1 2 |
- | 1.6 | In-gel |
1421.842 1425.867 |
1421.847 1425.869 |
207–217 | R.IQNLLKITNLR.I + BS2G | 4 1 |
y1–5, b1–5 | 0.8 | In-solution |
1630.800 1634.793 |
1630.775 1634.800 |
194–206 | L.DPAFKIEDPYSPR.I + BS2G | 15 4 |
y1,2,12 | 0.6 | In-gel |
1836.901 1840.924 |
1836.865 1840.890 |
176–191 | R.YSDIEPSTEGEVIFR.A amidated(C-term);+ BS2G | 20 18 |
y1–8 | 0.7 | In-solution |
1836.921 1840.944 |
1836.865 1840.890 |
176–191 | R.YSDIEPSTEGEVIFR.A amidated (C-term); + BS2G | 30 29 |
y1–4 | 0.6 | In-gel |
1935.877 1939.900 |
1935.883 1939.908 |
152–168 | Y.DCESSFPGISTGPMKKV.D + BS2G | 3 4 |
y2–3,5–8,10–13, b4, b9, b6, b4 |
0.7 | In-gel |
2240.265 2244.286 |
2240.265 2244.290 |
194–200+ 207–217 |
L.DPAFKIE.D + R.IQNLLKITNLR.I + BS2G |
0 2 |
b3, y1,4,17 |
0.7 | In-solution |
2240.260 2244.285 |
2240.265 2244.290 |
194–200+ 207–217 |
L.DPAFKIE.D + R.IQNLLKITNLR.I + BS2G |
2 2 |
1: y1–5 ; b2–5 2 : b2–4 |
0.7 | In-gel |
2501.238 2505.257 |
2501.235 2505.260 |
136–147+ 192–200 |
R.SSDFGKTWGVYR.Y+R.ALDPAFKIE.D + BS2G | 1 1 |
1 : b2, y1,2,5 | 0.5 | In-solution |
2501.216 2505.239 |
2501.235 2505.260 |
136–147+ 192–200 |
R.SSDFGKTWGVYR.Y+R.ALDPAFKIE.D + BS2G | 8 8 |
A: b2,3,5; y1,4–6 b : b3 |
0.6 | In-gel |
2552.132 2556.149 |
2552.135 2556.160 |
152–173 | Y.DCESSFPGISTGPMKKVDDIIC.D + BS2G | 1 4 |
b6,9,10,17,21 y10,11,13,17 |
0.5 | In-solution |
2552.021 2556.044 |
2552.135 2556.160 |
152–173 | Y.DCESSFPGISTGPMKKVDDIIC.D + BS2G | 45 45 |
- | 0.6 | In-gel |
2751.278 2755.315 |
2751.312 2755.337 |
33–55 | K.LSVTSTCGLHKPEPYCIVSHLQE.D + BS2G | 12 8 |
y3–y6,10–14 b2,4,10,11 |
0.5 | In-solution |
2751.272 2755.312 |
2751.312 2755.337 |
33–55 | K.LSVTSTCGLHKPEPYCIVSHLQE.D + BS2G | 15 9 |
b2,4,10 y5–6,10,12 |
0.6 | In-gel |
3078.499 3082.529 |
3078.502 3082.527 |
30–55 | R.AQKLSVTSTCGLHKPEPYCIVSHLQE.D + BS2G | 1 1 |
b15,17 y4,5,10,12 |
0.3 | In-gel |
Laminin γ1 | |||||||
---|---|---|---|---|---|---|---|
MH+exp | MH+theo | Sequence | Amino acids and cross-linker | Δm [ppm] |
D0/D4 | MS/MS | Digestion |
2118.151 2122.176 |
2118.149 2122.174 |
128–143 | R.LKFHTSRPESFAIYKR.T + BS3 | 1 1 |
1.3 | b7,9,15 y1,7,9 |
In-solution |
2118.172 2122.170 |
2118.149 2122.174 |
128–143 | R.LKFHTSRPESFAIYKR.T + BS3 | 11 2 |
0.8 | (y1,b15) | In-gel |
2076.080 2080.125 |
2076.103 2080.128 |
128–143 | R.LKFHTSRPESFAIYKR.T + BS2G | 11 1 |
0.8 | b1,15 y1,6,7,10 |
In-solution |
2852.190 2856.198 |
2852.226 2856.251 |
147–169 | E.DGPWIPYQYYSGSCENTYSKANR.G + BS2G | 13 19 |
0.4 | y1-3; b3– b5, b8, b16 |
In-solution |
2949.432 2953.455 |
2949.431 2953.456 |
239–263 | G.DDVFNEPKVLKSYYYAISDFAVGGR.C + BS2G | 0 0 |
0.7 | b18,b20, y1,3,19 |
In-solution |
In Figure 4, the identification of a cross-link between lysines at positions 198 and 212 of laminin β1 is presented exemplarily. MS/MS data unambiguously point to both lysines as reaction sites.
Figure 4.
MALDI-TOF/TOF-MS/MS analysis of the cross-linking product with BS3 of laminin β1 LN domain (amino acis194-200 connected with amino acids 207-217; [M+H]+exp 2282.322), lysines at positions 198 and 212 were found to be connected (marked with asterisks). Upper panel: the identified fragment ions are shown in bold; lower panel: tandem MS (MS/MS); the insets show the y6 fragment ion and the precursor ion, both exhibiting the characteristic D0/D4 isotope pattern.
Disulfide Bond Analysis
The analysis of disulfide bond patterns of laminin β1 LN domain by offline nano-HPLC/MALDI-TOF/TOF mass spectrometry has been described recently (12). Briefly, after recording the mass spectra the putatively disulfide-linked peptides are subjected to LIFT (laser-induced fragmentation technique)-TOF/TOF-MS/MS to confirm the disulfide bond. Screening the fragment ion mass spectra of disulfide-linked peptides for characteristic 66-u patterns (34 u + 32 u), arising from symmetric and asymmetric cleavage of disulfide bonds, greatly facilitates their identification (42). In addition, MALDI in-source decay created the reduced ‘halves’ of the disulfide-linked peptides and subsequent LIFT-TOF/TOF-MS/MS of the reduced peptides confirmed their amino acid sequences. Using different enzymes for proteolytic digestion of the laminin γ1 chain N-terminal protein fragment, a linear bonding pattern of the eight cysteine residues in the LN domain of the laminin γ1 chain was observed with a strict sequential 1–2, 3–4, 5–6, 7–8 connectivity of the eight cysteines. This disulfide bonding pattern had already been reported for the recombinant laminin β1 chain fragment and a laminin fragment derived by elastase digestion of mouse tumor laminin-111, confirming that this pattern also occurs in native laminin (12). Table 3 summarizes the cysteines that are connected by disulfide bonds.
Table 3.
Distance constraints in laminin β1 and γ1 LN domains serving as basis for computational modeling (filtering criteria for CA-CA distances: disulfide bonds: 13 Å (favored distance 8 Å), cross-links: 30 Å (favored distance 24 Å); shaded areas: these constraints could not be employed as a direct structural filter because of the spatial proximity in the primary structure of laminin LN domains.
Laminin β1 | Laminin γ1 | ||
---|---|---|---|
Connected amino acids | Connection Type | Connected amino acids | Connection Type |
C13-C18 | Disulfide | C18-C28 | Disulfide |
K32-K43 | BS2G | C47 -C55 | Disulfide |
C39-C48 | Disulfide | C67 -C70 | Disulfide |
C59-C62 | Disulfide | K129-K142 | BS2G, BS3 |
K141-K198 | BS2G | C160-C183 | Disulfide |
C153-C173 | Disulfide | S208-Y210 | BS3 |
K166-K167 | BS2G | K246-K249 | BS3 |
K198-Y203/S204 | BS2G, BS3 | ||
K198-K212 | BS2G, BS3 | ||
K212-T214 | BS2G | ||
K219-K222 | BS3 | ||
K222-T225 | BS2G |
Structures of Laminin β1 and γ1 LN Domains
The secondary structure of LN domains was predicted using three different programs as described in the Materials and Methods resulting in a β-sheet content of ca. 30% for laminin β1 and ca. 26% for laminin γ1. Fold recognition using the 3D-Jury method (25) was performed using the sequences of all murine laminin LN domains and the homologous sequences of netrin I, II, and IV as input. A total of six template structures (1CZT, 1D7P, 1GOF, 1KEX, 1SDD, 1XPW) were identified with 3D-Jury scores ranging between 39 and 58 (Table 4). All six structures share a common galactose-binding domain-like fold. In a complementary approach, 35 additional templates were determined by sequence-structure alignment threading servers Phyre, Libellula, Wurst, HHPred, and Loop (28), (29), (30), (31), (32) (Table 4). In order to eliminate false positives, a pairwise structure-structure comparison of all putative templates was executed. 14 out of 40 templates displayed the galactose-binding domain-like fold (Table 4). The 26 remaining templates were removed as false positives as they exhibited similarity with one other template at most (Figure 2). Sequence identities between the templates and laminin β1 and γ1 LN domains ranged between 10 to 18%.
Sequence Assignment and Modeling of Loops
If sequence identities are lower than 50%, selection of the best possible template and accurate alignment of target and template sequences are crucial for success in comparative modeling (43). To comprehensively test all possible templates and alignments a parametric ensemble alignment approach was used for all 14 template proteins (38). Briefly, for each template laminin β1 and γ1 LN domain pair sequence-sequence alignments were created using a scoring function that includes sequence similarity (identity, Blosum45, Pam250, Blast), secondary structure (programs JUFO (19) (35), Psipred (36), SAM (37)), and sequence gaps (gap opening and gap extension) with variable weights. Out of the originally 52,200 alignments redundant alignments were removed as well as alignments that contained gaps too large to be closed by the respective loop. A total of 4,333 and 7,306 sequence-sequence alignments were input to comparative modeling for β1 and γ1 LN domains, respectively.
Construction of Comparative Models
The coordinates of the backbone of aligned amino acids were copied from the template structure into an initial model. Missing regions were reconstructed using Rosetta (39). Altogether, 4,333 alignments (for laminin β1) and 7,306 alignments (for laminin γ1) resulted in complete comparative models. Side chains were added using Rosetta rotamer libraries (44) assuming that all cysteines are involved in disulfide bonds. Models were removed if not all loop regions could be closed, the compactness of the model did not match native protein domains of the same size, not all cysteines were involved in disulfide bridges (or being at least in close distance to other cysteines <13Å), the fold deviated significantly from the starting template, or the model was energetically clearly disfavored, i.e. had substantial clashes (Table 5).
Evaluation of the Models Using Experimental Data
The models obtained by using different templates, sequence-sequence alignments, modeling of loops, and relaxation of structures were evaluated using the distance constraints imposed on the laminin LN domains by “natural” cross-links (disulfide bonds) and “chemically introduced” cross-links. The distance constraints used for verifying the models are summarized in Table 3. The identification of a disulfide bond implies that the CA atoms of the cysteines involved are within a distance of ca. 7 Å. Every CA-CA distance of a disulfide bond, in which the cysteines are separated by at least two amino acids in sequence can be used as structural constraint. In case the cysteines are separated by less than two amino acids, the distance is below 9 Å and the distance constraint is always fulfilled. A similar approximation can be made for the CA-CA distances of amino acids that are cross-linked with BS3: If the CA atoms are at least seven positions apart from each other, these cross-links can be used as useful distance constraints. Thus, seven out of twelve and five out of seven constraints have been applied as restraint for laminin β1 and γ1, respectively. Nevertheless, as already described by Alexander et al. 41) the information content of the distance constraints depend on the sequence separation [aa] / Euclidian separation [Å] ratio.
The structures of laminin β1 and γ1 LN domains calculated by Rosetta were compared with the experimentally obtained distance constraints (Table 3, Supplementary Table 3). All structures were grouped based on the number of fulfilled distance constraints. A distance constraint was considered to be fulfilled if the CA-CA distance of a cross-link was below 30 Å and the CA-CA distance of a disulfide bridge below 13 Å. Both filter criteria are somewhat relaxed to the distances listed above in order to account for inaccuracies in the comparative models (Table 5). Only approximately 1% of all models fulfilled all distance constraints highlighting the discriminative power of even few experimental distance measurements (Table 5).
All models fulfilling all distance constraints exhibited the same topology (more than 100 out of 240 amino acids could be superimposed with an RMSD < 6 Å), however, shifts in the sequence structure alignment of up to 40 amino acids occur. In a second step, a composite scoring function described in Materials and Methods integrating the deviation of the experimental constraints from the theoretical maximum distance as well as the Rosetta energy and compactness of the structure were used to evaluate the quality of the models and create a ranking (Supplementary Table 4). In the best scoring models of the LN domains of laminin β1 and γ1 all of the distance constraints deviate from the theoretical distance by less than 5 Å (Figure 5).
Figure 5.
The lowest scoring Rosetta structures for (A) laminin β1 and (B) γ1 LN domains that fulfill all experimental distance constraints. The amino acids involved in the constraints are indicated, structures are visualized by PyMOL version 0.99rc6). Cysteines at positions 13 (laminin β1) and 18 (laminin γ1), which are not part of laminin N-terminal domains, are marked with asterisks.
For laminin β1, three structures (Table 5) fulfilled all seven distance constraints and exhibited structural similarity, i.e., more than 110 CA atoms are superimposed with an RMSD below 4 Å. For laminin γ1, five distance constraints were fulfilled by 12 models (Table 5). Nine structures could be superimposed in more than 120 CA atoms with an RMSD below 4 Å. For the remaining three models more than 50 CA atoms could be superimposed with an RMSD below 4 Å.
Leave-One-Out Cross-Validation and Evaluation with Structure Quality
To assess the quality of laminin β1 and γ1, all models fulfilling 6 out of 7 or 4 out of 5 distance constraints, respectively, were pairwise compared to the best solution structures for laminin β1 and γ1 LN domains (Figure 5). A test of the precision of the modeling efforts yielded 17 out of 33 (laminin β1) and 141 out of 293 models (laminin γ1) with a high structural similarity, i.e. more than 80 CA atom are superimposed with a RMSD below 4 Å. Accuracy was assessed by testing whether or not the left out distance constrained is fulfilled by the models. On the basis of the LOO cross-validation, the uncertainty of the predicted structures was estimated. For the models of laminin β1 and γ1 LN domains, which were obtained with 6 out of 7 (4 out of 5) distance constraints, 78% (61%) of all models fulfilled the missing distance constraint. If the same models were validated by their structure homology to the best model, 88% (64%) of the models exhibited the correct fold (Table 7). In conclusion, 78% and 61% of the leave-one-out structures fulfill the left-out distance constraint.
In addition, the programs VADAR and MolProbity were used for a Rosetta-independent measurement of structure quality (Supplementary Table 4). The quality criteria defined by the two programs are all in a reasonable range for the models that fulfill all distance constraints. As only minor differences in quality scores were observed between the models, these criteria were not used for further ranking the models.
Influence of Templates
To analyze the influence of the template on the final model best structures were compared for the 14 templates (Table 4). For laminin γ1, for all 14 templates structures were obtained, which fulfilled at least 4 out of 5 distance constraints and were energetically favored as well as structurally compact. From these, five structures displayed a fold identical to the best overall model. For laminin β1, only nine templates yielded models, which fulfilled six out of seven distance constraints. All displayed the identical fold as in the overall best structure (Supplementary Table 2).
Influence of Experimental Distance Constraints
A statistics on the effectiveness of each constraint revealed that all distance constraints reduced conformational space. However, the fraction of models excluded ranged from 0.1% to 80.7% (Table 7). As expected, the effectiveness of an experimental constraint increases with increasing sequence separation and decreasing limit for the Euclidean distance.
Implications of Our Models for Laminin Function
Laminins containing the γ1 chain are essential for mammalian development, as shown by the fact that mice lacking expression of this chain die at day 5.5 of embryonal development (45). Laminins assemble into a macromolecular network by interactions of their LN domain containing N-terminal short arms. This assembly renders them immobile in the basement membrane and allow their C-terminal domains to attach cells onto this structure by interactions with cell surface integrin or dystroglycan receptors. The crucial role of laminin LN domains in laminin network formation has led to intensive attempts to determine their structure by crystallographic techniques. However, it turned out to be extremely difficult to obtain well organized crystals for LN domains from laminins or the homologous netrins and for this reason no LN domain crystal structures have been published so far. The lack of detailed structural information has hampered the study of the crucial LN domain interactions.
The modeled structures that we present herein open new possibilities to identify binding surfaces on LN domains as well as ligand structures. Based on the models site-directed mutagenesis can be performed on surface exposed amino acid residues and the consequences of these mutations can be studied in binding assays for laminin short arm self-interactions (3). Further chemical cross-linking experiments can be performed on interacting laminin short arms to identify the domains within laminin, to which the LN domains bind. Such studies will provide a structural model of self-interacting laminin molecules, which will significantly add to our understanding of basement membrane structure. As mutations within LN domains that affect their interactions can cause severe retinal phenotypes (46) novel knowledge on pathogenic mechanisms may come about.
CONCLUSIONS
With our approach integrating chemical cross-linking, mass spectrometry, and computational modeling we were able to compute the first experimentally validated low-resolution structures of laminin N-terminal domains. After the identification of 14 potentially suited template structures for each laminin β1 and γ1 LN domain, several hundreds of models were created with the modeling program Rosetta. The models were filtered and ranked by both the correlation with experimentally derived distance constraints from natural cross-links, i.e., disulfide bonds, and chemical cross-links created by reaction with two amine-reactive cross-linkers and parameter for model quality i.e., energy and solvent accessible surface area.
The questions how well-defined the models are, how many, and which kind of distance constraints are needed for creating a good model, and how experimental data and template structure influence the model were addressed by leave-one-out cross-validation combined with a structure-structure alignment as well as by Rosetta-independent structure evaluation programs.
Although none of the techniques employed herein is completely new, we argue that the presented protocol is novel as it combines ten computational methods with a complex experimental setup integrating chemical cross-linking, isotope labeling, and HPLC/MS analysis. As such, we consider our approach to be a valid alternative for deriving structural models of proteins, which are not amenable to the high-resolution methods for protein 3D structure analysis, such as NMR spectroscopy or X-ray crystallography.
Supplementary Material
Acknowledgments
This work was supported by grants SI 867/7-1 and SM 65/1-3 from the DFG (Deutsche Forschungsgemeinschaft) and by the Köln Fortune program of the Medical Faculty of the University of Cologne. The authors thank Dr. Tibor Kohajda for programming the software tool IsoFind and Dr. Christian Ihling for discussions and valuable advice on MALDI-TOF/TOF-MS and ESI-FTICR-MS (Apex II) measurements. Karl Mechtler and Christoph Stingl, IMP Vienna, are acknowledged for LTQ-FT-MS measurements. SK acknowledges support from the DFG-funded Graduiertenkolleg 1026 “Conformational Transitions in Macromolecular Interactions” at the Martin Luther University Halle-Wittenberg.
ABBREVIATIONS
- BS2G
Bis(sulfosuccinimidyl)glutarate
- BS3
Bis(sulfosuccinimidyl)suberate
- DHB
2,5-Dihydroxybenzoic acid
- DTT
Dithiothreitol
- ESI-FTICR-MS
Electrospray ionization Fourier transform ion cyclotron resonance mass spectrometry
- FA
Formic acid
- HCCA
α-Cyano-4-hydroxy cinnamic acid
- HPLC
High-performance liquid chromatography
- LHRH
Luteinizing hormone releasing hormone
- LIFT
laser-induced fragmentation technique
- LN
laminin N-terminal
- LOO
Leave-one-out
- MALDI-TOF-MS
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
- NHS
N-hydroxysuccinimide
- PDB
Protein data bank
- RMSD
Root mean square deviation (calculated for the backbone coordinates)
- SASA
Solvent accessible surface area
- S/N
Signal-to-noise ratio
- TFA
Trifluoroacetic acid
- TIC
Total ion current
REFERENCES
- 1.Tunggal P, Smyth N, Paulsson M, Ott M-C. Microscopy Research and Technique. 2000;51:214–227. doi: 10.1002/1097-0029(20001101)51:3<214::AID-JEMT2>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]
- 2.Aumailley M, Bruckner-Tuderman L, Carter WG, Deutzmann R, Edgar D, Ekblom P, Engel J, Engvall E, Hohenester E, Jones JCR, Kleinman HK, Marinkovich MP, Martin GR, Mayer U, Meneguzzi G, Miner JH, Miyazaki K, Patarroyo M, Paulsson M, Quaranta V, Sanes JR, Sasaki T, Sekiguchi K, Sorokin LM, Talts JF, Tryggvason K, Uitto J, Virtanen I, von der Mark K, Wewer UM, Yamada Y, Yurchenco PD. Matrix Biology. 2005;24:326–332. doi: 10.1016/j.matbio.2005.05.006. [DOI] [PubMed] [Google Scholar]
- 3.Odenthal U, Haehn S, Tunggal P, Merkl B, Schomburg D, Frie C, Paulsson M, Smyth N. Journal of Biological Chemistry. 2004;279:44504–44512. doi: 10.1074/jbc.M402455200. [DOI] [PubMed] [Google Scholar]
- 4.Yurchenco PD, Tsilibary EC, Charonis AS, Furthmayr H. Journal of Biological Chemistry. 1985;260:7636–7644. [PubMed] [Google Scholar]
- 5.McKee KK, Harrison D, Capizzi S, Yurchenco PD. Journal of Biological Chemistry. 2007;282:21437–21447. doi: 10.1074/jbc.M702963200. [DOI] [PubMed] [Google Scholar]
- 6.Schneiders FI, Maertens B, Bose K, Li Y, Brunken WJ, Paulsson M, Smyth N, Koch M. Journal of Biological Chemistry. 2007;282:23750–23758. doi: 10.1074/jbc.M703137200. [DOI] [PubMed] [Google Scholar]
- 7.Young MM, Tang N, Hempel JC, Oshiro CM, Taylor EW, Kuntz ID, Gibson BW, Dollinger G. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:5802–5806. doi: 10.1073/pnas.090099097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Back JW, Jong Ld, Muijsers AO, Koster CGd. Journal of Molecular Biology. 2003;331:303–313. doi: 10.1016/s0022-2836(03)00721-6. [DOI] [PubMed] [Google Scholar]
- 9.Sinz A. Journal of Mass Spectrometry. 2003;38:1225–1237. doi: 10.1002/jms.559. [DOI] [PubMed] [Google Scholar]
- 10.Sinz A. Mass Spectrometry Reviews. 2006;25:663–682. doi: 10.1002/mas.20082. [DOI] [PubMed] [Google Scholar]
- 11.Hermanson G, editor. Bioconjugate Techniques. 2nd. Amsterdam, NL: Elsevier; 2008. [Google Scholar]
- 12.Kalkhof S, Haehn S, Ihling C, Paulsson M, Smyth N, Sinz A. Rapid Communications in Mass Spectrometry. 2008;22:1933–1940. doi: 10.1002/rcm.3576. [DOI] [PubMed] [Google Scholar]
- 13.Kalkhof S, Ihling C, Mechtler K, Sinz A. Analytical Chemistry. 2004;77:495–503. doi: 10.1021/ac0487294. [DOI] [PubMed] [Google Scholar]
- 14.Schmidt A, Kalkhof S, Ihling C, Cooper D, Sinz A. Eur J Mass Spectrom (Chichester, Eng) 2005;11:525–534. doi: 10.1255/ejms.748. [DOI] [PubMed] [Google Scholar]
- 15.Ihling C, Schmidt A, Kalkhof S, Schulz DM, Stingl C, Mechtler K, Haack M, Beck-Sickinger AG, Cooper DMF, Sinz A. Journal of the American Society for Mass Spectrometry. 2006;17:1100–1113. doi: 10.1016/j.jasms.2006.04.020. [DOI] [PubMed] [Google Scholar]
- 16.Simons KT, Kooperberg C, Huang E, Baker D. Journal of Molecular Biology. 1997;268:209–225. doi: 10.1006/jmbi.1997.0959. [DOI] [PubMed] [Google Scholar]
- 17.Bradley P, Malmström L, Qian B, Schonbrun J, Chivian D, Kim DE, Meiler J, Misura KMS, Baker D. Proteins: Structure, Function, and Bioinformatics. 2005;61:128–134. doi: 10.1002/prot.20729. [DOI] [PubMed] [Google Scholar]
- 18.Bowers PM, Strauss CEM, Baker D. Journal of Biomolecular NMR. 2000;18:311–318. doi: 10.1023/a:1026744431105. [DOI] [PubMed] [Google Scholar]
- 19.Meiler J, Baker D. Journal of Magnetic Resonance. 2005;173:310–316. doi: 10.1016/j.jmr.2004.11.031. [DOI] [PubMed] [Google Scholar]
- 20.Alexander N, Al-Mestarihi A, Bortolus M, McHaourab H, Meiler J. 2008;16:181–195. doi: 10.1016/j.str.2007.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kalkhof S, Sinz A. Anal Bioanal Chem. 2008;392:305–312. doi: 10.1007/s00216-008-2231-5. [DOI] [PubMed] [Google Scholar]
- 22.Laemli UK. Nature. 1970;227:680–685. doi: 10.1038/227680a0. [DOI] [PubMed] [Google Scholar]
- 23.Schulz DM, Kalkhof S, Schmidt A, Ihling C, Stingl C, Mechtler K, Zschörnig O, Sinz A. Proteins: Structure, Function, and Bioinformatics. 2007;69:254–269. doi: 10.1002/prot.21445. [DOI] [PubMed] [Google Scholar]
- 24.Mädler S, Bich C, Touboul D, Zenobi R. Journal of Mass Spectrometry. 2009;44:694–706. doi: 10.1002/jms.1544. [DOI] [PubMed] [Google Scholar]
- 25.Ginalski K, Elofsson A, Fischer D, Rychlewski L. Bioinformatics. 2003;19:1015–1018. doi: 10.1093/bioinformatics/btg124. [DOI] [PubMed] [Google Scholar]
- 26.Huang X, Miller W. Adv. Appl. Math. 1991;12:337–357. [Google Scholar]
- 27.Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Nucl. Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kelley LA, Sternberg MJE. Nat. Protocols. 2009;4:363–371. doi: 10.1038/nprot.2009.2. [DOI] [PubMed] [Google Scholar]
- 29.Juan D, Graña O, Pazos F, Fariselli P, Casadio R, Valencia A. Proteins: Structure, Function, and Genetics. 2003;50:600–608. doi: 10.1002/prot.10322. [DOI] [PubMed] [Google Scholar]
- 30.Torda AE, Procter JB, Huber T. Nucl. Acids Res. 2004;32:W532–W535. doi: 10.1093/nar/gkh357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Soding J. Bioinformatics. 2005;21:951–960. doi: 10.1093/bioinformatics/bti125. [DOI] [PubMed] [Google Scholar]
- 32.Teodorescu O, Galor T, Pillardy J, Elber R. Proteins: Structure, Function, and Bioinformatics. 2004;54:41–48. doi: 10.1002/prot.10474. [DOI] [PubMed] [Google Scholar]
- 33.Ortiz AR, Strauss CEM, Olmea O. Protein Science. 2002;11:2606–2621. doi: 10.1110/ps.0215902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Dong E, Smith J, Heinze S, Alexander N, Meiler J. Gene. 2008;422:41–46. doi: 10.1016/j.gene.2008.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Meiler J, Prompers JJ, Peti W, Griesinger C, Bruschweiler R. Journal of the American Chemical Society. 2001;123:6098–6107. doi: 10.1021/ja010002z. [DOI] [PubMed] [Google Scholar]
- 36.Jones DT. Journal of Molecular Biology. 1999;292:195–202. doi: 10.1006/jmbi.1999.3091. [DOI] [PubMed] [Google Scholar]
- 37.Karplus K, Barrett C, Cline M, Diekhans M, Grate L, Hughey R. Proteins: Structure, Function, and Genetics. 1999;37:121–125. doi: 10.1002/(sici)1097-0134(1999)37:3+<121::aid-prot16>3.3.co;2-h. [DOI] [PubMed] [Google Scholar]
- 38.Chivian D, Baker D. Nucl. Acids Res. 2006;34:e112. doi: 10.1093/nar/gkl480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rohl CA, Strauss CEM, Chivian D, Baker D. Proteins: Structure, Function, and Bioinformatics. 2004;55:656–677. doi: 10.1002/prot.10629. [DOI] [PubMed] [Google Scholar]
- 40.Willard L, Ranjan A, Zhang H, Monzavi H, Boyko RF, Sykes BD, Wishart DS. Nucl. Acids Res. 2003;31:3316–3319. doi: 10.1093/nar/gkg565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, III, Snoeyink J, Richardson JS, Richardson DC. Nucl. Acids Res. 2007;35:W375–W383. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Schnaible V, Wefing S, Resemann A, Suckau D, Buecker A, Wolf-Kuemmeth S, Hoffmann D. Analytical Chemistry. 2002;74:4980–4988. doi: 10.1021/ac025807j. [DOI] [PubMed] [Google Scholar]
- 43.Rost B, O'Donoghue S. Comput. Appl. Biosci. 1997;13:345–356. doi: 10.1093/bioinformatics/13.4.345. [DOI] [PubMed] [Google Scholar]
- 44.Kuhlman B, Baker D. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:10383–10388. doi: 10.1073/pnas.97.19.10383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Edwards MM, Mammadova-Bach E, Alpy F, Klein A, Hicks WL, Roux M, Simon-Assmann P, Smith RS, Orend G, Wu J, Peachey NS, Naggert JK, Lefevbre O, Nishina PM. J. Biol. Chem. 2010;285:7697–7711. doi: 10.1074/jbc.M109.069575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Smyth NR, Vatansever HS, Murray P, Meyer M, Frie C, Paulsson M, Edgar D. J. Cell Biol. 1999;144:151–160. doi: 10.1083/jcb.144.1.151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Macedo-Ribeiro S, Bode W, Huber R, Quinn-Allen MA, Kim SW, Ortel TL, Bourenkov GP, Bartunik HD, Stubbs MT, Kane WH, Fuentes-Prior P. Nature. 1999;402:434–439. doi: 10.1038/46594. [DOI] [PubMed] [Google Scholar]
- 48.Pratt KP, Shen BW, Takeshima K, Davie EW, Fujikawa K, Stoddard BL. Nature. 1999;402:439–442. doi: 10.1038/46601. [DOI] [PubMed] [Google Scholar]
- 49.Gaskell A, Crennell S, Taylor G. Structure. 1995;3:1197–1205. doi: 10.1016/s0969-2126(01)00255-6. [DOI] [PubMed] [Google Scholar]
- 50.Ito N, Phillips SEV, Stevens C, Ogel ZB, McPherson MJ, Keen JN, Yadav KDS, Knowles PF. Nature. 1991;350:87–90. doi: 10.1038/350087a0. [DOI] [PubMed] [Google Scholar]
- 51.Au SWN, Leng X, Harper JW, Barford D. Journal of Molecular Biology. 2002;316:955–968. doi: 10.1006/jmbi.2002.5399. [DOI] [PubMed] [Google Scholar]
- 52.Boraston AB, Nurizzo D, Notenboom V, Ducros V, Rose DR, Kilburn DG, Davies GJ. Journal of Molecular Biology. 2002;319:1143–1156. doi: 10.1016/S0022-2836(02)00374-1. [DOI] [PubMed] [Google Scholar]
- 53.Wendt KS, Vodermaier HC, Jacob U, Gieffers C, Gmachl M, Peters J-M, Huber R, Sondermann P. Nature Structural Biology. 2001;8:784–788. doi: 10.1038/nsb0901-784. [DOI] [PubMed] [Google Scholar]
- 54.Bianchet MA, Odom EW, Vasta GR, Amzel LM. Nature Structural Biology. 2002;9:628–634. doi: 10.1038/nsb817. [DOI] [PubMed] [Google Scholar]
- 55.Firbank SJ, Rogers MS, Wilmot CM, Dooley DM, Halcrow MA, Knowles PF, McPherson MJ, Phillips SEV. Proceedings of the National Academy of Sciences of the United States of America. 2001;98:12932–12937. doi: 10.1073/pnas.231463798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Lee CC, Kreusch A, McMullan D, Ng K, Spraggon G. Structure. 2003;11:99–108. doi: 10.1016/s0969-2126(02)00941-3. [DOI] [PubMed] [Google Scholar]
- 57.Adams TE, Hockin MF, Mann KG, Everse SJ. Proceedings of the National Academy of Sciences of the United States of America. 2004;101:8918–8923. doi: 10.1073/pnas.0403072101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Ramelot TA, Raman S, Kuzin AP, Xiao R, Ma L-C, Acton TB, Hunt JF, Montelione GT, Baker D, Kennedy MA. Proteins: Structure, Function, and Bioinformatics. 2009;75:147–167. doi: 10.1002/prot.22229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Marintchev A, Mullen MA, Maciejewski MW, Pan B, Gryk MR, Mullen GP. Nat Struct Mol Biol. 1999;6:884–893. doi: 10.1038/12347. [DOI] [PubMed] [Google Scholar]
- 60.Tews I, Perrakis A, Oppenheim A, Dauter Z, Wilson KS, Vorgias CE. Nat Struct Mol Biol. 1996;3:638–648. doi: 10.1038/nsb0796-638. [DOI] [PubMed] [Google Scholar]
- 61.JointCenterForStructuralGenomics. PDB entry 1VPQ To be published. [Google Scholar]
- 62.Guo H-C, Jardetzky TS, Garrettt TPJ, Lane WS, Strominger JL, Wiley DC. Nature. 1992;360:364–366. doi: 10.1038/360364a0. [DOI] [PubMed] [Google Scholar]
- 63.Cardoso RMF, Silva CHTP, Ulian de Araujo AP, Tanaka T, Tanaka M, Garratt RC. Acta Crystallographica Section D. 2004;60:1569–1578. doi: 10.1107/S0907444904016798. [DOI] [PubMed] [Google Scholar]
- 64.Setayesh FR, DeCorte BL, Horton P, Harris CM, Harris TM, Stone MP. Chemical Research in Toxicology. 1998;11:766–777. doi: 10.1021/tx9800147. [DOI] [PubMed] [Google Scholar]
- 65.Momma K, Mishima Y, Hashimoto W, Mikami B, Murata K. Biochemistry. 2005;44:5053–5064. doi: 10.1021/bi047781r. [DOI] [PubMed] [Google Scholar]
- 66.JointCenterForStructuralGenomics. PDB entry 1VKO To be published. [Google Scholar]
- 67.Mol CD, Dougan DR, Schneider TR, Skene RJ, Kraus ML, Scheibe DN, Snell GP, Zou H, Sang B-C, Wilson KP. Journal of Biological Chemistry. 2004;279:31655–31663. doi: 10.1074/jbc.M403319200. [DOI] [PubMed] [Google Scholar]
- 68.Skelton NJ, Quan C, Reilly D, Lowman H. Structure. 1999;7:157–168. doi: 10.1016/S0969-2126(99)80022-7. [DOI] [PubMed] [Google Scholar]
- 69.Li H, Tomizawa T, Tochio N, Muto Y, Koshiba S, Inoue M, Kigawa T, Yokoyama S. PDB entry 2COC To be published. [Google Scholar]
- 70.Udomsinprasert R, Pongjaroenkit S, Wongsantichon J, Oakley AJ, Prapanthadara L-a, Wilce MCJ, Ketterman AJ. Biochem. J. 2005;388:763–771. doi: 10.1042/BJ20042015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ichiyanagi K, Ishino Y, Ariyoshi M, Komori K, Morikawa K. Journal of Molecular Biology. 2000;300:889–901. doi: 10.1006/jmbi.2000.3873. [DOI] [PubMed] [Google Scholar]
- 72.López-Lucendo MF, Solís D, André S, Hirabayashi J, Kasai K-i, Kaltner H, Gabius H-J, Romero A. Journal of Molecular Biology. 2004;343:957–970. doi: 10.1016/j.jmb.2004.08.078. [DOI] [PubMed] [Google Scholar]
- 73.Katsuya Y, Mezaki Y, Kubota M, Matsuura Y. Journal of Molecular Biology. 1998;281:885–897. doi: 10.1006/jmbi.1998.1992. [DOI] [PubMed] [Google Scholar]
- 74.Koch AW, Farooq A, Shan W, Zeng L, Colman DR, Zhou M-M. Structure. 2004;12:793–805. doi: 10.1016/j.str.2004.02.034. [DOI] [PubMed] [Google Scholar]
- 75.Abad MC, Arni RK, Grella DK, Castellino FJ, Tulinsky A, Geiger JH. Journal of Molecular Biology. 2002;318:1009–1017. doi: 10.1016/S0022-2836(02)00211-5. [DOI] [PubMed] [Google Scholar]
- 76.Llera AS, Viedma F, Sánchez-Madrid F, Tormo J. Journal of Biological Chemistry. 2001;276:7312–7319. doi: 10.1074/jbc.M008573200. [DOI] [PubMed] [Google Scholar]
- 77.Baeyens KJ, De Bondt HL, Raeymaekers A, Fiers W, De Ranter CJ. Acta Crystallographica Section D. 1999;55:772–778. doi: 10.1107/s0907444998018435. [DOI] [PubMed] [Google Scholar]
- 78.Lubkowski J, Hennecke F, Pluckthun A, Wlodawer A. Nat Struct Mol Biol. 1998;5:140–147. doi: 10.1038/nsb0298-140. [DOI] [PubMed] [Google Scholar]
- 79.Tegoni M, Spinelli S, Verhoeyen M, Davis P, Cambillau C. Journal of Molecular Biology. 1999;289:1375–1385. doi: 10.1006/jmbi.1999.2845. [DOI] [PubMed] [Google Scholar]
- 80.Hart PJ, Deep S, Taylor AB, Shu Z, Hinck CS, Hinck AP. Nat Struct Mol Biol. 2002;9:203–208. doi: 10.1038/nsb766. [DOI] [PubMed] [Google Scholar]
- 81.Stams T, Chen Y, Boriack-Sjodin PA, Hurt JD, Liao J, May JA, Dean T, Laipis P, Silverman DN, Christianson DW. Protein Sci. 1998;7:556–563. doi: 10.1002/pro.5560070303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Lamb AL, Wernimont AK, Pufahl RA, O'Halloran TV, Rosenzweig AC. Biochemistry. 2000;39:1589–1595. doi: 10.1021/bi992822i. [DOI] [PubMed] [Google Scholar]
- 83.Whittingtons DA, Grubb JH, Waheed A, Shah GN, Sly WS, Christianson DW. Journal of Biological Chemistry. 2004;279:7223–7228. doi: 10.1074/jbc.M310809200. [DOI] [PubMed] [Google Scholar]
- 84.Dayan SM, Van Donkelaar A, Kortt AA. Journal of Biological Chemistry. 1987;262:10287–10289. [PubMed] [Google Scholar]
- 85.Watson JN, Newstead S, Dookhun V, Taylor G, Bennet AJ. FEBS Letters. 2004;577:265–269. doi: 10.1016/j.febslet.2004.10.016. [DOI] [PubMed] [Google Scholar]
- 86.Zhang Y. BMC Bioinformatics. 2008;9:40. doi: 10.1186/1471-2105-9-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Zhang Y. Proteins: Structure, Function, and Bioinformatics. 2009;77:100–113. [Google Scholar]
- 88.Soding J, Biegert A, Lupas AN. Nucl. Acids Res. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wu S, Zhang Y. Nucl. Acids Res. 2007;35:3375–3382. doi: 10.1093/nar/gkm251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Fiser A, Do RKG, Scaronali A. Protein Science. 2000;9:1753–1773. doi: 10.1110/ps.9.9.1753. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.