Computational Modeling of Laminin N-Terminal Domains Using Sparse Distance Constraints from Disulfide Bonds and Chemical Cross-Linking

Stefan Kalkhof; Sebastian Haehn; Mats Paulsson; Neil Smyth; Jens Meiler; Andrea Sinz

doi:10.1002/prot.22848

. Author manuscript; available in PMC: 2016 Oct 25.

Published in final edited form as: Proteins. 2010 Oct 11;78(16):3409–3427. doi: 10.1002/prot.22848

Computational Modeling of Laminin N-Terminal Domains Using Sparse Distance Constraints from Disulfide Bonds and Chemical Cross-Linking

Stefan Kalkhof ^1,⁵, Sebastian Haehn ², Mats Paulsson ², Neil Smyth ³, Jens Meiler ^4,^*, Andrea Sinz ^1,^*

PMCID: PMC5079110 NIHMSID: NIHMS824017 PMID: 20939100

Abstract

Basement membranes are thin extracellular protein layers, which separate endothelial and epithelial cells from the underlying connecting tissue. The main non-collagenous components of basement membranes are laminins, trimeric glycoproteins, which form polymeric networks by interactions of their N-terminal (LN) domains; however, no high-resolution structure of laminin LN domains exists so far. To construct models for laminin β₁ and γ₁ LN domains 14 potentially suited template structures were determined using fold recognition methods. For each target/template-combination comparative models were created with Rosetta. Final models were selected based on their agreement with experimentally obtained distance constraints from natural cross-links, i.e., disulfide bonds as well as chemical cross-links obtained from reactions with two amine-reactive cross-linkers. We predict that laminin β₁ and γ₁ LN domains share the galactose-binding domain-like fold.

Keywords: laminin, N-terminal domains, disulfide bonds, chemical cross-linking, mass spectrometry, computational modeling

INTRODUCTION

Laminins are the major non-collagenous proteins of basement membranes and are known to form networks due to crucial non-covalent self-interactions (1). Each member of the laminin protein family consists of three polypeptide chains, with one copy of the α, β and γ polypeptide chains being present. At the N-termini of all three chains globular (LN) domains are followed by a series of laminin-type EGF-like (LE) modules (for nomenclature see (2)) that embed one or two additional domains with a total length of 40–60 nm each. Towards the C-termini the three polypeptide chains intertwine into a long coiled-coil region of 77 nm that ends in five LG domains connected to the α chain. This arrangement gives laminins their typical cross-shaped structure carrying with three short and one long arm. The laminin N-terminal (LN) globular domains require the subsequent four LE-domains for efficient expression and proper folding (3). This region of the molecule is involved in Ca²⁺ dependent laminin self-assembly and binds to the corresponding N-terminal regions of other laminin chains (4). The LN domains of all three chains are required for efficient polymerization as deletion mutants with two or fewer LN domains fail to form networks (5).

The N-terminal portions of laminin chains share sequence homology and domain structure with the netrins, a family of extracellular proteins that were originally identified as neural guidance molecules. Netrin-4 was recently shown to interact with the N-terminal portions of laminin γ1 and γ3 chains in a manner that may regulate basement membrane assembly (6).

No high-resolution three-dimensional structures for laminin LN-domains have been obtained so far. An alternative approach that provides structural insight into proteins is based on chemical cross-linking and subsequent mass spectrometric analysis of the created products. Structural information can be obtained by the insertion of a chemical cross-linker between two functional groups within a protein. The cross-linker has a defined length and is connected via covalent bonds to functional groups of amino acid side chains. The cross-linked amino acids can be identified after enzymatic digestion. This chemical cross-linking approach is also applied to study protein-protein interfaces. The sequence separation of cross-linked amino acids, combined with the cross-linker length, impose a distance constraint on the structure of protein fold or protein complex (7) (8) (9) (10). Analysis of cross-linked peptides by mass spectrometry (MS) uses several advantages associated with MS analysis: (I) The mass of the protein or the protein complex under investigation is theoretically unlimited as the proteolytic peptides of the cross-linked proteins after an enzymatic digest are analyzed (in case a standard “bottom-up” strategy for mass spectrometric protein analysis is employed), (II) the analysis is rapid, (III) it requires very small (10^™14 – 10^™15 mol) amounts of protein, and (IV) as the cross-linking reaction can be executed in a native-like environment protein structure and flexibility are accurately reflected. It is possible to study membrane proteins, post-translational modifications, or splice variants. The broad range of cross-linking reagents with different specificities (primary amines, sulfhydryls, or carboxylic acids) and the wide range of distances (0 Å – >20 Å) allow setup of fine-tuned experimental strategies (7) (8) (9) (10) (11).

However, despite the straightforwardness of the cross-linking approach, the identification of the cross-linked products can be cumbersome due to the complexity of the reaction mixtures. Several strategies have been employed to enrich cross-linker-containing species by affinity chromatography or to facilitate the identification of the cross-linked products; e.g., by using isotope-labeled cross-linkers or proteins, fluorogenic cross-linkers, or cleavable cross-linkers (10).

We recently reported the identification of the disulfide bond pattern of laminin β₁ LN-domain using offline nano-high performance liquid chromatography (nano-HPLC)/matrix-assisted laser desorption/ionization time-of-flight/time-of-flight (MALDI-TOF/TOF) mass spectrometry (12). The aim of the present study was to obtain structural models of laminin β₁ and γ₁ LN-domains based on sparse distance constraints imposed by natural cross-links, i.e., disulfide bonds, as well as by chemical cross-links obtained from reactions with two amine-reactive cross-linkers. For chemical cross-linking of laminin LN domains, we used isotope-labeled, i.e. deuterated, cross-linkers to facilitate the identification of cross-linker-containing species in the mass spectra based on their characteristic isotope patterns (13) (14) (15).

Sparse experimental distance restraints can restrict the conformational space for a protein substantially and therefore enable determination of tertiary structure via computational modeling. For example, the Rosetta de-novo protein structure prediction algorithm (16) (17) allows the prediction of protein structures with medium to high resolution (1.5–3.0 Å) with less than one restraint per amino acid (distance and orientation restraints from NMR spectroscopy (18) (19)). Even with only approximately 0.1 constraints per amino acid the correct fold was determined de novo for T4-lysozyme and αA-crystallin using EPR distance restraints (20). The present project is challenging in that even fewer restraints are available for modeling. On the other hand, the protein models are not folded de novo, but template structures define the fold of the protein. Nevertheless, we underline that the structural models presented here should be treated as working hypotheses that need to be further validated and refined. We are confident that the availability of these models as supplementary information will facilitate this process.

MATERIALS AND METHODS

Materials

The cross-linking reagents BS³-D₀/D₄ bissulfosuccinimidyl)suberate–D₀/D₄), and BS²G-D₀/D₄ bissulfosuccinimidyl)glutarate–D₀/D₄) were obtained from Pierce Inc. (Rockford, IL, USA). The proteases trypsin, chymotrypsin, LysC, endoproteinase AspN, and GluC (all sequencing grade) were obtained from Roche Diagnostics (Mannheim, Germany). MALDI matrices were obtained from Bruker Daltonik (Bremen, Germany), all other chemicals were purchased from Sigma (Taufkirchen, Germany). Nano-HPLC solvents were spectroscopic grade (Uvasol, VWR, Darmstadt, Germany). Water was purified with a Direct-Q5 water purification system (Millipore, Eschborn, Germany).

Expression and Purification of Laminin β₁ and γ₁ N-terminal Constructs

Laminin β₁ und γ₁ constructs (mouse) comprising one LN domain plus LE1-4 domains were expressed in 293-EBNA human embryonic kidney cells (3). Amino acid sequences were confirmed by peptide mass fingerprint analysis using trypsin, endoproteinase AspN, chymotrypsin, endoproteinase GluC or a mixture of trypsin and AspN (enzyme:substrate ratio 1:50) as digestion enzymes.

Cross-Linking Reactions

For chemical cross-linking of laminin β₁ and γ₁ N-terminal constructs, the homobifunctional amine-reactive cross-linkers BS²G and BS³ were employed as 1:1 mixtures of non-deuterated and four-times deuterated derivatives (D₀/D₄). Cross-linking reactions were conducted with 2 μM protein solutions in 20 mM HEPES buffer, 100 mM NaCl, 5 mM CaCl₂, pH 7.4. Freshly prepared stock solutions of the cross-linkers (10 mg/ml in DMSO) were added in 100- and 200-fold molar excess (final concentrations 200 μM and 400 μM) to the protein solution. The reactions were conducted at room temperature under gentle shaking of the reaction mixtures and were quenched after 45 and 90 min, respectively, by adding NH₄HCO₃ to a final concentration of 20 mM.

In-Solution Digestion

For in-solution digestion, the cross-linking reaction mixtures were denatured, reduced, alkylated, and digested with a mixture of AspN and trypsin (enzyme: substrate ratio 1:50) according to an existing protocol (21).

Gel Electrophoresis and In-Gel Digestion

A part of the cross-linking reaction mixtures was desalted with Microcon YM-10 filters (Millipore, Eschborn, Germany) and separated by one-dimensional SDS-PAGE (5% stacking gel / 5%, 8% or 12% resolving gel) according to Laemmli (22). The bands of monomeric laminin β₁ und γ₁ were excised, reduced, alkylated, and digested at 37°C for 16 hrs with a mixture of AspN and trypsin (enzyme: substrate ratio 1:30) as described previously (21). Peptides were extracted by adding three times 50 μl of 5% TFA (for MALDI-MS analysis) or 5% (V/V) FA (for ESI-MS analysis); samples were concentrated in a vacuum concentrator to a volume of 5–10 μl.

Nano-HPLC / MALDI TOF/TOF-Mass Spectrometry

Proteolytic peptide mixtures were analyzed by offline coupling of a nano-HPLC system (Ultimate 3000, Dionex, Idstein, Germany) to a MALDI-TOF/TOF mass spectrometer (Ultraflex III, Bruker Daltonik, Bremen, Germany). Samples were injected via an autosampler with a 200-μl sample loop onto a precolumn (PepMap, C18, 300 μm × 5 mm, 3 μm, 100 Å, Dionex) and desalted by washing the precolumn for 15 min with 0.1% TFA before the peptides were eluted onto the separation column (PepMap, C18, 75 μm × 150 mm, 3μm, 100 Å, Dionex), which had been equilibrated with 95% solvent A (A: 5% ACN, 0.05% TFA). Peptides were separated with a 30 min-gradient (0–30 min: 5–50% B, 30–31 min: 50–95% B, 31–35 min: 95% B (solvent B: 80% ACN, 0.04% TFA) at a flow rate of 300 nl/min with UV detection at 214 nm and 280 nm. Eluates were fractionated into 15-sec fractions with the fraction collector Proteineer fc (Bruker Daltonik), mixed with 1.1 μl of matrix solution (0.7 μg/μl HCCA in 90 % ACN/0.1 % TFA, 1 mM NH₄H₂PO₄) and directly prepared onto a 384 MTP 800 μm AnchorChip target (Bruker Daltonik). Hystar software 3.2 controlled data collection with the nano-HPLC system, UV data acquisition, and fraction collector sampling.

MALDI-TOF-MS analyses were conducted in the positive ionization and reflectron mode by adding 2,000 laser shots in the range m/z 800–4,000 to one mass spectrum. Mass spectra were processed (Savitzky-Golay smoothing and baseline correction) and externally calibrated using Peptide Calibration Standard II (Bruker Daltonik). Monoisotopic mass signals with a signal-to-noise ratio (S/N) > 2 were automatically labeled using the SNAP (Sophisticated Numerical Annotation Procedure) algorithm. Afterwards, peak lists of labeled monoisotopic signals were created and signals with S/N >10 were selected for laser-induced fragmentation. In the MS/MS mode, up to 2,000 laser shots were accumulated for measurement of the intact precursor ion (termination at S/N of 30); the optimum laser energy was determined for each precursor ion by fuzzy logic. Additional 2,000 laser shots were accumulated at 50% higher laser energy for acquisition of fragment ion mass spectra. Spectra were processed, calibrated based on the exact mass of the precursor ion, annotated (SNAP algorithm, S/N>2) and combined with MS data to one data file. Data acquisition was done automatically by the WarpLC 1.1 software (Bruker Daltonik) coordinating MS data acquisition (FlexControl 1.3) and data processing (FlexAnalysis 3.0) softwares.

Nano-HPLC / Nano-ESI-FTICR Mass Spectrometry

Proteolytic peptide mixtures were additionally analyzed by nano-HPLC/nano-ESI mass spectrometry using the two FTICR (Fourier transform ion cyclotron resonance) mass spectrometers Apex II (Bruker Daltonics, Billerica, MA) and LTQ-FT (ThermoFisher Scientific, Bremen, Germany). The Apex II mass spectrometer was online coupled to the nano-HPLC system (Ultimate II with Switchos II and autosampler). Samples were injected via the autosampler with a 20-μl sample loop, and were desalted and concentrated by washing the precolumn (PepMap, C18, 300 μm × 5 mm, 5 μm, 100 Å, Dionex) with 0.1% FA at a flow rate of 2 μl/min for 10 min, before peptides were eluted onto the separation column (PepMap, C18, 75 μm × 150 mm, 3 μm, 100 Å, Dionex). The peptides were separated using a 30-min gradient (0–30min: 5–50% B, 30–31 min: 50–95% B, 31–35 min: 95% B, with solvent A: 5% ACN, 0.1% FA und B: 80% ACN, 0.1% FA) at a flow rate of 200 nl/min monitoring the elution of peptides by their UV absorption at 214 nm und 280 nm. Data acquisition was controlled by the Hystar (version 2.3).

The Apex II FTICR mass spectrometer was equipped with a 7 T supra-conducting magnet and a nano-ESI source (Agilent Technologies, Waldbronn, Germany). For nano-ESI-MS measurements, fused-silica-nano-ESI needles (PicoTips, ID 8 μm, New Objective, Woburn, MA, USA) were employed. The instrument was tuned using the doubly charged signal of the LHRH peptide at m/z 592.2358. Calibration was performed using the LHRH peptide fragments, which were obtained by capillary skimmer dissociation. Data were acquired in broadband mode (m/z range 400–2,000) with 256k data points per spectrum. 10 scans were accumulated to a single spectrum (ca. 12 sec per spectrum), the XMASS software (versions 7.0.2, 7.0.3, 7.0.8, Bruker Daltonik) was used for data acquisition and data.

Nano-HPLC/nano-ESI-MS measurements with the LTQ-FT hybrid mass spectrometer were conducted in online coupling with the Ultimate 3000 Nano-HPLC-System (Dionex). The LTQ-FT combines a linear ion trap (LTQ) and an ion cyclotron resonance (ICR) analyzer and is equipped with a 7 T magnet. Samples were injected via the autosampler with a 100-μl sample loop, desalted and concentrated on a precolumn (PepMap, C18, 300 μm × 5 mm, 3 μm, 100 Å). Separation of peptides was performed on a C18 column (PepMap, C18, 75 μm × 150 mm, 3 μm, 100 Å) using a 90-min gradient (0 to 60 % solvent B in 90 min, followed by isocratic elution at 90% B for 3 min; solvent A: 5% ACN, 0.1% FA, B: 80% ACN, 0.08% FA) at 300 nl/min. The Chromeleon software (version 2.3, Dionex) was used to control the HPLC system, UV data (214 nm und 280 nm) and MS data acquisition. Data acquisition was performed over 100 min. One duty cycle comprised one high-resolution full scan spectrum (m/z 300–2000, resolution 100,000 at m/z 400) in the ICR cell and 10 fragment ion mass spectra of the most intense signals in the linear ion trap. Dynamic exclusion (exclusion time 20 sec, exclusion window ±5 ppm) was used to enhance acquisition of signals with low intensity.

Identification of Cross-Linked Products

Analysis of MS data and identification of cross-linked products was performed with the programs IsoFind, GPMAW, Biotools und MS2Assign. The in-house software tool Isofind determines signals with a defined mass difference, i.e., 4.025 u for the identification of reaction products with D₀/D₄ labeled cross-linkers, yielding peak lists with the respective mass differences and peak intensity ratios. Only signals exhibiting the characteristic D₀/D₄ isotope patterns with similar peak intensities (0.25:1 to 1:4) and similar LC retention times (±1 min) were considered as potential cross-linked products. The GPMAW (General Protein Mass Analysis for Windows, version 8.0, Lighthouse Data, Odense, Denmark) (23) software was used for assignment of cross-linked products. Maximum mass deviations between theoretical and experimental masses of 5 ppm (LTQ-FT), 10 ppm (Apex II), and 50 ppm (Ultraflex III) were allowed. MS/MS data were manually compared to the predicted fragment masses for a cross-linked product. Additionally, the freely accessible program MS2Assign (http://roswell.ca.sandia.gov/~mm-young/ms2assign.html; part of the Collaboratory for MS3D, http://ms3d.org/home.php) was used for calculation of MS/MS data and comparison to experimental data. The software package Biotools 3.1 (Bruker Daltonik) was used to identify peptides that are modified by a hydrolyzed cross-linker as well as intrapeptidal cross-linked products based on exact mass and MS/MS data. Lysines, serines, tyrosines, and threonines were considered as potential cross-linking sites (21) (24). Additionally, oxidation (Met), carboxamidomethylation (Cys), amidation (at cross-linker and at C-terminus), deamidation (Gln, Asn), cyclization (Cys) were taken into account as potential modifications as well as incomplete cleavage (up to 8 missed cleavage sites) (21).

Fold Recognition

The general workflow of computational modeling is summarized in Figure 1. Laminin β₁ and γ₁ sequences were split into separate domains as defined by the ExPASy Proteomic Server (www.expasy.ch) and modeled independently. The amino acid sequences of murine LN domains of laminin β₁ (ExPASy entry P02469, 31-270) and laminin γ₁ (ExPASy entry P02469, 44-283) were analyzed by the 3D-Jury metaserver (www.bioinfo.pl, (25), Figure 1A). Potential templates (PDB entries: 1CZT, 1D7P, 1KEX, 1XPW) were identified with a 3D-Jury score of 45–55, which roughly correlates with the number of amino acids that can be superimposed between a template and a target structure with an RMSD <3.5 Å. A score larger 50 has a 90% chance of being a correctly identified fold. Sequence identities between laminin β₁ and laminin γ₁ and the templates were calculated with the software SIM (http://www.expasy.ch/tools/sim-prot.html, (26)) and were all smaller than 18%.

General workflow of the computational modeling procedure, which includes as main steps (A) detection and identification of templates, (B) sequence-sequence-alignment between target and template sequences, (C) modeling of the LN domains of laminin β₁ and γ₁, (D) experimentally guided scoring and selection of the best models, and (E) validation.

Rectangles: intermediate results; trapeze: verifications and controls; ovals: calculation and predictions steps; parallelograms: input data.

The quality of final comparative models is highly dependent on the template that is used for modeling. Given the low sequence similarity of the initial templates two alternative approaches were employed to more comprehensively search the PDB for additional potential template structures. In the first approach, homologous sequences of laminin β₁ and γ₁ LN domains were searched with PSI-BLAST (27) (www.ncbi.nlm.nih.gov/blast/Blast.cgi; E-value threshold 0.005; ten iterative rounds). In this experiment, the sequences of netrin I, II, IV as well as these of LN domains of laminin chains α₁, α₂, α₃B, α₅, β₂, β₃, γ₃ were identified to be related to the β₁ and γ₁ LN domains. These homologues as well as the sequences of the original 3D Jury fits 1CZT, 1D7P, 1KEX, 1XPW were submitted to the 3D-Jury metaserver for a second iteration. Two additional template candidates (1SDD and 1GOF) were identified by that procedure. To test orthogonal fold recognition approaches the threading servers Phyre [www.sbg.bio.ic.ac.uk/phyre/ (28)], Libellula [www.pdg.cnb.uam.es/servers/libellula, (29)], Wurst [www.zbh.uni-hamburg.de/wurst/, (30)], HHPred [http://toolkit.tue-bingen.mpg.de/hhpred, (31)], and Loopp [http://loopp.org/, (32)] were applied. 35 putative templates were identified in addition to the previously detected structures yielding a total of 40 candidate templates (Figure 1A).

In an unbiased approach the structures of all 40 potential templates were compared pairwise with the program Mammoth (http://ub.cbm.uam.es/mammoth/ (33)) and clustered based on their structural homologies. 14 structures including 1CZT, 1D7P, 1GOF, 1KEX, 1SDD and 1XPW were grouped in one cluster of which all members share the galactose-binding domain-like fold (SCOP: 49784, CATH: 2.60.120.260, compare Table 4). These 14 structures were considered as templates in the subsequent comparative modeling efforts (Figure 1B). Other structures have been dismissed as potential templates as in structure clustering (Figure 2) only one cluster was observed containing the 14 members with a galactose-binding domain-like fold, whereas further clusters contained only two or less members. These 26 additional structures included immunoglobulin-like β-sandwiches and carbonic anhydrase among others.

Table 4.

Results of the template search; 14 structures were found using the programs 3D Jury, Superfamily –“Superfam“, Phyre, Libellula “Lib“, HHPred; the programs Wurst and Loop yielded only false positive template structures. The six templates found with 3D-Jury are printed in bold.

No	Protein Name	PDB entry	Method	Template Score for Laminin β₁; γ₁	SCOP Fold name	SCOP Fold classification code	CATH Fold name	CATH Fold classification code	Protein Function Gene Ontology Annotation	Reference
1	factor V	1CZT	3DJury HHPred*	52; 58 95; 87	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Blood Clotting	(47)
2	factor VIII	1D7P	3D Jury HHPred	42; 57 88; 82	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Blood Clotting	(48)
3	Sialidase	1EUT	Libellula, Phyre*	25%; 80%	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Hydrolase	(49)
4	Galactose oxidase	1GOF	3DJury	49; 50	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Oxidoreductase(Oxygen(A))	(50)
5	APC10/DOC1 subunit of the anaphase- promoting complex	1GQP	Phyre HHPred*	-; 30% -; 74	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Cell Cycle	(51)
6	Carbohydrate binding module from laminarinase 16A	1GUI	Libellula		Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Carbohydrate Binding Module	(52)
7	APC10/DOC1 subunit of the anaphase- promoting complex	1JHJ	Phyre HHPred*	-; 25% -; 48	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Cell Cycle	(53)
8	Fucose binding lectin	1K12	HHPred*	94; 95	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Sugar Binding Protein	(54)
9	Galactose oxidase	1K3I	Phyre HHPred	30%; 80% 96; 95	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Oxidoreductase	(55)
10	B1 domain of neuropilin-1	1KEX	3D-Jury HHPred	37; 53 96; 93	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Protein Binding	(56)
11	Coagulation factor V	1SDD	3DJury HHPred Phyre	49; 39 67; 41 94%; 94%	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Blood Clotting	(57)
12	Placental protein 25, pp25	1TVG	HHPred Phyre	54; 53 98%; 97%	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Cell Cycle	(58)
13	xrcc1	1XNA	Libellula		Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	DNA Binding Protein	(59)
14	Placental protein 25, pp25	1XPW	3DJury	47; 50	Galactose-binding domain-like	49784	Galactose-binding domain-like	2.60.120.260	Structural Genomics, Unknown Function	(58)
15	chitobiase (N- acetyl-beta- glucoseaminidase)	1QBA	Phyre	45%; -	Immunoglobulin- like beta-sandwich	[48725]	Chitobiase, domain 2	3.30.379.10	Glycosyl Hydrolase	(60)
16	Hypothetical protein TM1631	1VPQ	Wurst	9.4; -	TIM beta/alpha- barrel	[51350]	Hypothetical protein tm1631.	3.20.20.410	Structural Genomics, Unknown Function	(61)
17	Class I MHC	1HSB	Wurst	9.3; -	Immunoglobulin- like beta-sandwich	[48725]	Immunoglobulins	2.60.40.10	Histocompatibility Antigen	(62)
18	Cu,Zn superoxide dismutase	1TO4	Wurst	9.1; -	Immunoglobulin- like beta-sandwich	[48725]	Immunoglobulin- like	2.60.40.200	Oxidoreductase	(63)
19	Putative regulator protein YcfX	1AP1	Wurst	9.1; -	Ribonuclease H- like motif	[53066]		Not yet assigned	Transferase	(64)
20	Alginate-binding periplasmic protein AlgQ1	1Y3N	Wurst	9.1; -	Periplasmic binding protein-like II	[53849]	Periplasmic binding protein- like II	3.40.190.10	Sugar Binding Protein	(65)
21	Myo-inositol 1- phosphate synthase	1VKO	Wurst	-; 11.0	AD(P)-binding Rossmann-fold domains	[51734]		Not yet assigned	Isomerase	(66)
22	c-KIT receptor	1T46	Wurst	-; 10.0	Protein kinase-like (PK-like)	[56111]	Phosphorylase Kinase; domain 1	3.30.200.20	Transferase Activator	(67)
23	Interleukin-8	1ILQ	Wurst	-; 10.0	IL8-like	[54116]	OB fold (Dihydrolipoamide Acetyltransferase, E2P)	2.40.50.40	Cytokine	(68)
24	FYVE, RhoGEF and PH domain containing protein 3, FGD3	2COC	Wurst	-; 10.0	PH domain-like barrel	[50728]		Not yet assigned	Signaling Protein	(69)
25	Class delta GST	1V2A	Wurst	-; 10.0	GST C-terminal domain-like	[47615]	Glutaredoxin	3.40.30.10	Transferase	(70)
26	PI-Pfui intein	1DQ3	Superfam		Hedgehog/intein (Hint) domain	[51293]	Endonuclease - Pi- scei; Chain A, domain 1	2.170.16.10	Hydrolase	(71)
27	Galectin-1	1W6O	Superfam		Concanavalin A- like lectins/glucanases	[49898]	Jelly Rolls	2.60.120.200	Lectin	(72)
28	Isoamylase	1BF2	Libellula	2.6; -	Immunoglobulin- like beta-sandwich	[48725]	Immunoglobulins	2.60.40.10	Hydrolase	(73)
29	N-cadherin (neural)	1OP4	Libellula	9.0; -	Immunoglobulin- like beta-sandwich	[48725]	Not yet assigned		Cell Adhesion	(74)
30	Plasminogen	1KI0	Libellula	4.3; -	Kringle-like	[57439]	Plasminogen Kringle 4	2.40.20.10	Hydrolase	(75)
31	CD69	1E8I	Libellula	-; 8.6	C-type lectin-like	[56435]	Mannose-Binding Protein A, subunit A	2.60.120.40	Hematopoietic Cell Receptor	(76)
32	Tumor necrosis factor (TNF)	2TNF	Libellula	-; 10.5	TNF-like	[49841]	Jelly Rolls	2.30.27.10	Cytokine	(77)
33	minor coat protein g3p	1G3P	Libellula	-; 11.7	N-terminal domains of the minor coat protein g3p	[50175]	Phage FD Coat Protein,Membrane penetration domain	2.60.40.10	Viral Protein	(78)
34	Immunoglobulin heavy chain	1QFW	Libellula	-; 14.4	Immunoglobulin- like beta-sandwich	[48725]	Immunoglobulins	2.10.60.10	Immune System	(79)
35	TGF-beta type II receptor	1KTZ	Libellula	-; 12.2	Snake toxin-like	[57301]	CD59	3.10.200.10	Cytokine/Cytokine Receptor	(80)
36	Carbonic anhydrase	2ZNC	Loopp	1.1; -	Carbonic anhydrase	[51068]	Carbonic Anhydrase II	2.60.40.200	Lyase	(81)
37	Copper chaperone for superoxide dismutase	1DO5	Loopp	0.9; -	Immunoglobulin- like beta-sandwich	[48725]	Immunoglobulin- like	3.10.200.10	Chaperone	(82)
38	Carbonic anhydrase	1RJ6	Loopp	0.9; -	Carbonic anhydrase	[51068]	Carbonic Anhydrase II	2.80.10.50	Lyase	(83)
39	Winged bean albumin 1	1WBA	Loopp	0.9; -	beta-Trefoil	[50352]	Trefoil (Acidic Fibroblast Growth Factor, subunit A)	2.60.120.260	Seed Storage Protein	(84)
40	Micromonospora sialidase	1W8O	HHPred	94; 97	6-bladed beta- propeller	[50938]	Galactose-binding domain-like		Hydrolase	(85)

Open in a new tab

Dendrogram of the selected and discarded template structures. The clustering was performed based on the structural homology (normalized Z-Score calculated with Mammoth). The templates exhibiting a galactose-binding domain-like fold are highlighted in red.

Parametric Sequence Alignment

BCL::Align ((34), http://www.meilerlab.org) was used to create a pairwise sequence alignment between the sequences of laminin β₁ and γ₁ LN domains and the 14 template proteins. Given the low sequence similarity predicted secondary structure was used as component of the scoring function in sequence alignment. Secondary structure prediction of laminin LN domains was performed with the program JUFO ((19), (35), http://www.meilerlab.org/), Psipred ((36); http://bioinf.cs.ucl.ac.uk/psipred/) and SAM (37) yielding large content of random coil and β-strands. In order to sample the space of possible alignments densely a grid search of alignment parameters was performed (38) (Supplementary Table 1, Figure 1B) for each of the 28 combinations of a template with one LN domain creating 52,200 pairwise sequence-sequence alignments. After removal of redundant alignments 5,371 and 9,830 unique alignments were obtained for laminin β₁ and laminin γ₁, respectively. Moreover, alignments were excluded that prevent construction of all loops assuming that each amino acid can bridge approximately 3Å. 4,333 (laminin β₁) and 7,306 (laminin γ₁) target/template-alignments fulfilling this rule were carried forward (Figure 1B).

Construction of Comparative Models

For each of the 11,639 alignments the backbone coordinates of the aligned regions were copied from the template structure into the model of the target. Loops were constructed in ten independent runs of Rosetta (39). Always the best of the 10 models with correctly constructed loop regions enter construction of side chain coordinates using the feature “-find_disulf“, which favors the formation of disulfide bonds. A gradient-based relaxation of the protein backbone structure was used to enable changes of the protein backbone conformation. A total of 3,550 and 5,104 models of laminin β₁ and γ₁ LN domains were constructed, respectively, each representing one of the original alignments (Figure 1C).

Validation and Refinement of LN Domain Structures

Models, in which not all loops could be closed, were removed (criterion 1). Additionally, models were filtered by energy (Rosetta total energy – bk_tot – larger than −100, criterion 2), and models with a solvent accessible surface area larger than 20,000 Å² for lack of compactness were removed (criterion 3) (Figure 1C). Further models were removed that displayed C^A-C^A distances for cross-links larger than 30 Å (criterion 4) or C^A-C^A distances of disulfide bridges larger than 13 Å (criterion 5) (Figure 1D & Table 5). A total of seven constraints were available for the laminin β₁ LN domain, five for the laminin γ₁ LN domain. Only three models of the laminin β₁ LN domain and twelve models of the γ₁ LN domain fulfilled all experimental constraints (Figure 1D). This highlights the power of the chemical cross-linking approach enabling an enormous reduction of potential structures, even if only a small number of distance constraints are obtained. In the present study, less than 1% of the initial models fulfill all distance constraints (Table 5). The models were ranked according to an empirical composite score derived from these four criteria (Table 6). The composite score minimizes the deviation between distances observed experimentally and in the model. At the same time, it minimizes Rosetta energy and solvent accessible surface area (see Supplementary Material).

Table 5.

Remaining structures based on the number of fulfilled distance constraints (disulfide bonds and cross-links, see Table 3). Compactness (according to Table 6) was used as an additional criterion to validate the obtained structures, which fulfill all cross-linking constraints.

Protein	Modeling Parameters criterion I-III					Number of fulfilled distance constraints criterion IV and V
Protein	sequence-sequence alignments	Non-redundant alignments	Criterion I Modeled loops	Criterion II Similarity to Template and energy cut-off	Criterion III Compactness	1	2	3	4	5	6	7
Ges. β1	730,800	5,371	4,333	506		506	506	501	472	288	54	3
1CZT	52,200	499	296	43	41	43	43	43	43	28	8	0
1D7P	52,200	364	313	84	83	84	84	84	81	52	10	3
1EUT	52,200	317	294	14	7	14	14	14	13	8	1	0
1GOF	52,200	1017	799	48	10	48	44	48	47	27	7	0
1GQP	52,200	241	241	84	45	84	84	84	79	43	0	0
1GUI	52,200	344	344	44	44	44	44	44	44	42	8	0
1JHJ	52,200	300	283	25	6	25	25	25	24	13	2	0
1K12	52,200	261	201	25	24	25	25	25	24	15	3	0
1K3I	52,200	727	591	35	10	35	35	35	34	17	5	0
1KEX	52,200	177	141	16	16	16	16	16	16	11	0	0
1SDD	52,200	614	372	59	3	59	59	55	40	12	2	0
1TVG	52,200	193	184	16	16	16	16	16	16	3	0	0
1XPW	52,200	193	192	12	10	12	12	12	12	12	6	0
Ges. γ1	730,800	9,830	7,306	1,349		1,349	1,334	1,159	387	12
1CZT	52,200	695	370	150	145	150	150	144	53	3
1D7P	52,200	616	507	146	142	146	146	132	31	1
1EUT	52,200	509	432	54	34	54	54	31	14	0
1GOF	52,200	811	657	57	30	57	56	37	11	0
1GQP	52,200	354	285	180	170	180	80	142	71	1
1GUI	52,200	409	319	103	102	103	103	96	26	1
1JHJ	52,200	797	776	48	18	48	48	47	8	0
1K12	52,200	572	385	171	171	171	171	168	12	1
1K3I	52,200	704	438	52	27	52	52	39	10	0
1KEX	52,200	180	118	57	56	57	57	56	38	5
1SDD	52,200	2896	1947	208	57	208	208	147	42	(1)^*
1TVG	52,200	485	374	30	29	30	30	28	20	0
1XNA	52,200	348	263	8	8	8	8	8	5	0
1XPW	52,200	454	435	30	22	30	30	29	17	0

Open in a new tab

Structure did not fulfill the compactness criterion.

Table 6.

Composite scoring function which integrates the deviation of the experimental constraints from the theoretical maximum distance as well as the Rosetta energy and compactness of the structure

Score = L N \frac{X_{i, max} X_{i}}{X_{i, max} X_{i, opt}}

With

Optimal X_opt

Maximal allowed X_max

Chemical Cross-Link

<19 Å

<24 Å

Disulfide

<8 Å

<13 Å

Energy (bk_tot)

<−250

<−180

Compactness (SASA)

<15.000

<20.000

Open in a new tab

If X_i<X_opt than $\frac{X_{i, max} X_{i}}{X_{i, max} X_{i, opt}}$ was set to 1, where else in the case X_i>X_max the term was set to 0.05.

Leave-One-Out Cross-Validation

To test the robustness of the model selection protocol a leave-one-out (LOO) cross-validation was performed by selecting the final models using only six out of seven (for laminin β₁) and four out of five (for laminin γ₁) distance constraints. Agreement of these models with the final constraint was taken as a measure of accuracy. Moreover, these models were compared to the models selected using all experimental data using the structure-structure alignment method (33). A Z-score larger than 8 was used as criterion for high structural homology (Table 7 & Supplementary Table 2). The final assessment of the models was based on precision as measured by the composite score (Table 6 & Supplementary Table 2).

Table 7.

Summary of the filter function and the leave-one-out (LOO) experiments. “filter total“, “filter (6/7)“, and “filter (4/5)“ indicate how many models in total and how many models that fulfill 6 out of 7, respectively, 4 out of 5 distance constraints pass the filter; LOO distance and LOO fold indicate the results of LOO in respect to distance fulfillment and structural homology with a Z score of larger than 8; information content measure (sequence separation [aa] / Euclidian separation [Å]) of the distance constraints. XL: Cross-link.

Laminin β₁	Disulfide 1	Disulfide 2	Disulfide 3	Disulfide 4	XL 1	XL 2	XL 3	Mean
Filter (total)	484 95.7%	117 23.1%	467 92.3%	47 9.3%	493 97.4%	288 56.9%	435 86.0%	65%
Filter (6/7)	33 100%	27 81.8%	33 100%	9 27.3%	33 100%	33 100%	33 100%	84%
LOO distance	3/3 100%	3/9 33%	3/3 100%	3/27 11.1%	3/3 100%	3/3 100%	3/3 100%	78%
LOO fold	3/3 100%	6/9 66%	3/3 100%	14/27 52%	3/3 100%	3/3 100%	3/3 100%	88%
Information content [Å⁻¹]	0.38	0.69	0.23	1.54	0.37	1.90	0.47

Laminin γ₁	Disulfide 1	Disulfide 2	Disulfide 3	Disulfide 4	XL 1	Mean
Filter (total)	1226 90.8%	260 19.3%	1348 99.9%	218 16.1%	1193 88.4%	63%
Filter (4/5)	291 99.3%	161 54.9%	293 100.0	148 50.5%	291 99.3%	81%
LOO distance	12/14 86%	12/133 9%	12/12 100%	12/145 8%	12/12 100%	61%
LOO fold	9/14 64%	68/133 51%	9/12 75%	82/145 57%	9/12 75%	64%
Information content [Å⁻¹]	0.77	0.62	0.23	1.31	0.43

Open in a new tab

Evaluation of Structure Quality

The structure quality of these models fulfilling all experimental distance constraints was evaluated using the programs VADAR ((40), http://redpoll.pharmacy.ualberta.ca/vadar/) and MolProbity ((41), http://molprobity.biochem.duke.edu/). The evaluated structures are shown in the Supplementary Material.

Estimation of the Influence of Templates and Distance Constraints

For each of the 14 templates the most likely structure for laminin β₁ and γ₁ was determined based on the composite score discussed above (Table 6). These structures were evaluated based on the structure homology to give the overall best model. To assess the importance of each distance constraints in model selection the fraction of models that fulfill each restraint before filtering was computed. Furthermore, we determined how often a single constraint is exclusively responsible for the rejection of models, i.e., in these cases for which all except one constraint are fulfilled.

RESULTS AND DISCUSSION

The aim of the present study was to obtain structural models of laminin β₁ and γ₁ LN domains based on sparse distance constraints imposed by natural cross-links, i.e., disulfide bonds, as well as by chemical cross-links obtained from reactions with the amine-reactive cross-linkers BS²G and BS³. The amino acid sequences of recombinant laminin β₁ and laminin γ₁ fragments comprised of one LN domain and LE domains 1-4 used in this study are shown in Figure 3.

Amino acid sequences of recombinant (A) laminin β1 and (B) laminin γ₁ fragments comprised of one LN domain and LE domains 1-4. Amino acids that were detected during peptide mass fingerprint and peptide fragment fingerprint analyses are shown in bold. Putative glycosylation sites are shown in italics and underlined. The tags that were added to the laminin N-terminal constructs are printed in italics. LN domains are shaded.

Intramolecular Cross-Linked Products

The homobifunctional NHS esters BS³-D₀/D₄ and BS²G-D₀/D₄ bridging distances of ca. 7.7 Å and 11.4 Å were used for chemical cross-linking as described in the Materials and Methods. A sulfonate group at the NHS moiety improves water solubility of the reagent. NHS esters are highly reactive towards primary amines, i.e., ε-amine groups of lysines and the free N-terminus of a protein, but as a side reaction they are also susceptible to hydrolysis. The amino acids that are modified by a partially hydrolyzed cross-linkers (so-called “dead-end” cross-links) do not yield direct distance information, but give valuable insights into the solvent accessibility of a specific amino acid (9). In this work, we did not consider these ”dead-end” cross-links for structural modeling. In addition to reacting with amine groups, NHS esters have also been found to react with hydroxyl groups of tyrosine, threonine and serine residues (21) (24). Upon cross-linking, NHS esters create an amide bond with mass increases of 138.068 u (BS³-D₀) and 96.021 u (BS²G-D₀), respectively. Peptides, which are modified by a partially hydrolyzed cross-linker exhibit mass increases of 156.079 u (BS³-D₀) and 114.032 u (BS²G-D₀), respectively. BS³ and BS²G were employed as 1:1 mixtures of their non-deuterated (D₀) and deuterated (D₄) species in order to facilitate the identification of cross-linked products by means of their distinct doublet isotope patterns with mass differences of 4.025 u (D₀/D₄) in the deconvoluted mass spectra (13).

To gain insight into the three-dimensional structures of laminin β₁ and γ₁ LN domains the cross-linking reaction had to be optimized. In contrast to determining protein-protein interaction sites, intramolecular cross-linking within the laminin LN domain monomers had to be favored. Under the present conditions no formation of laminin homodimers was observed in MALDI-TOF-MS or 1D-SDS-PAGE. Gel bands of laminin LN domain monomers from cross-linking reaction mixtures, in which both laminin β₁ and γ₁ constructs were contained as well as the isolated laminin fragments in solution were digested either in-gel or in-solution and analyzed by nano-HPLC/MALDI-TOF/TOF-MS(/MS).

When isolated laminins were subjected to the cross-linking reaction with BS³ and digestion was performed in-solution and in-gel, five and two, intramolecular cross-linked products were obtained for laminin β₁ and γ₁ LN domain, respectively. Table 1 summarizes three unique cross-links for laminin β₁ and two cross-links for laminin γ₁. In addition, intramolecular cross-linking products were identified from in-gel digestions of monomeric bands and in-solution digestions of cross-linking reaction mixtures with laminin β₁ and γ₁ LN domains using the cross-linkers BS³ and BS²G (Table 2). The cross-linking products that had already been found when performing in-solution digestion of separated laminin β₁ and γ₁ LN domain were confirmed. In total, eleven cross-linked products for laminin β₁ LN domain and four cross-linked products for laminin γ₁ LN domain were observed.

Table 1.

Intramolecular cross-linked products of isolated laminin β₁ and γ₁ LN domains using BS³ as cross-linker; all cross-linked products were verified by MS/MS data. The cross-linked amino acids are printed in italics and are underlined. The cross-linked product presented in Figure 4 is highlighted.

Protein	MH+_exp	MH+_theo	Sequence	Amino acids	Δm [ppm]	MS/MS	D₀/D₄	Digestion
β₁	1293.794 1297.821	1293.793 1297.818	218–227	R/IKFVKLHTLG/D	1 2	b5–9 y4–5	1.9	In-solution
β₁	1293.786 1297.810	1293.793 1297.818	218–227	R/IKFVKLHTLG/D	5 6	b5–9 y3–5	1.8	In-gel
β₁	1672.824 1676.846	1672.822 1676.847	194–206	L.DPAFKIEDPYSPR.I	1 1	y1,y12 b3	1.4	In-solution
β₁	1672.804 1676.830	1672.822 1676.847	194–206	L.DPAFKIEDPYSPR.I	11 10	y1–3,y12; b2,b4	1.4	In-gel
β₁	1856.951 1860.976	1856.943 1860.968	192–206	R.ALDPAFKIEDPYSPR.I	4 4	y1–3;y12, b2	1.4	In-solution
β₁	2282.310 2286.329	2282.334 2286.359	194–200 + 207–217	L.DPAFKIE.D + R.IQNLLKITNLR.I	11 13	1: b3,y6 2: y1,2;b2,3	1.6	In-solution
β₁	2466.415 2470.444	2466.433 2470.458	192–200 +207–217	R.ALDPAFKIE.D +R.IQNLLKITNLR.I	7 6	2: b1,3;y1–4	1.5	In-gel
β₁	2466.455 2470.479	2466.433 2470.458	192–200 +207–217	R.ALDPAFKIE.D +R.IQNLLKITNLR.	9 9	1:b2,3 2:b2	1.6	In-solution
γ₁	2118.135 2122.156	2118.149 2122.174	128–143	R.LKFHTSRPESFAIYKR.T	7 8	y1,2,5–7,9 b7,14,15	1.8	In-solution
γ₁	2118.145 2122.166	2118.149 2122.174	128–143	R.LKFHTSRPESFAIYKR.T	2 4	y1, b15	1.7	In-gel
γ₁	2651.284 2655.307	2651.299 2655.324	189 – 212	S.DISPLTGGNVAFSTLEGRPSAYNF.D	6 6	y1,2,6– 12,14,18,21,23 b2–4,6,9–11,18	1.3	In-gel
γ₁	2651.286 2655.301	2651.299 2655.324	189 – 212	S.DISPLTGGNVAFSTLEGRPSAYNF.D	5 9	y2,6,23; b2,3,6	1.5	In-solution

Open in a new tab

Table 2.

Intramolecular cross-linked products of cross-linking reaction mixtures containing both laminin β₁ and γ₁ LN domains; the cross-linker (BS²G or BS³) is indicated. All cross-linked products were verified by MS/MS data. The cross-linked amino acids are printed in italics and are underlined.

Laminin β₁
MH+_exp	MH+_theo	Sequence	Amino acids and cross-linker	Δm [ppm]	MS/MS	D₀/D₄	Digestion
1293.787 1297.812	1293.793 1297.818	218–227	R.IKFVKLHTLG.D + BS³	5 5	y4–5; b5–9	1.3	In-solution
1293.821 1297.841	1293.793 1297.818	218–227	R.IKFVKLHTLG.D + BS³	22 18	y3-5; b5–9	1.8	In-gel
1672.823 1676.847	1672.822 1676.847	194–206	L.DPAFKIEDPYSPR.I + BS³	1 0	b3, y1–3,12	1.4	In-solution
1672.833 1676.860	1672.822 1676.847	194–206	L.DPAFKIEDPYSPR.I + BS³	7 8	b3; y1–3,9,12	3.0	In-gel
2282.322 2286.342	2282.312 2286.337	194-200+ 207-217	L.DPAFKIE.D +R.IQNLLKITNLR.I + BS³	4 2	1 :b3 2 :b2–4; y1–5,	1.8	In-gel
2466.430 2470.454	2466.433 2470.458	192-200+ 207–217	R.IQNLLKITNLR.I + R.ALDPAFKIE.D + BS³	1 2	-	1.6	In-gel
1421.842 1425.867	1421.847 1425.869	207–217	R.IQNLLKITNLR.I + BS²G	4 1	y1–5, b1–5	0.8	In-solution
1630.800 1634.793	1630.775 1634.800	194–206	L.DPAFKIEDPYSPR.I + BS²G	15 4	y1,2,12	0.6	In-gel
1836.901 1840.924	1836.865 1840.890	176–191	R.YSDIEPSTEGEVIFR.A amidated(C-term);+ BS²G	20 18	y1–8	0.7	In-solution
1836.921 1840.944	1836.865 1840.890	176–191	R.YSDIEPSTEGEVIFR.A amidated (C-term); + BS²G	30 29	y1–4	0.6	In-gel
1935.877 1939.900	1935.883 1939.908	152–168	Y.DCESSFPGISTGPMKKV.D + BS²G	3 4	y2–3,5–8,10–13, b4, b9, b6, b4	0.7	In-gel
2240.265 2244.286	2240.265 2244.290	194–200+ 207–217	L.DPAFKIE.D + R.IQNLLKITNLR.I + BS²G	0 2	b3, y1,4,17	0.7	In-solution
2240.260 2244.285	2240.265 2244.290	194–200+ 207–217	L.DPAFKIE.D + R.IQNLLKITNLR.I + BS²G	2 2	1: y1–5 ; b2–5 2 : b2–4	0.7	In-gel
2501.238 2505.257	2501.235 2505.260	136–147+ 192–200	R.SSDFGKTWGVYR.Y+R.ALDPAFKIE.D + BS²G	1 1	1 : b2, y1,2,5	0.5	In-solution
2501.216 2505.239	2501.235 2505.260	136–147+ 192–200	R.SSDFGKTWGVYR.Y+R.ALDPAFKIE.D + BS²G	8 8	A: b2,3,5; y1,4–6 b : b3	0.6	In-gel
2552.132 2556.149	2552.135 2556.160	152–173	Y.DCESSFPGISTGPMKKVDDIIC.D + BS²G	1 4	b6,9,10,17,21 y10,11,13,17	0.5	In-solution
2552.021 2556.044	2552.135 2556.160	152–173	Y.DCESSFPGISTGPMKKVDDIIC.D + BS²G	45 45	-	0.6	In-gel
2751.278 2755.315	2751.312 2755.337	33–55	K.LSVTSTCGLHKPEPYCIVSHLQE.D + BS²G	12 8	y3–y6,10–14 b2,4,10,11	0.5	In-solution
2751.272 2755.312	2751.312 2755.337	33–55	K.LSVTSTCGLHKPEPYCIVSHLQE.D + BS²G	15 9	b2,4,10 y5–6,10,12	0.6	In-gel
3078.499 3082.529	3078.502 3082.527	30–55	R.AQKLSVTSTCGLHKPEPYCIVSHLQE.D + BS²G	1 1	b15,17 y4,5,10,12	0.3	In-gel

Laminin γ₁
MH+_exp	MH+_theo	Sequence	Amino acids and cross-linker	Δm [ppm]	D₀/D₄	MS/MS	Digestion
2118.151 2122.176	2118.149 2122.174	128–143	R.LKFHTSRPESFAIYKR.T + BS³	1 1	1.3	b7,9,15 y1,7,9	In-solution
2118.172 2122.170	2118.149 2122.174	128–143	R.LKFHTSRPESFAIYKR.T + BS³	11 2	0.8	(y1,b15)	In-gel
2076.080 2080.125	2076.103 2080.128	128–143	R.LKFHTSRPESFAIYKR.T + BS²G	11 1	0.8	b1,15 y1,6,7,10	In-solution
2852.190 2856.198	2852.226 2856.251	147–169	E.DGPWIPYQYYSGSCENTYSKANR.G + BS²G	13 19	0.4	y1-3; b3– b5, b8, b16	In-solution
2949.432 2953.455	2949.431 2953.456	239–263	G.DDVFNEPKVLKSYYYAISDFAVGGR.C + BS²G	0 0	0.7	b18,b20, y1,3,19	In-solution

Open in a new tab

In Figure 4, the identification of a cross-link between lysines at positions 198 and 212 of laminin β₁ is presented exemplarily. MS/MS data unambiguously point to both lysines as reaction sites.

MALDI-TOF/TOF-MS/MS analysis of the cross-linking product with BS³ of laminin β₁ LN domain (amino acis194-200 connected with amino acids 207-217; [M+H]⁺_exp 2282.322), lysines at positions 198 and 212 were found to be connected (marked with asterisks). Upper panel: the identified fragment ions are shown in bold; lower panel: tandem MS (MS/MS); the insets show the y6 fragment ion and the precursor ion, both exhibiting the characteristic *D₀/D₄* isotope pattern.

Disulfide Bond Analysis

The analysis of disulfide bond patterns of laminin β₁ LN domain by offline nano-HPLC/MALDI-TOF/TOF mass spectrometry has been described recently (12). Briefly, after recording the mass spectra the putatively disulfide-linked peptides are subjected to LIFT (laser-induced fragmentation technique)-TOF/TOF-MS/MS to confirm the disulfide bond. Screening the fragment ion mass spectra of disulfide-linked peptides for characteristic 66-u patterns (34 u + 32 u), arising from symmetric and asymmetric cleavage of disulfide bonds, greatly facilitates their identification (42). In addition, MALDI in-source decay created the reduced ‘halves’ of the disulfide-linked peptides and subsequent LIFT-TOF/TOF-MS/MS of the reduced peptides confirmed their amino acid sequences. Using different enzymes for proteolytic digestion of the laminin γ₁ chain N-terminal protein fragment, a linear bonding pattern of the eight cysteine residues in the LN domain of the laminin γ₁ chain was observed with a strict sequential 1–2, 3–4, 5–6, 7–8 connectivity of the eight cysteines. This disulfide bonding pattern had already been reported for the recombinant laminin β1 chain fragment and a laminin fragment derived by elastase digestion of mouse tumor laminin-111, confirming that this pattern also occurs in native laminin (12). Table 3 summarizes the cysteines that are connected by disulfide bonds.

Table 3.

Distance constraints in laminin β₁ and γ₁ LN domains serving as basis for computational modeling (filtering criteria for C^A-C^A distances: disulfide bonds: 13 Å (favored distance 8 Å), cross-links: 30 Å (favored distance 24 Å); shaded areas: these constraints could not be employed as a direct structural filter because of the spatial proximity in the primary structure of laminin LN domains.

Laminin β1		Laminin γ1
Connected amino acids	Connection Type	Connected amino acids	Connection Type
C13-C18	Disulfide	C18-C28	Disulfide
K32-K43	BS²G	C47 -C55	Disulfide
C39-C48	Disulfide	C67 -C70	Disulfide
C59-C62	Disulfide	K129-K142	BS²G, BS³
K141-K198	BS²G	C160-C183	Disulfide
C153-C173	Disulfide	S208-Y210	BS³
K166-K167	BS²G	K246-K249	BS³
K198-Y203/S204	BS²G, BS³
K198-K212	BS²G, BS³
K212-T214	BS²G
K219-K222	BS³
K222-T225	BS²G

Open in a new tab

Structures of Laminin β₁ and γ₁ LN Domains

The secondary structure of LN domains was predicted using three different programs as described in the Materials and Methods resulting in a β-sheet content of ca. 30% for laminin β₁ and ca. 26% for laminin γ₁. Fold recognition using the 3D-Jury method (25) was performed using the sequences of all murine laminin LN domains and the homologous sequences of netrin I, II, and IV as input. A total of six template structures (1CZT, 1D7P, 1GOF, 1KEX, 1SDD, 1XPW) were identified with 3D-Jury scores ranging between 39 and 58 (Table 4). All six structures share a common galactose-binding domain-like fold. In a complementary approach, 35 additional templates were determined by sequence-structure alignment threading servers Phyre, Libellula, Wurst, HHPred, and Loop (28), (29), (30), (31), (32) (Table 4). In order to eliminate false positives, a pairwise structure-structure comparison of all putative templates was executed. 14 out of 40 templates displayed the galactose-binding domain-like fold (Table 4). The 26 remaining templates were removed as false positives as they exhibited similarity with one other template at most (Figure 2). Sequence identities between the templates and laminin β₁ and γ₁ LN domains ranged between 10 to 18%.

Sequence Assignment and Modeling of Loops

If sequence identities are lower than 50%, selection of the best possible template and accurate alignment of target and template sequences are crucial for success in comparative modeling (43). To comprehensively test all possible templates and alignments a parametric ensemble alignment approach was used for all 14 template proteins (38). Briefly, for each template laminin β₁ and γ₁ LN domain pair sequence-sequence alignments were created using a scoring function that includes sequence similarity (identity, Blosum45, Pam250, Blast), secondary structure (programs JUFO (19) (35), Psipred (36), SAM (37)), and sequence gaps (gap opening and gap extension) with variable weights. Out of the originally 52,200 alignments redundant alignments were removed as well as alignments that contained gaps too large to be closed by the respective loop. A total of 4,333 and 7,306 sequence-sequence alignments were input to comparative modeling for β₁ and γ₁ LN domains, respectively.

Construction of Comparative Models

The coordinates of the backbone of aligned amino acids were copied from the template structure into an initial model. Missing regions were reconstructed using Rosetta (39). Altogether, 4,333 alignments (for laminin β₁) and 7,306 alignments (for laminin γ₁) resulted in complete comparative models. Side chains were added using Rosetta rotamer libraries (44) assuming that all cysteines are involved in disulfide bonds. Models were removed if not all loop regions could be closed, the compactness of the model did not match native protein domains of the same size, not all cysteines were involved in disulfide bridges (or being at least in close distance to other cysteines <13Å), the fold deviated significantly from the starting template, or the model was energetically clearly disfavored, i.e. had substantial clashes (Table 5).

Evaluation of the Models Using Experimental Data

The models obtained by using different templates, sequence-sequence alignments, modeling of loops, and relaxation of structures were evaluated using the distance constraints imposed on the laminin LN domains by “natural” cross-links (disulfide bonds) and “chemically introduced” cross-links. The distance constraints used for verifying the models are summarized in Table 3. The identification of a disulfide bond implies that the C^A atoms of the cysteines involved are within a distance of ca. 7 Å. Every C^A-C^A distance of a disulfide bond, in which the cysteines are separated by at least two amino acids in sequence can be used as structural constraint. In case the cysteines are separated by less than two amino acids, the distance is below 9 Å and the distance constraint is always fulfilled. A similar approximation can be made for the C^A-C^A distances of amino acids that are cross-linked with BS³: If the C^A atoms are at least seven positions apart from each other, these cross-links can be used as useful distance constraints. Thus, seven out of twelve and five out of seven constraints have been applied as restraint for laminin β₁ and γ₁, respectively. Nevertheless, as already described by Alexander et al. 41) the information content of the distance constraints depend on the sequence separation [aa] / Euclidian separation [Å] ratio.

The structures of laminin β₁ and γ₁ LN domains calculated by Rosetta were compared with the experimentally obtained distance constraints (Table 3, Supplementary Table 3). All structures were grouped based on the number of fulfilled distance constraints. A distance constraint was considered to be fulfilled if the C^A-C^A distance of a cross-link was below 30 Å and the C^A-C^A distance of a disulfide bridge below 13 Å. Both filter criteria are somewhat relaxed to the distances listed above in order to account for inaccuracies in the comparative models (Table 5). Only approximately 1% of all models fulfilled all distance constraints highlighting the discriminative power of even few experimental distance measurements (Table 5).

All models fulfilling all distance constraints exhibited the same topology (more than 100 out of 240 amino acids could be superimposed with an RMSD < 6 Å), however, shifts in the sequence structure alignment of up to 40 amino acids occur. In a second step, a composite scoring function described in Materials and Methods integrating the deviation of the experimental constraints from the theoretical maximum distance as well as the Rosetta energy and compactness of the structure were used to evaluate the quality of the models and create a ranking (Supplementary Table 4). In the best scoring models of the LN domains of laminin β₁ and γ₁ all of the distance constraints deviate from the theoretical distance by less than 5 Å (Figure 5).

The lowest scoring Rosetta structures for (A) laminin β₁ and (B) γ₁ LN domains that fulfill all experimental distance constraints. The amino acids involved in the constraints are indicated, structures are visualized by PyMOL version 0.99rc6). Cysteines at positions 13 (laminin β₁) and 18 (laminin γ₁), which are not part of laminin N-terminal domains, are marked with asterisks.

For laminin β₁, three structures (Table 5) fulfilled all seven distance constraints and exhibited structural similarity, i.e., more than 110 C^A atoms are superimposed with an RMSD below 4 Å. For laminin γ₁, five distance constraints were fulfilled by 12 models (Table 5). Nine structures could be superimposed in more than 120 C^A atoms with an RMSD below 4 Å. For the remaining three models more than 50 C^A atoms could be superimposed with an RMSD below 4 Å.

Leave-One-Out Cross-Validation and Evaluation with Structure Quality

To assess the quality of laminin β₁ and γ₁, all models fulfilling 6 out of 7 or 4 out of 5 distance constraints, respectively, were pairwise compared to the best solution structures for laminin β₁ and γ₁ LN domains (Figure 5). A test of the precision of the modeling efforts yielded 17 out of 33 (laminin β₁) and 141 out of 293 models (laminin γ₁) with a high structural similarity, i.e. more than 80 C^A atom are superimposed with a RMSD below 4 Å. Accuracy was assessed by testing whether or not the left out distance constrained is fulfilled by the models. On the basis of the LOO cross-validation, the uncertainty of the predicted structures was estimated. For the models of laminin β₁ and γ₁ LN domains, which were obtained with 6 out of 7 (4 out of 5) distance constraints, 78% (61%) of all models fulfilled the missing distance constraint. If the same models were validated by their structure homology to the best model, 88% (64%) of the models exhibited the correct fold (Table 7). In conclusion, 78% and 61% of the leave-one-out structures fulfill the left-out distance constraint.

In addition, the programs VADAR and MolProbity were used for a Rosetta-independent measurement of structure quality (Supplementary Table 4). The quality criteria defined by the two programs are all in a reasonable range for the models that fulfill all distance constraints. As only minor differences in quality scores were observed between the models, these criteria were not used for further ranking the models.

Influence of Templates

To analyze the influence of the template on the final model best structures were compared for the 14 templates (Table 4). For laminin γ₁, for all 14 templates structures were obtained, which fulfilled at least 4 out of 5 distance constraints and were energetically favored as well as structurally compact. From these, five structures displayed a fold identical to the best overall model. For laminin β₁, only nine templates yielded models, which fulfilled six out of seven distance constraints. All displayed the identical fold as in the overall best structure (Supplementary Table 2).

Influence of Experimental Distance Constraints

A statistics on the effectiveness of each constraint revealed that all distance constraints reduced conformational space. However, the fraction of models excluded ranged from 0.1% to 80.7% (Table 7). As expected, the effectiveness of an experimental constraint increases with increasing sequence separation and decreasing limit for the Euclidean distance.

Implications of Our Models for Laminin Function

Laminins containing the γ1 chain are essential for mammalian development, as shown by the fact that mice lacking expression of this chain die at day 5.5 of embryonal development (45). Laminins assemble into a macromolecular network by interactions of their LN domain containing N-terminal short arms. This assembly renders them immobile in the basement membrane and allow their C-terminal domains to attach cells onto this structure by interactions with cell surface integrin or dystroglycan receptors. The crucial role of laminin LN domains in laminin network formation has led to intensive attempts to determine their structure by crystallographic techniques. However, it turned out to be extremely difficult to obtain well organized crystals for LN domains from laminins or the homologous netrins and for this reason no LN domain crystal structures have been published so far. The lack of detailed structural information has hampered the study of the crucial LN domain interactions.

The modeled structures that we present herein open new possibilities to identify binding surfaces on LN domains as well as ligand structures. Based on the models site-directed mutagenesis can be performed on surface exposed amino acid residues and the consequences of these mutations can be studied in binding assays for laminin short arm self-interactions (3). Further chemical cross-linking experiments can be performed on interacting laminin short arms to identify the domains within laminin, to which the LN domains bind. Such studies will provide a structural model of self-interacting laminin molecules, which will significantly add to our understanding of basement membrane structure. As mutations within LN domains that affect their interactions can cause severe retinal phenotypes (46) novel knowledge on pathogenic mechanisms may come about.

CONCLUSIONS

With our approach integrating chemical cross-linking, mass spectrometry, and computational modeling we were able to compute the first experimentally validated low-resolution structures of laminin N-terminal domains. After the identification of 14 potentially suited template structures for each laminin β₁ and γ₁ LN domain, several hundreds of models were created with the modeling program Rosetta. The models were filtered and ranked by both the correlation with experimentally derived distance constraints from natural cross-links, i.e., disulfide bonds, and chemical cross-links created by reaction with two amine-reactive cross-linkers and parameter for model quality i.e., energy and solvent accessible surface area.

The questions how well-defined the models are, how many, and which kind of distance constraints are needed for creating a good model, and how experimental data and template structure influence the model were addressed by leave-one-out cross-validation combined with a structure-structure alignment as well as by Rosetta-independent structure evaluation programs.

Although none of the techniques employed herein is completely new, we argue that the presented protocol is novel as it combines ten computational methods with a complex experimental setup integrating chemical cross-linking, isotope labeling, and HPLC/MS analysis. As such, we consider our approach to be a valid alternative for deriving structural models of proteins, which are not amenable to the high-resolution methods for protein 3D structure analysis, such as NMR spectroscopy or X-ray crystallography.

Supplementary Material

NIHMS824017-supplement-01.pdf^{(492.7KB, pdf)}

Acknowledgments

This work was supported by grants SI 867/7-1 and SM 65/1-3 from the DFG (Deutsche Forschungsgemeinschaft) and by the Köln Fortune program of the Medical Faculty of the University of Cologne. The authors thank Dr. Tibor Kohajda for programming the software tool IsoFind and Dr. Christian Ihling for discussions and valuable advice on MALDI-TOF/TOF-MS and ESI-FTICR-MS (Apex II) measurements. Karl Mechtler and Christoph Stingl, IMP Vienna, are acknowledged for LTQ-FT-MS measurements. SK acknowledges support from the DFG-funded Graduiertenkolleg 1026 “Conformational Transitions in Macromolecular Interactions” at the Martin Luther University Halle-Wittenberg.

ABBREVIATIONS

BS²G: Bis(sulfosuccinimidyl)glutarate
BS³: Bis(sulfosuccinimidyl)suberate
DHB: 2,5-Dihydroxybenzoic acid
DTT: Dithiothreitol
ESI-FTICR-MS: Electrospray ionization Fourier transform ion cyclotron resonance mass spectrometry
FA: Formic acid
HCCA: α-Cyano-4-hydroxy cinnamic acid
HPLC: High-performance liquid chromatography
LHRH: Luteinizing hormone releasing hormone
LIFT: laser-induced fragmentation technique
LN: laminin N-terminal
LOO: Leave-one-out
MALDI-TOF-MS: Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
NHS: N-hydroxysuccinimide
PDB: Protein data bank
RMSD: Root mean square deviation (calculated for the backbone coordinates)
SASA: Solvent accessible surface area
S/N: Signal-to-noise ratio
TFA: Trifluoroacetic acid
TIC: Total ion current

REFERENCES

1.Tunggal P, Smyth N, Paulsson M, Ott M-C. Microscopy Research and Technique. 2000;51:214–227. doi: 10.1002/1097-0029(20001101)51:3<214::AID-JEMT2>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]
2.Aumailley M, Bruckner-Tuderman L, Carter WG, Deutzmann R, Edgar D, Ekblom P, Engel J, Engvall E, Hohenester E, Jones JCR, Kleinman HK, Marinkovich MP, Martin GR, Mayer U, Meneguzzi G, Miner JH, Miyazaki K, Patarroyo M, Paulsson M, Quaranta V, Sanes JR, Sasaki T, Sekiguchi K, Sorokin LM, Talts JF, Tryggvason K, Uitto J, Virtanen I, von der Mark K, Wewer UM, Yamada Y, Yurchenco PD. Matrix Biology. 2005;24:326–332. doi: 10.1016/j.matbio.2005.05.006. [DOI] [PubMed] [Google Scholar]
3.Odenthal U, Haehn S, Tunggal P, Merkl B, Schomburg D, Frie C, Paulsson M, Smyth N. Journal of Biological Chemistry. 2004;279:44504–44512. doi: 10.1074/jbc.M402455200. [DOI] [PubMed] [Google Scholar]
4.Yurchenco PD, Tsilibary EC, Charonis AS, Furthmayr H. Journal of Biological Chemistry. 1985;260:7636–7644. [PubMed] [Google Scholar]
5.McKee KK, Harrison D, Capizzi S, Yurchenco PD. Journal of Biological Chemistry. 2007;282:21437–21447. doi: 10.1074/jbc.M702963200. [DOI] [PubMed] [Google Scholar]
6.Schneiders FI, Maertens B, Bose K, Li Y, Brunken WJ, Paulsson M, Smyth N, Koch M. Journal of Biological Chemistry. 2007;282:23750–23758. doi: 10.1074/jbc.M703137200. [DOI] [PubMed] [Google Scholar]
7.Young MM, Tang N, Hempel JC, Oshiro CM, Taylor EW, Kuntz ID, Gibson BW, Dollinger G. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:5802–5806. doi: 10.1073/pnas.090099097. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Back JW, Jong Ld, Muijsers AO, Koster CGd. Journal of Molecular Biology. 2003;331:303–313. doi: 10.1016/s0022-2836(03)00721-6. [DOI] [PubMed] [Google Scholar]
9.Sinz A. Journal of Mass Spectrometry. 2003;38:1225–1237. doi: 10.1002/jms.559. [DOI] [PubMed] [Google Scholar]
10.Sinz A. Mass Spectrometry Reviews. 2006;25:663–682. doi: 10.1002/mas.20082. [DOI] [PubMed] [Google Scholar]
11.Hermanson G, editor. Bioconjugate Techniques. 2nd. Amsterdam, NL: Elsevier; 2008. [Google Scholar]
12.Kalkhof S, Haehn S, Ihling C, Paulsson M, Smyth N, Sinz A. Rapid Communications in Mass Spectrometry. 2008;22:1933–1940. doi: 10.1002/rcm.3576. [DOI] [PubMed] [Google Scholar]
13.Kalkhof S, Ihling C, Mechtler K, Sinz A. Analytical Chemistry. 2004;77:495–503. doi: 10.1021/ac0487294. [DOI] [PubMed] [Google Scholar]
14.Schmidt A, Kalkhof S, Ihling C, Cooper D, Sinz A. Eur J Mass Spectrom (Chichester, Eng) 2005;11:525–534. doi: 10.1255/ejms.748. [DOI] [PubMed] [Google Scholar]
15.Ihling C, Schmidt A, Kalkhof S, Schulz DM, Stingl C, Mechtler K, Haack M, Beck-Sickinger AG, Cooper DMF, Sinz A. Journal of the American Society for Mass Spectrometry. 2006;17:1100–1113. doi: 10.1016/j.jasms.2006.04.020. [DOI] [PubMed] [Google Scholar]
16.Simons KT, Kooperberg C, Huang E, Baker D. Journal of Molecular Biology. 1997;268:209–225. doi: 10.1006/jmbi.1997.0959. [DOI] [PubMed] [Google Scholar]
17.Bradley P, Malmström L, Qian B, Schonbrun J, Chivian D, Kim DE, Meiler J, Misura KMS, Baker D. Proteins: Structure, Function, and Bioinformatics. 2005;61:128–134. doi: 10.1002/prot.20729. [DOI] [PubMed] [Google Scholar]
18.Bowers PM, Strauss CEM, Baker D. Journal of Biomolecular NMR. 2000;18:311–318. doi: 10.1023/a:1026744431105. [DOI] [PubMed] [Google Scholar]
19.Meiler J, Baker D. Journal of Magnetic Resonance. 2005;173:310–316. doi: 10.1016/j.jmr.2004.11.031. [DOI] [PubMed] [Google Scholar]
20.Alexander N, Al-Mestarihi A, Bortolus M, McHaourab H, Meiler J. 2008;16:181–195. doi: 10.1016/j.str.2007.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Kalkhof S, Sinz A. Anal Bioanal Chem. 2008;392:305–312. doi: 10.1007/s00216-008-2231-5. [DOI] [PubMed] [Google Scholar]
22.Laemli UK. Nature. 1970;227:680–685. doi: 10.1038/227680a0. [DOI] [PubMed] [Google Scholar]
23.Schulz DM, Kalkhof S, Schmidt A, Ihling C, Stingl C, Mechtler K, Zschörnig O, Sinz A. Proteins: Structure, Function, and Bioinformatics. 2007;69:254–269. doi: 10.1002/prot.21445. [DOI] [PubMed] [Google Scholar]
24.Mädler S, Bich C, Touboul D, Zenobi R. Journal of Mass Spectrometry. 2009;44:694–706. doi: 10.1002/jms.1544. [DOI] [PubMed] [Google Scholar]
25.Ginalski K, Elofsson A, Fischer D, Rychlewski L. Bioinformatics. 2003;19:1015–1018. doi: 10.1093/bioinformatics/btg124. [DOI] [PubMed] [Google Scholar]
26.Huang X, Miller W. Adv. Appl. Math. 1991;12:337–357. [Google Scholar]
27.Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Nucl. Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Kelley LA, Sternberg MJE. Nat. Protocols. 2009;4:363–371. doi: 10.1038/nprot.2009.2. [DOI] [PubMed] [Google Scholar]
29.Juan D, Graña O, Pazos F, Fariselli P, Casadio R, Valencia A. Proteins: Structure, Function, and Genetics. 2003;50:600–608. doi: 10.1002/prot.10322. [DOI] [PubMed] [Google Scholar]
30.Torda AE, Procter JB, Huber T. Nucl. Acids Res. 2004;32:W532–W535. doi: 10.1093/nar/gkh357. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Soding J. Bioinformatics. 2005;21:951–960. doi: 10.1093/bioinformatics/bti125. [DOI] [PubMed] [Google Scholar]
32.Teodorescu O, Galor T, Pillardy J, Elber R. Proteins: Structure, Function, and Bioinformatics. 2004;54:41–48. doi: 10.1002/prot.10474. [DOI] [PubMed] [Google Scholar]
33.Ortiz AR, Strauss CEM, Olmea O. Protein Science. 2002;11:2606–2621. doi: 10.1110/ps.0215902. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Dong E, Smith J, Heinze S, Alexander N, Meiler J. Gene. 2008;422:41–46. doi: 10.1016/j.gene.2008.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Meiler J, Prompers JJ, Peti W, Griesinger C, Bruschweiler R. Journal of the American Chemical Society. 2001;123:6098–6107. doi: 10.1021/ja010002z. [DOI] [PubMed] [Google Scholar]
36.Jones DT. Journal of Molecular Biology. 1999;292:195–202. doi: 10.1006/jmbi.1999.3091. [DOI] [PubMed] [Google Scholar]
37.Karplus K, Barrett C, Cline M, Diekhans M, Grate L, Hughey R. Proteins: Structure, Function, and Genetics. 1999;37:121–125. doi: 10.1002/(sici)1097-0134(1999)37:3+<121::aid-prot16>3.3.co;2-h. [DOI] [PubMed] [Google Scholar]
38.Chivian D, Baker D. Nucl. Acids Res. 2006;34:e112. doi: 10.1093/nar/gkl480. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Rohl CA, Strauss CEM, Chivian D, Baker D. Proteins: Structure, Function, and Bioinformatics. 2004;55:656–677. doi: 10.1002/prot.10629. [DOI] [PubMed] [Google Scholar]
40.Willard L, Ranjan A, Zhang H, Monzavi H, Boyko RF, Sykes BD, Wishart DS. Nucl. Acids Res. 2003;31:3316–3319. doi: 10.1093/nar/gkg565. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, III, Snoeyink J, Richardson JS, Richardson DC. Nucl. Acids Res. 2007;35:W375–W383. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Schnaible V, Wefing S, Resemann A, Suckau D, Buecker A, Wolf-Kuemmeth S, Hoffmann D. Analytical Chemistry. 2002;74:4980–4988. doi: 10.1021/ac025807j. [DOI] [PubMed] [Google Scholar]
43.Rost B, O'Donoghue S. Comput. Appl. Biosci. 1997;13:345–356. doi: 10.1093/bioinformatics/13.4.345. [DOI] [PubMed] [Google Scholar]
44.Kuhlman B, Baker D. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:10383–10388. doi: 10.1073/pnas.97.19.10383. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Edwards MM, Mammadova-Bach E, Alpy F, Klein A, Hicks WL, Roux M, Simon-Assmann P, Smith RS, Orend G, Wu J, Peachey NS, Naggert JK, Lefevbre O, Nishina PM. J. Biol. Chem. 2010;285:7697–7711. doi: 10.1074/jbc.M109.069575. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Smyth NR, Vatansever HS, Murray P, Meyer M, Frie C, Paulsson M, Edgar D. J. Cell Biol. 1999;144:151–160. doi: 10.1083/jcb.144.1.151. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Macedo-Ribeiro S, Bode W, Huber R, Quinn-Allen MA, Kim SW, Ortel TL, Bourenkov GP, Bartunik HD, Stubbs MT, Kane WH, Fuentes-Prior P. Nature. 1999;402:434–439. doi: 10.1038/46594. [DOI] [PubMed] [Google Scholar]
48.Pratt KP, Shen BW, Takeshima K, Davie EW, Fujikawa K, Stoddard BL. Nature. 1999;402:439–442. doi: 10.1038/46601. [DOI] [PubMed] [Google Scholar]
49.Gaskell A, Crennell S, Taylor G. Structure. 1995;3:1197–1205. doi: 10.1016/s0969-2126(01)00255-6. [DOI] [PubMed] [Google Scholar]
50.Ito N, Phillips SEV, Stevens C, Ogel ZB, McPherson MJ, Keen JN, Yadav KDS, Knowles PF. Nature. 1991;350:87–90. doi: 10.1038/350087a0. [DOI] [PubMed] [Google Scholar]
51.Au SWN, Leng X, Harper JW, Barford D. Journal of Molecular Biology. 2002;316:955–968. doi: 10.1006/jmbi.2002.5399. [DOI] [PubMed] [Google Scholar]
52.Boraston AB, Nurizzo D, Notenboom V, Ducros V, Rose DR, Kilburn DG, Davies GJ. Journal of Molecular Biology. 2002;319:1143–1156. doi: 10.1016/S0022-2836(02)00374-1. [DOI] [PubMed] [Google Scholar]
53.Wendt KS, Vodermaier HC, Jacob U, Gieffers C, Gmachl M, Peters J-M, Huber R, Sondermann P. Nature Structural Biology. 2001;8:784–788. doi: 10.1038/nsb0901-784. [DOI] [PubMed] [Google Scholar]
54.Bianchet MA, Odom EW, Vasta GR, Amzel LM. Nature Structural Biology. 2002;9:628–634. doi: 10.1038/nsb817. [DOI] [PubMed] [Google Scholar]
55.Firbank SJ, Rogers MS, Wilmot CM, Dooley DM, Halcrow MA, Knowles PF, McPherson MJ, Phillips SEV. Proceedings of the National Academy of Sciences of the United States of America. 2001;98:12932–12937. doi: 10.1073/pnas.231463798. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Lee CC, Kreusch A, McMullan D, Ng K, Spraggon G. Structure. 2003;11:99–108. doi: 10.1016/s0969-2126(02)00941-3. [DOI] [PubMed] [Google Scholar]
57.Adams TE, Hockin MF, Mann KG, Everse SJ. Proceedings of the National Academy of Sciences of the United States of America. 2004;101:8918–8923. doi: 10.1073/pnas.0403072101. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Ramelot TA, Raman S, Kuzin AP, Xiao R, Ma L-C, Acton TB, Hunt JF, Montelione GT, Baker D, Kennedy MA. Proteins: Structure, Function, and Bioinformatics. 2009;75:147–167. doi: 10.1002/prot.22229. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Marintchev A, Mullen MA, Maciejewski MW, Pan B, Gryk MR, Mullen GP. Nat Struct Mol Biol. 1999;6:884–893. doi: 10.1038/12347. [DOI] [PubMed] [Google Scholar]
60.Tews I, Perrakis A, Oppenheim A, Dauter Z, Wilson KS, Vorgias CE. Nat Struct Mol Biol. 1996;3:638–648. doi: 10.1038/nsb0796-638. [DOI] [PubMed] [Google Scholar]
61.JointCenterForStructuralGenomics. PDB entry 1VPQ To be published. [Google Scholar]
62.Guo H-C, Jardetzky TS, Garrettt TPJ, Lane WS, Strominger JL, Wiley DC. Nature. 1992;360:364–366. doi: 10.1038/360364a0. [DOI] [PubMed] [Google Scholar]
63.Cardoso RMF, Silva CHTP, Ulian de Araujo AP, Tanaka T, Tanaka M, Garratt RC. Acta Crystallographica Section D. 2004;60:1569–1578. doi: 10.1107/S0907444904016798. [DOI] [PubMed] [Google Scholar]
64.Setayesh FR, DeCorte BL, Horton P, Harris CM, Harris TM, Stone MP. Chemical Research in Toxicology. 1998;11:766–777. doi: 10.1021/tx9800147. [DOI] [PubMed] [Google Scholar]
65.Momma K, Mishima Y, Hashimoto W, Mikami B, Murata K. Biochemistry. 2005;44:5053–5064. doi: 10.1021/bi047781r. [DOI] [PubMed] [Google Scholar]
66.JointCenterForStructuralGenomics. PDB entry 1VKO To be published. [Google Scholar]
67.Mol CD, Dougan DR, Schneider TR, Skene RJ, Kraus ML, Scheibe DN, Snell GP, Zou H, Sang B-C, Wilson KP. Journal of Biological Chemistry. 2004;279:31655–31663. doi: 10.1074/jbc.M403319200. [DOI] [PubMed] [Google Scholar]
68.Skelton NJ, Quan C, Reilly D, Lowman H. Structure. 1999;7:157–168. doi: 10.1016/S0969-2126(99)80022-7. [DOI] [PubMed] [Google Scholar]
69.Li H, Tomizawa T, Tochio N, Muto Y, Koshiba S, Inoue M, Kigawa T, Yokoyama S. PDB entry 2COC To be published. [Google Scholar]
70.Udomsinprasert R, Pongjaroenkit S, Wongsantichon J, Oakley AJ, Prapanthadara L-a, Wilce MCJ, Ketterman AJ. Biochem. J. 2005;388:763–771. doi: 10.1042/BJ20042015. [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Ichiyanagi K, Ishino Y, Ariyoshi M, Komori K, Morikawa K. Journal of Molecular Biology. 2000;300:889–901. doi: 10.1006/jmbi.2000.3873. [DOI] [PubMed] [Google Scholar]
72.López-Lucendo MF, Solís D, André S, Hirabayashi J, Kasai K-i, Kaltner H, Gabius H-J, Romero A. Journal of Molecular Biology. 2004;343:957–970. doi: 10.1016/j.jmb.2004.08.078. [DOI] [PubMed] [Google Scholar]
73.Katsuya Y, Mezaki Y, Kubota M, Matsuura Y. Journal of Molecular Biology. 1998;281:885–897. doi: 10.1006/jmbi.1998.1992. [DOI] [PubMed] [Google Scholar]
74.Koch AW, Farooq A, Shan W, Zeng L, Colman DR, Zhou M-M. Structure. 2004;12:793–805. doi: 10.1016/j.str.2004.02.034. [DOI] [PubMed] [Google Scholar]
75.Abad MC, Arni RK, Grella DK, Castellino FJ, Tulinsky A, Geiger JH. Journal of Molecular Biology. 2002;318:1009–1017. doi: 10.1016/S0022-2836(02)00211-5. [DOI] [PubMed] [Google Scholar]
76.Llera AS, Viedma F, Sánchez-Madrid F, Tormo J. Journal of Biological Chemistry. 2001;276:7312–7319. doi: 10.1074/jbc.M008573200. [DOI] [PubMed] [Google Scholar]
77.Baeyens KJ, De Bondt HL, Raeymaekers A, Fiers W, De Ranter CJ. Acta Crystallographica Section D. 1999;55:772–778. doi: 10.1107/s0907444998018435. [DOI] [PubMed] [Google Scholar]
78.Lubkowski J, Hennecke F, Pluckthun A, Wlodawer A. Nat Struct Mol Biol. 1998;5:140–147. doi: 10.1038/nsb0298-140. [DOI] [PubMed] [Google Scholar]
79.Tegoni M, Spinelli S, Verhoeyen M, Davis P, Cambillau C. Journal of Molecular Biology. 1999;289:1375–1385. doi: 10.1006/jmbi.1999.2845. [DOI] [PubMed] [Google Scholar]
80.Hart PJ, Deep S, Taylor AB, Shu Z, Hinck CS, Hinck AP. Nat Struct Mol Biol. 2002;9:203–208. doi: 10.1038/nsb766. [DOI] [PubMed] [Google Scholar]
81.Stams T, Chen Y, Boriack-Sjodin PA, Hurt JD, Liao J, May JA, Dean T, Laipis P, Silverman DN, Christianson DW. Protein Sci. 1998;7:556–563. doi: 10.1002/pro.5560070303. [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Lamb AL, Wernimont AK, Pufahl RA, O'Halloran TV, Rosenzweig AC. Biochemistry. 2000;39:1589–1595. doi: 10.1021/bi992822i. [DOI] [PubMed] [Google Scholar]
83.Whittingtons DA, Grubb JH, Waheed A, Shah GN, Sly WS, Christianson DW. Journal of Biological Chemistry. 2004;279:7223–7228. doi: 10.1074/jbc.M310809200. [DOI] [PubMed] [Google Scholar]
84.Dayan SM, Van Donkelaar A, Kortt AA. Journal of Biological Chemistry. 1987;262:10287–10289. [PubMed] [Google Scholar]
85.Watson JN, Newstead S, Dookhun V, Taylor G, Bennet AJ. FEBS Letters. 2004;577:265–269. doi: 10.1016/j.febslet.2004.10.016. [DOI] [PubMed] [Google Scholar]
86.Zhang Y. BMC Bioinformatics. 2008;9:40. doi: 10.1186/1471-2105-9-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
87.Zhang Y. Proteins: Structure, Function, and Bioinformatics. 2009;77:100–113. [Google Scholar]
88.Soding J, Biegert A, Lupas AN. Nucl. Acids Res. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
89.Wu S, Zhang Y. Nucl. Acids Res. 2007;35:3375–3382. doi: 10.1093/nar/gkm251. [DOI] [PMC free article] [PubMed] [Google Scholar]
90.Fiser A, Do RKG, Scaronali A. Protein Science. 2000;9:1753–1773. doi: 10.1110/ps.9.9.1753. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS824017-supplement-01.pdf^{(492.7KB, pdf)}

[R1] 1.Tunggal P, Smyth N, Paulsson M, Ott M-C. Microscopy Research and Technique. 2000;51:214–227. doi: 10.1002/1097-0029(20001101)51:3<214::AID-JEMT2>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]

[R2] 2.Aumailley M, Bruckner-Tuderman L, Carter WG, Deutzmann R, Edgar D, Ekblom P, Engel J, Engvall E, Hohenester E, Jones JCR, Kleinman HK, Marinkovich MP, Martin GR, Mayer U, Meneguzzi G, Miner JH, Miyazaki K, Patarroyo M, Paulsson M, Quaranta V, Sanes JR, Sasaki T, Sekiguchi K, Sorokin LM, Talts JF, Tryggvason K, Uitto J, Virtanen I, von der Mark K, Wewer UM, Yamada Y, Yurchenco PD. Matrix Biology. 2005;24:326–332. doi: 10.1016/j.matbio.2005.05.006. [DOI] [PubMed] [Google Scholar]

[R3] 3.Odenthal U, Haehn S, Tunggal P, Merkl B, Schomburg D, Frie C, Paulsson M, Smyth N. Journal of Biological Chemistry. 2004;279:44504–44512. doi: 10.1074/jbc.M402455200. [DOI] [PubMed] [Google Scholar]

[R4] 4.Yurchenco PD, Tsilibary EC, Charonis AS, Furthmayr H. Journal of Biological Chemistry. 1985;260:7636–7644. [PubMed] [Google Scholar]

[R5] 5.McKee KK, Harrison D, Capizzi S, Yurchenco PD. Journal of Biological Chemistry. 2007;282:21437–21447. doi: 10.1074/jbc.M702963200. [DOI] [PubMed] [Google Scholar]

[R6] 6.Schneiders FI, Maertens B, Bose K, Li Y, Brunken WJ, Paulsson M, Smyth N, Koch M. Journal of Biological Chemistry. 2007;282:23750–23758. doi: 10.1074/jbc.M703137200. [DOI] [PubMed] [Google Scholar]

[R7] 7.Young MM, Tang N, Hempel JC, Oshiro CM, Taylor EW, Kuntz ID, Gibson BW, Dollinger G. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:5802–5806. doi: 10.1073/pnas.090099097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Back JW, Jong Ld, Muijsers AO, Koster CGd. Journal of Molecular Biology. 2003;331:303–313. doi: 10.1016/s0022-2836(03)00721-6. [DOI] [PubMed] [Google Scholar]

[R9] 9.Sinz A. Journal of Mass Spectrometry. 2003;38:1225–1237. doi: 10.1002/jms.559. [DOI] [PubMed] [Google Scholar]

[R10] 10.Sinz A. Mass Spectrometry Reviews. 2006;25:663–682. doi: 10.1002/mas.20082. [DOI] [PubMed] [Google Scholar]

[R11] 11.Hermanson G, editor. Bioconjugate Techniques. 2nd. Amsterdam, NL: Elsevier; 2008. [Google Scholar]

[R12] 12.Kalkhof S, Haehn S, Ihling C, Paulsson M, Smyth N, Sinz A. Rapid Communications in Mass Spectrometry. 2008;22:1933–1940. doi: 10.1002/rcm.3576. [DOI] [PubMed] [Google Scholar]

[R13] 13.Kalkhof S, Ihling C, Mechtler K, Sinz A. Analytical Chemistry. 2004;77:495–503. doi: 10.1021/ac0487294. [DOI] [PubMed] [Google Scholar]

[R14] 14.Schmidt A, Kalkhof S, Ihling C, Cooper D, Sinz A. Eur J Mass Spectrom (Chichester, Eng) 2005;11:525–534. doi: 10.1255/ejms.748. [DOI] [PubMed] [Google Scholar]

[R15] 15.Ihling C, Schmidt A, Kalkhof S, Schulz DM, Stingl C, Mechtler K, Haack M, Beck-Sickinger AG, Cooper DMF, Sinz A. Journal of the American Society for Mass Spectrometry. 2006;17:1100–1113. doi: 10.1016/j.jasms.2006.04.020. [DOI] [PubMed] [Google Scholar]

[R16] 16.Simons KT, Kooperberg C, Huang E, Baker D. Journal of Molecular Biology. 1997;268:209–225. doi: 10.1006/jmbi.1997.0959. [DOI] [PubMed] [Google Scholar]

[R17] 17.Bradley P, Malmström L, Qian B, Schonbrun J, Chivian D, Kim DE, Meiler J, Misura KMS, Baker D. Proteins: Structure, Function, and Bioinformatics. 2005;61:128–134. doi: 10.1002/prot.20729. [DOI] [PubMed] [Google Scholar]

[R18] 18.Bowers PM, Strauss CEM, Baker D. Journal of Biomolecular NMR. 2000;18:311–318. doi: 10.1023/a:1026744431105. [DOI] [PubMed] [Google Scholar]

[R19] 19.Meiler J, Baker D. Journal of Magnetic Resonance. 2005;173:310–316. doi: 10.1016/j.jmr.2004.11.031. [DOI] [PubMed] [Google Scholar]

[R20] 20.Alexander N, Al-Mestarihi A, Bortolus M, McHaourab H, Meiler J. 2008;16:181–195. doi: 10.1016/j.str.2007.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Kalkhof S, Sinz A. Anal Bioanal Chem. 2008;392:305–312. doi: 10.1007/s00216-008-2231-5. [DOI] [PubMed] [Google Scholar]

[R22] 22.Laemli UK. Nature. 1970;227:680–685. doi: 10.1038/227680a0. [DOI] [PubMed] [Google Scholar]

[R23] 23.Schulz DM, Kalkhof S, Schmidt A, Ihling C, Stingl C, Mechtler K, Zschörnig O, Sinz A. Proteins: Structure, Function, and Bioinformatics. 2007;69:254–269. doi: 10.1002/prot.21445. [DOI] [PubMed] [Google Scholar]

[R24] 24.Mädler S, Bich C, Touboul D, Zenobi R. Journal of Mass Spectrometry. 2009;44:694–706. doi: 10.1002/jms.1544. [DOI] [PubMed] [Google Scholar]

[R25] 25.Ginalski K, Elofsson A, Fischer D, Rychlewski L. Bioinformatics. 2003;19:1015–1018. doi: 10.1093/bioinformatics/btg124. [DOI] [PubMed] [Google Scholar]

[R26] 26.Huang X, Miller W. Adv. Appl. Math. 1991;12:337–357. [Google Scholar]

[R27] 27.Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Nucl. Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Kelley LA, Sternberg MJE. Nat. Protocols. 2009;4:363–371. doi: 10.1038/nprot.2009.2. [DOI] [PubMed] [Google Scholar]

[R29] 29.Juan D, Graña O, Pazos F, Fariselli P, Casadio R, Valencia A. Proteins: Structure, Function, and Genetics. 2003;50:600–608. doi: 10.1002/prot.10322. [DOI] [PubMed] [Google Scholar]

[R30] 30.Torda AE, Procter JB, Huber T. Nucl. Acids Res. 2004;32:W532–W535. doi: 10.1093/nar/gkh357. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Soding J. Bioinformatics. 2005;21:951–960. doi: 10.1093/bioinformatics/bti125. [DOI] [PubMed] [Google Scholar]

[R32] 32.Teodorescu O, Galor T, Pillardy J, Elber R. Proteins: Structure, Function, and Bioinformatics. 2004;54:41–48. doi: 10.1002/prot.10474. [DOI] [PubMed] [Google Scholar]

[R33] 33.Ortiz AR, Strauss CEM, Olmea O. Protein Science. 2002;11:2606–2621. doi: 10.1110/ps.0215902. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Dong E, Smith J, Heinze S, Alexander N, Meiler J. Gene. 2008;422:41–46. doi: 10.1016/j.gene.2008.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Meiler J, Prompers JJ, Peti W, Griesinger C, Bruschweiler R. Journal of the American Chemical Society. 2001;123:6098–6107. doi: 10.1021/ja010002z. [DOI] [PubMed] [Google Scholar]

[R36] 36.Jones DT. Journal of Molecular Biology. 1999;292:195–202. doi: 10.1006/jmbi.1999.3091. [DOI] [PubMed] [Google Scholar]

[R37] 37.Karplus K, Barrett C, Cline M, Diekhans M, Grate L, Hughey R. Proteins: Structure, Function, and Genetics. 1999;37:121–125. doi: 10.1002/(sici)1097-0134(1999)37:3+<121::aid-prot16>3.3.co;2-h. [DOI] [PubMed] [Google Scholar]

[R38] 38.Chivian D, Baker D. Nucl. Acids Res. 2006;34:e112. doi: 10.1093/nar/gkl480. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Rohl CA, Strauss CEM, Chivian D, Baker D. Proteins: Structure, Function, and Bioinformatics. 2004;55:656–677. doi: 10.1002/prot.10629. [DOI] [PubMed] [Google Scholar]

[R40] 40.Willard L, Ranjan A, Zhang H, Monzavi H, Boyko RF, Sykes BD, Wishart DS. Nucl. Acids Res. 2003;31:3316–3319. doi: 10.1093/nar/gkg565. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, III, Snoeyink J, Richardson JS, Richardson DC. Nucl. Acids Res. 2007;35:W375–W383. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] 42.Schnaible V, Wefing S, Resemann A, Suckau D, Buecker A, Wolf-Kuemmeth S, Hoffmann D. Analytical Chemistry. 2002;74:4980–4988. doi: 10.1021/ac025807j. [DOI] [PubMed] [Google Scholar]

[R43] 43.Rost B, O'Donoghue S. Comput. Appl. Biosci. 1997;13:345–356. doi: 10.1093/bioinformatics/13.4.345. [DOI] [PubMed] [Google Scholar]

[R44] 44.Kuhlman B, Baker D. Proceedings of the National Academy of Sciences of the United States of America. 2000;97:10383–10388. doi: 10.1073/pnas.97.19.10383. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] 45.Edwards MM, Mammadova-Bach E, Alpy F, Klein A, Hicks WL, Roux M, Simon-Assmann P, Smith RS, Orend G, Wu J, Peachey NS, Naggert JK, Lefevbre O, Nishina PM. J. Biol. Chem. 2010;285:7697–7711. doi: 10.1074/jbc.M109.069575. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] 46.Smyth NR, Vatansever HS, Murray P, Meyer M, Frie C, Paulsson M, Edgar D. J. Cell Biol. 1999;144:151–160. doi: 10.1083/jcb.144.1.151. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] 47.Macedo-Ribeiro S, Bode W, Huber R, Quinn-Allen MA, Kim SW, Ortel TL, Bourenkov GP, Bartunik HD, Stubbs MT, Kane WH, Fuentes-Prior P. Nature. 1999;402:434–439. doi: 10.1038/46594. [DOI] [PubMed] [Google Scholar]

[R48] 48.Pratt KP, Shen BW, Takeshima K, Davie EW, Fujikawa K, Stoddard BL. Nature. 1999;402:439–442. doi: 10.1038/46601. [DOI] [PubMed] [Google Scholar]

[R49] 49.Gaskell A, Crennell S, Taylor G. Structure. 1995;3:1197–1205. doi: 10.1016/s0969-2126(01)00255-6. [DOI] [PubMed] [Google Scholar]

[R50] 50.Ito N, Phillips SEV, Stevens C, Ogel ZB, McPherson MJ, Keen JN, Yadav KDS, Knowles PF. Nature. 1991;350:87–90. doi: 10.1038/350087a0. [DOI] [PubMed] [Google Scholar]

[R51] 51.Au SWN, Leng X, Harper JW, Barford D. Journal of Molecular Biology. 2002;316:955–968. doi: 10.1006/jmbi.2002.5399. [DOI] [PubMed] [Google Scholar]

[R52] 52.Boraston AB, Nurizzo D, Notenboom V, Ducros V, Rose DR, Kilburn DG, Davies GJ. Journal of Molecular Biology. 2002;319:1143–1156. doi: 10.1016/S0022-2836(02)00374-1. [DOI] [PubMed] [Google Scholar]

[R53] 53.Wendt KS, Vodermaier HC, Jacob U, Gieffers C, Gmachl M, Peters J-M, Huber R, Sondermann P. Nature Structural Biology. 2001;8:784–788. doi: 10.1038/nsb0901-784. [DOI] [PubMed] [Google Scholar]

[R54] 54.Bianchet MA, Odom EW, Vasta GR, Amzel LM. Nature Structural Biology. 2002;9:628–634. doi: 10.1038/nsb817. [DOI] [PubMed] [Google Scholar]

[R55] 55.Firbank SJ, Rogers MS, Wilmot CM, Dooley DM, Halcrow MA, Knowles PF, McPherson MJ, Phillips SEV. Proceedings of the National Academy of Sciences of the United States of America. 2001;98:12932–12937. doi: 10.1073/pnas.231463798. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R56] 56.Lee CC, Kreusch A, McMullan D, Ng K, Spraggon G. Structure. 2003;11:99–108. doi: 10.1016/s0969-2126(02)00941-3. [DOI] [PubMed] [Google Scholar]

[R57] 57.Adams TE, Hockin MF, Mann KG, Everse SJ. Proceedings of the National Academy of Sciences of the United States of America. 2004;101:8918–8923. doi: 10.1073/pnas.0403072101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R58] 58.Ramelot TA, Raman S, Kuzin AP, Xiao R, Ma L-C, Acton TB, Hunt JF, Montelione GT, Baker D, Kennedy MA. Proteins: Structure, Function, and Bioinformatics. 2009;75:147–167. doi: 10.1002/prot.22229. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] 59.Marintchev A, Mullen MA, Maciejewski MW, Pan B, Gryk MR, Mullen GP. Nat Struct Mol Biol. 1999;6:884–893. doi: 10.1038/12347. [DOI] [PubMed] [Google Scholar]

[R60] 60.Tews I, Perrakis A, Oppenheim A, Dauter Z, Wilson KS, Vorgias CE. Nat Struct Mol Biol. 1996;3:638–648. doi: 10.1038/nsb0796-638. [DOI] [PubMed] [Google Scholar]

[R61] 61.JointCenterForStructuralGenomics. PDB entry 1VPQ To be published. [Google Scholar]

[R62] 62.Guo H-C, Jardetzky TS, Garrettt TPJ, Lane WS, Strominger JL, Wiley DC. Nature. 1992;360:364–366. doi: 10.1038/360364a0. [DOI] [PubMed] [Google Scholar]

[R63] 63.Cardoso RMF, Silva CHTP, Ulian de Araujo AP, Tanaka T, Tanaka M, Garratt RC. Acta Crystallographica Section D. 2004;60:1569–1578. doi: 10.1107/S0907444904016798. [DOI] [PubMed] [Google Scholar]

[R64] 64.Setayesh FR, DeCorte BL, Horton P, Harris CM, Harris TM, Stone MP. Chemical Research in Toxicology. 1998;11:766–777. doi: 10.1021/tx9800147. [DOI] [PubMed] [Google Scholar]

[R65] 65.Momma K, Mishima Y, Hashimoto W, Mikami B, Murata K. Biochemistry. 2005;44:5053–5064. doi: 10.1021/bi047781r. [DOI] [PubMed] [Google Scholar]

[R66] 66.JointCenterForStructuralGenomics. PDB entry 1VKO To be published. [Google Scholar]

[R67] 67.Mol CD, Dougan DR, Schneider TR, Skene RJ, Kraus ML, Scheibe DN, Snell GP, Zou H, Sang B-C, Wilson KP. Journal of Biological Chemistry. 2004;279:31655–31663. doi: 10.1074/jbc.M403319200. [DOI] [PubMed] [Google Scholar]

[R68] 68.Skelton NJ, Quan C, Reilly D, Lowman H. Structure. 1999;7:157–168. doi: 10.1016/S0969-2126(99)80022-7. [DOI] [PubMed] [Google Scholar]

[R69] 69.Li H, Tomizawa T, Tochio N, Muto Y, Koshiba S, Inoue M, Kigawa T, Yokoyama S. PDB entry 2COC To be published. [Google Scholar]

[R70] 70.Udomsinprasert R, Pongjaroenkit S, Wongsantichon J, Oakley AJ, Prapanthadara L-a, Wilce MCJ, Ketterman AJ. Biochem. J. 2005;388:763–771. doi: 10.1042/BJ20042015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R71] 71.Ichiyanagi K, Ishino Y, Ariyoshi M, Komori K, Morikawa K. Journal of Molecular Biology. 2000;300:889–901. doi: 10.1006/jmbi.2000.3873. [DOI] [PubMed] [Google Scholar]

[R72] 72.López-Lucendo MF, Solís D, André S, Hirabayashi J, Kasai K-i, Kaltner H, Gabius H-J, Romero A. Journal of Molecular Biology. 2004;343:957–970. doi: 10.1016/j.jmb.2004.08.078. [DOI] [PubMed] [Google Scholar]

[R73] 73.Katsuya Y, Mezaki Y, Kubota M, Matsuura Y. Journal of Molecular Biology. 1998;281:885–897. doi: 10.1006/jmbi.1998.1992. [DOI] [PubMed] [Google Scholar]

[R74] 74.Koch AW, Farooq A, Shan W, Zeng L, Colman DR, Zhou M-M. Structure. 2004;12:793–805. doi: 10.1016/j.str.2004.02.034. [DOI] [PubMed] [Google Scholar]

[R75] 75.Abad MC, Arni RK, Grella DK, Castellino FJ, Tulinsky A, Geiger JH. Journal of Molecular Biology. 2002;318:1009–1017. doi: 10.1016/S0022-2836(02)00211-5. [DOI] [PubMed] [Google Scholar]

[R76] 76.Llera AS, Viedma F, Sánchez-Madrid F, Tormo J. Journal of Biological Chemistry. 2001;276:7312–7319. doi: 10.1074/jbc.M008573200. [DOI] [PubMed] [Google Scholar]

[R77] 77.Baeyens KJ, De Bondt HL, Raeymaekers A, Fiers W, De Ranter CJ. Acta Crystallographica Section D. 1999;55:772–778. doi: 10.1107/s0907444998018435. [DOI] [PubMed] [Google Scholar]

[R78] 78.Lubkowski J, Hennecke F, Pluckthun A, Wlodawer A. Nat Struct Mol Biol. 1998;5:140–147. doi: 10.1038/nsb0298-140. [DOI] [PubMed] [Google Scholar]

[R79] 79.Tegoni M, Spinelli S, Verhoeyen M, Davis P, Cambillau C. Journal of Molecular Biology. 1999;289:1375–1385. doi: 10.1006/jmbi.1999.2845. [DOI] [PubMed] [Google Scholar]

[R80] 80.Hart PJ, Deep S, Taylor AB, Shu Z, Hinck CS, Hinck AP. Nat Struct Mol Biol. 2002;9:203–208. doi: 10.1038/nsb766. [DOI] [PubMed] [Google Scholar]

[R81] 81.Stams T, Chen Y, Boriack-Sjodin PA, Hurt JD, Liao J, May JA, Dean T, Laipis P, Silverman DN, Christianson DW. Protein Sci. 1998;7:556–563. doi: 10.1002/pro.5560070303. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R82] 82.Lamb AL, Wernimont AK, Pufahl RA, O'Halloran TV, Rosenzweig AC. Biochemistry. 2000;39:1589–1595. doi: 10.1021/bi992822i. [DOI] [PubMed] [Google Scholar]

[R83] 83.Whittingtons DA, Grubb JH, Waheed A, Shah GN, Sly WS, Christianson DW. Journal of Biological Chemistry. 2004;279:7223–7228. doi: 10.1074/jbc.M310809200. [DOI] [PubMed] [Google Scholar]

[R84] 84.Dayan SM, Van Donkelaar A, Kortt AA. Journal of Biological Chemistry. 1987;262:10287–10289. [PubMed] [Google Scholar]

[R85] 85.Watson JN, Newstead S, Dookhun V, Taylor G, Bennet AJ. FEBS Letters. 2004;577:265–269. doi: 10.1016/j.febslet.2004.10.016. [DOI] [PubMed] [Google Scholar]

[R86] 86.Zhang Y. BMC Bioinformatics. 2008;9:40. doi: 10.1186/1471-2105-9-40. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R87] 87.Zhang Y. Proteins: Structure, Function, and Bioinformatics. 2009;77:100–113. [Google Scholar]

[R88] 88.Soding J, Biegert A, Lupas AN. Nucl. Acids Res. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R89] 89.Wu S, Zhang Y. Nucl. Acids Res. 2007;35:3375–3382. doi: 10.1093/nar/gkm251. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R90] 90.Fiser A, Do RKG, Scaronali A. Protein Science. 2000;9:1753–1773. doi: 10.1110/ps.9.9.1753. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Computational Modeling of Laminin N-Terminal Domains Using Sparse Distance Constraints from Disulfide Bonds and Chemical Cross-Linking

Stefan Kalkhof

Sebastian Haehn

Mats Paulsson

Neil Smyth

Jens Meiler

Andrea Sinz

Abstract

INTRODUCTION

MATERIALS AND METHODS

Materials

Expression and Purification of Laminin β1 and γ1 N-terminal Constructs

Cross-Linking Reactions

In-Solution Digestion

Gel Electrophoresis and In-Gel Digestion

Nano-HPLC / MALDI TOF/TOF-Mass Spectrometry

Nano-HPLC / Nano-ESI-FTICR Mass Spectrometry

Identification of Cross-Linked Products

Fold Recognition

Figure 1.

Table 4.

Figure 2.

Parametric Sequence Alignment

Construction of Comparative Models

Validation and Refinement of LN Domain Structures

Table 5.

Table 6.

Leave-One-Out Cross-Validation

Table 7.

Evaluation of Structure Quality

Estimation of the Influence of Templates and Distance Constraints

RESULTS AND DISCUSSION

Figure 3.

Intramolecular Cross-Linked Products

Table 1.

Table 2.

Figure 4.

Disulfide Bond Analysis

Table 3.

Structures of Laminin β1 and γ1 LN Domains

Sequence Assignment and Modeling of Loops

Construction of Comparative Models

Evaluation of the Models Using Experimental Data

Figure 5.

Leave-One-Out Cross-Validation and Evaluation with Structure Quality

Influence of Templates

Influence of Experimental Distance Constraints

Implications of Our Models for Laminin Function

CONCLUSIONS

Supplementary Material

Acknowledgments

ABBREVIATIONS

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Expression and Purification of Laminin β₁ and γ₁ N-terminal Constructs

Structures of Laminin β₁ and γ₁ LN Domains