Abstract
Sarcoidosis is a systemic interstitial lung disease of unknown aetiology. Less invasive diagnostics are needed to decipher disease pathology and to distinguish sub-phenotypes. Here we test if SpotLight proteomics, which combines de novo MS/MS sequencing of enriched IgG and co-extracted proteins with subsequent label-free quantification of new and known peptides, can differentiate controls and sarcoidosis phenotypes (Löfgrens and non-Löfgrens syndrome, LS and nonLS). Intra-individually matched IgG enriched from serum and bronchial lavage fluid (BALF) from controls (n = 12), LS (n = 11) and nonLS (n = 12) were investigated. High-resolution mass-spectrometry SpotLight proteomics and uni- and multivariate-statistical analyses were used for data processing. Major differences were particularly observed in control-BALF versus sarcoidosis-BALF. However, interestingly, information obtained from BALF profiles was still present (but less prominent) in matched serum profiles. By using information from orthogonal partial least squares discriminant analysis (OPLS-DA) differentiating 1) sarcoidosis-BALF and control-BALF and 2) LS-BALF vs. nonLS-BALF, control-serum and sarcoidosis-serum (p = 0.0007) as well as LS-serum and nonLS-serum (p = 0.006) could be distinguished. Noteworthy, many factors prominent in identifying controls and patients were those associated with Fc-regulation, but also features from the IgG-Fab region and novel peptide variants. Differences between phenotypes were mostly IgG-specificity related. The results support the analytical utility of SpotLight proteomics which prospectively have potential to differentiate closely related phenotypes from a simple blood test.
Keywords: IgG, sarcoidosis, Löfgrens syndrome, biomarker
1. Introduction
Sarcoidosis is a systemic inflammatory disease which is characterized by non-caseating granulomas that predominately involve lung and hilar lymph nodes. Clinically, sarcoidosis can be categorized as Löfgren syndrome (LS) and non-Löfgren syndrome (nonLS) [1]. LS has an acute disease onset, although usually a more favorable outcome compared to nonLS, particularly in patients expressing human leukocyte antigen (HLA) type HLA-DRB1*03 [2]. Although a substantial amount of research has been undertaken, the immunological aetiology behind sarcoidosis remains elusive. Sarcoidosis has conventionally been considered a T-cell mediated disease. Interestingly, studies have shown antibody recognition towards mycobacterial antigens in patients [3,4] and HLA derived peptides can stimulate T-cell response [5]. Furthermore, high frequencies of autoantigens have been reported in sarcoidosis bronchoalveolar lavage fluid (BALF) and serum [6]. By identifying exogenous peptides and autoantigens, these studies suggested a contribution of immunoglobulins (Ig) to disease pathogenesis.
We recently developed the SpotLight proteomics approach [7] that combines de novo MS/MS sequencing of enriched antibodies and co-extracted proteins with subsequent label-free quantification of new and known peptides. In a pilot study using this approach on differentiating two types of neurodegenerative disorders, the hidden proteome added almost as much information to patient stratification as the apparent proteome [7]. Intriguingly, many of the new peptide sequences were attributable to antibody variable regions, and are potential biomarkers indicative of disease aetiology. Here we test if the SpotLight approach is also applicable in differentiating two sarcoidosis phenotypes (i.e., LS vs. nonLS). Since sarcoidosis is a predominant chronic lung disorder, it would be expected that the IgG in closest proximity to the lung is more disease specific. However, since antibodies are circulating in the body, it is likely that features identified in lung-IgG can also be identified in blood-IgG. Given that diagnostic bronchoscopy with BAL can be unpleasant, easier sampling by a blood test would facilitate diagnostic procedures significantly. Thus, in order to test the hypothesis that the blood-IgG profiles prospectively can be used to differentiate sarcoidosis phenotypes in clinical samples, and thereby to identify and develop prognostic markers, we applied SpotLight proteomics on intra-individually matched samples from the lung (i.e., BALF) and blood (i.e., serum) of LS and nonLS sarcoidosis patients as well as of healthy controls. The SpotLight approach and study design are described in Figure 1.
2. Results
2.1. Data Overview
In total, 825 IgG specific peptides (with conserved or novel sequence variations), 59 IgG-Fc-glycopeptides and 2369 peptides that either could be assigned to other known protein sequences or represented novel sequences were identified in at least 50% of the serum or BALF samples (Figure 2). Of this data, 1711 features could be detected both in serum and BALF, 1053 features were only found in serum, and 696 features were only found in BALF. The biggest differences between BALF and serum were found in non-IgG peptides, for which a greater portion 38% (n = 909) were found in serum compared to 24% (n = 580) in BALF. In contrast, of the IgGome and Fc-glycan peptides (that were detected in at least 50% of all individuals), the majority could be identified in both matrices (83%, n = 730). Principal component analysis (PCA) of the complete data set (3 components, R2 = 0.63, Q2 = 0.58) differentiated BALF and serum samples along the first component and BALF-controls and BALF-sarcoidosis patients along the second component (Figure 3A). When interrogating how the features assembled along the second component (Figure 3B), it became apparent that many of the IgG conserved, variable and novel peptide sequences are correlating with sarcoidosis. A complete list of all detected features, including subgroup averages, means and p-values are listed in Supplementary Table S1. Separate PCA analyses of the BALF and serum data sets are given in Supplementary Figure S1.
2.2. Differences Between Controls and Sarcoidosis Patients
From PCA analysis it is evident that the main differences between patients and controls were observed in samples taken in close proximity to the disease location (i.e. the BALF). 1072 features were found to be significantly different between the controls and sarcoidosis patients in the BALF. Of these, 724 remained significant following FDR- and 42 following BF correction (Figure 4A). For the serum samples, the results were more modest with 393 features identified as significantly different, and of which 16 remained significant following FDR-, and 5 following BF correction (Figure 4C). Noteworthy, in the BALF many of the FDR corrected features were from (n = 125) or showed sequence homology (n = 84) with IgG. Furthermore, out of the remaining FDR corrected features ~40% (n = 233) were from novel peptide sequences. It is likely that within this pool of peptides there might be hints on disease-specific proteoforms and CDR-chain variants. In contrast to BALF, no IgG related variable peptide chain sequences remained significant following correction in the serum. Instead, and as expected, the majority of significantly different features could be linked to inflammation. Noteworthy, when examining which features were consistently different in both matrixes two distinct trends were observed. Specifically, proteins/and peptides originating from the complement cascade were elevated in sarcoidosis patients as well as agalactosylated Fc-glycan peptides. For the BALF we could also identify an elevation in proteins/peptides in neutrophil activation/mediated immunity, cholesterol, lipid, and amyloid regulating pathways (Table 1, Figure 5). Furthermore, proteins/peptides involved in peptidase related activities were elevated in the sarcoidosis patients while the peptidase inhibitors were found in lower abundances (i.e., correlating with the healthy individuals).
Table 1.
Type | Correlation | Type | Process/Function | FDR | Gene Name |
---|---|---|---|---|---|
Process | Sarcoidosis | Complement | regulation of complement activation | 2.0 × 10−3 | C2, C3, C8G, CFB, CFI, F2 |
regulation of humoral immune response | 2.0 × 10−3 | C2, C3, C8G, CFB, CFI, F2, HPX | |||
complement activation | 7.2 × 10−3 | C2, C3, C8G, CFB, CFI | |||
complement activation, classical pathway | 1.7 × 10−2 | C2, C3, C8G, CFI | |||
complement activation, alternative pathway | 1.9 × 10−2 | C3, C8G, CFB | |||
Effector | regulation of immune effector process | 4.0 × 10−2 | APOA1, APOA2, ARG1, C2, C3, C8G, CFB, CFI, F2, HPX | ||
Neutrophil | neutrophil activation | 2.4 × 10−3 | ARG1, C3, CHIT1, COPB1, GAA, GIG25, GSN, HBB, HMHA1, LTF, NPC2, PREX1, PSMD1, SERPINB10, SERPINB3 | ||
neutrophil mediated immunity | 2.4 × 10−3 | ARG1, C3, CHIT1, COPB1, F2, GAA, GIG25, GSN, HBB, HMHA1, LTF, NPC2, PSMD1, SERPINB10, SERPINB3 | |||
Lipid | cholesterol efflux | 7.2 × 10−3 | APOA1, APOA2, APOB, NPC2 | ||
chylomicron remodeling | 7.6 × 10−3 | APOA1, APOA2, APOB | |||
chylomicron assembly | 1.2 × 10−2 | APOA1, APOA2, APOB | |||
plasma lipoprotein particle clearance | 2.6 × 10−2 | APOA1, APOA2, APOB, NPC2 | |||
negative regulation of very-low-density lipoprotein particle remodeling | 3.8 × 10−2 | APOA1, APOA2 | |||
Controls | Inhibitors | negative regulation of catalytic activity | 3.7 × 10−8 | ANXA2, APC, CD109, CST6, CSTA, CSTB, F11R, FABP4, FETUB, FKBP1A, GAPDH, GCHFR, GSTP1, HSPB1, MICAL1, PARK7, PEBP1, RGS2, SCGB1A1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |
negative regulation of hydrolase activity | 5.3 × 10−8 | CD109, CST6, CSTA, CSTB, FETUB, GAPDH, MICAL1, PARK7, PEBP1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |||
negative regulation of endopeptidase activity | 7.8 × 10−8 | CD109, CST6, CSTA, CSTB, FETUB, GAPDH, MICAL1, PARK7, PEBP1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |||
negative regulation of proteolysis | 7.6 × 10−7 | CD109, CD55, CST6, CSTA, CSTB, FETUB, GAPDH, MICAL1, PARK7, PEBP1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |||
Function | Sarcoidosis | Peptidases | endopeptidase activity | 2.1 × 10−3 | ADAMTS10, ADAMTS19, ADAMTS6, C2, C3, CAPN5, CFB, CFI, F2, LTF, PLG, TLL1, TMPRSS3, USP16 |
peptidase activity, acting on L-amino acid peptides | 5.8 × 10−3 | ADAMTS10, ADAMTS19, ADAMTS6, C2, C3, CAPN5, CFB, CFI, F2, LTF, PLG, TLL1, TMPRSS3, USP16, USP43 | |||
hydrolase activity | 2.6 × 10−2 | ADAMTS10, ADAMTS19, ADAMTS6, AGAP3, AQR, ARG1, ATIC, ATP4A, C2, C3, CAPN5, CFB, CFI, CHIT1, DUSP14, DYNC2H1, ERCC6L, F2, GAA, HEXDC, KIF21B, LTF, MYO3B, PLCL1, PLG, PTPRQ, RAB26, RARRES3, RASEF, TLL1, TMEM55A, TMPRSS3, USP16, USP43 | |||
serine-type endopeptidase activity | 4.3 × 10−3 | C2, C3, CFB, CFI, F2, LTF, PLG, TLL1, TMPRSS3 | |||
Cholesterol | cholesterol transporter activity | 4.3 × 10−3 | APOA1, APOA2, APOB, NPC2 | ||
cholesterol binding | 2.6 × 10−2 | APOA1, APOA2, APOB, NPC2 | |||
Lipid | high-density lipoprotein particle receptor binding | 2.6 × 10−2 | APOA1, APOA2 | ||
apolipoprotein receptor binding | 4.1 × 10−2 | APOA1, APOA2 | |||
lipoprotein particle receptor binding | 4.1 × 10−2 | APOA1, APOA2, APOB | |||
phosphatidylcholine-sterol O-acyltransferase activator activity | 4.1 × 10−2 | APOA1, APOA2 | |||
Amyloid | amyloid-beta binding | 3.7 × 10−2 | APOA1, CLSTN1, INSR, MAPK8IP2 | ||
Controls | Inhibitors | endopeptidase inhibitor activity | 6.6 × 10−7 | CD109, CST6, CSTA, CSTB, FETUB, GAPDH, PEBP1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |
enzyme inhibitor activity | 6.6 × 10−7 | ANXA2, CD109, CST6, CSTA, CSTB, FETUB, GAPDH, GCHFR, HSPB1, PEBP1, SCGB1A1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |||
peptidase regulator activity | 6.6 × 10−7 | CD109, CST6, CSTA, CSTB, CTSH, FETUB, GAPDH, PEBP1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |||
serine-type endopeptidase inhibitor activity | 2.1 × 10−4 | CD109, PEBP1, SERPINB12, SLPI, SPINT2, WFDC2, WFIKKN2 | |||
cysteine-type endopeptidase inhibitor activity | 1.8 × 10−3 | CST6, CSTA, CSTB, FETUB, WFDC2 | |||
aspartic-type endopeptidase inhibitor activity | 1.3 × 10−2 | GAPDH, WFDC2 | |||
phospholipase A2 inhibitor activity | 1.7 × 10−2 | ANXA2, SCGB1A1 | |||
protease binding | 3.3 × 10−2 | ANXA2, CST6, CSTA, CSTB, PARK7 | |||
Antioxidant | antioxidant activity | 8.4 × 10−4 | ALB, CAT, GSTP1, PARK7, SOD1, TXN | ||
Peptidases | cysteine-type endopeptidase activity | 3.6 × 10−2 | CASP14, CTSH, CTSS, TINAG | ||
peptidase activity | 3.6 × 10−2 | CASP14, CTSH, CTSS, DCD, KLKB1, MYRF, NAPSA, PARK7, PIP, PLG, TINAG | |||
endopeptidase activity | 4.7 × 10−2 | CASP14, CTSH, CTSS, KLKB1, NAPSA, PIP, PLG, TINAG |
2.3. Differences Between LS and non-LS Patients
In contrast to the large differences between controls and sarcoidosis patients, the differences between the phenotypes were much weaker (Figure 4B,D, respectively). However, interestingly, the majority of factors differentiating nonLS and LS were observed within the portion of IgG related peptides and novel sequences (IgG+novel: 61%, IgG only: 40%) compared to factors differentiating controls and sarcoidosis patients (IgG+novel: 49%, IgG only: 19%). Thus, out of 78 significant factors (peptides) in BALF, 62 were from IgG and novel peptide sequences. For the serum, out of 102 significant factors, 57 were from IgG and novel peptides. Furthermore, out of the nine factors that were consistently different between nonLS and LS in both matrices, eight were from IgG, of which seven were from the variable regions. Noteworthy, in contrast to the big differences between patients and controls in BALF (Figure 4A) compared to serum (Figure 4C), more features were significantly different in serum (Figure 4D) than in BALF (Figure 4B) when comparing the phenotypes. However, considering that nothing remained significant following FDR correction, we knew that many of the identified factors should be interpreted with caution, as some are likely to be false positives. It is noteworthy to mention that the total profile information obtained from BALF (from which we obtained most differentiation according to the IgG profile) was still sufficient to discriminate the phenotypes in serum. For details on significant/non-significant features in the phenotypes, see Supplementary Table S2.
2.4. Predicting Sarcoidosis Disease Status in Serum via Information Obtained from BALF
The main goal of this study was to test if information obtained by circulating polyclonal IgG (and co-eluting proteins) in BALF can be used to differentiate patient phenotypes and controls in serum. First, we tested this approach to differentiate serum-control and serum- sarcoidosis patients. The samples (that were treated as unknowns), were distinguished with a significance of p = 0.0007 (Figure 6A) using BALF-control and BALF-sarcoidosis information. In the next step, we tested the same approach to differentiate the serum-nonLS and serum-LS phenotypes using BALF information. Strikingly, even though the phenotype differences were much smaller than in the previous case, the model was still able to differentiate the phenotypes (p = 0.006), Figure 6B. Noteworthy, when testing the opposite approach and predicting the phenotypes in BALF by using the serum information, the phenotypes could still be distinguished, albeit with a lower significance (p = 0.03).
3. Discussion
The primary aim of this study was to see if Melon Gel extracted IgG from serum could be used to differentiate phenotype specific sarcoidosis with the help of information obtained from Melon Gel extracted IgG from BALF. The utility of SpotLight proteomics has previously been demonstrated in differentiating two types of neurodegenerative diseases with 85%–95% accuracy [7]. Herein, the same approach was used but instead of working with serum samples only, we used BALF samples for multivariate model building and serum samples (treated as unknowns) for validation. Given that antibodies are circulating in the system from less accessible but disease-specific organs (i.e., lung), to the organs that can be easily sampled (i.e., blood), we hypothesized that even though the disease and phenotype-specific imprint or profile is likely weaker in blood, it should still be present. This hypothesis was validated on the serum enriched IgG profiles by using information from the IgG enriched lung profile (which in theory should have a stronger imprint). Strikingly, using this approach both controls and sarcoidosis patients as well as nonLS- versus LS-patients could be differentiated. We also tested the inverse, i.e., using the serum profile data to differentiate the BALF samples. As expected the results were less striking. This strengthens the hypothesis that most disease and phenotype-specific information should be extracted in close proximity to the disease. However, by gathering (and validating) this information, it can prospectively be used to differentiate the patients using their corresponding enriched IgG serum profile. Thus, opening up the possibility of a new approach for disease and phenotype characterization via blood test sampling.
When analyzing polyclonal enriched IgG, using a bottom-up proteomics approach with sensitive Orbitrap MS instrumentation, other proteins and peptides that co-elute with the IgG can be identified [7,8,9]. Of these proteins, we expect a significant portion to be either co-eluting via Fc- or most intriguing Fab- binding interactions (Figure 1C). In this study, we could find several immune effector processing proteins that interact with the IgG-Fc region to be significantly elevated in the sarcoidosis patients (Figure 5). This trend was also in line with what was observed for the IgG-Fc-glycosylation pattern which was containing a significantly lower distribution of galactosylated forms [10]. We could also identify an elevation in the sarcoidosis patients in proteins involved in neutrophil activation/mediated immunity-, as well as cholesterol-, lipid-, and amyloid-regulating pathways (Figure 5). Furthermore, many of the protein/peptide sequences that were found in higher abundance in sarcoidosis are involved in peptidase activity. In contrast, several peptidase inhibitors had higher abundance in the controls. This information is in line with several other studies [11,12,13,14,15].
Not surprisingly, both univariate and multivariate data only indicated few/weak differences between the sarcoidosis phenotypes and with no factors reaching significance following FDR correction. However, interestingly, a majority of the phenotype differentiating factors were from the IgG variable regions. This was most specifically the case for the phenotype-specific features identified in the BALF-profiles and for features that were found to be significant according to phenotype in both matrices. Similarly to the BALF and serum IgG-profile models built to distinguish controls and sarcoidosis, the BALF-phenotype information remained superior in distinguishing the serum-phenotypes compared to using the serum-phenotype information to distinguish the BALF-phenotypes. One reason for this is likely that we obtain a larger portion of true positives in BALF (which is closer to the disease source). Another reason for this could be that the IgG profile (which was more phenotype specific in the BALF), is also carried through the circulating blood system and thereby easier to target and extrapolate between the matrices.
This pilot study was designed to test the hypothesis on a small number of patients and controls. The reproducibility of the results needs to be further validated on a larger cohort that includes both genders and with controls that are better age-matched to the patient group. However, importantly and as discussed above, most of the non-IgG detected features that were identified as sarcoidosis correlating have also been observed in previous studies [11,12,13,14,15]. Furthermore, the observed alteration in Fc-galactosylation status (which is known to be age dependent) remained significant following age correction [10].
In summary, we show the potential in using the serum profile of IgG and co-enriched proteins/peptides to target a disease or to characterize a disease phenotype. Within the pool of IgG and co-enriched proteins, we can identify both disease/phenotype specific features and features that will indicate the immune regulatory state of the patient. Considering that BALF sampling and other invasive sampling procedures can be daunting both for the patient and clinician, the idea of using easily accessible polyclonal IgG from a simple blood test is promising. Thus, we suggest that this approach prospectively can have huge potential and impact on disease/phenotype characterization.
4. Material and Methods
4.1. Subject Information
Sarcoidosis patients and controls have previously been described in detail [10]. Briefly, patients with LS (n = 11) or non-LS (n = 12), and healthy controls (n = 12) underwent bronchoscopy with BAL as previously described [16]. Furthermore, one LS patient was sampled twice with one year between sampling dates. All subjects were non-smoking males of northern European descent. LS and non-LS were age-matched (LS: 42 ± 7 years, non-LS: 43 ± 9 years) while healthy subjects were significantly younger (27 ± 3 years). LS is defined via acute onset, usually, with fever, chest radiographic findings with bilateral hilar lymphadenopathy, sometimes with pulmonary infiltrates, and with erythema nodosum, or bilateral ankle arthritis. Sarcoidosis patients were diagnosed as defined by the World Association of Sarcoidosis and other Granulomatous Disorders (WASOG) [17]. Written informed consent was obtained from all participants. The study was approved by the ethical committee review board in Stockholm (2005/1031–31/2). Clinical characteristics of the subjects are described in Table 2.
Table 2.
H a | LS b | Non-LS c | |
---|---|---|---|
Age (year) | 27 ± 3 | 42 ± 7 | 43 ± 9 |
Smoking (never/ex) | 12/0 | 8/3 | 9/3 |
Gender (male/female) | 12/0 | 12/0 | 12/0 |
X-ray stage I/II/III/IV d | NM e | 5/6/0/0 | 3/9/0/0 |
FVC f % of predicteg f | 114 ± 11 | 84 ± 16 | 90 ± 14 |
FEV(1) hg % of predicted | 107 ± 9 | 86 ± 16 | 86 ± 15 |
FEV(1)/FVC i | 79 ± 7 | 78 ± 7 | 70 ± 6 |
Macrophages % * | 86 ± 9 | 72 ± 21 | 69 ± 15 |
Lymphocytes % * | 9 ± 5 | 26 ± 21 | 28 ± 16 |
Neutrophils % * | 2 ± 4 | 2 ± 1 | 1 ± 1 |
Eosinophils % * | 0 | 0 | 2 ± 4 |
CD4/CD8 | NM | 9 ± 7 | 6 ± 6 |
Va2.3j% | NM | 18 ± 15 | 8 ± 8 |
CRP k mg/L | NM | 16 ± 26 | 3 ± 2 |
ACE l U/mL | NM | 72 ± 35 | 64 ± 34 |
Alb m g/L | 42 ± 3 | 41 ± 4 | 41 ± 4 |
a: Healthy, b: Löfgren Syndrome, c: non-Löfgren Syndrome, d: Stage I indicates granuloma formation in the hilar lymph nodes. Stage II has, in addition to the hilar lymphadenopathy, also granuloma formation in the lung shown as diffuse infiltrates on the x-ray. Stage III has parenchymal infiltrates on the x-ray but an absence of hilar adenopathy, stage IV indicates irreversible pulmonary scarring. e: not measured, f: Forced vital capacity, g: LS group is missing one value. h: Forced expiratory volume 1 sec, i: LS is missing one value, the non-LS group is missing three values. j: V alpha 2.3 T-cell receptor positive, k: C reactive protein, l: Angiotensin-converting enzyme, m: Serum albumin, * Data was obtained from the BAL.
4.2. Sample Preparations
Serum (40 µL/sample) and BALF (500 µL/sample) of sarcoidosis patients and controls were aliquoted in triplicates, prepared, and analyzed in a randomized manner as previously described [10]. Two replicates were analyzed by LC-MS/MS. The third (still available) set was confirmed to be consistent with the other two sets via pool analysis. Polyclonal IgGs were enriched from blood serum and from BALF using Melon Gel IgG Spin Purification Kit (Thermo Scientific) [7,10]. For serum IgG, ten µg of IgG/sample and for BALF the complete IgG samples were used for analysis. The protein S–S bonds were reduced, the cysteines alkylated, and the proteins cleaved with trypsin into peptides. Peptides were then desalted, dried down, and stored at −20 °C until LC-MS/MS analysis.
4.3. LC-MS/MS Analysis
Approximately 1 µg of digest/sample was analyzed using an EASY-nLC system connected to a Fusion Orbitrap mass-spectrometer (both Thermo Fisher Scientific) [10]. Briefly, reversed phase nano-LC-separation of the peptides was performed on a 50 cm long EASY spray column (PepMap, C18, 2 µm, 100 Å). The chromatographic separation was achieved using a solvent system containing (A) water with 2% acetonitrile and 0.1% formic acid and (B) acetonitrile with 2% water and 0.1% formic acid and with a flow rate of 300 nL/min. The mass spectrometer was operating in positive DDA mode. A survey mass spectrum was acquired in the m/z 200–2000 range with a nominal resolution of 120,000. Precursor ion selection was performed in “top speed” mode of 2 to 7 charged ions. Precursor ion selection for MS/MS, was performed for each precursor with both higher-energy collisional dissociation (HCD) and electron transfer dissociation (ETD).
4.4. Protein and Peptide Identification and Quantification
A detailed description of the SpotLight approach including the de novo sequencing and quantitative analysis has been described in detail [7]. Briefly, all MS/MS spectra from the MG-extraction experiments were firstly searched against a human reference proteome with two missed cleavages as well as 10 ppm and 20 ppm mass tolerances for precursor and fragment peaks. Carbamidomethylation of cysteine was set as a fixed modification. Oxidation of methionine, deamidation of asparagine and glutamine, as well as acetylation of protein N-terminus were considered to be set as variable modifications. MS/MS spectra assigned to peptide sequences with <1% FDR were excluded. Remaining data underwent de novo sequencing with unassigned spectra submitted in a pair-wise (HCD-ETD) manner to pNovo + (v.1.3) [18]. Precursor mass range was set to 700–4000 Da with oxidized methionine considered to be an independent residue. Mass tolerance was set in MS/MS at 5 ppm for precursors and 15 ppm for fragments. Candidate sequences were filtered by the criteria of full backbone coverage. Three top-scoring peptide sequences were kept as candidates. Peptide candidates were homology-searched against the human UniProt protein database using BLASTp. The highest BLAST score match was reported as the final sequence. The UniProt human protein database and obtained de novo sequences were merged into the SpotLight database, on which a second database search was performed. For quantitation, raw data were processed through the DeMix-Q workflow [19,20], in which MS/MS spectra were matched against the SpotLight database. Protein abundances were calculated by averaging abundances of the three most abundant constituent peptides. Only proteins from which at least two unique peptides per protein would be quantified were used. Furthermore, for both protein and peptide data, only features found and quantified in at least 50% of either all serum- or all BALF samples were kept. The abundances of IgG peptides were re-normalized such that their total abundance in all samples was the same (100%). Non-IgG peptides, as well as proteins, were normalized separately in the same way. Assignment of complementarity determining regions (CDR) and framework regions (FR) were based on Uniprot information and by using the VBASE sequence directory (Tomlinson et al., MRC Centre for Protein Engineering, http://www2.mrc-lmb.cam.ac.uk/vbase/alignments2.php).
4.5. Fc-Glycopeptide Identification and Quantification
As previously described [9,21], 63 glycopeptide variants with glycans N-linked to tryptic-peptides EEQYNSTYR and TKPREEQYNSTYR (IgG1), EEQFNSTFR and TKPREEQFNSTFR (IgG2 or IgG3), as well as EEQFNSTYR and TKPREEQFNSTYR (IgG4) were screened. Glycopeptide ion abundances were integrated over the respective chromatographic peak of monoisotopic ions (<10 ppm) and within ±2 min retention-time interval. Glycan abundances were normalized to the total content of Fc-glycosylated IgG1 or IgG2/(3) and IgG4 peptides. Glycan nomenclature is according to Royle et al. [22].
4.6. Statistical Analysis
Univariate analyses were performed using Student’s t-test (with equal or unequal variance depending on F-test). P-values were corrected according to Bonferroni (BF) and false discovery rate (FDR). Multivariate modeling using PCA and Orthogonal projections to latent structures discrimination analysis (OPLS-DA) was performed using SIMCA 15.0 following mean centering, log transformation, and UV scaling. OPLS-DA models were further validated using predicted scores t(tPS) of samples treated as unknowns. Pathway analysis was performed using STRING with a minimum required interaction score of 0.7.
Acknowledgments
We thank the research nurses Helene Blomqvist, Margitha Dahl, and Gunnel de Forest, as well as biomedical analyst Benita Dahlberg for sample collection and preparation. We also thank the volunteering patients and healthy controls for making the study possible.
Abbreviations
ADAMTS10 | A disintegrin and metalloproteinase with thrombospondin motifs 10 |
ADAMTS19 | A disintegrin and metalloproteinase with thrombospondin motifs 19 |
ADAMTS6 | A disintegrin and metalloproteinase with thrombospondin motifs 6 |
AGAP3 | Arf-GAP with GTPase, ANK repeat and PH domain-containing protein 3 |
ALB | Serum albumin |
ANXA2 | Annexin A2 |
APC | Adenomatous polyposis coli protein |
APOA1 | Apolipoprotein A-I |
APOA2 | Apolipoprotein A-2 |
APOB | Apolipoprotein B |
AQR | RNA helicase aquarius |
ARG1 | Arginase-1 |
ATIC | Bifunctional purine biosynthesis protein PURH |
ATP4A | Potassium-transporting ATPase alpha chain 1 |
C2 | Complement 2 |
C3 | Complement 3 |
C8G | Complement component C8 gamma chain |
CASP14 | Caspase-14 |
CAT | Catalase |
CAPN5 | Calpain-5, PLG, Plasminogen |
CD109 | CD109 antigen |
CD55 | Complement decay-accelerating factor |
CFB | Complement factor B |
CFI | Complement factor I |
CHIT1 | Chitotriosidase-1 |
CLSTN1 | Calsyntenin-1 |
COPB1 | Coatomer subunit beta |
CST6 | Cystatin-M |
CSTA | Cystatin-A |
CSTB | Cystatin-B |
CTSH | Pro-cathepsin H |
CTSS | Cathepsin S |
DCD | Dermcidin |
DUSP14 | Dual specificity protein phosphatase 14 |
DYNC2H1 | Cytoplasmic dynein 2 heavy chain 1 |
ERCC6L | DNA excision repair protein ERCC-6-like |
F11R | Junctional adhesion molecule A |
F2 | Prothrombin |
FABP4 | Fatty acid-binding protein, adipocyte |
FETUB | Fetuin-B |
FKBP1A | Peptidyl-prolyl cis-trans isomerase |
GAA | Lysosomal alpha-glucosidase |
GAPDH | Glyceraldehyde-3-phosphate dehydrogenase |
GCHFR | GTP cyclohydrolase 1 feedback regulatory protein |
GIG25 | Serpin peptidase inhibitor |
GSTP1 | Glutathione S-transferase P |
GSN | Gelsolin |
HBB | Hemoglobin subunit beta |
HEXDC | Hexosaminidase D |
HMHA1 | Rho GTPase-activating protein 45 |
HSPB1 | Heat shock protein beta-1 |
HPX | Hemopexin |
INSR | Insulin receptor |
KIF21B | Kinesin-like protein KIF21B |
KLKB1 | Plasma kallikrein |
LTF | Lactotransferrin |
NPC2 | NPC intracellular cholesterol transporter 2 homolog a |
MAPK8IP2 | C-Jun-amino-terminal kinase-interacting protein 2 |
MICAL1 | [F-actin]-monooxygenase MICAL1 |
MYO3B | Myosin-IIIb |
MYRF | Myelin regulatory factor |
NAPSA | Napsin-A |
PARK7 | Protein/nucleic acid deglycase DJ-1 |
PEBP1 | Phosphatidylethanolamine-binding protein 1 |
PIP | Prolactin-inducible protein |
PLCL1 | Inactive phospholipase C-like protein 1 |
PTPRQ | Phosphatidylinositol phosphatase, PTPRQ |
RAB26 | Ras-related protein Rab-26 |
PREX1 | Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 1 protein |
PSMD1 | 26S proteasome non-ATPase regulatory subunit 1 |
RARRES3 | Retinoic acid receptor responder protein 3 |
RASEF | Ras and EF-hand domain-containing protein |
RGS2 | Regulator of G-protein signaling 2 |
SCGB1A1 | Uteroglobin |
SERPINB3 | Serpin B3 |
SERPINB10 | Serpin B10 |
SERPINB12 | Serpin B12 |
SLPI | Antileukoproteinase |
SOD1 | Superoxide dismutase [Cu-Zn] |
SPINT2 | Kunitz-type protease inhibitor 2 |
TINAG | Tubulointerstitial nephritis antigen |
TLL1 | Tolloid-like protein 1 |
TMEM55A | Type 2 phosphatidylinositol 4,5-bisphosphate 4-phosphatase |
TMPRSS3 | Transmembrane protease serine 3 |
TXN | Thioredoxin |
USP16 | Ubiquitin carboxyl-terminal hydrolase 16 |
USP43 | Ubiquitin carboxyl-terminal hydrolase 43 |
WFDC2 | WAP four-disulfide core domain protein 2 |
WFIKKN2 | WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing protein 2 |
Supplementary Materials
The following are available online at https://www.mdpi.com/1422-0067/20/9/2157/s1, Figure S1. PCA analyses of (A) the data acquired from the BALF samples and (B) the data acquired from the serum samples. Note that particularly the BALF- but also the serum samples cluster according to healthy controls (HC) and sarcoidosis (Löfgren’s Syndrome and non- Löfgren’s Syndrome; LS and nonLS), Supplemental Table S1. All charachterized features in the data set sorthed according to significant differences in the Controls versus Sarcoidosis patients; Table S2. All charachterized features in the data set sorthed according to significant differences in the nonLS versus LS patients.
Author Contributions
J.G., R.A.Z., T.H., E.W., and S.L.L. designed the study; EW characterized Sarcoid subjects, A.E. performed bronchoscopies and generated clinical data; J.G. and R.A.Z. provided financial support and essential infrastructure; T.H., E.W., and S.L.L. performed IgG purifications, S.L.L. performed MS analysis, B.Z., T.H., E.W., and S.L.L. performed data analyses, S.L.L. interpreted data and drafted the manuscript. All authors participated in manuscript editing.
Funding
This study was supported by The Swedish Heart-Lung Foundation, The Swedish Research Council, King Gustaf V:s and Queen Victoria’s Freemasons´ Foundation, The Stockholm County Council, The Center for Inflammatory Diseases, and Karolinska Institutet. RAZ and SLL are funded by the EUROPEAN Union Horizon 2020 research program.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Iannuzzi M.C., Rybicki B.A., Teirstein A.S. Sarcoidosis. N. Engl. J. Med. 2007;357:2153–2165. doi: 10.1056/NEJMra071714. [DOI] [PubMed] [Google Scholar]
- 2.Grunewald J., Eklund A. Lofgren’s syndrome: Human leukocyte antigen strongly influences the disease course. Am. J. Respir. Crit. Care Med. 2009;179:307–312. doi: 10.1164/rccm.200807-1082OC. [DOI] [PubMed] [Google Scholar]
- 3.Fang C., Huang H., Xu Z. Immunological evidence for the role of mycobacteria in sarcoidosis: A meta-analysis. PLoS ONE. 2016;11:e0154716. doi: 10.1371/journal.pone.0154716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ferrara G., Valentini D., Rao M., Wahlstrom J., Grunewald J., Larsson L.O., Brighenti S., Dodoo E., Zumla A., Maeurer M. Humoral immune profiling of mycobacterial antigen recognition in sarcoidosis and lofgren’s syndrome using high-content peptide microarrays. Int J. Infect. Dis. 2017;56:167–175. doi: 10.1016/j.ijid.2017.01.021. [DOI] [PubMed] [Google Scholar]
- 5.Wahlstrom J., Dengjel J., Winqvist O., Targoff I., Persson B., Duyar H., Rammensee H.G., Eklund A., Weissert R., Grunewald J. Autoimmune t cell responses to antigenic peptides presented by bronchoalveolar lavage cell hla-dr molecules in sarcoidosis. Clin. Immunol. 2009;133:353–363. doi: 10.1016/j.clim.2009.08.008. [DOI] [PubMed] [Google Scholar]
- 6.Haggmark A., Hamsten C., Wiklundh E., Lindskog C., Mattsson C., Andersson E., Lundberg I.E., Gronlund H., Schwenk J.M., Eklund A., et al. Proteomic profiling reveals autoimmune targets in sarcoidosis. Am. J. Respir. Crit. Care Med. 2015;191:574–583. doi: 10.1164/rccm.201407-1341OC. [DOI] [PubMed] [Google Scholar]
- 7.Lundstrom S.L., Zhang B., Rutishauser D., Aarsland D., Zubarev R.A. Spotlight proteomics: Uncovering the hidden blood proteome improves diagnostic power of proteomics. Sci. Rep. 2017;7:41929. doi: 10.1038/srep41929. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Fernandes-Cerqueira C., Renard N., Notarnicola A., Wigren E., Graslund S., Zubarev R.A., Lundberg I.E., Lundstrom S.L. Patients with anti-jo1 antibodies display a characteristic igg fc-glycan profile which is further enhanced in anti-jo1 autoantibodies. Sci. Rep. 2018;8:17958. doi: 10.1038/s41598-018-36395-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lundstrom S.L., Fernandes-Cerqueira C., Ytterberg A.J., Ossipova E., Hensvold A.H., Jakobsson P.J., Malmstrom V., Catrina A.I., Klareskog L., Lundberg K., et al. Igg antibodies to cyclic citrullinated peptides exhibit profiles specific in terms of igg subclasses, fc-glycans and a fab-peptide sequence. PLoS ONE. 2014;9:e113924. doi: 10.1371/journal.pone.0113924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Heyder T., Wiklundh E., Eklund A., James A., Grunewald J., Zubarev R.A., Lundstrom S.L. Altered fc galactosylation in igg4 is a potential serum marker for chronic lung disease. Erj. Open Res. 2018:4. doi: 10.1183/23120541.00033-2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Salazar A., Mana J., Pinto X., Argimon J.M., Hurtado I., Pujol R. Corticosteroid therapy increases hdl-cholesterol concentrations in patients with active sarcoidosis and hypoalphalipoproteinemia. Clin. Chim. Acta. 2002;320:59–64. doi: 10.1016/S0009-8981(02)00046-3. [DOI] [PubMed] [Google Scholar]
- 12.Bargagli E., Rosi E., Pistolesi M., Lavorini F., Voltolini L., Rottoli P. Increased risk of atherosclerosis in patients with sarcoidosis. Pathobiology. 2017;84:258–263. doi: 10.1159/000477736. [DOI] [PubMed] [Google Scholar]
- 13.Ivanisevic J., Kotur-Stevuljevic J., Stefanovic A., Jelic-Ivanovic Z., Spasic S., Videnovic-Ivanov J., Vucinic-Mihailovic V., Ilic J. Dyslipidemia and oxidative stress in sarcoidosis patients. Clin. Biochem. 2012;45:677–682. doi: 10.1016/j.clinbiochem.2012.03.009. [DOI] [PubMed] [Google Scholar]
- 14.Chen E.S., Song Z., Willett M.H., Heine S., Yung R.C., Liu M.C., Groshong S.D., Zhang Y., Tuder R.M., Moller D.R. Serum amyloid a regulates granulomatous inflammation in sarcoidosis through toll-like receptor-2. Am. J. Respir. Crit. Care Med. 2010;181:360–373. doi: 10.1164/rccm.200905-0696OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Agostini C., Garbisa S., Trentin L., Zambello R., Fastelli G., Onisto M., Cipriani A., Festi G., Casara D., Semenzato G. Pulmonary alveolar macrophages from patients with active sarcoidosis express type iv collagenolytic proteinase. An enzymatic mechanism for influx of mononuclear phagocytes at sites of disease activity. J. Clin. Invest. 1989;84:605–612. doi: 10.1172/JCI114205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Karimi R., Tornling G., Grunewald J., Eklund A., Skold C.M. Cell recovery in bronchoalveolar lavage fluid in smokers is dependent on cumulative smoking history. PLoS ONE. 2012;7:e34232. doi: 10.1371/journal.pone.0034232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Statement on Sarcoidosis Joint statement of the american thoracic society (ats), the european respiratory society (ers) and the world association of sarcoidosis and other granulomatous disorders (wasog) adopted by the ats board of directors and by the ers executive committee, february 1999. Am. J. Respir. Crit. Care Med. 1999;160:736–755. doi: 10.1164/ajrccm.160.2.ats4-99. [DOI] [PubMed] [Google Scholar]
- 18.Chi H., Chen H., He K., Wu L., Yang B., Sun R.X., Liu J., Zeng W.F., Song C.Q., He S.M., et al. Pnovo+: De novo peptide sequencing using complementary hcd and etd tandem mass spectra. J. Proteome Res. 2013;12:615–625. doi: 10.1021/pr3006843. [DOI] [PubMed] [Google Scholar]
- 19.Zhang B., Kall L., Zubarev R.A. Demix-q: Quantification-centered data processing workflow. Mol. Cell Proteom. 2016;15:1467–1478. doi: 10.1074/mcp.O115.055475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhang B., Pirmoradian M., Chernobrovkin A., Zubarev R.A. Demix workflow for efficient identification of cofragmented peptides in high resolution data-dependent tandem mass spectrometry. Mol. Cell Proteom. 2014;13:3211–3223. doi: 10.1074/mcp.O114.038877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lundstrom S.L., Yang H., Lyutvinskiy Y., Rutishauser D., Herukka S.K., Soininen H., Zubarev R.A. Blood plasma igg fc glycans are significantly altered in alzheimer’s disease and progressive mild cognitive impairment. J. Alzheimers Dis. 2014;38:567–579. doi: 10.3233/JAD-131088. [DOI] [PubMed] [Google Scholar]
- 22.Royle L., Campbell M.P., Radcliffe C.M., White D.M., Harvey D.J., Abrahams J.L., Kim Y.G., Henry G.W., Shadick N.A., Weinblatt M.E., et al. Hplc-based analysis of serum n-glycans on a 96-well plate platform with dedicated database software. Anal. Biochem. 2008;376:1–12. doi: 10.1016/j.ab.2007.12.012. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.