Abstract
For mass spectrometry-based diagnostics of microorganisms, matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is currently routinely used to identify urinary tract pathogens. However, it requires a lengthy culture step for accurate pathogen identification, and is limited by a relatively small number of available species in peptide spectral libraries (≤3329). Here, we propose a method for pathogen identification that overcomes the above limitations, and utilizes the MALDI-TOF/TOF MS instrument. Tandem mass spectra of the analyzed peptides were obtained by chemically activated fragmentation, which allowed mass spectrometry analysis in negative and positive ion modes. Peptide sequences were elucidated de novo, and aligned with the non-redundant National Center for Biotechnology Information Reference Sequence Database (NCBInr). For data analysis, we developed a custom program package that predicted peptide sequences from the negative and positive MS/MS spectra. The main advantage of this method over a conventional MALDI-TOF MS peptide analysis is identification in less than 24 h without a cultivation step. Compared to the limited identification with peptide spectra libraries, the NCBI database derived from genome sequencing currently contains 20,917 bacterial species, and is constantly expanding. This paper presents an accurate method that is used to identify pathogens grown on agar plates, and those isolated directly from urine samples, with high accuracy.
Keywords: uropathogenic infection, tandem mass spectrometry, de novo peptide sequencing, peptide identification software
1. Introduction
In clinical microbiology, methods and techniques for reliable, direct identification of pathogens in human body fluids are continuously investigated. Mass spectrometry-based methods, especially matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS), have been successfully used in routine clinical characterization of pathogens for the past decades [1]. In most cases, competing methods for microbial identification include 16S ribosomal rRNA gene sequencing [2] and real-time polymerase chain reaction (RT-PCR) [3]. Although 16S rRNA sequencing is a widely used method to study the phylogenetic relationships among bacteria, it has low discriminatory power between related species [4]. Another DNA-based method commonly used in clinical practice to identify pathogens is the RT-PCR method. It is a rapid and sensitive method that allows high-throughput analysis, and even the quantification of pathogens [5]. However, the primers of RT-PCR target one or more bacterial species, so non-targeted pathogens cannot be identified. Spectroscopic methods such as Fourier transform near-infrared (FT-NIR) [6] and Raman spectroscopy [7] are also applicable for pathogen identification. These methods are rapid, non-invasive, inexpensive, and, therefore, suitable for routine quality control analyzes. However, the aforementioned technologies are limited by low sensitivity, a relatively small number of reference species (approximately 100), strong background interferences (e.g., water), and, finally, the cultivation of microorganisms [8]. Moreover, the interpretation of the spectral data obtained is often ambiguous, and spectroscopic instruments must be coupled with other standard methods to ensure their reliability. Despite all the advantages and disadvantages of the alternative methods, MALDI-TOF MS is the predominant and most reliable technique in clinical practice for the rapid and cost-effective identification of a wide range of pathogens [9].
Standard urine culture analysis is the most sought-after test in clinical laboratories, as urinary tract infections (UTI) affect more than 150 million people worldwide each year. The most common pathogens of UTIs are uropathogenic Escherichia coli, followed by Klebsiella pneumoniae, Proteus mirabilis, Staphylococcus saprophyticus, Enterococcus faecalis, Group B Streptococcus, Pseudomonas aeruginosa, Staphylococcus aureus, and Candida species [9]. The main disadvantage of urine-culture-based identification methods is the long cultivation time (36–72 h) and possible non-specific identification [10]. Due to the high prevalence of urinary tract infections mentioned earlier, improvement of identification methods in terms of speed, accuracy, and more meaningful identification is an ongoing need in clinical microbiology research. Several bottom-up proteomic studies concerning MS/MS microbial identification were published. In these studies, liquid chromatography–tandem mass spectrometry (LC–MS/MS) and database search tools were used to identify pure microbial cultures [11,12,13,14,15]. Using LC–MS/MS and various database search tools, researchers compared experimentally obtained MS/MS spectra of analyzed peptides with the in-silico-generated peptide fragmentation spectra in the protein database. Although the LC–MS/MS approach to microorganism identification results in accurate and robust species identification, the protein databases used often encounter obstacles due to the small number of available species. The standard identification procedure usually consists of the step of bacterial cultivation, MALDI-TOF MS analysis of proteins in the mass range of 2–20 kDa, and spectral alignment of the obtained signals with the reference spectral library. To speed up the process, and avoid a time-consuming cultivation step, several studies adapted a MALDI-TOF MS method to identify pathogens directly from common clinical samples such as urine [10,16,17,18,19,20] or a positive blood culture [21,22,23]. However, direct analysis of samples may be subject to low confidence levels due to polymicrobial or human protein interference [17]. In addition, a recent article described a method for direct identification of UTIs using tandem mass spectrometry, machine learning [24], and bottom-up metaproteomic analysis [25].
A widely used, high-resolution LC–MS/MS technique for proteomic analysis is not a standard in clinical research because of the need for highly skilled personnel, long analysis time, and high maintenance costs [26]. Our method utilizes a MALDI-TOF/TOF MS instrument that overcomes some of the limitations of MALDI-TOF MS and LC–MS/MS strategies. Compared with LC–MS/MS, MALDI-TOF MS and MALDI-TOF MS/MS instruments are much easier to integrate into hospital pipelines, because of their relative simplicity of use and lower operating costs. In addition, the MALDI MS technique is less susceptible to contamination than LC–MS/MS [27]. Unlike MS, MS/MS analysis allows sequencing of each peptide individually, so major interferences from human proteins or other protein contaminants do not interfere with bacterial identification. From this point of view, the optimal tool for pathogen identification in clinical microbiology should include the analytical methods MALDI MS and MS/MS. Therefore, the newly developed de novo sequencing identification method in this research is a complementary improvement to the currently available identification techniques.
We present here a novel bottom-up proteomics approach for the direct identification of microorganisms from a urine sample by tandem mass spectrometry using the MALDI-TOF/TOF MS instrument and de novo peptide sequencing (Figure 1). The method is based on in-solution N-terminal tryptic peptide derivatization by 4- or 5-formyl-1,3-benzenedisulfonic acid (FBDSA), which is described in detail elsewhere [28,29,30]. Derivatization allows de novo sequencing of positive and negative peptide mass spectra obtained after analysis of MALDI-MS/MS. Newly developed software called Protein Acrobat calculates the most probable peptide sequences from the aligned negative MS/MS and positive MS/MS mass spectra using a graph-based algorithm. The resulting list of probable peptides was aligned with the NCBInr database using the protein basic local alignment search tool (BLASTp) [31], and the results were scored according to the expected (E) value. Derived peptides above the predefined E-value threshold were assigned to a species in the database using a peptide-matching algorithm based on the Apache Lucene search engine [32]. The presented positive and negative de novo peptide sequencing approach was applied to the identification of UTI pathogens directly from urine samples.
2. Results
2.1. Results of Standard Urine Culture Test and Identification of Urine Bacterial Isolates Grown on Blood Agar Plates
A standard urine culture test, described in detail in Materials and Methods, was used to determine the number of viable bacteria in urine specimens from patients with a confirmed UTI. The biochemical test “European Guidelines for Urinalysis” [33] was applied to determine the causative agent of the infection and later comparison with the results of mass spectrometry performed in this research. Urine samples from 33 patients were collected on a workday (medium-sized hospital) using the clean-catch method and yielded a bacterial count of 105 colony-forming units (CFU/mL) or more. The samples were used for comparison between direct MALDI-TOF/TOF MS UTI identification and standard urine culture test.
The microorganisms from 33 collected urine samples were previously identified by standard urine culture test as: E. coli (16 samples), P. aeruginosa (4 samples), E. faecalis (3 samples), K. pneumoniae (3 samples), P. mirabilis (3 samples), and 1 sample each of Citrobacter freundii, Stenothropomonas maltophilia, Candida albicans, and Klebsiella aerogenes.
Proof of concept for the MALDI-TOF/TOF MS method for direct identification of UTIs was confirmed and validated on pure urine colonies of E. coli, P. mirabilis, P. aeruginosa, and K. pneumoniae grown on blood agar plates. For each designated species, ten biological replicates were processed and analyzed using MALDI-TOF/TOF MS as described in Materials and Methods. Tandem mass spectra were acquired in positive and negative ion modes, and peptide sequences were determined using the Protein Acrobat de novo sequencing algorithm.
2.2. Protein Acrobat De Novo Sequencing Algorithm
The data obtained after positive and negative MS/MS analysis in mascot generic format (MGF) were processed using Protein Acrobat. First, the b-ion series were converted from MS/MS negative to y-ions in MS/MS positive, following a peptide fragmentation rule that the sum of the nominal m/z of the b-ions and the corresponding y-ions equals the nominal value of the precursor mass and the mass of two hydrogen atoms:
m/z (y) = MH+ − m/z (b) + H+ | (1) |
Both spectra were normalized to account the lower peak intensities in the positive spectrum, which was achieved by dividing the spectra by their respective peak maxima. Peaks with a mass within a specified range (0.01 Da) were merged into a single spectrum to remove redundant peaks from positive and negative MS/MS spectra. Figure 2 shows an example of processing positive and negative MS/MS spectra of the peptide belonging to the putative stress response protein (accession number STS794191) found in K. pneumoniae species. The resulting mass spectrum was converted into a graphical spectrum, in which the nodes represent m/z values. The weighting of each node corresponds to the normalized intensity value. Edges were formed between nodes with a mass difference of any amino acid mass. Dynamic programming techniques were used to find paths between the starting node (zero mass) and the ending node (MS ion m/z mass), using the sum of the maximum node weights as the path score. All possible amino acid sequences were determined and scored relative to the overall peak intensity of fragmented ions. The ten highest scoring peptide sequences were matched to a filtered NCBI nr database using the BLASTp tool integrated with Protein Acrobat. The filtered NCBI nr database consisted of 48 common uropathogenic species adapted from Pinault et al. 2019 [11]. The complete list of the 48 bacteria is shown in Table S1 in the Supplementary Material. The E-value was set in the range of 0.001–1, and was 0.1 depending on how many peptides matched the corresponding species.
Figure 3 shows graphical results of peptide matches for pure colony samples of E. coli, P. mirabilis, K. pneumoniae, and P. aeruginosa. The average number of peptides matched to the corresponding bacterial species is 129 for E. coli, 42 for P. mirabilis, 54 for K. pneumoniae, and 29 for P. aeruginosa. The data shown in the graph are combined from 10 analyses per species, and normalized to the sum of identified peptides. It shows the probability of the identified species as a function of the number of matched peptides. To simplify the plot, only the first five species with the most peptide matches are shown in the graph.
2.3. Direct Identification of Uropathogenic Bacteria from Urine Samples
To determine the accuracy of the UTI identification results, decision tree learning was used to define the confidence level of bacterial identification. For simplicity, the input dataset was obtained from the analysis of pure colonies of E. coli, P. mirabilis, K. pneumoniae, and P. aeruginosa. The original dataset of 40 measurements (10 replicates for each bacterial species) was expanded using the mixup technique [34]. An expanded dataset defines the confidence level of direct bacterial identification of urine samples as follows: high confidence, medium confidence, and low confidence, denoted by the numbers 3, 2, and 1, respectively. Of the 33 urine samples analyzed, 29 samples are successfully identified. Protein Acrobat results for all urine samples are shown in Table 1. A high-confidence level (score 3) is reported for 22 samples, a medium-confidence level (score 2) for 7 samples, and a low = confidence level (score 1) for 4 samples. The low-confidence level samples are misidentified in three of four cases; K. pneumoniae and C. freundii are identified as E. coli and K. pneuomoniae, respectively. The remaining low-confidence level sample identified as K. aerogenes by the standard urine culture test is not identified by the direct identification method. In other words, it has an equal number of peptide matches with K. aerogenes, K. pneumoniae, Enterobacter kobei, and Enterobacter cloacae species. On the other hand, all samples with a medium-confidence level are correctly matched. Only 1 high-confidence level sample is misidentified (Citrobacter freundii), and the remaining 21 high-confidence level samples match the results of the standard urine culture test. The direct bacterial identification pipeline utilizing MALDI-MS/MS and Protein Acrobat is graphically depicted in Figure S1.
Table 1.
Standard Urine Culture |
Proteomic Method | Conf. 1 | Standard Urine Culture |
Proteomic Method | Conf. 1 |
---|---|---|---|---|---|
Klebsiella pneumoniae | Escherichia coli | 1 | Escherichia coli | Escherichia coli | 3 |
Klebsiella aerogenes | /2 | 1 | Escherichia coli | Escherichia coli | 3 |
Klebsiella pneumoniae | Escherichia coli | 1 | Escherichia coli | Escherichia coli | 3 |
Escherichia coli | Escherichia coli | 1 | Escherichia coli | Escherichia coli | 3 |
Candida albicans | Candida albicans | 2 | Escherichia coli | Escherichia coli | 3 |
Escherichia coli | Escherichia coli | 2 | Escherichia coli | Escherichia coli | 3 |
Escherichia coli | Escherichia coli | 2 | Escherichia coli | Escherichia coli | 3 |
Klebsiella pneumoniae | Klebsiella pneumoniae | 2 | Escherichia coli | Escherichia coli | 3 |
Enterococcus faecalis | Enterococcus faecalis | 2 | Enterococcus faecalis | Enterococcus faecalis | 3 |
Proteus mirabilis | Proteus mirabilis | 2 | Enterococcus faecalis | Enterococcus faecalis | 3 |
Stenotrophomonas maltophilia | Stenotrophomonas maltophilia | 2 | Pseudomonas aeruginosa | Pseudomonas aeruginosa | 3 |
Citrobacter freundii | Klebsiella pneumoniae | 3 | Pseudomonas aeruginosa | Pseudomonas aeruginosa | 3 |
Escherichia coli | Escherichia coli | 3 | Pseudomonas aeruginosa | Pseudomonas aeruginosa | 3 |
Escherichia coli | Escherichia coli | 3 | Pseudomonas aeruginosa | Pseudomonas aeruginosa | 3 |
Escherichia coli | Escherichia coli | 3 | Proteus mirabilis | Proteus mirabilis | 3 |
Escherichia coli | Escherichia coli | 3 | Proteus mirabilis | Proteus mirabilis | 3 |
Escherichia coli | Escherichia coli | 3 |
1 Confidence level: high-confidence level score, 3; medium-confidence level score, 2; and low-confidence level score, 1. 2 Not identified.
3. Discussion
Pathogen identification using tandem mass spectrometry is a developing methodology with promising results. Accurate identification of samples derived from pure colonies is shown in several publications [13,35]. However, the main advantage of this methodology is the possibility to directly use clinical samples, thus, avoiding the lengthy culture step [36]. Direct identification analysis was performed on urine samples with a bacterial count of 105 CFU/mL or greater. Accurate identification at bacterial counts less than 105 CFU/mL can be achieved using more sensitive nano LC-ESI–MS/MS instruments [24]. However, it is shown that accuracy in the aforementioned studies increases dramatically when a larger amount (>105 CFU/mL) of bacteria is present in the urine. In fact, high resolution nanoLC-ESI–MS/MS allows the identification of a larger number of peptides, especially low abundance peptides, resulting in better species discrimination and, thus, better identification.
Extracted proteins from bacterial cultures and urine samples were digested prior to identification. The resulting complex peptide mixtures were separated into 36 fractions prior to mass spectra analysis, to increase proteome coverage and facilitate identification of low abundance proteins [37]. As MALDI-TOF/TOF MS is not coupled to a liquid chromatography system, peptides were separated offline in two dimensions using micro-chromatography cartridges. A strong anion exchanger, specifically a quaternary ammonium cation (QMA), was chosen as the stationary phase in the first dimension because peptide derivatization with 4-formyl-1,3-benzenedisulfonic acid (Materials and Methods, Section 4.4.), as described, enhances the binding of the peptides to the functional groups of the QMA stationary phase. Six eluted fractions in the first dimension were further separated using a SCX stationary phase, to achieve greater separation of peaks in MS analysis. Using a combination of anion and cation exchange, the samples were fractioned and desalted at the same time, producing a clean sample read for the MS analysis. Although direct UTI identification was achieved in ≤24 h, the protocol can be further accelerated by rapid trypsin digestion [38,39] and MALDI-TOF/TOF MS instruments with 2000 Hz laser repetition rates (10× faster acquisition).
Pure bacterial cultures of E. coli, K. pneumoniae, P. mirabilis, and P. aeruginosa were analyzed in ten replicates to assess the qualitative differences of the identified peptides. Although the method was validated using pure cultures, this study aimed to develop a rapid and accurate protocol for the identification of pathogens directly from urine samples, without prior cultivation. Since direct analysis results in an overall lower peptide count than pure cultures, a classification model was applied to evaluate the confidence level of species identification. It is found that 87.8% of the samples could be correctly assigned with high- and medium-confidence levels. Misidentifications occur in samples with low-confidence levels and in one sample with a high-confidence level. The incorrect results involve three samples of Klebsiella species and one C. freundii. The results could be explained by the mixed-species samples (e.g., two or more species in the sample), contaminated urine samples (e.g., large number of human cells), or the high genomic similarity between Klebsiella, Escherichia, and Citrobacter species [40,41], which makes them difficult to distinguish when only a small number of peptides are identified.
However, it is worth mentioning that the identification of Klebsiella species, which was challenging in direct analysis, was unambiguous when analyzing pure colonies. The identification method was based on de novo sequencing of tryptic peptides, and subsequent peptide sequence alignment with 48 common uropathogenic bacterial species present in the NCBInr database (Table S1). De novo peptide sequencing is a promising identification tool in proteomics that has been investigated over the past 20 years [42]. Despite various de novo sequencing algorithms and extensive research, there is an apparent lack of wider application, especially in the identification of complex protein mixtures [43,44]. This can be attributed to a few drawbacks of MS/MS spectra sequencing such as insufficient mass accuracy relative to non-linear sequences data structure consisting of determined amino acid at nodes and edges (e.g., an average MS/MS spectrum could be branched by the algorithm in several hundred sequences), and uneven fragmentation patterns caused by missing fragment ions. The named drawbacks, along with the noise present in the MS/MS spectra (Figures S2–S4), hamper the complete detection of long sequences (larger than 20 amino acids), and contribute to sequence ambiguities. Disulfonic peptide derivatization at the N-terminus improves amino acid sequence recognition, by allowing the revelation of only b- ions in negative ion mode MS/MS, and y- ions in positive ion mode MS/MS [45]. The de-novo-identified peptides were aligned with protein sequences in the database, and microorganism species were determined based on the highest number of matched peptides. The described identification method can also be used to identify other pathogens in clinical, environmental, or food samples. However, rare pathogens or other species of interest could be identified by searching the entire NCBInr database, which increases the search time but, on the other hand, allows the identification of virtually any species in the database.
4. Materials and Methods
4.1. Raw Materials
Sodium dodecyl sulphate (SDS; ≥99.9%), sodium dihydrogen phosphate (≥99.9%), formic acid (>98%), hydrochloric acid (>36.5%), DL-dithiothreitol (DTT; ≥99.9%), urea (≥99.9%), cyano-4-hydroxy cinnamic acid (CHCA; ≥99.9%), proteomic grade of trypsin from porcine pancreas, and 4-formyl-1,3-benzenedisulfonic acid disodium salt hydrate (97%) were obtained from Sigma-Aldrich (St. Louis, MO, USA); sodium cyanoborohydride from G-Biosciences (St. Louis, MO, USA); and ammonium bicarbonate (≥99.9%), acetonitrile (≥99.9%), potassium dihydrogen phosphate (≥99.9%), Tris base (≥99.9%), and Microcon-30 kDa 0.5 mL centrifugal filter devices were purchased from Merck Millipore (Burlington, MA, USA). Sep-Pak Accell Plus Vac cartridges as QMA stationary phase was obtained from Waters (Milford, MA, USA), and SCX stationary phase in bulk and resin-free AssayMAP cartridges were obtained from Agilent Technologies (St. Clara, CA, USA).
4.2. Urine Sample Collection Ethics
Clean-voided, midstream urine specimens were collected from outpatients who were referred for urine culture and analysis to the Department for Clinical Microbiology and Hospital Infections, University Hospital Dubrava, Zagreb, Croatia. The urine samples were tested for bacterial growth on culture plates. The number of bacteria was determined by colony counting on an agarose MacConkey plate, and the species were determined by biochemical identification, according to standard procedures [33].
4.3. Protein Extraction
Bacterial colonies from agar plates were collected with a microbiological loop and mixed with 400 µL of 50 mM ammonium bicarbonate. The bacterial suspension was centrifuged at 8000× g for 10 min at 4 °C to wash the cells, and the supernatant was discarded. Proteins extracted directly from urine samples were obtained by the following protocol: a volume of 15 mL of thawed urine sample was centrifuged at 500× g for 1 min at room temperature, to separate microbial cells from larger particles such as salt crystals and leukocytes. The supernatant was centrifuged at 4500× g for 30 min at 4 °C to obtain bacterial pellet, which was then resuspended in 400 µL of 50 mm ammonium bicarbonate, and again centrifuged at 8000× g for 10 min at 4 °C. The wash step with ammonium bicarbonate was repeated one more time. From this point on, further sample preparation protocol and analysis were the same for bacterial biomass obtained from pure colonies and urine samples.
Cell lysis was achieved by resuspending the bacterial pellet in 200 µL of lysis buffer containing 2% of sodium SDS, 100 mm Tris/HCl pH 7.8, and 0.1 M DTT. After incubation at room temperature for 10 min, cells were disrupted by sonication using the Sonoplus mini20 ultrasonic homogenizer (Bandelin, Berlin, Germany). Samples were sonicated for five cycles of 1 min, with a 30 s cooling interval between cycles, and the sonication amplitude was set to 50%. Afterwards, samples were placed into boiling water for 2 min to inhibit protease activity. The cell lysate was removed from each sample by centrifugation at 16,000× g for 20 min at 4 °C, and the protein fraction contained in supernatant was further processed.
4.4. Protein Purification and Digestion
The FASP protocol was adapted from Yu et al., and conducted with slight modifications [46]. In brief, the Microcon centrifugal filter devices were rinsed with 200 μL of ultrapure water and 200 μL urea solution (8 M urea in 100 mm Tris/HCL buffer). An aliquot of 100 µL of extracted protein mixture was added to a filter device, mixed with 200 μL of urea solution, and centrifuged at 12,000× g for 20 min at 20 °C. Flow through was discarded, and 200 μL of urea solution was added to a filter and spun at 12,000× g for 15 min at 20 °C. This step was repeated, and filter was further washed twice with 200 μL of ammonium bicarbonate buffer. Tryptic digestion was performed overnight by adding 2 μL of 1 mg/mL trypsin solution and 100 μL of ammonium bicarbonate buffer to the filter unit, and incubating the resulting solution overnight on a Comfort thermomixer (Eppendorf, Hamburg, Germany) at 37 °C and 500 rpm. After digestion, peptides were centrifuged of the filter device at 12,000× g for 15 min, followed by one wash step with 50 μL of ammonium bicarbonate buffer. The peptide solution was dried in vacuum concentrator at 30 °C.
4.5. Derivatization Procedure
Peptide derivatization procedure was previously described [29], and repeated here with modifications. Each sample containing dried tryptic peptides was reconstituted with 30 μL of derivatization solution. The derivatization solution contained 12.5 mM of 4-formyl-1,3-benzenedisulfonic acid disodium salt hydrate and 95.5 mm of sodium cyanoborohydride dissolved in 10 mm potassium dihydrogen phosphate pH 5. The derivatization was performed in a household microwave oven at 180 W for 8 min.
4.6. Two Dimensional Fractionation of Peptide Mixture
Sample fractionation was performed by AssayMAP Bravo Automated liquid handling platform (Agilent, St. Clara, CA, USA), using a pre-determined fractionation protocol. Dried peptides were reconstituted in 1% of ammonium hydroxide, and subjected to a strong anion exchange using QMA stationary phase. The stationary phase was transferred to a 5 µL resin-free cartridges. Eluted fractions were dried in a vacuum concentrator, then resuspended in 1% formic acid, and additionally fractioned by SCX stationary phase, according to the manufacturer’s protocol. Buffers used for QMA and SCX fractionation are shown in Table S2 in Supplementary Materials. Finally, the resulting 36 fractions from each sample were evaporated in a vacuum concentrator.
4.7. Mass Spectrometry
Dried peptide fractions were resuspended in 10 µL of 5 mg/mL CHCA dissolved in 50% acetonitrile, and a volume of 1 μL was spotted onto a MALDI plate. Mass spectrometry acquisition was performed with a MALDI-TOF/TOF MS 4800 Plus analyser (AB Sciex, Framingham, MA, USA) equipped with a 200 Hz, 355 nm neodymium-doped yttrium aluminium garnet Nd: YAGlaser. Ions in MS analysis were analyzed in reflectron negative ion mode. The instrument parameters were set using the 4000 Series Explorer software V 3.5.3 (AB Sciex, Framingham, MA, USA). Mass spectra were obtained by averaging 1800 laser shots covering a mass range of m/z 1000 to 4000. The precursor ions generated by negative ion MS were analyzed both in negative and positive MS/MS ion mode. Tandem mass spectrometry acquisition was achieved by 1 kV collision energy, without usage of collision gas. Precursor peak selection (LC–MALDI peak MS processing) was controlled by job-wide peak selection method. Peak selection was performed automatically, in order to analyze the MS spectra and acquire MS/MS data from selected peaks. Parameters of MS/MS peak selection were as follows: minimum S/N filter = 15, minimum chromatogram peak width = 3, maximum precursors per fraction = 20, fraction-to-fraction precursor mass tolerance = 200 ppm.
4.8. BLASTp Parameters
Parameters of the blast search were as follows: amino acid mass tolerance = 0.3; number of queries = 10; matrix = PAM30; word size = 2; window size = 40; threshold = 11; gap cost existence = 9; gap costs extension = 1. The E-value was set to 0.1, in order to show only relevant results. Peptides were matched to a custom urinary database by an in-house-developed application written in C#, which utilizes Apache-Lucene-based search engine as it is described elsewhere [32]. The results were grouped by the number of queries (peptides) matched to a species in the database where the highest number of peptide matches correspond to an identified species.
5. Conclusions
We successfully applied a bottom-up mass spectrometry approach to identify bacterial species causing urinary tract infections directly from urine samples. Overall, the identification of pathogens causing urinary tract infections is achieved within 24 h, without a prior cultivation step. A novel de novo peptide sequencing software was developed to provide accurate identification even when processing a large search database. Furthermore, a mathematic model was applied to estimate the reliability of the identification result. Taxonomic resolution was satisfactory, however, it can be improved with the use of more sensitive instrumentation and curated database. In addition to urinary tract infections, the strategy used in this study could also be used for the analysis of other clinical samples such as blood cultures and stool samples, therefore, providing a path for further improvement of widely used MALDI MS techniques (MALDI MS to MALDI MS/MS) in clinical microbiology.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/molecules27175461/s1, Figure S1: Workflow of five-step MALDI MS/MS identification of urine microorganisms. Figure S2: Mass spectra negative ion mode total signal-to-noise intensity per spot m/z 1000–3500. Figure S3: Tandem mass spectra negative ion mode total MS/MS signal-to-noise intensity per selected precursor ion spectrum. Figure S4: Tandem mass spectra positive ion mode total MS/MS signal-to-noise intensity per selected precursor ion spectrum. Table S1: The 48 common uropathogenic species adapted from Pinault et al. 2019 that were used to filter the NCBInr database to enable the BLASTp search, Table S2: Buffers used for two-dimensional peptide fractionation by AssayMAP Bravo liquid-handling platform.
Author Contributions
Conceptualization, L.D., E.S. and M.C.; methodology, E.S., M.C., T.T., A.Š. and A.L.B.; software, T.T., A.Š. and A.L.B.; validation, L.D., E.S., L.O. and A.J.; formal analysis, L.D., E.S. and L.O.; investigation, L.D., E.S., A.J. and M.C.; resources, M.C., A.J. and M.Č.; data curation, L.D., E.S., M.C., T.T., A.Š. and A.L.B.; writing—original draft preparation, L.D., E.S., L.O. and A.L.B.; writing—review and editing, M.C.; visualization, L.D., E.S., L.O. and A.L.B.; supervision, M.C.; funding acquisition, M.C. and T.T. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of University Hospital Dubrava (protocol name “Identification of Uropathogens to a Sub-Species Level Directly from Urine Samples Using Mass Spectrometry”, 13 November 2019).
Informed Consent Statement
Written informed consent was obtained from the patients to publish this paper.
Data Availability Statement
All data are provided in the Supplementary Materials or are available on request.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded by the European Regional Development Fund “Qua/Qua” project, grant number KK.01.1.1.07.0023.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Cheng K., Chui H., Domish L., Hernandez D., Wang G. Recent Development of Mass Spectrometry and Proteomics Applications in Identification and Typing of Bacteria. Prot. Clin. Appl. 2016;10:346–357. doi: 10.1002/prca.201500086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Janda J.M., Abbott S.L. 16S RRNA Gene Sequencing for Bacterial Identification in the Diagnostic Laboratory: Pluses, Perils, and Pitfalls. J. Clin. Microbiol. 2007;45:2761–2764. doi: 10.1128/JCM.01228-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zauli D.A.G. PCR and Infectious Diseases. Synth. Biol.-New Interdiscip. Sci. 2019:137–147. doi: 10.5772/intechopen.85630. [DOI] [Google Scholar]
- 4.Church D.L., Cerutti L., Gürtler A., Griener T., Zelazny A., Emler S. Performance and Application of 16S RRNA Gene Cycle Sequencing for Routine Identification of Bacteria in the Clinical Microbiology Laboratory. Clin. Microbiol. Rev. 2020;33:e00053-19. doi: 10.1128/CMR.00053-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ni P.-X., Ding X., Zhang Y.-X., Yao X., Sun R.-X., Wang P., Gong Y.-P., Zhou J.-L., Li D.-F., Wu H.-L., et al. Rapid Detection and Identification of Infectious Pathogens Based on High-Throughput Sequencing. Chin. Med. J. (Engl.) 2015;128:877–883. doi: 10.4103/0366-6999.154281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rodriguez-Saona L.E., Khambaty F.M., Fry F.S., Calvey E.M. Rapid Detection and Identification of Bacterial Strains by Fourier Transform Near-Infrared Spectroscopy. J. Agric. Food Chem. 2001;49:574–579. doi: 10.1021/jf000776j. [DOI] [PubMed] [Google Scholar]
- 7.Ho C.-S., Jean N., Hogan C.A., Blackmon L., Jeffrey S.S., Holodniy M., Banaei N., Saleh A.A.E., Ermon S., Dionne J. Rapid Identification of Pathogenic Bacteria Using Raman Spectroscopy and Deep Learning. Nat. Commun. 2019;10:4927. doi: 10.1038/s41467-019-12898-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Locke A., Fitzgerald S., Mahadevan-Jansen A. Advances in Optical Detection of Human-Associated Pathogenic Bacteria. Molecules. 2020;25:5256. doi: 10.3390/molecules25225256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Flores-Mireles A.L., Walker J.N., Caparon M., Hultgren S.J. Urinary Tract Infections: Epidemiology, Mechanisms of Infection and Treatment Options. Nat. Rev. Microbiol. 2015;13:269–284. doi: 10.1038/nrmicro3432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.DeMarco M.L., Burnham C.-A.D. Diafiltration MALDI-TOF Mass Spectrometry Method for Culture-Independent Detection and Identification of Pathogens Directly from Urine Specimens. Am. J. Clin. Pathol. 2014;141:204–212. doi: 10.1309/AJCPQYW3B6JLKILC. [DOI] [PubMed] [Google Scholar]
- 11.Tracz D.M., McCorrister S.J., Chong P.M., Lee D.M., Corbett C.R., Westmacott G.R. A Simple Shotgun Proteomics Method for Rapid Bacterial Identification. J. Microbiol. Methods. 2013;94:54–57. doi: 10.1016/j.mimet.2013.04.008. [DOI] [PubMed] [Google Scholar]
- 12.Boulund F., Karlsson R., Gonzales-Siles L., Johnning A., Karami N., AL-Bayati O., Åhrén C., Moore E.R.B., Kristiansson E. Typing and Characterization of Bacteria Using Bottom-up Tandem Mass Spectrometry Proteomics. Mol. Cell. Proteom. 2017;16:1052–1063. doi: 10.1074/mcp.M116.061721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Alves G., Wang G., Ogurtsov A.Y., Drake S.K., Gucek M., Sacks D.B., Yu Y.-K. Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry. J. Am. Soc. Mass Spectrom. 2018;29:1721–1737. doi: 10.1007/s13361-018-1986-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Alves G., Wang G., Ogurtsov A.Y., Drake S.K., Gucek M., Suffredini A.F., Sacks D.B., Yu Y.-K. Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance. J. Am. Soc. Mass Spectrom. 2016;27:194–210. doi: 10.1007/s13361-015-1271-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Karlsson R., Thorsell A., Gomila M., Salvà-Serra F., Jakobsson H.E., Gonzales-Siles L., Jaén-Luchoro D., Skovbjerg S., Fuchs J., Karlsson A., et al. Discovery of Species-Unique Peptide Biomarkers of Bacterial Pathogens by Tandem Mass Spectrometry-Based Proteotyping *. Mol. Cell. Proteom. 2020;19:518–528. doi: 10.1074/mcp.RA119.001667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ferreira L., Sanchez-Juanes F., Gonzalez-Avila M., Cembrero-Fucinos D., Herrero-Hernandez A., Gonzalez-Buitrago J.M., Munoz-Bellido J.L. Direct Identification of Urinary Tract Pathogens from Urine Samples by Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry. J. Clin. Microbiol. 2010;48:2110–2115. doi: 10.1128/JCM.02215-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Pinault L., Chabrière E., Raoult D., Fenollar F. Direct Identification of Pathogens in Urine by Use of a Specific Matrix-Assisted Laser Desorption Ionization–Time of Flight Spectrum Database. J. Clin. Microbiol. 2019;57:e01678-18. doi: 10.1128/JCM.01678-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Li W., Sun E., Wang Y., Pan H., Zhang Y., Li Y., Zhang X., Li C., Du L., Wang C. Rapid Identification and Antimicrobial Susceptibility Testing for Urinary Tract Pathogens by Direct Analysis of Urine Samples Using a MALDI-TOF MS-Based Combined Protocol. Front. Microbiol. 2019;10:1182. doi: 10.3389/fmicb.2019.01182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Huang B., Zhang L., Zhang W., Liao K., Zhang S., Zhang Z., Ma X., Chen J., Zhang X., Qu P., et al. Direct Detection and Identification of Bacterial Pathogens from Urine with Optimized Specimen Processing and Enhanced Testing Algorithm. J. Clin. Microbiol. 2017;55:8. doi: 10.1128/JCM.02549-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Köhling H.L., Bittner A., Müller K.-D., Buer J., Becker M., Rübben H., Rettenmeier A.W., Mosel F. Direct Identification of Bacteria in Urine Samples by Matrix-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry and Relevance of Defensins as Interfering Factors. J. Med. Microbiol. 2012;61:339–344. doi: 10.1099/jmm.0.032284-0. [DOI] [PubMed] [Google Scholar]
- 21.Spanu T., Posteraro B., Fiori B., D’Inzeo T., Campoli S., Ruggeri A., Tumbarello M., Canu G., Trecarichi E.M., Parisi G., et al. Direct MALDI-TOF Mass Spectrometry Assay of Blood Culture Broths for Rapid Identification of Candida Species Causing Bloodstream Infections: An Observational Study in Two Large Microbiology Laboratories. J. Clin. Microbiol. 2012;50:176–179. doi: 10.1128/JCM.05742-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Leli C., Cenci E., Cardaccia A., Moretti A., D’Alò F., Pagliochini R., Barcaccia M., Farinelli S., Vento S., Bistoni F., et al. Rapid Identification of Bacterial and Fungal Pathogens from Positive Blood Cultures by MALDI-TOF MS. Int. J. Med. Microbiol. 2013;303:205–209. doi: 10.1016/j.ijmm.2013.03.002. [DOI] [PubMed] [Google Scholar]
- 23.Idelevich E.A., Schüle I., Grünastel B., Wüllenweber J., Peters G., Becker K. Rapid Identification of Microorganisms from Positive Blood Cultures by MALDI-TOF Mass Spectrometry Subsequent to Very Short-Term Incubation on Solid Medium. Clin. Microbiol. Infect. 2014;20:1001–1006. doi: 10.1111/1469-0691.12640. [DOI] [PubMed] [Google Scholar]
- 24.Roux-Dalvai F., Gotti C., Leclercq M., Hélie M.-C., Boissinot M., Arrey T.N., Dauly C., Fournier F., Kelly I., Marcoux J., et al. Fast and Accurate Bacterial Species Identification in Urine Specimens Using LC-MS/MS Mass Spectrometry and Machine Learning. Mol. Cell. Proteom. 2019;18:2492–2505. doi: 10.1074/mcp.TIR119.001559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yu Y., Sikorski P., Bowman-Gholston C., Cacciabeve N., Nelson K.E., Pieper R. Diagnosing Inflammation and Infection in the Urinary System via Proteomics. J. Transl. Med. 2015;13:111. doi: 10.1186/s12967-015-0475-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wilson S.R., Vehus T., Berg H.S., Lundanes E. Nano-LC in Proteomics: Recent Advances and Approaches. [(accessed on 6 June 2022)];Bioanalysis. 2015 7:1799–1815. doi: 10.4155/bio.15.92. Available online: https://www.future-science.com/doi/abs/10.4155/bio.15.92. [DOI] [PubMed] [Google Scholar]
- 27.Maus A., Mignon R., Basile F. Enhanced Protein Identification Using Graphite-Modified MALDI Plates for Offline LC-MALDI-MS/MS Bottom-up Proteomics. Anal. Biochem. 2018;545:31. doi: 10.1016/j.ab.2018.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cindric M., Pavelic K., Dodig I. Mass Spectrometry-Based Protein Identification Method with Selectiven-Terminus Dervatization. 8,647,880. U.S. Patent. 2014 February 11;
- 29.Butorac A., Dodig I., Bačun-Družina V., Tishbee A., Mrvčić J., Hock K., Diminić J., Cindrić M. The Effect of Starvation Stress on Lactobacillus Brevis L62 Protein Profile Determined by de Novo Sequencing in Positive and Negative Mass Spectrometry Ion Mode: CAF−/CAF+ Peptide de novo Sequencing. Rapid Commun. Mass Spectrom. 2013;27:1045–1054. doi: 10.1002/rcm.6528. [DOI] [PubMed] [Google Scholar]
- 30.Hassell K.M., Stutzman J.R., McLuckey S.A. Gas-Phase Bioconjugation of Peptides via Ion/Ion Charge Inversion: Schiff Base Formation on the Conversion of Cations to Anions. Anal. Chem. 2010;82:1594–1597. doi: 10.1021/ac902732v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic Local Alignment Search Tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 32.Chen C., Li Z., Huang H., Suzek B.E., Wu C.H. A Fast Peptide Match Service for UniProt Knowledgebase. Bioinformatics. 2013;29:2808–2809. doi: 10.1093/bioinformatics/btt484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Aspevall O., Hallander H., Gant V., Kouri T. European Guidelines for Urinalysis: A Collaborative Document Produced by European Clinical Microbiologists and Clinical Chemists under ECLM in Collaboration with ESCMID. Clin. Microbiol. Infect. 2001;7:173–178. doi: 10.1046/j.1198-743x.2001.00237.x. [DOI] [PubMed] [Google Scholar]
- 34.Zhang H., Cisse M., Dauphin Y.N., Lopez-Paz D. Mixup: Beyond Empirical Risk Minimization. arXiv. 20181710.09412 [Google Scholar]
- 35.Karlsson R., Davidson M., Svensson-Stadler L., Karlsson A., Olesen K., Carlsohn E., Moore E.R.B. Strain-Level Typing and Identification of Bacteria Using Mass Spectrometry-Based Proteomics. J. Proteome Res. 2012;11:2710–2720. doi: 10.1021/pr2010633. [DOI] [PubMed] [Google Scholar]
- 36.Karlsson R., Gonzales-Siles L., Gomila M., Busquets A., Salvà-Serra F., Jaén-Luchoro D., Jakobsson H.E., Karlsson A., Boulund F., Kristiansson E., et al. Proteotyping Bacteria: Characterization, Differentiation and Identification of Pneumococcus and Other Species within the Mitis Group of the Genus Streptococcus by Tandem Mass Spectrometry Proteomics. PLoS ONE. 2018;13:e0208804. doi: 10.1371/journal.pone.0208804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Manadas B., Mendes V.M., English J., Dunn M.J. Peptide Fractionation in Proteomics Approaches. Expert Rev. Proteom. 2010;7:655–663. doi: 10.1586/epr.10.46. [DOI] [PubMed] [Google Scholar]
- 38.Cheng G., Zheng S.-Y. Construction of a High-Performance Magnetic Enzyme Nanosystem for Rapid Tryptic Digestion. Sci. Rep. 2014;4:6947. doi: 10.1038/srep06947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dycka F., Bobal P., Mazanec K., Bobalova J. Rapid and Efficient Protein Enzymatic Digestion: An Experimental Comparison. Electrophoresis. 2012;33:288–295. doi: 10.1002/elps.201100123. [DOI] [PubMed] [Google Scholar]
- 40.Brisse S., Verhoef J. Phylogenetic Diversity of Klebsiella Pneumoniae and Klebsiella Oxytoca Clinical Isolates Revealed by Randomly Amplified Polymorphic DNA, GyrA and ParC Genes Sequencing and Automated Ribotyping. Int. J. Syst. Evol. Microbiol. 2001;51:915–924. doi: 10.1099/00207713-51-3-915. [DOI] [PubMed] [Google Scholar]
- 41.Kumar V., Sun P., Vamathevan J., Li Y., Ingraham K., Palmer L., Huang J., Brown J.R. Comparative Genomics of Klebsiella Pneumoniae Strains with Different Antibiotic Resistance Profiles. Antimicrob. Agents Chemother. 2011;55:4267–4276. doi: 10.1128/AAC.00052-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Tran N.H., Zhang X., Xin L., Shan B., Li M. De Novo Peptide Sequencing by Deep Learning. Proc. Natl. Acad. Sci. USA. 2017;114:8247–8252. doi: 10.1073/pnas.1705691114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Allmer J. Algorithms for the de Novo Sequencing of Peptides from Tandem Mass Spectra. Expert Rev. Proteom. 2011;8:645–657. doi: 10.1586/epr.11.54. [DOI] [PubMed] [Google Scholar]
- 44.Devabhaktuni A., Elias J.E. Application of de Novo Sequencing to Large-Scale Complex Proteomics Data Sets. J. Proteome Res. 2016;15:732–742. doi: 10.1021/acs.jproteome.5b00861. [DOI] [PubMed] [Google Scholar]
- 45.Cindric M., Hamersak Z., Dodig I. Method of Detection of Amino Acid Sequence and/or Identification of Peptides and Proteins, by Use of a New Derivatization Reagent and Synthesis of 5-Formyl-Benzene-1,3-Disulphonic Acid as Derivatization Reagent. 9,581,601. U.S. Patent. 2017 February 28;
- 46.Yu Y., Pieper R. Urinary Pellet Sample Preparation for Shotgun Proteomic Analysis of Microbial Infection and Host-Pathogen Interactions. Methods Mol. Biol. 2015;1295:65–74. doi: 10.1007/978-1-4939-2550-6_6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are provided in the Supplementary Materials or are available on request.