Abstract
Due to growing concern about organic micropollutants and their transformation products (TP) in surface and drinking water, reliable identification of unknowns is required. Here, we demonstrate how non-target liquid chromatography (LC)-high-resolution tandem mass spectrometry (MS/MS) and the feature-based molecular networking (FBMN) workflow provide insight into water samples from four riverbank filtration sites with different redox conditions. First, FBMN prioritized and connected drinking water relevant and seasonally dependent compounds based on a modification-aware MS/MS cosine similarity. Within the resulting molecular networks, forty-three compounds were annotated. Here, carbamazepine, sartans, and their respective TP were investigated exemplarily. With chromatographic information and spectral similarity, four additional TP (dealkylated valsartan, dealkylated irbesartan, two oxygenated irbesartan isomers) and olmesartan were identified and partly verified with an authentic standard. In this study, sartans and TP were investigated and grouped regarding their removal behavior under different redox conditions and seasons for the first time. Antihypertensives were grouped into compounds being well removed during riverbank filtration, those primarily removed under anoxic conditions, and rather persistent compounds. Observed seasonal variations were mainly limited to varying river water concentrations. FBMN is a powerful tool for identifying previously unknown or unexpected compounds and their TP in water samples by non-target analysis.
Graphical abstract
Supplementary Information
The online version contains supplementary material available at 10.1007/s00216-021-03500-7.
Keywords: Identification of unknowns, Molecular networking, Transformation products, Tandem mass spectrometry, Environmental analysis
Introduction
Due to rising concern about the increasing number of chemical compounds present in our surface waters, the identification of organic micropollutants (OMP) and their transformation products (TP) is necessary [1]. Non-target screening theoretically allows researchers to detect hundreds of chemical compounds in a single analytical run by chromatography coupled to mass spectrometry (MS). However, identifying unknown OMP in environmental water samples is challenging and requires reliable data evaluation [2]. Initially, non-target screening does not require reference standards as compound annotation usually relies on public spectral reference libraries or in silico prediction. However, to turn an annotation into an identification, orthogonal identifiers (e.g., retention time or fragment spectra) are commonly compared to reference standards [3]. Generally, several approaches for the subsequent steps of non-target screening, namely feature detection (e.g., MZmine, enviMass), statistical analysis/prioritization (e.g., MetaboAnalyst, Matlab), and compound annotation/identification (e.g., GNPS, FOR-IDENT, MassBank) in data evaluation of large MS datasets in this context have been made. Prioritization of features (defined by mass-to-charge ratio (m/z), retention time, and intensity) offers the possibility to focus on a predefined compound group (e.g., formation during a process [4]) and therefore enables a pre-selection of features of interest. A subsequent spectral database search allows the annotation of compounds [2]. Notably, the outcome of this approach is dependent on the availability of reference spectra and therefore falls short for many TP [5]. As a solution to similar challenges, molecular networking (MN) was introduced in 2012 [6] and made publicly available through the Global Natural Products Social Molecular Networking (GNPS) web-platform (http://gnps.ucsd.edu) [7]. The recently published MN protocol provides an introduction to the topic [8]. MN creates networks of nodes, i.e., fragmentation spectra (MS/MS), which are connected based on a pairwise spectral alignment and similarity scoring of all experimental MS/MS spectra in a study, which are often acquired in data-dependent fragmentation [8]. In a second step, nodes are annotated by matching against the public GNPS spectral libraries. This workflow was applied to different non-target MS studies in various fields, including ocean water [9], agricultural [10], forensic [11], and biomedical research [12]. Despite growing MS/MS spectral libraries, annotation rates of only 5% of all experimental MS/MS spectra are common [8]. Nevertheless, many novel compounds were identified with this approach [13–15] and annotations can be propagated throughout a molecular network [16]. The GNPS platform combines multiple tools to analyze and enrich annotations in molecular networks [8, 17, 18]. Compared to classical molecular networking, where all MS/MS spectra are clustered with no regard to retention time, feature-based molecular networking (FBMN) uses only MS/MS spectra that originate from features with a chromatographic peak shape and therefore also considers retention times and peak areas. FBMN enables the identification of isobaric isomers and a more precise statistical evaluation of datasets [19]. Nevertheless, FBMN has so far mainly been used in natural product research and has not been extended towards surface and drinking water TP identification which is shown in this study. In environmental analysis, it has been used to identify TP of anthropogenic source in ocean water [20].
Riverbank filtration (RBF) is a long-used and efficient natural clean-up technique for the effective removal of OMP, bacteria, and other unwanted substances from raw water used for drinking water production [21]. OMP are transported in the groundwater and either adsorbed, transformed, or degraded during RBF and, therefore, specifically either removed or converted to other chemical structures of unknown risk. RBF performance for OMP removal and degradation is dependent on a variety of factors, such as redox conditions, travel time and distance, hydraulic conductivity, or initial OMP concentration [22–25]. Although the behavior during RBF is well studied for many compounds, e.g., commonly used pesticides [26, 27] and pharmaceuticals [28, 29], various TP still lack data on the removal during RBF and suitable removal conditions.
Therefore, we applied FBMN on a set of seasonal RBF samples from four sites at two rivers in Germany to identify unknown compounds and possible TP with non-target liquid chromatography (LC)-MS analysis. After a spectral database search, TP were identified by their spectral similarity to reference standards, data from the literature, and manual spectral evaluation. Furthermore, the identified compounds were statistically analyzed regarding their seasonal occurrence and behavior at the different sites under different redox conditions and travel distances. Therefore, the aims of this study were (1) to implement the application of FBMN on environmental samples and (2) to broaden the knowledge on OMP and TP behavior during RBF.
Materials and methods
Sample material and study sites
Four RBF sites in Germany, two at the Ems river and two at the Ruhr river, were investigated (see Supplementary Information (ESM) Figure S1). These sites were previously characterized and chemically analyzed by target analysis [25]. Grab water samples were taken from a transect comprising of river water (R), three groundwater wells (B1-3), and the abstraction well (W) at each RBF site at three different sampling times each: summer 2017 (Su, low discharge), fall 2017 (Fa, medium discharge), and spring 2018 (Sp, low discharge). The Ems river sites (Ea and Eb) consist of anoxic silty sand aquifers with medium conductivity (10−4 m/s) and travel distances of 633 and 89 m, respectively. Oxic gravel aquifers characterize the Ruhr river sites (Ra and Rb) with high conductivity (10−2 m/s) and travel distances of 72 and 42 m, respectively. The Ems river is predominantly influenced by agricultural activity and wastewater treatment plants (WWTP), whereas the Ruhr river is mainly affected by WWTP, industry, and urban runoff but less agricultural activity. The proportion of treated wastewater (TWW) at mean minimum discharge reaches 50–100% at both rivers, being higher in Ems river [30]. Detailed information on the fraction of treated wastewater in both rivers and effects of discharge can be found in our previous study Oberleitner et al., 2020 [25].
Analysis of organic micropollutants
The detailed method is described in Oberleitner et al., 2020 [25]. In brief, the samples were acidified to 0.1% formic acid (99.8%, FA from Merck, Germany) and centrifuged at 4000g for 10 min, and 490 μL was directly co-injected with 10 μL internal standard (d6-sulfadimethoxine (Dr. Ehrenstorfer), d6-diuron (Campro Scientific), d4-benzotriazole (Chiron AS)) to the HPLC system (Prominence UFLC, Shimadzu, Japan). A binary gradient consisting of acetonitrile (5% water, 0.05% formic acid) and water (0.05% formic acid) was applied on a NucleoShell RP 18plus (Macherey-Nagel, Germany) reversed-phase column and detection was achieved by a Bruker MaXis 3G qTOF tandem mass spectrometer equipped with an electrospray ionization source in positive ionization mode (m/z 50–1000). By injecting a solution of sodium formate directly into the ESI source before each measurement, mass recalibration was performed. Data-dependent fragmentation was applied, selecting the eight most intensive ions above 400 counts for fragmentation with 35 eV and active exclusion after five spectra for 0.15 min. Mass recalibration and semi-quantitative analysis were performed with DataAnalysis 4.2 (Bruker Daltonics, Germany). Analytical standards of valsartan, valsartan acid, irbesartan, candesartan, olmesartan, telmisartan, 10-hydroxycarbazepine, and 10,11-dihydro-10,11-dihydroxycarbamazepine were purchased from Neochema and Merck and diluted to 1000, 500, and 100 ng/L with 5% acetonitrile (0.1% formic acid) and measured with the same method as the samples.
Data processing and identification of components
Data files were converted to .mzML file format with CompassXport (Bruker Daltonics, Germany) and processed with MZmine 2.51 [31]. Detailed information on data processing can be derived from ESM Table S1. All data files were grouped by the attributes “Sample Type” (either “River” or “Well”) and “Season” (“Summer,” “Fall,” and “Spring”). Aligned feature lists were filtered to exclude features lacking fragmentation spectra or 13C isotope pattern as they would not match the later applied selection criteria due to signal intensity or number of fragments. The resulting lists were exported for FBMN on GNPS. Nevertheless, it cannot be excluded that the lists still contain e.g. adducts or in-source fragments as multiple signals from only one compound. Primary annotation was performed by matching experimental MS/MS spectra against the GNPS spectral database.
FBMN [19] was applied to the samples and features (nodes) were connected by a similarity score of their MS/MS spectra (cosine score ≥ 0.7 with at least 4 matched signals). Default settings given by GNPS for FBMN are cosine ≥ 0.7 and 6 matched signals [8]. The default settings are given for natural compounds of a possible higher mass than anthropogenic substances present in river water. The number of expectable fragments is lower, though, in smaller molecules. Therefore, the number of matched signals to enable connection of features was set to 4. Less than 4 signals would lead to probably more results, but also reduce the certainty of the annotated results. Regarding the cosine, the default settings were chosen since a lower cosine (< 0.7) would again lead to more connections but less certainty. Cosines above 0.7 can be individually judged later as they are given for each connection as the strength of the bond between two nodes.
Evaluation of networks
The resulting networks were exported to Cytoscape [32] and the depiction of networks was customized. The identified and unidentified nodes were visualized as squares and circles, respectively, with their size being dependent on their summed up feature area. The weight of the edges between the nodes was set according to their cosine similarity score and labeled with the m/z difference between the nodes. Area variations between the sample groups, either based on the attributes “Sample Type” or “Season,” were depicted in pie charts within the nodes.
Quality assurance
To assure the stability of the chromatography and the mass spectrometry, the abovementioned internal standards were used. Maximum time deviation of the internal standards ranged between 0.7 and 1.1% and signal intensity’s relative standard deviations ranged between 8.4 and 13.1% for the internal standards (ESM Table S2). Mass accuracy was <0.005 m/z or 10 Da. Between the measurement of each triplicate, laboratory blanks were included in the sequence. For the subsequent data evaluation, only 10 representative blank samples were considered. Features appearing in blanks in signal intensities higher than the tripled noise level were excluded from the subsequent feature lists of the samples.
Results and discussion
Creation and prioritization of networks
In total, 19,246 feature nodes containing MS/MS spectra were created for all samples (Fig. 1). Despite filtering 13C isotope patterns, these nodes include multiple ion adducts and in-source fragments for one compound and are therefore not equivalent to the number of detected compounds. In total, 507 networks were created containing at least two nodes out of which 375 consisted of exactly two nodes. The largest network contained 86 nodes. A more comprehensive view on the detected features concerning their total number and appearance in different seasons as well as a comparison of the various locations is described in Oberleitner et al., 2020b [33] (for a detailed view, use the URL provided at the end of this document to access the annotated network raw data).
Many nodes remained unconnected from any networks either because of a lack of similar compounds within the samples or insufficient quality of their corresponding MS/MS spectrum. A minimum of four signals is required for a similarity score. Therefore, most nodes form so-called singletons.
Considering only networks that contained nodes with library matches (squares), only 256 nodes remained (1.3% of all feature nodes). Similarly, most annotated nodes were singletons. The largest network contained 53 nodes with 128 connecting edges with annotations for dibutyl phthalate and diethyl phthalate. These are expected to derive from contamination during sampling and measurement from tubes containing phthalate plasticizers. These signals also occurred in blank samples with significantly lower signal intensity. Therefore, there might also be phthalate pollution in the samples derived from the river water. The same can be seen for triphenylphosphine oxide and triphenylphosphate although they did only form singletons and not larger networks. Each network can be rearranged and investigated individually. For further evaluation, two networks with a maximum size of ten nodes were chosen exemplarily. Additionally, networks that contained related annotations (e.g., other sartans) were added to the view of the respective network.
Carbamazepine and transformation products
The first identified network is depicted in Fig. 2. Three nodes were annotated as carbamazepine, 10-hydroxycarbazepine, and 10,11-dihydroxy-10,11-dihydro-carbamazepine. Carbamazepine was already determined within the samples in our previous study being mostly persistent during RBF [25]. 10-Hydroxycarbazepine has a high structural similarity to carbamazepine (ESM Fig. S2) and was only found within the river samples; hence, it is well removed during RBF. It is a metabolite of oxcarbazepine previously detected in surface waters but not detected in our study [34]. The seasonal distribution showed a typical distribution for wastewater-derived micropollutants: The highest intensities were detected in summer 2017 and spring 2018. These two sampling times showed the lowest sampled discharge (Fig. 2 in [25]), and therefore the water body consists of a high TWW proportion [30].
10,11-Dihydroxy-10,11-dihydro-carbamazepine is a metabolite of carbamazepine [35] known to be formed during wastewater treatment [36] with distribution within rivers and wells comparable to carbamazepine. Both TP have not yet been investigated concerning their behavior during RBF before. To verify their identification, 10-hydroxycarbazepine and 10,11-dihydro-10,11-dihydroxycarbamazepine were measured as reference standards at three different concentrations (100, 500, and 1000 ng/L). The semi-quantitative analysis regarding the three calibration steps (exact mass, retention time, six to seven signals per peak) showed that river concentrations were below 100 ng/L for 10-hydroxycarbazepine and below 150 ng/L for 10,11-dihydroxy-10,11-dihydro-carbamazepine. Since 10,11-dihydroxy-10,11-dihydro-carbamazepine is only partly removed during RBF, it is relevant to drinking water production in these concentrations and should be observed during the subsequent treatment steps.
Sartans and their transformation products
The second investigated network (Fig. 3) contains nodes annotated as valsartan, telmisartan, candesartan, irbesartan, and their TP valsartan acid by the GNPS database due to their spectral similarity. Annotation was secured by a search for diagnostic fragments and later proven by reference standards. Sartans are a group of antihypertensive drugs with a high production volume (>100 t/a in Germany [37]) and structural similarities (ESM Figure S3), such as biphenyl, imidazole, or tetrazole groups.
Nodes A–E were not annotated during spectral library search but further investigated in this study and manually annotated as a valsartan 13C isotope, olmesartan, an oxygenated form of irbesartan IRB_442 and a structural isomer (C1), dealkylated valsartan, and dealkylated irbesartan, respectively. An identification level 2 was achieved for nodes A–E as described by the metabolomics standards initiative (MSI) and Schymanski et al. [2]. This means that A–E are probable structures annotated by diagnostic fragments. For a level 1 confirmed structure, a reference standard would be needed, often unavailable for transformation products. Compounds at 14.76 and 11.85 min were not annotated in this study and remain unidentified. Yet, they are very likely sartan-related compounds that show similar fragment spectra, but are not yet described in literature.
Identification of transformation products
Node A showed a spectral similarity to valsartan, the same retention time (15.49 min), and a mass difference of 1.002 Da. Therefore, it was identified as a valsartan 13C isotope that was missed during isotope filtering in MZmine.
With an accurate m/z of 447.212 and a retention time of 11.06 min, node B fitted the molecular formula C24H27N6O3+ (4 ppm tolerance). Literature research identified it as olmesartan [38] and a spectral mirror match (ESM Figure S4) confirmed the identification via the spectral library MassBank of North America (MoNA) [39], although the measured mass spectrum had a low intensity and only fragments between m/z 177 and 235.
Node C showed high similarity to irbesartan and a mass difference of +13.979 Da (tR = 14.14 min). Literature research towards TP of irbesartan resulted in an oxygenated form of irbesartan IRB_442 [40]. The annotation was confirmed by a spectral investigation (ESM Figure S5). The fragment at m/z 235 resulted from a loss of the imidazole structure, followed by a loss of N2 to a fragment at m/z 207. Fragment m/z 180 is the biphenyl-structure with a bonded CN part. Node C1 has the same parent mass and fragments as C but an earlier retention time (12.81 min). Therefore, we annotated it as a structural isomer of IRB_442_C as found before by Boix et al. [41].
Node D was annotated by the similarity to valsartan and the mass difference of −100.052 Da as a dealkylated valsartan which was previously described as a transformation product of valsartan [42]. It showed the same diagnostic fragments as IRB_442_C (ESM Figure S6).
Compound E showed a high spectral similarity to irbesartan and IRB_442 and a mass difference of −42.047 Da to irbesartan. Additional to fragments at m/z 207 and 180, fragments at m/z 344 and 153 (ESM Figure S7) were diagnostic for a dealkylated TP of irbesartan (-propylene) [41].
The retention behavior of the compounds was compared to different RP-HPLC-MS studies [38, 41–43] and, except for candesartan, the elution order of the sartans was conclusive (ESM Figure S8). Candesartan eluted between irbesartan and valsartan, but in literature, it was either found to elute before [38] or after [43] both compounds depending on the applied column, as mobile phases were the same in both studies (water and methanol with 0.1% FA). Therefore, we assumed that candesartan elution was very sensitive to the chosen column and chromatographic conditions, e.g., the chosen eluents.
The identification via diagnostic fragments and literature data leads to an identification level of two (probable structure) for the compounds A–E [2]. Retention behavior was conclusive and the presence of the sartans makes the occurrence of their TP probable [2]. The identification of valsartan, valsartan acid, irbesartan, candesartan, olmesartan, and telmisartan was verified by reference standards to level 1. The semi-quantitative analysis showed concentrations for all sample types of <100 ng/L for irbesartan and telmisartan, mean concentrations <100 ng/L (maximum 200–300 ng/L) for candesartan and valsartan, mean concentrations of 400 ng/L (maximum ~425 ng/L) for olmesartan, and mean concentrations of 200–300 ng/L (maximum 600–700 ng/L) for valsartan acid. For valsartan, concentrations between 0.5 and 150 ng/L have been reported in British rivers before [44]. For Bavarian river catchments, mean concentrations in WWTP effluents of 460 ng/L for candesartan, 1250 ng/L for irbesartan, 740 ng/L for olmesartan, 680 ng/L for telmisartan, and 1100 ng/L for valsartan are known [38]. Considering the dilution of TWW in the Ems river and Ruhr river at different discharge scenarios and a different proportion of prescribed sartans per region, this results in comparable river water concentrations as mentioned for WWTP effluents. In the Ems river, telmisartan and irbesartan were frequently detected in a round robin test, as well as olmesartan, telmisartan, and irbesartan in the Ruhr river, which is in line with our frequent detection of the compounds [45]. Nödler et al. (2013) found a maximum concentration of 2119 ng/L (Creek Gonna) for valsartan acid (median 65 ng/L of 13 surface water samples across Germany) in surface water [46], which is a larger range than our results (<LOD - 700 ng/L) indicating a medium wastewater impact for the Ruhr and Ems rivers compared to other German rivers. This is in line with Karakurt et al., calculating a TWW proportion of around 50–100% at low river discharge and far less than 50% at medium discharge [30]. Valsartan acid was also suggested as a possible wastewater indicator alongside carbamazepine as it showed persistence and a correlation to carbamazepine concentration in surface water. It is concluded that the identified sartans at these concentration levels are of particular interest concerning drinking water production from riverbank filtrate. They should be included in routine analysis more frequently.
Seasonal occurrence and elimination during riverbank filtration under different redox conditions
Valsartan, telmisartan, and compounds A, C1, and D and the unspecified compound at 14.76 min were almost exclusively found in the river samples and are therefore not relevant for drinking water production. Valsartan acid, irbesartan, candesartan, and compounds B, C, and E and an unspecified compound at 11.85 min were also found in the well samples and showed a typical seasonal distribution of wastewater-derived OMP [47].
Intensities for each of the sartans were normalized to the corresponding maximum intensity across all samples from the different locations, which was set to 100%. Matrix effects were not investigated in this study but in regard to our previous study (Oberleitner et al., 2020 [25]), major matrix effects influencing the distribution of the analytes were not expected. The resulting distribution of the sartans and their TP is depicted in Fig. 4.
Compounds IRB_442_C1, telmisartan, valsartan, and dealkylated valsartan are depicted as gray-hatched bars. Maximum intensities for these compounds were found in the Ems river at site Ea. At both Ems river sites, intensities decreased in fall due to dilution by precipitation (river discharge ratio summer:fall ~ 1:2) [48]. At the Ruhr river, only telmisartan and dealkylated valsartan show decreased intensities in fall, whereas IRB_442_C1 and valsartan show increased abundances in fall. A possible explanation could be increased photodegradation in spring and summer [46, 47]. However, possible photodegradation was also reported for telmisartan [49], candesartan [50], and irbesartan [51] which show a different seasonal distribution. The four compounds were found in all river samples but were immediately eliminated after entering the hyporheic zone and therefore not or only in small intensities found in the first groundwater wells near the bank (B1). In our study, the elimination of these compounds was not dependent on redox conditions which was reported for valsartan [52]. Valsartan was also previously found to be persistent during RBF to some extent, which is conclusive with our findings [52, 53].
Irbesartan, dealkylated irbesartan, and IRB_442_C are depicted in shades of blue and were found in all river samples in comparable intensities, except for fall river samples of the Ems river. Here, the before-mentioned dilution is expected to be responsible. Notably, these compounds were found in the well samples (B1-W) of the Ruhr river sites in comparable or slightly decreased intensities in contrast to their presence only in small intensities in the wells of the Ems river sites. Since the Ems river sites are both anoxic, it is assumed that the elimination of irbesartan and its TP is increased under anoxic conditions. In contrast, irbesartan was previously found to be persistent to some extent independent from the prevailing redox conditions [52]. TP IRB_442_C is not found in the remote wells (B3 and W) of site Ra, which possibly derives from the longer travel distance (66 m) compared to site Rb (38 m).
Valsartan acid, olmesartan, and candesartan are shaded in green and found in all river samples. Decreased intensities are again found in fall. During RBF, olmesartan and candesartan are partially removed at sites Ea and Ra but not significantly at sites Eb and Rb. This coincides with the respective longer travel distance for both redox conditions. Both have previously been described to be persistent during RBF [54, 55]. Although valsartan acid shows varying intensities in the wells at site Rb, intensities tend to decrease during RBF at the oxic Ruhr river sites and the more anoxic site Ea. Contrary to literature data [52, 53] where valsartan acid was found to be persistent independent from redox conditions, in our study, valsartan acid is partially removed under rather oxic conditions.
To the best of our knowledge, this is the first report on RBF behavior for the TP IRB_442_C and C1, dealkylated valsartan, and dealkylated irbesartan.
Further identified compounds
As a result of the application of FBMN on surface and groundwater samples from two river systems, a list of 43 components (ESM Table S3, including the before-mentioned micropollutants) was annotated by matches with the GNPS spectral libraries of which five were known targets (metoprolol, diclofenac, oxazepam, caffeine, and carbamazepine) from our previous study [25]. Components that were only present in the river samples, but not in the abstraction wells, are ß-carboline-1-propionic acid (probably derived from plants [56]), 10-hydroxycarbazepine, phenylalanine-leucin (dipeptide), and flufenacet (herbicide). These components are therefore well removed during RBF at all sites. Flufenacet was reported to be eliminated during RBF up to 60% [57]. Flufenacet was only found at the Ems river as well as flurtamone (herbicide) and 7-chloro-3-methylquinoline-8-carboxylic acid (quinmerac, herbicide), which were also found in the abstraction wells of the Ems river sites. These micropollutants are therefore site-specific for the Ems river. Further pharmaceuticals (e.g., sitagliptin, amisulpride, and clindamycin), biocides (e.g., terbutryn), and metabolites (e.g., 2-hydroxyibuprofen and desethylterbutylazine) were common components for all sites and seasons. Natural organic compounds (e.g., l-tryptophane and oleoylserotonin) were found among the identified features. Some of these compounds were previously found to be persistent during RBF to some extent (compare [58–61]). Diclofenac and tris(3-chloropropyl) phosphate were identified as the 37Cl isotope of their monoisotopic masses, emphasizing the importance of collecting MS/MS spectra of molecules containing two or more chlorine atoms. Due to the settings in MZmine, the most abundant isotope was selected for the feature list. In molecules containing two or more chlorine atoms, this mass is different from the monoisotopic mass.
Conclusions
FBMN is a powerful tool to promote the identification of unknowns in environmental water samples at low concentrations. With spectral database search, a total of 43 compounds was annotated. Additionally, in comparison to other identification tools, FBMN enables the annotation of unknown species with spectral similarity search based on known annotated compounds present in the sample. Therefore, it broadens the annotation of unknowns in non-target analysis towards (unknown) transformation products at low concentrations that cannot be identified by database search due to lacking spectral data. In this case, several transformation products of sartans were annotated by their spectral similarity to their parent compounds although a spectral database search did not lead to any hits. Still, it has to be considered that thousands of features present in the samples could not yet be annotated due to lack of intensity or fragment spectra and the fact that the GNPS database is still focused on natural compounds. The number of annotated compounds could eventually increase by extending the database towards anthropogenic pollutants.
Another advantage over other molecular networking tools is the utilization of chromatographic data enabling a plausibility check of annotated features to their retention time which in this case confirmed the identification of sartan TP.
Supplementary Information
Acknowledgements
The authors would like to thank the waterworks which provided the samples.
Author contribution
Daniela Oberleitner: methodology, software, validation, formal analysis, investigation, writing—original draft, visualization. Robin Schmid: methodology, validation, formal analysis, investigation, writing—original draft, visualization. Wolfgang Schulz: methodology, resources, writing—review and editing, visualization, supervision. Axel Bergmann: conceptualization, resources, writing—review and editing, supervision. Christine Achten: conceptualization, resources, writing—review and editing, supervision.
Funding
Open Access funding enabled and organized by Projekt DEAL. Robin Schmid would like to acknowledge the Verband der Chemischen Industrie e.V. (VCI) for a Chemical Industry Fund fellowship and financial support. The other authors received no financial support.
Data availability
The annotated network raw data can be accessed through https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=a35c2a4e0408402389c48d78ab692b7a. Please note that the raw data has later been cleared of double hits and obvious false positives in the subsequent Cytoscape visualization.
Code availability
Not applicable.
Declarations
Conflict of interest
The authors declare no competing interests.
Footnotes
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Ahmed AKA, Marhaba TF. Review on river bank filtration as an in situ water treatment process. Clean Techn Environ Policy. 2017;19(2):349–359. doi: 10.1007/s10098-016-1266-0. [DOI] [Google Scholar]
- 2.Schymanski EL, Singer HP, Slobodnik J, et al. Non-target screening with high-resolution mass spectrometry: critical review using a collaborative trial on water analysis. Anal Bioanal Chem. 2015;407(21):6237–6255. doi: 10.1007/s00216-015-8681-7. [DOI] [PubMed] [Google Scholar]
- 3.Gago-Ferrero P, Krettek A, Fischer S, Wiberg K, Ahrens L. Suspect screening and regulatory databases: a powerful combination to identify emerging micropollutants. Environ Sci Technol. 2018;52(12):6881–6894. doi: 10.1021/acs.est.7b06598. [DOI] [PubMed] [Google Scholar]
- 4.Bader T, Schulz W, Lucke T, Seitz W, Winzenbacher R. Application of non-target analysis with LC-HRMS for the monitoring of raw and potable water: strategy and results. In: Drewes JE, Letzel T, editors. Assessing transformation products of chemicals by non-target and suspect screening − strategies and workflows, vol. 2. Washington: American Chemical Society; 2016. p. 49–70.
- 5.Singer HP, Wössner AE, Mcardell CS, Fenner K. Rapid screening for exposure to “Non-Target” pharmaceuticals from wastewater effluents by combining HRMS-based suspect screening and exposure modeling. Environ Sci Technol. 2016;50(13):6698–6707. doi: 10.1021/acs.est.5b03332. [DOI] [PubMed] [Google Scholar]
- 6.Watrous J, Roach P, Alexandrov T, et al. Mass spectral molecular networking of living microbial colonies. Proc Natl Acad Sci U S A. 2012;109(26):E1743–E1752. doi: 10.1073/pnas.1203689109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wang M, Carver JJ, Phelan VV, et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol. 2016;34(8):828–837. doi: 10.1038/nbt.3597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Aron AT, Gentry EC, McPhail KL, et al. Reproducible molecular networking of untargeted mass spectrometry data using GNPS. Nat Protoc. 2020;15:1954–1991. 10.1038/s41596-020-0317-5. [DOI] [PubMed]
- 9.Petras D, Koester I, da Silva R, et al. High-resolution liquid chromatography tandem mass spectrometry enables large scale molecular characterization of dissolved organic matter. Front Mar Sci. 2017;4:3389. doi: 10.3389/fmars.2017.00405. [DOI] [Google Scholar]
- 10.Floros DJ, Petras D, Kapono CA, et al. Mass spectrometry based molecular 3D-cartography of plant metabolites. Front Plant Sci. 2017;8:429. doi: 10.3389/fpls.2017.00429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bouslimani A, Melnik AV, Xu Z, et al. Lifestyle chemistries from phones for individual profiling. Proc Natl Acad Sci U S A. 2016;113(48):E7645–E7654. doi: 10.1073/pnas.1610019113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Quinn RA, Comstock W, Zhang T, et al. Niche partitioning of a pathogenic microbiome driven by chemical gradients. Sci Adv. 2018;4(9):eaau1908. doi: 10.1126/sciadv.aau1908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Raheem DJ, Tawfike AF, Abdelmohsen UR, Edrada-Ebel R, Fitzsimmons-Thoss V. Application of metabolomics and molecular networking in investigating the chemical profile and antitrypanosomal activity of British bluebells (Hyacinthoides non-scripta) Sci Rep. 2019;9(1):2547. doi: 10.1038/s41598-019-38940-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Fox Ramos AE, Evanno L, Poupon E, Champy P, Beniddir MA. Natural products targeting strategies involving molecular networking: different manners, one goal. Nat Prod Rep. 2019;36(7):960–980. doi: 10.1039/c9np00006b. [DOI] [PubMed] [Google Scholar]
- 15.Kalinski J-CJ, Waterworth SC, Noundou XS, et al. Molecular networking reveals two distinct chemotypes in pyrroloiminoquinone-producing Tsitsikamma favus sponges. Mar Drugs. 2019;17(1). 10.3390/md17010060. [DOI] [PMC free article] [PubMed]
- 16.Da Silva RR, Wang M, Nothias L-F, et al. Propagating annotations of molecular networks using in silico fragmentation. PLoS Comput Biol. 2018;14(4):e1006089. doi: 10.1371/journal.pcbi.1006089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Tripathi A, Vázquez-Baeza Y, Gauglitz JM, et al. Chemically informed analyses of metabolomics mass spectrometry data with Qemistree. Nat Chem Biol. 2021;17(2):146–151. doi: 10.1038/s41589-020-00677-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ernst M, Kang KB, Caraballo-Rodríguez AM, et al. MolNetEnhancer: enhanced molecular networks by integrating metabolome mining and annotation tools. Metabolites. 2019;9(7). 10.3390/metabo9070144. [DOI] [PMC free article] [PubMed]
- 19.Nothias LF, Petras D, Schmid R, et al. Feature-based molecular networking in the GNPS analysis environment. Nat Methods 2020;17:905–8. 10.1038/s41592-020-0933-6. [DOI] [PMC free article] [PubMed]
- 20.Petras D, Minich JJ, Cancelada LB, et al. Non-targeted tandem mass spectrometry enables the visualization of organic matter chemotype shifts in coastal seawater. Chemosphere. 2021;271:129450. doi: 10.1016/j.chemosphere.2020.129450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Huntscha S, Rodriguez Velosa DM, Schroth MH, Hollender J. Degradation of polar organic micropollutants during riverbank filtration: complementary results from spatiotemporal sampling and push-pull tests. Environ Sci Technol. 2013;47(20):11512–11521. doi: 10.1021/es401802z. [DOI] [PubMed] [Google Scholar]
- 22.Burke V, Greskowiak J, Asmuß T, Bremermann R, Taute T, Massmann G. Temperature dependent redox zonation and attenuation of wastewater-derived organic micropollutants in the hyporheic zone. Sci Total Environ. 2014;482-483:53–61. doi: 10.1016/j.scitotenv.2014.02.098. [DOI] [PubMed] [Google Scholar]
- 23.Burke V, Richter D, Hass U, Duennbier U, Greskowiak J, Massmann G. Redox-dependent removal of 27 organic trace pollutants: compilation of results from tank aeration experiments. Environ Earth Sci. 2014;71(8):3685–3695. doi: 10.1007/s12665-013-2762-8. [DOI] [Google Scholar]
- 24.Storck FR, Schmidt CK, Lange FT, Henson JW, Hahn K. Factors controlling micropollutant removal during riverbank filtration. J Am Water Works Assoc. 2012;104(12):E643–E652. doi: 10.5942/jawwa.2012.104.0147. [DOI] [Google Scholar]
- 25.Oberleitner D, Schulz W, Bergmann A, Achten C. Impact of seasonality, redox conditions, travel distances and initial concentrations on micropollutant removal during riverbank filtration at four sites. Chemosphere. 2020;250:126255. doi: 10.1016/j.chemosphere.2020.126255. [DOI] [PubMed] [Google Scholar]
- 26.Sjerps RMA, Kooij PJF, van Loon A, van Wezel AP. Occurrence of pesticides in Dutch drinking water sources. Chemosphere. 2019;235:510–518. doi: 10.1016/j.chemosphere.2019.06.207. [DOI] [PubMed] [Google Scholar]
- 27.Schipper PNM, Vissers MJM, van der Linden AMA. Pesticides in groundwater and drinking water wells: overview of the situation in the Netherlands. Water Sci Technol. 2008;57(8):1277–1286. doi: 10.2166/wst.2008.255. [DOI] [PubMed] [Google Scholar]
- 28.van Driezum IH, Derx J, Oudega TJ, et al. Spatiotemporal resolved sampling for the interpretation of micropollutant removal during riverbank filtration. Sci Total Environ. 2019;649:212–223. doi: 10.1016/j.scitotenv.2018.08.300. [DOI] [PubMed] [Google Scholar]
- 29.Bradley PM, Barber LB, Duris JW, et al. Riverbank filtration potential of pharmaceuticals in a wastewater-impacted stream. Environ Pollut. 2014;193:173–180. doi: 10.1016/j.envpol.2014.06.028. [DOI] [PubMed] [Google Scholar]
- 30.Karakurt S, Schmid L, Hübner U, Drewes JE. Dynamics of wastewater effluent contributions in streams and impacts on drinking water supply via riverbank filtration in Germany - a national reconnaissance. Environ Sci Technol. 2019;53(11):6154–6161. doi: 10.1021/acs.est.8b07216. [DOI] [PubMed] [Google Scholar]
- 31.Pluskal T, Castillo S, Villar-Briones A, et al. MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinformatics 2010;11:395. 10.1186/1471-2105-11-395. [DOI] [PMC free article] [PubMed]
- 32.Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Oberleitner D, Stütz L, Schulz W, Bergmann A, Achten C. Seasonal performance assessment of four riverbank filtration sites by combined non-target and effect-directed analysis. Chemosphere. 2020;261:127706. doi: 10.1016/j.chemosphere.2020.127706. [DOI] [PubMed] [Google Scholar]
- 34.de Jongh CM, Kooij PJF, de Voogt P, Ter Laak TL. Screening and human health risk assessment of pharmaceuticals and their transformation products in Dutch surface waters and drinking water. Sci Total Environ. 2012;427-428:70–77. doi: 10.1016/j.scitotenv.2012.04.010. [DOI] [PubMed] [Google Scholar]
- 35.Jelic A, Michael I, Achilleos A, et al. Transformation products and reaction pathways of carbamazepine during photocatalytic and sonophotocatalytic treatment. J Hazard Mater. 2013;263(Pt 1):177–186. doi: 10.1016/j.jhazmat.2013.07.068. [DOI] [PubMed] [Google Scholar]
- 36.Leclercq M, Mathieu O, Gomez E, Casellas C, Fenet H, Hillaire-Buys D. Presence and fate of carbamazepine, oxcarbazepine, and seven of their metabolites at wastewater treatment plants. Arch Environ Contam Toxicol. 2008;56(3):408. doi: 10.1007/s00244-008-9202-x. [DOI] [PubMed] [Google Scholar]
- 37.Letzel M. Verbundprojekt RISK-IDENT: Bewertung bislang nicht identifizierter anthropogener Spurenstoffe sowie Handlungsstrategien zum Risikomanagement im aquatischen System. 2. Statusseminar der BMBF-Fördermaßnahme RiSKWa – Risikomenagement von neuen Schadstoffen und Krankheitserregern im Wasserkreislauf. 2013;28–29.
- 38.Bayer A, Asner R, Schüssler W, et al. Behavior of sartans (antihypertensive drugs) in wastewater treatment plants, their occurrence and risk for the aquatic environment. Environ Sci Pollut Res Int. 2014;21(18):10830–10839. doi: 10.1007/s11356-014-3060-z. [DOI] [PubMed] [Google Scholar]
- 39.MoNA - MassBank of North America. Olmesartan - Originally submitted to the Fiehn Lab HILIC Library. Submitter Megan Showalter. https://mona.fiehnlab.ucdavis.edu/spectra/display/FiehnHILIC002199Accessed 12 Dec 2019.
- 40.Letzel T, Bayer A, Schulz W, et al. LC-MS screening techniques for wastewater analysis and analytical data handling strategies: sartans and their transformation products as an example. Chemosphere. 2015;137:198–206. doi: 10.1016/j.chemosphere.2015.06.083. [DOI] [PubMed] [Google Scholar]
- 41.Boix C, Ibáñez M, Sancho JV, Parsons JR, de Voogt P, Hernández F. Biotransformation of pharmaceuticals in surface water and during waste water treatment: identification and occurrence of transformation products. J Hazard Mater. 2016;302:175–187. doi: 10.1016/j.jhazmat.2015.09.053. [DOI] [PubMed] [Google Scholar]
- 42.Helbling DE, Hollender J, Kohler H-PE, Singer H, Fenner K. High-throughput identification of microbial transformation products of organic micropollutants. Environ Sci Technol. 2010;44(17):6621–6627. doi: 10.1021/es100970m. [DOI] [PubMed] [Google Scholar]
- 43.Castro G, Rodríguez I, Ramil M, Cela R. Selective determination of sartan drugs in environmental water samples by mixed-mode solid-phase extraction and liquid chromatography tandem mass spectrometry. Chemosphere. 2019;224:562–571. doi: 10.1016/j.chemosphere.2019.02.137. [DOI] [PubMed] [Google Scholar]
- 44.Kasprzyk-Hordern B, Dinsdale RM, Guwy AJ. The occurrence of pharmaceuticals, personal care products, endocrine disruptors and illicit drugs in surface water in South Wales, UK. Water Res. 2008;42(13):3498–3518. doi: 10.1016/j.watres.2008.04.026. [DOI] [PubMed] [Google Scholar]
- 45.Brüggen S, Schmitz OJ. A new concept for regulatory water monitoring via high-performance liquid chromatography coupled to high-resolution mass spectrometry. J Anal Test. 2018;2(4):342–351. doi: 10.1007/s41664-018-0081-5. [DOI] [Google Scholar]
- 46.Nödler K, Hillebrand O, Idzik K, et al. Occurrence and fate of the angiotensin II receptor antagonist transformation product valsartan acid in the water cycle--a comparative study with selected β-blockers and the persistent anthropogenic wastewater indicators carbamazepine and acesulfame. Water Res. 2013;47(17):6650–6659. doi: 10.1016/j.watres.2013.08.034. [DOI] [PubMed] [Google Scholar]
- 47.Eckert P, Lamberts R, Wagner C. The impact of climate change on drinking water supply by riverbank filtration. Water Sci Technol Water Supply. 2008;8(3):319–324. doi: 10.2166/ws.2008.077. [DOI] [Google Scholar]
- 48.Derx J, Blaschke AP, Blöschl G. Three-dimensional flow patterns at the river–aquifer interface — a case study at the Danube. Adv Water Resour. 2010;33(11):1375–1387. doi: 10.1016/j.advwatres.2010.04.013. [DOI] [Google Scholar]
- 49.Shah RP, Singh S. Identification and characterization of a photolytic degradation product of telmisartan using LC-MS/TOF, LC-MSn, LC-NMR and on-line H/D exchange mass studies. J Pharm Biomed Anal. 2010;53(3):755–761. doi: 10.1016/j.jpba.2010.05.005. [DOI] [PubMed] [Google Scholar]
- 50.Bhagwate S, Gaikwad NJ, Tarte P. Stability indicating HPLC method for the determination of candesartan in pharmaceutical dosages form. IJSPR. 2013;4(3):1079–1085. [Google Scholar]
- 51.Shah RP, Sahu A, Singh S. Identification and characterization of degradation products of irbesartan using LC-MS/TOF, MS(n), on-line H/D exchange and LC-NMR. J Pharm Biomed Anal. 2010;51(5):1037–1046. doi: 10.1016/j.jpba.2009.11.008. [DOI] [PubMed] [Google Scholar]
- 52.Schaffer M, Kröger KF, Nödler K, et al. Influence of a compost layer on the attenuation of 28 selected organic micropollutants under realistic soil aquifer treatment conditions: insights from a large scale column experiment. Water Res. 2015;74:110–121. doi: 10.1016/j.watres.2015.02.010. [DOI] [PubMed] [Google Scholar]
- 53.Berkner S, Thierbach C. Biodegradability and transformation of human pharmaceutical active ingredients in environmentally relevant test systems. Environ Sci Pollut Res Int. 2014;21(16):9461–9467. doi: 10.1007/s11356-013-1868-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hellauer K, Karakurt S, Sperlich A, et al. Establishing sequential managed aquifer recharge technology (SMART) for enhanced removal of trace organic chemicals: experiences from field studies in Berlin, Germany. J Hydrol. 2018;563:1161–1168. doi: 10.1016/j.jhydrol.2017.09.044. [DOI] [Google Scholar]
- 55.Burke V, Schneider L, Greskowiak J, et al. Trace organic removal during river bank filtration for two types of sediment. Water. 2018;10(12):173. doi: 10.3390/w10121736. [DOI] [Google Scholar]
- 56.Kardono LBS, Angerhofer CK, Tsauri S, Padmawinata K, Pezzuto JM, Kinghorn AD. Cytotoxic and antimalarial constituents of the roots of Eurycoma longifolia. J Nat Prod. 1991;54(5):1360–1367. doi: 10.1021/np50077a020. [DOI] [PubMed] [Google Scholar]
- 57.Verstraeten IM, Thurman EM, Lindsey ME, Lee EC, Smith RD. Changes in concentrations of triazine and acetamide herbicides by bank filtration, ozonation, and chlorination in a public water supply. J Hydrol. 2002;266:190–208. doi: 10.1016/S0022-1694(02)00163-4. [DOI] [Google Scholar]
- 58.Nagy-Kovács Z, László B, Fleit E, et al. Behavior of organic micropollutants during river bank filtration in Budapest, Hungary. Water. 2018;10(12):1861. doi: 10.3390/w10121861. [DOI] [Google Scholar]
- 59.Dragon K, Drozdzynski D, Gorski J, Kruc R. The migration of pesticide residues in groundwater at a bank filtration site (Krajkowo well field, Poland) Environ Earth Sci. 2019;78(20):109. doi: 10.1007/s12665-019-8598-0. [DOI] [Google Scholar]
- 60.Kiefer K, Müller A, Singer H, Hollender J. New relevant pesticide transformation products in groundwater detected using target and suspect screening for agricultural and urban micropollutants with LC-HRMS. Water Res. 2019;165:114972. doi: 10.1016/j.watres.2019.114972. [DOI] [PubMed] [Google Scholar]
- 61.Mueller TC, Banks PA, Steen WC. Microbial degradation of flurtamone in three Georgia soils. Weed Sci. 1991;39(2):270–274. doi: 10.1017/S0043174500071599. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The annotated network raw data can be accessed through https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=a35c2a4e0408402389c48d78ab692b7a. Please note that the raw data has later been cleared of double hits and obvious false positives in the subsequent Cytoscape visualization.
Not applicable.