Skip to main content
Applied and Environmental Microbiology logoLink to Applied and Environmental Microbiology
. 2025 Oct 17;91(11):e01508-25. doi: 10.1128/aem.01508-25

Gilthead sea bream gut bacteriome as a valuable tool for seafood provenance analysis

Eduardo Feijão 1,2,, Irina A Duarte 1, Marcelo Pereira 2, Pedro Pascoal 2, Mónica Nunes 2, Susanne E Tanner 1,3, Ricardo Dias 2,4,5, Bernardo Duarte 1,4, Ana Rita Matos 2,4, Andreia Figueiredo 2,4, Vanessa F Fonseca 1,3
Editor: Knut Rudi6
PMCID: PMC12628811  PMID: 41104935

ABSTRACT

The increasing demand for high-quality seafood underscores the significant challenges posed by rampant seafood fraud. This study aimed to identify regional capture biomarkers by using the gut bacteriome of Sparus aurata specimens through state-of-the-art long-read sequencing allied to machine learning tools. The gut bacteriomes of animals from four different fishing areas on the Portuguese coast were sequenced. The alpha and beta diversity analyses were shown to enable Center-South gut bacteriome differentiation from other fishing areas due to higher abundance of species of the phyla Pseudomonadota, Bacteroidota, and Bacillota and classes such as Alphaproteobacteria, Betaproteobacteria, and Bacilli. The gradient boosting machine (GBM) model selected by the H2O automatic machine learning pipeline presented a high global accuracy during training and validation phases, identifying Center-South and South sample provenance with 100% and 71.1% accuracy, respectively. By integrating the most important OTUs to the GBM model with the regional biomarkers identified through point biserial correlation analysis (indicspecies packages), a reduced set of five provenance biomarkers was identified, belonging to Gammaproteobacteria, Betaproteobacteria, and Bacilli classes, possibly highlighting the anthropogenic activities surrounding the fishing areas and local environmental abiotic factors. This study highlights the extensive and valuable information obtained by long-read sequencing and couples it with the potential of machine learning algorithms to ultimately demonstrate its efficiency in providing efficient and highly accurate seafood provenance biomarkers. This study also reports the likely influence of industrial and recreational activities, population density, and water management facilities on the gut bacteriome of S. aurata.

IMPORTANCE

This study significantly contributes to a topic of utmost importance—seafood provenance analysis and seafood fraud—by leveraging gut bacteriome profiling. Through the application of long-read sequencing and machine learning, it identifies reliable biomarkers that distinguish gilthead sea bream from different fishing areas. These findings enhance traceability methods by providing a robust tool to combat seafood fraud and ensure food authenticity, thereby protecting the supply chain, the consumer, and the environment. Additionally, this study explores the possible interactions between the gut bacteriome and the industrial, recreational, and environmental factors that could influence the identified biomarkers of regional provenance while also offering insights into the composition of the marine ecosystems surrounding the fishing areas. This approach has broader implications for fishery management, sustainable sourcing, and regulatory enforcement.

KEYWORDS: biomarkers, fish provenance, gut bacteriome, metagenomic, Sparus aurata

INTRODUCTION

According to the Food and Agriculture Organization (FAO), seafood-derived proteins account for 17% of global protein consumption (1). The increasing demand for high-quality seafood availability by an ever-growing human population has brought to the attention of stakeholders, consumers, producers, fishers, and managers the challenges induced by widespread seafood fraud activity and unsustainable fishing practices. Naturally, with consumption rates and revenue increasing, so does the potential for food fraud motivation, which involves misleading the consumer for financial gain and poses a risk to food integrity, the economy, and public health (2). The seafood supply chain is vulnerable to various types of fraud related to adulteration, provenance, and ethical trade of seafood products, including catch method fraud, illegal, unreported, and unregulated (IUU) fishing practices, and species substitution (3, 4).

To overcome these issues, an array of different practices and countermeasures from multiple stakeholders, including private label owners, international institutions, and lawmakers, have been implemented throughout the supply chain to detect and impair malpractices (3, 5). Nonetheless, biochemical and high-throughput screening techniques, including elemental fingerprinting (68), fatty acid analysis (9, 10), stable isotope analysis, and DNA profiling (11), have already been applied with promising results in the field of food traceability.

As an emerging tool for evaluating fish provenance, the gut bacteriome has been the subject of studies to evaluate the complex relationships between microorganisms and their hosts (12, 13). Early studies revealed the significant influence of microbial communities on the digestive and immune functions of host organisms as well as their susceptibility to changes influenced by various host-related factors (14, 15). Notably, researchers have found that habitat and host selection play crucial roles in shaping the composition of gut bacteria in fish (14, 15). In recent decades, long-read Oxford Nanopore sequencing has been rapidly evolving, bypassing the pitfalls that involve high error rates, flow cell compatibility, and cost management, garnering attention owing to its potential, reliability, and on-site use (1618), making it a tempting high-throughput sequencing tool to analyze the genetic diversity and ecosystem structure (19, 20). This potential has also recently been explored for the detection and characterization of various biomarkers of seafood authenticity and provenance (19, 21, 22).

Sparus aurata, commonly known as the gilthead sea bream, belongs to the Sparidae family and Perciformes order and can be found in the warm and euryhaline waters of the Mediterranean Sea and North-East Atlantic Ocean (23). S. aurata is a highly valuable species, considering both capture fisheries and large-scale aquaculture, with considerable consumer value attributed to its excellent flesh quality and availability (rapid growth). Global catches of Gilthead sea bream were estimated at 13.9 tons in 2021, with the EU providing over 27% of these catches, from countries including France, Greece, Spain, Italy, and Portugal (24).

In Portugal, a country that consistently appears in the top tier globally for fish consumption per capita, the capture of wild Sparus aurata specimens amounted to 219 tons in 2022. Notably, the price of these specimens increased by 8.7% compared with that in the previous year. As of 2022, S. aurata specimens were sold at an approximate price of 13.74 €/kg (25). Owing to the increasing importance of commercial and recreational fishing, and the consequent high value of wild-caught specimens, cases of fraud, especially those related to the catch method, have been reported (26).

Considering the pitfalls and challenges within the current fishing industry, and with the goal of fostering controlled, regulated, and environmentally conscious commerce, this study aimed to identify robust biomarkers for ascertaining the geographical origin of wild-caught sea bream along the coast of Portugal by applying long-read sequencing of the gut metagenome together with machine learning models.

MATERIALS AND METHODS

Study areas and sample collection

Sparus aurata samples were collected in 2020 at four locations ranging from the upper geographical limit of its distribution at the Center-North (Peniche) of Portugal to the Center (Sesimbra), Center-South (Sines), and including the South of Portugal (Olhão), encompassing a fine-scale spatial distribution of approximately 500 km of marine coastal waters (Fig. 1). Maps were generated in R-Studio (R version 4.3.2 – “Eye Holes”) using coastline data from the Global Self-consistent, Hierarchical, High-resolution Shoreline Database (GSHHS, 27).

Fig 1.

Map depicts Portugal with four marked regions along Atlantic coast named center-north, center, center-south, and south, bordered by Spain to east and Atlantic Ocean to west.

Location of the four sampling areas along the Portuguese west and south coasts. Center-North sampling area (green); Center sampling area (blue); Center-South sampling area (red); South sampling area (yellow). Map generated using coastline data from the Global Self-consistent, Hierarchical, High-resolution Shoreline Database (GSHHS).

Thirty specimens with similar size and weight range (mean length = 331.78 mm ±21.73 mm; mean weight = 538.39 g ±104.71 g) were collected in each area, transported immediately to the laboratory, and post-mortem dissected to sample the intestinal tract, specifically 1 cm of intestine just above the anus, and stored at −80°C until further analysis. For morphological parameters, see Table S1.

DNA extraction and amplification

Total DNA extraction was performed using the MGIEasy Nucleic Acid Extraction kit (MGI Tech, USA), according to the manufacturer’s instructions using the MGISP-960 robot. Genetic material quality and quantity were assessed using the NanoDrop One and Qubit 4 Fluorometer analysis. For the rDNA gene amplification, Long Amp hot start Taq 2 × master mix (New England Biolabs, MA, USA) was used at 1 x along with 50 ng/µL of genomic DNA. To amplify the full-length 16S rDNA bacterial gene, 0.25 µM of the primer pair 27F (5′-AGAGTTTGATCMTGGCTCAG-3′) and 2R (5′-CGGTTACCTTGTTACGACTT-3′) was used. PCR was conducted on a Biometra UNO II, using the following conditions: 1 cycle of 94°C for 1  min, 35 cycles of 94°C for 20 s, 55°C for 30 s, and 65°C for 2 min and a final extension of 65°C for 5 min. Subsequently, amplification products were visualized through gel electrophoresis and purified using the Solid Phase Reversible Immobilization (SPRI) technique with magnetic beads (28, 29). Quantification steps were performed using the 1x dsDNA HS assay for Qubit.

Metagenomic sequencing by long-read Nanopore quantification

DNA was end-repaired with NEBNext End Repair Module (New England BioLabs, MA, USA), cleaned with Agencourt AMPure XP Beads (Beckman Coulter, High Wycombe, UK), and dA-tailed (New 160 England BioLabs, MA, USA). The library was prepared from 300  ng input DNA from each sample using the Sequencing Native Barcoding Kit 24 V14 (SQK-NBD114.24) (Oxford 162 Nanopore Technologies, Oxford, UK) in accordance with the manufacturer’s protocol. The library was quantified and prepared for PromethION sequencing, using FLO-PRO114M flowcells, MinKNOW v22.12.4, and standard 72 h run script with active channel selection enabled. DNA flow rate was 400 bases/s. After 24 h, 3,000,000 passed reads were yielded with an estimated N50 of 1,500 bp, and the mean quality score was 14. In total, 4.5 Gb of data was produced, with an average of 750,000 reads per sample. Data were collected using the Epi2Me software (Oxford Nanopore Technologies, UK).

Bioinformatic analysis

Following the removal of low-quality gDNA reads, the remaining reads were filtered using Prinseq-lite version 0.20.4, keeping reads with length and Phred score higher than 300 bps and 7, respectively (30). Taxonomical classification was performed using an in-house analytical pipeline for long-read Nanopore sequencing, as described by (31). Taxonomic classification was obtained through Kraken2 (version 2.1.2) with default settings and the NCBI Refseq database (Bacteria) to merge assignments into the highest common taxonomic rank (LCA) (31). Each taxonomic rank classification was considered an operational taxonomic unit (OTU), allowing for a direct interpretation of taxonomic diversity without the need for traditional clustering methods.

The following analyses were performed using the R-Studio version 4.3.2 – “Eye Holes.” Only the OTUs present in more than 25% of the samples were included in the alpha and beta diversity analyses.

Rarefaction, alpha, and beta diversity analyses were performed using the phyloseq package (32). Statistical analysis of the different diversity parameters and data abundances across all sampling sites was conducted using the Kruskal-Wallis tests, with the ggpubr package used for improved data visualization (33). Further comparative analysis of taxa abundance was performed using phylogenetic heat trees and the metacoder package (34). To limit potential taxonomic bias at higher taxonomic levels, only OTUs with a complete taxonomic lineage were used, and standard data preprocessing procedures following package guidelines were implemented. OTUs with zero and low abundance (counts < 5) were excluded, and data normalization for uneven sampling was performed following the method proposed by (34). Visualization was accomplished using the Davidson-Harel layout, with node placement facilitated by the Reingold-Tilford algorithm. Further analysis involved evaluation of significant differences in taxonomic abundance between sample groups using the built-in Wilcoxon test within the metacoder package.

For the traceability model development, the H2O Automatic Machine Learning pipeline was used (35, 36). The training and prediction phases were conducted after randomly partitioning the data set into training (70%) and testing (30%) subsets. The tested models included deep learning, distributed random forests (DRF), generalized linear models (GLM), extreme gradient boosting (XGBoost), gradient boosting machines (GBM), and stacking ensemble. A maximum of 20 models were generated, and their performance was evaluated using the H2O Automatic Machine Learning pipeline. The generated models were evaluated based on their test and training accuracy. The GBM model was selected using the following parameters. Fivefold cross-validation used, and the score of the model was evaluated after every five trees. A total number of 40 trees was used with a maximum tree depth of 8. Feature importance was calculated using the H2O automatic machine-learning pipeline for the best-performing model.

The indicator OTUs for each fishing area gut bacteriome were extracted using point biserial correlation analysis using the indicspecies packages (37, 38) and filtered according to the most important OTUs in the prediction model. The most important OTUs in the model were those that cumulatively explained 80% of the results. Biomarker abundance within the provenance categories and accumulative abundance were assessed using the agricolae package, and statistical analysis was performed using the Kruskal-Wallis analysis (39).

RESULTS

Long-read sequencing analysis

The gut bacteriome of S. aurata specimens was sequenced using long-read Oxford Nanopore sequencing technology. In total, 187 thousand reads were registered, 180 thousand of which were used for taxonomic construction using a custom in-house pipeline that divided the obtained data into sets of sequences of length ~1,300 bp. As a result of the first data processing, it was possible to identify 47,390 OTUs that belonged to 92 phyla, 91 classes, 225 orders, 586 families, 2,935 genera, and 38,607 identified species. After further processing, which included discarding OTUs that were not present in at least 25% of the samples, 1,804 OTUs that belonged to 5 kingdoms, 23 phyla, 40 classes, 90 orders, 190 families, 446 genera, and 1,577 species were used to analyze alpha and beta diversities.

Alpha and beta diversity analysis

When comparing the gut bacteriome species richness in S. aurata samples from different fishing areas along the coast of Portugal, as represented by the Chao1 index (Fig. 2A), the Center-South samples tended to have a significantly higher number of species when compared to other areas, excluding the Center-North fishing area. Using the abundance-based coverage estimator index (ACE) to assess OTU abundance, a similar pattern was observed in Sparus aurata samples from the Center-South region, which differed significantly from all other fishing areas. In contrast, no significant variation in species richness estimators was detected among the remaining fishing areas. When taking into consideration Shannon’s diversity index (H), which indicates how difficult it is to correctly predict the OTUs of a sample, significantly higher H values were observed in samples from the Center-North, followed by samples from the South and Center areas, revealing no statistical significance among them. The gut microbiome of the Center-South samples shows less diversity when compared to the gut microbiomes of the specimens prevenient of the other fishing areas (P < 0.01).

Fig 2.

Plots depict alpha diversity metrics across regions, stacked bar chart of phylum relative abundance, hierarchical clustering tree, and PCoA plot comparing microbial community composition across center-north, center, center-south, and south regions.

Diversity analysis of S. aurata gut bacteriome samples from four fishing areas along the coast of Portugal. (A) Alpha diversity analysis; (B) relative abundance (%) of the top 10 most abundant phyla; (C) cluster analysis based on Jaccard dissimilarities; (D) principal coordinate analysis (PCoA); N = 30 replicates per site. Asterisks denote significant differences at P < 0.05 (*), P < 0.01 (**), and P < 0.001 (***) using the Kruskal-Wallis test.

Considering Pielou’s evenness index, when compared to the gut bacteriome of specimens from the Center-South fishing area, samples from the Center presented significantly higher evenness, followed by samples from the Center-North and South, and Center (P < 0.05). However, no statistical difference was observed in evenness among the Center-North, Center, and South gut bacteriomes. Overall, the alpha-diversity analysis highlighted the Center-South as the region where the gut bacteriome of S. aurata samples presented higher species richness but the lowest evenness and diversity.

The most abundant phylum was Pseudomonadota, accounting for more than 90% of the population in each sampling group, followed by Bacillota, Mycoplasmatota, and Actinomycetota (Fig. 2B). Center and Center-South presented a lower relative abundance of Pseudomonadota but higher abundance of Bacillota and Actinomycetota, while the specimens from the South of Portugal presented a higher relative abundance of Mycroplasmatota.

The β-diversity analysis was evaluated through cluster analysis using the Jaccard dissimilarity matrix and PCoA, and results indicated a separation between S. aurata samples from the Center-North and Center-South of Portugal with overall variability in samples from the South (Fig. 2C and D). PCoA explained data variance (54.4%) in the first two components, with the first axis (PCo1) explaining the highest variance proportion (39.43%) to the overall separation of Center-South samples from most of the other samples.

Analysis of taxa abundance through comparative phylogenetic trees (Fig. 3) revealed differences in branch abundance across phylogenetic trees derived from samples originating from the four fishing areas. Sparus aurata samples from the Center-South region exhibited a significantly higher abundance of OTUs from Pseudomonadota, Bacteroidota, and Bacillota phyla when compared to other regions, which was also evident from Fig. 2B. Among the most abundant classes from these phyla present in the Center-South samples, when compared to the other fishing areas, are Alphaproteobacteria, Betaproteobacteria, and Bacilli, with a higher abundance of OTUs from orders such as Rhodocyclales, Burkholderiales, and Spirillales. It is evident that the gut bacteriome of the S. aurata specimens from the Center-North, Center, and South fishing areas exhibits a higher abundance of OTUs belonging to the phylum Pseudomonadota when compared to the specimens from the Center-South region. Notably, the most substantial differences are found at the class and order levels, with great abundance of OTUs belonging to classes such as Gammaproteobacteria and orders such as Enterobacterales, Vibrionales, and Oceanospirillales. Among these central regions, the abundance of OTUs related to the phyla Bacillota, Actinomycetota, Bacteroidota, and Spirochaetota seems to be significantly and progressively increasing from the Center-North to Center-South areas.

Fig 3.

Phylogenetic trees depict microbial taxonomic composition, with branches marking differences across center-north, center, center-south, and south regions, scaled by log2 ratio median proportions and number of operational taxonomic units.

Comparative phylogenetic heat trees of the gut bacteriome of S. aurata samples from four different fishing areas along the coast of Portugal. N = 30 replicates per site; colored branches denote significantly different abundances of a specific taxon at P < 0.05, as determined by the Wilcoxon test.

Pattern recognition and data modeling

Among all the models tested, the gradient boosting machine (GBM) model achieved the best results. The results obtained from the training phase of the GBM model present in Fig. 4 (left panel) showed a 100% score of training accuracy. When using a blind test data set to test the trained model, the global accuracy of the obtained model for assigning samples to the provenance area of each test sample was still high (73.3%) (Fig. 4, right panel).

Fig 4.

Confusion matrices depict classification accuracy of training with 100 percent global accuracy and testing with 73.3 percent global accuracy across regions: center-north, center, center-south, and south.

Gradient boosting machine (GBM) model training and testing accuracy heatmaps of the gut bacteriome of Sparus aurata samples from four different fishing areas along the coast of Portugal.

The model enabled 100% accuracy in correctly classifying samples from the Center-South area, followed by 71.4% accuracy in the classification of samples from the South fishing zone. Lower efficiencies when applying the test data set were observed in samples from the Center (66.7%) and Center-North (50%) areas, with some misclassification with the assignment of samples to the Center-North area in the first case, and in the Center-South and South fishing areas in the latter. Most misclassification errors result from sample mismatches in adjacent regions (for example, between Center and Center-North areas and between South and Center-South areas).

Biomarker identification

To obtain the best possible biomarkers for identifying fish provenance, a chained variable selection based on two statistical methodologies was employed: the point biserial indicator species model (Indicspecies), which identifies OTUs sensitive to changes according to the grouping variable, and GBM Feature Importance, which allows the identification of the most influential variables in the abovementioned classification model.

Fish gut bacteriome indicator species analysis resulted in a selected number of exclusive OTUs in each sampling area. This analysis provided the highest number of possible biomarker candidates for S. aurata provenance from the Center-South (444 biomarker candidates) and Center (138 biomarker candidates) areas, with only a few biomarker candidates from the Center-North (five biomarker candidates) and South (17 biomarker candidates) fishing areas (Fig. S1, in the supplement).

The most relevant OTUs were selected until 80% cumulative feature importance was achieved, resulting in the selection of 54 candidate OTUs with higher explanatory power, considering the traceability of the GBM model produced (Fig. 5). Subsequently, the top five biomarkers for each fishing area were obtained after merging the indicator species results with the most important OTU data for the prediction model, highlighting the common biomarkers.

Fig 5.

Bar chart depicts variable importance percentages of operational taxonomic units with higher contributions from uncultured Photobacterium, uncultured marine bacterium, and Vibrio species, followed by other bacterial taxa with decreasing importance values.

Feature importance of the most important OTUs for the prediction model (cumulative importance = 80%).

To assess the suitability of the proposed biomarkers individually and as part of a set of biomarkers to trace seafood provenance, the relative abundance (%) of each candidate OTU biomarker in every sample was calculated, and its abundance was compared across sampling sites (Fig. 6). A suitable biomarker will be most abundant in the region for which it is proposed as a provenance biomarker and ideally absent or with very low abundance in the gut microbiomes of the fish specimens collected in other areas.

Fig 6.

Violin and scatterplots depict relative abundance of bacterial operational taxonomic units across center-north, center, center-south, and south regions with variation across taxa, including Pseudomonas, Vibrio, Haemophilus, Acinetobacter, and others.

Top five proposed biomarkers of the Sparus aurata gut bacteriome for each fishing area to evaluate the provenance. (A–F) Center-North biomarkers; (G–L) North biomarkers; (M–R) Center-South biomarkers; (S–X) South biomarkers. Different letters denote statistically significant differences among the fishing areas obtained through a Kruskal-Wallis statistical analysis.

Upon merging the two statistical approaches, it was evident that most of the OTUs identified as biomarkers of geographical origin belonged to the Gammaproteobacteria class (75%), with only a few corresponding to the Betaproteobacteria (20%) and Bacilli (5%) classes. Of all the suggested biomarkers, the Center-North area presented the only OTUs belonging to the order Vibrionales. The Center fishing area presented only OTUs from the Aeromonadales, Pasteurellales, and Caryophanales orders, while the Center-South and South regions presented only OTUs belonging to the orders Burkholderiales and Pseudomonadales, respectively.

For Center-North, this approach selected OTUs corresponding to Pseudomonas resinovorans, Aliivibrio salmonicida, Vibrio sp., BL112, Bacterium E1-161, and Pseudomonas sp., B35. The relative abundance of the sum of the five selected biomarker OTUs was significantly higher in the samples collected in this area than in the remaining test sites. It is important to acknowledge that the best provenance biomarkers to classify S. aurata samples from the Center-North are Aliivibrio salmonicida and Vibrio sp. BL112.

To evaluate the S. aurata provenance of the Center samples, the proposed biomarkers were Xenorhabdus stockiae, Planococcus sp., Ka-3, Aeromonas sobria, Haemophilus influenzae, and Pseudomonas libanensis. Upon suitability assessment, it is possible to observe that these biomarkers do not properly discriminate center-captured samples from specimens captured in the Center-North, Center-South, and South, individually and as a set of biomarkers, in line with the higher degree of misclassification observed in the GBM test set confusion matrix and from where the feature importance was extracted.

The proposed biomarkers to trace S. aurata gut bacteriome samples provenance to Center-South included the Brachymonas sp, Nitrosomonadaceae bacterium, Proteobacterium Core-1, Delftia sp, JD2, and Acidovorax avenae. These biomarkers showed statistically significant higher relative abundances in samples from the Center-South region when compared to the other areas. As a set of provenance biomarkers, these species demonstrated strong discriminatory power, both individually and collectively, effectively distinguishing between fishing areas.

To trace S. aurata specimens to the South fishing area, the proposed biomarkers included Acinetobacter haemolyticus, Acinetobacter johnsonii, Pseudomonas sp., ADPe, Acinetobacter sp., phenon 11, and Azomonas agilis. Acinetobacter johnsonii showed limited resolution, failing to distinguish between the South and Center-South regions. In contrast, the other proposed biomarkers exhibited higher relative abundance in the gut bacteriome of specimens from the South region, effectively discriminating this area from the others when analyzed individually. However, the cumulative relative abundance of the biomarkers did not show a statistically significant difference between the South and Center-South regions.

DISCUSSION

The escalating prevalence of fish fraud poses significant risks to the environment, economy, and human health (2). This underscores the critical need to counter fraudulent activities by implementing stringent regulations. Additionally, it is of utmost importance to explore innovative techniques and determine new biomarkers to accurately determine fish identity and provenance so that we can safeguard the environment from resource exploitation. Through this approach to explore innovative techniques and determine new biomarkers, it is also possible to link high-quality fresh seafood to geographical provenance, increasing the value of current seafood products and investing in commerce, recreational fishing activities, and local seafood markets and restaurants. Ultimately, the biggest goal is to mitigate health hazards for consumers while also protecting the environment and economy. This study used long-read Nanopore sequencing to analyze the gut bacteriome of four groups of Sparus aurata from different fishing regions along the Portuguese coast, spanning approximately 500 km. The aim was to identify distinct biomarkers indicative of regional provenance based on their geographical distribution. The gut bacteria of fish are heavily influenced by factors such as water temperature, salinity, oxygen concentration, diet, feeding habits, habitat community, and intrinsic factors such as genetics and developmental stages (15, 40). However, it has been reported that most of the gut bacteriome of fish is composed of bacteria with a high abundance of the phyla Pseudomonadota, Bacillota, and Bacteroidota (41), with carnivorous fish showing a higher abundance of Pseudomonadota and Bacillota than herbivorous species (13, 42). While the alpha diversity analysis results of the different S. aurata gut bacteriome groups revealed key differences in species richness, diversity, and equitability among the gut bacteriomes, the most abundant OTUs belonged to the phyla Pseudomonadota, Bacillota, Mycoplasmatota, and Actinomycetota, which is in accordance with the aforementioned literature. When compared at the species level, our results are supported by other published studies, although the abundance of each phylum may vary according to the body part analyzed (e.g., flesh, stomach, and intestines), capture, and production methods (40, 4345). Interestingly, comparative phylogenetic heat tree analysis revealed that gut bacteriome samples of specimens from the Center-North, Center, and South presented a higher abundance of OTUs belonging to the Gammaproteobacteria class and orders Vibrionales and Oceanospirillales, which have been reported to increase in acidic environments and are associated with various seaweed species (46). Although the algal community, surface water temperature, and ocean acidification are interactive factors in marine ecosystems, it is not surprising that these OTUs were higher in the gut bacteriome of Southern samples because of the overall higher surface water temperature that bathes the South coast with the Mediterranean Sea influence (4751). However, it is expected that colder surface water temperatures might influence the abundance of other bacteria belonging to these orders, specifically the order Vibrionales, which includes bacteria known to thrive in cold waters (52). It is also important to note that outside of the Center-South region, S. aurata specimens seem to have more OTUs belonging to the order Enterobacterales, which includes cellulolytic enzyme-producing bacteria commonly found in herbivorous fish, possibly highlighting dietary differences between Center-South and the other fishing areas, as opposed to OTUs belonging to bacteria of the Bacilli class (53).

While diversity analysis was used to assess possible differences among the gut bacteriomes of S. aurata samples, machine learning algorithms were used to identify patterns and interactions between variables that allow extraction of fish provenance prediction models and biomarkers (54). In the training phase, the GBM model presented 100% accuracy, discriminating patterns among OTU abundance, taxonomic identification, and fishing areas. When applying the blind sample data set, the global model accuracy remained high at 73.3%, with three sites classified with accuracies above 66.7%. Notably, samples from the Center-South area presented the highest accuracy, aligned with the differences in the various diversity indices explored in the gut bacteriome. The lower accuracy of the model predictions among the other three fishing areas is also supported by the alpha diversity, cluster, and PCA analyses. These results highlight a few challenges in evaluating fish provenance, including species mobility and geographical distribution. S. aurata is a mobile species and therefore difficult to control its movement toward other fishing areas. The presence of this species above the Central-Northern area is not common, although it has been reported to occur more often in the Celtic Sea and in the cold waters of the English Channel (5557).

After crossing the most important OTUs for the model with the Indicator Species analysis, it was possible to extract five biomarkers that can be used as a package tool for easier, faster, and cheaper fish food provenance analysis. Overall, the proposed biomarkers linking gut bacteriome samples to fishing areas appear to be influenced by anthropogenic activities, such as primary and extractive industries including agriculture, fishing, and mining, as well as recreational activities and environmental factors such as surface water temperature, and resistome (5862). This approach aims to reduce the number of OTUs to be analyzed to confirm or assess the provenance of a certain S. aurata sample using targeted analysis while maintaining the explanatory resolution of the analyzed traits.

The Center-North region is a densely populated area, with an increased seasonal population density due to its touristic characteristics, but also includes areas of intense fishing activities, industries related to the transformation of primary products, and agricultural activities (61, 63). The high relative abundance of Aliivibrio salmonicida, a gram-negative bacterium and, to a lesser extent, Vibrio sp. BL112 may be related to water temperature because it thrives in cold waters (52). The Portuguese coast experiences some variation in surface water temperature; therefore, it is not surprising to find these bacteria more commonly near the northern coast and outside the warmer waters of the southern Portuguese coast (47, 62). Information regarding bacterium E1-161 is scarce, and its taxonomical classification places it in the order Enterobacterales. Bacteria from this order are often reported to develop resistance to antibiotics due to their ability to produce β-lactamases, β-lactam antibiotic degrading enzymes, rendering their efficacy null (64). Additionally, it is interesting to observe that the relative abundance of this specific biomarker was similar in the Center-North and Center regions, neighboring areas with high coastal population density. This is in line with previous studies, where the occurrence of antibiotic resistance genes was directly related to environments subjected to high anthropogenic pressure (58). Agricultural activities are also prominent throughout the country, so it was no surprise that biomarkers related to the Pseudomonas genus (P. resinovorans and Pseudomonas sp. B35), including pesticide-tolerant bacteria (e.g., Pseudomonas resinovorans SZMC 25872), did not show differences in relative abundance among the fishing areas (65). Although individually, these two biomarkers present lower resolution compared to the other biomarkers, to assign a certain gut microbiome sample as originating from the Center-North area, when grouped as a provenance discrimination set, the discrimination success was statistically validated.

The proposed biomarkers for tracing S. aurata provenance to the central region included Aeromonas sobria and Haemophilus influenzae, which have been shown to develop antibiotic resistance (66, 67), often correlated with high population densities (68). This could be related to previously reported high concentrations of antibiotics present in the waters of the Tagus estuary, which is surrounded by the Lisbon metropolitan area, an area with approximately 2.5 million inhabitants (59). The other proposed biomarkers include Pseudomonas libanensis, which is a denitrifying bacterium that has also been isolated from wastewater. Again, the large wastewater outfall from the Lisbon Metropolitan area may be responsible for the high abundance of this bacteria in the specimens captured in the vicinity of this large urban center (6971); the Xenorhabdus stockiae bacterium could also be related to the extensive agricultural activity in this region (72). Information regarding Panococcus sp., Ka-3 is scarce, although some strains have been reported to be able to use hydrocarbons as carbon sources, such as those abundant near port areas (73). Two of the most prominent ports in Portugal (Lisbon and Setúbal) could explain the higher abundance of these bacteria in the gut microbiomes of species captured in the Center area (74). Although the resolution and discrimination power of this group of biomarkers were low, tracing S. aurata specimens to the Center fishing area will ultimately rely on the strength of the biomarkers from the other areas. Specimens caught in the Center can be identified by a process of exclusion when they do not match the biomarker profiles of the other areas.

The biomarkers used to trace S. aurata samples to the Center-South region were successfully identified, and both their resolution as an individual and a set of provenance biomarkers were remarkable. The relative abundance of the top five biomarkers was higher in samples from the Center-South than in any other fishing area, and the same pattern was observed for each biomarker. This results from the high accuracy of the GBM classification for this area (100% accuracy) and is thus reflected in the selected high-importance features. This group of biomarkers is likely linked to three major industrial and commercial activities in the study area, a highly populated and industrialized region. These activities include: (i) the commercial port of Sines, the main port in the Ibero-Atlantic front, which is the country’s leading energy supplier of crude and its derivatives (coal and natural gas); (ii) the remaining cattle breeding activities and/or intense agricultural exploitation; and (iii) the mining activity that surrounds the area that includes the Aljustrel mines (60, 75).

These activities could be sources of environmental contamination with heavy metals such as Cr, As, Pb, Cd, Cu, Fe, and Zn, as well as other contaminants such as ammonia and other nitrogen compounds (60, 76). This group of biomarkers includes heavy metal-tolerant bacteria such as the uncultured Brachymonas sp. and the ammonia-oxidizing Nitrosomonadaceae bacterium, which has been reported to be resistant to a wide array of metals and metalloids commonly found near industrial areas (As, Pb, Cd, Cu, and Zn) (77); and Pb(II) and Cr(II)-resistant Delftia sp. bacteria (78). The bacterium Acidovorax avenae has been reported to degrade chlorobenzenes, which are widely used in many industries (e.g., pharmaceuticals, agriculture, dyes, and petrochemical industries) and often reach water bodies through involuntary discharges (7981). Finally, Proteobacterium Core-1 belongs to the Gammaproteobacteria class, and while this specific bacterium and its presence in the gut bacteriome of S. aurata requires more extensive research, possible cross-contamination between soil and surface water bodies cannot be excluded since this class has been known to respond to different concentrations of environmental soil pollution (82, 83).

The Acinetobacter genus includes three reported bacteria in the set of biomarkers presented as candidates for classifying gut bacteriome samples from specimens captured in the southern area of Portugal. This group of biomarkers has been reported to be heavy metal-tolerant and abundant in WWTPs, similar to the other OTUs mentioned above. Nevertheless, the higher water temperatures in this area of Portugal could influence their abundance (84, 85). The Acinetobacter johnsonii and Acinetobacter haemolyticus are tolerant to metals such as copper, lead, and chromium (8486). Nonetheless, bacteria of this genus seem to prefer warmer temperatures around 25°C, which can be found in the Southern Portuguese coastal waters and match the optimal growth temperatures (22°C to 26°C) of S. aurata (62). This abiotic factor seems to be influencing other biomarkers such as Azomonas agilis, which prefers warmer surface water temperatures (87).

The last proposed biomarker is the Pseudomonas sp. ADPe, which could be related to the high abundance of golf courses along the South region, specifically in Algarve, because of its capacity to degrade atrazine, a chlorinated herbicide used to control broadleaf weed growth (8890). However, this genus of bacteria is often naturally present in golf courses, and one cannot exclude environmental cross-contamination (91).

Overall, long-read Oxford Nanopore sequencing provided comprehensive insights into the gut bacteriome of Sparus aurata, leveraging detailed OTU abundance data to identify a practical and accurate set of biomarkers for determining fish provenance. Additionally, this study highlights the impact of industrial and recreational activities, population density, and water management infrastructure on coastal ecosystems, with a particular focus on their effects on the gut bacteriome of S. aurata. Future work should improve on the proposed model (e.g., by increasing sample size), evaluate biomarker stability through time and seasons, test the present or improved methodology on other species and/or farming practices, and validate these biomarkers.

Conclusion

The gut bacteriome of S. aurata has emerged as a valuable tool for discerning regional variation among different fishing areas. Through diversity analysis, distinct patterns in OTU abundance were uncovered in the Center-South samples, which exhibited a higher prevalence of OTUs associated with Rhodocyclales, Burkholderiales, and Spirillales orders. Conversely, other fishing areas displayed an elevated abundance of OTUs attributed to orders Enterobacterales, Vibrionales, and Oceanospirillales. These differences were also evident in the top five biomarkers in each fishing area.

Employing machine learning techniques yielded remarkable efficiency during the training phase, achieving a perfect accuracy score. Furthermore, these methods successfully identified the origin of the gut bacteriome from blind samples with a commendable accuracy of 73.3%, notably by linking the provenance of blind and random S. aurata gut bacteriome samples to the Center-South.

Building on these findings, this study proposed 20 potential biomarkers for evaluating the provenance of Sparus aurata. These biomarkers offer promising avenues for enhancing traceability efforts within the fishery industry, facilitating more precise regional assessments, and bolstering quality control measures along the supply chain.

The proposed biomarkers for tracing S. aurata provenance appear to reflect regional environmental pressures, including seasonal human activity, water temperature gradients (e.g., Aliivibrio salmonicida and Vibrio sp., BL112), and agriculture (e.g., Pseudomonas resinovorans and Pseudomonas sp., B35) in the Center-North; urbanization (e.g., Aeromonas sobria and Haemophilus influenzae), wastewater discharge (e.g., Pseudomonas libanensis), agricultural runoff (e.g., Xenorhabdus stockiae), and port activity (e.g., Panococcus sp., Ka-3) in the Center; industrial and port activity pollution, adjacent mining activities, and heavy metal exposure (e.g., Brachymonas sp., Nitrosomonadaceae, and Delftia sp.) in the Center-South; and warmer water temperatures (e.g., Azomonas agilis), WWTP density (e.g., Acinetobacter johnsonii and A. haemolyticus), and recreational land use (e.g., golf courses; Pseudomonas sp., ADPe) in the South.

ACKNOWLEDGMENTS

We thank Fundação para a Ciência e a Tecnologia (FCT) for funding MARE (Marine and Environmental Sciences Center, http://doi.org/10.54499/UIDB/04292/2020 and http://doi.org/10.54499/UIDP/04292/2020), ARNET (Aquatic Research Network Associated Laboratory, http://doi.org/10.54499/LA/P/0069/2020), BioISI (Biosystems and Integrative Sciences Institute, http://doi.org/10.54499/UIDB/04046/2020 and http://doi.org/10.54499/UIDP/04046/2020), Ce3C (Center for Ecology, Evolution and Environmental Changes, http://doi.org/10.54499/UIDB/00329/2020 and http://doi.org/10.54499/UIDP/00329/2020) and CHANGE (Global Change and Sustainability Institute, http://doi.org/10.54499/LA/P/0121/2020). E. Feijão was supported by FCT through a Ph.D. Grant (https://doi.org/10.54499/2022.11916.BD).

Contributor Information

Eduardo Feijão, Email: emfeijao@ciencias.ulisboa.pt.

Knut Rudi, Norwegian University of Life Sciences, Ås, Norway.

DATA AVAILABILITY

Raw data were submitted to the National Center for Biotechnology Information (NCBI) under the Sequence Read Archive (SRA) accession no. PRJNA1330551.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/aem.01508-25.

Supplemental material. aem.01508-25-s0001.docx.

Table 1, S. aurata morphological parameters; Fig. S1, indicator species results.

aem.01508-25-s0001.docx (449.5KB, docx)
DOI: 10.1128/aem.01508-25.SuF1

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. FAO . 2020. The state of world fisheries and aquaculture 2020 INFORM. FAO. Available from: http://www.fao.org/documents/card/en/c/ca9229en [Google Scholar]
  • 2. van Ruth SM, Huisman W, Luning PA. 2017. Food fraud vulnerability and its key factors. Trends Food Sci Technol 67:70–75. doi: 10.1016/j.tifs.2017.06.017 [DOI] [Google Scholar]
  • 3. Fox M, Mitchell M, Dean M, Elliott C, Campbell K. 2018. The seafood supply chain from a fraudulent perspective. Food Sec 10:939–963. doi: 10.1007/s12571-018-0826-z [DOI] [Google Scholar]
  • 4. Lawrence S, Elliott C, Huisman W, Dean M, van Ruth S. 2022. The 11 sins of seafood: assessing a decade of food fraud reports in the global supply chain. Compr Rev Food Sci Food Saf 21:3746–3769. doi: 10.1111/1541-4337.12998 [DOI] [PubMed] [Google Scholar]
  • 5. European Parliament and Council . 2013. Regulation (EU) No 1379/2013 of the European Parliament and of the Council of 11 December 2013 on the common organisation of the markets in fishery and aquaculture products, amending Council Regulations (EC) No 1184/2006 and (EC) No 1224/2009 and repealing Council Regulation (EC) No 104/2000. Official Journal of the European Union, L354/1–L354/18
  • 6. Duarte B, Duarte IA, Caçador I, Reis-Santos P, Vasconcelos RP, Gameiro C, Tanner SE, Fonseca VF. 2022. Elemental fingerprinting of thornback ray (Raja clavata) muscle tissue as a tracer for provenance and food safety assessment. Food Control 133:108592. doi: 10.1016/j.foodcont.2021.108592 [DOI] [Google Scholar]
  • 7. Mamede R, Duarte IA, Caçador I, Reis-Santos P, Vasconcelos RP, Gameiro C, Canada P, Ré P, Tanner SE, Fonseca VF, Duarte B. 2022. Elemental fingerprinting of wild and farmed fish muscle to authenticate and validate production method. Foods 11:3081. doi: 10.3390/foods11193081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Duarte B, Mamede R, Carreiras J, Duarte IA, Caçador I, Reis-Santos P, Vasconcelos RP, Gameiro C, Ré P, Tanner SE, Fonseca VF. 2022. Harnessing the full power of chemometric-based analysis of total reflection X-ray fluorescence spectral data to boost the identification of seafood provenance and fishing areas. Foods 11:2699. doi: 10.3390/foods11172699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Farabegoli F, Pirini M, Rotolo M, Silvi M, Testi S, Ghidini S, Zanardi E, Remondini D, Bonaldo A, Parma L, Badiani A. 2018. Toward the authentication of European sea bass origin through a combination of biometric measurements and multiple analytical techniques. J Agric Food Chem 66:6822–6831. doi: 10.1021/acs.jafc.8b00505 [DOI] [PubMed] [Google Scholar]
  • 10. Fonseca VF, Duarte IA, Matos AR, Reis-Santos P, Duarte B. 2022. Fatty acid profiles as natural tracers of provenance and lipid quality indicators in illegally sourced fish and bivalves. Food Control 134:108735. doi: 10.1016/j.foodcont.2021.108735 [DOI] [Google Scholar]
  • 11. Gopi K, Mazumder D, Sammut J, Saintilan N. 2019. Determining the provenance and authenticity of seafood: a review of current methodologies. Trends Food Sci Technol 91:294–304. doi: 10.1016/j.tifs.2019.07.010 [DOI] [Google Scholar]
  • 12. Del Rio-Lavín A, Monchy S, Jiménez E, Pardo MÁ. 2023. Gut microbiota fingerprinting as a potential tool for tracing the geographical origin of farmed mussels (Mytilus galloprovincialis). PLoS One 18:e0290776. doi: 10.1371/journal.pone.0290776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Gao Y-M, Zou K-S, Zhou L, Huang X-D, Li Y-Y, Gao X-Y, Chen X, Zhang X-Y. 2020. Deep insights into gut microbiota in four carnivorous coral reef fishes from the South China Sea. Microorganisms 8:426. doi: 10.3390/microorganisms8030426 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Kim PS, Shin NR, Lee JB, Kim MS, Whon TW, Hyun DW, Yun JH, Jung MJ, Kim JY, Bae JW. 2021. Host habitat is the major determinant of the gut microbiome of fish. Microbiome 9:166. doi: 10.1186/s40168-021-01113-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Luan Y, Li M, Zhou W, Yao Y, Yang Y, Zhang Z, Ringø E, Erik Olsen R, Liu Clarke J, Xie S, Mai K, Ran C, Zhou Z. 2023. The fish microbiota: research progress and potential applications. Engineering (Beijing) 29:137–146. doi: 10.1016/j.eng.2022.12.011 [DOI] [Google Scholar]
  • 16. Ho JKI, Puniamoorthy J, Srivathsan A, Meier R. 2020. MinION sequencing of seafood in Singapore reveals creatively labelled flatfishes, confused roe, pig DNA in squid balls, and phantom crustaceans. Food Control 112:107144. doi: 10.1016/j.foodcont.2020.107144 [DOI] [Google Scholar]
  • 17. Johri S, Solanki J, Cantu VA, Fellows SR, Edwards RA, Moreno I, Vyas A, Dinsdale EA. 2019. “Genome skimming” with the MinION hand-held sequencer identifies CITES-listed shark species in India’s exports market. Sci Rep 9:4476. doi: 10.1038/s41598-019-40940-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Menegon M, Cantaloni C, Rodriguez-Prieto A, Centomo C, Abdelfattah A, Rossato M, Bernardi M, Xumerle L, Loader S, Delledonne M. 2017. On site DNA barcoding by nanopore sequencing. PLoS One 12. doi: 10.1038/s41598-019-40940-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Kumar G, Kocour M. 2017. Applications of next-generation sequencing in fisheries research: a review. Fish Res 186:11–22. doi: 10.1016/j.fishres.2016.07.021 [DOI] [Google Scholar]
  • 20. MacKenzie M, Argyropoulos C. 2023. An introduction to nanopore sequencing: past, present, and future considerations. Micromachines (Basel) 14:459. doi: 10.3390/mi14020459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Franco CM, Ambrosio RL, Cepeda A, Anastasio A. 2021. Fish intended for human consumption: from DNA barcoding to a next-generation sequencing (NGS)-based approach. Curr Opin Food Sci 42:86–92. doi: 10.1016/j.cofs.2021.05.005 [DOI] [Google Scholar]
  • 22. Haynes E, Jimenez E, Pardo MA, Helyar SJ. 2019. The future of NGS (Next Generation Sequencing) analysis in testing food authenticity. Food Control 101:134–143. doi: 10.1016/j.foodcont.2019.02.010 [DOI] [Google Scholar]
  • 23. Maroso F, Gkagkavouzis K, De Innocentiis S, Hillen J, do Prado F, Karaiskou N, Taggart JB, Carr A, Nielsen E, Triantafyllidis A, Bargelloni L, Aquatrace Consortium . 2021. Genome-wide analysis clarifies the population genetic structure of wild gilthead sea bream (Sparus aurata). PLoS One 16:e0236230. doi: 10.1371/journal.pone.0236230 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. EUMOFA: European Commission, “European market observatory for fisheries and aquaculture products”. 2021. Available from: https://www.eumofa.eu/. Retrieved 20 Dec 2024.
  • 25. Instituto Nacional de Estatística - Estatísticas da Pesca: 2022. 2023. Lisboa: INE. Available from: https://www.ine.pt/xurl/pub/66322600 [Google Scholar]
  • 26. Jacquet JL, Pauly D. 2008. Trade secrets: renaming and mislabeling of seafood. Mar Policy 32:309–318. doi: 10.1016/j.marpol.2007.06.007 [DOI] [Google Scholar]
  • 27. Wessel P, Smith WHF. 1996. A global, self‐consistent, hierarchical, high‐resolution shoreline database. J Geophys Res 101:8741–8743. doi: 10.1029/96JB00104 [DOI] [Google Scholar]
  • 28. DeAngelis MM, Wang DG, Hawkins TL. 1995. Solid-phase reversible immobilization for the isolation of PCR products. Nucleic Acids Res 23:4742–4743. doi: 10.1093/nar/23.22.4742 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Stortchevoi A, Kamelamela N, Levine SS. 2020. SPRI beads-based size selection in the range of 2-10kb. J Biomol Tech 31:7–10. doi: 10.7171/jbt.20-3101-002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Schmieder R, Edwards R. 2011. Quality control and preprocessing of metagenomic datasets. Bioinformatics 27:863–864. doi: 10.1093/bioinformatics/btr026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Duarte B, Feijão E, Cruz-Silva A, Pascoal P, Nunes M, Pereira M, Figueiredo A, Dias RP, Tanner SE, Fonseca VF. 2025. Reveal your microbes, and i’ll reveal your origins: geographical traceability via Scomber colias intestinal tract metagenomics. Anim Microbiome 7:43. doi: 10.1186/s42523-025-00398-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. McMurdie PJ, Holmes S. 2013. Phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One 8:e61217. doi: 10.1371/journal.pone.0061217 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Kassambara A. 2023. Ggpubr: “ggplot2” Based Publication Ready Plots [Google Scholar]
  • 34. Foster ZSL, Sharpton TJ, Grünwald NJ. 2017. Metacoder: an R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput Biol 13:e1005404. doi: 10.1371/journal.pcbi.1005404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Candel A, Ledell E, Bartz A. n.d. Deep Learning with H2O
  • 36. Malohlava M, Candel A, Bartz A, Roark H, Parmar V. 2024. Gradient Boosting Machine with H2O: Seventh Edition Gradient Boosting Machine with H2O by Michal Malohlava & Arno Candel with assitance from Cliff Click [Google Scholar]
  • 37. De Cáceres M, Legendre P. 2009. Associations between species and groups of sites: indices and statistical inference. Ecology 90:3566–3574. doi: 10.1890/08-1823.1 [DOI] [PubMed] [Google Scholar]
  • 38. De Cáceres M, Legendre P, Moretti M. 2010. Improving indicator species analysis by combining groups of sites. Oikos 119:1674–1684. doi: 10.1111/j.1600-0706.2010.18334.x [DOI] [Google Scholar]
  • 39. Mendiburu F, Simon R, Mendiburu F. 2015. Agricolae-Ten years of an Open source Statistical tool for experiments in Breeding, agriculture and biology. Available from: 10.7287/peerj.preprints.1404v1 [DOI]
  • 40. Egerton S, Culloty S, Whooley J, Stanton C, Ross RP. 2018. The gut microbiota of marine fish. Front Microbiol 9:873. doi: 10.3389/fmicb.2018.00873 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Ghanbari M, Kneifel W, Domig KJ. 2015. A new view of the fish gut microbiome: advances from next-generation sequencing. Aquaculture 448:464–475. doi: 10.1016/j.aquaculture.2015.06.033 [DOI] [Google Scholar]
  • 42. Ofek T, Lalzar M, Laviad-Shitrit S, Izhaki I, Halpern M. 2021. Comparative study of intestinal microbiota composition of six edible fish species. Front Microbiol 12:760266. doi: 10.3389/fmicb.2021.760266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Nerini M, Russo A, Decorosi F, Meriggi N, Viti C, Cavalieri D, Marvasi M. 2024. A microbial phenomics approach to determine metabolic signatures to enhance seabream Sparus aurata traceability, differentiating between wild-caught and farmed. Foods 13:2726. doi: 10.3390/foods13172726 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Meriggi N, Russo A, Renzi S, Cerasuolo B, Nerini M, Ugolini A, Marvasi M, Cavalieri D. 2024. Enhancing seafood traceability: tracking the origin of seabass and seabream from the tuscan coast area by the analysis of the gill bacterial communities. Anim Microbiome 6. doi: 10.13140/2.1.1884.3526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Parlapani FF, Meziti A, Kormas KA, Boziaris IS. 2013. Indigenous and spoilage microbiota of farmed sea bream stored in ice identified by phenotypic and 16S rRNA gene analysis. Food Microbiol 33:85–89. doi: 10.1016/j.fm.2012.09.001 [DOI] [PubMed] [Google Scholar]
  • 46. Aires T, Serebryakova A, Viard F, Serrão EA, Engelen AH. 2018. Acidification increases abundances of Vibrionales and Planctomycetia associated to a seaweed-grazer system: potential consequences for disease and prey digestion efficiency. PeerJ 6:e4377. doi: 10.7717/peerj.4377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Baptista V, Silva PL, Relvas P, Teodósio MA, Leitão F. 2018. Sea surface temperature variability along the Portuguese coast since 1950. Int J Climatol 38:1145–1160. doi: 10.1002/joc.5231 [DOI] [Google Scholar]
  • 48. Domingues RB, Lima MJ. 2023. Unusual red tide of the dinoflagellate Lingulodinium polyedra during an upwelling event off the Algarve coast (SW Iberia). Regional Studies in Marine Science 63:102998. doi: 10.1016/j.rsma.2023.102998 [DOI] [Google Scholar]
  • 49. Fu FX, Tatters AO, Hutchins DA. 2012. Global change and the future of harmful algal blooms in the ocean. Mar Ecol Prog Ser 470:207–233. doi: 10.3354/meps10047 [DOI] [Google Scholar]
  • 50. Hyun B, Kim JM, Jang PG, Jang MC, Choi KH, Lee K, Yang EJ, Noh JH, Shin K. 2020. The effects of ocean acidification and warming on growth of a natural community of coastal phytoplankton. JMSE 8:821. doi: 10.3390/jmse8100821 [DOI] [Google Scholar]
  • 51. Lima MJ, Relvas P, Barbosa AB. 2022. Variability patterns and phenology of harmful phytoplankton blooms off southern Portugal: looking for region-specific environmental drivers and predictors. Harmful Algae 116:102254. doi: 10.1016/j.hal.2022.102254 [DOI] [PubMed] [Google Scholar]
  • 52. Thode SK, Kahlke T, Robertsen EM, Hansen H, Haugen P. 2015. The immediate global responses of Aliivibrio salmonicida to iron limitations. BMC Microbiol 15. doi: 10.1186/s12866-015-0342-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Liu H, Guo X, Gooneratne R, Lai R, Zeng C, Zhan F, Wang W. 2016. The gut microbiome and degradation enzyme activity of wild freshwater fishes influenced by their trophic levels. Sci Rep 6. doi: 10.1038/srep24340 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Asnicar F, Thomas AM, Passerini A, Waldron L, Segata N. 2024. Machine learning for microbiologists. Nat Rev Microbiol 22:191–205. doi: 10.1038/s41579-023-00984-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Baudron AR, Brunel T, Blanchet MA, Hidalgo M, Chust G, Brown EJ, Kleisner KM, Millar C, MacKenzie BR, Nikolioudakis N, Fernandes JA, Fernandes PG. 2020. Changing fish distributions challenge the effective management of European fisheries. Ecography 43:494–505. doi: 10.1111/ecog.04864 [DOI] [Google Scholar]
  • 56. Martinsohn J. 2013. Using new analytical approaches to verify the origin of fish, p 189–215. In New analytical approaches for verifying the origin of food. Elsevier Ltd. [Google Scholar]
  • 57. Mhalhel K, Levanti M, Abbate F, Laurà R, Guerrera MC, Aragona M, Porcino C, Briglia M, Germanà A, Montalbano G. 2023. Review on gilthead seabream (Sparus aurata) aquaculture: life cycle, growth, aquaculture practices and challenges. JMSE 11:2008. doi: 10.3390/jmse11102008 [DOI] [Google Scholar]
  • 58. Duarte B, Figueiredo A, Ramalhosa P, Canning-Clode J, Caçador I, Fonseca VF. 2022. Unravelling the Portuguese coastal and transitional waters’ microbial resistome as a biomarker of differential anthropogenic impact. Toxics 10:613. doi: 10.3390/toxics10100613 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Fonseca VF, Duarte IA, Duarte B, Freitas A, Pouca ASV, Barbosa J, Gillanders BM, Reis-Santos P. 2021. Environmental risk assessment and bioaccumulation of pharmaceuticals in a large urbanized estuary. Sci Total Environ 783:147021. doi: 10.1016/j.scitotenv.2021.147021 [DOI] [PubMed] [Google Scholar]
  • 60. Galego Fernandes P, Carreira P, da Silva MO. 2008. Anthropogenic sources of contamination recognition — Sines coastal aquifer (SW Portugal). J Geochem Explor 98:1–14. doi: 10.1016/j.gexplo.2007.10.003 [DOI] [Google Scholar]
  • 61. Marteleira R, Bicudo P, Ant& NA, oacute NA, Brito nio G, Coelho PS. 2018. Surfing safe waves: integration of water quality modelling in the evaluation of potential health risks. IJHST 8:105. doi: 10.1504/IJHST.2018.090892 [DOI] [Google Scholar]
  • 62. Mosqueira M, Pombo A, Borges C, Brito AC, Zacarias N, Esteves R, Palma C. 2022. Potential for coastal and offshore aquaculture in Portugal: insights from physico-chemical and oceanographic conditions. Appl Sci (Basel) 12:2742. doi: 10.3390/app12052742 [DOI] [Google Scholar]
  • 63. Jorge JP, Oliveira V, Oliveira F, Lima Santos L, Viana AS, Malheiros C. 2022. The planning system in Portugal Contemporary challenges of spatial planning in tourism destinations. Wydawnictwo Uniwersytetu Łódzkiego. 10.18778/8331-149-4.01. [DOI] [Google Scholar]
  • 64. Lepe JA, Martínez-Martínez L. 2022. Resistance mechanisms in Gram-negative bacteria. Medicina Intensiva (English Edition) 46:392–402. doi: 10.1016/j.medine.2022.05.004 [DOI] [PubMed] [Google Scholar]
  • 65. Zhumakayev AR, Varga M, Vörös M, Kocsubé S, Ramteke PW, Szekeres A, Vágvölgyi C, Hatvani L, Marik T. 2022. Characterization of the antagonistic potential of the glyphosate-tolerant Pseudomonas resinovorans SZMC 25872 strain against the plant pathogenic bacterium Agrobacterium tumefaciens. Front Plant Sci 13:1034237. doi: 10.3389/fpls.2022.1034237 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Lim J, Hong S. 2020. Characterization of Aeromonas salmonicida and A. sobria isolated from cultured salmonid fish in Korea and development of a vaccine against furunculosis. J Fish Dis 43:609–620. doi: 10.1111/jfd.13158 [DOI] [PubMed] [Google Scholar]
  • 67. Tristram S, Jacobs MR, Appelbaum PC. 2007. Antimicrobial resistance in Haemophilus influenzae. Clin Microbiol Rev 20:368–389. doi: 10.1128/CMR.00040-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Bruinsma N, Hutchinson JM, van den Bogaard AE, Giamarellou H, Degener J, Stobberingh EE. 2003. Influence of population density on antibiotic resistance. J Antimicrob Chemother 51:385–390. doi: 10.1093/jac/dkg072 [DOI] [PubMed] [Google Scholar]
  • 69. Nzila A, Thukair A, Sankara S, Abdur Razzak S. 2017. Characterization of aerobic oil and grease-degrading bacteria in wastewater. Environ Technol 38:661–670. doi: 10.1080/09593330.2016.1207712 [DOI] [PubMed] [Google Scholar]
  • 70. Prata JC, da Costa JP, Lopes I, Duarte AC, Rocha-Santos T. 2020. Environmental status of (micro)plastics contamination in Portugal. Ecotoxicol Environ Saf 200:110753. doi: 10.1016/j.ecoenv.2020.110753 [DOI] [PubMed] [Google Scholar]
  • 71. Silva AM, Vieira H, Martins N, Granja ATS, Vale MJ, Vale FF. 2010. Viral and bacterial contamination in recreational waters: a case study in the Lisbon bay area. J Appl Microbiol 108:1023–1031. doi: 10.1111/j.1365-2672.2009.04503.x [DOI] [PubMed] [Google Scholar]
  • 72. Zhang C, Chen H, Hüttel S, Hu S, Zhang W, Ding X, Yin J, Yin Y, Müller R, Xia L, Zhang Y, Tu Q. 2022. A novel tumor-targeting strain of Xenorhabdus stockiae exhibits potent biological activities. Front Bioeng Biotechnol 10. doi: 10.3389/fbioe.2022.984197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Waghmode S, Suryavanshi M, Sharma D, Satpute SK. 2020. Planococcus species – an imminent resource to explore biosurfactant and bioactive metabolites for industrial applications. Front Bioeng Biotechnol 8:996. doi: 10.3389/fbioe.2020.00996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Santos AMP, Salvador R, Dias JCQ, Soares CG. 2018. Assessment of port economic impacts on regional economy with a case study on the Port of Lisbon. Marit Policy Manag 45:684–698. doi: 10.1080/03088839.2018.1471536 [DOI] [Google Scholar]
  • 75. Azevedo S, Ferreira JJM. 2008. Competitiveness of the port of Sines: the RBV contribution. SSRN Journal. doi: 10.2139/ssrn.1086394 [DOI] [Google Scholar]
  • 76. Gomes DCF. 2018. DINA CLARA FIGUEIRA GOMES AS MINAS DE ALJUSTREL Território subterrâneo, paisagem a descobrir. CIMA-Centro Interpretativo da Mina de Algares. [Google Scholar]
  • 77. Chun SJ, Kim YJ, Cui Y, Nam KH. 2021. Ecological network analysis reveals distinctive microbial modules associated with heavy metal contamination of abandoned mine soils in Korea. Environ Pollut 289:117851. doi: 10.1016/j.envpol.2021.117851 [DOI] [PubMed] [Google Scholar]
  • 78. Ubalde MC, Braña V, Sueiro F, Morel MA, Martínez-Rosales C, Marquez C, Castro-Sowinski S. 2012. The versatility of Delftia sp. isolates as tools for bioremediation and biofertilization technologies. Curr Microbiol 64:597–603. doi: 10.1007/s00284-012-0108-5 [DOI] [PubMed] [Google Scholar]
  • 79. Monferrán MV, Echenique JR, Wunderlin DA. 2005. Degradation of chlorobenzenes by a strain of Acidovorax avenae isolated from a polluted aquifer. Chemosphere 61:98–106. doi: 10.1016/j.chemosphere.2005.03.003 [DOI] [PubMed] [Google Scholar]
  • 80. Padmanabhan J, Parthasarathi R, Subramanian V, Chattaraj PK. 2005. Molecular structure, reactivity, and toxicity of the complete series of chlorinated benzenes. J Phys Chem A 109:11043–11049. doi: 10.1021/jp0538621 [DOI] [PubMed] [Google Scholar]
  • 81. Wang M-J, Jones KC. 1994. Behaviour and fate of chlorobenzenes (CBs) introduced into soil–plant systems by sewage sludge application: a review. Chemosphere 28:1325–1360. doi: 10.1016/0045-6535(94)90077-9 [DOI] [Google Scholar]
  • 82. Niepceron M, Martin-Laurent F, Crampon M, Portet-Koltalo F, Akpa-Vinceslas M, Legras M, Bru D, Bureau F, Bodilis J. 2013. GammaProteobacteria as a potential bioindicator of a multiple contamination by polycyclic aromatic hydrocarbons (PAHs) in agricultural soils. Environ Pollut 180:199–205. doi: 10.1016/j.envpol.2013.05.040 [DOI] [PubMed] [Google Scholar]
  • 83. Zhang Q, Zhang Z, Lu T, Yu Y, Penuelas J, Zhu YG, Qian H. 2021. Gammaproteobacteria, a core taxon in the guts of soil fauna, are potential responders to environmental concentrations of soil pollutants. Microbiome 9:196. doi: 10.1186/s40168-021-01150-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Pei QH, Shahir S, Santhana Raj AS, Zakaria ZA, Ahmad WA. 2009. Chromium(VI) resistance and removal by Acinetobacter haemolyticus. World J Microbiol Biotechnol 25:1085–1093. doi: 10.1007/s11274-009-9989-2 [DOI] [Google Scholar]
  • 85. Timková I, Maliničová L, Nosáľová L, Kolesárová M, Lorková Z, Petrová N, Pristaš P, Kisková J. 2024. Genomic insights into the adaptation of Acinetobacter johnsonii RB2-047 to the heavy metal-contaminated subsurface mine environment. Biometals 37:371–387. doi: 10.1007/s10534-023-00555-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Zakaria ZA, Zakaria Z, Surif S, Ahmad WA. 2007. Hexavalent chromium reduction by Acinetobacter haemolyticus isolated from heavy-metal contaminated wastewater. J Hazard Mater 146:30–38. doi: 10.1016/j.jhazmat.2006.11.052 [DOI] [PubMed] [Google Scholar]
  • 87. Kennedy C, Rudnick P. 2015. Azomonas, p 1–10. In Bergey’s manual of systematics of archaea and bacteria. Wiley. [Google Scholar]
  • 88. Abarikwu SO, Ezim OE, Ikeji CN, Farombi EO. 2023. Atrazine: cytotoxicity, oxidative stress, apoptosis, testicular effects and chemopreventive Interventions. Front Toxicol 5:1246708. doi: 10.3389/ftox.2023.1246708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Billet L, Devers-Lamrani M, Serre R-F, Julia E, Vandecasteele C, Rouard N, Martin-Laurent F, Spor A. 2021. Complete genome sequences of four atrazine-degrading bacterial strains, Pseudomonas sp. strain ADPe, Arthrobacter sp. strain TES, Variovorax sp. strain 38R, and Chelatobacter sp. strain SR38. Microbiol Resour Announc 10. doi: 10.1128/MRA.01080-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Rostami S, Jafari S, Moeini Z, Jaskulak M, Keshtgar L, Badeenezhad A, Azhdarpoor A, Rostami M, Zorena K, Dehghani M. 2021. Current methods and technologies for degradation of atrazine in contaminated soil and water: a review. Environ Technol Innov 24:102019. doi: 10.1016/j.eti.2021.102019 [DOI] [Google Scholar]
  • 91. Allan‐Perkins E, Manter DK, Jung G. 2019. Soil microbial communities on roughs, fairways, and putting greens of cool-season golf courses. Crop Sci 59:1753–1767. doi: 10.2135/cropsci2018.04.0220 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental material. aem.01508-25-s0001.docx.

Table 1, S. aurata morphological parameters; Fig. S1, indicator species results.

aem.01508-25-s0001.docx (449.5KB, docx)
DOI: 10.1128/aem.01508-25.SuF1

Data Availability Statement

Raw data were submitted to the National Center for Biotechnology Information (NCBI) under the Sequence Read Archive (SRA) accession no. PRJNA1330551.


Articles from Applied and Environmental Microbiology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES