Abstract
Fish allergy is a significant health concern, with diagnosis and management complicated by diverse fish species and allergens. We conducted a comprehensive RNA-seq analysis of eight fish species to identify allergen profiles, integrating ImmunoCAP sIgE data to explore associations with allergen expression and diagnostic performance. Over 30 putative fish allergens were identified, with varying sequence similarities and expression levels, roughly classifying fish into two groups based on parvalbumin (PV) expression. Higher similarities in allergen expression correlated with stronger sIgE data relationships among fish extracts. High PV expression and conserved PV sequences were linked to elevated sIgE measurements, potentially indicating higher allergenicity. For diagnosis, species-specific extract sIgE remained the best indicator of corresponding fish allergy diagnosis, while incorporating multiple sIgE data enhanced performance. In component-resolved diagnosis (CRD), the current panel with PV alone showed comparable performance to fish extract for PV-high fish allergy, while PV-low fish may require the inclusion of more minor allergens for improved CRD accuracy. This RNA-seq allergen analysis helps reveal fish allergen profiles, classify fish groups, and predict allergenicity, potentially improving CRD design and food management in fish allergy.
Keywords: fish allergy, RNA-seq, allergen discovery, parvalbumin, ImmunoCAP sIgE, component-resolved diagnosis, fish allergenicity ladder
1. Introduction
Fish, with over 30,000 species, are crucial in modern nutrition but are also among the “Big Nine” allergenic foods, which include milk, eggs, crustacean shellfish, tree nuts, peanuts, wheat, soybeans, and sesame [1,2]. The prevalence of fish allergy varies between 0.2% and 7%, influenced by different consumption habits [3,4,5]. Clinically, fish allergy is a major cause of food-induced anaphylaxis, with a lower likelihood of outgrowing compared to milk or egg allergies [6,7]. Due to the significant cross-reactivity of fish allergy, patients are often advised to avoid all fish, potentially leading to nutritional deficiencies.
Whole fish extracts are commonly used in skin prick tests (SPTs) and ImmunoCAP specific IgE (sIgE) assays [8]. Commercial extracts are currently available for over 20 fish species. Multiple fish extracts are often employed to evaluate whether the subject is allergic to multiple fish [9]. However, it is unclear if these tests, based on representative fish species, effectively cover potential cross-reactivity with other untested fish. Better understanding of fish allergenicity differences could improve allergy management and dietary strategies.
Component-resolved diagnosis (CRD) at the molecular or epitope level has gained attention for precision allergy diagnosis [10]. Fish parvalbumins (PVs) are the major fish allergens with reported sensitization rates of around 90% in different patient cohorts [11,12]. Currently, two recombinant PVs from cod and common carp (rGad c 1 and rCyp c 1) are commercially available. However, it remains uncertain whether they can represent other fish PVs or act as suitable alternatives to fish extracts for different fish allergies. The way towards CRD requires comprehensive understanding of fish allergens at the sequence level.
We hypothesized that fish allergen profiling may reveal different fish allergenicity for better food management and provide novel insights for CRD design. Traditional wet-lab methods involve the preparation of whole protein extracts, immunoblotting with patient serum, and identifying signal-positive bands via mass spectrometry, which require significant time and resource consumption for large-scale studies, given the vast diversity of consumable fish around the world [13,14].
Next-generation sequencing (NGS) methods, particularly RNA sequencing (RNA-seq), show promise in allergen discovery and profiling. In 2020, a study using RNA-seq for five shrimp species identified all seven known crustacean allergens and over 30 potential novel allergens [15]. This approach involves the de novo assembly of transcripts and alignment with known allergens, predicting their allergenicity based on sequence similarity. Compared to classical protein-based methods, the RNA-seq method can reveal comprehensive allergen expression profiles without extraction bias in a more cost-effective way for parallel studies.
In this study, we used RNA-seq to characterize allergen expression profiles of eight widely consumed fish species. Combining clinical ImmunoCAP sIgE data, we explored the association between fish allergen profiles and serological sIgE sensitization patterns, offering insights into rational CRD design and food management for fish allergy.
2. Results
2.1. Overview of Potential Allergen Transcripts Identified by Fish RNA-Seq Analysis
RNA-seq data of eight fish species—yellowfin tuna, salmon, halibut, cod, grouper, grass carp, catfish, and tilapia—were obtained from public datasets [16,17,18,19], each comprising 2–3 samples of fish muscles or whole bodies [Table S1]. Fish transcripts were de novo assembled and aligned against known allergens in the AllergenOnline database [20], following a pipeline similar to that of the previous shrimp allergy study [15]. On average, around 40 potential allergen transcripts were identified per fish species [Figure 1A], mapped to 34 known allergens at the molecular level [Figure 1B]. Cyclophilin had the highest number of transcripts identified, followed by PV, porin, and ferritin, etc. These transcripts exhibited varying sequence identities to their target allergens [Figure 1C]. Cyclophilin transcripts, despite being numerous, had less than 70% identity to their target allergens, indicating lower probabilities of being truly allergenic. Generally, proteins with more than 70% identity to known allergens were considered medium to high probability allergens [20,21,22], including PV, heat shock protein 70 (Hsp70), aldolase, glyceraldehyde-3-phosphate dehydrogenase (GAPDH), enolase, L-lactate dehydrogenase (LDH), creatine, triosephosphate isomerase (TPI), tropomyosin, pyruvate kinase (PK), glucose-6-phosphate isomerase (GPI), alpha-actinin, tubulin, and glycogen phosphorylase-like protein (PG). These putative allergens offered possibilities for CRD design and better characterization of allergen sensitization at the molecular level. In total, approximately 35% of known target allergens were originally identified in fish, with others from distant fungi (22%) and mites (9%), indicating successful fish allergen identification and the possibility to explore the cross-reactivity of different allergen resources [Figure 1D].
To assess the comprehensiveness of transcriptomic allergen profiles, we compared the putative salmon allergen transcripts with the known salmon allergens in the database [Table S2]. Six of the seven recorded salmon allergens were successfully identified, except for the Sal s 6 collagen [23], likely due to its repetitive GXY sequences confusing de novo assembly algorithms [24]. Despite the loss of collagens in all fish, additional 14–28 potential allergens were identified, still revealing informative allergen profiles for addressing allergenic differences.
2.2. Overall Fish Allergen Profiles and Top Expressed Allergens Distinguish Fish into Two Categories
The expression levels of these potential allergens were measured by Transcripts Per Million (TPM) to calculate their relative abundances. The six highly expressed allergens accounted for over 70% of total allergen expression in all fish [Figure 2A]. They were PV, GAPDH, aldolase, enolase, creatine, and tropomyosin, all of which had been previously reported as major or minor fish allergens [25,26,27]. Pearson correlation analysis of allergen expression profiles clustered these fish samples into two distinct groups [Figure 2B], consistent with their different PV expression levels [Figure 2A]. Group 1 (tuna, salmon, halibut) had lower PV expression (“PV-low”) and higher expression of other minor allergens, while Group 2 (cod, grouper, grass carp, tilapia, catfish) had dominant PV expression (“PV-high”). This kind of fish classification based on their allergen profile revealed the molecular basis of different fish allergenicity and offered complementary insights on fish avoidance, regarding the traditional classification based on fish source (sea or fresh water) and meat color (white or red muscle) [28,29].
We further explored the isoform expression profiles of these top expressed allergens. For instance, cod expressed up to nine PV isoforms, while the highest expressed isoform dominated 50–90% of the total PV expression [Figure 2C]. For other allergens, the top one isoform exhibited a similarly dominant expression [Figure S1]. Focusing on these top expressed isoforms may simplify the design of CRD.
Additionally, many identified allergens were clustered together as the “other” group due to their low expression [Figure 1A], indicating a lower likelihood of these allergens triggering allergic reactions. By combining insights regarding sequence similarities and expression levels, we manually filtered transcripts with more than 70% sequence identity to known allergens and TPM, accounting for more than 1% of total allergen expression in at least three fish samples as potential “truly” fish allergens. Beyond the six highly expressed allergens, this strategy additionally identified LDH, TPI, PK, PG, GPI, and alpha-actinin as minor fish allergens [Figure 2D]. Notably, PG, which was recently identified as a shrimp allergen [30], may contribute to fish–shrimp cross-reactivity [31].
2.3. Fish Allergen Profiles Improve Interpretation of sIgE Sensitization Patterns
Routine ImmunoCAP sIgE assays often use multiple fish extracts to address potential cross-reactivity. Our previous study [32] reported a Hong Kong fish allergy cohort tested with nine fish extracts: tuna (f40), halibut (f303), salmon (f41), cod (f3), grouper (f410), herring (f205), catfish (f369), tilapia (f414), and grass carp, along with two PV recombinants: rGad c 1 (f426) and rCyp c 1 (f355). Here, we included sIgE data of 188 subjects who were serologically sensitized (sIgE > 0.35 kUA/L) to at least one fish extract or PV [Figure 3A and Table S3]. We explored whether the fish allergen profile revealed by transcriptomic analysis could help interpret the sIgE sensitization patterns observed.
First, the sIgE levels of tuna, salmon, and halibut extracts were significantly lower than those of grass carp, catfish, and tilapia [Figure 3A]. This serological sIgE gradient was typically attributed to the variable PV content in previous wet-lab studies [33,34], which was also in line with our transcriptomic-based fish classification.
Second, the correlation heatmap of fish sIgE data [Figure 3B] appeared to mimic the correlation figure of allergen expression [Figure 2B], where tuna, salmon, and halibut were clustered together. This observation indicated that paired fish with higher correlations of allergen expression might lead to higher sIgE correlations. Indeed, paired fish within the same fish group (e.g., within Group 1 fish: tuna, salmon, halibut) had significantly higher sIgE correlation coefficients than those of paired fish from different fish groups (e.g., Group 1 fish tuna and Group 2 fish cod) (p = 0.00056) [Figure 3C]. The overall association between allergen expression correlation and sIgE correlation was significant (p = 0.02), but retained a relatively low coefficient (Pearson r = 0.437) [Figure S2]. Given the variability and potential bias due to fish protein extracts [35], this kind of moderate correlation was considered to be acceptable. The observed differences in fish allergen profiles may help to predict variations in sIgE measurements and reveal distinct fish allergenicity.
Third, the sIgE levels of rCyp c 1 were higher than those of rGad c 1 (p < 0.0001) [Figure 3A,D], indicating different binding affinities or sensitization abilities due to differences in PV sequences. A phylogenetic tree of PVs showed that Cyp c 1 is closer to other fish PVs than Gad c 1 [Figure S3A]. Cyp c 1 also had higher pairwise sequence similarities to fish PVs (p = 0.0038) [Figure 3E and Table S4]. Probably, the higher sIgE levels of rCyp c 1 resulted from its higher sequence conservation.
To generalize this insight, we constructed the consensus sequence of all PVs in the allergen database [Figure S3B], enabling the calculation of conservation scores for Cyp c 1, Gad c 1, and other PVs, which aimed to quantify the sequence differences of different PVs. The results showed that Cyp c 1 had a higher conservation score than Gad c 1 (0.75 vs. 0.62), while chicken (Gad d 8) and frog PVs (Ran e 1, Ran e 2) had nearly the lowest scores (0.52~0.63) [Figure 3F], in line with the fact that only a few cross-reactivity allergy cases were reported among frog, chicken, and fish [36,37]. The top expressed PV isoforms of grass carp, tilapia, and catfish with higher conservation scores may elicit higher sIgE levels. This prediction matches our previous finding of higher IgE reactivity to grass carp PV (Cten i 1) compared to cod (Gad m 1) and salmon (Sal s 1) [38]. Of note, although tuna PV has a relatively high conservation score (0.73), the expression of tuna PV accounted for less than 1% of total allergen expression [Figure 2A]. Given the low sIgE measurement of tuna extract, both the PV expression and sequence difference shall be taken into consideration for fish allergenicity evaluation.
Overall, the transcriptomic allergen analysis provided comprehensive information about fish allergen expression and allergen sequences for cross-reactivity discussion, advancing the interpretation of clinical sIgE results. Fish with low PV expression and low PV sequence conservation scores, such as salmon and halibut, tended to yield a lower sIgE measurement and may be less allergenic for the fish allergy patients in our cohort, suggesting the potential application of RNA-seq analysis for fish allergenicity prediction and tailored food management.
2.4. Diagnosis Performance of sIgE Data Indicated Rational CRD Designs for Different Fish Allergy
Transcriptome analysis revealed certain similarities of allergen expression among different fish. We further explored whether the sIgE data of one representative fish extract was applicable for the allergy diagnosis of other fish, and checked if multiple fish extracts and PVs together could improve sIgE diagnosis performance.
In our fish allergy cohort, 78 subjects underwent the grass carp (GC) or salmon oral food challenge (OFC). In addition, 56 of 74 subjects (75.7%) were confirmed as having a GC allergy, and 19 of 72 (26.4%) were confirmed as having a salmon allergy [Figure 4A]. The majority of salmon-allergic patients (16/19) were also allergic to GC, while most GC patients (35/56) were allergic only to GC but not salmon. For corresponding fish sIgE levels, allergic patients received significantly higher values than those of tolerant subjects (p < 0.05) [Figure 4B and Table S5].
This OFC-confirmed cohort was selected for Receiver Operating Characteristic (ROC) analysis to evaluate the diagnostic performance of different fish sIgE data by Area Under Curve (AUC) comparison. The results showed that salmon sIgE achieved an AUC of 0.731 for salmon allergy diagnosis, and GC sIgE achieved an AUC of 0.788 for GC allergy diagnosis [Figure 4C]. No other fish or PV sIgE could exhibit a better AUC [Figure 4D].
For GC allergy diagnosis, the sIgE data of two PVs demonstrated similar diagnostic performance (AUC~0.76–0.77) to the GC fish extract. However, for salmon allergy, neither rGad c 1 nor rCyp c 1 sIgE reached comparable performance (AUC < 0.65) to the salmon fish extract. PVs may work as potential alternatives to fish extracts or as CRD candidates for GC and other “PV-high” fish, but not for salmon or “PV-low” fish.
We then applied multi-factor logistic regression models to include all types of sIgE data to explore their best combinations for allergy diagnosis. The results showed that incorporating multiple sIgE data improved diagnostic performance for both salmon and GC allergy, although a maximum AUC ceiling existed even when all sIgE data were included [Figure 4E,F]. For salmon allergy, a combination of other fish sIgE data, excluding salmon data, did not surpass the AUC of salmon sIgE alone [Figure 4E]. For GC allergy diagnosis, using other fish sIgE data achieved an equal or slightly higher AUC than GC sIgE alone [Figure 4F]. These results suggested that sIgE assays based on specific fish extracts remained the best choice for corresponding fish allergy diagnosis. However, sIgE data based on one “representative fish extract” may not be reliable for predictive diagnosis to another fish, especially for “PV-low” fish. Thus, the strategy of CRD, selecting “representative allergens”, seems to be a more promising and complete solution for universal fish allergy diagnosis and prediction.
3. Discussion
This study utilized a transcriptomic approach to profile allergen repertoires across eight fish species commonly implicated in fish allergies, which successfully identified and quantified 34 putative allergens, highlighting six highly expressed (PV, GAPDH, aldolase, enolase, creatine, and tropomyosin) and six relatively low-expressed (LDH, TPI, PK, PG, GPI, and alpha-actinin) allergens.
These allergens have demonstrated varying allergy potencies. Serum IgE binding to PVs has been widely reported and characterized in many different fish allergies [39], alongside more clinically relevant support such as SPT and basophil activation tests (BAT) [34], while co-sensitization to collagen, GAPDH, aldolase, enolase, creatine, tropomyosin, LDH, TPI, PK, GPI, or alpha-actinin, etc., was partially described in specific fish allergy studies [25,28,40,41]. Previous studies have reported enolases and aldolases as important fish allergens in cod, salmon, and tuna allergy patients [26]. For 27.4% of patients who had no specific IgE to PVs but did have sIgE to enolases and aldolases, they did not show alleviated reactions but still experienced mild to severe asthmatic symptoms, compared to those of PV-sensitized patients. In addition, it was noteworthy that PV and tropomyosin were also more heat-stable allergens than other allergens such as GAPDH, aldolase, enolase, and creatine [42,43]. The molecular allergenicities can be variable depending on different cooking and consumption scenarios [25,43]. Although PVs are currently regarded as the dominant fish allergen components, the allergen potency of other minor allergens should not be ignored and requires further clarification with more clinical relevance in cohorts with diverse consumption habits.
Except for the loss of collagens, our RNA-seq study offered the first comprehensive allergen profile for multiple fish species under a parallel and comparable framework, providing hints for future validation. Together with feasible access to full-length sequences of these allergens, this high-resolution allergen map provided unlimited possibilities to explore different fish allergenicities in multiple dimensions. The RNA-seq-derived allergen repertoire can serve as a valuable complement to current protein allergen databases established through wet-lab studies. An interesting observation regarding dominantly expressed isoforms helps simplify the candidate selection process for CRD design. For instance, the top expressed isoforms that also share close evolutionary distances with each other may more likely function as pan-allergens across multiple fish species and be useful in clinical practice. Given the transferable features of NGS technology, these results reinforced the potential of the RNA-seq method for allergen mapping among cross-reactive food sources, fish and shellfish for instance, which may share allergens such as tropomyosin, TPI, and PG, etc. [15,44].
Notably, previous studies have shown that fish muscles in different colors (white or red) and different body parts (dorsal, ventral, rostral, or caudal) have variable content of PV proteins [45]. Our current analysis mainly relied on publicly available RNA-seq data of fish muscle or whole-body samples. More comprehensive comparisons of different fish tissues at various fish developmental stages may help reveal more dynamic allergen profiles and potentially identify clinically relevant allergen isoforms in each fish.
Fish can be roughly classified into two groups based on their allergen expression patterns: “PV-high” and “PV-low”. Fish with high PV expression, such as grass carp, tilapia, and catfish, exhibited higher ImmunoCAP sIgE levels in our allergy cohort, in contrast to low PV expression fish, including tuna, salmon, and halibut. A higher correlation of allergen expression was associated with a higher correlation in sIgE values, although the predictive power was moderate. These integrative observations about the allergen profile and sIgE measurement were in line with previous discussion about the “fish allergenicity ladder” [32]. The RNA-seq method potentially provided a cost-effective solution for large-scale fish classification and food management for allergy patients.
As an attempt at fish allergenicity evaluation, we proposed PV consensus sequence and conservation scores to quantify a PV sequence difference. This approach suggested that fish PVs with higher conservation scores might lead to higher sIgE measurements, indicating higher allergenicity or cross-reactivity. Together with the PV expression levels measured by TPMs, these numerical methods may help predict and rank the different fish allergenicities in the future. Of course, the reliability and limitations of proposed PV conservation scores require further tests and clarification. Given the complexity of real-world labeling and processing of fish products, allergic patients should still be reminded to avoid all types of fish in daily life to prevent risky allergic reactions following accidental exposure to their allergic fish. The insight regarding differences in fish allergenicity may benefit better diagnosis and oral immunotherapy for fish allergy in clinical practice.
We also explored whether the sIgE data of one specific fish extract could serve as equivalents or alternatives for other fish allergy diagnoses. Analysis showed that the sIgE data of salmon or GC extract remained the best indicator for diagnosing corresponding fish allergy, however, their predictive power to other fish allergies may be very limited. Moving towards CRD, PV recombinants displayed comparable performance to GC extract for GC allergy diagnosis but performed much worse in salmon allergy diagnosis. This suggests that the rational CRD design should include other minor allergens, as this might be necessary for salmon and “PV-low” fish allergy diagnosis. For instance, selectively incorporating the most highly expressed allergens in salmon, tuna, and halibut—such as aldolase, enolase, and GAPDH—can expand the current CRD panel beyond the two PVs alone. This approach would provide more clinically relevant results that help to clarify the allergenic potential of non-PV allergens as well.
It is important to note that the results regarding sIgE sensitization and OFC diagnosis may only apply to our allergy cohort. Cohorts in other regions with distinct fish consumption habits may exhibit different allergen sensitization patterns [46]. In our cohort, subjects reported the highest frequency of salmon consumption followed by grass carp and grouper, with catfish and tilapia being the least consumed [32]. There is a possibility that individuals initially sensitized to grass carp may have lower reactivity to salmon allergens. The connection between fish consumption habits and differences in intrinsic fish allergenicity remains an open question for further exploration. Moreover, although commercial fish extracts are widely applied for sIgE assays, OFC tests in different cohorts may involve regionally dominant fish species under varying cooking methods [47,48]. For instance, grass carp may be less consumed in European regions while salmon is more popular when served raw, potentially inducing more heat-labile allergens. All these differences should be considered when interpreting the sIgE data and OFC results interactively.
This study has certain limitations and weaknesses related to the use of RNA-seq for allergen discovery. As a sequence alignment method relying on known allergen databases, it is not capable of identifying entirely novel allergens that do not exist in the database or allergens sharing structural epitopes. It may also have limitations in dealing with complex allergen sequences, such as collagen with numerous GXY repeats, due to the short-read nature of current NGS methods. This issue might be better addressed with third-generation long-read sequencing technologies. Compared to traditional wet-lab methods, which may have limitations such as potential bias in protein extraction due to different buffers, relatively low sensitivity due to limited patient serum involvement, and the inability to provide full-length identification with mass spectrometry, the RNA-seq method can serve as a complementary approach. It provides an unbiased allergen profile in large-scale studies, offering full-length allergen sequences and relative abundance at the RNA level, which should benefit the future exploration of possible cross-reactivity among different food sources. Analysis of the sequence details of food and human homologs also offers the opportunity to explore fundamental questions about the molecular basis of allergens with respect to their specificity over other non-allergen molecules. It is worth noting that the association between RNA stability and protein abundance was not discussed in this study. Given the potential variability in protein abundance regarding protein extracts or real-world cooking, the initial state revealed by RNA expression levels may serve as a good reference starting point to explore these effects in various real-world consumption scenarios. Despite these limitations, the RNA-seq approach offers valuable insights into the allergen profiles of various fish species, providing a foundation for further investigations into fish allergenicity and the development of more precise diagnostic tools.
4. Materials and Methods
4.1. Collection of Fish RNA-Seq and ImmunoCap sIgE Data
The RNA-seq allergen study method offers significant advantages for large-scale profiling. The rapid development and numerous public NGS resources also offer unlimited possibilities to explore allergen differences across various species. In this study, we focus on fish species commonly consumed in Hong Kong, being high in worldwide consumption, and widely studied in sIgE assays to explore their differing allergenicity [49,50]. We obtained RNA-seq data from 8 fish species—yellowfin tuna, halibut, salmon, cod, grouper, catfish, tilapia, and grass carp—from public datasets [16,17,18,19]. For each fish, two or three biological replicates from muscle or whole fish tissue samples were included, labeled as fish names with attributions 1, 2, 3 for further analysis. The RNA-seq data details are summarized in Table S1.
The fish allergy cohort was recruited between 2016 and 2023 from six hospitals in Hong Kong, namely the Prince of Wales Hospital (PWH), Queen Elizabeth Hospital (QEH), Queen Mary Hospital (QMH), Princess Margaret Hospital (PMH), Yan Chai Hospital (YCH), and United Christian Hospital (UCH), as described in our previous report [32]. Subjects with complete ImmunoCAP sIgE data were included, comprising nine fish whole extracts: tuna (f40), halibut (f303), salmon (f41), cod (f3), grouper (f410), herring (f205), catfish (f369), tilapia (f414), and grass carp (for research use only, developed by Thermo Fisher Scientific) [25], along with two recombinant parvalbumins, Gad c 1 (f426) and Cyp c 1 (f355). All sIgE measurements were conducted at the Paediatric Research Laboratory in PWH equipped with the Phadia 200 system. In total, 188 subjects sensitized to at least one fish extract or PV (sIgE > 0.35 kUA/L) were selected for serological sensitization analysis. This sIgE cutoff is widely used in clinical practice to confirm allergic sensitization and report negative results, derived from the initial detection limit of the first sIgE assay [51]. Among them, 74 subjects underwent a grass carp OFC, 72 subjects underwent a salmon OFC, and 68 subjects underwent OFCs for both grass carp and salmon [Figure 4A]. These subjects were included for evaluating the diagnostic performance of different fish sIgE data.
4.2. RNA-Seq Data Analysis and Allergen Identification
The RNA-seq data analysis and allergen identification adapted a similar pipeline as the previous study on shrimp allergen discovery [15]. Briefly, the initial quality assessment of raw RNA-seq data employed FastQC, followed by adapter trimming and low-quality read filtering using fastp [52]. Rcorrector then corrected random sequencing errors to ensure data integrity [53]. De novo assembly utilized the Trinity toolkit, and transcript expression was quantified as TPMs using Trinity scripts [54]. TransRate and BUSCO (against the arthropoda odb9 database) evaluated assembly quality [55,56].
A reference allergen database was constructed based on the AllergenOnline databases [20]. The latest version (v22) of the AllergenOnline database contained 2290 allergens identified over the past decades. BLAST searches, with a pairwise identity threshold of 50% and subject coverage exceeding 90%, identified transcripts homologous to known allergens, based on a previous report about the criteria of sequence similarities for allergen prediction [21]. For each transcript and known allergen, only the best BLAST match (with the lowest E-value) was retained to remove duplicate alignments. For allergen isoform analysis, isoforms were numbered in descending order of the expression TPM level. Fish allergen transcript sequences were extracted and translated to amino acid (AA) sequences using the ExPASy translate tool [57] for sequence similarity analysis, multiple alignment, and phylogenetic tree construction with MEGA 14 [58].
4.3. Parvalbumin Conservation Score Calculation
In total, 80 unique parvalbumin sequences identified from fish transcriptomes or the AllergenOnline database were included. Multiple sequence alignment was performed using the R msa package and WebLogo to identify the parvalbumin consensus sequence [59,60], in a length of 110 AA. For the AA at each position, the conservation scores were calculated as the AA frequency among all parvalbumins. For a given parvalbumin, the parvalbumin conservation score was calculated as the average AA conservation score across the full length. The conservation scores for chicken (Gad d 8) and frog (Ran e 1 and Ran e 2) parvalbumins were also calculated as a reference for fish parvalbumins.
4.4. ROC Analysis
Subjects who underwent grass carp or salmon OFCs were included for sIgE ROC analysis using the R package multipleROC. For multiple sIgE analysis, logistic regression models were applied [61]. The R package bestglm was used to identify the best subset from multiple fish sIgE data under each assigned subset size using the Akaike Information Criterion (AIC) [62]. For comparing the diagnosis performance, the AUC was calculated based on either single fish sIgE data or multiple sIgE logistic regression models accordingly.
4.5. Statistics and Figure Plots
Correlation was calculated using the Pearson method in the R package cor. Linear regression models and corresponding parameters were computed using the R package lm. For two-group comparisons, Wilcoxon rank-sum tests were applied using either the single or paired model, as applicable. R packages such as ggplot2 and heatmap were used to generate bar plots, boxplots, dot plots, heatmaps, and other figures presented in this study.
5. Conclusions
This study provides comprehensive allergen profiles of eight common fish species using RNA-seq analysis. The high-resolution allergen map, featuring full-length sequences and relative expression abundance at the isoform level, systematically reveals molecular differences in fish allergenicity. This RNA-seq-based allergen repertoire serves as a valuable complement to current protein-based allergen databases and could be further expanded by including more food species in future studies.
The fish allergen profiles facilitated a better interpretation of clinical ImmunoCAP sIgE data, demonstrating potential applications in fish allergenicity prediction and improved fish classification for food avoidance strategies in fish allergy management. Regarding future CRD design for different fish allergies, our analysis suggests that the current PV-only panel may be primarily applicable for PV-high fish species. Including other minor allergens such as aldolase and enolase, etc., could be a promising direction for future CRD development.
These findings highlight the potential of large-scale RNA-seq analysis for fish allergenicity prediction, classification, and food management. Future studies should incorporate more clinical evidence to clarify the allergenic potency of different minor allergens across multiple allergy cohorts and at the protein level if possible. Such research will advance our understanding of the molecular basis of fish allergies and contribute to improved diagnostic and management strategies for fish allergy patients.
Acknowledgments
We thank the subjects and their parents for participating in our fish allergy studies. We also thank Nancy Cheng, Tiffany Tavares, Suk T Lee, Rain Cheng, Annerliza Kwok, Belinda Tang, Nicole PF Li, Chloris HW Leung, Ann WS Au, and Yuki Shum for supporting oral food challenges for fish, and Brian Fong and Maco Lam for helping with sample processing for the sIgE measurement.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms251910784/s1.
Author Contributions
Z.-Y.L. and C.Y.Y.W. contributed equally to this work as co-first authors. Conceptualization, Z.-Y.L., C.Y.Y.W., A.S.Y.L. and T.F.L.; methodology, Z.-Y.L., C.Y.Y.W., A.S.Y.L. and T.F.L.; software, Z.-Y.L.; validation, Z.-Y.L., C.Y.Y.W. and A.S.Y.L.; formal analysis, Z.-Y.L. and C.Y.Y.W.; investigation, Z.-Y.L., C.Y.Y.W., A.S.Y.L. and T.F.L.; resources, Z.-Y.L., C.Y.Y.W., A.S.Y.L., W.H.C., J.S.R.D., I.C.S.L., J.W.C., N.A.N., P.K.H., G.T.C., Q.U.L., O.M.C., Y.S.Y., J.S.C.W., D.C.K.L., M.H.K.H. and M.Y.W.K.; data curation, Z.-Y.L. and C.Y.Y.W.; writing—original draft preparation, Z.-Y.L.; writing—review and editing, C.Y.Y.W., A.S.Y.L., J.K.C.S., M.F.T., N.Y.H.L. and T.F.L.; visualization, Z.-Y.L.; supervision, T.F.L.; project administration, Z.-Y.L., C.Y.Y.W. and M.F.T.; funding acquisition, C.Y.Y.W. and T.F.L. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
The study was conducted in accordance with the Declaration of Helsinki, and approved by the institutional review boards of respective hospitals and joint institutes: PWH (2017.542); QEH (KC/KE-17-0217/FR-4); QMH (UW16-2003); PMH (KW/EX-18-116[127-12]); UCH (KC/KE-20-0355/ER-1).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
Summary statistics of allergen profiles and sIgE patterns are presented in the main figures and Supplementary Materials. Raw data of fish RNA-seq can be freely downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/, accessed on 1 July 2023) using the accession numbers provided in Table S1. Clinical demographic data, symptoms, sIgE measurements, and OFC outcomes of allergy subjects can be obtained from the authors upon reasonable request, subject to ethical restrictions.
Conflicts of Interest
The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Funding Statement
Zhong-Yi Liu was supported by the Faculty Postdoctoral Fellowship Scheme (references FPFS/22-23/054C and FPFS/23-24/R/026) of the Faculty of Medicine, The Chinese University of Hong Kong. This research was funded by the Health and Medical Research Fund (grant numbers 06170856, 08191356, 08191436, and 09202866) of the Health Bureau and Research Impact Fund (grant number R4035-19) of the Research Grants Council, Hong Kong Special Administrative Region Government and Direct Grant for Research (grant number 2024.173) of The Chinese University of Hong Kong.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Lin H.Y., Wright S., Costello M.J. Numbers of fish species, higher taxa, and phylogenetic similarity decrease with latitude and depth, and deep-sea assemblages are unique. PeerJ. 2023;11:e16116. doi: 10.7717/peerj.16116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chang F., Eng L., Chang C. Food Allergy Labeling Laws: International Guidelines for Residents and Travelers. Clin. Rev. Allerg. Immu. 2023;65:148–165. doi: 10.1007/s12016-023-08960-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Moonesinghe H., Mackenzie H., Venter C., Kilburn S., Turner P., Weir K., Dean T. Prevalence of fish and shellfish allergy: A systematic review. Ann. Allergy Asthma Immunol. 2016;117:264–272.e4. doi: 10.1016/j.anai.2016.07.015. [DOI] [PubMed] [Google Scholar]
- 4.Connett G.J., Gerez I., Cabrera-Morales E.A., Yuenyongviwat A., Ngamphaiboon J., Chatchatee P., Sangsupawanich P., Soh S.E., Yap G.C., Shek L.P., et al. A population-based study of fish allergy in the Philippines, Singapore and Thailand. Int. Arch. Allergy Immunol. 2012;159:384–390. doi: 10.1159/000338940. [DOI] [PubMed] [Google Scholar]
- 5.Wai C.Y.Y., Leung N.Y.H., Leung A.S.Y., Wong G.W.K., Leung T.F. Seafood Allergy in Asia: Geographical Specificity and Beyond. Front. Allergy. 2021;2:676903. doi: 10.3389/falgy.2021.676903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Okamoto M., Takafuji S., Inoue S., Tanaka Y. Fish allergy tolerance 16 months after diagnosis. Allergol Immunopathol. 2021;49:25–27. doi: 10.15586/aei.v49i5.313. [DOI] [PubMed] [Google Scholar]
- 7.Sharp M.F., Lopata A.L. Fish allergy: In review. Clin. Rev. Allergy Immunol. 2014;46:258–271. doi: 10.1007/s12016-013-8363-1. [DOI] [PubMed] [Google Scholar]
- 8.Tong W.S., Yuen A.W., Wai C.Y., Leung N.Y., Chu K.H., Leung P.S. Diagnosis of fish and shellfish allergies. J. Asthma Allergy. 2018;11:247–260. doi: 10.2147/JAA.S142476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yuk J.E., Lee J., Jeong K.Y., Park K.H., Kim J.D., Kim J.T., Lee J.H., Park J.W. Allergenicity and Stability of 6 New Korean Bony Fish Extracts. Allergy Asthma Immunol. Res. 2021;13:623–637. doi: 10.4168/aair.2021.13.4.623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Armentia A., Santos J., Serrano Z., Martin B., Martin S., Barrio J., Fernandez S., Gonzalez-Sagrado M., Pineda F., Palacios R. Molecular diagnosis of allergy to Anisakis simplex and Gymnorhynchus gigas fish parasites. Allergol. Immunopathol. 2017;45:463–472. doi: 10.1016/j.aller.2016.12.008. [DOI] [PubMed] [Google Scholar]
- 11.Klueber J., Schrama D., Rodrigues P., Dickel H., Kuehn A. Fish Allergy Management: From Component-Resolved Diagnosis to Unmet Diagnostic Needs. Curr. Treat. Options Allergy. 2019;6:322–337. doi: 10.1007/s40521-019-00235-w. [DOI] [Google Scholar]
- 12.Leung A.S.-Y., Fusayasu N., Álvarez L.A., Gu Y., Ebisawa M., Wong G.W.-K. Fish allergy management: Should fish be completely avoided? The pros and cons debate. J. Allergy Hypersensitivity Dis. 2024;2:100008. doi: 10.1016/j.jahd.2024.100008. [DOI] [Google Scholar]
- 13.Dramburg S., Hilger C., Santos A.F., de Las Vecillas L., Aalberse R.C., Acevedo N., Aglas L., Altmann F., Arruda K.L., Asero R., et al. EAACI Molecular Allergology User’s Guide 2.0. Pediatr. Allergy Immunol. 2023;34((Suppl. 28)):e13854. doi: 10.1111/pai.13854. [DOI] [PubMed] [Google Scholar]
- 14.Seth D., Poowutikul P., Pansare M., Kamat D. Food Allergy: A Review. Pediatr. Ann. 2020;49:e50–e58. doi: 10.3928/19382359-20191206-01. [DOI] [PubMed] [Google Scholar]
- 15.Karnaneedi S., Huerlimann R., Johnston E.B., Nugraha R., Ruethers T., Taki A.C., Kamath S.D., Wade N.M., Jerry D.R., Lopata A.L. Novel Allergen Discovery through Comprehensive De Novo Transcriptomic Analyses of Five Shrimp Species. Int. J. Mol. Sci. 2020;22:32. doi: 10.3390/ijms22010032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vo T.T.M., Amoroso G., Ventura T., Elizur A. Histological and transcriptomic analysis of muscular atrophy associated with depleted flesh pigmentation in Atlantic salmon (Salmo salar) exposed to elevated seawater temperatures. Sci. Rep. 2023;13:4218. doi: 10.1038/s41598-023-31242-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Vo T.T.M., Nguyen T.V., Amoroso G., Ventura T., Elizur A. Deploying new generation sequencing for the study of flesh color depletion in Atlantic Salmon (Salmo salar) BMC Genom. 2021;22:545. doi: 10.1186/s12864-021-07884-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wang X., Liu G., Xie S., Pan L., Tan Q. Growth and Meat Quality of Grass Carp (Ctenopharyngodon idellus) Responded to Dietary Protein (Soybean meal) Level Through the Muscle Metabolism and Gene Expression of Myosin Heavy Chains. Front. Nutr. 2022;9:833924. doi: 10.3389/fnut.2022.833924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lin G., Thevasagayam N.M., Wan Z.Y., Ye B.Q., Yue G.H. Transcriptome Analysis Identified Genes for Growth and Omega-3/-6 Ratio in Saline Tilapia. Front. Genet. 2019;10:244. doi: 10.3389/fgene.2019.00244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Goodman R.E., Ebisawa M., Ferreira F., Sampson H.A., van Ree R., Vieths S., Baumert J.L., Bohle B., Lalithambika S., Wise J., et al. AllergenOnline: A peer-reviewed, curated allergen database to assess novel food proteins for potential cross-reactivity. Mol. Nutr. Food Res. 2016;60:1183–1198. doi: 10.1002/mnfr.201500769. [DOI] [PubMed] [Google Scholar]
- 21.Aalberse R.C. Structural biology of allergens. J. Allergy Clin. Immunol. 2000;106:228–238. doi: 10.1067/mai.2000.108434. [DOI] [PubMed] [Google Scholar]
- 22.Nugraha R., Kamath S.D., Johnston E., Zenger K.R., Rolland J.M., O’Hehir R.E., Lopata A.L. Rapid and comprehensive discovery of unreported shellfish allergens using large-scale transcriptomic and proteomic resources. J. Allergy Clin. Immunol. 2018;141:1501–1504.e8. doi: 10.1016/j.jaci.2017.11.028. [DOI] [PubMed] [Google Scholar]
- 23.Hamada Y., Nagashima Y., Shiomi K. Identification of collagen as a new fish allergen. Biosci. Biotechnol. Biochem. 2001;65:285–291. doi: 10.1271/bbb.65.285. [DOI] [PubMed] [Google Scholar]
- 24.Kleinnijenhuis A.J., van Holthoon F.L. Domain-Specific Proteogenomic Analysis of Collagens to Evaluate De Novo Sequencing Results and Database Information. J. Mol. Evol. 2018;86:293–302. doi: 10.1007/s00239-018-9844-x. [DOI] [PubMed] [Google Scholar]
- 25.Wai C.Y.Y., Leung N.Y.H., Leung A.S.Y., Fusayasu N., Sato S., Xu K.J.Y., Yau Y.S., Rosa Duque J.S., Kwan M.Y.W., Cheng J., et al. Differential patterns of fish sensitization in Asian populations: Implication for precision diagnosis. Allergol. Int. 2023;72:458–465. doi: 10.1016/j.alit.2023.03.003. [DOI] [PubMed] [Google Scholar]
- 26.Kuehn A., Hilger C., Lehners-Weber C., Codreanu-Morel F., Morisset M., Metz-Favre C., Pauli G., de Blay F., Revets D., Muller C.P., et al. Identification of enolases and aldolases as important fish allergens in cod, salmon and tuna: Component resolved diagnosis using parvalbumin and the new allergens. Clin. Exp. Allergy. 2013;43:811–822. doi: 10.1111/cea.12117. [DOI] [PubMed] [Google Scholar]
- 27.Ruethers T., Kamath S., Taki A., Le T., Karnaneedi S., Nugraha R., Cao T., Nie S., Williamson N., Mehr S., et al. Tropomyosin Is a Novel Major Fish Allergen of Unrecognized Importance. J. Allergy Clin. Immun. 2020;145:Ab226. doi: 10.1016/j.jaci.2019.12.187. [DOI] [Google Scholar]
- 28.Liu R., Krishnan H.B., Xue W., Liu C. Characterization of allergens isolated from the freshwater fish blunt snout bream (Megalobrama amblycephala) J. Agric. Food Chem. 2011;59:458–463. doi: 10.1021/jf103942p. [DOI] [PubMed] [Google Scholar]
- 29.Kobayashi A., Tanaka H., Hamada Y., Ishizaki S., Nagashima Y., Shiomi K. Comparison of allergenicity and allergens between fish white and dark muscles. Allergy. 2006;61:357–363. doi: 10.1111/j.1398-9995.2006.00966.x. [DOI] [PubMed] [Google Scholar]
- 30.Li S., Chu K.H., Wai C.Y.Y. Genomics of Shrimp Allergens and Beyond. Genes. 2023;14:2145. doi: 10.3390/genes14122145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Li J., Li Z., Kong D., Li S., Yu Y., Li H. IgE and IgG4 responses to shrimp allergen tropomyosin and its epitopes in patients from coastal areas of northern China. Mol. Med. Rep. 2020;22:371–379. doi: 10.3892/mmr.2020.11084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Leung A.S.Y., Wai C.Y.Y., Leung N.Y.H., Ngai N.A., Chua G.T., Ho P.K., Lam I.C.S., Cheng J., Chan O.M., Li P.F., et al. Real-World Sensitization and Tolerance Pattern to Seafood in Fish-Allergic Individuals. J. Allergy Clin. Immunol. Pract. 2023;12:633–642. doi: 10.1016/j.jaip.2023.09.038. [DOI] [PubMed] [Google Scholar]
- 33.Kuehn A., Scheuermann T., Hilger C., Hentges F. Important variations in parvalbumin content in common fish species: A factor possibly contributing to variable allergenicity. Int. Arch. Allergy Immunol. 2010;153:359–366. doi: 10.1159/000316346. [DOI] [PubMed] [Google Scholar]
- 34.Kalic T., Morel-Codreanu F., Radauer C., Ruethers T., Taki A.C., Swoboda I., Hilger C., Hoffmann-Sommergruber K., Ollert M., Hafner C., et al. Patients Allergic to Fish Tolerate Ray Based on the Low Allergenicity of Its Parvalbumin. J. Allergy Clin. Immunol. Pract. 2019;7:500–508.e11. doi: 10.1016/j.jaip.2018.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ruethers T., Taki A.C., Nugraha R., Cao T.T., Koeberl M., Kamath S.D., Williamson N.A., O’Callaghan S., Nie S., Mehr S.S., et al. Variability of allergens in commercial fish extracts for skin prick testing. Allergy. 2019;74:1352–1363. doi: 10.1111/all.13748. [DOI] [PubMed] [Google Scholar]
- 36.Kuehn A., Codreanu-Morel F., Lehners-Weber C., Doyen V., Gomez-Andre S.A., Bienvenu F., Fischer J., Ballardini N., van Hage M., Perotin J.M., et al. Cross-reactivity to fish and chicken meat—A new clinical syndrome. Allergy. 2016;71:1772–1781. doi: 10.1111/all.12968. [DOI] [PubMed] [Google Scholar]
- 37.Hilger C., Thill L., Grigioni F., Lehners C., Falagiani P., Ferrara A., Romano C., Stevens W., Hentges F. IgE antibodies of fish allergic patients cross-react with frog parvalbumin. Allergy. 2004;59:653–660. doi: 10.1111/j.1398-9995.2004.00436.x. [DOI] [PubMed] [Google Scholar]
- 38.Leung N.Y.H., Leung A.S.Y., Xu K.J.Y., Wai C.Y.Y., Lam C.Y., Wong G.W.K., Leung T.F. Molecular and immunological characterization of grass carp (Ctenopharyngodon idella) parvalbumin Cten i 1: A major fish allergen in Hong Kong. Pediatr. Allergy Immunol. 2020;31:792–804. doi: 10.1111/pai.13259. [DOI] [PubMed] [Google Scholar]
- 39.Kuehn A., Swobode I., Arumugam K., Hilger C., Hentges F. Fish allergens at a glance: Variable allergenicity of parvalbumins, the major fish allergens. Front. Immunol. 2014;5:179. doi: 10.3389/fimmu.2014.00179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ruethers T., Taki A.C., Karnaneedi S., Nie S., Kalic T., Dai D.Y., Daduang S., Leeming M., Williamson N.A., Breiteneder H., et al. Expanding the allergen repertoire of salmon and catfish. Allergy. 2021;76:1443–1453. doi: 10.1111/all.14574. [DOI] [PubMed] [Google Scholar]
- 41.Sun Y.B., Luo Y.Q., Chen J., Liu X., Gao J.Y., Xie Y.H., Chen H.B. Fish Allergy: A Review of Clinical Characteristics, Mechanism, Allergens, Epitopes, and Cross-Reactivity. Acs Food Sci. Technol. 2024;4:304–315. doi: 10.1021/acsfoodscitech.3c00572. [DOI] [Google Scholar]
- 42.Mukherjee S., Horka P., Zdenkova K., Cermakova E. Parvalbumin: A Major Fish Allergen and a Forensically Relevant Marker. Genes. 2023;14:223. doi: 10.3390/genes14010223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Taki A.C., Ruethers T., Nugraha R., Karnaneedi S., Williamson N.A., Nie S., Leeming M.G., Mehr S.S., Campbell D.E., Lopata A.L. Thermostable allergens in canned fish: Evaluating risks for fish allergy. Allergy. 2023;78:3221–3234. doi: 10.1111/all.15864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Thalayasingam M., Lee B.W. Food Allergy: Molecular Basis and Clinical Practice. Volume 101. Karger Publishers; Basel, Switzerland: 2015. Fish and Shellfish Allergy; pp. 152–161. [DOI] [PubMed] [Google Scholar]
- 45.Kobayashi Y., Yang T., Yu C.T., Ume C., Kubota H., Shimakura K., Shiomi K., Hamada-Sato N. Quantification of major allergen parvalbumin in 22 species of fish by SDS-PAGE. Food Chem. 2016;194:345–353. doi: 10.1016/j.foodchem.2015.08.037. [DOI] [PubMed] [Google Scholar]
- 46.Kalic T., Radauer C., Lopata A.L., Breiteneder H., Hafner C. Fish Allergy Around the World-Precise Diagnosis to Facilitate Patient Management. Front. Allergy. 2021;2:732178. doi: 10.3389/falgy.2021.732178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bernhisel-Broadbent J., Strause D., Sampson H.A. Fish hypersensitivity. II: Clinical relevance of altered fish allergenicity caused by various preparation methods. J. Allergy Clin. Immunol. 1992;90:622–629. doi: 10.1016/0091-6749(92)90135-O. [DOI] [PubMed] [Google Scholar]
- 48.Dijkema D., Emons J.A.M., Van de Ven A.A.J.M., Elberink J.N.G.O. Fish Allergy: Fishing for Novel Diagnostic and Therapeutic Options. Clin. Rev. Allerg. Immu. 2022;62:64–71. doi: 10.1007/s12016-020-08806-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wang M.P., Thomas G.N., Ho S.Y., Lai H.K., Mak K.H., Lam T.H. Fish consumption and mortality in Hong Kong Chinese—The LIMOR study. Ann. Epidemiol. 2011;21:164–169. doi: 10.1016/j.annepidem.2010.10.010. [DOI] [PubMed] [Google Scholar]
- 50.Zhao H., Wang M., Peng X., Zhong L., Liu X., Shi Y., Li Y., Chen Y., Tang S. Fish consumption in multiple health outcomes: An umbrella review of meta-analyses of observational and clinical studies. Ann. Transl. Med. 2023;11:152. doi: 10.21037/atm-22-6515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Schoos A.M., Hansen S.M., Skov F.R., Stokholm J., Bonnelykke K., Bisgaard H., Chawes B.L. Allergen Specificity in Specific IgE Cutoff. JAMA Pediatr. 2020;174:993–995. doi: 10.1001/jamapediatrics.2020.0944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chen S., Zhou Y., Chen Y., Gu J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Song L., Florea L. Rcorrector: Efficient and accurate error correction for Illumina RNA-seq reads. Gigascience. 2015;4:48. doi: 10.1186/s13742-015-0089-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q., et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Simao F.A., Waterhouse R.M., Ioannidis P., Kriventseva E.V., Zdobnov E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 56.Smith-Unna R., Boursnell C., Patro R., Hibberd J.M., Kelly S. TransRate: Reference-free quality assessment of de novo transcriptome assemblies. Genome Res. 2016;26:1134–1144. doi: 10.1101/gr.196469.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gasteiger E., Gattiker A., Hoogland C., Ivanyi I., Appel R.D., Bairoch A. ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003;31:3784–3788. doi: 10.1093/nar/gkg563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kumar S., Stecher G., Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol. Biol. Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Bodenhofer U., Bonatesta E., Horejs-Kainrath C., Hochreiter S. msa: An R package for multiple sequence alignment. Bioinformatics. 2015;31:3997–3999. doi: 10.1093/bioinformatics/btv494. [DOI] [PubMed] [Google Scholar]
- 60.Crooks G.E., Hon G., Chandonia J.M., Brenner S.E. WebLogo: A sequence logo generator. Genome Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Sperandei S. Understanding logistic regression analysis. Biochem. Med. 2014;24:12–18. doi: 10.11613/BM.2014.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zhang Z. Variable selection with stepwise and best subset approaches. Ann. Transl. Med. 2016;4:136. doi: 10.21037/atm.2016.03.35. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Summary statistics of allergen profiles and sIgE patterns are presented in the main figures and Supplementary Materials. Raw data of fish RNA-seq can be freely downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/, accessed on 1 July 2023) using the accession numbers provided in Table S1. Clinical demographic data, symptoms, sIgE measurements, and OFC outcomes of allergy subjects can be obtained from the authors upon reasonable request, subject to ethical restrictions.