Abstract
The emergence of an infectious viral disease caused by the Chinese giant salamander iridovirus (GSIV) has led to substantial economic losses. However, no more molecular information is available for the understanding of the mechanisms associated with virus–host interaction. In this study, de novo sequencing was used to obtain abundant high-quality ESTs and investigate differentially-expressed genes in the spleen of Chinese giant salamanders that were either infected or mock infected with GSIV. Comparative expression analysis indicated that 293 genes were down-regulated and 220 genes were up-regulated. Further enrichment analysis showed that the most enriched pathway is “complement and coagulation cascades”, and significantly enriched diseases include “inherited thrombophilia”, “immune system diseases”, “primary immunodeficiency”, “complement regulatory protein defects”, and “disorders of nucleotide excision repair”. Additionally, 30 678 simple sequence repeats (SSRs) from all spleen samples, 26 355 single nucleotide polymorphisms (SNPs) from the spleens of uninfected animals and 36 070 SNPs from the spleens of infected animals were detected. The large amount of variation was specific for the Chinese giant salamanders that were infected with GSIV. The results reported herein provided significant and new EST information that could contribute greatly in investigations into the molecular functions of immune genes in the Chinese giant salamander.
Electronic supplementary material
The online version of this article (doi:10.1186/s13567-015-0279-8) contains supplementary material, which is available to authorized users.
Introduction
Amphibians are an important evolutionary bridge between aquatic and terrestrial vertebrates [1]. The Chinese giant salamander, Andrias davidianus, is the largest extant amphibian species and is considered a living fossil because it has existed for more than 350 million years [2]. The phylogenetic position of the Chinese giant salamander makes it an invaluable model for evolutionary and comparative studies. The Chinese giant salamander also has significant economic value as an edible delicacy and for medicinal purposes. However, in the past 50 years the Chinese giant salamander population has declined sharply due to deterioration of habitat, environmental pollution, climate change, infectious diseases and commercial trade [3, 4]. Currently, artificial breeding and culture is being used to protect the Chinese giant salamander. Approximately two million Chinese giant salamanders are bred annually in China.
With the rapid expansion of Chinese giant salamander farming, emerging infectious diseases have been increasing. Viral diseases, including the iridovirus, have caused major impacts to the Chinese giant salamander industry [5, 6]. The economic losses caused by the Chinese giant salamander iridovirus (GSIV) reached 300 million RMB (48 million USD) in 2010. There is an urgent need to understand the immune system of the Chinese giant salamander and the pathogenic mechanism(s) associated with a GSIV infection. In previous studies, the morphogenesis, pathological changes, rapid detection methods and virion-associated viral proteins of GSIV have been reported [7–12]. Although a ranavirus-induced thymus cDNA library and two immune organs (skin and spleen) from the healthy Chinese giant salamanders were sequenced [13, 14], and several immune genes were reported [15–17], the molecular information available is still limited for the Chinese giant salamander and that has hindered the understanding of the molecular mechanisms associated with viral infection and virus–host interactions.
Next-generation sequencing technology, such as de novo transcriptome sequencing, can be used for large-scale efficient and economical production of ESTs, and has become an important method for studying non-model species [18, 19]. Transcriptome sequencing facilitates functional genomic studies, including global gene expression, novel gene discovery, assembly of full-length genes, simple sequence repeats (SSRs) and single nucleotide polymorphism (SNP) discovery [20]. This technology has also been used widely in comparative transcriptomics to identify differences in transcript abundance among different developmental stages and under different treatment conditions [21, 22]. In the present study, the spleen transcriptome of the Chinese giant salamander was sequenced by de novo sequencing technology, and a comparative analysis of transcriptome data was performed between a control and a group infected with GSIV. The results provided a significant amount of information on the genes in the Chinese giant salamander, and suggested the conservation and divergence of several important immune signaling pathways. The different genes expressed and enrichment analysis of pathways could contribute significant new information regarding the pathogenic mechanism(s) of the virus and the interaction(s) of the virus and the host.
Materials and methods
Chinese giant salamanders, viral challenge and sample collection
Chinese giant salamanders (average weight, ~180 g) were obtained from the research farm of the Yangtze River Fisheries Research Institute in Wuhan, China. Prior screening indicated that these animals were free of GSIV. All the salamanders were kept in aerated, tap water supplied tanks at 20 °C and fed with diced bighead carp (Hypophthalmichthys nobilis) for 2 weeks prior to the experiment. The virus suspension used to infect a portion of the salamanders was obtained from GSIV-infected EPC cells. The fish in infected group (GS_TS) were injected intramuscularly with 0.2 mL of the GSIV suspension (5 × 107 TCID50/mL), and fish in control group (GS_CS) were injected intramuscularly with equal volume of DPBS (Sigma). The spleens from three individual salamanders from each group were collected at 48 h post-injection.
RNA isolation and cDNA synthesis
Total RNA was extracted from the spleens using TRIzol® Reagent (Invitrogen, USA). Samples of the three individuals from each treatment group were pooled in equal amounts to generate one RNA sample per group. These two RNA samples were sent to Shanghai Majorbio Bio-pharm Biotechnology Co., Ltd. (Shanghai, China) for the cDNA library construction and Illumina deep sequencing.
cDNA library construction and Illumina deep sequencing
Two cDNA libraries were prepared using the TruseqTM RNA sample prep Kit (Illumina, San Diego, CA, USA) following the manufacturer’s instructions. Briefly, poly (A)+ RNA was purified from 5 μg of pooled total RNA using oligo (dT) magnetic beads, sheared into short fragments, and primed for cDNA library synthesis using the TruSeq RNA sample preparation kit according to the manufacturer’s instructions (Illumina). After quantitation using a TBS-380 minifluorometer (PicoGreen), the samples were clustered (TruSeq paired-end cluster kit v3-cBot-HS; Illumina) and sequenced on the HiSeq 2000 platform (100 bp, TruSeq SBS kit v3-HS 200 cycles; Illumina).
Data analysis
The raw reads from the images were generated using Solexa GA pipeline 1.6. After the removal of low-quality reads, processed reads with an identity value of 95% and a coverage length of 100 bp were assembled using the Trinity de novo assembler [23]. The isogenes generated were compared with the NCBI non-redundant (nr) database using the BLASTx algorithm, with a cut-off E value of ≤10−5. GO terms were extracted from the best hits obtained from the BLASTx against the nr database using Blast2GO [24]. These results were then sorted by GO categories using in-house Perl scripts. BLASTx was also used to align unique sequences to the Swiss-Prot database, Kyoto Encyclopedia of Genes and Genomes (KEGG) and Clusters of Orthologous Groups (COG) (with the e value of 10−6) to predict possible functional classifications and molecular pathways [25].
Differential expression analysis
To identify differentially expressed genes/isogenes between infected and uninfected groups, genes/isogenes expression levels were measured by using numbers of fragments per kilobase of transcript per million fragments sequenced (FPKM) [26], similar to RPKM (reads per kilobase of gene model exon per million mapped reads) measure used earlier [27]. The differential expression analysis was carried out using RSEM [28] and edgeR [29] softwares. For each gene/isogene, the p value was computed, and then Benjamini–Hochberg false discovery rate (FDR) was applied to correct the results for p value. The transcripts that were increased or decreased at an estimated absolute log2-fold change of >1 and FDR adjusted p value ≤ 0.05 were considered to be differentially expressed.
Identification of EST-SSR motifs and EST-SNPs
MSATCOMMANDER V. 0.8.2 [30] was used to analyze the microsatellite (SSR) distribution. The minimum number of repeats for SSR detection was six for di-SSRs and four repeats for tri-, tetra-, penta-, and hexa-SSRs. The open reading frame (ORF) and untranslated region (UTR) within the isotig were identified using Trinity [23]. The location of SSRs was estimated based on ORFs and UTRs. SSR-containing isotigs were annotated based on BLAST similarity searches. SNPs were detected based on alignment using BWA V. 0.5.9 [31] and SAMtools V. 0.1.18 [32]. From the “pileup” output of SAMtools, VarScan V.2.2.7 filtered SNPs based on the following criteria including (1) the total coverage and the number of reads to cover a candidate SNP (>8 reads); (2) the base quality where base calls with low Phred quality (<25) were removed from the coverage; and (3) frequency of mutated bases higher than 30% among all reads covering the position.
Quantitative real-time PCR
Quantitative real time PCR was performed using iQ™ SYBR Green Supermix (Bio-Rad, Singapore) on a BIO-RAD CFX96 Real-Time System under the following conditions: 3 min at 95 °C, followed by 45 cycles of 15 s at 94 °C, 15 s at 55 °C and 30 s at 72 °C. Different genes including complement component C1R, C1S, C1S-like, C2, C3, C4, C5, C7, C8A and C9 were used for validation. An additional file shows the primer sequences used in this study (Additional file 1). The relative expression levels of the selected genes were normalized to β-Actin and calculated using 2−ΔΔCt method.
Results
De novo sequencing and assembly
Two sequencing libraries were prepared from spleen samples obtained from control (GS_CS) and GSIV-infected (GS_TS) Chinese giant salamanders that were sequenced using an Illumina Hiseq 2000. In total, 122.48 million raw reads were generated from GS_CS and 154.75 million for GS_TS. The data was refined by discarding low-quality reads that contained unknown bases or whose length was lower than 20 nt after removal of the adaptors and low-quality bases. The resulting high-quality reads numbered 113.45 million and 143.78 million for the GS_CS and GS_TS samples, respectively. The total length of these reads was 9.6 × 109 and 11.97 × 109 bp for GS_CS and GS_TS samples, respectively and the Q20 percentage (the percentage of sequences with a sequencing error rate lower than 1%) was over 98% for both samples (Table 1). All high-quality reads were deposited in the National Central for Biotechnology Information (NCBI) and can be accessed under the accession number SRP047398.
Table 1.
GSIV-infected | Control | |
---|---|---|
Total raw reads | 154 748 118 | 122 483 450 |
Total clean reads | 143 776 686 | 113 450 692 |
Total clean nucleotides (bp) | 11 967 607 097 | 9 600 577 224 |
Q20 percentage | 98.44% | 98.43% |
De novo assembly was performed using Trinity that resulted in 80 367 genes and 123 440 isogenes. The total length was 182 916 518 bp, with an average length of 1481 bp (Table 2). Each isogene was longer than 351 bp, and 71 295 (57.76%) of the isogenes were 350–1000 bp. Additionally, 27 826 (22.5%) of the isogenes were longer than 2000 bp. The size distribution of isogenes is shown in Figure 1A.
Table 2.
Type | Number |
---|---|
Total gene | 80 367 |
Total isogenes | 123 440 |
Total residues | 182 916 518 |
Average length | 1481.83 |
Largest isogene | 18 965 |
Smallest isogene | 351 |
Functional annotation and classification
All assembled high-quality isogenes had their ORFs predicted using Trinity. A total of 58 979 (47.8%) isogenes contained an ORF. The predicted protein sequences were blasted against NR (non-redundant protein sequences in NCBI), String and Gene databases using BLASTP with a cut-off E value of 10−5. There were 38 106 (64.6%) isogenes with homologous sequences in at least one of the databases. Among them, 38 106 (64.6%), 15 579 (26.4), and 20 352 (34.5%) isogenes were found in NR, String and Gene databases, respectively. A total of 9629 (16.3%) isogenes were found in all three databases, while 20 873 (35.4%) isogenes were not identified (Figure 1B).
The ORFs of 64 461 (52.2%) isogenes could not be predicted, and the DNA sequences of 64 461 isogenes were blasted against the NR, String and Gene databases using BLASTX with a cut-off E value of 10−5. There were 10 138 (15.7%) isogenes with homologous sequences in at least one of above databases. Among them, 10 138 (15.7%), 959 (1.5%), and 3351 (5.2%) isogenes were found in the NR, String and Gene databases, respectively. A total of 516 (0.8%) isogenes were found in all three databases, while 54 323 (84.3%) isogenes were not identified (Figure 1C).
Based on NR annotations, the Gene Ontology (GO) classification system was used to classify the possible functions of the isogenes. A total of 31 356 (25.4%) isogenes were successfully assigned to at least one GO term annotation and were classified into three main categories including biological process, cellular component and molecular function (Figure 2A). For biological process, the top six largest categories were cellular process (23 645), single-organism process (19 125), metabolic process (17 849), biological regulation (15 665), regulation of biological process (14 954) and response to stimulus (11 342). For the cellular component category, the top three largest categories were cell (22 548), cell part (22 544) and organelle (16 663). Only a few isogenes belonged to the virion (29), and virion part (24) sub-categories. Interestingly, for the molecular function category, 20 914 and 10 902 isogenes were classified into the sub-categories “binding” and “catalytic activity”, respectively.
Isogenes sequences in our transcriptome library were also analyzed by the Clusters of Orthologous Groups of proteins (COG). Out of 38 106 NR hits, 7719 (20.3%) of the sequences showed a COG classification (Figure 2B). The largest category was “general function prediction only” (2875 of 7719 isogenes or 37.2%), followed by “signal transduction mechanisms” (1224 isogenes, 15.9%), “transcription” (1126 isogenes, 14.6%), “replication, recombination and repair” (1105 isogenes, 14.3%) and “post-translational modification, protein turnover, chaperones” (706 isogenes, 9.1%). The sub-categories “extracellular structures” (0, 0), “nuclear structure” (4, 0.05%) and “cell motility” (8, 0.1%) had the fewest related genes. Additionally, 310 (4.02%) isogenes were annotated as “function unknown”.
The 38 106 annotated sequences were mapped to the reference canonical pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG). Among those, 23 712 isogenes were assigned to 317 KEGG pathways (Additional file 2). Of the 23 712 isogenes, 2195 (9.26%) were related to metabolic pathways, 784 (3.31%) to pathways in cancer, 750 (3.16%) to PI3K-Akt signaling pathway, 643 (2.71%) to MAPK signaling pathway, 614 (2.59%) to neuroactive ligand-receptor interaction, and 604 (2.55%) to HTLV-I infection.
Differential expression analysis
To identify differential expression changes between GS_CS and GS_TS samples, FPKM method was used to calculate the expression levels of genes and isogenes. The results showed that 293 genes were down-regulated and 220 genes were up-regulated with an FDR < 0.05 and ratios larger than 2 (Figures 3A and B). Among them, 102 down-regulated genes were detected only in the GS_CS samples (Additional file 3A), and 67 up-regulated genes specific for the GS_TS samples (Additional file 3B). For the isogenes, 2888 down-regulated and 3588 up-regulated isogenes were identified (Figures 3C and D).
Enrichment analysis was conducted to help clarify the biological functions of all differentially expressed isogenes (DEGs) that were identified. All DEGs were mapped to each term of the GO database, and the GO terms with a corrected P value ≤0.05 were defined as significantly enriched in DEGs. The results indicated that 2493 DEGs were enriched in 433 GO terms (Additional file 4). Among these GO terms, “immune system process” (172 DEGs), “regulation of immune system process” (123 DEGs), “response to stimulus” (962 DEGs), “response to DNA damage stimulus” (78 DEGs), “lymphocyte activation” (45 DEGs), “T cell activation” (32 DEGs), “leukocyte migration” (30 DEGs) and “chemotaxis” (81 DEGs) were significantly enriched among DEGs compared with the whole transcriptome background.
We also performed an enrichment analysis of the KEGG pathways and diseases. These DEGs were significantly enriched in 7 pathways including “complement and coagulation cascades”, “hematopoietic cell lineage”, “Staphylococcus aureus infection”, “tight junction”, “viral myocarditis”, “transcriptional misregulation in cancer” and “allograft rejection” (Additional file 5). The most enriched pathway was “complement and coagulation cascades” (66 DEGs), with 7 DEGs up-regulated and 59 DEGs down-regulated (Figure 4). The significantly enriched disease terms included inherited thrombophilia (11 DEGs), immune system diseases (94 DEGs), primary immunodeficiency (71 DEGs), complement regulatory protein defects (9 DEGs), afibrinogenemia (5 DEGs), congenital disorders of the DNA repair systems (15 DEGs), agammaglobulinemias (9 DEGs), disorders of nucleotide excision repair (13 DEGs), congenital disorders of metabolism (112 DEGs), cockayne syndrome (7 DEGs), H00821 (15 DEGs), familial combined hyperlipidemia (5 DEGs), H00978 (8 DEGs) and epidermolysisbullosa, simplex (7 DEGs) (Additional file 5). Among those DEGs involved in immune system diseases, 41 DEGs were up-regulated and 53 DEGs were down-regulated (Figure 5A). Those DEGs associated with inherited thrombophilia, complement regulatory protein defects, afibrinogenemia, H00821, epidermolysisbullosa, and simplex diseases were strongly suppressed in the spleen of GSIV-infected Chinese giant salamanders (Figure 5).
SSR and SNP discovery
Simple sequence repeats (SSRs) have been shown to be an efficient tool for performing quantitative trait loci (QTL) analysis and constructing genetic linkage(s) due to their high diversity and abundance. In the present study, 30 678 SSRs were obtained from the transcriptomic information from the Chinese giant salamander. Among them, the most frequent repeat motifs were mononucleotide SSR motifs (76.2%), followed by trinucleotide (11.95%), dinucleotide (10.06%), tetranucleotide (1.6%), pentanucleotide (0.14%) and hexanucleotide (0.02%) (Figure 6A). Based on the distribution of SSR motifs, (A)^n, (G)^n and (T)^n (n ≥ 10) were the three predominant types among mononucleotide SSR motifs, with frequencies of 30.93, 30.14 and 27.77%, respectively (Figure 6B). In the six types of dinucleotide repeat motifs, GT (27.93%) was the most common motif, followed by AT (26.34%) and AC (22.2%) (Figure 6C). Among 20 types of trinucleotide repeats, AGC (13.91%), GCT (13.64%) and AGG (11.86%) were the three predominant types (Figure 6D). Tetranucleotide and pentanucleotide SSR motifs contained 35 and 23 types of repeats, respectively (Figures 6E and F).
SNPs were identified from alignments of multiple sequences used for contig assembly. A total of 26 355 and 36 070 SNPs were obtained from the GS_CS and GS_TS samples, respectively. In the GS_CS samples, 16 524 were putative transitions and 9831 were putative transversions (Figure 7A). In the GS_TS samples, the numbers of various SNP types were higher than that in GS_CS samples (Figures 7A and B), with 23 030 SNPs for transitions and 13 040 SNPs for transversions (Figure 7B). No insertion–deletion polymorphisms (indels) were found in either the GS_CS or GS_ TS samples.
DEGs, SSRs and SNPs analysis related to disorders of nucleotide excision repair
Among the sequencing libraries, we identified 75 sequences related to nucleotide excision repair. The members of the DNA repair gene family including ERCC1-6, ERCC8, XPA, POLH, BLM, TFIIH1-4, RBX1, Cul4, DDB1, DDB2, XPC, HR23, CETN2, CDK7, MNAT1, CCNH, TTDA, RPA, PCNA, RFC and LIG1 were all expressed in the spleen of the Chinese giant salamander. However, 10 genes including BLM, DDB2, ERCC4, ERCC5, ERCC6, ERCC8, POLH, RPA1, TFIIH3 and XPA were DEGs, with 11 isogenes up-regulated and 6 isogenes down-regulated (Table 3).
Table 3.
Transcript_id | KEGG GENE NAME | CS fpkm | TS fpkm | log2FC1 (TS/CS) | Significant |
---|---|---|---|---|---|
comp90201_c0_seq1 | BLM | 1.18 | 0 | −8.73 | Yes |
comp90201_c0_seq2 | BLM | 0 | 0.99 | 8.29 | Yes |
comp90773_c0_seq2 | DDB2 | 0 | 1.18 | 6.51 | Yes |
comp93741_c0_seq2 | ERCC4, XPF | 0 | 0.25 | 6.27 | Yes |
comp92742_c0_seq18 | ERCC5, XPG | 0.54 | 0 | −7.86 | Yes |
comp91446_c0_seq12 | ERCC6, CSB | 0 | 0.35 | 6.94 | Yes |
comp91446_c0_seq15 | ERCC6, CSB | 0.45 | 0 | −7.57 | Yes |
comp91446_c0_seq3 | ERCC6, CSB | 0.57 | 0 | −7.89 | Yes |
comp91446_c0_seq4 | ERCC6, CSB | 0 | 0.61 | 7.82 | Yes |
comp91446_c0_seq8 | ERCC6, CSB | 0 | 0.63 | 7.85 | Yes |
comp81303_c0_seq1 | ERCC8, CSA | 0 | 0.59 | 6.14 | Yes |
comp81303_c0_seq2 | ERCC8, CSA | 0.82 | 0 | −6.78 | Yes |
comp92141_c0_seq1 | POLH | 0 | 0.18 | 5.53 | Yes |
comp92141_c0_seq3 | POLH | 0 | 0.19 | 5.53 | Yes |
comp88203_c0_seq1 | RPA1, POLR1A | 0 | 3.78 | 7.76 | Yes |
comp86879_c0_seq2 | TFIIH3, TFB4 | 1.61 | 0 | −7.53 | Yes |
comp91081_c0_seq4 | XPA | 0 | 1.46 | 7.99 | Yes |
DEGs was filtered using threshold of false discovery rate (FDR) ≤0.05 and absolute value of log2Ratio ≥1.
Further analysis of SSRs and SNPs involved in disorders of nucleotide excision repair indicated a total of 12 genes that were related to disorders of nucleotide excision repair that contained 21 SSRs motifs (Table 4). Among them, DEGs including BLM, ERCC6 and ERCC8 contained 1 mononucleotide SRR and 4 trinucleotide SRR motifs. Thirteen genes including ERCC1, ERCC2, CCNH, Cul4, DDB1, LIG1, PCNA, RBX1, RPA, RFC, TFIIH2, TTDA and XPA contained 76 SNP sites (Table 5). Up to 15 SNPs existed in the isogene of PCNA (comp90786_c1_seq5). Among 76 SNPs, 33 (43.42%) and 29 (38.16%) of the SNPs were specific for the uninfected and infected Chinese giant salamanders, respectively. There were 14 SNPs, that were identical for the GS_CS and GS_TS samples but different from the reference sequences.
Table 4.
Transcript_id | KEGG GENE NAME | Start BP | Repeat | End BP | Type |
---|---|---|---|---|---|
comp90201_c0_seq1 | BLM | 5420 | (GCT)^4 | 5432 | Trinucleotide |
comp90201_c0_seq1 | BLM | 3386 | (ATC)^4 | 3398 | Trinucleotide |
comp57681_c0_seq1 | CDK7 | 43 | (AT)^6 | 55 | Dinucleotide |
comp71102_c0_seq1 | CUL4 | 944 | (A)^10 | 954 | Mononucleotide |
comp107286_c0_seq1 | ERCC3, XPB | 2478 | (G)^10 | 2488 | Mononucleotide |
comp107286_c0_seq1 | ERCC3, XPB | 2204 | (AGC)^4 | 2216 | Trinucleotide |
comp93741_c0_seq1 | ERCC4, XPF | 3807 | (T)^10 | 3817 | Mononucleotide |
comp91446_c0_seq12 | ERCC6, CSB | 3738 | (AGG)^6 | 3756 | Trinucleotide |
comp91446_c0_seq15 | ERCC6, CSB | 3727 | (AGG)^6 | 3745 | Trinucleotide |
comp81303_c0_seq1 | ERCC8, CSA | 66 | (T)^14 | 80 | Mononucleotide |
comp33205_c0_seq1 | RAD23, HR23 | 1046 | (AGC)^4 | 1058 | Trinucleotide |
comp33042_c0_seq1 | RBX1, ROC1 | 1 | (T)^11 | 12 | Mononucleotide |
comp89984_c0_seq1 | rpa | 577 | (G)^10 | 587 | Mononucleotide |
comp33075_c0_seq1 | RPA2 | 191 | (GGT)^4 | 203 | Trinucleotide |
comp57325_c0_seq1 | RPA12 | 1 | (G)^21 | 22 | Mononucleotide |
comp72565_c0_seq1 | RPA3 | 59 | (A)^12 | 71 | Mononucleotide |
comp56791_c0_seq1 | RFC | 1 | (T)^13 | 14 | Mononucleotide |
comp36422_c0_seq1 | RFC | 1 | (T)^11 | 12 | Mononucleotide |
comp37382_c0_seq1 | RFC | 192 | (GGT)^4 | 204 | Trinucleotide |
comp53798_c0_seq1 | TFIIH1, TFB1 | 2301 | (AC)^7 | 2315 | Dinucleotide |
comp82958_c1_seq1 | TFIIH2, SSL1 | 3 | (A)^19 | 22 | Mononucleotide |
Table 5.
Transcript_id | KEGG GENE NAME | Site | Ref | CS_type | TS_type |
---|---|---|---|---|---|
comp74141_c0_seq1 | ERCC1 | 2223 | G | – | A |
comp74141_c0_seq2 | 2070 | G | – | A | |
comp74141_c0_seq2 | 847 | G | T | T | |
comp88233_c0_seq1 | ERCC2 | 128 | G | A | – |
comp88233_c0_seq1 | 2140 | T | – | C | |
comp88233_c0_seq1 | 2450 | G | – | T | |
comp108677_c0_seq1 | CCNH | 190 | C | T | – |
comp61358_c0_seq1 | Cul4A | 1710 | A | C | – |
comp61358_c0_seq1 | 871 | C | – | A | |
comp71102_c0_seq1 | Cul4B | 1738 | A | – | G |
comp71102_c0_seq1 | 811 | A | – | G | |
comp32714_c0_seq1 | DDB1 | 129 | G | – | A |
comp32717_c0_seq1 | 1429 | G | – | A | |
comp32717_c0_seq1 | 433 | T | C | – | |
comp32741_c0_seq1 | 1157 | G | C | C | |
comp80349_c0_seq1 | LIG1 | 3623 | C | – | T |
comp90786_c1_seq5 | PCNA | 105 | T | C | C |
comp90786_c1_seq5 | 13 | G | – | A | |
comp90786_c1_seq5 | 1314 | G | – | A | |
comp90786_c1_seq5 | 1683 | A | G | – | |
comp90786_c1_seq5 | 1934 | G | A | A | |
comp90786_c1_seq5 | 2038 | A | G | – | |
comp90786_c1_seq5 | 2068 | A | G | G | |
comp90786_c1_seq5 | 22 | A | G | – | |
comp90786_c1_seq5 | 2395 | A | G | – | |
comp90786_c1_seq5 | 2465 | G | – | A | |
comp90786_c1_seq5 | 2487 | G | A | A | |
comp90786_c1_seq5 | 2501 | A | G | – | |
comp90786_c1_seq5 | 2543 | G | – | A | |
comp90786_c1_seq5 | 2562 | A | G | – | |
comp90786_c1_seq5 | 93 | G | C | – | |
comp90786_c1_seq7 | PCNA | 105 | T | C | C |
comp90786_c1_seq7 | 1240 | T | C | C | |
comp90786_c1_seq7 | 1244 | C | A | A | |
comp90786_c1_seq7 | 13 | G | – | A | |
comp90786_c1_seq7 | 1818 | G | A | A | |
comp90786_c1_seq7 | 2077 | A | C | C | |
comp90786_c1_seq7 | 22 | A | G | – | |
comp90786_c1_seq7 | 2401 | A | – | G | |
comp90786_c1_seq7 | 2615 | C | – | G | |
comp90786_c1_seq7 | 93 | G | C | – | |
comp33042_c0_seq1 | RBX1 | 89 | G | T | – |
comp89984_c0_seq1 | RPA | 1679 | C | – | T |
comp89984_c0_seq1 | 1829 | G | A | – | |
comp89984_c0_seq1 | 2071 | G | – | A | |
comp89984_c0_seq1 | 2600 | C | G | – | |
comp89984_c0_seq1 | 2645 | G | T | – | |
comp89984_c0_seq1 | 3312 | A | G | – | |
comp89984_c0_seq1 | 799 | C | T | – | |
comp89984_c0_seq1 | 815 | G | T | – | |
comp33075_c0_seq1 | RPA2 | 100 | T | C | C |
comp57325_c0_seq1 | RPA12 | 385 | G | – | C |
comp59041_c0_seq1 | RPA49 | 757 | G | A | – |
comp59041_c0_seq1 | 864 | A | – | G | |
comp59044_c0_seq1 | 380 | G | – | C | |
comp72565_c0_seq1 | RPA3 | 475 | A | – | G |
comp72565_c0_seq1 | 802 | G | A | – | |
comp88203_c0_seq2 | RPA | 4663 | G | C | – |
comp81979_c0_seq1 | RPA43 | 1011 | C | – | T |
comp81979_c0_seq1 | 1032 | C | T | – | |
comp81979_c0_seq1 | 1098 | G | – | A | |
comp56791_c0_seq1 | RFC1 | 3527 | T | C | C |
comp56791_c0_seq1 | 3757 | A | – | G | |
comp36422_c0_seq1 | RFC2 | 1188 | C | – | T |
comp85228_c0_seq1 | RFC2 | 1319 | T | G | – |
comp85228_c0_seq1 | 1744 | A | C | – | |
comp85228_c0_seq1 | 1844 | C | T | – | |
comp36017_c0_seq1 | RFC3 | 335 | A | C | C |
comp37382_c0_seq1 | RFC3 | 49 | C | T | – |
comp82958_c1_seq1 | TFIIH2 | 122 | G | – | A |
comp57951_c0_seq1 | TTDA | 513 | G | T | – |
comp91081_c0_seq1 | XPA | 1705 | T | C | – |
comp91081_c0_seq3 | 1633 | T | G | – | |
comp91081_c0_seq3 | 2827 | A | T | – | |
comp91081_c0_seq4 | 2669 | A | T | – | |
comp91081_c0_seq4 | 2893 | C | A | A |
The identical bases with reference sequences were indicated with “–”.
DEGs, SSRs and SNPs analysis related to antiviral activity
RIG-I-like receptor and Toll-like receptor signaling pathways are two pivotal pathways involved in antiviral immune responses. In the Chinese giant salamander, 143 isogenes involved in the RIG-I-like receptor signaling pathway and 201 isogenes involved in the Toll-like receptor signaling pathway were identified. However, only 20 and 18 isogenes were DEGs for the RIG-I-like receptor and Toll-like receptor signaling pathways, respectively (Additional file 6). Interestingly, the expression of antiviral pattern recognition receptors such as RIG-I, MDA5, LGP2, TLR3, TLR8, TLR7 and TLR9 did not change in response to the GSIV infection. The expression of NLRX1 was significantly increased, whereas TRIM25 decreased (Additional file 6).
For the RIG-I-like receptor signaling pathway, 21 genes contained 38 SSR motifs. Of these, IL8 contained the most SSRs (6 motifs), followed by IRF7 (5 motifs) and RIG-I (4 motifs). For the Toll-like receptor signaling pathway, 21 genes that included AKT, CCL3,ERK1, FADD, IKBKE,IKBKG, IL8, IRF7, JUN, MAP2K7, MAP3K7IP2, MYD88, NFKB1, NFKBIA, P38, PIK3C, PIK3R, TBK1, TBK1, TLR1 and TLR5 contained 59 SSR motifs. Among them, AKT contained the most mononucleotide SSRs, and PIK3C had the most trinucleotide SSRs (Additional file 7).
For SNPs analysis, 25 genes involved in the RIG-I-like receptor signaling pathway contained 272 motifs. Up to 75 and 52 SNPs existed in the genes of IKBKG and OTUD5, respectively. Among 272 SNPs, 207 (76.1%) SNPs were specific for the GSIV-infected Chinese giant salamanders. The antiviral PRRs MDA5, LGP2 and RIG-I also contained SNP motifs. For the Toll-like receptor signaling pathway, 41 genes contained 445 SNPs that consisted of 141 SNPs (31.69%) specific for the GS_CS samples and 228 SNPs (51.24%) specific for the GS_TS samples, and 73 common SNPs (16.4%) for both the GS_CS and GS_TS samples (Additional file 8). The genes involved in both the RIG-I-like receptor and Toll-like receptor signaling pathways, which included IRF3, IRF7, NFKBIA, FADD, JNK, RELA, IL12B, IL8, CASP8, P38, IKBKA, IKBKE and IKBKG contained a total of 141 SNPs (Additional files 8 and 9).
Experimental validation
Since the most enriched pathway was the “complement and coagulation cascades”, 10 genes belonging to the complement components were selected and used for qPCR validation. PCR amplification showed that all qPCR primers produced single fragments of the expected lengths (141–354 bp). Except for complement components C1R (comp81391_c0_seq2), C1S (comp72777_c0_seq1) and C7 (comp93878_c1_seq8), the expression of other 7 complement components including C1S-like, C2, C3, C4, C5, C8A and C9 was in agreement with their transcript abundance changes determined by RNA-seq (Figure 8).
Discussion
Transcriptome sequencing of spleen samples from uninfected and infected Chinese giant salamanders
The complete genome of GSIV has been sequenced [12]. However, the available molecular information is limited for the Chinese giant salamander, and not sufficient for the investigation of the mechanisms of immune reactivity against pathogen infection. In recent years, a number of reports reveal that the transcriptome sequencing of cDNA is an efficient method for generating large sequences that represent expressed genes [33]. Given that the spleen is one of the most important immune organs and is also the main target organ for the iridovirus GSIV [10, 22, 34, 35], the transcriptome sequencing of spleen samples from uninfected and infected Chinese giant salamanders was expected to provide abundant ESTs for amphibian immune genes and contribute to the understanding of GSIV-host interactions. In the study reported herein, using the recently developed Solexa sequencing technology and Trinity RNA-Seq assembly, the results of transcriptome sequencing are reported for the Chinese giant salamanders.
TLR, NLR and the RLR systems that are involved in immune responses
To begin this line of research, we first checked a set of sequences that encode components of TLR, NLR and the RLR systems that are involved in immune responses. In the spleen transcriptome of the Chinese giant salamanders, a total of 201, 206 and 143 isogenes were identified to be involved in the TLR, NLR and RLR pathways, respectively. We used MEGA software to compare the encoded proteins for these three pathways from the Chinese giant salamanders with their mammalian and teleost counterparts. For some proteins, the annotated sequences were not complete, therefore the phylogenetic trees constructed show relationships, but do not show precise evolutionary distances. The conservation and divergence of the TLR, NLR and RLR families are summarized below and in the figure legends.
The TLR system
The Toll-like Receptors (TLRs) are present throughout virtually the entire animal kingdom and have important functions in initiating inflammatory responses and shaping adaptive immunity [36]. A typical TLR is a type I transmembrane protein with many extracellular leucine-rich repeat (LRR) motifs for ligand recognition, and a cytoplasmic TIR domain for signal transduction. In invertebrates, only one or two TLR genes exist in sea squirt Cionaintestinalis and the nematode Caenorhabditiselegans [37, 38], however the sea urchin Strongylocentrotus purpuratus and amphioxus Branchiostoma lanceolatum possess a large number of TLRs [39, 40]. In mammals, the human genome contains 10 functional TLRs whereas the mouse genome contains 12 TLRs, with TLR10 being a pseudogene, and TLR11, TLR12 and TLR13 being mouse-specific genes [41]. Most fish species possess a higher number of TLR genes than mammalian species due to the presence of duplicated TLRs and fish-specific TLRs [42, 43].
Six major TLR families were identified in all vertebrate taxa designated as TLR1, TLR3, TLR4, TLR5, TLR7 and TLR11 [44]. In the Chinese giant salamanders, the members of TLR3, TLR5 and TLR7 have clear orthologs with fish and mammals that was supported by the phylogenetic analysis (Figure 9A). For TLR1 family, clear orthologous relationships were found among fish, amphibians and mammals for TLR2 gene, and two sequences were found to group with mammalian TLR10, TLR1 and TLR6. Interestingly, the ortholog of the mouse-specific TLR13 and fish-specific TLR21 and 22 appear to exist in the Chinese giant salamander. Similar to fish, the Chinese giant salamander has duplicated copies of some TLRs, such as TLR2, TLR5 and TLR21. These results suggest that amphibians may have a greater TLR repertoire than fish or mammals as a result of living in both aquatic and terrestrial environments.
The NLR system
A large family of cytoplasmic NACHT-LRR receptors (NLRs), characterized by the presence of a nucleotide-binding protein (NACHT domain), have important functions in apoptosis, inflammation and innate immune signaling [45–48]. There are 20–30 NLR genes in vertebrates, which are divided into three distinct subfamilies with regard to their phylogenetic relationships including the NODs (NOD1-2, NOD3/NLRC3, NOD4/NLRC5, NOD5/NLRX1, CIITA), the NLRPs (NLRP1–14, also called NALPs) and the IPAF subfamily, consisting of IPAF (NLRC4) and NAIP [49]. In the Chinese giant salamander, the canonical NOD proteins (NOD1, NOD2, NOD3, NOD5, CIITA), IPAF, NAIP, as well as APAF1 (apoptotic protease activating factor 1) exhibit clear orthologous relationships (Figure 9B). However, no homologous sequences were found for NOD4 in the spleen transcriptome of the Chinese giant salamander.
The expansion of NLR-encoding genes has been described in the sea urchin, amphioxus and zebrafish [39, 40, 50, 51]. In their genome, at least 92 (amphioxus) and 200 (sea urchin and zebrafish) NLR genes were predicted. The large majority of the sea urchin NLR proteins consist of a central NACHT domain, an N-terminal DEATH domain and C-terminal LRRs [39]. However, the large groups of fish-specific NLR proteins do not contain any the amino-terminal effector domains [51]. Additionally, NLRP proteins with a PYD domain in the N-terminal region were not found in the sea urchin, amphioxus and zebrafish [39, 40, 51]. Unlike NLRs in the species mentioned above, the expansion of NLR proteins were not found in the spleen transcriptome of the Chinese giant salamander, however many annotated sequences grouped with the mammalian NLRPs in the phylogenetic tree (Figure 9B).
In the Chinese giant salamander, many NLRs such as APAF1, IPAF, NOD2, NOD3, CIITA and NALP3/NALP12 existed as multiple isoforms. Our previous studies have shown that the isoforms of immune genes regulated positively or negatively antibacterial and antiviral immunity [52–54]. The multiple isoforms of NLR genes in the present study suggested that the complexity and diversity of the innate immunity may be achieved in Chinese giant salamander through the use of alternative splicing or gene duplication.
The RLR system
Besides NLRs, the RIG-I-like receptors (RLRs) represent another crucial family of intracellular pattern recognition receptors, which use C-terminal RNA helicases to recognize viral RNA and N-terminal CARD domains for signaling [55]. The RLR family members include retinoic acid inducible gene-I (RIG-I), melanoma differentiation gene-5 (MDA5) and laboratory of genetics and physiology-2 (LGP2), which have been identified in teleost species [52, 56–59]. These three RLRs were present in the Chinese giant salamander. As for adapters, the CARD-containing IPS-1 gene wasn’t identified in the spleen transcriptome of the Chinese giant salamander (Additional file 10A).
Comparative transcriptome analysis revealed the immune anergy of TLR, NLR and the RLR systems in the spleen
Iridoviruses are large double-stranded DNA (dsDNA) viruses that can infect invertebrates and poikilothermic vertebrates, including insects, fish, amphibians and reptiles. A real-time polymerase chain reaction (PCR) assay for a marine fish iridovirus showed that the spleen and kidney contained the largest number of viral particles while no viral DNA was detected in the muscle tissue [34]. A comparison of the genes expressed during a red seabream iridovirus (RSIV) infection in the spleen and kidney suggested that RSIV preferentially targets the spleen [36]. In the Chinese giant salamander, our previous study showed tissue necrosis and the existence of GSIV viral particles in the spleen, liver and kidney [10]. Many studies have shown that the spleen and kidney are the target organs of iridovirus infections.
Using 454 pyrosequencing, 755 up-regulated genes and 695 down-regulated genes were identified in the two spleen-complementary DNA libraries, that were constructed from Singapore grouper iridovirus (SGIV) infected and control orange-spotted grouper [22]. Although 80 367 genes were identified in the spleen transcriptome of the Chinese giant salamander using Solexa sequencing technology, comparative transcriptome analysis indicated that only 293 genes were down-regulated and 220 genes were up-regulated in response to the GSIV infection. In addition, a large number of genes were involved in TLR (72 hits), NLR (63 hits) and RLR (44 hits) pathways, however only 11 (TLR), 12 (NLR) and 10 (RLR) genes in these pathways showed significant changes in their transcripts after GSIV infection. The non-significant changes for the majority of the genes including PRRs and the non-significant enrich for these pathways suggested that TLR, NLR and RLR systems in the spleen of Chinese giant salamanders were immune anergic during GSIV infection.
Comparative transcriptome analysis revealed the primary immunodeficiency of the complement system in the spleen
The vertebrate complement system is a humoral and proteolytic system that is composed of approximately 40 soluble and membrane-bound proteins. It is an integral part of the innate immune system protecting the host against invasion and proliferation of various pathogens. The mammalian complement system has three different activation pathways which include classical, alternative and lectin. Activation of the classical pathway is triggered by the binding of C1q proteins to immune complexes or aggregates containing IgG or IgM [60]. The lectin pathway parallels the classical pathway, the difference being at the initial step of target recognition and subsequent activation [61]. Activation of the lectin pathway occurs through the binding of the mannose-binding lectins (MBL) to their target, which results in the activation of the MBL-associated serine proteases (MASPs) [62]. Different from the classical and lectin pathways, the alternative pathway of complement activation is triggered spontaneously, and primarily makes use of the recognition of host-associated molecular patterns (HAMPs), not pathogen-associated molecular patterns (PAMPs) [63].
In the study reported herein, the majority of genes involved in the complement and coagulation cascades (53 of 59 total genes or 90%) were identified, and the alternative, lectin and classical pathways appeared conserved in the Chinese giant salamander (Additional file 10B). The KEGG enrichment analysis indicated that the most enriched pathway is the “complement and coagulation cascades” and significantly enriched diseases include “primary immunodeficiency” and “complement regulatory protein defects”. In mammals, primary immunodeficiencies (PIDs) are severe defects in the capacity of the host to mount a proper immune response and are characterized by an increased susceptibility to infections. Common PIDs include disorders of humoral immunity, T-cell defects, combined B- and T-cell defects, phagocytic disorders, and complement deficiencies [64]. In this study, the different genes that were expressed in the “primary immunodeficiency” were mainly the components of complement system, with down-regulated expression of C1s-like, C2, C3, C4, c8g.1, C9, complement component (3b/4b) receptor 1-like, complement receptor type 2-like, complement factor H and hemolytic complement. The data from the qRT-PCR analysis also validated the down-regulated expression of the complement components C1s-like, C2, C3, C4, C5, C8A and C9 in response to the GSIV infection, however the weak up-regulated expression of C1r, C1s and C7 in the transcriptome analysis could not be validated by qRT-PCR. The results supported that GSIV infection can trigger primary immunodeficiency of the complement system in the spleen of the Chinese giant salamander.
Acknowledgements
The work was supported by Special Fund for Agro-scientific Research in the Public Interest (201203086).
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
LZ and YF designed the experiments. YF executed the experiments, submitted data and drafted the manuscript. MXC performed primer design, the bioinformatics analysis and drafted the manuscript. YWH performed the bioinformatics analysis and qRT-PCR validation. JM and LH participated in sample collection and RNA extraction. SEL and PN contributed to manuscript editing. LZ conceived the study and revised the manuscript. All authors read and approved the final manuscript.
Additional files
Footnotes
Yuding Fan and Ming Xian Chang contributed equally to this work
Contributor Information
Yuding Fan, Email: fanyd@yfi.ac.cn.
Ming Xian Chang, Email: mingxianchang@ihb.ac.cn.
Jie Ma, Email: jiema@yfi.ac.cn.
Scott E. LaPatra, Email: scott.lapatra@clearsprings.com
Yi Wei Hu, Email: huyiweicsu@163.com.
Lili Huang, Email: lilihuang0520@hotmail.com.
Pin Nie, Email: pinnie@ihb.ac.cn.
Lingbing Zeng, Email: zlb@yfi.ac.cn.
References
- 1.Vogel G. Frog is a prince of a new model organism. Science. 1999;285:25. doi: 10.1126/science.285.5424.25. [DOI] [PubMed] [Google Scholar]
- 2.Gao KQ, Shubin NH. Earliest known crown-group salamanders. Nature. 2003;422:424–428. doi: 10.1038/nature01491. [DOI] [PubMed] [Google Scholar]
- 3.Zhang KJ, Wang XM, Wu W, Wang ZH, Huang S. Advances in conservation biology of Chinese giant salamander. Biodivers Sci. 2002;10:291–297. [Google Scholar]
- 4.Wang XM, Zhang KJ, Wang ZH, Ding YZ, Wu W, Huang S. The decline of the Chinese giant salamander Andrias davidianus and implications for its conservation. ORYX. 2004;38:197–202. doi: 10.1017/S0030605304000341. [DOI] [Google Scholar]
- 5.Dong W, Zhang X, Yang C, An J, Qin J, Song F, Zeng W. Iridovirus infection in Chinese giant salamanders, China. Emerg Infect Dis. 2011;17:2388–2389. doi: 10.3201/eid1712.101758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Geng Y, Wang KY, Zhou ZY, Li CW, Wang J, He M, Yin ZQ, Lai WM. First report of a Ranavirus associated with morbidity and mortality in farmed Chinese giant salamanders (Andrias davidianus) J Comp Pathol. 2011;145:95–102. doi: 10.1016/j.jcpa.2010.11.012. [DOI] [PubMed] [Google Scholar]
- 7.Zhou Y, Zeng LB, Meng Y, Zhou QL, Zhang H, Gao ZY, Xiao Y, Sun JB. Establishment of a TaqMan real-time PCR assay for detecting the giant salamander iridovirus. J Fish Chin. 2012;36:772–778. doi: 10.3724/SP.J.1231.2012.27923. [DOI] [Google Scholar]
- 8.Zhou ZY, Geng Y, Ren SY, Wang KY, Huang XL, Chen DF, Liu XX, Lai WM. Ranavirus (family Iridoviridae) detection by polymerase chain reaction (PCR) in Chinese giant salamander (Andrias davidianus, Blanchard, 1871), China. Afr J Biotechnol. 2012;11:15130–15134. [Google Scholar]
- 9.Meng Y, Zhang H, Liang H, Zeng L, Xiao H, Xie C. Development of a loop–mediated isothermal amplification assay for rapid detection of iridovirus in the Chinese giant salamander. J Virol Methods. 2013;194:211–216. doi: 10.1016/j.jviromet.2013.08.024. [DOI] [PubMed] [Google Scholar]
- 10.Meng Y, Ma J, Jiang N, Zeng LB, Xiao HB. Pathological and microbiological findings from mortality of the Chinese giant salamander (Andrias davidianus) Arch Virol. 2014;159:1403–1412. doi: 10.1007/s00705-013-1962-6. [DOI] [PubMed] [Google Scholar]
- 11.Ma J, Zeng L, Zhou Y, Jiang N, Zhang H, Fan YD, Meng Y, Xu J. Ultrastructural morphogenesis of an amphibian iridovirus isolated from Chinese giant salamander (Andrias davidianus) J Comp Pathol. 2014;150:325–331. doi: 10.1016/j.jcpa.2013.09.007. [DOI] [PubMed] [Google Scholar]
- 12.Li W, Zhang X, Weng S, Zhao G, He J, Dong C. Virion-associated viral proteins of a Chinese giant salamander (Andriasdavidianus) iridovirus (genus Ranavirus) and functional study of the major capsid protein (MCP) Vet Microbiol. 2014;172:129–139. doi: 10.1016/j.vetmic.2014.05.009. [DOI] [PubMed] [Google Scholar]
- 13.Zhu R, Chen ZY, Wang J, Yuan JD, Liao XY, Gui JF, Zhang QY. Thymus cDNA library survey uncovers novel features of immune molecules in Chinese giant salamander Andrias davidianus. Dev Comp Immunol. 2014;46:413–422. doi: 10.1016/j.dci.2014.05.019. [DOI] [PubMed] [Google Scholar]
- 14.Li FG, Wang LX, Lan QJ, Yang H, Li Y, Liu XL, Yang ZX. RNA-Seq analysis and gene discovery of Andrias davidianus using Illumina short read sequencing. PLoS One. 2015;10:e0123730. doi: 10.1371/journal.pone.0123730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhu R, Chen ZY, Wang J, Yuan JD, Liao XY, Gui JF, Zhang QY. Extensive diversification of MHC in Chinese giant salamanders Andrias davidianus (Anda-MHC) reveals novel splice variants. Dev Comp Immunol. 2014;42:311–322. doi: 10.1016/j.dci.2013.10.001. [DOI] [PubMed] [Google Scholar]
- 16.Wang L, Yang H, Li F, Zhang Y, Yang Z, Li Y, Liu X. Molecular characterization, tissue distribution and functional analysis of macrophage migration inhibitory factor protein (MIF) in Chinese giant salamanders Andrias davidianus. Dev Comp Immunol. 2013;39:161–168. doi: 10.1016/j.dci.2012.10.012. [DOI] [PubMed] [Google Scholar]
- 17.Huang LL, Fan YD, Zhou Y, Jiang N, Liu WZ, Meng Y, Zeng LB. Cloning, sequence analysis and expression profiles of Toll-like receptor 7 from Chinese giant salamander Andrias davidianus. Comp Biochem Phys. 2015;184:52–57. doi: 10.1016/j.cbpb.2015.02.006. [DOI] [PubMed] [Google Scholar]
- 18.Alagna F, D’Agostino N, Torchia L, Servili M, Rao R, Pietrella M, Perrotta G. Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development. BMC Genomics. 2009;10:399. doi: 10.1186/1471-2164-10-399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Salem M, Rexroad CE, Wang J, Thorgaard GH, Yao J. Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches. BMC Genomics. 2010;11:564. doi: 10.1186/1471-2164-11-564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Emrich SJ, Barbazuk WB, Li L, Schnable PS. Gene discovery and annotation using LCM-454 transcriptome sequencing. Genome Res. 2007;17:69–73. doi: 10.1101/gr.5145806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Li PH, Ponnala L, Gandotra N, Wang L, Si YQ, Tausta SL, Kebrom TH, Provart N, Patel R, Myers CR, Reidel EJ, Turgeon R, Liu P, Sun Q, Nelson T, Brutnell TP. The developmental dynamics of the maize leaf transcriptome. Nat Genet. 2010;42:1060–1067. doi: 10.1038/ng.703. [DOI] [PubMed] [Google Scholar]
- 22.Huang Y, Huang X, Yan Y, Cai J, Ouyang Z, Cui H, Wang P, Qin QW. Transcriptome analysis of orange-spotted grouper (Epinepheluscoioides) spleen in response to Singapore grouper iridovirus. BMC Genomics. 2011;12:556. doi: 10.1186/1471-2164-12-556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–3676. doi: 10.1093/bioinformatics/bti610. [DOI] [PubMed] [Google Scholar]
- 25.Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480–D484. doi: 10.1093/nar/gkm882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28:511–515. doi: 10.1038/nbt.1621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–628. doi: 10.1038/nmeth.1226. [DOI] [PubMed] [Google Scholar]
- 28.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Faircloth BC. msatcommander: detection of microsatellite repeat arrays and automated, locus-specific primer design. Mol Ecol Resour. 2008;8:92–94. doi: 10.1111/j.1471-8286.2007.01884.x. [DOI] [PubMed] [Google Scholar]
- 31.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Morozova O, Hirst M, Marra MA. Applications of new sequencing technologies for transcriptome analysis. Annu Rev Genomics Hum Genet. 2009;10:135–151. doi: 10.1146/annurev-genom-082908-145957. [DOI] [PubMed] [Google Scholar]
- 34.Wang XW, Ao JQ, Li QG, Chen XH. Quantitative detection of a marine fish iridovirus isolated from large yellow croaker, Pseudosciaenacrocea, using a molecular beacon. J Virol Methods. 2006;133:76–81. doi: 10.1016/j.jviromet.2005.10.025. [DOI] [PubMed] [Google Scholar]
- 35.Dang TL, Yasuike M, Hirono I, Kondo H, Aoki T. Transcriptional profile of red seabream iridovirus in a fish model as revealed by viral DNA microarrays. Virus Genes. 2007;35:449–461. doi: 10.1007/s11262-007-0090-3. [DOI] [PubMed] [Google Scholar]
- 36.Iwasaki A, Medzhitov R. Toll-like receptor control of the adaptive immune responses. Nat Immunol. 2004;5:987–995. doi: 10.1038/ni1112. [DOI] [PubMed] [Google Scholar]
- 37.Sasaki N, Ogasawara M, Sekiguchi T, Kusumoto S, Satake H. Toll-like receptors of the ascidian Ciona intestinalis: prototypes with hybrid functionalities of vertebrate Toll-like receptors. J Biol Chem. 2009;284:27336–27343. doi: 10.1074/jbc.M109.032433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Satake H, Sekiguchi T. Toll-like receptors of deuterostome invertebrates. Front Immunol. 2012;3:34. doi: 10.3389/fimmu.2012.00034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hibino T, Loza-Coll M, Messier C, Majeske AJ, Cohen AH, Terwilliger DP, Buckleyb KM, Brocktonb V, Naird SV, Berneye K, Fugmann SD, Anderson MK, Pancer Z, Cameron RA, Smith LC, Rast JP. The immune gene repertoire encoded in the purple sea urchin genome. Dev Biol. 2006;300:349–365. doi: 10.1016/j.ydbio.2006.08.065. [DOI] [PubMed] [Google Scholar]
- 40.Huang S, Yuan S, Guo L, Yu Y, Li J, Wu T, Ge J, Yu Y, Huang H, Dong M, Yu C, Chen S, Xu A. Genomic analysis of the immune gene repertoire of amphioxus reveals extraordinary innate complexity and diversity. Genome Res. 2008;18:1112–1126. doi: 10.1101/gr.069674.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ariffin JK, Sweet MJ. Differences in the repertoire, regulation and function of Toll-like Receptors and inflammasome-forming Nod-like Receptors between human and mouse. Curr Opin Microbiol. 2013;16:303–310. doi: 10.1016/j.mib.2013.03.002. [DOI] [PubMed] [Google Scholar]
- 42.Quiniou SM, Boudinot P, Bengtén E. Comprehensive survey and genomic characterization of Toll-like receptors (TLRs) in channel catfish, Ictaluruspunctatus:identification of novel fish TLRs. Immunogenetics. 2013;65:511–530. doi: 10.1007/s00251-013-0694-9. [DOI] [PubMed] [Google Scholar]
- 43.Kanwal Z, Wiegertjes GF, Veneman WJ, Meijer AH, Spaink HP. Comparative studies of Toll-like receptor signalling using zebrafish. Dev Comp Immunol. 2014;46:35–52. doi: 10.1016/j.dci.2014.02.003. [DOI] [PubMed] [Google Scholar]
- 44.Roach JC, Glusman G, Rowen L, Kaur A, Purcell MK, Smith KD, Hood LE, Aderem A. The evolution of vertebrate Toll-like receptors. Proc Natl Acad Sci U S A. 2005;102:9577–9582. doi: 10.1073/pnas.0502272102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Fritz JH, Ferrero RL, Philpott DJ, Girardin SE. Nod-like proteins in immunity, inflammation and disease. Nat Immunol. 2006;7:1250–1257. doi: 10.1038/ni1412. [DOI] [PubMed] [Google Scholar]
- 46.Kanneganti TD, Lamkanfi M, Núñez G. Intracellular NOD-like receptors in host defense and disease. Immunity. 2007;27:549–559. doi: 10.1016/j.immuni.2007.10.002. [DOI] [PubMed] [Google Scholar]
- 47.Franchi L, Park JH, Shaw MH, Marina-Garcia N, Chen G, Kim YG, Núñez G. Intracellular NOD-like receptors in innate immunity, infection and disease. Cell Microbiol. 2008;10:1–8. doi: 10.1111/j.1462-5822.2007.01059.x. [DOI] [PubMed] [Google Scholar]
- 48.Chen G, Shaw MH, Kim YG, Nuñez G. NOD-like receptors: role in innate immunity and inflammatory disease. Ann Rev Pathol. 2009;4:365–398. doi: 10.1146/annurev.pathol.4.110807.092239. [DOI] [PubMed] [Google Scholar]
- 49.Schroder K, Tschopp J. The inflammasomes. Cell. 2010;140:821–832. doi: 10.1016/j.cell.2010.01.040. [DOI] [PubMed] [Google Scholar]
- 50.Rast JP, Smith LC, Loza-Coll M, Hibino T, Litman GW. Genomic insights into the immune system of the sea urchin. Science. 2006;314:952–956. doi: 10.1126/science.1134301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Stein C, Caccamo M, Laird G, Leptin M. Conservation and divergence of gene families encoding components of innate immune response systems in zebrafish. Genome Biol. 2007;8:R251. doi: 10.1186/gb-2007-8-11-r251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chang M, Collet B, Nie P, Lester K, Campbell S, Secombes CJ, Zou J. Expression and functional characterization of the RIG-I-like receptors MDA5 and LGP2 in Rainbow trout (Oncorhynchus mykiss) J Virol. 2011;85:8403–8412. doi: 10.1128/JVI.00445-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Chang MX, Zou J, Nie P, Huang B, Yu Z, Collet B, Secombes CJ. Intracellular interferons in fish: a unique means to combat viral infection. PLoS Pathog. 2013;9:e1003736. doi: 10.1371/journal.ppat.1003736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Yu ZL, Li JH, Xue NN, Nie P, Chang MX. Expression and functional characterization of PGRP6 splice variants in grass carp Ctenopharyngodonidella. Dev Comp Immunol. 2014;47:264–274. doi: 10.1016/j.dci.2014.08.006. [DOI] [PubMed] [Google Scholar]
- 55.Bruns AM, Horvath CM. Activation of RIG-I-like receptor signal transduction. Crit Rev Biochem Mol Biol. 2012;47:194–206. doi: 10.3109/10409238.2011.630974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Su J, Huang T, Dong J, Heng J, Zhang R, Peng L. Molecular cloning and immune responsive expression of MDA5 gene, a pivotal member of the RLR gene family from grass carp Ctenopharyngodon idella. Fish Shellfish Immunol. 2010;28:712–718. doi: 10.1016/j.fsi.2010.01.009. [DOI] [PubMed] [Google Scholar]
- 57.Ohtani M, Hikima J, Kondo H, Hirono I, Jung TS, Aoki T. Characterization and antiviral function of a cytosolic sensor gene, MDA5, in Japanese flounder, Paralichthys olivaceus. Dev Comp Immunol. 2011;35:554–562. doi: 10.1016/j.dci.2010.12.013. [DOI] [PubMed] [Google Scholar]
- 58.Rajendran KV, Zhang J, Liu S, Peatman E, Kucuktas H, Wang X, Liu H, Wooda T, Terhunea J, Liu ZJ. Pathogen recognition receptors in channel catfish: II Identification, phylogeny and expression of retinoic acid-inducible gene I (RIG-I)-like receptors (RLRs) Dev Comp Immunol. 2012;37:381–389. doi: 10.1016/j.dci.2012.02.004. [DOI] [PubMed] [Google Scholar]
- 59.Zou PF, Chang MX, Xue NN, Liu XQ, Li JH, Fu JP, Chen SN, Nie P. Melanoma differentiation-associated gene 5 in zebrafish provoking higher interferon-promoter activity through signalling enhancing of its shorter splicing variant. Immunology. 2014;141:192–202. doi: 10.1111/imm.12179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Gaboriaud C, Juanhuix J, Gruez A, Lacroix M, Darnault C, Pignol D, Verger D, Fontecilla-Camps JC, Arlaud GJ. The crystal structure of the globular head of complement protein C1q provides a basis for its versatile recognition properties. J Biol Chem. 2003;278:46974–46982. doi: 10.1074/jbc.M307764200. [DOI] [PubMed] [Google Scholar]
- 61.Degn SE, Thiel S. Humoral pattern recognition and the complement system. Scand J Immunol. 2013;78:181–193. doi: 10.1111/sji.12070. [DOI] [PubMed] [Google Scholar]
- 62.Fujita T. Evolution of the lectin-complement pathway and its role in innate immunity. Nat Rev Immunol. 2002;2:346–353. doi: 10.1038/nri800. [DOI] [PubMed] [Google Scholar]
- 63.Pangburn MK, Ferreira VP, Cortes C. Discrimination between host and pathogens by the complement system. Vaccine. 2008;8:I15–I21. doi: 10.1016/j.vaccine.2008.11.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Cooper MA, Pommering TL, Korányi K. Primary immunodeficiencies. Am Fam Physician. 2003;68:2001–2008. [PubMed] [Google Scholar]