Skip to main content
Genes logoLink to Genes
. 2020 May 19;11(5):570. doi: 10.3390/genes11050570

A Reference Genome of Bursaphelenchus mucronatus Provides New Resources for Revealing Its Displacement by Pinewood Nematode

Shuangyang Wu 1,, Shenghan Gao 2,, Sen Wang 2, Jie Meng 1,3, Jacob Wickham 1, Sainan Luo 2,3, Xinyu Tan 2, Haiying Yu 2, Yujia Xiang 1,3, Songnian Hu 2,3, Lilin Zhao 1, Jianghua Sun 1,3,4,*
PMCID: PMC7288286  PMID: 32438771

Abstract

The Bursaphelenchus mucronatus, which was highly similar with Bursaphelenchus xylophilus in terms of morphological characteristics and biological properties—but had weaker pathogenicity to forests—was a native species often displaced by B. xylophilus when occupying the same niche. Since the draft genome of the invasive B. xylophilus has been published, the absence of a reference genome of B. mucronatus still prevents us from understanding the molecular evidences behind competitive displacement. In this study, we employed Single Molecule, Real-Time (SMRT) sequencing and a Hi-C scaffolding approach to yield a near chromosome-level assembly of B. mucronatus, including six pseudo-chromosomes. The assembly size is 73 Mb, with scaffold N50 of 11.50 Mb and contig N50 of 1.48 Mb. Comparative genomics results showed high similarity between B. xylophilus and B. mucronatus. However, the losing of orphan genes and species-specific orthologous genes in B. mucronatus may indicate weaker adaptability to the environment. The gene family contractions of GPCRs (G Protein-Coupled Receptors) and cellulases in B. mucronatus may jointly contribute to its displacement by B. xylophilus. Overall, we introduced a valuable genomic resource for molecular and evolutionary studies of B. mucronatus, especially for studying the competitive displacement by the pinewood nematode, which could help us control the pathogenicity of pine wilt diseases.

Keywords: Bursaphelenchus mucronatus, Bursaphelenchus xylophilus, Comparative genomics, GPCR

1. Introduction

The pinewood nematode, Bursaphelenchus xylophilus, was introduced from North America to Asia and Europe; it is now a global quarantine pest and the main cause of pine wilt disease. It has caused severe disaster to coniferous forest ecosystems [1,2]. In contrast with B. xylophilus, there was another native sympatric sibling species, Bursaphelenchus mucronatus, which was highly similar with B. xylophilus in terms of morphological characteristics and biological properties [3], but has weaker pathogenicity to forests [4,5,6,7]. A previous study showed that the juveniles and males of the two nematode species are difficult to distinguish morphologically, although the tail of adult female worms between B. xylophilus and B. mucronatus are different. Specifically, the tail of female B. mucronatus is conical and has a distinct tail tip [6,8], whereas B. xylophilus has a rounded terminus [8]. They also occupy the same niche, reproduce in the resin canals, feed on epithelial cells in living hosts, consume the host’s symbiotic fungi in dead hosts, and are even transported to host pine trees by the same insect vectors, Monochamus beetles [4,9]. Interestingly, even the two nematodes are highly similar and reside usually in the shared ecosystems; some evidences indicate that B. xylophilus usually displace the native nematode B. mucronatus in forests during its invasion process [10,11]. Therefore, previous work, which mostly focused on comparisons of biological characteristics (morphology, pathogenicity, reproduction, etc.) between B. xylophilus and B. mucronatus [7,11,12,13] could not sufficiently explain why and how the competitive replacement took place.

Genomic resources have proven to be useful tools to explain the successful expansion of invasive species. The Cydia pomonella genome reveals the genetic basis of its invasiveness with three single nucleotide polymorphisms (SNPs) found in the promoter of CYP6B2. These are potentially associated with insecticide resistance that may explain its worldwide expansion [14]. The Hyphantria cunea genome infers that the fall webworm can colonize novel hosts, mediated by plasticity in their gustatory receptor, which could utilize more novel nutrition sources [15]. The genome of the invasive pest Lycorma delicatula has served as a reference for studying olfactory genes to restrict its expansion on the fruit or horticultural crops [16]. The comparative genomics study of four Strongyloides species exploited the unique Strongyloides life cycle and found the expansion gene family of astacin-like and SCP/TAPS (Sperm-coating protein (SCP)-like extracellular proteins, also called SCP/Tpx-1/Ag5/PR-1/Sc7) is associated with the evolution of parasitism [17]. Given these golden examples, the comparative genomic analysis between B. xylophilus and B. mucronatus is expected to be a key to the answers. Since the draft genome of the invasive species B. xylophilus has already been published [18], the absence of a reference genome of B. mucronatus still prevents us from understanding the molecular evidences behind this competitive displacement, or even, much simply, are the two nematodes the same species or not? A reference genome for B. mucronatus is a necessary resource and a better way to understand the competition between the two species.

Thus, we hereby have sequenced and assembled a reference genome for B. mucronatus using PacBio SMRT (Single Molecule, Real-Time) technology, and then carried out a comparative genomic analysis with the previously released B. xylophilus reference genome. This reference genome is to investigate evidences for the evolution, the pathogenicity, and the competitive displacement between B. xylophilus and B. mucronatus, which may help us better understand the pathogenesis of pinewood nematode and control the pine wilt disease.

2. Materials and Methods

2.1. Sample Collection, Cultivation, and Growth Synchronization

The nematode samples of B. mucronatus, which originated from Zhejiang Province, China, were cultivated on the fungus Botrytis cinerea using potato dextrose agar (PDA) plates. About 100 adult nematodes were added to the PDA plate with B. cinerea. Moreover, 1X PBST buffer (Phosphate Buffer Saline Tween-20) was used to wash out the nematodes after feeding on B. cinerea; the suspension was poured into a clean glass petri dish and stored for 30 min. The excess suspension was poured out when most of the eggs were adsorbed to the bottom of the glass. Sterile distilled water was added to the glass to form a water layer with a height of about 5 mm, and cultured in the dark at 25 °C until most of the eggs hatched and became the second instar. Then, the second instar larvae (L2) were cultured with B. cinerea. After 24 h–30 h, L2 grew into the third instar larvae (L3); after 48 h–54 h, L2 grew into the fourth instar larvae (L4); and after 72 h–78 h, L2 grew into the adults. Different stages of nematodes were collected by the Baermann funnel method.

2.2. Library Construction and Sequencing

Total genomic DNA was extracted from about 20,000 adult nematodes of mixed age using MagAttract HMW DNA Kit (QIAGEN, Valencia, CA, USA), according to the manufacturer’s protocol. Agarose gel electrophoresis was used to check the average length of genomic DNA molecules; the main band was around 20 kb, which satisfied the requirement of assembling sequences with PacBio long read. SMRTbell™ Template Prep Kit was subsequently used to construct the library for the PacBio RS II platform (Pacific Biosciences of California, Menlo Park, CA, USA). A total of 24 single-molecule real-time cells were produced with P6/C4 chemistry, generating 720 million reads with a total length of 11 Gb subreads, with a coverage of 160x of the B. mucronatus genome. Meanwhile, a paired-end library with an insert size of 300 bp was constructed using KAPA Hyper Prep Kit (KK8504) and then sequenced on HiSeq 4000 platform (Illumina, San Diego, CA, USA), which was used for assembly survey and assembly correction.

To anchor the assembled sequences into chromosomal level pseudo-molecules, a Hi-C library was constructed using another batch of 20,000 nematodes of mixed stage. Nuclei were extracted and subsequently digested with HindIII and DpnII enzymes, incubated with biotin-14-dATP, and then ligated by T4 DNA polymerase. The extracted DNA was sheared by Proteinase K and purified to 350 bp sizes. Magnetic beads were used to conduct biotin pull down and tail the adapter. PCR amplification was performed after adapters were ligated to the Hi-C products. The PCR products were purified with AMPure XP beads, and the Hi-C libraries were quantified by qPCR for Illumina HiSeq X-ten PE150 sequencing.

Five transcriptomic libraries of B. mucronatus in different life stage (Adult, L2, L3, and L4, mixed age) was built by KAPA Stranded mRNA-Seq Library Preparation kit (KK8421) and quantified by qPCR for Illumina HiSeq 4000 PE150 sequencing. The mixed age of transcriptomic library of B. xylophilus was built using the same method and the same cultivation method as B. mucronatus. Note that these age-specific RNA sequencing (RNA-seq) libraries were designated for constructing transcripts for training and predicting coding genes; thus, no biological replicates were employed.

2.3. Genome Size Estimation

The genome size was estimated based on the K-mer frequency distribution counted from Illumina pair-end libraries. The K-mer frequency was counted by Jellyfish [19] with parameter “-m 17 -C” and the result was uploaded to GenomeScope [20] web portal to estimate the genome size. For the GenomeScope parameter, we used a 4-peak model that converged in 4 rounds. The estimated genome size was around 67.5 Mb (Table S1).

2.4. Genome Assembly

A total of 11 Gb (~160X) whole genome shotgun PacBio P6/C4 reads generated from PacBio RS II platform were used for genome assembly (Table S2). We used Canu (version 1.8) [21] to generate the de novo assembly of B. mucronatus. To obtain an optimal result, we separated the assembling progress into two stages. First, we generated the corrected long reads using Canu with default parameters. Second, we carried out a series of runs using the corrected reads with different combinations of parameters, in the hope of searching for better integrity (higher N50 length and fewer numbers of contigs) of the genome assembly (Table S3). Finally, we chose the parameter combination of “minReadLength=2000, minOverlapLength=2000, correctedErrorRate=0.06, batOptions=-dg 3 -db 3 -dr 1 -ca 500 -cp 50” for the assembly. The assembled contigs were further corrected by Illumina pair-end reads using Pilon [22] with default parameters. The initial assembly contained 1,030 contigs, which sum up to 83,092,420 bp (83 Mb), including 28 repeats with a total length of 606,152 bp. The N50 length of the assembly was 1.48 Mb and the L50 number was 12 (Table S4).

2.5. Pseudo-Chromosomal Level Scaffolding with Hi-C Data

The reads from the Hi-C library (Table S5) sequencing were mapped to the assembled contigs using Juicer [23], visualized using Juicebox [23] and anchored into pseudo-chromosome using 3D-DNA pipeline [24]. Juicebox Assembly Tools (JBAT) [23] was used to manually polish and finalize the assembly. Finally, we obtained six pseudo-chromosomes and assigned the chromosome number by their sorted length. In addition to the pseudo-chromosomes, there remains 66 unplaced contigs (Sequence length>=30 Kb) (Table S6).

2.6. Genome Annotation

2.6.1. Protein-Coding Gene Annotation

Protein-coding genes were annotated by integrating transcriptomic evidence (assembled transcripts from RNA-seq) with the result from ab initio gene predictors and homologous mapping. Four stages of RNA-seq data were respectively mapped to the genome using Hisat2, and transcripts were constructed using stringtie [25]. The transcripts from all stages were further combined using TACO [26]. Transdecoder was then used to extract candidate ORFs (Open Reading Frame) from TACO results [27]. PASA (version 2.3.3) [28] was used to generate a training set for ab initio gene predictors. Augustus (version 3.3) [29] and GeneMark-ET (version 2.3) [30] were used for ab initio gene prediction. All predicted ORFs were integrated with EVM (EVidenceModeler) [31] and polished with GFFread [32] as the final set of protein-coding genes. All of these protein-coding genes were characterized for their putative function against public database by BLASTP (BLAST: Basic Local Alignment Search Tool for Proteins) (1e-5), including NR (Non-Redundant Protein Sequence Database) [33], UniProt [34], and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases [35]. Meanwhile, InterProScan was used to align protein-coding genes against the InterPro database [36] to identify protein domains. Completeness of protein-coding genes was evaluated by the BUSCO pipeline (Version 3.0.2, lineage dataset is: nematoda_odb9) [37] using both the proteins and genome mode, which may also asses the quality of the assembly. The final assembly of Bursaphelenchus mucronatus has been deposited in GenBank with accession number of VUOU00000000. Raw reads are deposited in the Sequence Read Archive (SRA) database under BioProject accession PRJNA562094, including RNA-seq, genome survey sequencing and PacBio sequencing.

2.6.2. Repeat and Non-Coding RNA Identification

RepeatModeler (version 1.0.5, http://www.repeatmasker.org/) was used to construct a species-specific repetitive sequence database. RepeatMasker (version 4.0.5, http://www.repeatmasker.org/) was subsequently used (WUBLAST engine, default parameters) to identify repetitive sequence using database from RepeatModeler.

The rRNA, miRNA, and spliceosomal RNA genes were identified using rfam_scan (version 1.2) with known sequence from Rfam [38]. The tRNAs were identified using the tRNAscan-SE (version 1.3.1) [39] with default parameters.

2.7. Evolution and Comparative Analysis

2.7.1. Reconstruction of Gene Families

In addition to the B. mucronatus proteins, we also collected the protein sequences from 11 other species on Wormbase (https://www.wormbase.org/), including Trichinella spiralis (I), Ascaris suum (III), Brugia malayi (III), Meloidogyne hapla (IV), Steinernema carpocapsae (IV), B. xylophilus (IV), Panagrellus redivivus (IV), Strongyloides ratti (IV), Parastrongyloides trichosuri (IV), Haemonchus contortus (V), Caenorhabditis elegans (V). After removing protein sequences shorter than 30 aa, or have frameshift, an All-to-All alignment of protein sequences was constructed using BLASTP (1e−5). OrthoFinder (version 2.2.7) [40] was then used to infer the orthologous genes and obtain the gene family based on the similarity of protein sequence. Café (version 3.0) [41] program was used to analyze expansion and contraction of gene families, in which the λ value was estimated by the program. The clade family number is defined as the shared family number of each species in each clade.

2.7.2. Reconstruction of Phylogenetic Tree

Single-copy genes were extracted from OrthoFinder results and concatenated for reconstruction of the phylogenetic tree. The program Clustal Omega (Version 1.2.1) [42] was used for multiple sequence alignment. The gaps in the alignment were removed using the program trimAl with “–nogaps” (version 1.4) [43]. The optimal substitution model for amino acids was estimated by ProtTest (version 3.4.2) [44]. RAxML (version 8.0.24) [45] was used to reconstruct a phylogenetic tree based on maximum likelihood method using LG+I+G+F (LG model with +I: invariable sites; +G: gamma-distributed rates and +F: observed amino acid frequencies) substitution model suggested by ProtTest with 200 bootstrap replicates. Trichinella spiralis was used as the outgroup. Finally, iTOL [46] and Evolview [47] were used to visualize and annotated the phylogenetic tree.

2.7.3. Collinear and Similarity Analysis of B. mucronatus and B. xylophilus

MCscan (https://github.com/tanghaibao/jcvi) was used to perform collinear analysis between B. xylophilus and B. mucronatus. LAST (Version 1021) [48] was used to perform the sequence alignments.

To assess the similarity of B. mucronatus and B. xylophilus, we compared the average mismatch rate of mapped reads when cross-mapping the sequencing reads of B. mucronatus and B. xylophilus to each other. BWA (Burrows-Wheeler Aligner) (Version 0.7.17-r1188) [49] was used to map sequencing reads to each reference genome. BAM (Binary Alignment Map) records that flagged as unmapped segments, secondary alignment, and supplementary alignment were filtered from mapping results using SAMtools [50]. The average mismatch rate was calculated by extracting “NM:i” tag from the BAM file to count the number mismatches and divide it by total number of mapped bases (sum of M, I, and D in the CIGAR string of BAM files). Meanwhile, the overall mapping rate was also calculated by SAMtools and compared between the two species.

2.8. Identification and Comparison of GPCR, Cellulase, and Pectin Genes of B. xylophilus and B. mucronatus

InterProScan results from gene annotation stage were used to identify GPCR, cellulase, and pectin genes in both B. xylophilus and B. mucronatus. As for GPCR genes, TMHMM and Pfam records were used to predict the transmembrane domain in the protein sequence. Proteins with seven transmembrane pattern and G Protein-Coupled Receptor domains were hypothetically identified as GPCR genes. As for cellulose and pectin genes, CAZy database [51] was used to search genes belongs to the carbohydrate active enzymes with BLASTP (1e−5).

The TPM (Transcripts Per Kilobase of exon model per Million mapped reads) of GPCRs for each gene was calculated using RSEM (Version 1.2.31) [52] with the default parameter. BINGO [53] was carried out for functional enrichment analysis.

3. Results and Discussion

3.1. Genome Assembly and Quality Assessment

Prior to the genome assembly, the genome size and heterozygosity rate were estimated using Illumina paired-end survey data. The estimated genome size based on 17-mer distribution was ~67.5 Mb (Figure S1), and the estimated heterozygosity rates was 0.257%. A total of 11 G single-molecule long reads (~160× based on estimated) were assembled using CANU. After pseudo-chromosome scaffolding using Hi-C data, we obtained a reference genome assembly of B. mucronatus, which consists of 72 super-scaffolds, including six pseudo-chromosomes (Figure 1 and Figure 2). The total size of the assembly was 73 Mb and scaffold N50 were 11.5 Mb (Table S6).

Figure 1.

Figure 1

Circos view of the Bursaphelenchus mucronatus genome. The outer grey lanes depict circular representation of pseudo-molecules. The tracks indicate the following character with window size in 100 Kb: (a) the density of genes (darker color indicates more genes), (b) TE (Transposable element) density, (c) GC (Guanine and Cytosine) content, (d) genome coverage, (e) transcriptome coverage (from inner to outer indicate the different stage: adult, L2 (Second instar larvae), L3 (Third instar larvae), L4. (Fourth instar larvae). All values are normalized by Z-score).

Figure 2.

Figure 2

Hi-C contact heatmap of pseudo-molecules. The blocks represent signal associated with the two contact positions. All six blocks are evaluated by the genome coverage of Hi-C library of HindIII and DpnII.

To assess the integrity and accuracy of the assembly, Illumina genome survey data and PacBio data were mapped to the assembly; the genome coverage was 97.11% and 95.68%, indicating that the assembly was relatively complete. The coverage map is showed in Figure S2. The number of complete and single-copy BUSCOs (Benchmarking Universal Single-Copy Orthologs) were 760 (77.4%) and 680 (69.2%), respectively, using protein and genomic models, which were comparable with B. xylophilus (Table S7). Meanwhile, the average transcriptomic-mapping rate was over 95% (Table S8), which also demonstrates that the protein-coding gene regions of the genome were also relatively complete.

3.2. Repeat and Non-Coding RNA Identification

The genome of B. mucronatus contained 21.74% repeat sequences (Table S9), similar to B. xylophilus [18]. We also identified 626 non-coding RNA (ncRNA) including 389 transfer RNAs (tRNAs), 183 ribosomal RNAs (rRNAs), 18 microRNAs (miRNAs), and 36 spliceosomal RNAs (Table S10).

3.3. Gene Annotation and Functional Analysis

A total of 13,696 non-redundant protein-coding genes were identified (Table S11). The average gene length is 3,564.61 bp and gene density is 187.48/Mb. The average exon length is 229.68 bp, exon GC (guanine and cytosine) content is 45.55%. We also performed functional analysis using mainstream public database and added annotations to the identified genes (Table S12). About 42% (5,761) and 32% (4,391) of genes were assigned to the GO (Gene Ontology) terms and KO (KEGG Orthology) identifiers, respectively. More than 60% of the genes were assigned to Pfam (11,317), NR (10,553) and UniProt (8,353). Among the gene set, 4,496 proteins contain a transmembrane domain from TMHMM annotation; these genes are candidates for receptor genes, such as GPCRs, which may sense external stimuli and chemical communication between populations [54,55].

3.4. Construction of Gene families

A total of 11 species were selected from Wormbase to perform gene family reconstruction using OrthoFinder (see method 2.7.1). Results showed that 252,937 protein sequences were classified into 14,951 families, of which B. xylophilus and B. mucronatus possessed 10,092 and 9,371 families, respectively. Results showed that Clade IV contains most of the species and possessed 4343 gene families, while Clade III and Clade V had 7,144 and 7,781 gene families, respectively. Clade IV shared 3,923 with Clade III and 4,101 with Clade V. These three clades shared 3756 gene families (Figure 3) that were enriched in the basic metabolic and biological process such as gene expression (p = 2.3e−22), RNA metabolic process (p = 7.5e−13), macromolecule biosynthetic process (p = 9.7e−11), structural molecule activity (p = 7.8e−12), ribonucleotide binding (p = 1.7e−9) and nucleotide binding (p = 3e−9) (Figure S3, Figure S4, Figure S5 and Table S13), which may be the fundamental functional genes for nematodes.

Figure 3.

Figure 3

Gene family numbers of three nematode clades. The number of gene families for Clade IV shared 3923 with Clade III and 4101 with Clade V. The three clades shared 3756 gene families.

3.5. Genome Phylogeny of B. mucronatus

From OrthoFinder results, we collected 497 conserved single-copy genes in 12 species for reconstructing the phylogenetic tree. T. spiralis was selected as the outgroup to determine the root of the phylogenetic tree (Figure S6). The branch length of the topological tree indicated the expected value of substitutions per site. As shown in the phylogenetic tree, B. xylophilus and B. mucronatus were clustered together as neighborhood, which indicates that these two species were highly similar. The branch length of B. xylophilus and B. mucronatus were 0.0129 and 0.0202, respectively. This may indicate that mutations in the B. mucronatus may go a little bit further since last separation with B. xylophilus.

3.6. Dynamics of the Gene Families

The dynamic change of gene families is a manifestation of species adaptive evolution at the gene level. In Figure 4, we showed the expansion/contraction of gene families on each branch of the species tree. Family expansions and contractions in Clade III and Clade V were 294 expansions vs. 242 contractions in Clade III and 575 expansions vs. 194 contractions in Clade V, respectively. As for Clade IV, each branch had a different number of gene family expansions vs. contractions. Looking at the numbers between B. xylophilus and B. mucronatus, B. xylophilus had 1,116 family expansions (4,462 genes) vs. 98 family contractions (215 genes), while B. mucronatus has 441 family expansions (1,417 genes) vs. 422 family contractions (1,184 genes). This may indicate that, in comparison with gene family enhancement in B. xylophilus, B. mucronatus are likely losing more species-specific genes, which might result in a relative lower ability to adopt to specific factor in environment than B. xylophilus.

Figure 4.

Figure 4

Phylogeny and gene families of B. mucronatus and related species. The phylogenetic relationship was inferred by RAxML, based on orthologous proteins of the 497 single-copy gene families. The circle size represents gene number, and the different colors indicate different gene types showed in the right top of the figure. The numbers above (red) and below (blue) indicate number of expansed and contracted gene families, respectively. The colors of the strips within the phylogenetic tree indicate different classes of the nematodes.

Moreover, by looking at the functional enrichment of the gene families, we found that the family contractions of B. mucronatus were mainly enriched in transmembrane transport related processes such as transmembrane transport (Biological process, p = 6e−5), membrane (Cellular component, p = 5e−6), and transmembrane receptor activity (Molecular function, p = 1e−4), which may indicate B. mucronatus have weaker ability to respond to external stimuli (Figure S7, Figure S8, Figure S9, Table S14). Interestingly, in biological process, the G-protein coupled receptor (GPCR) protein signaling pathway was significantly enriched (p = 7e−4). Since GPCRs were related to the ability in the process of pheromone binding to the receptor or chemical communication, which is crucial with activities of nematodes, this might contribute to the displacement of B. mucronatus by B. xylophilus (Figure S7, Table S14).

The family expansions of B. xylophilus were mainly enriched in proteolysis (p = 2.2829e−28), peptidase activity (p = 2.0682e−30), and extracellular region (p = 3.0338e−13) (Figure S10, Figure S11, Figure S12, and Table S15). Interestingly, cellulase activity (p = 2e−7) showed significant enrichment in molecular function, in particular, the plant cell wall degradation (Figure S12, Table S15). We further annotated carbohydrate active enzymes (CAZymes) in both B. mucronatus and B. xylophilus. We found that GH45 family of cellulase was contracted in B. mucronatus (7 genes) while expanded in B. xylophilus (11 genes). Moreover, the PL3 family of pectinase showed a similar situation with GH4. In B. mucronatus, there were five genes, while in B. xylophilus there were 22 (Table S16). These results may indicate a considerable shift in the ability of plant cell wall degradation between B. mucronatus and B. xylophilus and, thus, may lead to weaker pathogenicity in B. mucronatus.

3.7. Genome Comparison of B. mucronatus and B. xylophilus

By analyzing the collinear and homology segments (Sequence length≥1 Mb) of the B. mucronatus and B. xylophilus using MCscan, we found that there are 183 homologous blocks between B. xylophilus and B. mucronatus, with a total length of about 44.86 Mb and a similarity of homologous segments of 88.71%. Genome structure showed high consensus of synteny between B. xylophilus and B. mucronatus (Figure 5), but the cross-mapping showed that the sequence varied greatly on the genome level and even greater on gene regions (Table S17). The synteny confirms that they originate from the same ancestor; however, their sequences have changed a lot before great structural variations taken place.

Figure 5.

Figure 5

Genome synteny between B. mucronatus and B. xylophilus. Genomic alignments between B. xylophilus (Sequence length >1 Mb) and B. mucronatus showed high similarity of synteny. The alignments regions are marked with grey ribbons.

By analyzing the collinear and homology segments (Sequence length≥1 Mb) of the B. mucronatus and B. xylophilus using MCscan, we found that there are 183 homologous blocks between B. xylophilus and B. mucronatus, with a total length of about 44.86 Mb and a similarity of homologous segments of 88.71%. Genome structure showed high consensus of synteny between B. xylophilus and B. mucronatus (Figure 5), but the cross-mapping showed that the sequence varied greatly on the genome level and even greater on gene regions (Table S17). The synteny confirms that they originate from the same ancestor; however, their sequences have changed a lot before great structural variations taken place.

From the gene family results, we allocated all genes into three types, including species-specific orthologous clusters genes, un-clustered genes, and orphan genes as described in a previous study [56,57]. For B. mucronatus, there are 42 species-specific orthologous clusters genes, 241 un-clustered genes and 758 orphan genes, while the number in B. xylophilus is 194, 1,301, and 1,340. Functional analysis on species-specific orthologous clusters genes show that in B. mucronatus they mainly have kinase related function (PF00069|PF07714) (Table S18), while B. xylophilus have more than 10 varied conserved domains (Table S19). Interestingly, we found a 7TM GPCR, which belongs to the srh family in B. xylophilus species-specific orthologous clusters genes.

To investigate the possibility of gene variations which may contribute to the separation of the two species, we set three sequence identity cutoffs including (high, 0.5–0.75; medium, 0.33–0.50; and low, <0.33) to find out the number of ortholog pairs between the two species that have lower identity below the cutoff. We got 1,239, 354, and 104 genes in B. mucronatus and 1,633, 473, and 167 genes in B. xylophilus under each cutoff, respectively. Functional enrichment analysis results showed that in B. xylophilus, genes with medium identity were enriched in several stress responding processes including: responding to DNA damage stimulus stress, responding to stress, DNA repair, cellular response to stress, and cellular response to stimulus (Table S20). The gene function under the other two cutoffs of two species was mainly a basic biological process (Table S21). This result indicates that orthologs of B. xylophilus in medium identity may contribute to its stronger environment adaption than B. mucronatus, which promotes its displacement of B. mucronatus during the invasion process.

3.8. GPCR Contractions May Be the Reason That B. mucronatus Was Displaced by B. xylophilus

We further extracted the GPCR genes of the two nematodes according to the Pfam and TMHMM results. In total, 551 GPCRs were found in B. mucronatus (Table S22 and Table S23) and 596 GPCRs were found in B. xylophilus (Table S24 and Table S25).

We further classified GPCRs based on Pfam annotation, including rhodopsin (104/106, B. mucronatus/B. xylophilus), secretin (6/5), srbc (2/2), srd (76/79), srh (82/98), sri (1/4), srj (3/3), srsx (42/44), srt (30/31), srv (5/7), srw (16/15), srx (14/16), str (117/125), class ab chemoreceptor (53/61). This number distribution (Figure S13) indicates that some type of GPCRs may be losing diversity in B. mucronatus, which may indicate overall reduced sensitivity in response to chemical stimulation. Combined with gene family results, we found that the number of single-copy genes, srn-1(srh family), whose function in C. elegans is amphid sensory neurons [58] is much higher in B. xylophilus (22 genes) than in B. mucronatus (13 genes). This result may indicate that B. xylophilus has better migration ability than B. mucronatus, which confirms the previous phenotypic experiments [59]. Meanwhile, npr-1 (rhodopsin family), the gene function of feeding behavior, aerotaxis, thermal avoidance, ethanol tolerance, innate immunity in C. elegans [60], has single copy in B. xylophilus and no ortholog in B. mucronatus. The npr-5 (rhodopsin family), whose function is related to fat storage and dauer formation in C. elegans [60], has the same situation as npr-1. The loss of the npr genes may lead to weaker stress resistance of B. mucronatus. When the environment is poor and the food is lacking, this may lead B. mucronatus to enter the dauer formation stage at a much slower speed, after the quickly-responding B. xylophilus and the weaker fat synthesis and storage capacity will lead to death. However, B. xylophilus will quickly enter the dauer formation stage, the fat particles in the body increase in order to endure the stress. Furthermore, we find that the daf-38 (rhodopsin family) that was mediating ascaroside perception in C. elegans has five copies in B. xylophilus, and two copies in B. mucronatus, which may result in a weaker ability to conduct chemical communications within B. mucronatus populations.

In a recent study, we found that displacement between B. xylophilus and B. mucronatus was facilitated by ascarosides [61]. The two nematode species reacted differently to the same concentration of ascarosides. Low concentration of ascarosides could promote the growth and the length of the female body of B. xylophilus, while the same pheromone had no effect on the B. mucronatus [61].

We also checked the RNA-seq data for any evidence to support this hypothesis. Because of no biological replicate and potential wild sampling bias, we simply used mixed age RNA-seq of B. xylophilus as well as B. mucronatus to compare their overall expression levels. The results showed that the overall expression level of all GPCRs of B. xylophilus is relatively higher than B. mucronatus (Table S26, Table S27, Figure S14). This indicates that during the development stages, in addition to the copy number expansion, the expression level may also contribute to a higher quantity of GPCRs in B. xylophilus, which may lead to higher sensitivity to external pheromones stimuli in B. xylophilus.

4. Conclusions

In this study, we introduced the first chromosome-level B. mucronatus genome using PacBio and Hi-C technology, which brings a valuable genomic resource for molecular and evolutionary studies of B. mucronatus, especially a fundamental resource for studying the competitive displacement by the pinewood nematode B. xylophilus. With this new resource, previous comparative studies that focused on biological characteristics between B. mucronatus and B. xylophilus [7,11,12,13] could be extended to the genomic and molecular level in the near future. Moreover, the comparative genomic analysis would also finally decipher the code behind the competitive displacement between these two nematodes, which is the key role to understand and control the pathogenicity of pine wilt diseases.

In this study, we identified ‘Species-identity-specific genes’ from the orthologs under different identity cutoffs. Functional enrichment analysis results showed that medium identity of B. xylophilus genes are enriched in several stress responding process. Meanwhile, species-identity-specific genes of the B. mucronatus are mainly enriched in fundamental protein biological processes. This result may indicate that the sequence difference in orthologues may contribute to the speciation process.

As shown in the phylogenetic tree, B. xylophilus and B. mucronatus were clustered together as neighborhood, which indicates that these two species were highly similar. However, we noticed that the phylogenetic tree showed slightly different structure from previous nematode study [62]. This may be due to the different datasets used for tree inferences. In prior to the whole genome, mitochondrial genes were often used [63]. In our study, all coding genes were used for the tree inference. We also checked and confirmed the similar situation in some other studies [64,65]. We cannot justify which one is more correct, but this at least indicates different mutation rates on the mitochondrion and the whole genome.

The gene family contraction of B. mucronatus and the gene family expansion of B. xylophilus showed that some type of GPCRs may be missing or losing diversity in B. mucronatus, which may result in a weaker ability of B. mucronatus in the process of pheromone binding to the receptor or chemical communication and pathogenicity compared with B. xylophilus. In previous studies, it was found that B. mucronatus was the weaker contender in the competitive displacement by B. xylophilus [12,66]. From the RNA-seq results, it is implied that higher sensitivity of GPCRs in B. xylophilus makes it easier for the signal transduction to reach its saturation. This may also trigger the feedback regulation on expression of GPCRs once saturated [67], which eventually reduce the expression levels of GPCRs and, thus, lower its sensitivity to external pheromones stimuli. On the opposite, the lower expression levels of GPCRs in B. mucronatus may lower its sensitivity and raise the threshold of ascarosides, thus, making it harder to response to external stimuli, which may result in a null effect on fecundity or even suppression. This may indicate that the gene family expansion and gene regulation may be responsible for the higher baseline expression level of GPCRs in B. xylophilus than in B. mucronatus. Overall, the enhanced chemical sensing of ascarosides in B. xylophilus and diminished ability in B. mucronatus, which result in physiological changes that favor the enhanced fecundity and length growth of B. xylophilus, may be an explanation, and indicate that the displacement of B. mucronatus could be a chemical communications-driven procedure.

Overall, we introduced a high quality reference genome for B. mucronatus, compared it with the released reference genome of B. xylophilus [18], and discussed the phenomenon of displacement from the view of gene families, functions, and expressions. In the process of invasion by alien species, chemical communication plays a critical role in the displacement between invasive and indigenous species [68]. In the B. xylophilusB. mucronatus system, we believe that differences in the contraction (B. mucronatus) and expansion (B. xylophilus) of the GPCR families may be related to the different responses to ascarosides, which is one of the key factors in their apparent displacement. With these changes, B. mucronatus may be less sensitive to ascarosides than B. xylophilus. Meanwhile, our research helps to reveal that the internal essence of the displacement between these two species is likely highly facilitated by chemical communication, which potentially provides new perspectives for pine wilt disease control, and mitigating the negative impacts of such biological invasion.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/11/5/570/s1, Figure S1: GenomeScope k-mer frequency and coverage profiles of genome survey data (k = 17). Figure S2: Hi-C contact heatmap with PacBio and Illumina data evaluated of pseudo-molecules. The largest scale of PacBio data and Illumina data are 120 and 150, respectively. Figure S3: Biological process enrichment of shared gene family in three Clades. The annotation background is whole gene GO annotation result from BINGO. Figure S4: Cellular component enrichment of shared gene family in three Clades. The annotation background is whole gene GO annotation result from BINGO. Figure S5: Molecular function enrichment of shared gene family in three Clades. The annotation background is whole gene GO annotation result from BINGO. Figure S6: Phylogenetic tree of B. mucronatus and related species. The phylogeny was constructed based on the protein sequences of the 497 single-copy genes. The values above the branches indicate the branch (solid line) lengths. Figure S7: Biological process enrichment of the contracted genes families of B. mucronatus. Figure S8: Cellular component enrichment of the contracted genes families of B. mucronatus. Figure S9: Molecular function enrichment of the contracted genes families of B. mucronatus. Figure S10: Biological process enrichment of the expanded families of B. xylophilus. Figure S11: Cellular component enrichment of the expanded families of B. xylophilus. Figure S12: Molecular function enrichment of the expanded families of B. xylophilus. Figure S13: Distribution of GPCR subfamily in B. xylophilus and B. mucronatus. Figure S14: Boxplot of GPCRs expression in B. xylophilus and B. mucronatus. The average expression of B. xylophilus is higher than B. mucronatus. Table S1: Statistics of genome survey data of B. mucronatus. The 4-peak model was used to estimate the heterozygosity, genome haploid length (bp), genome repeat length (bp), genome unique length (bp) and read error rate, which was converged in 4 rounds. Table S2: Statistics of PacBio sequencing data of B. mucronatus. Table S3: Statistics of different parameter combination of Canu assembly. Spec.file is the specific parameter in each round. Repeat-dove reads are short for dovetail, where the ends of reads align fully. Repeat-cont are short for contained, where a read aligns fully within another read. Uniq-repeat-cont reads are fully contained in other read. Uniq-repeat-dove reads are not fully contained in other read, which was used to form initial contigs. Span-repeat is the spanning repeat reads. Uniq-anchor reads are used for anchoring contig. Contigs are the initial contig of Canu assembly. Consensus is the final contig assembly of Canu. Table S4: Statistics of B. mucronatus contig feature. Table S5: Statistics of Hi-C sequencing data of B. mucronatus. Table S6: Statistics of B. mucronatus genome feature. Table S7: The result of BUSCO. Table S8: The mapping rate of RNA-seq data. Table S9: Summary of repeat elements identified in the B. mucronatus genome. LINEs: long interspersed elements, SINEs: short interspersed elements, LTR elements: long terminal repeat, DNA elements: DNA type repeat. Table S10: Non-coding RNAs in the B. mucronatus genome. Table S11: Statistics of B. mucronatus gene annotation. Table S12: Function annotation of B. mucronatus. Table S13: Functional enrichment of shared gene family in three Clades. Table S14: Functional enrichment of contraction family of B. mucronatus. Table S15: Functional enrichment of expansion family of B. xylophilus. Table S16: Statistics of cellulase family of B. xylophilus and B. mucronatus. Table S17: Statistics of average error rate of cross-mapping between B. mucronatus and B. xylophilus. Table S18: Functional annotation of species-specific cluster genes of B. mucronatus. Table S19: Functional annotation of species-specific cluster genes of B. xylophilus. Table S20: Functional enrichment of median identity ortholog genes of B. xylophilus. Table S21: Functional enrichment of different identity cutoff of ortholog genes in B. xylophilus and B. mucronatus. Table S22: Pfam annotation of GPCR genes of B. mucronatus. Table S23: TMHMM annotation of GPCR genes of B. mucronatus. Table S24: Pfam annotation of GPCR genes of B. xylophilus. Table S25: TMHMM annotation of GPCR genes of B. xylophilus. Table S26: The expression of GPCRs in B. xylophilus. Table S27: The expression of GPCRs in B. mucronatus.

Author Contributions

S.W. (Shuangyang Wu), S.H., L.Z., and J.S. conceived the study; S.W. (Shuangyang Wu), S.G., S.W. (Sen Wang), H.Y. and J.M. performed data analysis and experiments. S.W. (Shuangyang Wu), J.W., S.G., S.W. (Sen Wang), and J.M. draft manuscript. X.T., Y.X., and S.L. perform the sequencing library and format manuscript. S.W. (Shuangyang Wu), S.G., J.W., and J.S. revised manuscript. All authors read and approved the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the Natural Science Foundation of China (31630013) and the Frontier Science Key Project of the Chinese Academy of Sciences (QYZDJ-SSW-SMC024).

Conflicts of Interest

The authors declare that they have no competing interests

References

  • 1.Zhao L., Mota M., Vieira P., Butcher R.A., Sun J. Interspecific communication between pinewood nematode, its insect vector, and associated microbes. Trends Parasitol. 2014;30:299–308. doi: 10.1016/j.pt.2014.04.007. [DOI] [PubMed] [Google Scholar]
  • 2.Zhao L., Zhang X., Wei Y., Zhou J., Zhang W., Qin P., Chinta S., Kong X., Liu Y., Yu H., et al. Ascarosides coordinate the dispersal of a plant-parasitic nematode with the metamorphosis of its vector beetle. Nat. Commun. 2016;7:12341. doi: 10.1038/ncomms12341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Guiran G.D., Bruguier N. Hybridization and Phylogeny of the Pine Wood Nematode (Bursaphelenchus Spp.) Nematologica. 1989;35:321–330. doi: 10.1163/002825989X00421. [DOI] [Google Scholar]
  • 4.Yan X., Cheng X.Y., Wang Y.S., Luo J., Mao Z.C., Ferris V.R., Xie B.Y. Comparative transcriptomics of two pathogenic pinewood nematodes yields insights into parasitic adaptation to life on pine hosts. Gene. 2012;505:81–90. doi: 10.1016/j.gene.2012.05.041. [DOI] [PubMed] [Google Scholar]
  • 5.Pereira F., Moreira C., Fonseca L., van Asch B., Mota M., Abrantes I., Amorim A. New insights into the phylogeny and worldwide dispersion of two closely related nematode species, Bursaphelenchus xylophilus and Bursaphelenchus mucronatus. PLoS ONE. 2013;8:e56288. doi: 10.1371/journal.pone.0056288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mamiya Y., Enda N. Bursaphelenchus Mucronatus N. Sp. (Nematoda: Aphelenchoididae) from pine wood and its biology and pathogenicity to pine trees. Nematologica. 1979;25:353–361. doi: 10.1163/187529279X00091. [DOI] [Google Scholar]
  • 7.Niu H.T., Zhao L.L., Sun J.H. Phenotypic plasticity of reproductive traits in response to food availability in invasive and native species of nematode. Biol. Invasions. 2013;15:1407–1415. doi: 10.1007/s10530-012-0379-6. [DOI] [Google Scholar]
  • 8.Nickle W.R., Golden A.M., Mamiya Y., Wergin W.P. On the Taxonomy and Morphology of the Pine Wood Nematode, Bursaphelenchus xylophilus (Steiner &Buhrer 1934) Nickle 1970. J. Nematol. 1981;13:385–392. [PMC free article] [PubMed] [Google Scholar]
  • 9.Mamiya Y., Kiyohara T., Mamiya Y., Kiyohara T. Description of Bursaphelenchus Lignicolus N. Sp. (Nematoda: Aphelenchoididae) From Pine Wood and Histopathology of Nematode-Infested Trees. Nematologica. 1972;18:120–124. doi: 10.1163/187529272X00296. [DOI] [Google Scholar]
  • 10.Kishi Y. Pine Wood Nematode and the Japanese Pine Sawyer. Thomas Company; Tokyo, Japan: 1995. [Google Scholar]
  • 11.Cheng X.Y., Xie P.Z., Cheng F.X., Xu R.M., Xie B.Y. Competitive displacement of the native species Bursaphelenchus mucronatus by an alien species Bursaphelenchus xylophilus (Nematoda: Aphelenchida: Aphelenchoididae): A case of successful invasion. Biol. Invasions. 2009;11:205–213. doi: 10.1007/s10530-008-9225-2. [DOI] [Google Scholar]
  • 12.Jikumaru S., Togashi K. Inhibitory Effect of Bursaphelenchus mucronatus (Nematoda: Aphelenchoididae) on B. xylophilus Boarding Adult Monochamus alternatus (Coleoptera: Cerambycidae) J. Nematol. 2004;36:95–99. [PMC free article] [PubMed] [Google Scholar]
  • 13.Vincent B., Altemayer V., Rouxmorabito G., Naves P., Sousa E., Lieutier F. Competitive interaction between Bursaphelenchus xylophilus and the closely related species Bursaphelenchus mucronatus. Nematology. 2008;10:219–230. [Google Scholar]
  • 14.Wan F., Yin C., Tang R., Chen M., Wu Q., Huang C., Qian W., Rota-Stabelli O., Yang N., Wang S., et al. A chromosome-level genome assembly of Cydia pomonella provides insights into chemical ecology and insecticide resistance. Nat. Commun. 2019;10:4237. doi: 10.1038/s41467-019-12175-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Wu N., Zhang S., Li X., Cao Y., Liu X., Wang Q., Liu Q., Liu H., Hu X., Zhou X.J., et al. Fall webworm genomes yield insights into rapid adaptation of invasive species. Nat. Ecol. Evol. 2019;3:105–115. doi: 10.1038/s41559-018-0746-5. [DOI] [PubMed] [Google Scholar]
  • 16.Kingan S.B., Urban J., Lambert C.C., Baybayan P., Childers A.K., Coates B., Scheffler B., Hackett K., Korlach J., Geib S.M. A high-quality genome assembly from a single, field-collected spotted lanternfly (Lycorma delicatula) using the PacBio Sequel II system. Gigascience. 2019;8:10. doi: 10.1093/gigascience/giz122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hunt V.L., Tsai I.J., Coghlan A., Reid A.J., Holroyd N., Foth B.J., Tracey A., Cotton J.A., Stanley E.J., Beasley H., et al. The genomic basis of parasitism in the Strongyloides clade of nematodes. Nat. Genet. 2016;48:299–307. doi: 10.1038/ng.3495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kikuchi T., Cotton J.A., Dalzell J.J., Hasegawa K., Kanzaki N., McVeigh P., Takanashi T., Tsai I.J., Assefa S.A., Cock P.J., et al. Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus. PLoS Pathog. 2011;7:e1002219. doi: 10.1371/journal.ppat.1002219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Marcais G., Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27:764–770. doi: 10.1093/bioinformatics/btr011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Vurture G.W., Sedlazeck F.J., Nattestad M., Underwood C.J., Fang H., Gurtowski J., Schatz M.C. GenomeScope: Fast reference-free genome profiling from short reads. Bioinformatics. 2017;33:2202–2204. doi: 10.1093/bioinformatics/btx153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Koren S., Walenz B.P., Berlin K., Miller J.R., Bergman N.H., Phillippy A.M. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Walker B.J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S., Cuomo C.A., Zeng Q., Wortman J., Young S.K., et al. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Durand N.C., Robinson J.T., Shamim M.S., Machol I., Mesirov J.P., Lander E.S., Aiden E.L. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 2016;3:1. doi: 10.1016/j.cels.2015.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Dudchenko O., Batra S.S., Omer A.D., Nyquist S.K., Hoeger M., Durand N.C., Shamim M.S., Machol I., Lander E.S., Aiden A.P., et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356:92–95. doi: 10.1126/science.aal3327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Pertea M., Kim D., Pertea G.M., Leek J.T., Salzberg S.L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 2016;11:1650–1667. doi: 10.1038/nprot.2016.095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Niknafs Y.S., Pandian B., Iyer H.K., Chinnaiyan A.M., Iyer M.K. TACO produces robust multisample transcriptome assemblies from RNA-seq. Nat. Methods. 2017;14:68–70. doi: 10.1038/nmeth.4078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Haas B.J., Papanicolaou A., Yassour M., Grabherr M., Blood P.D., Bowden J., Couger M.B., Eccles D., Li B., Lieber M., et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013;8:1494–1512. doi: 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Haas B.J., Delcher A.L., Mount S.M., Wortman J.R., Smith R.K., Jr., Hannick L.I., Maiti R., Ronning C.M., Rusch D.B., Town C.D., et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003;31:5654–5666. doi: 10.1093/nar/gkg770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Stanke M., Diekhans M., Baertsch R., Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008;24:637–644. doi: 10.1093/bioinformatics/btn013. [DOI] [PubMed] [Google Scholar]
  • 30.Borodovsky M., Mills R., Besemer J., Lomsadze A. Prokaryotic gene prediction using GeneMark and GeneMark.hmm. Curr. Protoc. Bioinform. 2003;1:4.5.1–4.5.16. doi: 10.1002/0471250953.bi0405s01. [DOI] [PubMed] [Google Scholar]
  • 31.Haas B.J., Salzberg S.L., Zhu W., Pertea M., Allen J.E., Orvis J., White O., Buell C.R., Wortman J.R. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008;9:R7. doi: 10.1186/gb-2008-9-1-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Trapnell C., Roberts A., Goff L., Pertea G., Kim D., Kelley D.R., Pimentel H., Salzberg S.L., Rinn J.L., Pachter L., et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Coordinators N.R. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2018;46:D8–D13. doi: 10.1093/nar/gkx1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.The UniProt C. UniProt: The universal protein knowledgebase. Nucleic Acids Res. 2017;45:D158–D169. doi: 10.1093/nar/gkw1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kanehisa M., Furumichi M., Tanabe M., Sato Y., Morishima K. KEGG: New perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45:D353–D361. doi: 10.1093/nar/gkw1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Finn R.D., Attwood T.K., Babbitt P.C., Bateman A., Bork P., Bridge A.J., Chang H.Y., Dosztanyi Z., El-Gebali S., Fraser M., et al. InterPro in 2017-beyond protein family and domain annotations. Nucleic Acids Res. 2017;45:D190–D199. doi: 10.1093/nar/gkw1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Waterhouse R.M., Seppey M., Simao F.A., Manni M., Ioannidis P., Klioutchnikov G., Kriventseva E.V., Zdobnov E.M. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 2018;35:543–548. doi: 10.1093/molbev/msx319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kalvari I., Argasinska J., Quinones-Olvera N., Nawrocki E.P., Rivas E., Eddy S.R., Bateman A., Finn R.D., Petrov A.I. Rfam 13.0: Shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Res. 2018;46:D335–D342. doi: 10.1093/nar/gkx1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lowe T.M., Chan P.P. tRNAscan-SE On-line: Integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016;44:W54–W57. doi: 10.1093/nar/gkw413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Emms D.M., Kelly S. OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015;16:157. doi: 10.1186/s13059-015-0721-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Han M.V., Thomas G.W., Lugo-Martinez J., Hahn M.W. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol. Biol. Evol. 2013;30:1987–1997. doi: 10.1093/molbev/mst100. [DOI] [PubMed] [Google Scholar]
  • 42.Sievers F., Higgins D.G. Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 2018;27:135–145. doi: 10.1002/pro.3290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Capella-Gutierrez S., Silla-Martinez J.M., Gabaldon T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Darriba D., Taboada G.L., Doallo R., Posada D. ProtTest 3: Fast selection of best-fit models of protein evolution. Bioinformatics. 2011;27:1164–1165. doi: 10.1093/bioinformatics/btr088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Letunic I., Bork P. Interactive Tree of Life (iTOL) v4: Recent updates and new developments. Nucleic Acids Res. 2019;47:W256–W259. doi: 10.1093/nar/gkz239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Subramanian B., Gao S., Lercher M.J., Hu S., Chen W.H. Evolview v3: A webserver for visualization, annotation, and management of phylogenetic trees. Nucleic Acids Res. 2019;47:W270–W275. doi: 10.1093/nar/gkz357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Kielbasa S.M., Wan R., Sato K., Horton P., Frith M.C. Adaptive seeds tame genomic sequence comparison. Genome Res. 2011;21:487–493. doi: 10.1101/gr.113985.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Lombard V., Golaconda Ramulu H., Drula E., Coutinho P.M., Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42:D490–D495. doi: 10.1093/nar/gkt1178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Li B., Dewey C.N. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Maere S., Heymans K., Kuiper M. BiNGO: A Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005;21:3448–3449. doi: 10.1093/bioinformatics/bti551. [DOI] [PubMed] [Google Scholar]
  • 54.Tran A., Tang A., O’Loughlin C.T., Balistreri A., Chang E., Coto Villa D., Li J., Varshney A., Jimenez V., Pyle J., et al. C. elegans avoids toxin-producing Streptomyces using a seven transmembrane domain chemosensory receptor. Elife. 2017;6:e23770. doi: 10.7554/eLife.23770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Tan F.J., Fire A.Z., Hill R.B. Regulation of apoptosis by C. elegans CED-9 in the absence of the C-terminal transmembrane domain. Cell Death Differ. 2007;14:1925–1935. doi: 10.1038/sj.cdd.4402215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Rodelsperger C., Prabh N., Sommer R.J. New Gene Origin and Deep Taxon Phylogenomics: Opportunities and Challenges. Trends Genet. 2019;35:914–922. doi: 10.1016/j.tig.2019.08.007. [DOI] [PubMed] [Google Scholar]
  • 57.Prabh N., Roeseler W., Witte H., Eberhardt G., Sommer R.J., Rodelsperger C. Deep taxon sampling reveals the evolutionary dynamics of novel gene families in Pristionchus nematodes. Genome Res. 2018;28:1664–1674. doi: 10.1101/gr.234971.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Vidal B., Aghayeva U., Sun H.S., Wang C., Glenwinkel L., Bayer E.A., Hobert O. An atlas of Caenorhabditis elegans chemoreceptor expression. PLoS Biol. 2018;16:e2004218. doi: 10.1371/journal.pbio.2004218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Son J.A., Jung C.S., Han H.R. Migration and Attacking Ability of Bursaphelenchus mucronatus in Pinus thunbergii Stem Cuttings. Plant Pathol. J. 2016;32:340–346. doi: 10.5423/PPJ.OA.01.2016.0025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Frooninckx L., Rompay L.V., Temmerman L., Sinay E.V., Beets I., Janssen T., Husson S.J., Schoofs L. Neuropeptide GPCRs in C. elegans. Front. Endocrinol. (Lausanne) 2012;3:167. doi: 10.3389/fendo.2012.00167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Meng J., Wickham J.D., Ren W., Zhao L., Sun J. Species displacement facilitated by ascarosides between two sympatric sibling species: A native and invasive nematode. J. Pest Sci. 2020;93:1059–1071. doi: 10.1007/s10340-020-01206-w. [DOI] [Google Scholar]
  • 62.Smythe A.B., Holovachov O., Kocot K.M. Improved phylogenomic sampling of free-living nematodes enhances resolution of higher-level nematode phylogeny. BMC Evol. Biol. 2019;19:121. doi: 10.1186/s12862-019-1444-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.van den Elsen S., Holovachov O., Karssen G., van Megen H., Helder J., Bongers T., Bakker J., Holterman M., Mooyman P. A phylogenetic tree of nematodes based on about 1200 full-length small subunit ribosomal DNA sequences. Nematology. 2009;11:927–950. doi: 10.1163/156854109X456862. [DOI] [Google Scholar]
  • 64.Desjardins C.A., Cerqueira G.C., Goldberg J.M., Dunning Hotopp J.C., Haas B.J., Zucker J., Ribeiro J.M., Saif S., Levin J.Z., Fan L., et al. Genomics of Loa loa, a Wolbachia-free filarial parasite of humans. Nat. Genet. 2013;45:495–500. doi: 10.1038/ng.2585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.International Helminth Genomes Consortium Comparative genomics of the major parasitic worms. Nat. Genet. 2019;51:163–174. doi: 10.1038/s41588-018-0262-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Rutherford T.A., Riga E., Webster J.M. Temperature-mediated Behavioral Relationships in Bursaphelenchus xylophilus, B. mucronatus, and Their Hybrids. J. Nematol. 1992;24:40–44. [PMC free article] [PubMed] [Google Scholar]
  • 67.Ji N., Middelkoop T.C., Mentink R.A., Betist M.C., Tonegawa S., Mooijman D., Korswagen H.C., van Oudenaarden A. Feedback control of gene expression variability in the Caenorhabditis elegans Wnt pathway. Cell. 2013;155:869–880. doi: 10.1016/j.cell.2013.09.060. [DOI] [PubMed] [Google Scholar]
  • 68.Raw J.L., Miranda N.A., Perissinotto R. Chemical cues released by an alien invasive aquatic gastropod drive its invasion success. PLoS ONE. 2013;8:e64071. doi: 10.1371/journal.pone.0064071. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Genes are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES