Abstract
Wolbachia, a gram-negative -proteobacterium, is an endosymbiont found in some arthropods and nematodes. Diaphorina citri Kuwayama, the vector of ‘Candidatus Liberibacter asiaticus’ (CLas), are naturally infected with a strain of Wolbachia (wDi), which has been shown to colocalize with the bacteria pathogens CLas, the pathogen associated with huanglongbing (HLB) disease of citrus. The relationship between wDi and CLas is poorly understood in part because the complete genome of wDi has not been available. Using high-quality long-read PacBio circular consensus sequences, we present the largest complete circular wDi genome among supergroup-B members. The assembled circular chromosome is 1.52 megabases with 95.7% genome completeness with contamination of 1.45%, as assessed by checkM. We identified Insertion Sequences (ISs) and prophage genes scattered throughout the genomes. The proteins were annotated using Pfam, eggNOG, and COG that assigned unique domains and functions. The wDi genome was compared with previously sequenced Wolbachia genomes using pangenome and phylogenetic analyses. The availability of a complete circular chromosome of wDi will facilitate understanding of its role within the insect vector, which may assist in developing tools for disease management. This information also provides a baseline for understanding phylogenetic relationships among Wolbachia of other insect vectors.
Subject terms: Symbiosis, Genomics
Introduction
The Asian citrus psyllid, Diaphorina citri Kuwayama, (Hemiptera: Liviidae), is a vector of ‘Candidatus Liberibacter asiaticus’ (CLas), a gram-negative -proteobacteria that putatively causes citrus greening disease, also known as huanglongbing (HLB)1. D. citri also harbor three endosymbionts: ‘Candidatus Carsonella ruddii’, ‘Candidatus Profftella armature’, and ‘Wolbachia’ (wDi)2. Infected D. citri transmit CLas while feeding on citrus trees. Infection with CLas reduces fruit quality and yield, and eventually kills the citrus tree1. CLas also interacts with host D. citri and its endosymbionts, including Wolbachia, a gram-negative -proteobacteria3–5. These studies reported that the abundance of wDi is related to the abundance of CLas in D. citri and regulates the phage lytic cycle genes in CLas as D. citri infected with “Ca. Liberibacter asiaticus” had a higher Wolbachia titer than the non-infected ones5,6. The 56-amino-acid repressor protein of Wolbachia in the psyllid represses SC1_gp110 (holin) gene of Ca. Liberibacter asiaticus which is critical for the survival of both endosymbionts in the psyllid5. This suggests a potential role of Wolbachia in CLas transmission and underscores the need for a well characterized Wolbachia genome4 in gaining a better grasp of and combating this dreadful citrus disease.
In some arthropods, such as Drosophila melanogaster, Aedes aegypti, Culex pipiens, Acraea encedon, Armadillidium vulgare, and Asobara tabida, Wolbachia can alter host reproduction and increase viral resistance7–9. The presence of Wolbachia can manipulate the cellular and reproductive processes by inducing cytoplasmic incompatibility, parthenogenesis, feminization, or male killing8. The infection of Aedes aegypti by Wolbachia strains, wMelCS (D. melanogaster), wRi (D. simulans) and wPip (Culex quinquefasciatus) had effects on fitness, maternal transmission, cytoplasmic incompatibility, tissue tropism and dengue virus blocking10. In addition, a recent study showed the importance of Wolbachia as Wolbachia-infected A. aegypti were resistant to Zika and dengue virus co-infection and were suitable for mitigating mosquito-borne diseases11. The role of Wolbachia in Hemiptera, including D. citri, remains poorly understood. The previously released draft wDi genome used paired-end and mate-pair Illumina datasets for the D. citri metagenome12. The draft wDi genome (AMZJ01000000.1) was estimated to be 1.25 Mb with 124 contigs with gaps. In this study, we utilized single molecule real-time (SMRT) sequencing by Pacific Biosciences (PacBio) technology to generate long reads from isolated wDi from the host cells13. Several challenges confronting whole genome sequencing and de novo assembly of wDi genome exist, including: (1) difficulties in culturing and isolating large amounts of high quality wDi DNA, (2) the incidence of many long repetitive elements and lateral gene transfers (LGTs) from Wolbachia to the host genome, and 3) the presence of Insertion Sequences (IS) and WO-prophage sequences that complicate the complete genome assembly14–17. The obstacles for generating a single complete contig have been overcome using long-read sequencing methods, such as PacBio, that generate longer reads through the repeats15. In this study, we utilized HiCanu18 for the complete assembly of genome sequences from wDi sample, which could resolve near-identical genomic repeats. The assembly resulted in a circular genome of 1.52 Mb which is the largest complete genome among assembled Wolbachia genomes to date among supergroup-B members, except for the complete Wolbachia genome from Folsomia candida (wFol) of 1.8 Mb19 (supergroup-E), invasive cherry fruit fly Rhagoletis cingulata (wCin2)20 of 1.53 Mb (supergroup-A). The genome dataset will enhance our ability to elucidate the interactions of wDi with its D. citri host and associated endosymbionts.
Result and discussion
wDi genome assembly
The purpose of this study was to obtain an enclosed Wolbachia genome from D. citri. Recently, we published wDi genomes from a single collection point of the same wDi culture used in this study, which were near complete but could not be circularized21. The sequencing of obligate endosymbionts such as Wolbachia is not an easy task because of their very low abundance, inability to grow outside a host, and inability to culture axenically22. In addition, collection of large amounts of high-quality DNA for whole genome sequencing requires a large quantity of bacteria. This requires a high number of infected host cells to obtain the obligate endosymbiont bacteria22. Thus, in this study, wDi samples were collected from combination of two collection points (cell passages) from the same culture to obtain high quality wDi DNA for whole genome sequencing. An overview of wDi extraction and genome assembly pipeline is shown in Fig. 1.
To produce a high-quality assembly, circular consensus sequences (CCS) were used. CCS are derived accurately from the noisy individual subreads which are consensus sequence obtained from multiple passes of a single template molecule23,24. The raw PacBio sequencing data obtained from the SMRT cells produced 899,643 filtered subreads and a total of approximately four billion bases, with the longest subread length of 118 kb. High quality CCS reads upto 32 kb size were generated from raw PacBio reads for high quality assembly. The maximum number of CCS reads (> 4,000) generated from using SMRT® LINK v7.0 using Sequel II system were of high quality with Q60 (Fig. 2a,b). Further, 45-bp left adapter sequences were trimmed from CCS reads. In addition, the short reads < 1000 bp and worst 10% of read bases were discarded to ensure high-quality assembly with the coverage of 72.89 × . We utilized pacbio-hifi parameter in Canu v1.9 to solve the complexity of Wolbachia genomes and generate complete assembly with overlapping ends that can be trimmed for circularization. Pacbio-hifi, recently integrated in Canu v1.9 provides high repeat resolution than pacbio-corrected at least on complex genomes like Wolbachia18. By default, Canu v1.9 with pacbio-hifi option uses only overlaps that are below 0.03% error which is much lower than used with pacbio-corrected option. In this study, we applied an even lower rate, correctedErrorRate = 0.001, that reduces the risk for the mis-assembly. Before trimming, the assembled genome size was 1,530,940 bp. The genomes after circularization were checked for potential errors using Illumina sequencing data. Firstly, the quality of trimmed Illumina data was ensured using FastQC to determine the data quality using various quality metrics. Phred quality scores per-base for the sample was higher than 30 and GC content of 33%, following a normal distribution. The Illumina data provided median coverage of 925 × for the sample. The analyses corrected 91 SNPs, 10 small insertions totaling 73 bases, and three small deletions totaling 41 bases. The de novo assembled genome after correction was 1,528,786 bp in size with an average GC content of 34.08% (Table 1).
Table 1.
Parameter | wDi |
---|---|
Sequencing instrument | PacBio Sequel II |
Polymerase reads | 72,696 |
Subreads | 899,643 |
Bases (Mb) | 3991.2 |
Mean read length (bp) | 55,808 |
Longest subread length (bp) | 118,419 |
CCS bases (Mb) | 109.34 |
CCS reads (bp) | 32,392 |
CCS coverage (×) | 72.89 |
Assembler | Canu v1.9 |
Assembled chromosome (bp) | 1,528,786 |
Circularity | Yes |
G + C % | 34.07 |
Genes | 1,435 |
CDSsa (Total) | 1,394 |
CDSs (with protein) | 1,202 |
Genes (RNA) | 41 |
rRNAs | 3 |
tRNAs | 34 |
ncRNAsb | 4 |
Pseudo genes (total) | 192 |
PacBio accession | SRR10985324 |
Illumina accession | SRR11075881 |
GenBank accession no | CP048819 |
Bioproject | PRJNA603775 |
Project ID | SRP245886 |
aCDSs, coding DNA sequences.
bncRNAs, noncoding RNAs.
The complete genome is longer than the previously reported draft contigs of wDi which was estimated to be 1.25 Mb12. The wDi genome is largest among assembled Wolbachia genomes as compared with other Wolbachia from arthropods and nematodes. Previously, the largest Wolbachia genomes were from Folsomia candida (1.8 Mb)19, invasive cherry fruit fly Rhagoletis cingulata (1.53 Mb)20 and embryos of Aedes albopictus (1.48 Mb)25.
Genome annotations and assessments
The wDi genome was annotated including protein coding genes, 5S, 16S, and 23S rRNA and tRNA genes. An overview of their genome features, including CDSs, rRNAs, and tRNAs was visualized in CG view Server (Fig. 3). PGAP annotations showed assembled wDi chromosome to contain total of 1,435 genes which are 1,394 coding sequences with 1,202 protein coding genes. Forty-one genes are related to RNAs (three RNAs, 34 tRNAs, and four noncoding RNAs) and 192 are pseudogenes. We compared the complete wDi chromosome with the draft wDi in various perspectives using various tools implemented in Microscope platform26. The core genes and genome specific genes was identified comparing wDi_AMZJ.112 based on Microscope gene families with parameter of 80% amino acid identity and 80% alignment coverage. A total of 1,073 genes were shared between two wDi genomes, while 239 and 183 genes were specific to wDi assembled in this study and wDi_AMZJ.112, respectively, based on single transitive links (single linkage) with alignment coverage constraints and implemented in a software package (called SiLiX for SIngle LInkage Clustering of Sequences) (Figure S1; Table S1). Notably, dnaK (fragment of chaperone protein), metC (fragment of cystathionine beta-lyase/L-cysteine desulfhydrase), ylbg (putative DNA-binding transcriptional regulator), insF (transposase), rpoC (fragment of RNA polymerase subunit beta), kefB (fragment of K +: H + antiporter) constituted the largest fraction of genes in complete wDi. However, the Microscope platform's gene phyloprofile analysis revealed that homologs for those genes exist in draft wDi, with homology constraints of identity greater than or equal to 35 percent (Table S2). In complete wDi, tandem duplications revealed 36 locations containing 286 genes, whereas draft wDi revealed just 20 regions involving 64 genes. Tandem duplicated genes have an identity ≥ 35% with a minLRap ≥ 0.8 and are separated by a maximum of five consecutive genes. It is evident that tandem duplications play major role in expansion of gene families27. In addition, the comparison between complete and draft wDi was done using lineplots, dotplots, and mauve alignment. The lineplot showed the strand conservation and inversions in the syntenic regions and shows high prevalence of transposases and insertion sequences throughout the complete wDi genome that are absent in the draft wDi (Fig. 4a). The dot plot shows the breaks and inversions when compared to the draft wDi (Fig. 4b). Mauve alignment showed some regions in the complete wDi genome whose locally collinear blocks (LCBs) were absent in the draft wDi (Fig. 4c). Each LCB is a homologous sequence region shared by two or more of the genomes under investigation and does not contain any homologous sequence rearrangements28. We also looked at a number of critical elements such as transposases, Ankyrin, DNA-repair genes, and resolvases in complete and draft wDi that are responsible for both difficulty in assembly and genome expansion. In complete and draft wDi, we found 109 versus 15 transposases, 57 versus 54 proteins with ankyrin repeats, 14 versus 11 DNA repair proteins, and six versus one resolvases. The homolog for 56-amino-acid repressor protein (WP_017531870) of Wolbachia in the psyllid that represses SC1_gp110 (holin) gene of Ca. Liberibacter asiaticus was also found in the complete wDi genome (GZ065_v1_1041).
The BUSCO completeness scores of assembled wDi genome was also compared to Wolbachia reference genomes using bacteria_odb10 database (calculated in this study). The BUSCO completeness of the final assembled wDi genome showed 80.6% as compared to other reference Wolbachia genomes wOo (78.2%), wOv (78.2%), wFol (81.5%), wAlbB (84.7%), wBm (79.8%), wOo (78.2%), wMau (83.9%), wMel (83.1%), wPip (86.3%) and wRi (83.9%) suggesting similar number of ‘complete and single-copy’ genes recovered in wDi genome compared to reference Wolbachia genomes and is typical and reliable for comparative genomics among Wolbachia genomes25 (Figure S2). It has been suggested that even the complete genomes of Wolbachia miss up to 9 to 25 genes from the BUSCO set because of their endosymbiotic lifestyle which makes genes redundant, and these genes probably are not missing from the assemblies and annotations29. The final assembled wDi genome showed 94.0% completeness when the subset database, rickettsiales_odb10 was used for the BUSCO analysis. In addition, the checkM completeness of the assembled wDi genome was 95.73% with 1.45% contamination. The checkM completeness and contamination falls within the range of ≥ 95% complete with ≤ 5% contamination that makes excellent reference genome for analysis30,31. The checkM contamination of the previously published complete wFol genome (1.8 Mb)19 was 1.82% (calculated in this study) which was assembled from filtered reads obtained from F. candida genome that was sequenced using PacBio sequencing technology (Table S3). In addition, the taxonomy to Wolbachia sp. was confirmed using Centrifuge v1.0.3 tool that showed all sequences belonging to Wolbachia species.
Insertion sequences (ISs), prophage genes, ORF7 and Ankyrin proteins
Insertion sequences are bacterial class-II transposons that are capable of replication and can spread throughout the genome using cut-and-paste mechanism32. ISs are classified into about 20 families and play key role in genome evolution32,33. Specifically, 10% of the Wolbachia genomes consist of insertion sequence elements34. A total of 138 ORFs related to ISs were found in the wDi genome, belonging to 14 different IS families (Figure S3; Table S4). The most represented IS families were IS982 (28 copies; 20.3%), IS481 (26 copies; 18.8%), and IS110 (25 copies; 18.1%). Although the ISs in the wDi genome are diverse, they have less ORFs than in the entire circular wAlbB (CP031221) chromosome belonging to supergroup B, which has nine IS families and 216 ORFs associated to IS elements, with IS982 and IS481 having 99 and 76 copies, respectively. The other supergroup B members, wPip possess IS982, wNo and wMau possessed IS110 and wRi possess IS66_ssgr_ISBst12 as a dominant IS family. The majority of the members of the supergroup A, wWpum, wCin2, wMel, wMel_I23 possess IS5_ssgr_IS1031 as a dominant IS family while, wDAna possess IS110, wCsol and wHa possess IS5_ssgr_IS903 as a dominant IS family. Wolbachia belonging to supergroups C, D that infect filarial nematodes such as wOo (one IS ORF) and wBm (three IS ORFs) possess highly reduced IS elements with IS4_ssgr_IS231 and IS630 as a dominant IS family, respectively. The supergroup E and F members, wFol and wCle possessed IS5_ssgr_IS1031 as a dominant IS family with 117 and 231 IS ORFs respectively (Fig. 5).
Prophages are subjected to selective pressure from their hosts, resulting in a variety of partial DNA genomic abnormalities such as recombination, gene loss, and progressive disintegration35. The prophage genes are dynamic elements that mediate horizontal gene transfer and are widespread in Wolbachia genomes36,37. Defective genomic prophages, also known as cryptic prophages, are virions that have lost their ability to generate virions and lyse host cells35,38. The most major difference between intact and cryptic WO is that intact WO possesses a rather complete gene module that codes for head, baseplate, and tail proteins, allowing it to generate active virions39.The prophage regions in the wDi genome showed five regions (four intact and one incomplete or cryptic) sized 55.8 kb, 23.1 kb, 32.2 kb, 11.9 kb and 34.6 kb containing 64, 33, 21, 18, and 24 proteins, respectively (Figure S2; Table S5). Altogether, prophage region constituted total of 164 prophage-associated loci scattered in four intact and one incomplete regions with the combined size of 137.9 kb (10.3%) in the wDi genome. Based on the existence of all genomic structures (phage attachment sites, genes encoding structural phage proteins, and genes coding for proteins involved in DNA regulation, insertion to the host genome, and lysis), the four entire WOwDi phages have the ability to create virions. One cryptic WOwDi sized 11.8 kb (location: 522,438–534,303) lacks phage baseplate and tail assembly proteins. wDi genome supports widely held belief that Wolbachia with cryptic prophages usually has at least one intact WO prophage40. This shows the expansion of the prophage region when compared to other supergroup B members such as wAlbB_CP031221 (1.47%), wNo (4.09%), wMau (4.07%) and but comparable to wPip (1.48 Mb genome size) with 9.25% prophage sequences (with only one 59.8 kb sized intact prophage region with other four cryptic prophage regions) (Fig. 5). Surprisingly, PHASTER analysis revealed two cryptic prophage regions of 6.4 kb and 15.4 kb in wAlbB_CP031221 without the presence of intact prophage region. However, four WO-like islands (designated wAlbB WO like island 01 through wAlbB WO like island 04) and 19 prophage-associated loci (13 CDS, 6 pseudogenes) were discovered by BLAST comparisons to several WO phages totaling 111 prophage-associated loci with a combined size of 116 kb (8%) without active prophages25. Other Wolbachia genome only with cryptic prophages were found in group A member, wWpum (Wolbachia in Wiebesia pumilae)39 having no ability to produce active virions.
The WO prophage areas are sometimes used in cytoplasmic incompatibility genetic investigations41. The BLASTp searches of WOMelB WD0631 (NCBI accession number AAS14330.1) and WD0632 (AAS14331.1) in Microscope platform for CifA and CifB protein sequences, respectively41 found no homologs in the wDi strain for CifA but a few for CifB. Among CifB hits using HHpred42, GZ065_v1_1517, GZ065_v1_0240 follow Module B-1 (ModB-1 with PDDEXK nuclease family, and various other restriction endonucleases such as NucS, HSDR_N, and MmeI), and GZ065_v1_0695, GZ065_v1_0696, GZ065_v1_0704 follow Module B-3 [with ubiquitin-modification (Ulp-1) and protease-like domains (Sentrin-specific protease)]41.
In addition, the wDi genome revealed the presence of four different minor capsid gene ORF7 paralogs (GZ065_00870, GZ065_01245, GZ065_01575, and GZ065_6965) (Figure S3) as in Nasonia vitripennis A Wolbachia37 which are present in the four different prophage sequence regions. The protein domain annotations of the assembled genomes showed 57 (4.0%) proteins in the wDi genome to contain at least one copy of an ankyrin repeat domain (Figures S3; Table S6) which is comparable to ANK proteins wMel, wRi, and wPip with about 4% of the total genes43. These ANK proteins of about 33 amino acids play significant role in interactions between host and symbionts34,44 and are found abundantly in genes of WO-prophage44.
Many contemporary hypotheses propose that obligate endosymbionts should have limited genome sizes45, similar to Wolbachia strains in filarial nematodes, which contain no or few insertion sequences, transposable elements, and prophage sequences, due to their obligate association with the host46. Recent study have shown that the genome of the obligatory wFol29 strain, on the other hand, is the biggest complete Wolbachia genome ever identified, with 1,801,626 base pairs (bp) and highly enriched in repeated and mobile elements (124 transposases, 96 ankyrin repeat proteins, 34 DNA-repair genes, and 19 resolvases). In wDi too, the genome is highly enriched in repeated and mobile elements (109 transposases, 57 proteins with ankyrin repeats, 14 DNA repair proteins, and six resolvases) than other supergroup-B members29. All known Wolbachia strains are in a similar transitional stage, in which they are primarily vertically transferred and do not exist in specialized structures47. As a result, their genome size is expected to vary depending on the host47.
COG, eggNOG, and pfam annotations
COG automatic classification revealed 1,092 CDSs classified in at least one COG group in the wDi genome (Table S7). eggNOG annotations of protein coding genes assigned functions to 1,221 protein coding genes (Table S8). The top five pathways were related to “replication, recombination and repair”, “translation, ribosomal structure and biogenesis”, “energy production and conversion”, “posttranslational modification, protein turnover, chaperones”, and “coenzyme transport and metabolism”. The Pathway Tools was used to observe whether the metabolic pathways were complete or not. The analysis showed 40 complete metabolic pathways and 62 incomplete metabolic pathways (Table S9). The pfam annotation of wDi identified 1075 protein coding genes with unique pfam domains. The important pfam domains for mobile genetic elements such as DDE Transposase domain DDE_Tnp_1 (PF01609), DDE_Tnp_1_3 (PF13612), DDE_Tnp_4 (PF13359), DDE_Tnp_IS240 (PF13610.6), Retroviral Integrase domain rve (PF00665), rve_3 (PF13683), and reverse transcriptase domain RVT_1 (PF00078) were found abundantly in wDi genome (Table S10).
Toxin-antitoxin system and Type IV Secretion SSystem (T4SS) genes
Toxin–antitoxin (TA) systems are genetic components that consist of a toxin gene (proteins) and its antitoxin counterpart (protein or non-coding RNAs). In bacteria various processes, like translation, replication, cytoskeleton development, membrane integrity, and cell wall biosynthesis are affected by TA toxins48. PGAP annotation in the wDi genome revealed the presence of Type II RelE/ParE toxin genes, GZ065_00055, GZ065_03670 (pseudogene) and one Type II RatA family toxin gene, GZ065_04425. Based on the BLASTp search using wPip antitoxin gene, WP_007302904.1, we identified GZ065_00050 as a possible antitoxin gene for RelE toxin. Type II RatA family toxin gene, GZ065_04425 was situated immediate to ssrS noncoding RNA gene (Rfam RF00013), separated by fewer than 18 nucleotides. Previously, RelE/ParE and RatA/ssrS toxin-antioxin modules were also reported in wCle, wFol, wPip, wMel, wRi, wAu, wHa, wNo49.
Genes related to the Type IV Secretion System (T4SS) are another important group represented in Wolbachia. Bacteria utilize T4SSs to proliferate and survive inside the host secreting protein effectors, protein-DNA complexes50. The wDi genome revealed the presence of 14 genes associated to T4SSs (Table S11). These genes were organized in two operons in each wDi genome. Operon 1 contains virB8, virB9-1, virB10, virB11, and virD4. Operon 2 contains virB3, virB4, virB6-1, and virB6-2. The virB2 and virB7 genes were found to be scattered elsewhere in the genomes. Interestingly, we found both virB2 (three copies) and virB7 (one copy) genes in the wDi genome. These genes have been reported as absent among Wolbachia and most members of the order Rickettsiales51,52. However, recent studies have shown the presence of virB2 gene (pilus component) in Wolbachia pipientis from Ae. albopictus (wAlbB)25, Wolbachia from Laodelphax striatellus53, Candidatus Wolbachia bourtzisii (wDacA), Wolbachia pipientis wDacB from Dactylopius coccus54, and Wolbachia from Muscidifurax uniraptor (wUni)55. In addition, the virB7 gene (pilus-associated protein) was previously observed only in Wolbachia from Laodelphax striatellus (wStri)53. Bing et al.53 also showed wDi clustered together with wStri with a strong support in a monophyletic clade and suggested that these strains shared the same ancestor.
Comparative genomics of wDi with reference Wolbachia genomes
The Wolbachia pangenome describes 2,112 gene clusters with 18,800 genes that were identified in 15 Wolbachia genomes. The pangenome study resulted three bins that were unique to wDi genomes. The Bin_1 consisted of 58 gene clusters with 127 genes common in both complete and incomplete wDi_AMZJ.112 genomes, Bin_2 consisted of 29 gene clusters with 62 genes that were unique to the complete wDi genome, and Bin_3 consisted of 12 gene clusters with 13 genes that were unique to incomplete wDi_AMZJ.112 genome (Fig. 6a, Table S12). The largest fraction of genes in three bins constituted Ankyrin repeat proteins (n = 28; play important role in interactions between host and symbionts) and IS4 transposase (n = 11; play role in DNA mobility using “cut and paste” mechanism), chromosome segregation ATPases (n = 5; play important role in chromosome condensation and segregation during cytoplasmic incompatibility in male insects), curved DNA-binding protein CbpA, containing a DnaJ-like domain (n = 2; act as a molecular chaperone in an adaptive response to environmental stresses other than heat shock), DNA repair protein RadC (n = 2), DNA-directed RNA polymerase (n = 2), RecA-family ATPase (n = 6) , REP element-mobilizing transposase (n = 2), transcriptional regulator with XRE-family HTH domain (n = 2), Mg/Co/Ni transporter MgtE (n = 2; important in inorganic ion transport and metabolism) and rest were conserved protein with unknown function.
The ANI values among the wDi genome and reference Wolbachia genomes indicated the similarity in the range of 82% (supergroup D-wBm) to 95% (supergroup B-wAlbB) and 99.8% to incomplete wDi_AMZJ.112 genome (Fig. 6b). OrthoFinder assigned 21,264 genes (96.3% of total) to 1,924 orthogroups (Table S13) in the 15 Wolbachia genomes. There were 626 orthogroups with all species present and 407 of these consisted entirely of single-copy genes (Fig. 6c). The analysis showed 43 orthogroups unique to complete and draft wDi genomes.
Phylogenetics of wDi and other Wolbachia genomes
The IQ-TREE v 1.6.8 tool was used to construct a ML phylogenetic tree using the concatenated protein sequences of single copy genes including ribosomal proteins of reference Wolbachia genomes obtained from NCBI database (Table S14) with the wDi genome. The single copy genes were utilized instead of multilocus sequence typing loci (gatB, coxA, hcpA, fbpA, and ftsZ)58 which are problematic in phylogenetic analyses and may not accurately represent the properties of different Wolbachia strains59. The advent of sequencing technology and availability of complete and draft genomes of Wolbachia, recent phylogenetic studies have been done utilizing single copy gene sets53,59,60 rather than whole-genome sequence typing61. Although comparisons of whole Wolbachia genome sequences is useful for strain differentiation, diversity estimates, and phylogenetic analyses, the size is cumbersome and not necessary to answer specific questions that can be addressed using genetic marker loci59. The obtained tree (Fig. 7) indicated that the wDi genome belonged to supergroup-B Wolbachia strains (wVulC, wCon, wLug, wBta, wStri, wAlbB, wDacB, wLcl, wNo, wMau, wAus, Ob_Wba, wBol1-b, wMeg, and wPip) and made a clade with wStri (the Wolbachia from Korean Laodelphax striatellus population) and wStri_1 (the Wolbachia from Chinese L. striatellus population). Wolbachia are supergrouped (A, B, E–H), the Wolbachia endosymbionts of arthropods belong to supergroup-A and -B and of filarial nematodes belong to supergroup-C and -D8,62. wPpe belongs to supergroup-L63, whereas wCfeT strain is ancestrally to most other Wolbachia lineages (used as an outgroup)64. The phylogenetic analysis by Saha et al.12 also indicated that Wolbachia from D. citri belongs to supergroup-B using FtsZ and Wsp genes.
Conclusions
The genome sequence of the Wolbachia culture isolated from D. citri was completely assembled and compared with other Wolbachia genomes available in the NCBI database. This study is in accordance with the study by Sinha et al.25, which demonstrated that high quality, complete Wolbachia genome assemblies can be achieved from long-read sequences of high coverage without enrichment, such as through Large Enriched Fragment Targeted Sequencing67 and other target genome enrichment techniques68,69. In this study, we used DNA from an axenic Wolbachia cultures for whole genome sequencing rather than filtering Wolbachia sequence reads from the whole insect genome sequence. The latter, referred to as a metagenomic sequencing approach, is a frequent practice that generates low coverage reads for Wolbachia genome assembly70,71. Recent integration of the pacbio-hifi option in Canu (HiCanu) facilitates generation of complete assemblies consisting of repeat resolution on complex genomes like that of Wolbachia rather than pacbio-corrected assemblies in previous versions. In addition, concatenated protein sequences of single copy genes generated using hmm source from Campbell et al.65 delineated supergroup-B Wolbachia of D. citri from other supergroups. The availability of a complete circular genome of the D. citri endosymbiont, Wolbachia, will facilitate the development of endosymbiont-mediated strategies for pest and disease management. This study expands the list of complete Wolbachia reference genomes that can be useful in studying evolutionary relationships among Wolbachia of arthropods and nematodes.
Materials and methods
Extraction of Wolbachia from D. citri (wDi)
D. citri were collected from a laboratory culture established in 2005 from a population collected in Polk Co. (28.0′ N, 81.9′ W), Lake Alfred, Florida, USA. Individual psyllids were placed on sterile diet rings for two days prior to Wolbachia extraction. The surface sterilized psyllid was homogenized in 1.0 mL of Schneider’s Drosophila (S2) medium (catalog number 21720024, Gibco) followed by centrifugation at 100 × g for five minutes. The supernatant was further centrifuged at 400 × g for five minutes to pellet wDi with insect debris. The pellet was resuspended with 1.0 mL of S2 medium separate wDi from impurities. The samples were centrifuged at 100 × g for five minutes to pellet impurities, and the supernatant was transferred to a new tube. The final centrifuge step was conducted at 4000 × g for five minutes, and the pelleted wDi was resuspended in fresh 1.0 mL of S2 media.
Infection of wDi in S2 cells and isolation of wDi from cell culture
Drosophila S2 cells (catalog number R69007, Invitrogen) were infected with Wolbachia extracted from Diaphorina citri (S2 + wDi)72 and maintained in Schneider’s Drosophila medium (catalog number 21720024, Gibco) containing 10% heat inactivated fetal bovine serum (catalog number 10082147, Gibco); 50 units of penicillin and 50 μg streptomycin sulfate (catalog number 15070063, Gibco) per mL (S2 complete media) Dobson et al.73 according to standard procedures74. The S2 + wDi cells were harvested and lysed by vortex using 3 mm borosilicate glass beads to isolate wDi. The supernatant samples were processed as described by Rasgon et al.75. wDi cells from the same culture were collected on different dates (different cell passages, 26 and 28) and combined to obtain enough wDi DNA to produce a complete genome21.
wDi Genomic DNA (gDNA) extraction
The wDi gDNA was extracted using the MagAttract HMW DNA Mini kit (catalog number 67563, Qiagen) using manufacturer’s protocol with few modifications. The modifications were as follows: The bacterial pellet was resuspended in 180 µl ATL buffer [from DNeasy® Blood and Tissue Kit (catalog number 69506, Qiagen)] with 20 µl Proteinase K and incubated for 30 min at 56 °C. 15 μl MagAttract Suspension and 280 μl Buffer MB was added to the sample and mixed by pulse vortexing. The sample tubes were transferred to the tube holder of the Magnetic Rack (without the magnetic insert). The tube holder of the Magnetic Rack (without the magnetic insert) was placed onto the mixer and incubate at room temperature (15–25 °C) for 3 min at 1400 rpm. The magnetic insert was placed into the tube holder of the Magnetic Rack, wait (~ 1 min) until bead separation has been completed, and the supernatant was removed. The extracted gDNA was purified using the DNeasy PowerClean Cleanup kit (catalog number 1287750, Qiagen). gDNA was quantified using the Qubit 1 × dsDNA HS Assay kit (ThermoFisher Scientific) and DNA quality was assessed using the TapeStation Genomic DNA ScreenTape (Agilent Technologies).
Long-read (PacBio) sequencing
Sequencing of wDi gDNA was performed on six replicate samples (five samples are not included in this study). wDi gDNA (4–8 µg in 150 µl TE) was sheared down to 10 kb using Covaris g-TUBES (catalog number 520079, Covaris Inc.), using two passes at 7,000 rpm. The resulting size of the fragments was verified on the TapeStation Genomic DNA ScreenTape (Agilent Technologies). Barcoded, 10 kb insert-size libraries were constructed using 600–700 ng of pure and fragmented (10 kb) from each bacterial sample using the protocol of PacBio for multiplex SMRT sequencing of bacterial genomes (PacBio Manual PN 101–069-200–02) in conjunction with barcodes from the Barcoded Adaptor Kit 8A (PacBio PN 101–081-300). Briefly, the library construction reactions consisted of the following sequential steps: ExoVII treatment, DNA Damage Repair, End Repair and Blunt-end ligation of barcoded SMRT bell adaptors. After ligation, samples were pooled, purified using AMPure, and treated with ExoIII/ExoVII to eliminate excess adaptors and any damaged DNA. This procedure resulted in ~ 800 ng of adaptor ligated SMRT bell library. The final library was further size selected in the SageELF™ instrument (catalog number ELD7510), using 0.75% agarose gel cassettes and the 1–18 kb v2 cassette definition program. The desired SageELF™ fractions in the 5–20 kb range, averaging 10 kb (TapeStation) were cleaned using AMPure magnetic beads (0.6:1.0 beads to sample ratio) and eluted in 15 μl of 10 nM Tris HCl, pH 8.0. The library size selection by ELF step yielded 126 ng of ready-to-sequence material. Sequencing was performed on the PacBio SEQUEL instrument using the Chemistry 3.0 reagents in combination with the SMRT® LINK v 6.0 software. The library was added on the PacBio SEQUEL sample plate at 8 pM by diffusion-loading and 224 min pre-extension time for sequencing in LR-SMRT cells with 20-h data collection. All other steps for sequencing were done according to the recommended protocol by PacBio sequencing calculator.
Short-read (Illumina) sequencing
The gDNA samples for Illumina sequencing were fragmented using the Covaris to 400 bp following the manufacturer recommended protocol. The genomic libraries were constructed using 100 ng as the input and the NEBNext Ultra II DNA library prep kit for Illumina (New England Biolabs). Three PCR cycles were performed with each library prior to library validation using the TapeStation High Sensitivity D5000 ScreenTape (Agilent Technologies). Libraries were quantified using the Qubit 1 × dsDNA HS Assay kit (ThermoFisher Scientific) and molar concentration was calculated to pool the libraries in equimolar ratios. The pool was then quantified and 14 pM was loaded into the MiSeq flow cell. The run was set as a 300 paired-end run using the 600-cycles v3 kit.
De novo genome assembly
PacBio CCS were generated using SMRT® LINK v7.0 using Sequel II system. The parameters used for CCS generation were minimum full passes of three and minimum predicted accuracy of 99%. The left adapter sequences (45 bp) were trimmed using seqtk (https://github.com/lh3/seqtk). The reads smaller than 1000 bp were filtered out using filtlong (–min_length 1000, –keep_percent 90) (https://github.com/rrwick/Filtlong). The de novo assembly was done using Canu v1.9 (https://github.com/marbl/canu)76 using the “pacbio-hifi” option18. The suggested circular chromosome was rendered using the following parameters: trim-assemble, genomeSize = 1.5 m, correctedErrorRate = 0.001, cnsErrorRate = 0.050, minReadLength = 3000. The resulted contig was circularized by introducing a ‘break’ in the single contig using Amos v3.1.0 and Minimus2 (http://amos.sourceforge.net/wiki/index.php/Minimus2) that trimmed the duplicate sequences in the beginning and end of the chromosome to produce a circular genome. The origin of replication was adjusted using Circlator v1.5.577.
Genome correction
The PacBio-only assembled genome can have a high probability of indel errors78. Therefore, the assembled genome was checked for potential errors using Illumina data obtained from respective samples using the Pilon error-detection and correction tool79. The adapters and low-quality Illumina sequences were filtered using program Trimmomatic v0.36 (ILLUMINACLIP: adapters.fasta:2:30:20 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:50)80. The quality of trimmed reads was assessed using FastQC v0.11.781. After cleaning, the reads were mapped to the PacBio chromosome using bwa v0.7.1782 using pair-end mode. The indexed bam output file obtained from bwa was utilized for indel correction using Pilon v1.2279.
Genome annotations and assessments
Genome annotation was done using the standard NCBI Prokaryotic Genome Annotation Pipeline (PGAP)83 and Microscope platform26. PGAP annotations are available at NCBI GenBank. The annotations from Microscope platform were used for some comparative studies and mentioned when discussed below (represented by GZ065_v1_n). The completeness of the genome was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO) v4 using bacteria_obd10 database (Creation date: 2019–06-26, number of species: 4085, number of BUSCOs: 124) and rickettsiales_odb10 database (Creation date: 2020–03-06, number of species: 34, number of BUSCOs: 364)84 and CheckM85. Microscope platform was utilized for completeness using CheckM, Clusters of Orthologous Groups (COG) classification of proteins including functional annotation of protein-coding genes using eggNOG-Mapper v1.0.386 , eggNOG database v4.5.187, encoded pathway analysis via Pathway Tools v2388 and the MicroCyc metabolic pathways database89. The map of the circular genome with gene feature information was generated using CGView90. The SiLiX software91 integrated in the Microscope platform that uses the MicroScope gene families (MICFAM) was used for the analysis of the components (core-genome, strain specific sequences) for complete and draft wDi. MAUVE28 was used for complete and draft wDi genomes alignments with locally collinear blocks. Gepard92 was used for creating dot plot between complete and draft wDi genomes. LinePlot tool implemented in the Microscope platform was used to create a line plot for a global comparison, based on minimum synton size of eight genes. Protein sequences from Microscope platform were used for identifying Pfam domains using pfam_scan.pl script v1.5 (last accessed March 10, 2020) using Pfam database v31.093. The prophage regions were identified by PHAge Search Tool Enhanced Release (PHASTER. https://phaster.ca/)94 (last accessed September 28, 2021). ISsaga web server http://issaga.biotoul.fr/issaga_index.php95(last accessed September 28, 2021) was used to find Insertion Sequence (IS) elements using ISfinder database33. HHpred42 was used for the detection of protein domains for identification of modules41 to categorize the possible cytoplasmic incompatibility genes. ORF7, or phage WO-B genome was identified from Pfam which are molecular markers for Wolbachia strain typing96,97 and plays a possible role in inducing cytoplasmic incompatibility98. The prophage sequences, IS elements, Ankyrin genes, T4SS genes and ORF7 sequences in the corresponding wDi genomes was represented in a circos plot using Circa (OMGenomics, http://omgenomics.com/circa/).
Comparative genomics of wDi genome with other Wolbachia genomes
Wolbachia metapangenome, ANI identity, and orthogroup analyses
The assembled wDi genome from this study was compared to various reference genomes: wPip99, wAlbB100, wAlbB_CP03122125, wMel44, wBm_CP03433367, wBm101, wMau67, wRi34, wDAna102, wHa22, wMel_I2370, wNo22 and wRec103. The previously published, non-circular wDi genomes wDi_AMZJ.112 was also included in the comparison. The pangenome analyses were performed using anvio v5.5.057 (http://merenlab.org/software/anvio/). The taxonomy was assigned using Centrifuge v1.0.3104. The COGs to the reference genomes were assigned using program ‘anvi-run-ncbi-cogs’. The program ‘anvi-pan-genome’ was used following flags and parameters: ‘-use-ncbi-blast’, ‘-minbit 0.5’, and ‘-mcl-inflation 5’ for the wDi genome and reference genomes. The similarity between the wDi and reference genomes were calculated using ‘anvi-compute-ani’ which utilizes PyANI56 in ‘ANIb’ mode to compute average nucleotide identity across the genomes. The orthogroups across the wDi and reference genomes were identified using Orthofinder v2.4.0105 and common orthogroups across multiple genomes were visualized via UpSet plot using Intervene (https://asntech.shinyapps.io/intervene/)106.
Phylogenetic analysis
We constructed two maximum likelihood phylogenetic trees in different scale. The phylogenetic analysis was performed using protein sequences hits obtained via ‘anvi-get-sequences-for-hmm-hits,’ which utilizes the hidden markov model (hmm) source from Campbell et al.65 using 139 single copy genes including 48 ribosomal genes. One small scale phylogenetic tree was constructed using seventeen complete Wolbachia chromosomes for studying and visualizing the abundance and variations of Insertion and prophage sequences. For big scale phylogenetic tree, seventy-seven Wolbachia genomes (taxid: 953) were downloaded from the NCBI database using command ncbi-genome-download to perform the phylogenetic analysis with wDi genome. The concatenated protein sequences of single copy genes were aligned using MUSCLE107 and were subjected to ModelFinder108 for RAxML tree using Bayesian Information Criterion (BIC). The best amino acid substitution model was used for construction of maximum likelihood phylogenetic tree using IQ-TREE v1.6.866 using ultrafast bootstrap mode with 5000 iterations. Branch support was estimated using the Shimodaira–Hasegawa (SH)-like approximate likelihood ratio test with 1,000 replicates. Modelfinder and IQ-TREE was integrated in a PhyloSuite v1.2.2 software109 The rerooting, labeling, and color coding of the phylogenetic tree was performed using iTOL v5.7 (https://itol.embl.de/)110.
Supplementary Information
Acknowledgements
The authors acknowledge Albert Mangual for maintaining cell cultures and assisting with wDi isolation. We thank Paul Carr for maintaining D. citri cultures from which wDi cells were isolated. The authors acknowledge the team at University of Florida Interdisciplinary Center for Biotechnology Research (UF-ICBR) NextGen DNA Sequencing Center, in particular David Moraga, Scientific Director, for his inputs. The University of Florida Research Computing Center provided computational resources and support that have contributed to the research results reported in this publication. Funding for this project was provided to K.S.P.-S by the United States Defense Advanced Research Projects Agency, United States (DARPA) (award D19AP00013).
Author contributions
S.N., S.I.B., and K.S.P. designed the study and wrote the main manuscript text. S.N and S.I.B. analyzed data and prepared Figs. 1, 2, 3, 4, 5, 6, and 7. A.M.M. and S.N. collected the sequencing data. All authors reviewed the manuscript.
Data availability
The accessions SRR10985324, and SRR11075881 under Bioproject PRJNA603775 connected with biosample SAMN13940805 have been deposited at the NCBI. The assembled genome and annotations have been deposited at the NCBI GenBank database under the accession CP048819. All the supplemental materials have been uploaded in Figshare: 10.6084/m9.figshare.14397131. Figure S1. Venn diagram showing common and genome specific genes between complete wDi and draft wDi_AMZJ.1 genome. Figure S2. BUSCO assessment of the completeness of wDi genomes with reference sequences. Figure S3. Circos plot representation of various features in the wDi genome. The wDi genome is represented by the outer circle. The first, second, third, fourth and fifth inner circle represents the track for IS elements, Ankyrin genes, T4SS genes, prophage sequences, and ORF7 sequences, respectively in the wDi genome. Table S1 shows list of complete and draft wDi genome specific genes. Table S2 shows list of orthologs of complete and draft wDi using annotation from Microscope platform. Table S3 shows list of Wolbachia genomes sequenced and assembled using different technology and assembly tools. Table S4 shows Insertion Sequences (ISs) in the wDi genome. Table S5 shows prophage statistics in the wDi genome. Table S6 shows list of Ankyrin genes in the wDi genome. Table S7 shows COG automatic classification of protein coding genes in the wDi genome. Table S8 shows eggNOG annotations of protein coding genes in the wDi genome. Table S9 shows Metabolic pathways analysis in the wDi genome. Table S10 shows Pfam domain annotations for the wDi proteins of the wDi genome. Table S11 shows list of genes related to Type IV Secretion System in the wDi genome. Table S12 shows summary of Wolbachia Pan gene clusters. Table S13 shows Orthogroup analyses. Table S14 shows list of Wolbachia genome assemblies downloaded from the NCBI database, consisting of 139 single copy genes including 48 ribosomal genes from Campbell et al.65 used for the hidden markov model (hmm) source, concatenated protein sequences, and phylogenetic tree construction file.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Surendra Neupane and Sylvia I. Bonilla
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-03184-0.
References
- 1.Gottwald TR. Current epidemiological understanding of citrus huanglongbing. Annu. Rev. Phytopathol. 2010;48:119–139. doi: 10.1146/annurev-phyto-073009-114418. [DOI] [PubMed] [Google Scholar]
- 2.Nakabachi A, et al. Defensive bacteriome symbiont with a drastically reduced genome. Curr. Biol. 2013;23:1478–1484. doi: 10.1016/j.cub.2013.06.027. [DOI] [PubMed] [Google Scholar]
- 3.Pelz-Stelinski K, Killiny N. Better together: Association with ‘Candidatus Liberibacter asiaticus’ increases the reproductive fitness of its insect vector, Diaphorina citri (Hemiptera: Liviidae) Ann. Entomol. Soc. Am. 2016;109:371–376. doi: 10.1093/aesa/saw007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chu C-C, Gill TA, Hoffmann M, Pelz-Stelinski KS. Inter-population variability of endosymbiont densities in the Asian citrus psyllid (Diaphorina citri Kuwayama) Microb. Ecol. 2016;71:999–1007. doi: 10.1007/s00248-016-0733-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Jain M, Fleites LA, Gabriel DW. A small Wolbachia protein directly represses phage lytic cycle genes in “Candidatus Liberibacter asiaticus” within psyllids. MSphere. 2017;2:e00171–e1117. doi: 10.1128/mSphereDirect.00171-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Fagen JR, et al. Characterization of the relative abundance of the citrus pathogen Ca: Liberibacter asiaticus in the microbiome of its insect vector, Diaphorina citri, using high throughput 16S rRNA sequencing. Open Microbiol. J. 2012;6:29. doi: 10.2174/1874285801206010029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Serbus LR, Casper-Lindley C, Landmann F, Sullivan W. The Genetics and Cell Biology of Wolbachia-Host Interactions. Annu. Rev. Genet. 2008;42:683–707. doi: 10.1146/annurev.genet.41.110306.130354. [DOI] [PubMed] [Google Scholar]
- 8.Werren JH, Baldo L, Clark ME. Wolbachia: master manipulators of invertebrate biology. Nat. Rev. Genet. 2008;6:741–751. doi: 10.1038/nrmicro1969. [DOI] [PubMed] [Google Scholar]
- 9.Kamtchum-Tatuene J, Makepeace BL, Benjamin L, Baylis M, Solomon T. The potential role of Wolbachia in controlling the transmission of emerging human arboviral infections. Curr. Opin. Infect. Dis. 2017;30:108–116. doi: 10.1097/QCO.0000000000000342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Fraser, J. E. et al. Novel Wolbachia-transinfected Aedes aegypti mosquitoes possess diverse fitness and vector competence phenotypes. PLoS Pathog.13, e1006751 (2017). [DOI] [PMC free article] [PubMed]
- 11.Caragata, E. P. et al. Pathogen blocking in Wolbachia-infected Aedes aegypti is not affected by Zika and dengue virus co-infection. PLOS Negl. Trop. Dis. 13, e0007443 (2019). [DOI] [PMC free article] [PubMed]
- 12.Saha S, et al. Survey of endosymbionts in the Diaphorina citri metagenome and assembly of a Wolbachia wDi draft genome. PLoS ONE. 2012;7:1. doi: 10.1371/journal.pone.0050067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Eid J, et al. Real-time DNA sequencing from single polymerase molecules. Science. 2009;323:133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
- 14.Hotopp JCD, Klasson L. The complexities and nuances of analyzing the genome of Drosophila ananassae and its Wolbachia endosymbiont. G3-Genes Genom Genet. 2018;8:373–374. doi: 10.1534/g3.117.300164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Goodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat. Rev. Genet. 2016;17:333. doi: 10.1038/nrg.2016.49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kent BN, et al. Complete bacteriophage transfer in a bacterial endosymbiont (Wolbachia) determined by targeted genome capture. Genome Biol. Evol. 2011;3:209–218. doi: 10.1093/gbe/evr007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Blaxter M. Symbiont Genes in Host Genomes: Fragments with a Future? Cell Host Microbe. 2007;2:211–213. doi: 10.1016/j.chom.2007.09.008. [DOI] [PubMed] [Google Scholar]
- 18.Nurk, S. et al. HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads. Genome Res. (2020). [DOI] [PMC free article] [PubMed]
- 19.Faddeeva-Vakhrusheva A, et al. Coping with living in the soil: the genome of the parthenogenetic springtail Folsomia candida. BMC Genomics. 2017;18:493. doi: 10.1186/s12864-017-3852-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wolfe TM, et al. Comparative genome sequencing reveals insights into the dynamics of Wolbachia in native and invasive cherry fruit flies. Mol. Ecol. 2021 doi: 10.1111/mec.15923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Neupane S, Bonilla SI, Manalo AM, Pelz-Stelinski KS. Near-Complete Genome Sequences of a Wolbachia Strain Isolated from Diaphorina citri Kuwayama Microbiol. Resour. Announc. 2020;9:e00560–e1520. doi: 10.1128/MRA.00560-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ellegaard KM, Klasson L, Näslund K, Bourtzis K, Andersson SG. Comparative genomics of Wolbachia and the bacterial species concept. PLoS Genet. 2013;9:1. doi: 10.1371/journal.pgen.1003381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Travers KJ, Chin C-S, Rank DR, Eid JS, Turner SW. A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucl. Acids Res. 2010;38:e159–e159. doi: 10.1093/nar/gkq543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Loomis EW, et al. Sequencing the unsequenceable: expanded CGG-repeat alleles of the fragile X gene. Genome Res. 2013;23:121–128. doi: 10.1101/gr.141705.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sinha A, Li Z, Sun L, Carlow CK. Complete Genome Sequence of the Wolbachia wAlbB Endosymbiont of Aedes albopictus. Genome Biol. Evol. 2019;11:706–720. doi: 10.1093/gbe/evz025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Vallenet D, et al. MicroScope: an integrated platform for the annotation and exploration of microbial gene functions through genomic, pangenomic and metabolic comparative analysis. Nucl. Acids Res. 2020;48:D579–D589. doi: 10.1093/nar/gkz926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhang J. Evolution by gene duplication: An update. Trends Ecol. Evol. 2003;18:292–298. [Google Scholar]
- 28.Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kampfraath AA, et al. Genome expansion of an obligate parthenogenesis-associated Wolbachia poses an exception to the symbiont reduction model. BMC Genom. 2019;20:106. doi: 10.1186/s12864-019-5492-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Brady A, Salzberg SL. Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models. Nat. Methods. 2009;6:673–676. doi: 10.1038/nmeth.1358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Parks DH, MacDonald NJ, Beiko RG. Classifying short genomic fragments from novel lineages using composition and homology. BMC Bioinformatics. 2011;12:328. doi: 10.1186/1471-2105-12-328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Chandler, M. & Mahillon, J. i Insertion sequences revisited, p. 305–366. In N. L. Craig, R. Craigie, M. Gellert, and A. Lambowitz (ed.), Mobile DNA II. American Society for Microbiology, Washington, D.C. (2002).
- 33.Siguier P, Pérochon J, Lestrade L, Mahillon J, Chandler M. ISfinder: the reference centre for bacterial insertion sequences. Nucl. Acids Res. 2006;34:D32–D36. doi: 10.1093/nar/gkj014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Klasson L, et al. The mosaic genome structure of the WolbachiawRi strain infecting Drosophila simulans. Proc. Natl. Acad. Sci. 2009;106:5725–5730. doi: 10.1073/pnas.0810753106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Canchaya C, Proux C, Fournous G, Bruttin A, Brüssow H. Prophage genomics. Microbiol. Mol. Biol. Rev. 2003;67:238–276. doi: 10.1128/MMBR.67.2.238-276.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Masui S, et al. Bacteriophage WO and virus-like particles in Wolbachia, an endosymbiont of arthropods. Biochem. Biophys. Res. Commun. 2001;283:1099–1104. doi: 10.1006/bbrc.2001.4906. [DOI] [PubMed] [Google Scholar]
- 37.Bordenstein SR, Wernegreen JJ. Bacteriophage Flux in Endosymbionts (Wolbachia): Infection Frequency, Lateral Transfer, and Recombination Rates. Mol. Biol. Evol. 2004;21:1981–1991. doi: 10.1093/molbev/msh211. [DOI] [PubMed] [Google Scholar]
- 38.Saridaki A, et al. Wolbachia prophage DNA adenine methyltransferase genes in different Drosophila-Wolbachia associations. PLoS One. 2011;6:e19708. doi: 10.1371/journal.pone.0019708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Miao, Y.-h., Xiao, J.-h. & Huang, D.-w. Distribution and evolution of the bacteriophage WO and Its antagonism with Wolbachia. Front. Microbial.11. 10.3389/fmicb.2020.595629 (2020). [DOI] [PMC free article] [PubMed]
- 40.Kent, B. N., Funkhouser, L. J., Setia, S. & Bordenstein, S. R. Evolutionary genomics of a temperate bacteriophage in an obligate intracellular bacteria (Wolbachia). PLoS One6, e24984 (2011). [DOI] [PMC free article] [PubMed]
- 41.Lindsey ARI, et al. Evolutionary genetics of cytoplasmic incompatibility genes cifA and cifB in prophage WO of Wolbachia. Genome Biol. Evol. 2018;10:434–451. doi: 10.1093/gbe/evy012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Klasson L, et al. Genome evolution of Wolbachia strain wPip from the Culex pipiens group. Mol. Biol. Evol. 2008;25:1877–1887. doi: 10.1093/molbev/msn133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wu, M. et al. Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol.2, e69 (2004). [DOI] [PMC free article] [PubMed]
- 45.McCutcheon JP, Moran NA. Extreme genome reduction in symbiotic bacteria. Nat. Rev. Microbiol. 2012;10(1):13–26. doi: 10.1038/nrmicro2670. [DOI] [PubMed] [Google Scholar]
- 46.Comandatore F. Supergroup C Wolbachia, mutualist symbionts of filarial nematodes, have a distinct genome structure. Open Biol. 2015;5:150099. doi: 10.1098/rsob.150099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Lo W-S, Huang Y-Y, Kuo C-H. Winding paths to simplicity: genome evolution in facultative insect symbionts. FEMS Microbiol. Rev. 2016;40:855–874. doi: 10.1093/femsre/fuw028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Unterholzner SJ, Poppenberger B, Rozhon W. Toxin–antitoxin systems. Mob. Genet. Elements. 2013;3:e26219. doi: 10.4161/mge.26219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Fallon AM. Computational evidence for antitoxins associated with RelE/ParE, RatA, Fic, and AbiEii-family toxins in Wolbachia genomes. Mol. Genet. Genomics. 2020;295:891–909. doi: 10.1007/s00438-020-01662-0. [DOI] [PubMed] [Google Scholar]
- 50.Gonzalez-Rivera, C., Bhatty, M. & Christie, P. J. Mechanism and function of type IV secretion during infection of the human host. Microbiol. Spectr.4 (2016). [DOI] [PMC free article] [PubMed]
- 51.Rancès E, Voronin D, Tran-Van V, Mavingui P. Genetic and functional characterization of the type IV secretion system in Wolbachia. J. Bacteriol. 2008;190:5020–5030. doi: 10.1128/JB.00377-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Pichon S, et al. Conservation of the Type IV secretion system throughout Wolbachia evolution. Biochem. Biophys. Res. Commun. 2009;385:557–562. doi: 10.1016/j.bbrc.2009.05.118. [DOI] [PubMed] [Google Scholar]
- 53.Bing X-L, Zhao D-S, Sun J-T, Zhang K-J, Hong X-Y. Genomic Analysis of Wolbachia from Laodelphax striatellus (Delphacidae, Hemiptera) Reveals Insights into Its “Jekyll and Hyde” Mode of Infection Pattern. Genom. Biol. Evol. 2020;12:3818–3831. doi: 10.1093/gbe/evaa006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ramírez-Puebla ST, et al. Genomes of Candidatus Wolbachia bourtzisiiwDacA and Candidatus Wolbachia pipientiswDacB from the Cochineal Insect Dactylopius coccus (Hemiptera: Dactylopiidae) G3-Genes Genom. Genet. 2016;6:3343–3349. doi: 10.1534/g3.116.031237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Newton ILG, et al. Comparative genomics of two closely related Wolbachia with different reproductive effects on hosts. Genom. Biol. Evol. 2016;8:1526–1542. doi: 10.1093/gbe/evw096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Pritchard L, Glover RH, Humphris S, Elphinstone JG, Toth IK. Genomics and taxonomy in diagnostics for food security: Soft-rotting enterobacterial plant pathogens. Anal. Methods. 2016;8:12–24. [Google Scholar]
- 57.Eren AM, et al. Anvi’o: An advanced analysis and visualization platform for ‘omics data. PeerJ. 2015;3:e1319. doi: 10.7717/peerj.1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Baldo L, et al. Multilocus sequence typing system for the endosymbiont Wolbachia pipientis. Appl. Environ. Microbiol. 2006;72:7098. doi: 10.1128/AEM.00731-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Bleidorn C, Gerth M. A critical re-evaluation of multilocus sequence typing (MLST) efforts in Wolbachia. FEMS Microbiol. Ecol. 2018;94:1. doi: 10.1093/femsec/fix163. [DOI] [PubMed] [Google Scholar]
- 60.Wang X, et al. Phylogenomic analysis of Wolbachia strains reveals patterns of genome evolution and recombination. Genom. Biol. Evol. 2020;12:2508–2520. doi: 10.1093/gbe/evaa219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Pérez-Losada M, Cabezas P, Castro-Nallar E, Crandall KA. Pathogen typing in the genomics era: MLST and the future of molecular epidemiology. Infect. Genet. Evol. 2013;16:38–53. doi: 10.1016/j.meegid.2013.01.009. [DOI] [PubMed] [Google Scholar]
- 62.Glowska E, Dragun-Damian A, Dabert M, Gerth M. New Wolbachia supergroups detected in quill mites (Acari: Syringophilidae Infect. Genet. Evol. 2015;30:140–146. doi: 10.1016/j.meegid.2014.12.019. [DOI] [PubMed] [Google Scholar]
- 63.Chung, M., Munro, J. B., Tettelin, H. & Dunning Hotopp, J. C. Using Core Genome Alignments To Assign Bacterial Species. mSystems3, e00236–00218. 10.1128/mSystems.00236-18 (2018). [DOI] [PMC free article] [PubMed]
- 64.Vasconcelos EJ, et al. Assessing cat flea microbiomes in Northern and Southern California by 16S rRNA next-generation sequencing. Vector Borne Zoonot. Dis. 2018;18:491–499. doi: 10.1089/vbz.2018.2282. [DOI] [PubMed] [Google Scholar]
- 65.Campbell JH, et al. UGA is an additional glycine codon in uncultured SR1 bacteria from the human microbiota. Proc. Natl. Acad. Sci. 2013;110:5540–5545. doi: 10.1073/pnas.1303090110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Nguyen L-T, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lefoulon E, et al. Large Enriched Fragment Targeted Sequencing (LEFT-SEQ) Applied to Capture of Wolbachia Genomes. Sci. Rep. 2019;9:5939. doi: 10.1038/s41598-019-42454-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Geniez S, et al. Targeted genome enrichment for efficient purification of endosymbiont DNA from host DNA. Symbiosis. 2012;58:201–207. doi: 10.1007/s13199-012-0215-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Dunning Hotopp JC, Slatko BE, Foster JM. Targeted enrichment and sequencing of recent endosymbiont-host lateral gene transfers. Sci. Rep. 2017;7:857. doi: 10.1038/s41598-017-00814-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Basting PJ, Bergman CM. Complete genome assemblies for three variants of the Wolbachia endosymbiont of Drosophila melanogaster. Microbiol. Resour. Announc. 2019;8:1. doi: 10.1128/MRA.00956-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Wang X, et al. Genome assembly of the A-Group Wolbachia in Nasonia oneida using linked-reads technology. Genom. Biol. Evol. 2019;11:3008–3013. doi: 10.1093/gbe/evz223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Rasgon JL, Ren X, Petridis M. Can anopheles gambiae Be infected with Wolbachia pipientis? Insights from an in vitro system. Appl. Environ. Microbiol. 2006;72:7718. doi: 10.1128/AEM.01578-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Dobson SL, Marsland EJ, Veneti Z, Bourtzis K, O'Neill SL. Characterization of Wolbachia host cell range via the in vitro establishment of infections. Appl. Environ. Microbiol. 2002;68:656–660. doi: 10.1128/AEM.68.2.656-660.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Baum, B. & Cherbas, L. Drosophila cell lines as model systems and as an experimental tool, pp. 391–424 in Drosophila. 391–424 (Springer, 2008). [DOI] [PubMed]
- 75.Rasgon JL, Gamston CE, Ren X. Survival of Wolbachia pipientis in cell-free medium. Appl. Environ. Microbiol. 2006;72:6934–6937. doi: 10.1128/AEM.01673-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Koren S, et al. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Hunt M, et al. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol. 2015;16:294. doi: 10.1186/s13059-015-0849-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Watson M, Warr A. Errors in long-read assemblies can critically affect protein prediction. Nat. Biotechnol. 2019;37:124–126. doi: 10.1038/s41587-018-0004-z. [DOI] [PubMed] [Google Scholar]
- 79.Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PloS One9 (2014). [DOI] [PMC free article] [PubMed]
- 80.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Andrews, S. FastQC: a quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc (2010).
- 82.Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26:589–595. doi: 10.1093/bioinformatics/btp698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Tatusova T, et al. NCBI prokaryotic genome annotation pipeline. Nucl. Acids. Res. 2016;44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 85.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Huerta-Cepas J, et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol. Biol. Evol. 2017;34:2115–2122. doi: 10.1093/molbev/msx148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Huerta-Cepas, J. et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res.44, D286-D293 (2015). [DOI] [PMC free article] [PubMed]
- 88.Karp PD, et al. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology. Brief. Bioinform. 2010;11:40–79. doi: 10.1093/bib/bbp043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Vallenet D, et al. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes. Nucl. Acids Res. 2017;45:D517–D528. doi: 10.1093/nar/gkw1101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Grant JR, Stothard P. The CGView Server: A comparative genomics tool for circular genomes. Nucl. Acids Res. 2008;36:W181–W184. doi: 10.1093/nar/gkn179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Miele, V., Penel, S. & Duret, L. Ultra-fast sequence clustering from similarity networks with SiLiX. BMC Bioinformatic.s12, 116–116. 10.1186/1471-2105-12-116 (2011). [DOI] [PMC free article] [PubMed]
- 92.Krumsiek J, Arnold R, Rattei T. Gepard: A rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 2007;23:1026–1028. doi: 10.1093/bioinformatics/btm039. [DOI] [PubMed] [Google Scholar]
- 93.Finn RD, et al. The Pfam protein families database: Towards a more sustainable future. Nucleic Acids Res. 2015;44:D279–D285. doi: 10.1093/nar/gkv1344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Arndt D, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44:W16–W21. doi: 10.1093/nar/gkw387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Varani AM, Siguier P, Gourbeyre E, Charneau V, Chandler M. ISsaga is an ensemble of web-based methods for high throughput identification and semi-automatic annotation of insertion sequences in prokaryotic genomes. Genome Biol. 2011;12:R30. doi: 10.1186/gb-2011-12-3-r30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Sanogo YO, Dobson SL. WO bacteriophage transcription in Wolbachia-infected Culex pipiens. Insect Biochem. Mol. Biol. 2006;36:80–85. doi: 10.1016/j.ibmb.2005.11.001. [DOI] [PubMed] [Google Scholar]
- 97.Bordenstein SR, Marshall ML, Fry AJ, Kim U, Wernegreen JJ. The Tripartite Associations between Bacteriophage, Wolbachia, and Arthropods. PLOS Pathog. 2006;2:e43. doi: 10.1371/journal.ppat.0020043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Sinkins SP, et al. Wolbachia variability and host effects on crossing type in Culex mosquitoes. Nat. Biotechnol. 2005;436:257–260. doi: 10.1038/nature03629. [DOI] [PubMed] [Google Scholar]
- 99.Klasson L, et al. Genome evolution of Wolbachia strain wPip from the Culex pipiens group. Mol. Evol. Evol. 2008;25:1877–1887. doi: 10.1093/molbev/msn133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Gerth M, Bleidorn C. Comparative genomics provides a timeframe for Wolbachia evolution and exposes a recent biotin synthesis operon transfer. Nat. Microb. 2016;2:1–7. doi: 10.1038/nmicrobiol.2016.241. [DOI] [PubMed] [Google Scholar]
- 101.Foster, J. et al. The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode. PLoS Biol.3 (2005). [DOI] [PMC free article] [PubMed]
- 102.Pichon, S. Type IV secretion system and ankyrin domain-containing proteins in Wolbachia-arthropods interactions, Université de Poitiers (2009).
- 103.Metcalf, J. A., Jo, M., Bordenstein, S. R., Jaenike, J. & Bordenstein, S. R. Recent genome reduction of Wolbachia in Drosophila recens targets phage WO and narrows candidates for reproductive parasitism. PeerJ2, e529 (2014). [DOI] [PMC free article] [PubMed]
- 104.Kim D, Song L, Breitwieser FP, Salzberg SL. Centrifuge: Rapid and sensitive classification of metagenomic sequences. Genome Res. 2016;26:1721–1729. doi: 10.1101/gr.210641.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Emms DM, Kelly S. OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015;16:157. doi: 10.1186/s13059-015-0721-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Khan A, Mathelier A. Intervene: A tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinformatics. 2017;18:287. doi: 10.1186/s12859-017-1708-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Edgar RC. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucl. Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Zhang D, et al. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Res. 2020;20:348–355. doi: 10.1111/1755-0998.13096. [DOI] [PubMed] [Google Scholar]
- 110.Letunic I, Bork P. Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44:W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The accessions SRR10985324, and SRR11075881 under Bioproject PRJNA603775 connected with biosample SAMN13940805 have been deposited at the NCBI. The assembled genome and annotations have been deposited at the NCBI GenBank database under the accession CP048819. All the supplemental materials have been uploaded in Figshare: 10.6084/m9.figshare.14397131. Figure S1. Venn diagram showing common and genome specific genes between complete wDi and draft wDi_AMZJ.1 genome. Figure S2. BUSCO assessment of the completeness of wDi genomes with reference sequences. Figure S3. Circos plot representation of various features in the wDi genome. The wDi genome is represented by the outer circle. The first, second, third, fourth and fifth inner circle represents the track for IS elements, Ankyrin genes, T4SS genes, prophage sequences, and ORF7 sequences, respectively in the wDi genome. Table S1 shows list of complete and draft wDi genome specific genes. Table S2 shows list of orthologs of complete and draft wDi using annotation from Microscope platform. Table S3 shows list of Wolbachia genomes sequenced and assembled using different technology and assembly tools. Table S4 shows Insertion Sequences (ISs) in the wDi genome. Table S5 shows prophage statistics in the wDi genome. Table S6 shows list of Ankyrin genes in the wDi genome. Table S7 shows COG automatic classification of protein coding genes in the wDi genome. Table S8 shows eggNOG annotations of protein coding genes in the wDi genome. Table S9 shows Metabolic pathways analysis in the wDi genome. Table S10 shows Pfam domain annotations for the wDi proteins of the wDi genome. Table S11 shows list of genes related to Type IV Secretion System in the wDi genome. Table S12 shows summary of Wolbachia Pan gene clusters. Table S13 shows Orthogroup analyses. Table S14 shows list of Wolbachia genome assemblies downloaded from the NCBI database, consisting of 139 single copy genes including 48 ribosomal genes from Campbell et al.65 used for the hidden markov model (hmm) source, concatenated protein sequences, and phylogenetic tree construction file.