Skip to main content
Scientific Data logoLink to Scientific Data
. 2024 Jul 18;11:798. doi: 10.1038/s41597-024-03640-2

Chromosome-level genome assembly of the two-spotted spider mite Tetranychus urticae

Li-Jun Cao 1,2,#, Tian-Bo Guan 1,2,#, Jin-Cui Chen 2, Fangyuan Yang 2, Jing-Xian Liu 1, Feng-Liang Jin 1,, Shu-Jun Wei 2,
PMCID: PMC11258348  PMID: 39025916

Abstract

The two-spotted spider mite, Tetranychus urticae Koch (Acari: Tetranychidae), is a notorious pest in agriculture that has developed resistance to almost all chemical types used for its control. Here, we assembled a chromosome-level genome for the TSSM using Illumina, Nanopore, and Hi-C sequencing technologies. The assembled contigs had a total length of 103.94 Mb with an N50 of 3.46 Mb, with 87.7 Mb of 34 contigs anchored to three chromosomes. The chromosome-level genome assembly had a BUSCO completeness of 94.8%. We identified 15,604 protein-coding genes, with 11,435 genes that could be functionally annotated. The high-quality genome provides invaluable resources for the genetic and evolutionary study of TSSM.

Subject terms: Genome, Sequencing

Background & Summary

The two-spotted spider mite (TSSM), Tetranychus urticae Koch (Acari: Tetranychidae), is a notorious agricultural pest, with over 1,100 documented host plants1. It causes damage to a wide variety of vegetables, fruit trees, and flowers worldwide. Despite numerous control methods developed to control TSSM, it remains one of the major challenges to mitigating the damage of the TSSM in fields25. The TSSM has a high potential to adapt to environmental changes6,7. It has developed resistance to almost all types of pesticide used to its control8. A reference genome is essential for understanding the ecology and genetics of adaptation as well as for developing new control methods of TSSM. A TSSM genome was determined using Sanger sequencing, which is one of the early reported pest genomes7. The assembly has a size of 89.6 Mb with 640 scaffolds7. It has been widely used and significantly enhanced the studies of TSSM, especially in the fields of pesticide resistance, adaptation to host plants, and environmental changes914. To improve the continuity of the TSSM genome and correct misassembled scaffolds, Wybouw, et al.15 assembled the Sanger sequences into three pseudochromosomes by using population allele frequency data and de novo assemblies of seven strains from Illumina data. The number of chromosomes is consistent with previous cytological work16,17.This chromosome-level genome resolves discontinuities of allele frequencies and facilitates the genome-wide scanning of genes and mutations underlying the evolutionary adaptation of TSSM15,18,19.

In this study, we assembled a chromosome-level genome for the TSSM using a combination of Nanopore long-read and Illumina short-read sequencing, Hi-C technology, and RNA-sequencing (RNA-seq). We yielded a nuclear genome assembly of 87.7 Mb, with an N50 of 29.6 Mb and BUSCO (Benchmarking Universal Single-Copy Ortholog) completeness of 93.4%. This high-quality genome will provide invaluable resources for the study of the TSSM and its relative issues.

Methods

Materials and sequencing

The TSSM strain used for sequencing was collected from Xiaoshan City of Zhejiang province. To decrease the effect of heterozygosity on subsequent analysis, a lab population was reared on French bean Phaseolus vulgaris from a small population (about 200 individuals) for continuous generations (about 20 generations) before sequencing, under 25 ± 1 °C, 60 ± 5% relative humidity and L16: D8 photoperiod. Approximately 200 individuals were used for Illumina, 2000 for NanoPore, and 3000 for Hi-C proximity ligation library construction. About 200 larvae and adults were used for transcriptome sequencing for each of the three libraries. Genomic DNA was extracted using the DNeasy tissue kit (Qiagen, Hilden, Germany) for Illumina library construction and the MagAttract HMW DNA kit (Qiagen, Hilden, Germany) for NanoPore library construction. For the Hi-C library, the genome was digested by the restriction enzyme DpnII, and fragments were then sheared into ~400 bp. The Hi-C library was sequenced using the DNBSEQ-T7 platform. RNA-seq libraries were prepared using VAHTSTM mRNA-seq V2 Library Prep Kit (Vazyme, Nanjing, China) and sequenced on the Illumina NovaSeq platform. Sequencing data generated from each library are provide in Table 1.

Table 1.

Summary statistics of generated sequencing data for Tetranychus urticae genome assembly and annotation in this study.

Library Sequencing instrument Size (bp) Coverage Accession number
Illumina pair-end Illumina NovaSeq 19,573,148,100 223.20 SRR28000465
NanoPore Oxford NanoPore 29,214,889,073 333.14 SRR28000457
Hi-C DNBSEQ-T7 22,715,244,600 259.03 SRR28000066
TU_LabR1 Illumina NovaSeq 5,889,513,359 \ SRR28000928
TU_LabR2 Illumina NovaSeq 9,347,590,280 \ SRR28000929
TU_LabR3 Illumina NovaSeq 7,512,468,577 \ SRR28000930

Genome survey

Genome survey was performed using a k-mer based method. The k-mer coverage was counted from Illumina short reads using Jellyfish version 2.2.1020 with k-mers of 17, 21, 25, and 31. Genome size, heterozygosity, and duplication rate were estimated using GenomeScope version 2.021. The estimated size of the TSSM genome rangs from 87.25 Mb to 88.05 Mb, with a heterozygosity rate of 0.60% to 0.64%, and a duplication rate of 3.25% to 4.41% (Fig. 1a–d).

Fig. 1.

Fig. 1

Genome survey and assembly of the two-spotted spider mite (TSSM) Tetranychus urticae. Genome size, heterozygosity and rate of duplication were estimated using Genomescope when k-mer = 17 (a), 21 (b), 25 (c), and 31 (d). (e) The genome-wide all-by-all Hi-C matrix of TSSM. Three linkage groups were identified based on Hi-C contacts, indicated by blue boxes. Sequences anchored on chromosomes are shown in the plot. (f) Synteny blocks between our new assembly and two previously published genome assemblies of TSSM.

Genome assembly

Nanopore long-reads were corrected and assembled using Nextdenovo22 with default parameters. In order to remove possible secondary alleles, the assembled contigs were filtered using the pipeline Purge Haplotigs23, which produced 177 contigs with a total length of 103.35 Mb and a contig N50 of 3.46 Mb. Raw Illumina whole-genome short-reads were used to polish the long-read contig-level assembly using Pilon v1.2224. Hi-C Illumina short-reads were used to assemble contigs into a chromosome-level genome using Juicer v1.525 and 3D-DNA26. The final assembly contains three chromosomes composed of 34 contigs with a total length of 87.7 Mb (Fig. 1e). This newly assembled genome has greater continuity, with 33 gaps, compared to a previously reported pseudochromosome-level genome, which consisted of 42 scaffolds with over 800 gaps15,27.

Genome annotation

The repeat annotation was performed with RepeatModeler v2.0.428 and RepeatMasker v4.1.429 using a species-specific repeat library, a RepBase database, and a repeat element library for Arthropoda from the Dfam database. The protein-coding genes were annotated using RNA-seq-based, ab initio, and homolog-based methods in the MAKER v3.01.04 pipeline30. For the RNA-seq-based method, the RNA-seq reads of three libraries were mapped to our TSSM assembly with Hisat v2.2.031. The transcripts were then assembled using Stringtie v2.1.2. For ab initio annotation, SNAP v2013-02-1632 and Augustus v3.2.333 parameters were estimated or trained before using them to predict genes in MAKER30. The SNAP parameters were estimated from high-quality transcripts obtained by improvement and filtering using PASA v2.4.134. The gene model of Augustus was directly obtained from the above BUSCO analysis of the genome assembly. For the homolog-based method, we the used protein-coding genes of Drosophila melanogaster (dmel_r6.06) and the previously published genome of TSSM (Accession: GCF_000239435.1)7. Another homology-based method implemented in GeMoMa35 and transcript-based gene predictions utilized in the PASA pipeline v2.1.08734 were performed. Gene models from the three main sources were merged to produce consensus models by EvidenceModeler36. Finally, we identified 15,604 protein-coding genes, 11,232 of which were identical (>95%) to 10,725 protein sequences of the previous version15. Functions of the protein-coding genes were annotated using EggNOG-Mapper v2.1.737 against the database EggNOG v5.0.238, NR39, Swiss-Prot40, GO41, KEGG42, COG43 and PFAM44. In total, 11,435 genes could be functionally annotated. The gene count, Guanine-Cytosine(GC) content, and repeat sequence content were calculated in 100Kbp non-overlapping sliding windows using Bedtools v2.3045 and displayed in a Circos plot by TBtools v2.09346 (Fig. 2a).

Fig. 2.

Fig. 2

Circos plot of GC content, gene count, and repeat content of Tetranychus urticae genome.

Data Records

Illumina short-reads, Nanopore, Hi-C raw reads for T. urticae genome sequencing and Illumina transcriptome data can be accessed in the NCBI Sequence Read Archive under project accession number PRJNA78838547, with accession numbers SRR2800046548, SRR2800045748, SRR2800006648 and SRR28000928- SRR2800093048, respectively. The finally assembled genome has been deposited in the NCBI with an accession number of JALDPR010000001-JALDPR01000005149. The genome assembly and annotation files are available in Figshare (10.6084/m9.figshare.25241794)50.

Technical Validation

Completeness of the genome assembly was up to 94.8% (90.8% single-copied genes, 4.0% duplicated genes, 0.5% fragmented, and 4.7% missing genes) as assessed using BUSCO v3.0.251 with the ‘arachnida_odb10’ database (n = 2934), similar to the previously assembled pseudochromosome-level genome (95.0% completeness with 90.8% single-copied genes, 4.2% duplicated genes, 0.3% fragmented, and 4.7% missing genes). The completeness for annotated gene set was 93.4% (87.1 single-copied genes, 6.3% duplicated genes, 0.7% fragmented, and 5.9% missing genes). Synteny between our assembly and a previously published assembly (the London strain, Assembly accession: GCF_000239435.1) as well as the chromosome-level reassembly7,15 was analyzed using MCSCAN52. This genome showed high synteny to a previously assembled scaffold-level and pseudochromosome-level genome (Fig. 1e). As noted by previous studies15,27, errors on scaffolds 1, 2, 4, and 8 of the Sanger assembly were resolved by our new assembly.

Acknowledgements

This work was supported by National Key R&D Program of China (2023YFD1401200), Key Laboratory of Urban Agriculture (North China, Ministry of Agriculture and Rural Affairs of the People’s Republic of China), Key Laboratory of Environment Friendly Management on Fruit and Vegetable Pests in North China (Co-construction by Ministry of Agriculture and Rural Affairs of the People’s Republic of China and Province), and Program of Beijing Academy of Agriculture and Forestry Sciences (JKZX202208).

Author contributions

S.J.W. designed the study. J.C.C. contributed to the materials. L.J.C., T.B.G., and F.Y. analyzed the data. L.J.C., and F.Y.Y. wrote the manuscript. S.J.W., J.X.L., and F.L.J. revised the manuscript.

Code availability

No custom scripts or code were used in this study.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Li-Jun Cao, Tian-Bo Guan.

Contributor Information

Feng-Liang Jin, Email: jflbang@scau.edu.cn.

Shu-Jun Wei, Email: shujun268@163.com.

References

  • 1.Gerson U, Weintraub PG. Mites (Acari) as a factor in greenhouse management. Annu. Rev. Entomol. 2012;57:229–247. doi: 10.1146/annurev-ento-120710-100639. [DOI] [PubMed] [Google Scholar]
  • 2.Reichert MB, Schneider JR, Wurlitzer WB, Ferla NJ. Impacts of cultivar and management practices on the diversity and population dynamics of mites in soybean crops. Exp. Appl. Acarol. 2024;92:41–59. doi: 10.1007/s10493-023-00862-8. [DOI] [PubMed] [Google Scholar]
  • 3.Mérida-Torres NM, Cruz-López L, Malo EA, Cruz-Esteban S. Attraction of the two-spotted spider mite, Tetranychus urticae (Acari: Tetranychidae), to healthy and damaged strawberry plants mediated by volatile cues. Exp. Appl. Acarol. 2023;91:413–427. doi: 10.1007/s10493-023-00852-w. [DOI] [PubMed] [Google Scholar]
  • 4.Gong Y-J, et al. Efficacy of carbon dioxide treatments for the control of the two-spotted spider mite, Tetranychus urticae, and treatment impact on plant seedlings. Exp. Appl. Acarol. 2018;75:143–153. doi: 10.1007/s10493-018-0251-1. [DOI] [PubMed] [Google Scholar]
  • 5.Tanaka M, Yase J, Kanto T, Osakabe M. Combined nighttime ultraviolet B irradiation and phytoseiid mite application provide optimal control of the spider mite Tetranychus urticae on greenhouse strawberry plants. Pest Manage. Sci. 2024;80:698–707. doi: 10.1002/ps.7798. [DOI] [PubMed] [Google Scholar]
  • 6.Bajda, S. A., Wybouw, N., Nguyễn, V. H., Clercq, P. D. & Leeuwen, T. V. Adaptation of an arthropod predator to a challenging environment is associated with a loss of a genome‐wide plastic transcriptional response. Pest Manage. Sci., 10.1002/ps.7936 (2024). [DOI] [PubMed]
  • 7.Grbic M, et al. The genome of Tetranychus urticae reveals herbivorous pest adaptations. Nature. 2011;479:487–492. doi: 10.1038/nature10640. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sparks TC, Nauen R. IRAC: Mode of action classification and insecticide resistance management. Pestic. Biochem. Physiol. 2015;121:122–128. doi: 10.1016/j.pestbp.2014.11.014. [DOI] [PubMed] [Google Scholar]
  • 9.Vandenhole M, et al. Contrasting roles of cytochrome P450s in amitraz and chlorfenapyr resistance in the crop pest Tetranychus urticae. Insect Biochem. Mol. Biol. 2024;164:104039. doi: 10.1016/j.ibmb.2023.104039. [DOI] [PubMed] [Google Scholar]
  • 10.De Rouck S, İnak E, Dermauw W, Van Leeuwen T. A review of the molecular mechanisms of acaricide resistance in mites and ticks. Insect Biochem. Mol. Biol. 2023;159:103981. doi: 10.1016/j.ibmb.2023.103981. [DOI] [PubMed] [Google Scholar]
  • 11.Fotoukkiaii SM, et al. High-resolution genetic mapping reveals cis-regulatory and copy number variation in loci associated with cytochrome P450-mediated detoxification in a generalist arthropod pest. PLoS Genet. 2021;17:e1009422. doi: 10.1371/journal.pgen.1009422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Rouck SD, Mocchetti A, Dermauw W, Leeuwen TV. SYNCAS: Efficient CRISPR/Cas9 gene-editing in difficult to transform arthropods. Insect Biochem. Mol. Biol. 2024;165:104068. doi: 10.1016/j.ibmb.2023.104068. [DOI] [PubMed] [Google Scholar]
  • 13.Shi P, et al. Independently evolved and gene flow-accelerated pesticide resistance in two-spotted spider mites. Ecol. Evol. 2019;9:2206–2219. doi: 10.1002/ece3.4916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bruinsma K, et al. Host adaptation and specialization in Tetranychidae mites. Plant Physiol. 2023;193:2605–2621. doi: 10.1093/plphys/kiad412. [DOI] [PubMed] [Google Scholar]
  • 15.Wybouw N, et al. Long-term population studies uncover the genome structure and genetic basis of xenobiotic and host plant adaptation in the herbivore Tetranychus urticae. Genetics. 2019;211:1409–1427. doi: 10.1534/genetics.118.301803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Helle W, Bolland HR. Karyotypes and sex-determination in spider mites (Tetranychidae) Genetica. 1967;38:43–53. doi: 10.1007/BF01507446. [DOI] [Google Scholar]
  • 17.Grbic M, et al. Mity model: Tetranychus urticae, a candidate for chelicerate model organism. Bioessays. 2007;29:489–496. doi: 10.1002/bies.20564. [DOI] [PubMed] [Google Scholar]
  • 18.Ji M, et al. A nuclear receptor HR96-related gene underlies large trans-driven differences in detoxification gene expression in a generalist herbivore. Nat. Commun. 2023;14:4990. doi: 10.1038/s41467-023-40778-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sugimoto N, et al. QTL mapping using microsatellite linkage reveals target-site mutations associated with high levels of resistance against three mitochondrial complex II inhibitors in Tetranychus urticae. Insect Biochem. Mol. Biol. 2020;123:103410. doi: 10.1016/j.ibmb.2020.103410. [DOI] [PubMed] [Google Scholar]
  • 20.Marcais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27:764–770. doi: 10.1093/bioinformatics/btr011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Vurture GW, et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics. 2017;33:2202–2204. doi: 10.1093/bioinformatics/btx153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Hu, J. et al. An efficient error correction and accurate assembly tool for noisy long reads. bioRxiv, 2023.2003.2009.531669 10.1101/2023.03.09.531669 (2023). [DOI] [PMC free article] [PubMed]
  • 23.Roach MJ, Schmidt SA, Borneman AR. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinform. 2018;19:460. doi: 10.1186/s12859-018-2485-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Walker BJ, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Durand NC, et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016;3:95–98. doi: 10.1016/j.cels.2016.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Dudchenko O, et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356:92–95. doi: 10.1126/science.aal3327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bryon A, et al. Disruption of a horizontally transferred phytoene desaturase abolishes carotenoid accumulation and diapause in Tetranychus urticae. Proc. Natl. Acad. Sci. 2017;114:E5871–E5880. doi: 10.1073/pnas.1706865114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Flynn JM, et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. 2020;117:9451–9457. doi: 10.1073/pnas.1921046117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Tarailo-Graovac M, Chen N. Using RepeatMasker to Identify Repetitive Elements in Genomic Sequences. Curr. Protoc. Bioinformatics. 2009;25:4.10.11–14.10.14. doi: 10.1002/0471250953.bi0410s25. [DOI] [PubMed] [Google Scholar]
  • 30.Holt C, Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinform. 2011;12:491. doi: 10.1186/1471-2105-12-491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Korf I. Gene finding in novel genomes. BMC Bioinform. 2004;5:59. doi: 10.1186/1471-2105-5-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Stanke M, Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics. 2003;19(Suppl 2):ii215–225. doi: 10.1093/bioinformatics/btg1080. [DOI] [PubMed] [Google Scholar]
  • 34.Haas BJ, et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003;31:5654–5666. doi: 10.1093/nar/gkg770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Keilwagen, J., Hartung, F. & Grau, J. in Gene prediction: Methods and protocols Vol. 1962 Methods in Molecular Biology (ed M. Kollmar) 161-177 (2019). [DOI] [PubMed]
  • 36.Haas BJ, et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 2008;9:R7. doi: 10.1186/gb-2008-9-1-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Huerta-Cepas J, et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol. Biol. Evol. 2017;34:2115–2122. doi: 10.1093/molbev/msx148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Huerta-Cepas J, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Deng YY, et al. Integrated nr database in protein annotation system and its localization. Computer Engineering. 2006;32:71–72. doi: 10.3969/j.issn.1000-3428.2006.05.026. [DOI] [Google Scholar]
  • 40.Consortium TU. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2022;51:D523–D531. doi: 10.1093/nar/gkac1052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Ashburner M, et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Tatusov RL, Galperin MY, Natale DA, Koonin EV. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000;28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Finn RD, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42:D222–230. doi: 10.1093/nar/gkt1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Quinlan AR. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr. Protoc. Bioinformatics. 2014;47:11.12.11–34. doi: 10.1002/0471250953.bi1112s47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Chen C, et al. TBtools-II: A “one for all, all for one” bioinformatics platform for biological big-data mining. Mol. Plant. 2023;16:1733–1742. doi: 10.1016/j.molp.2023.09.010. [DOI] [PubMed] [Google Scholar]
  • 47.2021. NCBI BioProject. https://www.ncbi.nlm.nih.gov/bioproject/PRJNA788385
  • 48.2024. NCBI Sequence Read Archive. SRP490166
  • 49.2024. Genbank. GCA_036877765.1
  • 50.Wei S-J, Cao L-J. 2024. Chromosome-level genome and annotation of the two-spotted spider mite Tetranychus urticae. figshare. [DOI] [PMC free article] [PubMed]
  • 51.Simao FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
  • 52.Wang Y, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40:e49. doi: 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. 2021. NCBI BioProject. https://www.ncbi.nlm.nih.gov/bioproject/PRJNA788385
  2. 2024. NCBI Sequence Read Archive. SRP490166
  3. 2024. Genbank. GCA_036877765.1
  4. Wei S-J, Cao L-J. 2024. Chromosome-level genome and annotation of the two-spotted spider mite Tetranychus urticae. figshare. [DOI] [PMC free article] [PubMed]

Data Availability Statement

No custom scripts or code were used in this study.


Articles from Scientific Data are provided here courtesy of Nature Publishing Group

RESOURCES