Abstract
The Green Tree Python (Morelia viridis) is a snake native to New Guinea, some islands in Indonesia, and the Cape York Peninsula in Australia. We present the whole genome sequence for this species. Illumina sequencing was performed on a genetic sample from a single individual. The reads were assembled using a de novo method followed by a series of references from related species for finishing. The raw and assembled data are publicly available via Genbank: Sequence Read Archive (SRR19167500) and genome (JANHOE000000000).
Keywords: genome, snake, python
Introduction
The Green Tree Python (Morelia viridis) is a snake native to New Guinea, some islands in Indonesia, and the Cape York Peninsula in Australia. This species is bright green and can reach a total length (including tail) of 2 m (6.6 ft) and a weight of 1.6 kg (3.5 lb), with females slightly larger and heavier than males. Living primarily in trees, the green tree python hunts and eats small reptiles and mammals. It is a popular pet, and numbers in the wild have suffered with large-scale smuggling of wild-caught green tree pythons in Indonesia. Despite this, the green tree python is rated as least concern on the IUCN Red List of endangered species. We present the whole genome sequence for this species.
Methods
The genetic sample was obtained from a single individual snake that was legally imported by S&J Reptiles in 2015 from the Aru islands and derived from a naturally shed skin.
DNA extraction was performed using the Qiagen DNAeasy genomic extraction kit using the standard process. A paired-end sequencing library was constructed using the Illumina TruSeq kit according to the manufacturer’s instructions. The library was sequenced on an Illumina Hi-Seq platform in paired-end, 2 × 150bp format. The resulting fastq files were trimmed of adapter/primer sequence and low-quality regions with Trimmomatic v0.33 (Bolger, Lohse, and Usadel 2014). The trimmed sequence was assembled by SPAdes v2.5 (Bankevich et al. 2012) followed by a finishing step using Zanfona (Kieras, O’Neill, and Pirro 2021).
Results
The genome assembly yielded a total sequence length of 1,351,159,366 bp over 110,412 scaffolds.
Funding
Funding was provided by Iridian Genomes, grant# IRGEN_RG_2021-1345 Genomic Studies of Eukaryotic Taxa.
Data availability
Raw and assembled data is publicly available via GenBank:
Raw genome data
https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR19167500
Assembled genome
REFERENCES
- Bankevich Anton, Nurk Sergey, Antipov Dmitry, Gurevich Alexey A., Dvorkin Mikhail, Kulikov Alexander S., Lesin Valery M., et al. 2012. “SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing.” Journal of Computational Biology 19 (5): 455–77. 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolger Anthony M., Lohse Marc, and Usadel Bjoern. 2014. “Trimmomatic: A Flexible Trimmer for Illumina Sequence Data.” Bioinformatics 30 (15): 2114–20. 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kieras M, O’Neill K, and Pirro S. 2021. “Zanfona, a Genome Assembly Finishing Tool for Paired-End Illumina Reads.” https://github.com/zanfona734/zanfona.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Raw and assembled data is publicly available via GenBank:
Raw genome data
https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR19167500
Assembled genome
