Abstract
Known for its valuable agarwood, Aquilaria malaccensis Lam. is an evergreen tropical forest tree species endemic to the Indo-malesian region. Indiscriminate damaging and harvesting of the trees in the wild have resulted in it being listed in the CITES Appendix II for controlled trade and in the IUCN Red List as ‘Vulnerable (VU)’. In this study, the complete chloroplast genome of A. malaccensis was assembled using data from high-throughput Illumina sequencing. The chloroplast genome was 174,832 bp in size, which included two inverted repeat regions of 42,091 bp each, separated by a large single copy region of 87,302 bp and a small single copy region of 3,348 bp. A total of 139 genes were predicted, including 39 tRNA, 8 rRNA, and 92 protein-coding genes. Phylogenetic analysis placed A. malaccensis within the family Thymelaeaceae. The chloroplast genome sequence of A. malaccensis offers a useful resource for future studies on the taxonomy and conservation of the threatened Aquilaria trees.
Keywords: Conservation, comparative genomics, phylogenomics, plant DNA barcoding, Thymelaeaceae
Aquilaria malaccensis Lam. (Thymelaeaceae), is native to the Indo-malesian region and is the most widely distributed species of its genus (Lee and Mohamed 2016). In response to pathogenic infection via external wounds, an Aquilaria tree secretes resin, which hardens and turns into agarwood (Mohamed et al. 2010), a valuable ingredient for traditional medicines, perfumery, and religious uses. As natural agarwood is rare, intentional wounding and indiscriminate felling of wild Aquilaria trees in search of agarwood have resulted in the decline of natural Aquilaria populations over the years, prompting conservationist and trade enforcers to swiftly decide on protecting these trees in the wild (Barden et al. 2000). Since 1998, A. malaccensis has been given the status of ‘Vulnerable (VU)’ by the IUCN Red List (IUCN 2017), and also became the first agarwood-producing species under the tribe Aquilarieae to make it into Appendix II of CITES in 2005 (UNEP-WCMC 2015). Due to commercial value, studies that focus on the agarwood often overshadow studies on the Aquilaria trees that produce them, allowing taxonomical issues within the genus to persist. In this study, we characterized the complete cp genome sequence of A. malaccensis as a resource for future studies on the taxonomy of Aquilaria.
Genomic DNA was extracted from fresh leaves of A. malaccensis grown in the greenhouse of the Faculty of Forestry, Universiti Putra Malaysia (UPM), originally obtained from a natural population in Pahang, Malaysia. A voucher specimen (FBL01001) is deposited at the Forest Biotechnology Laboratory in the Faculty of Forestry, UPM. A genomic library with an insert size of 300 bp was prepared using a TruSeq DNA Sample Prep Kit (Illumina, CA, USA) and sequenced on an Illumina HiSeq X Ten platform. Approximately 8 Gb of raw data was generated through pair-end 150 bp sequencing. After removal of adapter sequences, raw reads were fed into the assembly pipeline of NOVOPlasty (Dierckxsens et al. 2017) with the rbcL sequence of Aquilaria yunnanensis (GenBank accession KR528756) as the seed sequence. The assembled cp genome was annotated using the online annotation tool DOGMA (Wyman et al. 2004) and further checked manually by comparison against the A. yunnanensis cp genome (MG656407).
The complete cp genome of A. malaccensis (GenBank accession MH286934) was 174,832 bp in length, consisting of a pair of inverted repeat regions of 42,091 bp each, a large single copy region of 87,302 bp, and a small single copy region of 3,348 bp. A total of 139 genes were annotated, including 92 protein-coding genes, 39 transfer RNA genes, and 8 ribosomal RNA genes. The overall GC content of the cp genome was 36.71%.
Phylogenetic analysis was conducted on an alignment consisting of chloroplast genome sequences of six species in the order Malvales, including four species from the family Thymelaeaceae (A. malaccensis, A. sinensis, A. yunnanensis, and Daphne kiusiana), two species from the family Malvaceae (Gossypium arboretum, and Theobroma cacao), and one outgroup species, Eucalyptus grandis from the family Myrtaceae. A maximum likelihood tree constructed with RAxML (Stamatakis 2014), implemented in Geneious ver. 10.1 (http://www.geneious.com, Kearse et al. 2012) placed A. malaccensis together with other species of Aquilaria (Figure 1).
Disclosure statement
The authors report no conflict of interest.
References
- Barden A, Anak NA, Mulliken T, Song M. 2000. Heart of matter: agarwood use and trade and CITES implementation for Aquilaria malaccensis. Cambridge: Traffic International. [Google Scholar]
- Dierckxsens N, Mardulyn P, Smits G. 2017. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45:e18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- IUCN 2017. The IUCN Red List of Threatened Species. Version 2017-3. [cited 2018 May 1]. Available from: http://www.iucnredlist.org. [Internet]
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 28:1647–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee SY, Mohamed R. 2016. The origin and domestication of Aquilaria, an important agarwood-producing genus In:, editor. Agarwood. Singapore: Springer. [Google Scholar]
- Mohamed R, Jong PL, Zali MS. 2010. Fungal diversity in wounded stems of Aquilaria malaccensis. Fungal Divers. 43:67–74. [Google Scholar]
- Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30:1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- UNEP-WCMC (Comps). 2015. The checklist of CITES species website. CITES Secretariat, Geneva, Switzerland. Cambridge: UNEP-WCMC. [cited 2018 May 1]. Available from: http://checklist.cites.org. [Internet].
- Wyman SK, Jansen RK, Boore JL. 2004. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 20:3252–3255. [DOI] [PubMed] [Google Scholar]