Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 Aug 24;16(8):e0256373. doi: 10.1371/journal.pone.0256373

The plastome sequence of Bactris gasipaes and evolutionary analysis in tribe Cocoseae (Arecaceae)

Raquel Santos da Silva 1, Charles Roland Clement 2, Eduardo Balsanelli 3,4, Valter Antonio de Baura 3, Emanuel Maltempi de Souza 3, Hugo Pacheco de Freitas Fraga 1, Leila do Nascimento Vieira 1,*
Editor: Berthold Heinze5
PMCID: PMC8384209  PMID: 34428237

Abstract

The family Arecaceae is distributed throughout tropical and subtropical regions of the world. Among the five subfamilies, Arecoideae is the most species-rich and still contains some ambiguous inter-generic relationships, such as those within subtribes Attaleinae and Bactridineae. The hypervariable regions of plastid genomes (plastomes) are interesting tools to clarify unresolved phylogenetic relationships. We sequenced and characterized the plastome of Bactris gasipaes (Bactridinae) and compared it with eight species from the three Cocoseae sub-tribes (Attaleinae, Bactridinae, and Elaeidinae) to perform comparative analysis and to identify hypervariable regions. The Bactris gasipaes plastome has 156,646 bp, with 113 unique genes. Among them, four genes have an alternative start codon (cemA, rps19, rpl2, and ndhD). Plastomes are highly conserved within tribe Cocoseae: 97.3% identity, length variation of ~2 kb, and a single ~4.5 kb inversion in Astrocaryum plastomes. The LSC/IR and IR/SSC junctions vary among the subtribes: in Bactridinae and Elaeidinae the rps19 gene is completely contained in the IR region; in the subtribe Attaleinae the rps19 gene is only partially contained in the IRs. The hypervariable regions selected according to sequence variation (SV%) and frequency of parsimony informative sites (PIS%) revealed plastome regions with great potential for molecular analysis. The ten regions with greatest SV% showed higher variation than the plastid molecular markers commonly used for phylogenetic analysis in palms. The phylogenetic trees based on the plastomes and the hypervariable regions (SV%) datasets had well-resolved relationships, with consistent topologies within tribe Cocoseae, and confirm the monophyly of the subtribes Bactridinae and Attaleinae.

Introduction

The family Arecaceae contains 181 genera and about 2,600 species distributed throughout tropical and subtropical regions of the world [1]. The most recent taxonomic review in Arecaceae, published by Baker and Dransfield [1], recognizes five subfamilies: Arecoideae, Calamoideae, Ceroxyloideae, Coryphoideae, and Nypoideae. Arecoideae is the largest subfamily, with 14 tribes and 108 genera. Several Arecoideae tribes have been extensively studied, but some inter-generic relationships remain ambiguous, such as those within subtribes Attaleinae and Bactridineae [1]. These subtribes are both within tribe Cocoseae, which includes Elaeidinae as its third subtribe.

Plastid genomes (plastomes) are a useful tool for phylogenetic and evolutionary studies [2]. Hypervariable regions in plastomes can provide information to elucidate phylogenetic relationships that are not yet well resolved [3, 4]. However, these highly variable regions vary between clades, and their identification may be necessary for each taxonomic level [3]. Angiosperms show structural rearrangements, loss of genes, introns, and heterogeneous nucleotide substitution rates in protein-coding genes among their plastomes [5, 6]. Also, the fact that palms have a low mutation rate in the plastome [7] makes the identification of clade-specific hypervariable regions singularly relevant for the group.

In addition to use in phylogenetic analysis, these hypervariable regions may also be powerful molecular markers for analysis of population genetic structure, including that among wild and domesticated populations. The family Arecaceae includes species of great ecological importance, either by their interaction with pollinators [8] or with frugivorous animals [9, 10]. Also, palms provide several high-value products for industry (e.g., fibers, construction materials, oil, medicinal compounds, heart-of-palm, fruits) and are especially important for tropical and subtropical indigenous and traditional communities [11, 12]. Among the most useful palms, the coconut (Cocos nucifera L.), the date palm (Phoenix dactylifera L.), and the oil palm (Elaeis guineensis Jacq.) stand out by their global economic importance. The peach palm (Bactris gasipaes Kunth) is the only fully domesticated palm in the Neotropics [11]; it was domesticated for its fruit and is becoming important as a source of heart-of-palm [13]. Many Cocoseae species included in our analysis have interesting domestication histories, such as Bactris gasipaes, Elaeis guineensis, Cocos nucifera [11, 14, 15]. Others are from genera with traditional use by indigenous and traditional communities in South America, such as Butia, Astrocaryum, Acrocomia, and Syagrus [1619].

We sequenced and characterized the plastome of Bactris gasipaes and compared it with eight species from the three sub-tribes (Attaleinae, Bactridinae, and Elaeidinae) of the tribe Cocoseae to perform comparative structural analysis and phylogenetic inference, and to identify hypervariable regions.

Material and methods

Taxon sampling

We collected fresh leaves from a wild individual of Bactris gasipaes, i.e., var. chichagui type 1 ([20]; accession number F0205/83) in the core collection of Peach palm Active Germplasm Bank [21] at the National Research Institute for Amazonia (INPA, Manaus, AM, Brazil).

Plastomes and nuclear marker sequences from seven genera of tribe Cocoseae were downloaded from GenBank, including individuals from its three subtribes: Attaleinae (3 genera; 3 species), Elaeidinae (1 species), and Bactridinae (3 genera; 4 species). The species names and the GenBank accession numbers for sequences used in the analysis are in S1 Table.

Bactris gasipaes plastome sequencing

The extraction of plastid-enriched DNA was based on the methodology described by Vieira et al. [22] and modified by Sakaguchi et al. [23], proportionally adjusting the buffer volumes for 8 g of fresh leaves. The DNA extraction was performed with CTAB buffer, as described by Shi et al. [24]. The DNA was purified with the Genomic DNA Clean & Concentrator-10 Kit (Zymo Research, Irvine, CA, USA). The purified DNA was quantified using Qubit dsDNA HS Assay kit (Thermo Fisher Scientific, Carlsbad, CA, USA) in Qubit Fluorometer (Thermo Fisher Scientific). Libraries were prepared with Nextera XT DNA Library Preparation Kit (Illumina, San Diego, CA, USA) and sequenced in Illumina MiSeq® (Illumina), obtaining 250 bp paired-end reads.

Plastome assembly was performed using CLC Genomics Workbench v.8.0 (Qiagen, Germantown, MD, USA) software with de novo strategy. The Acrocomia aculeata plastome was used as a reference for the ordering of contigs. Plastome annotation was performed using Geneious Prime® (Biomatters, Auckland, New Zealand). For all genes, manual verification was performed, adjusting the initial and terminal codons. The final plastome sequence was deposited in GenBank: MW054718.

Plastome structural analysis in tribe Cocoseae

The comparative analysis to identify structural rearrangements in the plastomes of the Cocoseae species was carried out using eight species (S1 Table), excluding one IR from all plastomes, and using the progressive alignment on Mauve software [25]. The IRScope software [26] was used to visualize and compare the plastome junctions (IRb/LSC; IRb/SSC, SSC/IRa; IRa/LSC).

Identification of hypervariable regions

We estimated the variability of the sequences with the formula proposed by Shaw et al. [27], adapted and used by Zavala-Páez et al. [28]. First, we individually aligned each collinear coding sequence (CDS), intergenic spacers (IGS), and introns of the plastomes (list of species in S1 Table) using MAFFT v.7 software [29]. Then, the alignments were imported into the software DNAsp v6.12.03 [30] to obtain the number of invariable sites (monomorphic), parsimony informative sites (PIS), number of substitutions, and number of InDel events. Sequence variability (SV) was calculated using the formula: SV% = [(number of substitutions + number of InDels) / (number of substitutions + number of InDels + invariable sites)] x 100. The frequency of PIS was calculated using the formula: PIS% = [(number of parsimony informative sites/ number of substitutions + number of InDels + invariable sites)] x 100.

The ten regions with the highest SV% and PIS% values were selected to carry out the subsequent analysis. The plastid markers matK, trnQ-rps16, rps16 intron, trnD-trnT, trnL-trnF [31, 32] and the nuclear markers PRK and RPB2 [32, 33], commonly used for phylogenetic analysis in Arecaceae, were used for comparative purposes and subjected to the same procedure to obtain the PIS% and SV% values.

Phylogenetic inferences

Phylogenetic inferences were made including the following species of the tribe Cocoseae: Bactris gasipaes, Acrocomia aculeata, Astrocaryum aculeatum, Astrocarym murumuru, Butia eriospatha, Cocos nucifera, Elaeis guineensis, Syagrus coronata, and two species as outgroup: Brahea brandegeei (Purpus) H. E. Moore (subfamily Coryphoideae) and Archontophoenix alexandrae (F.Muell.) H.Wendl. & Drude (subfamily Arecoideae). Three data sets were used: i) the plastome alignment (one IR excluded); ii) the ten regions with the greatest SV% value; iii) the ten regions with the greatest PIS% value.

Plastome alignment was performed using progressive alignment on Mauve [25] implemented in Geneious Prime® v.2020.1.2. The Locally Collinear Blocks (LCBs) identified by Mauve were individually extracted and concatenated. The alignment of the ten regions with the highest SV% and PIS% values was carried out using MAFFT v.7.450 [29] implemented in Geneious Prime® v.2020.1.2. Phylogenetic inferences were performed by Maximum Likelihood (ML) using W-IQ-tree [34], with 1,000 bootstrap repetitions. The choice of substitution models, including FreeRate heterogeneity model, was made according to Bayesian information criterion (BIC; Table 1). Branch support analysis was performed with 1,000 repetitions of bootstrap and single branch test SH (-aLTR, 1,000 replicates). The resulting trees were represented using Geneious Prime® v.2020.1.2.

Table 1. Substitution models selected for the phylogenetic inferences using Maximum Likelihood (ML).

Region Models for ML
Plastome K3Pu+F+R2
trnC-petN HKY+F
psbC-trnS JC+I
psaC-ndhE F81+F+G4
ccsA-ndhD F81+F+I
petN-psbM TPM2+F+I
accD-psaI F81+F+I
trnS-trnG F81+F+I
rps15-ycf1 HKY+F+I
ndhF-rpl32 F81+F+I
rpl16-intron F81+F+I
petD-rpoA F81+F+I
petA-psbJ F81+F+I
trnG-trnfM F81+F+I
rps8-rpl14 F81+F+I

Results

Bactris gasipaes plastome

The sequencing of plastid-enriched DNA resulted in 448,600 reads with an average length of 214 bp. Of these, 47,735 were plastome reads (~10%), resulting in an average depth of coverage of 67.64 (SD = 24.32). The assembled plastome has a 21 bp gap in the IGS trnT-UGU/trnL-UAA (position 46,800 to 46,820). This gap is in an AT-rich region (sequence 22 bp upstream to 119 bp downstream is only 7.8% of GC-content) and is, therefore, difficult to sequence [35]. We calculated this gap length using other species of tribe Cocoseae as reference.

Bactris gasipaes plastome has the quadripartite structure typically found in angiosperms [2], with a pair of inverted repeat (IRs), a large single-copy region (LSC), and a small single-copy region (SSC). The IRs are 27,038 bp in length (each), the LSC is 85,118 bp, and the SSC is 17,452 bp, resulting in a plastome with 156,646 bp.

Bactris gasipaes plastome has an average GC-content of 37.5%. When comparing the plastome regions, the SSC has the lowest GC-content, with 31.3%, followed by LSC with 35.5%. The IRs have the highest value, with 42.6% of GC-content. The rRNA and tRNA show high GC-content, with 55.3% and 53.4%, respectively. Protein-coding genes have an average GC-content of 37.9%, intergenic spacers (IGS) of 37.5%, and introns of 37.1%. The plastome GC-content among species from tribe Cocoseae is similar, ranging from 37.40% (Elaeis guineensis) to 37.53% (Acrocomia aculeata).

In the Bactris gasipaes plastome, we annotated 113 unique genes, 79 of which are protein-coding genes, 30 tRNA genes, and 4 rRNA genes (Table 2). Duplicate genes in IRs include 8 tRNA genes, 4 rRNA genes, and 7 protein-coding genes (Table 2). Among the 113 genes, 15 genes contain one intron (6 tRNA genes and 9 protein-coding genes) and 3 genes contain two introns (clpP, ycf3, and rps12; Table 2). Among intron-containing genes, 12 are located in LSC (trnK-UUU, rps16, trnG-UCC, atpF, rpoC1, ycf3, trnL-UAA, trnV-UAC, clpP, petB, petD, rpl16), 1 in SSC (ndhA), 4 in IRs (rpl2, ndhB, trnI-GAU, trnA-UGC), and rps12 is a trans-splicing gene with the first exon located in the LSC region and the second and third exons in the IRs.

Table 2. List of genes of Bactris gasipaes plastome organized according to their location.

Plastome region Name of genes
Large Single Copy (LSC) psbA, trnK-UUU*, matK, rps16*, trnQ-UUG, psbK, psbI, trnS-GCU, trnG-UCC*, trnR-UCU, atpA, atpF*, atpH, atpI, rps2, rpoC2, rpoC1*, rpoB, trnC-GCA, petN, psbM, trnD-GUC, trnY-GUA, trnE-UUC, trnT-GGU, psbD, psbC, trnS-UGA, psbZ, trnG-GCC, trnfM-CAU, rps14, psaB, psaA, ycf3*, trnS-GGA, rps4, trnT-UGU, trnL-UAA*, trnF-GAA, ndhJ, ndhK, ndhC, trnV-UAC*, trnM-CAU, atpE, atpB, rbcL, accD, psaI, ycf4, cemA, petA, psbJ, psbL, psbF, psbE, petL, petG, trnW-CCA, trnP-UGG, psaJ, rpl33, rps18, rpl20, rps12* (exon 1), clpP*, psbB, psbT, psbN, psbH, petB*, petD*, rpoA, rps11, rpl36, infA, rps8, rpl14, rpl16*, rps3, rpl22
Inverted Repeat (IR) rps19, trnH-GUG, rpl2*, rpl23, trnI-CAU, ycf2, trnL-CAA, ndhB*, rps7, rps12* (exons 2 and 3), trnV-GAC, rrn16, trnI-GAU*, trnA-UGC*, rrn23, rrn4.5, rrn5, trnR-ACG, trnN-GUU
IR / SSC junction ndhF, ycf1
Small Single Copy (SSC) rpl32, trnL-UAG, ccsA, ndhD, psaC, ndhE, ndhG, ndhI, ndhA*, ndhH, rps15

* Intron-containing genes.

Surprisingly, the cemA gene exhibited an alternative start codon, as reported in species of subtribes Attaleinae and Elaeidinae, contrasting with the Bactridinae species sequenced so far (Astrocaryum aculeatum, Astrocaryum murumuru, and Acrocomia aculeata). Three other genes have alternative initiation codons, rps19 (GTG), rpl2 (ACG), ndhD (ATC).

Comparative analysis in tribe Cocoseae

Plastomes are highly conserved (97.3% identity) within tribe Cocoseae. Species from subtribes Bactridinae and Elaeidinae have a plastome ~2 kb larger than the species from subtribe Attaleinae. This difference in length between the subtribes is mainly in the IRs and LSC regions (Table 3). The plastomes from Cocoseae species ranged from 154.048 bp (Butia eriospatha) to 156.937 bp (Elaeis guineenses).

Table 3. Plastomes of tribe Cocoseae.

Subtribe Species Plastome (bp) LSC (bp) IR (bp) SSC (bp)
Elaeidinae Elaeis guineensis 156,973 85,192 27,071 17,639
Bactridinae Astrocaryum aculeatum 156,804 85,037 27,081 17,605
Astrocaryum murumuru 156,801 85,017 27,081 17,622
Bactris gasipaes 156,646 85,118 27,038 17,452
Acrocomia aculeata 156,500 84,936 27,092 17,380
Attaleinae Syagrus coronata 155,053 84,535 26,522 17,474
Cocos nucifera 154,731 84,230 26,555 17,391
Butia eriospatha 154,048 83,805 26,437 17,369

LSC: large single copy region; IR: inverted repeat; SSC: small single copy region.

The progressive alignment among species from tribe Cocoseae shows evidence for three Locally Collinear Blocks (LCBs) (Fig 1). These three LCBs are a result of the 4.5 kb inversion present in the plastomes of Astrocaryum murumuru and Astrocaryum aculeatum (Fig 1). The set of genes that makes up this structural rearrangement is composed of ndhC, ndhK, ndhJ, trnF-GAA, and trnL-UAA.

Fig 1. Plastome rearrangement analysis within tribe Cocoseae.

Fig 1

Locally collinear blocks (LCBs) are identified by colors. The 4.5 kb inversion in Astrocaryum is in green.

In the LSC/IR and IR/SSC junctions of the plastomes, there are differences among the subtribes (Fig 2). In Bactridinae (Acrocomia aculeata, Astrocaryum aculeatum, Astrocaryum murumuru, and Bactris gasipaes) and Elaeidinae (Elaeis guineensis) the rps19 gene is completely contained in the IR region and, therefore, there are two copies of the complete gene. In contrast, in the subtribe Attaleinae (Butia eriospatha, Cocos nucifera, and Syagrus coronata) the rps19 gene is only partially contained in the IRs, resulting in a complete rps19 and a partial rps19: the complete rps19 gene starts at IRb and ends at LSC (LSC/IRb); the partial rps19 starts at IR, but does not contain the final portion of the gene (Fig 2).

Fig 2. Comparison of plastome junctions (IRb/LSC; IRb/SSC, SSC/IRa; IRa/LSC) among Cocoseae species.

Fig 2

The numbers indicate sequence length in base pairs.

Similarly, the ycf1 gene is partially contained in IRs, with a complete ycf1 at IRa/SSC and a partial (pseudo) ycf1 at IRb. The ndhF gene has both position and length conserved at the IRb/SSC junction in tribe Cocoseae, with the portion of the gene contained in the IRb overlapping the ycf1 gene (56 bp) (Fig 2).

Hypervariable regions

We carried out the SV% and PIS% estimates to identify the plastome regions with the greatest variation within tribe Cocoseae. All ten regions selected according to the highest SV% showed greater variation than the plastid molecular markers commonly used for phylogenetic analysis in palms (Fig 3A). As expected, they have SV% lower than the nuclear markers PRK and RPB2 (Fig 3A). Among the ten regions selected according to the highest PIS% values, all showed greater values than the plastid molecular markers commonly used for phylogenetic analysis in palms (Fig 3B) and two of them (trnC-petN and psbC-trnS) were more variable than the nuclear marker PRK (Fig 3B). The nuclear marker RPB2 showed the highest values for both SV% and PIS% estimates.

Fig 3. Hypervariable plastome regions compared with commonly used plastid and nuclear markers.

Fig 3

A—The ten plastome regions with greatest sequence variation (SV%); B—The ten plastome regions with greatest frequency of parsimony informative sites (PIS%).

We also calculated the frequency of substitutions and frequency of InDel events for each plastome region. The substitution frequency was ~5x higher than InDels in plastomes (Table 4). In the coding sequences (CDSs), substitutions are ~80x more common than InDels. In IGSs and introns, substitutions are ~4x and ~3x more frequent than InDels, respectively (Table 4). In general, these data show that in all regions of Cocoseae plastomes there is a higher frequency of substitutions than InDels.

Table 4. Frequency of substitutions/mutations and insertions/deletions (InDels) among the plastome sequence.

Plastome CDS IGS Introns
Mutations (%) 1.53 0.80 2.30 1.12
InDels (%) 0.30 0.01 0.58 0.34
Mutation / InDel ratio 5.02 79.90 3.99 3.26

CDS–coding sequence; IGS–intergenic spacer.

Phylogenomic analysis

The complete LCB matrix of the aligned plastomes consisted of 137,452 columns, of which 134,244 are constant and 815 parsimony-informative. The SV matrix contained a total of 9614 columns, with 133 parsimony-informative and 9144 constant sites, and the PIS matrix 7538 columns, with 129 parsimony-informative and 7200 constant sites. The phylogenomic trees inferred using Maximum Likelihood (ML) and based on the plastome and the ten selected regions with greatest SV% datasets showed identical topologies (Fig 4) and high bootstrap support (> 86). The phylogenetic tree inferred from the alignment of the ten regions with greatest PIS values showed a similar topology; it differed only by the presence of a polytomy in the Attaleinae clade, generated by the low bootstrap support in intergeneric relationships (S1 Fig). For the topologies generated by ML in the three datasets, the monophyly of the subtribes Bactridinae and Attaleinae was confirmed. Also, the subtribe Elaeidinae appears more closely related to Bactridinae than to Attaleinae. In Bactridinae, Bactris and Astrocaryum are closely related genera, and in Attaleinae, Cocos nucifera as sister to Syagrus coronata.

Fig 4. Phylogenetic trees based on maximum likelihood inference.

Fig 4

A–Phylogenetic inference using plastome sequences (one IR removed); B—Phylogenetic inference using the ten plastome regions with greatest sequence variability (SV%) values. The numbers above the branches are maximum likelihood bootstrap values (1000 replicates).

Discussion

Bactris gasipaes plastome

The plastome of Bactris gasipaes has the typical quadripartite structure and gene content of other described Cocoseae species [5, 3640]. The IRs are identical, which probably occurs due to mechanisms of replication and repair through recombination-dependent replication (RDR), as previously reported in plant plastomes [41]. The GC content in the plastomes of Bactris gasipaes and other species of Cocoseae corroborate the mean value described by Kwon et al. [42] for angiosperms (37.71; SD 1.10). The tendency for a higher GC content in the IRs than in the LSC and SSC was previously reported in bryophytes, ferns, lycophytes, and angiosperms [42, 43].

The gene content is conserved among species of Bactridinae (e.g., Acrocomia aculeata, Astrocaryum murumuru, Astrocaryum aculeatum, and Bactris gasipaes) and Elaeidinae (Elaeis guineensis) [36, 38, 39]. Attaleinae species present one pseudogenized rps19, and thus, one less CDS [40].

The gene cemA has an unconventional start codon in Bactris gasipaes, what was previously described in species of subtribes Attaleinae and Elaeidinae, in Podococcus barteri (NC_027276.1), Phoenix dactylifera ’Khalas’, and other monocots. However, it is still not clear if this gene, with the unconventional start codon, is translatable to a protein [44]. Although most of the genes encoding proteins have ATG initiation codons [7], some alternative initiation codons are found in plants [45], such as GTG in the rps19 gene, ACG in rpl2 and ATC in ndhD, which were reported in Lilium longiflorum, Phoenix dactylifera ’Khalas’ and Amomum compactum, respectively [44, 46, 47].

Comparative plastome analysis within Cocoseae

Comparative studies using plastomes of species at different taxonomic levels can bring insights into plastome evolution, phylogenetic relationships, and evolutionary rates [2]. Plastomes of the three subtribes analyzed (Attaleinae, Bactridinae, and Elaeidinae), represented here by eight species, provided information to compare sequence variations in tribe Cocoseae. We identified slight differences in plastome size (~2 kb) and an inversion of 4.5 kp that occurs in the ndh complex (LSC region) of the Astrocaryum plastome. Similarly, Barrett et al. [48] reported that Arecaceae plastomes are highly conserved structure, describing only one 1.9 kb inversion located between the rps16 and trnG-UUC genes in Tahina spectabilis.

Also, the variability among Attaleinae, Bactridinae, and Elaeidinae in the LSC/IR junctions, mainly involving the rps19 gene, was previously described in Acrocomia aculeata [38] and Butia eriosphata [40], as well as in Phoenix dactylifera [44, 49]. Similarly, the ndhF gene overlapping with ycf1 in ~25 bp is commonly observed in palms [38, 40, 44, 49]. Despite the differences in the IR junction, the IR structure and gene content is conserved among palms, corroborating the hypothesis that the IR regions offer an isolation mechanism that stabilizes the structure of the genome [50].

Hypervariable regions

Plastomes have several non-coding regions, but not all of them have been explored for phylogenetic studies [3, 4, 28]. Among the ten regions with the greatest SV% identified in our study, only five IGSs (e.g., accD-psaI, ndhF-rpl32, trnS-trnG, psaC-ndhE, rps15-ycf1) and one intron (rpl16) were previously used and/or highlighted in angiosperm studies [3, 38, 51]. Among the ten regions with the greatest PIS%, only three IGS (trnS-trnG, petA-psbJ, and psaC-ndhE) were identified in studies carried out by Shaw et al. [3, 27] and Lopes et al. [38]. Thus, in our study, we described four new promising regions based on both SV and PIS values (trnC-petN, psbC-trnS, ccsA-ndhD, petN-psbM) and three new regions based on PIS (petD-rpoA, trnG-trnfM, rps8-rpl14). Also, Scarcelli et al. [51] reported 100 primers for phylogeny in monocots. However, four of the hypervariable regions reported in our study were not contemplated (e.g., psaC-ndhE, petN-psbM, accD-psaI, trnS-trnG). Among them. trnS-trnG and accD-psaI primers designed by Scarcelli et al. [51] showed no amplification in Arecaceae, and petN-psbM and psaC-ndhE were not mentioned, probably due to gene rearrangements in monocots plastomes. This reinforces that the highly variable regions vary between clades, and their identification may be necessary for distinct taxonomic levels. As expected, the nuclear genes PRK and RPB2 showed greater variation than most plastidial regions. These nuclear markers are very informative and produce well-resolved topologies [33]. The combined use of the plastidial regions described here and the nuclear markers PRK and RPB2 have great potential for phylogenetic studies in tribe Cocoseae.

Phylogenomic analysis

The ten regions with the greatest SV% values are suitable for phylogenetic inferences and produce phylogenetic trees with well-resolved and the expected topologies. In the ML analysis, all datasets tested (plastome, ten SV regions, and ten PIS regions) result in subtribe Bactridinae as monophyletic. The monophyly Bactridinae was previouly described by Eiserhardt et al. [32], as well as the sister relationship between the subtribes Elaeidinae and Bactridinae [33, 38]. Our results are in contrast with those of Gunn [52], in which the sister relationship between Astrocaryum and Bactris is weakly supported. In all of our datasets this sister relationship is well-supported. In addition, the monophyly of subtribe Cocoseae was also verified in a plastid DNA analysis [53], in the super-tree method [54] and by the combined analysis with the PRK and RPB2 genes [33]. The sister relationship between Cocos nucifera and Syagrus coronata was also previously described [55], corroborating our results. Thus, both the plastome and the ten regions with greatest SV values were able to produce well-resolved phylogenetic trees and with consistent topologies within tribe Cocoseae.

Supporting information

S1 Table. GenBank accession numbers of the nucleotide sequences used in our analysis.

(DOC)

S1 Fig. Phylogenetic tree based on maximum likelihood inference using the ten plastome regions with greatest PIS% values.

The numbers above the branches are maximum likelihood bootstrap values (1000 replicates).

(TIF)

Acknowledgments

We thank Jeison W. de S. Magnabosco and Michelle E. Zavala-Páez for their help with the analyses.

Data Availability

The plastome sequence was deposited in GenBank: MW054718.

Funding Statement

This research was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq - https://www.gov.br/cnpq) [grant numbers LNV 435200/2018-6, CRC 303477/2018-0]. This research was also partially supported by The Brazilian Program of National Institutes of Science and Technology-INCT/Brazilian Research Council-CNPq/MCT and Embrapa’s Portfólio de Recursos Genéticos (https://www.embrapa.br/), which helped to maintain the Peach palm Active Germplasm Bank at INPA. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Baker WJ, Dransfield J. Beyond Genera Palmarum: progress and prospects in palm systematics. Bot J Linn Soc 2016; 182:207–233. [Google Scholar]
  • 2.Rogalski M, Vieira LN, Fraga HP, Guerra MP. Plastid genomics in horticultural species: importance and applications for plant population genetics, evolution and biotechnology. Front Plant Sci 2015; 6:1–17. doi: 10.3389/fpls.2015.00001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shaw J, Lickey EB, Schuilling EE, Small RL. Comparison of whole chloroplast genome sequences to choose noncoding regions for phylogenetic studies in angiosperms: the tortoise and the hare III. Am J Bot 2007; 94:275–288. doi: 10.3732/ajb.94.3.275 [DOI] [PubMed] [Google Scholar]
  • 4.Smidt EC, Zavala-Páez M, Vieira LN, Viruel J, Baura VA, Balsanelli E, et al. Characterization of sequence variability hotspots in Cranichideae plastomes (Orchidaceae, Orchidoideae). PLoS One 2020; 15:e0227991. doi: 10.1371/journal.pone.0227991 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wicke S, Schneeweiss GM, Depamphilis CW, Muller KF, Quandat D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol Biol 2011; 76:273–297. doi: 10.1007/s11103-011-9762-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ruhlman TA, Jansen RK. The plastid genomes of flowering plants. In: Maliga P, editor. Chloroplast biotechnology: methods and protocols. Totowa:Humana Press; 2014. pp. 3–38. [DOI] [PubMed] [Google Scholar]
  • 7.Wilson MA, Gaut B, Clegg MT. Chloroplast DNA evolves slowly in the palm family (Arecaceae). Mol Biol Evol 1990; 7:303–314. doi: 10.1093/oxfordjournals.molbev.a040605 [DOI] [PubMed] [Google Scholar]
  • 8.Barfod AS, Hagen M, Borchsenius F. Twenty-five years of progress in understanding pollination mechanisms in palms (Arecaceae). AoB Plants 2011; 108:503–1516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Onstein RE, Baker WJ, Couvreur TLP, Faurby S, Svenning JC, Kissling WD. Frugivory-related traits promote speciation of tropical palms. Nat Ecol Evol 2017; 12:1903–1911. doi: 10.1038/s41559-017-0348-7 [DOI] [PubMed] [Google Scholar]
  • 10.Muñoz G, Trøjelsgaard K, Kissling WD. A synthesis of animal-mediated seed dispersal of palms reveals distinct biogeographical differences in species interactions. J Biogeogr 2019; 46:466–484. [Google Scholar]
  • 11.Clement CR. Domesticated palms. Principes 1992; 36:70–78. [Google Scholar]
  • 12.Balslev H, Bernal R, Fay MF. Palms–emblems of tropical forests. Bot J Linn Soc 2016; 182:195–200. [Google Scholar]
  • 13.Clement CR. Peach palm (Bactris gasipaes). In: Janick J, Paull RE, editors. The encyclopedia of fruit and nuts. Wallingford: CABI Publishing; 2008. pp. 93–101. [Google Scholar]
  • 14.Clement CR, Cristo-Araújo M, Coppens d’Eeckenbrugge G, Reis VM, Lehnebach R, Picanço-Rodrigues D. Origin and dispersal of domesticated peach palm. Front Ecol Evol 2017; 5:1–19. [Google Scholar]
  • 15.Muñoz-Pérez JM, Cañas GP, López L, Arias T. Genome-wide diversity of northern South America cultivated Coconut (Cocus nucifera L.) uncovers diversification times and targets of domestication of coconut globally. bioRxiv 2019; 825398. [Google Scholar]
  • 16.Rivas M, Condón F. Plant domestication and utilization: The case of the Pampa biome. In: Al-Khayri JM, et al., editors. Advances in plant breeding strategies: breeding, biotechnology and molecular tools. Cham:Springer; 2015. pp. 3–24. [Google Scholar]
  • 17.Montoya SG, Motoike SY, Kuki KN, Couto AD. Fruit development, growth, and stored reserves in macauba palm (Acrocomia aculeata), an alternative bioenergy crop. Planta 2016; 244:927–938. doi: 10.1007/s00425-016-2558-7 [DOI] [PubMed] [Google Scholar]
  • 18.Levis C, Flores BM, Moreira PA, Luize BG, Alves RP, Franco-Moraes J, et al. How people domesticated Amazonian forests. Front Ecol Evol 2018; 5:171. [Google Scholar]
  • 19.Pereira Cruz A, Giehl ELH, Levis C, Machado JS, Bueno L, Peroni N. Pre-colonial Amerindian legacies in forest composition of southern Brazil. PLoS One 2020; 15:e0235819. doi: 10.1371/journal.pone.0235819 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Henderson A. Bactris (Palmae). New York: Flora Neotropica; 2000 [Google Scholar]
  • 21.Cristo-Araújo M, Picanço-Rodrigues D, Astolfi-Filho S, Clement CR. Peach palm core collection in Brazilian Amazonia. Crop Breed Appl Biotech 2015; 15:18–25. [Google Scholar]
  • 22.Vieira LN, Fraga HPF, Rogalski M, Souza EM, Pedrosa FO, Guerra MP. An improved protocol for intact chloroplasts and cpDNA isolation in conifers. PLoS One 2014; 9:e84792. doi: 10.1371/journal.pone.0084792 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sakaguchi S, Ueno S, Tsumura Y, Setoguchi H, Ito M, Hattori C, et al. Application of a simplified method of chloroplast enrichment to small amounts of tissue for chloroplast genome sequencing. Appl Plant Sci 2017; 5:1–8. doi: 10.3732/apps.1700002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Shi C, Hu N, Huang H, Gao J, Zhao YJ, Gao LZ. An improved chloroplast DNA extraction procedure for whole plastid genome sequencing. PLoS One 2012; 7:e31468. doi: 10.1371/journal.pone.0031468 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 2004; 14:1394–1403. doi: 10.1101/gr.2289704 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Amiryousefi A, Hyvonen J, Poczai P. IRscope: An online program to visualize the junction sites of chloroplast genomes. Bioinformatics 2018; 34:3030–3031. doi: 10.1093/bioinformatics/bty220 [DOI] [PubMed] [Google Scholar]
  • 27.Shaw J, Shafer HL, Rayne LO, Kovach MJ, Schorr M, Morris AB. Chloroplast DNA sequence utility for the lowest phylogenetic and phylogeographic inferences in angiosperms: The tortoise and the hare IV. Am J Bot 2014; 101:1987–2004. doi: 10.3732/ajb.1400398 [DOI] [PubMed] [Google Scholar]
  • 28.Zavala-Páez M, Vieira LN, Baura VA, Balsanelli E, Souza EM, Cerna M, et al. Comparative plastid genomics of neotropical Bulbophyllum (Orchidaceae; Epidendroideae). Front Plant Sci 2020; 11:799. doi: 10.3389/fpls.2020.00799 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol 2013; 30:772–780. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Rozas J, Ferrer-Mata A, Sánchez-Delbarrio JC, Guirão-Rico S, Librado P, Ramos-Onsins S, et al. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Datasets. Mol Biol Evol 2017; 34:3299–3302. doi: 10.1093/molbev/msx248 [DOI] [PubMed] [Google Scholar]
  • 31.Asmussen CB, Dransfield J, Deickmann V, Barfod AS, Pintaud JC, Baker WJ. A new subfamily classification of the palm family (Arecaceae): evidence from plastid DNA phylogeny. Bot J Linn Soc 2006; 151:15–38. [Google Scholar]
  • 32.Eiserhardt WL, Pintaud JC, Asmussen-Lange C, Hahn WJ, Bernal R, Balslev H, et al. Phylogeny and divergence times of Bactridinae (Arecaceae, Palmae) based on plastid and nuclear DNA sequences. Taxon 2011; 60:485–498. [Google Scholar]
  • 33.Baker WJ, Norup MV, Clarkson JJ, Couvreur TLP, Dowe JL, Lewis CE, et al. Phylogenetic relationships among arecoid palms (Arecaceae: Arecoideae). AoB Plants 2011; 108:1417–1432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Trifinopoulos J, Nguyen LT, von Haeseler A, Minh BQ. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res 2016; 44:W232–W235. doi: 10.1093/nar/gkw256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Benjamini Y, Speed TP. Summarizing and correcting the GC content bias in high-throughput sequencing. Nucleic Acids Res 2012; 40:e72–e72. doi: 10.1093/nar/gks001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Uthaipaisanwong P, Chanprasert J, Shearman JR, Sangsrakru D, Yoo- cha T, Jomchai N, et al. Characterization of the chloroplast genome sequence of oil palm (Elaeis guineensis Jacq.). Gene 2012; 500:172–180. doi: 10.1016/j.gene.2012.03.061 [DOI] [PubMed] [Google Scholar]
  • 37.Áquila R, Silva J, Silva JV, Almeida C. The first complete chloroplast genome sequence of Syagrus coronata: comparative analysis in Arecoideae (Arecaceae). Rev Bras Bot 2018. 41:393–401. [Google Scholar]
  • 38.Lopes AS, Pacheco TG, Nimz T, Vieira LN, Guerra MP, Nodari RO, et al. The complete plastome of macaw palm [Acrocomia aculeata (Jacq.) Lodd. ex Mart.] and extensive molecular analyses of the evolution of plastid genes in Arecaceae. Planta 2018; 247:1011–1030. doi: 10.1007/s00425-018-2841-x [DOI] [PubMed] [Google Scholar]
  • 39.Lopes AS, Pacheco TG, Silva ON, Cruz LM, Balsanelli E, Souza EM, et al. The plastomes of Astrocaryum aculeatum G. Mey. and A. murumuru Mart. show a flip-flop recombination between two short inverted repeats. Planta 2019; 250:1229–1246. doi: 10.1007/s00425-019-03217-z [DOI] [PubMed] [Google Scholar]
  • 40.Magnabosco JWS, Fraga HPF, Silva RS, Rogalski M, Souza EM, Guerra MP, et al. Characterization of the complete chloroplast genome of Butia eriospatha (Arecaceae). Genet Mol Biol 2020; 43:e20200023. doi: 10.1590/1678-4685-GMB-2020-0023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Gonçalves D.JP, et al. Under the rug: abandoning persistent misconceptions that obfuscate organelle evolution. Mol Phylogenet Evol 2020; 151:106903. doi: 10.1016/j.ympev.2020.106903 [DOI] [PubMed] [Google Scholar]
  • 42.Kwon EC, Kim JH, Kim NS. Comprehensive genomic analyses with 115 plastomes from algae to seed plants: structure, gene contents, GC contents, and introns. Genes Genomics 2020; 42:553–570. doi: 10.1007/s13258-020-00923-x [DOI] [PubMed] [Google Scholar]
  • 43.Li FW, Kuo LY, Pryer KM, Rothfels CJ. Genes translocated into the plastid inverted repeat show decelerated substitution rates and elevated GC content. Genome Biol Evol 2016; 8:2452–2458. doi: 10.1093/gbe/evw167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Yang M, Zhang X, Liu G, Yin Y, Chen K, Yun Q, et al. The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.). PLoS One 2010; 5:e12762. doi: 10.1371/journal.pone.0012762 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Huang YY, Matzke AJM, Matzke M. Complete sequence and comparative analysis of the chloroplast genome of coconut palm (Cocos nucifera). PLoS One 2013; 8:74736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Kim JS, Kim JH. Comparative genome analysis and phylogenetic relationship of order Liliales insight from the complete plastid genome sequences of two Lilies (Lilium longiflorum and Alstroemeria aurea). PLoS One 2013; 8:e68180. doi: 10.1371/journal.pone.0068180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wu M-L, et al. Complete chloroplast genome of the medicinal plant Amomum compactum: gene organization, comparative analysis, and phylogenetic relationships within Zingiberales. Chin Med 2018; 13: 1–12. doi: 10.1186/s13020-017-0157-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Barrett CF, Baker WJ, Comer JR, Conran JG, Lahmeyer SC, Leebens-Mack JH, et al. Plastid genomes reveal support for deep phylogenetic relationships and extensive rate variation among palms and other commelinid monocots. New Phytol 2016; 209:855–870. doi: 10.1111/nph.13617 [DOI] [PubMed] [Google Scholar]
  • 49.Khan A, Khan IA, Heinze B, Azim MK. The chloroplast genome sequence of date palm (Phoenix dactylifera L. cv. ‘Aseel’). Plant Mol Biol Report 2012; 30:666–678. [Google Scholar]
  • 50.Yue F, Cui L, dePamphilis CW, Moret BME, Tang J. Gene rearrangement analysis and ancestral order inference from chloroplast genomes with inverted repeat. BMC Genomics 2008; 9:S25. doi: 10.1186/1471-2164-9-S1-S25 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Scarcelli N, Barnaud A, Eiserhardt W, Treier UA, Seveno M, d’Anfray A, et al. A set of 100 chloroplast DNA primer pairs to study population genetics and phylogeny in monocotyledons. PLoS One 2011; 6:e19954. doi: 10.1371/journal.pone.0019954 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Gunn BF. The phylogeny of the Cocoeae (Arecaceae) with emphasis on Cocos nucifera. Ann Mo Bot Gard 2004; 91:505–522. [Google Scholar]
  • 53.Hahn WJ. A phylogenetic analysis of the Arecoid line of palms based on plastid DNA sequence data. Mol Phylogenet Evol 2002; 23:189–204. doi: 10.1016/S1055-7903(02)00022-2 [DOI] [PubMed] [Google Scholar]
  • 54.Baker WJ, Savolainen V, Asmussen-Lange CB, Chase MW, Dransfield J, Forest F, et al. Complete generic-level phylogenetic analyses of palms (Arecaceae) with comparisons of supertree and supermatrix approaches. Syst Biol 2009; 58:240–256. doi: 10.1093/sysbio/syp021 [DOI] [PubMed] [Google Scholar]
  • 55.Meerow AW, et al. Phylogeny and historical biogeography of the cocosoid palms (Arecaceae, Arecoideae, Cocoseae) inferred from sequences of six WRKY gene family loci. Cladistics 2015; 31: 509–534. [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Berthold Heinze

17 Jun 2021

PONE-D-21-11791

The plastome sequence of Bactris gasipaes and evolutionary analysis in tribe Cocoseae (Arecaceae)

PLOS ONE

Dear Dr. do Nascimento Vieira,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Considering the reports of three reviewers, and my own assessment, I think a minor revision, which nevertheless answers to the suggestions in the reviews carefully, is the appropriate thing to do. Please check the details further below!

Please submit your revised manuscript by Aug 01 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Berthold Heinze

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found athttps://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

We have received three reviews for this manuscript. All authors emphasize that the study is straightforward and mostly well done. Normally, I am reluctant to recommend accepting manuscript which describes chloroplast sequencing, because this has become routine work, really (but be aware to rely too heavily on refernce-guided assembly! Have a look at a book chapter where I rather recommend to publish such data as "data reports": https://doi.org/10.1007/978-1-0716-0997-2_5).

So my recommendation is to include a Phoenix species, as some reviewers have done, and mention and discuss the study recommended (with the 100 primers for monocotyledon chloroplasts).

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a straightforward but sound description of the newly sequenced plastome of the economically important palm, Bactris gasipaes. The properties of the plastome are analysed in a simple phylogenetic framework including seven other species from tribe Cocoseae plus a small outgroup. The analysis is fine, the manuscript is well written and the discussion makes sense. My only comment is that the authors have overlooked a paper on phylogenetically informative plastid regions in palms: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0019954 I think the discussion would benefit from a comparison of the variable regions identified here to those of Scarcelli et al.

Reviewer #2: In this paper, authors sequenced and characterized the plastome of Bactris gasipaes (Bactridinae) and compared it with eight species from the three Cocoseae sub-tribes to perform comparative analysis and to identify hypervariable regions.

Reviewer #3: The main study objective is the use of the plastidial regions and the nuclear markers PRK and RPB2 for a suitable approach for phylogenetic inferences and produces phylogenetic trees with well-resolved and the expected topologies in tribe Cocoseae. Without using another subfamily far from the Cocoseae subtribe, e.g. P dactylifera, the conclusion cannot be drawn with confidence.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: M. Kamran Azim

Reviewer #3: Yes: Ibrahim Al-Mssallem

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: The plastome sequence of Bactris gasipaes and evolutionary analysis in tribe 24052021.docx

PLoS One. 2021 Aug 24;16(8):e0256373. doi: 10.1371/journal.pone.0256373.r002

Author response to Decision Letter 0


31 Jul 2021

Curitiba, July 31, 2021

Dear Dr. Berthold Heinze

Academic Editor of PLOS ONE

Manuscript ID: PONE-D-21-11791

Manuscript resubmission Requirement

We received the results of evaluation of the refereed MS. We thank the corrections and recommendations, which improved the MS overall quality. All the queries, recommendations and suggestions were considered, as follow:

Academic editor

We have received three reviews for this manuscript. All authors emphasize that the study is straightforward and mostly well done. Normally, I am reluctant to recommend accepting manuscript which describes chloroplast sequencing, because this has become routine work, really (but be aware to rely too heavily on reference-guided assembly! Have a look at a book chapter where I rather recommend to publish such data as "data reports": https://doi.org/10.1007/978-1-0716-0997-2_5). So my recommendation is to include a Phoenix species, as some reviewers have done, and mention and discuss the study recommended (with the 100 primers for monocotyledon chloroplasts).

R: Thank you for the positively evaluation and the constructive comments. We assembled the plastome using a de novo strategy. The reference-guided strategy was only used to order the contigs. The final assemble was verified by read mapping to ensure no contigs were mistakenly united.

We have one species from subfamily Coryphoideae (the same subfamily of Phoenix) in the phylogenetic analysis. Also, Archontophoenix alexandrae (F.Muell.) H.Wendl. & Drude from subfamily Arecoideae, but Areceae subtribe.

We included the recommended study (Scarcelli et al. 2011) in the discussion.

#Reviewer 1

This is a straightforward but sound description of the newly sequenced plastome of the economically important palm, Bactris gasipaes. The properties of the plastome are analysed in a simple phylogenetic framework including seven other species from tribe Cocoseae plus a small outgroup. The analysis is fine, the manuscript is well written and the discussion makes sense. My only comment is that the authors have overlooked a paper on phylogenetically informative plastid regions in palms: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0019954 I think the discussion would benefit from a comparison of the variable regions identified here to those of Scarcelli et al.

R: Thank you for the positively evaluation and the constructive comments. The discussion were improved and the following sentences were included: “Also, Scarcelli et al. [51] reported 100 primers for phylogeny in monocots. However, four of the hypervariable regions reported in our study were not contemplated (e.g., psaC-ndhE, petN-psbM, accD-psaI, trnS-trnG). Among them. trnS-trnG and accD-psaI primers designed by Scarcelli et al . [51] showed no amplification in Arecaceae, and petN-psbM and psaC-ndhE were not mentioned, probably due to gene rearrangements in monocots plastomes. This reinforces that the highly variable regions vary between clades, and their identification may be necessary for distinct taxonomic levels.”.

#Reviewer 2

In this paper, authors sequenced and characterized the plastome of Bactris gasipaes (Bactridinae) and compared it with eight species from the three Cocoseae sub-tribes to perform comparative analysis and to identify hypervariable regions.

R: Thank you for the positively evaluation.

#Reviewer 3

The main study objective is the use of the plastidial regions and the nuclear markers PRK and RPB2 for a suitable approach for phylogenetic inferences and produces phylogenetic trees with well-resolved and the expected topologies in tribe Cocoseae. Without using another subfamily far from the Cocoseae subtribe, e.g. P dactylifera, the conclusion cannot be drawn with confidence.

R: In the phylogenetic analysis, we included Brahea brandegeei (Purpus) H. E. Moore from the subfamily Coryphoideae (the same subfamily of Phoenix) and Archontophoenix alexandrae (F.Muell.) H.Wendl. & Drude from subfamily Arecoideae, but Areceae subtribe. These species as outgroup contemplates the suggestion. Also, we included the average depth of coverage of 67.64 (SD = 24.32).

Best regards,

Leila do Nascimento Vieira

leilavieira@ufpr.br

Corresponding author

Attachment

Submitted filename: Response to Reviewers.doc

Decision Letter 1

Berthold Heinze

5 Aug 2021

The plastome sequence of Bactris gasipaes and evolutionary analysis in tribe Cocoseae (Arecaceae)

PONE-D-21-11791R1

Dear Dr. do Nascimento Vieira,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Berthold Heinze

Section Editor

PLOS ONE

Additional Editor Comments (optional):

Many thanks for submitting the revision. I see that all comments and recommendations were either followed up, or sufficiently discussed, in the letter and/or the manuscript itself. Also from a copyediting view, the manuscript seems ready for print.

Reviewers' comments:

Acceptance letter

Berthold Heinze

9 Aug 2021

PONE-D-21-11791R1

The plastome sequence of Bactris gasipaes and evolutionary analysis in tribe Cocoseae (Arecaceae)

Dear Dr. do Nascimento Vieira:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Berthold Heinze

Section Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. GenBank accession numbers of the nucleotide sequences used in our analysis.

    (DOC)

    S1 Fig. Phylogenetic tree based on maximum likelihood inference using the ten plastome regions with greatest PIS% values.

    The numbers above the branches are maximum likelihood bootstrap values (1000 replicates).

    (TIF)

    Attachment

    Submitted filename: The plastome sequence of Bactris gasipaes and evolutionary analysis in tribe 24052021.docx

    Attachment

    Submitted filename: Response to Reviewers.doc

    Data Availability Statement

    The plastome sequence was deposited in GenBank: MW054718.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES