Skip to main content
Frontiers in Plant Science logoLink to Frontiers in Plant Science
. 2019 Feb 28;10:218. doi: 10.3389/fpls.2019.00218

Sequencing the Plastid Genome of Giant Ragweed (Ambrosia trifida, Asteraceae) From a Herbarium Specimen

Gaurav Sablok 1,2, Ali Amiryousefi 1,2, Xiaolan He 1,2, Jaakko Hyvönen 1,2, Péter Poczai 1,2,*
PMCID: PMC6403193  PMID: 30873197

Abstract

We report the first plastome sequence of giant ragweed (Ambrosia trifida); with this new genome information, we assessed the phylogeny of Asteraceae and the transcriptional profiling against glyphosate resistance in giant ragweed. Assembly and genic features show a normal angiosperm quadripartite plastome structure with no signatures of deviation in gene directionality. Comparative analysis revealed large inversions across the plastome of giant ragweed and the previously sequenced members of the plant family. Asteraceae plastid genomes contain two inversions of 22.8 and 3.3 kb; the former is located between trnS-GCU and trnG-UCC genes, and the latter between trnE-UUC and trnT-GGU genes. The plastid genome sequences of A. trifida and the related species, Ambrosia artemisiifolia, are identical in gene content and arrangement, but they differ in length. The phylogeny is well-resolved and congruent with previous hypotheses about the phylogenetic relationship of Asteraceae. Transcriptomic analysis revealed divergence in the relative expressions at the exonic and intronic levels, providing hints toward the ecological adaptation of the genus. Giant ragweed shows various levels of glyphosate resistance, with introns displaying higher expression patterns at resistant time points after the assumed herbicide treatment.

Keywords: Ambrosia trifida, chloroplast genome, genome skimming, glyphosate, herbicide resistance, invasive plants, noxious weeds, ragweed

Introduction

Plant invasions are accelerating on a global scale and cannot be fully understood without analyzing the genetic background of the source and invading populations. Biological invasions may threaten both global and local biodiversity, ecosystem functions, agriculture, and public health (Vitousek et al., 1997). Human-induced global climate change might further complicate the effects of the invasions. In Europe, as in many other regions of the world, the number of invasive plant species has increased considerably in the past 200 years due to increased trade, tourism, and disturbance (Pyšek et al., 2009). Europe suffers from invasive species in many ways, and a crude estimate of monetary impact (diminished yield and control measures) suggests that these additional costs due to invasive species exceed €12 billion annually (Kettunen et al., 2009), although this may be an underestimate (Vilá et al., 2010).

The genus Ambrosia L. (ragweeds) of the Asteraceae (tribe Heliantheae, subtribe Ambrosiinae) includes 40–50 species (Payne, 1964; Martin et al., 2018). Most are dioecious desert shrubs, both annual and perennial, while some species are weedy pioneers that have become established as exotics outside of their original range (Martin et al., 2018). Based on their current diversity and distribution, Ambrosia most likely evolved in the SW USA and adjacent Mexico, then subsequently radiated to many areas of North America (Payne, 1964). Giant ragweed (Ambrosia trifida L.) has raised awareness as an invasive plant in Europe — together with its relatives, common (Ambrosia artemisiifolia L.) and perennial ragweed (A. psilostachya DC.), it represents an example of aggressive invasion all over the continent. Ragweeds are among the most economically destructive weeds, as they interfere with the growth and establishment of various crops (Cowborough et al., 2003; Kong et al., 2007). A. trifida was primarily a weed of floodplains and ditch banks, but in the past decades it has expanded its native range in North America, causing considerable economic loss in the Corn Belt (Regnier et al., 2016). It can dominate in common croplands due to its rapid growth and large leaf area. Notably, its ability to adjust its resource utilization responses and extend its germination period has allowed it to tolerate changing environments, adding to its success as an invasive species (Abul-fatih and Bazzaz, 1979). Ragweeds also produce large amounts of pollen that causes allergenic rhinitis, presenting a burden to public health (Kazinczi et al., 2008). Currently half of all hay fever cases in North America are caused by ragweeds (Taramarcaz et al., 2005), making it one of the most powerful aeroallergens. In fact, it has been estimated that areas in Europe affected by severe ragweed allergy are likely to increase substantially by the year 2100 (Rasmussen et al., 2017). Pollen production of these plants is enhanced by special anemophily-specialized floral organs, including the pistillodium (thought to be universal within Ambrosia), which is a vestigial pistil in staminate flowers that forces pollen away from the corolla (Payne, 1963; Martin et al., 2009; Martin et al., 2010).

Giant ragweed is often controlled with the broad-spectrum herbicide glyphosate [N-(phosphonomethyl) glycine], which has become a dominant herbicide worldwide since its introduction in 1974. Although glyphosate belongs to the herbicide group with the greatest increase in resistance cases, it is still the most widely used non-selective systemic herbicide worldwide (Bracamonte et al., 2018), despite its debated environmental effects (Vandenberg et al., 2017). Recently, it was shown that glyphosate affects the gut microbiome of honeybees, inducing susceptibility to infections and death from pathogenic bacteria (Motta et al., 2018). The continuous application of glyphosate, along with reduced use of other weed management practices has also caused many weeds, including A. trifida, to become glyphosate-resistant (Ganie et al., 2017). Since the introduction of transgenic glyphosate-resistant (GR) crops in 1996, 19 weeds have evolved resistance to glyphosate; about half evolved in GR crops (Heap, 2018). In 2016, the area of transgenic crops reached 185.1 million ha globally, and approximately 80% were planted solely with GR crops (International Service for the Acquisition of Agri-biotech Applications [ISAAA], 2016). This means that growers continually applied glyphosate alone over these vast areas to control genetically variable and prolific weeds. The high initial efficacy of glyphosate often leads to a decline in the use of other herbicide options and less investment by industry to discover new herbicide active ingredients (Green and Owen, 2011). No matter how effective the herbicide is, weed management programs cannot rely heavily on only one tactic otherwise weeds will ultimately adapt and survive in large numbers (Green and Owen, 2011).

Our knowledge about glyphosate resistance is still limited (Duke and Powles, 2008; Sammons and Gaines, 2014), and several mechanisms have been reported in various weeds (Norsworthy et al., 2011). A specific type of resistance, called target-site resistance, may occur when single or several mutations take place in the conserved region of the EPSPS gene and/or its duplications (Yu et al., 2015; Patterson et al., 2018; Sammons et al., 2018). This gene codes the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) enzyme that is inactivated by glyphosate, thus disrupting aromatic amino acid synthesis in plants. Such target-site resistance appears rapidly in weed populations. However, the evaluation of several glyphosate-resistant A. trifida accessions revealed no changes in the EPSPS gene sequence and copy number. Rather, a form of non-target-site resistance (NTSR) known as metabolic resistance has been suggested (Moretti et al., 2018; Van Horn et al., 2018). This type of metabolic resistance involves more changes than just the target site alone. Plants with NTSR rapidly metabolize or break down the herbicide before it is able to cause toxic effects (Sammons and Gaines, 2014), thus representing a major issue for the chemical control of weeds. NTSR is hard to manage because weeds have unpredictable resistance to herbicides with different chemical structures and/or target proteins (Délye, 2013). Furthermore, recent studies have demonstrated that weeds with NTSR can transmit cross-resistance to other herbicides with different modes of action, even to those not yet marketed (Petit et al., 2010). This is because NTSR is much more of a general adaptive response to herbicides (Yuan et al., 2007). According to the current view, herbicide application is a stress to the plant, which triggers response pathways in weeds irrespective of their sensitivity to the herbicide (Délye, 2013). These stress–response pathways are pre-existent in plant species, coded by multiple genes and alleles. The selection of these genes starts with the uneven application of the herbicide in the field. For example, some individuals may survive herbicide spraying because they receive less of the chemical, hence their genes are carried to future generations, providing resistance to some degree. Further NTSR evolution likely requires several generations of sexual recombination within a population until individuals accumulate enough alleles to possess a resistance level that would enable survival of the full dose of herbicide (Délye et al., 2013). Recurrent selection experiments have convincingly provided support for this theory (Neve and Powels, 2005; Busi et al., 2012). Studies of the genetic basis of NTSR are sparse, and only the key role of glutathione transferase genes have been investigated (Cummins et al., 2013). Research addressing NTSR in weeds has long been hampered by the absence of ‘omics’ resources for the vast majority of weed species. The increasing accessibility of genomics and transcriptomics supported by next-generation sequencing (NGS) technologies should rapidly enable the identification of genes governing NTSR (Délye et al., 2013). For such purposes, transcriptomes are already available for various weeds (Peng et al., 2010; Riggins et al., 2010; Zhang et al., 2012) including A. artemisiifolia (Taller et al., 2016b; Virág et al., 2016).

Genetic research of ragweeds has mostly been focused on A. artemisiifolia, with the use of microsatellite markers to determine the origins of invading populations (Genton et al., 2005a,b; Gaudeul et al., 2011). Population genetics of A. artemisiifolia has also been investigated (Cseh and Taller, 2008; Chun et al., 2010; Gladieux et al., 2011; Mátyás et al., 2012; Martin et al., 2016; von Boheemen et al., 2017), and genomic tools are under development for the analysis of historical specimens (Sánchez Barreiro et al., 2017) and fresh plant material (Cseh et al., 2009; Cseh, 2010; Mátyás et al., 2011; Taller et al., 2016a; Virág et al., 2016; Nagy et al., 2017). The history of common ragweed distribution was revealed based on comprehensive studies of herbarium specimens (Chauvel et al., 2006; Csontos et al., 2010; Martin et al., 2014). However, the genetics of other ragweed species in Europe have been less studied. As such, the relationships of ragweeds are poorly understood and thus a taxonomic update of Payne (1964), with newly described species and a solid phylogenetic framework of the genus is necessary. Toward this goal, herbarium specimens accumulated over the past 100 years that are readily available for sampling would be valuable material to assess ragweed diversity, and to perform evolutionary studies. Accordingly, a recent study by Martin et al. (2018) demonstrated the utility of herbarium specimens in phylogenetic analyses.

Until recently, the use of herbarium collections for obtaining molecular-level data has been difficult due to the generally poor condition of their DNA (Besnard et al., 2014). However, advances in NGS technologies and subsequent development of techniques for extracting historical DNA (Gutaker et al., 2017; Shepherd, 2017) have recently made herbarium specimens an attractive option. In fact, most NGS methods are designed for using short fragmented DNA molecules (100–400 bp) as templates, which suits herbarium specimens (Staats et al., 2013). Gutaker and Burbano (2017) demonstrated that NGS is better suited than PCR-based approaches for ancient DNA sequencing, because adapter ligation allows interrogation of the molecule ends so that longer molecules can be obtained for sequencing. As such, this approach has enabled access to genetic information on type or other important historical specimens, which are crucial for resolving taxonomic uncertainties. Moreover, NGS is both feasible and cost-effective (Bakker et al., 2015; Bakker, 2017). Staats et al. (2011) showed that it is also possible to use genome skimming to analyze DNA that is otherwise too degraded for PCR-based approaches, thus offering the possibility to include rare and extinct species from natural history collections in phylogenetic analyses. Several phylogenomic studies have explored the potential of herbarium specimens of different ages (Staats et al., 2013; Besnard et al., 2014; Aubriot et al., 2018; McManus et al., 2018; Zeng et al., 2018). For example, the entire nuclear genome of a 43-year-old Arabidopsis thaliana (L.) Heynh. (Brassicaceae) herbarium specimen with high and uniform sequence coverage has been reported using a reference-based assembly approach (Staats et al., 2013). NGS techniques can also be used to further acquire mitochondrial genomic data, which provides the possibility to address questions related to hybridization and introgression (e.g., Rydin et al., 2017; Vargas et al., 2017), as well as assess the phylogenetic congruence between topologies based on mitochondrial regions and plastomes (Van de Paer et al., 2018). For example, the evolutionary history of an extinct plant lineage Hesperelaea A. Gray, known from an 1875 collection from Guadalupe Island, has been revealed based on both the plastome (Zedane et al., 2016) and mitogenome (Van de Paer et al., 2016).

Herbarium specimens also offer a unique opportunity to study the plastid genome compartments preserved in specimens. Plastomes of angiosperms are small (typically approximately 120–190 kb in size) and have a highly conserved quadripartite structure containing two inverted repeats (IRa and IRb), which separate the large and small single-copy regions (LSC and SSC). The plastid genome includes 110 to 130 genes that participate primarily in photosynthesis, transcription, and translation (Palmer, 1985; Olmstead and Palmer, 1994; Daniell et al., 2016). Their conserved gene content, order, and organization make plastid genomes fairly well-suited for studies about gene loss, structural rearrangement, pseudogenes, or additional mutation events that might be characteristic of some lineages (Poczai and Hyvönen, 2013, 2017; Beck and Semple, 2015; Wicke and Schneeweiss, 2015). Herbarium specimens appear to yield enough reads for effective plastome assembly, similar to fresh specimens (Bakker et al., 2015). This is because plant cell walls are composed of cellulose microfibrils (Cosgrove, 1999) that provide additional protection for preservation of the genomic DNA (Bakker, 2017). The total assembly length of plastid genomes is comparable between fresh material and herbarium specimens. Those from herbarium specimens simply need slightly more editing and “scaffolding” because on average they yield shorter contigs (Bakker et al., 2015; Bakker, 2017; Twyford and Ness, 2017).

In the current study, we sought to develop further resources to facilitate genomic research of ragweeds. By using a herbarium specimen, we aimed to demonstrate the utility of such collections for weed genomics. We present the complete chloroplast genome sequence of giant ragweed (A. trifida) using high-throughput sequencing, and report the assembly, annotation, gene expression, and unique structure characterization of its plastome. Comparison of the gene order and inverted repeat (IR) length across Asteraceae is also presented, as well as transcriptional profiling against glyphosate resistance.

Materials and Methods

DNA Isolation and Plastome Sequencing

Giant ragweed leaves (0.5 g) were obtained from a herbarium specimen of the Finnish Museum of Natural History (H), University of Helsinki, Finland (H1645542; Mansfield, OH, United States, 1886). Leaf samples were rinsed with deionized water and 70% ethanol, and total genomic DNA was isolated using the NucleoSpin Plant II kit (Macherey-Nagel, Düren, Germany). All work was carried out in a dedicated laboratory with UV sterilized equipment; blank samples were processed during DNA extraction. DNA concentration was measured with a Qubit fluorometer (Invitrogen) and verified on an 0.8% agarose gel. A paired-end genomic library was constructed using the Nextera DNA library preparation kit (Illumina, San Diego, CA, United States). Fragment analysis was conducted with an Agilent Technologies 2100 Bioanalyzer using a DNA 1000 chip. Sequencing was performed on an Illumina MiSeq platform from both ends with 150 bp read length.

Genome Assembly and Annotation

To obtain high-quality clean data, raw reads were first filtered by removing low-quality reads with a sliding window quality cutoff of Q20 using Trimmomatic (Bolger et al., 2014). Plastid reads were obtained by reference mapping using A. artemisiifolia L. (MG019037; NC035875) plastid genome (Amiryousefi et al., 2017; Nagy et al., 2017) with Geneious R10 (Kearse et al., 2012). Clean reads were used for de novo assembly, which was performed with three different algorithms of the programs Velvet v1.2.10 (Zerbino and Birney, 2008), Geneious assembler, and NOVOPlasty (Dierckxsens et al., 2017). General genome structure and ambiguous nucleotide positions were evaluated by an additional assessment. Sequence inconsistencies were checked by mapping reads to the plastome as described in Wysocki et al. (2015). The final de novo assembly was used to map the reads and calculate the mean coverage of the plastid genome using Geneious R10. Sanger-based gap closure and IR junction verification was performed following Poczai and Hyvönen (2017). Annotation was performed using a two-step procedure. First, we transferred all annotations in Geneious from the reference A. artemisiifolia to the A. trifida genome using a similarity threshold of 80%. The genome sequence was also annotated using GeSeq (Tillich et al., 2017), since this program performed better than other algorithms in a comparative study (Amiryousefi et al., 2018b). In a second step, we inspected, compared, and curated all annotations manually. This included extracting all coding regions per genome, confirming start/stop codons and features for each gene, and aligning extracted regions across all plastid genomes to confirm approximate gene lengths based on their conserved gene order. Final curated annotations were transferred to the complete plastid genome sequence of A. trifida, which was deposited in GenBank (NC036810). The resulting genome map was drawn with OGDraw v.1.2 (Lohse et al., 2007).

Comparative Plastome Analysis of Sequenced Asteraceae With Ambrosia trifida

To highlight the significance of the sequenced A. trifida plastome compared to the previously available plastomes of Asteraceae, we first investigated the repeat proportion of the genome using MISA (Thiel et al., 2003; Beier et al., 2017) and REPUTER (Kurtz et al., 2001) to identify the stretches of the perfect, compound, and long-forward repeats. MISA was used to analyze the perfect microsatellites [often abbreviated as simple sequence repeats (SSRs)] with a defined length of n = 10 in mononucleotide repeats, n = 6 in dinucleotide repeats, and n = 3 in tri-, tetra-, penta-, and hexa-nucleotide repeats. For the compound repeats, two defined SSRs should be interrupted by 100 bp. For comparative analysis, a repeat profile was mined across the 10 species to observe any divergence in the occurrence of the repeat motifs. For the identification of the forward repeats, REPUTER was used with a defined length of 30 bp and a hamming distance of 3. For the identification of microstructural events, pairwise alignments of the Asteraceae plastomes with A. trifida were performed using LASTZ (Harris, 2007) and MuMMER version 3.1 (Kurtz et al., 2004). The show-snps feature was used to evaluate the identification of plastomic variations. Once microstructural events were identified, they were further plotted using Circos (Krzywinski et al., 2009) and ggbio (Yin et al., 2012).

Comparative Phylogenomics

We based our sampling on the results of Panéro and Funk (2002), updated in Panéro et al. (2014), considering the availability of currently deposited plastid genomes in the Organelle Genome Resources of NCBI (Wolfsberg et al., 2001) accessed on 12.12.2017. Of the currently accepted 13 subfamilies of Asteraceae, only three (Carduoideae [8], Cichorioideae [8], and Asteroideae [128]) had complete plastid genome sequences deposited with a prominent bias in the genomes of Asteroideae. Genomes within subfamilies were chosen to represent each (super)tribe from the available sequences. For Carduoideae, we included one species from all accessible genera that were sequenced and available at the time we conducted the phylogenetic analysis. In Cichorioideae, only two — Lactuca L. and Taraxacum F. H. Wigg — were available. From Asteroideae, we aimed to include at least one species from each of the 21 tribes. We also included genome sequences of Carum carvi L. and Foeniculum vulgare Mill. of Apiaceae from the campanulid clade (Angiosperm Phylogeny Group, 2016) as outgroups. Accession numbers are provided in Supplementary Table S1. For phylogenetic analysis, we used a matrix of 50 protein-coding genes representing 43 species, with a total concatenated matrix alignment of 31,356 bp. We estimated congruence between different sources of information by comparing the whole genome alignment matrix, coding region matrix, and non-coding region matrix (intergenic regions). We observed ambiguity in the repeat expansion, which might affect the gene composition. Since large microstructural variations such as single nucleotide polymorphism (SNPs), and insertions/deletions (INDELs) were observed, phylogenetic analyses were limited to the coding regions (Curci et al., 2015).

From all the sequenced Asteraceae and A. trifida, coding alignments were constructed for the following fifty protein-coding genes: cemA, infA, matK, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK, petA, petG, petL, psaA, psaB, psaC, psaI, psaJ, psbA, psbB, psbC, psbD, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, rbcL, rpl14, rpl20, rpl22, rpl32, rpl33, rpl36, rpoA, rpoB, rps2, rps3, rps4, rps8, rps11, rps14, rps18, and ycf4, using MACSE (Ranwez et al., 2011), which uses a frameshift alignment algorithm for aligning coding sequences. Following the MACSE alignments, each frameshift was masked and the subsequent alignment was trimmed using trimAL (Capella-Gutiérrez et al., 2009). Finally, prior to the construction of the super-matrix, terminal stop codons were identified and subsequently removed from the trimmed alignments. Concatenation of the phylogenetic matrix was performed using SequenceMatrix version 1.8 (Vaidya et al., 2011). The concatenated sequence matrix was analyzed by maximum likelihood (ML) and parsimony methods. ML analyses were performed with IQ-TREE (Nguyen et al., 2015). The best fitting model (GY+F+I+G4) was determined by ModelFinder (Kalyaanamoorthy et al., 2017) as implemented in IQ-TREE according to the Akaike information criterion (AIC), and Bayesian information criterion (BIC). To assess branch support, all IQ-TREE analyses used the ultrafast bootstrap approximation (UFBoot; Hoang et al., 2018) with 1,000 replicates and the SH-like approximate likelihood ratio test (SH-aLRT) also with 1,000 bootstrap replicates.

Parsimony analyses were performed using nona (Goloboff, 1994) within winclada (Nixon, 2002) shell. Prior to analyses, the command “mop uninformative characters” was used to exclude parsimony uninformative characters. This resulted in a matrix of 3,769 characters. Two separate searches were performed (using processor time as a seed to randomize the order of the terminals) with the following settings: hold 30,000 (holding defined maximum number of trees), 100 replications (search performed with multiple tree-bisection-reconnection algorithm multmax), hold/3 (keeping three starting trees for each replication). In addition, we also performed larger analyses by keeping 20 starting trees for each replication (hold/20) with 1,000 replications. To assess whether the longer genes have any compositional heterogeneity, we created a partition-specific matrix of the long genes ndhD, ndhH, psaA, psaB, psbA, psbB, psbC, and rbcL and evaluated the skewness and the compositional heterogeneity across the combined and partition-specific variations.

To reveal the placement of the genes near the inverted repeat junction sites of A. trifida, an IR plot of this species and nine other Asteraceae were obtained with IRscope (Amiryousefi et al., 2018a).

Transcriptome Profiling

Despite their small size, plastids represent a classical example of miniature genomes containing mono- and polycistronic transcripts. With the advent of plastome sequencing, there has been considerable interest in understanding the transcriptional activity of plastid genes, as well as other events related to RNA editing and post-transcriptional splicing. To understand the transcriptional divergence associated with glyphosate resistance, which is defined as the amount of the transcriptional plasticity between the sensitive and resistant treatment in the giant ragweed plastome, we mapped the RNA-seq reads previously deposited to the currently sequenced A. trifida plastid genome using ChloroSeq (Castandet et al., 2016), with the defined exonic and intronic localization in the annotated GFF3 (NCBI SRA: PRJNA267208; Supplementary Table S2). For mapping the RNA-seq reads, tophat2 (Kim et al., 2013) was used with -g 2 and -no-novel-junctions functions to minimize the identification of the novel splice sites. IR repeat diversity, mapping read estimation, and genome coverage was subsequently calculated using SAMtools version 1.15 (Li et al., 2009) and bedtools version 2.25 (Quinlan and Hall, 2010). Expression values in terms of the RPKM values were mapped to the genome features and were visualized using Circos (Krzywinski et al., 2009) and ggbio (Yin et al., 2012).

Results and Discussion

Genome Assembly and Plastome Features

Plastid DNA sequencing generated 145,207 paired-end reads, with an average fragment length of 267 bp. De novo assemblies of reads resulted a total of 38 contigs with an N50 of 13,231 bp. Ten contigs were alignable and covered 100% of the A. artemisiifolia reference genome. Mapping of the reads to the de novo assembled plastid genome resulted in a 180× mean coverage. The chloroplast genome of A. trifida was 152,040 bp and showed a quadripartite structure of long (83,966 bp) and small (17,894 bp) single-copy regions, separated by two inverted repeat regions of 25,090 bp (Figure 1). As in other species of Asteraceae, the A. trifida chloroplast genome contains 80 protein-coding, 28 tRNA, and four rRNA genes comprising a total of 112 unique genes (Supplementary Tables S3, S4). The distribution of the genes also exhibited similarity with other Asteraceae and angiosperms, with 13 genes found in the SSC, 19 genes in the IR, and 80 genes in the LSC. The overall GC content of the chloroplast genome was 37.2%. Only 21% of the whole genome is non-coding. There were 18 intron-containing genes in the giant ragweed plastome (Supplementary Table S4). From these, 16 (10 protein-coding and six tRNA) genes had a single intron, and two (ycf3, clpP) had two introns. 12 (eight protein-coding and four tRNA) genes are located in the LSC, one (protein-coding) gene in the SSC, and five (three protein-coding and two tRNA) genes in the IR region. The largest intron (2,565 bp) was located in the trnK-UUU gene, including the highly diverse matK gene. The trnK intron is of interest because it represents an unusual form of a group II intron derived from a mobile group of mitochondrial-like intron ORFs (Hausner et al., 2006). The rps12 gene was trans-spliced with the 5′ end exon located in the LSC region and the two remaining exons found in the IR regions. We also observed three cases of overlapping genes, namely psbD/psbC, atpE/atpB, and rps3/rpl22. In the ndhD and psbL genes, we observed that ACG is used as an alternative start codon instead of the common AUG in A. trifida and in most species of Asteraceae. It has been shown that this exceptional ACG start codon is RNA edited in all Solanaceae (Amiryousefi et al., 2018b) except Datura stramonium L., while the start codon of psbL is unedited. In Asteraceae, the canonical AUG form is found for both psbL and ndhD in Ageratina adenophora (Spreng.) King & H. Rob., Pericallis hybrida (Willd.) R. Nordenstam, Silybum marianum (L.) Gaertn. However, only ndhD possesses the AUG start codon in Centaurea diffusa Lam., Chrysanthemum indicum L., Jacobaea vulgaris Gaertn., Mikania micrantha Kunth and Saussurea involucrata Matsum. & Koidz.

FIGURE 1.

FIGURE 1

Plastome map of Ambrosia trifida. Gene features and distribution of the direction of genes with respect to their localization. Genes on the inner side of the outer circle are transcribed counterclockwise while those on the outer side are transcribed clockwise. Genes belonging to different functional groups are color-coded; the GC and AT content of the genome are plotted on the inner circle as dark and light gray, respectively. Inverted repeats (IR); large single copy (LSC) regions; small single copy (SSC) regions.

Genomic Repeats and Rearrangements

Microsatellites (or SSRs) are valuable molecular markers of high-degree variations within the same species. These markers have been previously used in population genetics and polymorphism investigation of ragweeds (Genton et al., 2005a,b; Gaudeul et al., 2011). We analyzed the distribution of SSRs according to the defined length criteria in the giant ragweed plastome. A total of 100 SSRs were observed, with eight present in compound form. To understand whether their distribution varies among Asteraceae species, we further compared 10 previously sequenced plastomes (Table 1). The most abundant motifs of the SSRs were poly-A/T stretches characteristic of angiosperm plastid genomes. These results are consistent with previous findings that the SRs are generally composed of short poly-A or poly-T repeats and rarely contain tandem G or C repeats h. In addition to mononucleotide stretches, we observed tetra- (AGAT/ATCT) and hexa- (AAGGAT/ATCCTT) nucleotide repeats, which could be of specific interest for future cpSSR marker development in cross-species amplifications or population genetic studies of ragweeds. We further identified 25 larger repeats (>30 bp) using the defined parameters in REPUTER (Kurtz et al., 2001). Most of these long repeats were present in the intergenic spacers (Table 2). Among these repeats, a large portion were found within ycf genes, which have been shown to have high divergence rates in most embryophyte lineages and have undergone pseudogenization (Michelangeli et al., 2003). The nucleotide sequence similarity among embryophyte ycf2 is extraordinarily low compared to other plastid-encoded genes: it is less than 50% across bryophytes, ferns, and seed plants (Wicke et al., 2011). This divergence is not surprising since ycf genes have experienced many insertions/deletions. For example, these deletions account for the reduction in the chloroplast genome size among members of the Graminid clade (Poczai and Hyvönen, 2017).

Table 1.

Distribution and summary of the shared SSRs across the Asteraceae plastomes.

Repeats Ambrosia trifida Helianthus annuus Jacobaea vulgaris Taraxacum officinale Echinacea purpurea Artemisia annua Soliva sessilis Westoniella kohkemperi Carthamus tinctorius Ambrosia artemisiifolia
A/T 30 39 34 19 38 39 16 49 19 36
AAC/GTT 4 4 4 4 4 4 4 4 3 4
AAG/CTT 28 24 24 20 25 21 23 22 20 28
AAT/ATT 16 18 26 20 19 26 34 40 24 19
ACC/GGT 2 3 NIL 2 3 3 3 3 3 2
ACG/CGT 1 1 1 1 1 NIL NIL 1 1 1
ACT/AGT 1 1 1 1 1 NIL 2 NIL NIL 1
AGC/CTG 7 7 6 7 7 7 7 7 6 7
AGG/CCT 3 3 4 2 2 4 3 3 3 3
ATC/ATG 3 3 3 3 3 5 4 3 3 3
AAAG/CTTT 1 1 1 3 1 1 1 NIL 1 1
AAAT/ATTT 2 2 5 1 2 7 3 9 3 4
AGAT/ATCT 1 1 NIL NIL 1 NIL NIL 2 NIL 1
AAGGAT/ATCCTT 1 NIL NIL NIL 1 NIL NIL NIL NIL 1

Table 2.

Summary of the identified forward repeat stretched across the Ambrosia trifida plastome.

No Type Location Region Repeat unit Period size (bp) Copy nr.
1 P ycf3/ndhA Intron LSC/SSC CAGAACCGTACATGAGATTTTCATCTCATACGGCTCCTC 41 2
2 P rps12-ycf16 IGS IR CAGAACCGTACATGAGATTTTCA[CT]CTCATACGGCTCCTC 39 2
3 P rps12-ycf15 IGS IR TATTAGATTAGTCTATTAATTCATATTAGATTAGTCT 37 2
4 F pdbE – petL IGS LSC ATTCATGAATTGATTAGAATATTGCCGCAATTG 34 2
5 F ycf2 Gene IR TGACGATATTGATGCTAGTGACGATAT 27 2
6 P psbT-psbN IGS LSC AATTGAAGTAATGAGCCTCCCAAT 24 2
7 F rps12-ycf15 IGS IR CTATTAGATTAGTCTATTAATTCA 23 2
8 F rpl32-ndhF IGS SSC ATAAAAATATTCAATAAGTATAA 23 2
9 F trnD – trnY IGS LSC TTCTCTCGTATCAGGTAT 18 3
10 F ycf1 Gene SSC AATGGAAATAGAAGAAG 18 3
11 F psbK -psbI IGS LSC ATACCTTATTAGC 13 4

IGS, intergenic spacer; IR, inverted repeat; LSC, large single copy region; SSC, small single copy region; F, forward repeat; P, palindromic repeat.

Asteraceae plastid genomes contain two inversions of 22.8 and 3.3 kb as compared with the outgroup terminals Carum carvi and Foeniculum vulgare of Apiaceae. The larger inversion is located between the trnS-GCU and trnG-UCC genes, and the smaller between the trnE-UUC and trnT-GGU genes. In addition to these large inversions, another smaller inversion of 3.3 kb is located within the larger inversion, between the trnC-GCA and rpoB genes. These rearrangements are assumed to have originated in the late Eocene (36–42 My BP) and are commonly reported for Asteraceae (Jansen and Palmer, 1987; Kim et al., 2005). They are absent in the species of the basal subfamily Barnadesioideae as assessed by restriction endonuclease digestions (Jansen and Palmer, 1987). This interesting finding should be confirmed by future research, since complete plastid genome data is currently missing for this group. To further understand the level of the microstructural events, we performed a pairwise comparison of the Asteraceae plastomes with the sequenced A. trifida, revealing a high number of substitutions compared to insertions and deletions. Interestingly, we observed a large number of inversions compared with Parthenium argentatum A. Gray and Ageratina adenophora (Spreng.) King & H. RobAs in other angiosperms, the coding regions of Asteraceae are more conservative than the non-coding regions; rpoC1 is the most divergent of all the genes. The invasive weed Ageratina adenophora rpoC1 contains two introns, while only one intron is found in the plastid genomes of other Asteraceae (Nie et al., 2012). Our LASTZ dot-plot comparison with the complete plastome of Saussurea involucrata Matsum. & Koidz. showed rearrangement patterns (Figure 2 and Supplementary Figure S1) caused by a large shift (approximately 93 kb) in residue numbering that caused problems during de-circularization of the genome. The published genome of this species (Xie et al., 2015) also contains numerous annotation errors with several key genes unannotated (for example, rpl16 exon 2, rps12, and truncated rps19). We found that SSC regions showed signatures of inversions (Figure 2), which is consistent with previous reports (Liu et al., 2013; Walker et al., 2014; Gruenstaeudl et al., 2017). These major spots of inversions have also been reported previously in the plastomes of several plant lineages (Schwarz et al., 2015; Hsu et al., 2016; Graham et al., 2017; Sinn et al., 2018), It is important to note that the SSC region occurs in two inversion isomers that exist in equimolar proportions in the same individual, which are identical in sequence but different in orientation (Palmer, 1983). Presently, whole plastid genome sequences are uploaded in GenBank without preference for the orientation of the SSC region, which appears as a genome structural inversion but is truly just chloroplast heteroplasmy (Walker et al., 2015). Many plastid genome studies use reference-guided mapping, where the orientation of the SSC is copied along novel genomes. For example, SSC regions occur in inverted orientations in Solanaceae compared to Asteraceae (Salih et al., 2017). Another good example in Asteraceae is the plastid genome of Lactuca sativa L., which has been published twice independently (NC007578 and DQ383816). Besides minor polymorphisms among these genomes, the major difference is the orientation of the SSC, which appears to be inverted. Due to the rapid increase in amount of de novo assembled plastid genomes that are often deposited independently and parallel to each other, this phenomenon should not be overlooked, and SSC orientations should not be regarded as diversity hot-spots.

FIGURE 2.

FIGURE 2

Segmental synteny and conservation of Ambrosia trifida across Asteraceae plastomes. LASTz alignments were filtered to keep only colored tracks showing > 70% alignment coverage, with the colored ribbons are species-specific. The inner tract displays the corresponding gene annotations of A. trifida, with the corresponding ribbons showing the inversions and segmental rearrangement across the gene regions.

Inverted Repeat (IR) Diversity

Most of the length variation in angiosperms is attributed to expansion or contraction of the IR regions. It is rare that this variation is due to gene losses, which could vary within a single genus (Sloan et al., 2014). To obtain comprehensive insight into the IR regions of the Asteraceae, a survey of over 40 species was performed using a recently published tool IRscope (Amiryousefi et al., 2018a). This revealed the placement of rps19 in the JLB (LSC/IRb) in almost all cases (Figure 3 and Supplementary Figure S2). The inversion of the SSC region, and hence its reversed gene annotation, is a main distinguishing factor in the visual representation of the species. The fixation of the extended ycf1 in the JSB (IRb/SSC) was confirmed for the majority of the species (approximately 80%). For the same set of species, we observed a pseudo ycf1 gene often tangental to the JSA (SSC/IRa) on the IRa side and the ndhF gene near the JSA on the SSC side. In some species, the SSC occurred in the reverse orientation and with ycf1 extending on the JSA site and a pseudo ycf1 and ndhF plotted on the JSB. The JLA has the rps19 and trnH genes in its vicinity, with psbA further in the LSC region in almost all cases. The pseudo rps19 gene was unannotated in the original files of the first four species in Figure 3. Hence, we obtained the corresponding annotations using GeSeq and inverted the SSC region with IRscope for the first five species to improve visual comparison. Unlike the others, the new annotation of A. artemisiifolia confirmed the existence of the rps19 and its corresponding pseudo fragment on the LSC adjacent to IRb and IRa, respectively (Figure 3). Non-identical IR features were observed for A. artemisiifolia (NC035875), where the IRb contains a single C base insertion near rpl2 (84,242 bp). The trnH gene was also assigned to the opposite DNA strand, which we have manually corrected for Figure 3. After these adjustments, we obtained a corrected size of 25,086 bp for the IRs (cf. Nagy et al., 2017).

FIGURE 3.

FIGURE 3

Inverted repeat plot of Ambrosia trifida. rps19 gene on the LSC/IRb regions with corresponding 184/95 bp. The positive strand synteny of the rpl2, rps19, and rpl22 on the IRb, JLB, and LSC, respectively, is also confirmed. In accordance with the other selected nine Asteraceae, the JSB of A. trifida is hosting the extended functional version of the ycf1 gene, while the pseudo version of this gene beside ndhF is placed near the JSA site. Another copy of the rps19 and trnH, along rpl2 are gathered near the JLA.

The conservation of the length of the IRs, SSC, and LSC is another interesting point. These regions were between 24 to 26, 18 to 19, and 82 to 84 kb, respectively. The maximum length of plastid genomes is 153,014 bp for Conyza bonariensis (L.) Cronquist, while the shortest length of 149, 510 bp was found in Aster spathulifolius Maxim. As in other Asteraceae, the inverted repeat plot of the A. trifida showed the rps19 in the LSC/IRb regions with a corresponding 184/95 bp. The synteny of the rpl2, rps19, and rpl22 (for IRb, JLB, and LSC, respectively) of the positive strand was also confirmed. As in the majority of other angiosperms, the JSB of A. trifida has the extended functional version of the ycf1 gene while the pseudo version of this gene beside ndhF is placed near the JSA site. Another copy of the genes rps19, trnH and rpl2 can be found near the JLA.

Comparative Phylogenomics

Asteraceae is one of the largest families of flowering plants, with approximately 1,500 genera and 23,000 species (Kumar et al., 2009). While several studies have been conducted to resolve the phylogeny (Denda et al., 1999; Panéro and Funk, 2002, 2008; Panéro et al., 2014), many questions remain open. Plastome sequences can now be easily acquired for phylogenomic analyses at relatively low costs, thus providing rich sources of phylogenetic information. To assess the phylogenetic position of A. trifida relative to the previously sequenced Asteraceae species, we performed maximum likelihood (Figure 4) and parsimony analysis. The latter resulted in six equally parsimonious trees with length of 7,297 steps, with consistency index (Kluge and Farris, 1969) CI 0.65 and retention index (Farris, 1989) RI 0.83 (Supplementary Figure S3). The tree is congruent with that obtained by using maximum likelihood as an optimality criterion, and no bias in compositional heterogeneity was detected in the sequences (Supplementary Figure S4).

FIGURE 4.

FIGURE 4

Phylogenetic tree (left in cladogram style, right with branch lengths) inferred from plastid genome data. Maximum likelihood phylogeny illustrating the phylogenetic relationships of Asteraceae based on fifty protein-coding genes. Species of the genus Amrbosia are highlighted in red. Branch lengths are proportional to the number of substitutions, while numbers represent bootstrap support values for each node. The given scale represents substitutions per site.

Historically, Asteraceae were divided into two large subfamilies (Asteroideae and Cichorioideae) and 13 tribes (Bentham, 1873). Major changes in classification have been made during the past decade, resulting in a better phylogenetic framework. Based on the analyses of chloroplast DNA markers (Panéro and Funk, 2002, 2008; Panéro et al., 2014), 13 major clades (subfamilies) were identified in Asteraceae. From these subfamilies, our study included representatives of Carduoideae, Cichorioideae and Asteroideae with complete plastid genome sequences available from public databases. The Asteroideae subfamily was sister to Cichorioideae, including Lactuca sativa L. and Taraxacum officinale (L.) Weber ex F. H. Wigg of the tribe Cichorieae. Taxa within the subfamily Carduoideae were represented by members of the tribe Cynareae, and formed two clades. The first was composed of Carthamus tinctorius L. and Centaurea diffusa Lam. The second included Cynara baetica (Spreng.) Pau and Silybum marianum (L.) Gaertn. as sisters, while Saussurea involucrata Matsum. & Koidz. were resolved in a basal position. This phylogeny obtained for the Cynareae is consistent with the results of a more detailed molecular analysis (Barres et al., 2013) including larger number of terminals. As expected, our tree resolved three supertribes Asteroideae, Helianthidae, and Senecionodae within the subfamily Asteroideae. Jacobea vulgaris Gaertn., together with Pericallis hybrida (Willd.) R. Nordenstam, grouped in the supertribe Senecionodae. In the Asterodae, all three sampled tribes of Anthemidae, Gnaphalieae, and Astereae were distinct with high support values compared to previous plastid studies where tribes of Asteroideae were not resolved (Curci et al., 2015). Helianthus annuus L, A. artemisiifolia, and A. trifida grouped in the Heliantheae alliance indicating a group with high support values within the supertribe Helianthodae, and separate of Millerideae (Galinsoga quadriradiata Ruiz & Pav., Guizotia abyssinica) and Eupatorieae (Ageratina adenophora [Spreng.] King & H. Rob.; Mikania micrantha Kunth). Deeper level relationships within the alliance Heliantheae were unresolved. Whole chloroplast genome sequences were unavailable for the cockleburs (Xanthium L.) thus we were unable to provide support for the previously reported sister genus relationship to Ambrosia (Miao et al., 1995). Both genera are monoecious with pistilate florets surrounded by woody involucres, a morphological trait that has been assumed to indicate their close relationship (Peterson and Payne, 1973).

Comparative Transcriptional Profiles Against Glyphosate Resistance

As a first contribution toward deciphering the complete genetic information of giant ragweed, we determined the complete sequence of its plastome. Since assembling plastid genomes from herbarium specimens is possible, the complete sequences can be used in further applications. The analysis presented here is an example of the use of herbarium genomics in other fields like weed research where, besides the biology of ragweeds, their control with herbicides is a major area of focus for research. Herbicides have a pivotal role in controlling weeds and sustaining food security by restricting weeds while being as harmless as possible for crops (Duhoux et al., 2017). Reaching this specific goal with various mechanisms of action, herbicides induce changes in gene expression to alter or terminate plant physiological pathways. Knowledge about the regulation of plastid gene expression in response to herbicide treatments is central to our understanding of photosynthesis and other plastid-localized metabolic pathways associated with herbicide resistance. To date, the molecular basis of herbicide resistance has been largely investigated by single-gene sequencing to identify single-point mutations in the target site of the herbicide. More recently, second-generation sequencing technologies have also enabled transcriptomic approaches (e.g., RNA-seq) to identify candidate genes underlying more complex NTSR mechanisms, such as herbicide metabolism and translocation (Ravet et al., 2018). Weed genomics offers the promise to go beyond transcriptomics and provide further novel insights into the biological processes such as NTSR.

Despite its small size, the transcriptional apparatus of plastids is relatively divergent at the expression level compared to the nuclear genome, which is the first line in transcriptional-mediated responses pertaining to abiotic and biotic stress including herbicide resistance. These transcriptional events, as defined by the variations observed in the transcriptional expression levels, play an important role in understanding plastid metabolism at the cellular and energetic levels by altering the transport of solutes across the membrane, or by regulating their sequestration through membrane trafficking (Sammons and Gaines, 2014). One such example is glyphosate, which is sequestered by a transport mechanism and inhibits the shikimate pathway in the chloroplast (Sammons and Gaines, 2014). Plastid gene expression and its regulation has been intensely studied in chloroplasts (see the review of Leister et al., 2017). By contrast, knowledge of gene expression under herbicide treatment in plastids is still very limited. Since a large fraction of plastid protein-coding genes is involved in photosynthesis, it is generally believed that plastid gene expression is affected in plant tissues treated with herbicides. Until now, limited knowledge has been accumulated to understand the transcriptional events related to glyphosate resistance, although it has been used globally to control noxious weeds such as A. trifida (Sammons and Gaines, 2014). As previously mentioned, since glyphosate sequestration and harvesting through the chloroplast mediated shikimate pathway play an important role in herbicide resistance, we compared the transcriptional events in the plastid genome of A. trifida. For this, we used RNA-seq reads previously deposited in the Sequence Read Archive (Supplementary Table S2) gathered from total plant cell transcriptomes, which capture both primary and processed mRNA sequences of the plastome. Therefore, we first isolated the plastid genome data from the total transcriptomes of glyphosate resistant and sensitive A. trifida plants sequenced across four different time points – 0, 3, 8, and 12 h after spraying with the herbicide. In particular, we observed that exonic divergence (change in the patterns of the expression levels of exons across the sensitive and the resistant time points) to be more pronounced compared to the intronic divergence (change in the patterns of the expression levels of introns across the sensitive and the resistant time points; Figure 5 and Supplementary Table S5). Considering the divergence of the glyphosate uptake, a higher expression of plastid genes was found in resistant as compared to the sensitive conditions. As a first line of defense against any abiotic stress, the primary focus of the gene expression alternation at the nuclear level is mainly pertaining to the genes involved in photosynthetic responses, heat shock or those involved in the membrane transport. We observed a higher expression of the tRNA genes across all of the time points, which is conserved across all of the embryophytes and suggests that the higher expression of the relative tRNAs is required for rapid transcription activity, which might be required to circumvent the herbicide-mediated changes in the membrane transport of the generated ATPs across the NADPH complex. Additionally, it might indicate that the expression of the SIG2, responsible for the transcription of tRNA genes, is pre-dominant as compared to the SIG6, which is mostly associated with the photosynthetic genes (Figure 6).

FIGURE 5.

FIGURE 5

Violin plots showing the exonic and intronic RPKM distribution across the sensitive and resistant glyphosate ragweeds. Time points indicate sample collection: 0, 3, 8, and 12 h after herbicide application.

FIGURE 6.

FIGURE 6

Circos plot visualization of the transcriptional mapping of Ambrosia trifida. Single-base resolution of the mapped RNA-seq reads across the A. trifida plastome with represented gene annotations. Track ribbons order according to the following SRA accession number: SRR1661420, SRR1661463, SRR1661464, SRR1661465, SRR1661466, SRR1661467, SRR1661468 and SRR1661469.

Furthermore, A. trifida had relatively low expression levels, which might hint toward the uptake of the glyphosate and the rate of disruption of the photosynthetic apparatus and the metabolic pathways associated with plastid genes. It has been widely demonstrated that glyphosate also affects growth by contaminating the environment and accumulating in plant organs, and can be a major limiting factor for agricultural productivity (Fartyal et al., 2018). According to our observations, plastid intronic expression is higher in glyphosate-resistant giant ragweed at all sequenced time points compared to the sensitive time points after the herbicide treatment (Supplementary Table S5). The expression of introns associated with ycf3 increased during the 8 and 12 h time points in sensitive A. trifida, which might present a defense mechanism to combat glyphosate (Figure 6). Relative expression divergence among the sensitive and resistant time points for the photosystem I (PSI) assembly protein ycf3 revealed an up-regulation in the resistant phenotype compared to the sensitive phenotype. This clearly reflects the role of glyphosate as an interfering herbicide in plant growth through the inhibition of the photosynthetic complex (Fartyal et al., 2018). Higher expression of ribosomal-associated genes has also been observed in sensitive time points compared to the glyphosate resistant time points, which might reflect the higher activity rates of the translation and ribosomal machinery required for the efficient translation of plastid genes during glyphosate resistance. We also observed down-regulation of atpF, which is an H+ATPase (CF0 subunit of the CF0CF1) and a critical component for energy production also associated with the down-regulation of ndhA and ndhB. Members of the NADH-like dehydrogenases play an important role in the PSI cyclic electron flow (Suorsa et al., 2009). Overall, these findings suggest that glyphosate has immediate effects on the photo-accumulation events and thus alters the transport and the energy production in plastid genes. Similar disruptions were reported in mitochondrial ATP influx-outflux during glyphosate resistance (Gomes et al., 2017).

Conclusion

The inclusion of historical museum specimens in phylogenomic analyses of biodiversity provides new possibilities for both fundamental and applied plant biology research. Our study is a good example of how herbarium specimens can be used to investigate phylogeny and genomic patterns of herbicide resistance. However, it should be kept in mind that museomics are limited by the amount of plant material, and disruptive sampling should be cautiously carried out with such specimens. Herbarium genomics of weeds could also improve our understanding of resistance to herbicides. As demonstrated in our study, plastid genomes reconstructed from herbarium specimens, coupled with transcriptomic resources can also be used to investigate herbicide resistance with potential for further applications in weed management.

Author Contributions

PP conceived the study, performed the sampling, laboratory experiments, genome assembly, annotation, and data curation, and prepared the original draft of the manuscript. GS conceived the methodologies and performed the further analyses. JH and AA performed parsimony and inverted repeat analyses, respectively. JH, XH, and PP carried out the project administration and provided resources for the work. PP and JH supervised the work. GS, AA, and PP performed the validation and visualization of the work. All authors reviewed and edited the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank the staff and colleagues of the Viikki Biocenter who kindly contributed reagents and materials for our study. We thank CSC – IT Center for Sciences, Espoo (Finland) for providing computational servers for the data analysis. We also thank Derek Ho and Jacquelin DeFaveri for critical reading of the manuscript and valuable comments on an earlier version.

Footnotes

Funding. This study was funded by the Systematics Research Fund (United Kingdom), the Finnish Museum of Natural History Luomus Trigger Funds (PP), and the Academy of Finland (Grant No. 295595 to JH and XH).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2019.00218/full#supplementary-material

FIGURE S1

LASTZ dot-plot comparison among Asteraceae complete plastid genome sequences.

FIGURE S2

Inverted repeat plot of 30 Asteraceae chloroplast genomes.

FIGURE S3

Phylogenetic tree of Asteraceae based on the analysis of fifty protein coding genes with parsimony used as an optimality criterion.

FIGURE S4

Evaluation of the skewness and compositional heterogeneity across the combined and partition-specific variations of the long genes ndhD, ndhH, psaA, psaB, psbA, psbB, psbC, and rbcL.

TABLE S1

List of the sequenced plastomes used for the phylogenomic analysis.

TABLE S2

Summary of the aligned transcriptomics reads.

TABLE S3

List of genes in the chloroplast genome of Ambrosia trifida.

TABLE S4

Genes having an intron in the Ambrosia trifida plastid genome, and the length of the exons and introns.

TABLE S5

Intronic reads per kb per million mapped (RPKM) across the herbicide treatments in the plastid genome of giant ragweed (Ambrosia trifida).

References

  1. Abul-fatih H. A., Bazzaz F. A. (1979). Biology of Ambrosia trifida L. 2. Germination, emergence, growth and survival. New Phytol. 83 817–827. 10.1111/j.1469-8137.1979.tb02313.x [DOI] [Google Scholar]
  2. Amiryousefi A., Hyvönen J., Poczai P. (2017). The plastid genome sequence of the invasive plant common ragweed (Ambrosia artemisiifolia, Asteraceae). Mitochondrial DNA B 2 753–754. 10.1080/23802359.2017.1390423 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Amiryousefi A., Hyvönen J., Poczai P. (2018a). IRscope: an online program to visualize the junction sites of chloroplast genomes. Bioinformatics 34 3030–3031. 10.1093/bioinformatics/bty220 [DOI] [PubMed] [Google Scholar]
  4. Amiryousefi A., Hyvönen J., Poczai P. (2018b). The chloroplast genome sequence of bittersweet (Solanum dulcamara): plastid genome structure evolution in Solanaceae. PLoS One 13:e0196069. 10.1371/journal.pone.0196069 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Angiosperm Phylogeny Group (2016). An updated of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 181 1–20. 10.1016/j.jep.2015.05.035 [DOI] [PubMed] [Google Scholar]
  6. Aubriot X., Knapp S., Syfert M. M., Poczai P., Buerki S. (2018). Shedding new light on the origin and spread of the brinjal eggplant (Solanum melongena L.) and its wild relatives. Am. J. Bot. 105 1175–1187. 10.1002/ajb2.1133 [DOI] [PubMed] [Google Scholar]
  7. Bakker F. T. (2017). Herbarium genomics: skimming and plastomics from archival specimens. Webbia 72 35–45. 10.1080/00837792.2017.1313383 [DOI] [Google Scholar]
  8. Bakker F. T., Lei D., Yu J., Mohammadin S., Wei Z., van de Kerke S., et al. (2015). Herbarium genomics: plastome sequence assembly from a range of herbarium specimens using an iterative organelle genome assembly pipeline. Biol. J. Linn. Soc. 117 33–43. 10.1111/bij.12642 [DOI] [Google Scholar]
  9. Barres L., Sanmartín I., Anderson C. L., Susanna A., Buerki S., Galbany-Casals M., et al. (2013). Reconstructing the evolution and biogeographic history of tribe Cardueae (Compositae). Am. J. Bot. 100 867–882. 10.3732/ajb.1200058 [DOI] [PubMed] [Google Scholar]
  10. Beck J. B., Semple J. C. (2015). Next-generation sampling: pairing genomics with herbarium specimens provides species-level signal in Solidago (Asteraceae). Appl. Plant Sci. 3:1500014. 10.3732/apps.1500014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Beier S., Thiel T., Münch T., Scholz U., Mascher M. (2017). MISA-web: a web server for microsatellite prediction. Bioinformatics 33 2583–2585. 10.1093/bioinformatics/btx198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bentham G. (1873). “Compositae,” in Genera Plantarum Vol. 2 eds Bentham G., Hooker J. D. (London: Reeve; ), 163–533. [Google Scholar]
  13. Besnard G., Christin P.-A., Malé P.-J. G., Lhuillier E., Lauzeral C., Coissac E., et al. (2014). From museums to genomics: old herbarium specimens shed light on C3 to C4 transition. J. Exp. Bot. 65 6711–6721. 10.1093/jxb/eru395 [DOI] [PubMed] [Google Scholar]
  14. Bolger A. M., Lohse M., Usadel B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30 2114–2120. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bracamonte E., da Silveira H. M., Alcántara-de la Cruz R., Domínguez-Valenzuela J. A., Cruz-Hipolito H. E., De Prado R. (2018). From tolerance to resistance: mechanisms governing the differential response to glyphosate in Chloris barbata. Pest Manag. Sci. 74 1118–1124. 10.1002/ps.4874 [DOI] [PubMed] [Google Scholar]
  16. Busi R., Gaines T. A., Walsh M. J., Powels S. B. (2012). Understanding the potential for resistance evolution to the new pyroxasulfone: field selection at high doses versus recurrent selection at low doses. Weed Res. 52 489–499. 10.1111/j.1365-3180.2012.00948.x [DOI] [Google Scholar]
  17. Capella-Gutiérrez S., Silla-Martínez J. M., Gabaldón T. (2009). trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25 1972–1973. 10.1093/bioinformatics/btp348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Castandet B., Hotto A. M., Strickler S. R., Stern D. B. (2016). ChloroSeq, an optimized chloroplast RNA-Seq bioinformatic pipeline, reveals remodeling of the organellar transcriptome under heat stress. G3. 6 2817–2827. 10.1534/g3.116.030783 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chauvel B., Dessaint F., Cardinal-Legrand C., Bretagnolle F. (2006). The historical spread of Ambrosia artemisiifolia L. in France from herbarium records. J. Biogeogr. 33 665–673. 10.1111/j.1365-2699.2005.01401.x [DOI] [Google Scholar]
  20. Chun Y. J., Fumanal B., Laitung B., Bretagnolle F. (2010). Gene flow and population admixture as the primary post-invasion process in common ragweed (Ambrosia artemisiifolia) populations in France. New Phytol. 185 1100–1107. 10.1111/j.1469-8137.2009.03129.x [DOI] [PubMed] [Google Scholar]
  21. Cosgrove D. J. (1999). Enzymes and other agents that enhance cell wall extensibility. Ann. Rev. Plant Phys. Plant. Mol. Biol. 50 391–417. 10.1146/annurev.arplant.50.1.391 [DOI] [PubMed] [Google Scholar]
  22. Cowborough M. J., Brown R. B., Tardif F. J. (2003). Impact of common ragweed (Ambrosia artemisiifolia) aggregation on economic thresholds in soybean. Weed Sci. 51 947–954. 10.1614/02-036 [DOI] [Google Scholar]
  23. Cseh A. (2010). The Molecular Genetic Analysis of the Most Important Weed Biological Characteristics and Health Effects of Common Ragweed. Ph.D. thesis, University of Pannonia, Hungary. [Google Scholar]
  24. Cseh A., Cernák I., Taller J. (2009). Molecular characterization of atrazine resistance in common ragweed (Ambrosia artemisiifolia L.). J. Appl. Genet. 50 321–327. 10.1007/BF03195690 [DOI] [PubMed] [Google Scholar]
  25. Cseh A., Taller J. (2008). Genetic diversity of ragweed (Ambrosia artemisiifolia L.) a comparison of maternally inherited cpDNA and mtDNA. J. Plant Dis. Prot. 21 389–394. [Google Scholar]
  26. Csontos P., Vitalos M., Barina Z., Kiss L. (2010). Early distribution and spread of Ambrosia artemisiifolia in Central and Eastern Europe. Bot. Helv. 120 75–78. 10.1007/s00035-010-0072-2 [DOI] [Google Scholar]
  27. Cummins I., Wortley D. J., Sabbadin F., He Z., Coxon C. R., Straker H. E., et al. (2013). Key role for glutathione transferase in multiple-herbicide resistance in grass weeds. Proc. Natl. Acad. Sci. U.S.A. 110 5812–5817. 10.1073/pnas.1221179110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Curci P. L., De Paola D., Danzi D., Vendramin G. G., Sonnante G. (2015). Complete chloroplast genome of the multifunctional crop globe artichoke and comparison with other Asteraceae. PLoS One 10:e0120589. 10.1371/journal.pone.0120589 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Daniell H., Lin C.-S., Yu M., Chang W.-J. (2016). Chloroplast genomes: diversity, evolution, and applications in genetic engineering. Genome Biol. 17:134. 10.1186/s13059-016-1004-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Délye C. (2013). Unravelling the genetic bases of non-target-site based resistance (NTSR) to herbicides: a major challenge for weed science in the forthcoming decade. Pest Manag. Sci. 69 176–187. 10.1002/ps.3318 [DOI] [PubMed] [Google Scholar]
  31. Délye C., Jasieniuk M., Le Corre V. (2013). Deciphering the evolution of herbicide resistance in weeds. Trend. Genet. 29 649–658. 10.1016/j.tig.2013.06.001 [DOI] [PubMed] [Google Scholar]
  32. Denda T., Watanabe K., Kosuge K., Yahara T., Ito M. (1999). Molecular phylogeny of Brachycome (Asteraceae). Plant Syst. Evol. 217 299–311. 10.1007/BF00984372 [DOI] [Google Scholar]
  33. Dierckxsens N., Mardulyn P., Smits G. (2017). NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucl. Acid Res. 45:e18. 10.1093/nar/gkw955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Duhoux A., Pernin F., Desserre D., Délye C. (2017). Herbicide safeners decrease sensitivity to herbicides inhibiting acetolactate-synthase and likely activate non-target-site-based resistance pathways in the major grass weed Lolium sp. (Rye-grass). Front. Plant Sci. 8:1310. 10.3389/fpls.2017.01310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Duke S. O., Powles S. B. (2008). Glyphosate: a once-in-a-century herbicide. Pest Manag. Sci. 64 319–325. 10.1002/ps.1518 [DOI] [PubMed] [Google Scholar]
  36. Farris J. S. (1989). The retention index and homoplasy excess. Syst. Zool. 38 406–407. 10.1016/j.jtbi.2014.10.033 [DOI] [PubMed] [Google Scholar]
  37. Fartyal D., Agarwal A., James D., Borphukan B., Ram B., Sheri V., et al. (2018). Co-expression of P173S mutant rice EPSPS and igrA genes results in higher glyphosate tolerance in transgenic rice. Front. Plant Sci. 9:144. 10.3389/fpls.2018.00144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Ganie Z. A., Lindquist J. L., Jugulam M., Kruger G. R., Marx D. B., Jhala A. J. (2017). An integrated approach to control glyphosate-resistant Ambrosia trifida with tillage and herbicides in glyphosate-resistant maize. Weed Res. 57 112–122. 10.1111/wre.12244 [DOI] [Google Scholar]
  39. Gaudeul M., Giraud T., Kiss L., Shykoff J. A. (2011). Nuclear and chloroplast microsatellites show multiple introductions in the worldwide invasion history of common ragweed, Ambrosia aretemisiifolia. PLoS One 6:e17658. 10.1371/journal.pone.0017658 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Genton B. J., Jonot O., Thévenet D., Fournier E., Blatrix R., Vautrin D., et al. (2005a). Isolation of five polymorphic microsatellite loci in the invasive weed Ambrosia artemisiifolia (Asteraceae) using an enrichment protocol. Mol. Ecol. Res. 5 381–383. 10.1111/j.1471-8286.2005.00934.x [DOI] [Google Scholar]
  41. Genton B. J., Shykoff J. A., Giraud T. (2005b). High genetic diversity in French invasive populations of common ragweed, Ambrosia artemisiifolia, as a result of multiple sources of introduction. Mol. Ecol. 14 4275–4285. [DOI] [PubMed] [Google Scholar]
  42. Gladieux P., Giraud T., Kiss L., Genton B. J., Jonot O., Shykoff J. A. (2011). Distinct invasion sources of common ragweed (Ambrosia artemisiifolia) in Eastern and Western Europe. Biol. Inv. 13 933–944. 10.1007/s10530-010-9880-y [DOI] [Google Scholar]
  43. Goloboff P. A. (1994). NONA. Version 2.0. Tucumán, AR: Computer software distributed by the author. [Google Scholar]
  44. Gomes M. P., da Silva Cruz F. V., Bicalho E. M., Borges F. V., Fonseca M. B., Juneau P., et al. (2017). Effects of glyphosate acid and the glyphosate-commercial formulation (Roundup) on Dimorphandra wilsonii seed germination: interference of seed respiratory metabolism. Environ. Pollut. 220 452–459. 10.1016/j.envpol.2016.09.087 [DOI] [PubMed] [Google Scholar]
  45. Graham S. W., Lam V. K. Y., Merckx V. S. F. T. (2017). Plastomes on the edge: the evolutionary breakdown of mycoheterotroph plastid genomes. New Phytol. 214 48–55. 10.1111/nph.14398 [DOI] [PubMed] [Google Scholar]
  46. Green J. M., Owen M. D. K. (2011). Herbicide-resistant crops: utilities and limitations for herbicide-resistant weed management. J. Agric. Food Chem. 59 5819–5829. 10.1021/jf101286h [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Gruenstaeudl M., Nauheimer L., Borsch T. (2017). Plastid genome structure and phylogenomics of Nymphaeales: conserved gene order and new insights into relationships. Plant Syst. Evol. 303 1251–1270. 10.1007/s00606-017-1436-5 [DOI] [Google Scholar]
  48. Gutaker R. M., Burbano H. A. (2017). Reinforcing plant evolutionary genomics using ancient DNA. Curr. Opin. Plant Biol. 36 38–45. 10.1016/j.pbi.2017.01.002 [DOI] [PubMed] [Google Scholar]
  49. Gutaker R. M., Reiter E., Furtwängler A., Schuenemann V. J., Burbano H. A. (2017). Extraction of ultrashort DNA molecules from herbarium specimens. Biotechniques 62 76–79. 10.2144/000114517 [DOI] [PubMed] [Google Scholar]
  50. Harris R. S. (2007). Improved Pairwise Alignment of Genomic DNA. Ph.D. thesis, Pennsylvania State University, University Park, PA. [Google Scholar]
  51. Hausner G., Olson R., Simon D., Johnosn I., Sanders E. R., Karol K. G., et al. (2006). Origin and evolution of the chloroplast trnK (matK) intron: a model for evolution of group II intron RNA strucutres. Mol. Biol. Evol. 23 380–391. 10.1093/molbev/msj047 [DOI] [PubMed] [Google Scholar]
  52. Heap I. (2018). The International Survey of Herbicide Resistant Weeds. Available at: http://www.weedscience.org/Summary/home.aspx [accessed March 7, 2018]. [Google Scholar]
  53. Hoang D. T., Chernomor O., von Haeseler A., Minh B. Q., Vinh L. S. (2018). UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35 518–522. 10.1093/molbev/msx281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Hsu C. Y., Wu C. S., Chaw S. M. (2016). Birth of four chimeric plastid gene clusters in Japanese umbrella pine. Genome Biol. Evol. 8 1776–1784. 10.1093/gbe/evw109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. International Service for the Acquisition of Agri-biotech Applications [ISAAA] (2016). Global status of commercialized biotech/GM crops. ISAAA Breif No. 52. Ithaca, NY: ISAAA. [Google Scholar]
  56. Jansen R. K., Palmer J. D. (1987). A chloroplast DNA inversion marks an ancient evolutionary split in the sunflower family (Asteraceae). Proc. Natl. Acad. Sci. U.S.A. 84 5818–5822. 10.1073/pnas.84.16.5818 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Kalyaanamoorthy S., Minh B. Q., Wong T. K. F., von Haeseler A., Jermiin L. S. (2017). ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14 587–589. 10.1038/nmeth.4285 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Kazinczi G., Beres I., Pathy Z., Novak R. (2008). Common ragweed (Ambrosia artemisiifolia L.): a review with special regards to the results in Hungary: II. Importance and harmful effect, allergy, habitat, allelopathy and beneficial characteristics. Herbologia 9 93–118. [Google Scholar]
  59. Kearse M., Moir R., Wilson A., Stones-Havas S., Cheung M., Sturrock S., et al. (2012). Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28 1647–1649. 10.1093/bioinformatics/bts199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Kettunen M., Genovesi P., Gollasch S., Pagad S., Starfinger U., ten Brink P., et al. (2009). Technical support to EU strategy on invasive alien species (IAS) - Assessment of the impacts of IAS in Europe and the EU. Brussels: Institute for European Environmental Policy. [Google Scholar]
  61. Kim D., Pertea G., Trapnell C., Pimentel H., Kelley R., Salzberg S. L. (2013). TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 14:R36. 10.1186/gb-2013-14-4-r36 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Kim K.-L., Choi K.-S., Jansen R. K. (2005). Two chloroplast DNA inversions originated simultaneously during the early evolution of the sunflower family (Asteraceae). Mol. Biol. Evol. 22 1783–1792. 10.1093/molbev/msi174 [DOI] [PubMed] [Google Scholar]
  63. Kluge A. G., Farris J. S. (1969). Quantitative phyletics and the evolution of Anurans. Syst. Zool. 18 1–32. 10.2307/2412407 [DOI] [Google Scholar]
  64. Kong C.-H., Wang P., Xu X.-X. (2007). Allelopathic interference of Ambrosia trifida with wheat (Triticum aestivum). Agr. Ecosyst. Environ. 119 416–420. 10.1016/j.agee.2006.07.014 [DOI] [Google Scholar]
  65. Krzywinski M., Schein J., Birol I., Connors J., Gascoyne R., Horsman D., et al. (2009). Circos: an information aesthetic for comparative genomics. Genome Res. 19 1639–1645. 10.1101/gr.092759.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Kuang D. Y., Wu H., Wang Y. L., Gao L. M., Zhang S. Z., Lu L. (2011). Complete chloroplast genome sequence of Magnolia kwangsiensis (Magnoliaceae): implication for DNA barcoding and population genetics. Genome 54 663–673. 10.1139/G11-026 [DOI] [PubMed] [Google Scholar]
  67. Kumar S., Hahn F. H., McMahan C. M., Cornish K., Whalen M. C. (2009). Comparative analysis of the complete sequence of the plastid genome of Parthenium argentatum and identification of DNA barcodes to differentiate Parthenium species and lines. BMC Plant Biol. 9:131. 10.1186/1471-2229-9-131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Kurtz S., Choudhuri J. V., Ohlebusch E., Schleiermacher C., Stoye J., Giegerich R., et al. (2001). REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29 4633–4642. 10.1093/nar/29.22.4633 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Kurtz S., Phillippy A., Delcher A. L., Smoot M., Shumway M., Antonescu C., et al. (2004). Versatile and open software for comparing large genomes. Genome Biol. 5:R12. 10.1186/gb-2004-5-2-r12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Leister D., Wang L., Kleine T. (2017). Organellar gene expression and acclimation of plants to environmental stress. Front. Plant. Sci. 8:387. 10.3389/fpls.2017.00387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., et al. (2009). The Sequence Alignment/Map format and SAMtools. Bioinformatics 25 2078–2079. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Liu Y., Huo N., Dong L., Wang Y., Zhang S., Young H. A., et al. (2013). Complete chloroplast genome sequences of Mongolia medicine Artemisia frigida and phylogenetic relationships with other plants. PLoS One 8:e57533. 10.1371/journal.pone.0057533 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Lohse M., Drechsel O., Bock R. (2007). OrganellarGenomeDRAW (OGDRAW) – a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 52 267–274. 10.1007/s00294-007-0161-y [DOI] [PubMed] [Google Scholar]
  74. Martin M. D., Chamecki M., Brush G. S. (2010). Anthesis synchronization and floral morphology determine diurnal patterns of ragweed pollen dispersal. Agric. For. Meteorol. 150 1307–1317. 10.1016/j.agrformet.2010.06.001 [DOI] [Google Scholar]
  75. Martin M. D., Chamecki M., Brush G. S., Meneveau C., Parlange M. B. (2009). Pollen clumping and wind dispersal in an invasive angiosperm. Am. J. Bot. 96 1703–1711. 10.3732/ajb.0800407 [DOI] [PubMed] [Google Scholar]
  76. Martin M. D., Olsen M. T., Samaniego J. A., Zimmer E. A., Gilbert M. T. P. (2016). The population genomic basis of geographic differentiation in North American common ragweed (Ambrosia artemisiifolia L.). Ecol. Evol. 6 3760–3771. 10.1002/ece3.2143 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Martin M. D., Quiroz-Claros E., Brush G. S., Zimmer E. A. (2018). Herbarium collection-based phylogenetics of the ragweeds (Ambrosia. Asteraceae). Mol. Phylogenet. Evol. 120 335–341. 10.1016/j.ympev.2017.12.023 [DOI] [PubMed] [Google Scholar]
  78. Martin M. D., Zimmer E. A., Olsen M. T., Foote A. D., Gilbert M. T. P., Brush G. S. (2014). Herbarium specimens reveal a historical shift in phylogeographic structure of common ragweed during native range disturbance. Mol. Ecol. 23 1701–1716. 10.1111/mec.12675 [DOI] [PubMed] [Google Scholar]
  79. Mátyás K. K., Taller J., Cseh A., Poczai P., Cernák I. (2011). Development of a simple PCR-based assay for the identification of triazine resistance in the noxious plant common ragweed (Ambrosia artemisiifolia) and its applicability in higher plants. Biotechnol. Lett. 33 2509–2515. 10.1007/s10529-011-0714-5 [DOI] [PubMed] [Google Scholar]
  80. Mátyás K. K., Vignesh M., Taller J. (2012). Population genetic analysis of common ragweed (Ambrosia artemisiifolia L.) in the Carpathian-basin. Magy. Gyomkut. Tech. 13 21–36. [Google Scholar]
  81. McManus H. A., Fučíková K., Lewis P. O., Lewis L. A., Karol K. G. (2018). Organellar phylogenomics inform systematics in the green algal family Hydrodictyaceae (Chlorophyceae) and provide clues to the complex evolutionary history of plastid genomes in the green algal tree of life. Am. J. Bot. 105 315–329. 10.1002/ajb2.1066 [DOI] [PubMed] [Google Scholar]
  82. Miao B., Turner B., Simpson B., Mabry T. J. (1995). Chloroplast DNA study of the genera Ambrosia s.l. and Hymenoclea (Asteraceae): systematic implications. Plant Syst. Evol. 194 241–255. 10.1007/BF00982858 [DOI] [Google Scholar]
  83. Michelangeli F. A., Davis J. I, Stevenson D. W. (2003). Phylogenetic relationships among Poaceae and related families as inferred from morphology, inversions in the plastid genome, and sequence data from the mitochondrial and plastid genomes. Am. J. Bot. 90 93–106. 10.3732/ajb.90.1.93 [DOI] [PubMed] [Google Scholar]
  84. Moretti M. L., Van Horn C. R., Robertson R., Segobye K., Weller S. C., Young B. G., et al. (2018). Glyphosate resistance in Ambrosia trifida: Part 2. Rapid response physiology and non-target-site resistance. Pest Manag. Sci. 74 1079–1088. 10.1002/ps.4569 [DOI] [PubMed] [Google Scholar]
  85. Motta E. V. S., Raymann K., Moran N. A. (2018). Glyphosate perturbs the gut microbiota of honey bees. Proc. Natl. Acad. Sci. U.S.A. 115 10305–10310. 10.1073/pnas.1803880115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Nagy E., Hegedûs G., Taller J., Kutasy B., Virág E. (2017). Illumina sequencing of the chloroplast genome of common ragweed (Ambrosia artemisiifolia L.). Data Brief 15 606–611. 10.1016/j.dib.2017.10.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Neve P., Powels S. (2005). Recurrent selection with reduced herbicide rates results in the rapid evolution of herbicide resistance in Lolium rigidum. Theor. Appl. Genet. 110 1154–1166. 10.1007/s00122-005-1947-2 [DOI] [PubMed] [Google Scholar]
  88. Nguyen L. T., Schmidt H. A., von Haeseler A., Minh B. Q. (2015). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32 268–274. 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Nie X., Lv S., Zhang Y., Du X., Wang L., Biradar S. S., et al. (2012). Complete chloroplast genome sequence of a major invasive species, crofton weed (Ageratina adenophora). PLoS One 7:e36869. 10.1371/journal.pone.0036869 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Nixon K. C. (2002). WinClada ver. 1.00. 08. Ithaca, NY: Published by the Author; Available at: http://www.diversityoflife.org/winclada/ [Google Scholar]
  91. Norsworthy J. K., Riar D., Jha P., Scott R. C. (2011). Confirmation, control, and physiology of glyphosate-resistant giant ragweed (Ambrosia trifida) in Arkansas. Weed Technol. 25 430–435. 10.1614/WT-D-10-00155.1 [DOI] [Google Scholar]
  92. Olmstead R. G., Palmer J. D. (1994). Chloroplast DNA systematic: a review of methods and data analysis. Am. J. Bot. 81 1205–1224. 10.1002/j.1537-2197.1994.tb15615.x [DOI] [Google Scholar]
  93. Palmer J. D. (1983). Chloroplast DNA exists in two orientations. Nature 301 92–93. 10.1038/301092a0 890739 [DOI] [Google Scholar]
  94. Palmer J. D. (1985). Comparative organization of chloroplast genomes. Ann. Rev. Genet. 19 325–354. 10.1146/annurev.ge.19.120185.001545 [DOI] [PubMed] [Google Scholar]
  95. Panéro J. L., Freire S. E., Ariza Espinar L., Crozier B. S., Barboza G. E., Cantero J. J. (2014). Resolution of deep nodes yields an improved backbone phylogeny and a new basal lineage to study early evolution of Asteraceae. Mol. Phylogenet. Evol. 80 43–53. 10.1016/j.ympev.2014.07.012 [DOI] [PubMed] [Google Scholar]
  96. Panéro J. L., Funk V. A. (2002). Toward a phylogenetic subfamilial classification for the Compositae (Asteraceae). Proc. Biol. Soc. Wash. 115 909–922. [Google Scholar]
  97. Panéro J. L., Funk V. A. (2008). The value of sampling anomalous taxa in phylogenetic studies: major clades of the Asteraceae revealed. Mol. Phylogenet. Evol. 47 757–782. 10.1016/j.ympev.2008.02.011 [DOI] [PubMed] [Google Scholar]
  98. Patterson E. L., Pettinga D. J., Ravet K., Neve P., Gaines T. A. (2018). Glyphosate resistance and EPSPS gene duplication: convergent evolution in multiple plant species. J. Hered. 109 117–125. 10.1093/jhered/esx087 [DOI] [PubMed] [Google Scholar]
  99. Payne W. W. (1963). The morphology of the inflorescence of ragweeds (Ambrosia-Franseria). Am. J. Bot. 50 872–880. 10.1002/j.1537-2197.1963.tb06566.x [DOI] [Google Scholar]
  100. Payne W. W. (1964). A re-evaluation of the genus Ambrosia (Compositae). J. Arnold Arboretum 45 401–430. [Google Scholar]
  101. Peng Y., Abercrombie L. L. G., Yuan J. S., Riggins C. W., Sammons R. D., Tranel P. J., et al. (2010). Characterization of the horseweed (Conyza canadensis) transcriptome using GS-FLX 454 pyrosequencing and its application for expression analysis of candidate non-target herbicide resistance genes. Pest. Manag. Sci. 66 1053–1062. 10.1002/ps.2004 [DOI] [PubMed] [Google Scholar]
  102. Peterson K., Payne W. W. (1973). The genus Hymenoclea (Compositae: Ambrosieae). Brittonia 25 243–256. 10.2307/2805586 [DOI] [Google Scholar]
  103. Petit C., Bay G., Pernin F., Délye C. (2010). Prevalence of cross- or multiple resistance to the acetyl-coenzyme A carboxylase inhibitors fenoxaprop, clodinafop and pinoxaden in black-grass (Alopecurus myosuroides Huds.) in France. Pest. Manag. Sci. 66 168–177. 10.1002/ps.1851 [DOI] [PubMed] [Google Scholar]
  104. Poczai P., Hyvönen J. (2013). Discovery of novel plastid phenylalanine (trnF) pseudogenes defines a distinctive clade in Solanaceae. Springerplus 2:459. 10.1186/2193-1801-2-459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Poczai P., Hyvönen J. (2017). The complete chloroplast genome sequence of the CAM epiphyte Spanish moss (Tillandsia usneoides, Bromeliaceae) and its comparative analysis. PLoS One 12:e0187199. 10.1371/journal.pone.0187199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Pyšek P., Jarošík V., Pergl J., Randall R., Chytrý M., Kühn I., et al. (2009). The global invasion success of Central European plants is related to distribution characterisitcs in their native range and species traits. Divers. Distrib. 15 891–903. 10.1111/j.1472-4642.2009.00602.x [DOI] [Google Scholar]
  107. Quinlan A. R., Hall I. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26 841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Ranwez V., Harispe S., Delsuc F., Douzery E. J. P. (2011). MACSE: multiple alignment of coding SEquences Accounting for frameshifts and stop codons. PLoS One 6:e22594. 10.1371/journal.pone.0022594 [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Rasmussen K., Thyrring J., Muscarella R., Borchsenius F. (2017). Climate-change-induced range shifts of three allergenic ragweeds (Ambrosia L.) in Europe and their potential impact on human health. PeerJ 5:e3104. 10.7717/peerj.3104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Ravet K., Patterson E. L., Krähmer H., Hamouzová K., Fan L., Jasieniuk M., et al. (2018). The power and potential of genomics in weed biology and management. Pest. Manag. Sci. 74 2216-2225. 10.1002/ps.5048 [DOI] [PubMed] [Google Scholar]
  111. Regnier E. E., Harrison S. K., Loux M. M., Holloman C., Venkatesh R., Diekmann F., et al. (2016). Certified crop advisors’ perceptions of giant ragweed (Ambrosia trifida) distribution, herbicide resistance, and management in the Corn Belt. Weed Sci. 64 361–377. 10.1614/WS-D-15-00116.1 [DOI] [Google Scholar]
  112. Riggins C. W., Peng Y., Stewart C. N., Jr. (2010). Characterization of de novo transcriptome for waterhemp (Amaranthus tuberculatus) using GS-FLX 454 pyrosequencing and its application for studies of herbicide target-site genes. Pest Manag. Sci. 66 1042–1052. 10.1002/ps.2006 [DOI] [PubMed] [Google Scholar]
  113. Rydin C., Wikström N., Bremer B. (2017). Conflicting results from mitochondrial genomic data challenge current views of Rubiaceae phylogeny. Am. J. Bot. 104 1522–1532. 10.3732/ajb.1700255 [DOI] [PubMed] [Google Scholar]
  114. Salih M. R. H., Majeský L’, Schwarzacher T., Gornall R., Heslop-Harrison P. (2017). Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): identity and variation between three microspecies. PLoS One 12:e0168008. 10.1371/journal.pone.0168008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Sammons R. D., Gaines T. A. (2014). Glyphosate resistance: state of knowledge. Pest Manag. Sci. 70 1367–1377. 10.1002/ps.3743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Sammons R. D., You J., Qi Y., Flasinski S., Kavanaugh C., Washam J., et al. (2018). Evaluation of glyphosate resistance in Arabidopsis thaliana expressing an altered target site EPSPS. Pest Manag. Sci. 74 1174–1183. 10.1002/ps.4654 [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Sánchez Barreiro F., Vieira F. G., Martin M. D., Haile J., Gilbert M. T. P., Wales N. (2017). Characterizing restriction enzyme-associated loci in historic ragweed (Ambrosia artemisiifolia) voucher specimens using custom-designed RNA probes. Mol. Ecol. Res. 17 209–220. 10.1111/1755-0998.12610 [DOI] [PubMed] [Google Scholar]
  118. Schwarz E. N., Ruhlman T. A., Sabir J. S. M., Hajrah N. H., Alharbi N. S., Al-Malki A. L., et al. (2015). Plastid genome sequences of legumes reveal parallel inversions and multiple losses of rps16 in papilionoids. J. Syst. Evol. 53 458–468. 10.1111/jse.12179 [DOI] [Google Scholar]
  119. Shepherd L. D. (2017). A non-destructive DNA sampling technique for herbarium specimens. PLoS One 12:e0183555. 10.1371/journal.pone.0183555 [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. Sinn B. T., Sedmak D. D., Kelly L. M., Freudenstein J. V. (2018). Total duplication of the small single copy region in the angiosperm plastome: rearrangement and inverted repeat instability in Asarum. Am. J. Bot. 105 71–84. 10.1002/ajb2.1001 [DOI] [PubMed] [Google Scholar]
  121. Sloan D. B., Triant D. A., Forrester N. J., Bergner L. M., Wu M., Taylor D. R. (2014). A recurring syndrome of accelerated plastid genome evolution in the angiosperm tribe Sileneae (Caryophyllaceae). Mol. Phylogenet. Evol. 72 82–89. 10.1016/j.ympev.2013.12.004 [DOI] [PubMed] [Google Scholar]
  122. Staats M., Cuenca A., Richardson J. E., Vrielink-van Ginkel R., Petersen G., Seberg O., et al. (2011). DNA damage in plant herbarium tissue. PLoS One 6:e28448. 10.1371/journal.pone.0028448 [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Staats M., Erkens R. H. J., van de Vossenberg B., Wieringa J. J., Kraaijeveld K., Stielow B., et al. (2013). Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens. PLoS One 8:e6918. 10.1371/journal.pone.0069189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  124. Suorsa M., Sirpiö S., Aro E.-M. (2009). Towards characterization of the chloroplast NAD(P)H dehydrogenase complex. Mol. Plant 2 1127–1140. 10.1093/mp/ssp052 [DOI] [PubMed] [Google Scholar]
  125. Taller J., Decsi K., Farkas E., Nagy E., Mátyás K. K., Kolics B., et al. (2016a). De novo transcriptome sequencing based identification of Amb a 3-like pollen allergen in common ragweed (Ambrosia artemisiifolia). J. Bot. Sci. 5 12–16. [Google Scholar]
  126. Taller J., Nagy E., Decsi K., Kutasy B., Mátyás K., Farkas E., et al. (2016b). Utilization of transcriptomics in weed research. A case study on common ragweed (Ambrosia artemisiifolia L.) the most widespread weed in Hungary. Magy. Gyomkut. Technol. 17 5–15. [Google Scholar]
  127. Taramarcaz P., Lambelet C., Clot B., Kleimer C., Hauser C. (2005). Ragweed (Ambrosia) progression and its health risks: will Switzerland resist this invasion? Swiss Med. Wkly. 135 538–548. [DOI] [PubMed] [Google Scholar]
  128. Thiel T., Michalek W., Varshney R. K., Graner A. (2003). Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106 411–422. 10.1007/s00122-002-1031-0 [DOI] [PubMed] [Google Scholar]
  129. Tillich M., Lehwark P., Pellizzer T., Ulbricht-Jones E. S., Fischer A., Bock R., et al. (2017). GeSeq – versatile and accurate annotation of organelle genomes. Nucl. Acids Res. 45 W6–W11. 10.1093/nar/gkx391 [DOI] [PMC free article] [PubMed] [Google Scholar]
  130. Twyford A. D., Ness R. W. (2017). Strategies for complete plastid genome sequencing. Mol. Ecol. Res. 17 858–868. 10.1111/1755-0998.12626 [DOI] [PMC free article] [PubMed] [Google Scholar]
  131. Vaidya G., Lohman D. J., Meier R. (2011). SequenceMatrix: concatenation software for the fast assembly of multi-gene datasets with character set and codon information. Cladistics 27 171–180. 10.1111/j.1096-0031.2010.00329.x [DOI] [PubMed] [Google Scholar]
  132. Van de Paer C., Bouchez O., Besnard G. (2018). Prospects on the evolutionary mitogenomics of plants: a case study on the olive family (Oleaceae). Mol. Ecol. Res. 18 407–423. 10.1111/1755-0998.12742 [DOI] [PubMed] [Google Scholar]
  133. Van de Paer C., Hong-Wa C., Jeziorski C., Besnard G. (2016). Mitogenomics of Hesperelaea, an extinct genus of Oleaceae. Gene 594 197–202. 10.1016/j.gene.2016.09.007 [DOI] [PubMed] [Google Scholar]
  134. Van Horn C. P., Moretti M. L., Robertson R. R., Segobye K., Weller S. C., Young B. G., et al. (2018). Glyphosate resistance in Ambrosia trifida: part 1. Novel rapid cell death response to glyphosate. Pest Manag. Sci. 74 1071–1078. 10.1002/ps.4567 [DOI] [PubMed] [Google Scholar]
  135. Vandenberg L. N., Blumberg B., Antoniou M. N., Benbrook C. M., Carroll L., Colborn T., et al. (2017). Is it time to reassess current safety standards for glyphosate-based herbicides? J. Epidemiol. Comm. Health 71 613–618. 10.1136/jech-2016-208463 [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Vargas O. M., Ortiz E. M., Simpson B. B. (2017). Conflicting phylogenomic signals reveal a pattern of reticulate evolution in a recent high-Andean diversification (Asteraceae: Astereae: Diplostephium). New Phytol. 214 1736–1750. 10.1111/nph.14530 [DOI] [PubMed] [Google Scholar]
  137. Vilá M., Basnou C., Pyšek P., Josefsson M., Genovesi P., Gollasch S., et al. (2010). How well do we understand the impacts of alien species on ecosystem services? A pan-European, cross-taxa assessment. Front. Ecol. Environ. 8 135–144. 10.1890/080083 [DOI] [Google Scholar]
  138. Virág E., Hegedûs G., Barta E., Nagy E., Mátyás K., Kolics B., et al. (2016). Illumina sequencing of common (short) ragweed (Ambrosia artemisiifolia L.) reproductive organs and leaves. Front. Plant. Sci. 7:1506. 10.3389/fpls.2016.01506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  139. Vitousek P. M., D’Antonio C. M., Loope L. L., Rejmanek M., Westbrooks R. (1997). Introduced species: a significant component of human-caused global change. New. Zeal. J. Ecol. 21 1–16. 10.1007/s00114-014-1153-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  140. von Boheemen L. A., Lombaert E., Nurkowski K. A., Gauffre B., Rieseberg L. H., Hodgins K. A. (2017). Multiple introductions, admixture and bridgehead invasion characterize the introduction history of Ambrosia artemisiifolia in Europe and Australia. Mol. Ecol. 26 5421–5434. 10.1111/mec.14293 [DOI] [PubMed] [Google Scholar]
  141. Walker J. F., Jansen R. K., Zanis M. J., Emery N. C. (2015). Sources of inversion variation in the small single copy (SSC) region of chloroplast genomes. Am. J. Bot. 102 1751–1752. 10.3732/ajb.1500299 [DOI] [PubMed] [Google Scholar]
  142. Walker J. F., Zanis M. J., Emery N. C. (2014). Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae). Am. J. Bot. 101 722–729. 10.3732/ajb.1400049 [DOI] [PubMed] [Google Scholar]
  143. Wicke S., Schneeweiss G. M. (2015). “Next-generation organellar genomics: potentials and pitfalls of high-throughput technologies for molecular evolutionary studies and plant systematics,” in Nextgeneration Sequencing in Plant Systematics, eds Hörandl E., Appelhans M. S. (Bratislava: International Association for Plant Taxonomy (IAPT)), 9–50. [Google Scholar]
  144. Wicke S., Schneeweiss G. M., dePamphilis C. W., Müller K. F., Quandt D. (2011). The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant. Mol. Biol. 76 273–297. 10.1007/s11103-011-9762-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  145. Wolfsberg T. G., Schafer S. S., Tatusov R. L., Tatusov T. A. (2001). Organelle genome resouces at NCBI. Tends Biochem. Sci. 26 199–203. 10.1016/S0968-0004(00)01773-4 [DOI] [PubMed] [Google Scholar]
  146. Wysocki W. P., Clark L. G., Attigala L., Ruiz-Sanchez E., Duvall M. R. (2015). Evolution of the bamboos (Bambusoideae; Poaceae): a full plastome phylogenomic analysis. BMC Evol. Biol. 15:50. 10.1186/s12862-015-0321-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  147. Xie Q., Shen K.-N., Hao X., Nam P. N., Hieu B. T. N., Chen C.-H., et al. (2015). The complete chloroplast genome of Tianshan snow lotus (Saussurea involucrata), a famous traditional Chinese medicinal plant of the family Asteraceae. Mitochondrial DNA B Resour. 28 294–295. 10.3109/19401736.2015.1118086 [DOI] [PubMed] [Google Scholar]
  148. Yin T., Cook D., Lawrence M. (2012). ggbio: an R package for extending the grammar of graphics for genomic data. Genome Biol. 13:R77. 10.1186/gb-2012-13-8-r77 [DOI] [PMC free article] [PubMed] [Google Scholar]
  149. Yu Q., Jalaludin A., Han H., Chen M., Sammons R. D., Powels S. B. (2015). Evolution of a double amino acid substitution in the 5-enolpyruvylshikimate-3-phosphate synthases in Eleusine indica conferring high-level glyphosate resistance. Plant Physiol. 167 1440–1447. 10.1104/pp.15.00146 [DOI] [PMC free article] [PubMed] [Google Scholar]
  150. Yuan J. S., Tranel P. J., Stewart C. N., Jr. (2007). Non-target-site herbicide resistance: a family business. Trends Plant Sci. 12 6–13. 10.1016/j.tplants.2006.11.001 [DOI] [PubMed] [Google Scholar]
  151. Zedane L., Hong-Wa C., Murienne J., Jeziorski C., Baldwin B. G., Besnard G. (2016). Museuomics illuminate the history of an extinct, paleoendemic plant lineage (Hesperelaea, Oleaceae) known from an 1875 collection from Guadalupe Island, Mexico. Biol. J. Linn. Soc. 117 44–57. 10.1111/bij.12509 [DOI] [Google Scholar]
  152. Zeng C.-X., Hollingsworth P. M., Yang J., He Z.-S., Zhang Z.-R., Li D.-Z., et al. (2018). Genome skimming herbarium specimens for DNA barcoding and phylogenomics. Plant Methods 14:43. 10.1186/s13007-018-0300-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  153. Zerbino D. R., Birney E. (2008). Velvet: algorithms for de novo short read assembly using Bruijin graphs. Genome Res. 18 821–829. 10.1101/gr.074492.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  154. Zhang Y., Pei X., Lu Z., Wang Z., Jia S., Li W. (2012). De novo foliar transcriptome of Chenopodium amaranticolor and analysis of its gene expression during virus-induced hypersensitive response. PLoS One 7:e45953. 10.1371/journal.pone.0045953 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

FIGURE S1

LASTZ dot-plot comparison among Asteraceae complete plastid genome sequences.

FIGURE S2

Inverted repeat plot of 30 Asteraceae chloroplast genomes.

FIGURE S3

Phylogenetic tree of Asteraceae based on the analysis of fifty protein coding genes with parsimony used as an optimality criterion.

FIGURE S4

Evaluation of the skewness and compositional heterogeneity across the combined and partition-specific variations of the long genes ndhD, ndhH, psaA, psaB, psbA, psbB, psbC, and rbcL.

TABLE S1

List of the sequenced plastomes used for the phylogenomic analysis.

TABLE S2

Summary of the aligned transcriptomics reads.

TABLE S3

List of genes in the chloroplast genome of Ambrosia trifida.

TABLE S4

Genes having an intron in the Ambrosia trifida plastid genome, and the length of the exons and introns.

TABLE S5

Intronic reads per kb per million mapped (RPKM) across the herbicide treatments in the plastid genome of giant ragweed (Ambrosia trifida).


Articles from Frontiers in Plant Science are provided here courtesy of Frontiers Media SA

RESOURCES