Abstract
Capitate glandular trichomes (CGT), one type of glandular trichomes, are most common in Asteraceae species. CGT can produce various secondary metabolites such as sesquiterpene lactones (STLs) and provide durable resistance to insect pests. In sunflower, CGT-based host resistance is effective to combat the specialist pest, sunflower moth. However, the genetic basis of CGT density is not well understood in sunflower. In this study, we identified two major QTL controlling CGT density in sunflower florets by using a F4 mapping population derived from the cross HA 300 × RHA 464 with a genetic linkage map constructed from genotyping-by-sequencing data and composed of 2121 SNP markers. One major QTL is located on chromosome 5, which explained 11.61% of the observed phenotypic variation, and the second QTL is located on chromosome 6, which explained 14.06% of the observed phenotypic variation. The QTL effects and the association between CGT density and QTL support interval were confirmed in a validation population which included 39 sunflower inbred lines with diverse genetic backgrounds. We also identified two strong candidate genes in the QTL support intervals, and the functions of their orthologs in other plant species suggested their potential roles in regulating capitate glandular trichome density in sunflower. Our results provide valuable information to sunflower breeding community for developing host resistance to sunflower insect pests.
Keywords: sesquiterpenes, capitate glandular trichome, glandular trichome, sunflower, Helianthus annuus L., heat shock transcription factor, WRKY transcription factor
Introduction
Plant trichomes, the hair-like structures on above-ground plant surfaces, are key features governing interaction with the environment, including biotic and abiotic factors. Plant trichomes vary greatly in morphology and function, with at least 300 known types of plant trichomes (Wagner, 1991; Spring, 2000; Werker, 2000). Based on their metabolic activity, plant trichomes are classified into two broad groups: glandular trichomes (GTs) and non-glandular trichomes (non-GTs), which co-exist on plant surfaces like leaves, flowers, stems, and bracts (Hare and Elle, 2002; Rautio et al., 2002). A well-studied example of non-GTs is from the model plant Arabidopsis thaliana, which possesses trichomes that are unicellular, unbranched or with two to five branches (Oppenheimer et al., 1991; Larkin, 1994; Szymanski et al., 2000). In contrast, GTs are usually multicellular and consist of differentiated basal, stalk, and secretory cells. GTs have been found on ~30% of all vascular plants, particularly in dicots.
Numerous studies have shown plant trichomes are involved in water protection, absorbing UV radiation, and attracting pollinators by releasing specific chemicals (Skaltsa et al., 1994; Moraes et al., 1998; Paré and Tumlinson, 1999; Benz and Martin, 2006; Lusa et al., 2014). However, the most notable function of plant trichomes is their role in plant defense. Non-GTs serve as physical obstacles to create an unfavorable microenvironment for herbivores, parasites, and pathogens. Some non-GTs form spine- or thorn-like structures that are able to injure and trap insects and other pests during their searching and feeding activities (Pillemer and Tingey, 1976; Riddick and Wu, 2011). On the other hand, GTs are metabolic factories and mainly function in chemical defense in plants (Peiffer et al., 2009; Tian et al., 2012). The biochemical pathways and metabolic profiles in GTs have been studied in great detail in plants (Tissier, 2012). GTs can synthesize and release a wide range of chemicals, including flavonoids, terpenes, alkaloids, acyl-sugars, methyl-ketones and surface proteins (Glas et al., 2012). Recent molecular studies confirm, in different plant species, that trichome development is under the control of a conserved regulatory network, and have identified the function of several key genes in the network (Serna and Martin, 2006; Yang and Ye, 2012; Pattanaik et al., 2014). Compared with leaf tissues, more than 200 genes involved in secondary metabolism and defense-related genes are differentially expressed in trichomes in Nicotiana tabacum, and some of these were only expressed in trichomes (Cui et al., 2011). These secondary metabolites are either effective toxins against the pests or attractors to the natural enemies of the pests (Howe and Jander, 2008; War et al., 2012). In some cases, GTs actively release resinous or sticky chemicals on plant surfaces to limit insect movement and function as a physical defense (Simmons and Gurr, 2005; Glas et al., 2012). Studies on wild species of Medicago have shown that erect GTs are effective against stem-, leaf-, and fruit-eating pests (Kitch et al., 1985; Danielson et al., 1990), demonstrating their wide role in defense of many types of tissues.
In Asteraceae, GTs have been the subject of many studies for their ability to synthesize various secondary metabolites of ecological and medical importance (Wang et al., 2009; Chadwick et al., 2013; Lv et al., 2017). GTs on tarweed (Madia elegans) entrap insects by releasing sticky chemicals and provide carrion as food resource for predators. This indirect defense efficiently decreases the herbivore's activities and increases plant fitness under natural field conditions (Krimmel and Pearse, 2013). Among all the secondary metabolites produced by GTs, sesquiterpene lactones (STLs) are the most prevalent within Asteraceae species. Chemically, STLs derive from the isoprenoid biosynthetic pathway and contain a basic backbone of 15 carbon atoms (Zidorn, 2008). STLs show great structural variety in the arrangement of the basic skeleton and in the composition of the side chain. More than 7,000 distinct molecules of STL have been reported so far (Fraga, 2005). Besides of their ecological role in nature, STLs provide benefits to human health (Chadwick et al., 2013). Artemisinin, a unique STL molecule isolated from sweet wormwood (Artemisia annua), is the most effective treatment for malaria (Covello et al., 2007). Other studies also showed STLs have anti-carcinogenic and anti-inflammatory effects (Li-Weber et al., 2002; Ghantous et al., 2010; Nasim et al., 2011). Consequently, there is great interest in understanding the anatomy, development, biochemistry and molecular biology of GTs in Asteraceae, and the biosynthetic pathways of artemisinin and other STLs have been well studied (Spring and Bienert, 1987; Covello et al., 2007; Zidorn, 2008; Wang et al., 2009; Duarte and Empinotti, 2012; Menin et al., 2012; Michalska et al., 2013; Eljounaidi et al., 2014; Aschenbrenner et al., 2016; Bombo et al., 2016; Lv et al., 2017).
In sunflower (Helianthus annuus), there are three main types of trichomes: non-GTs, linear glandular trichomes (LGTs), and capitate glandular trichomes (CGTs). As early as 60–70 h post-germination, these three types of trichomes form on the primordia of the first true leaves, and LGTs and CGTs begin active biosynthesis and secretion of secondary metabolites such as flavonoids, terpenoids, and sesquiterpenes at 72–96 h post-germination (Aschenbrenner et al., 2014). The LGTs are often found among non-GTs and are present on the most parts of sunflower plant, while CGTs only occur on the leaf surfaces and the anther appendages (distal ends) of sunflower florets (Aschenbrenner et al., 2013). In fact, CGTs are the only type of trichome found on anther appendages, where several important sunflower pests feed on pollen and florets, especially the most damaging insect pest of North American sunflower, the sunflower moth (Homoeosoma electellum Hulst.). As seen in other Asteraceae, sunflower CGTs synthesize STLs as the primary chemical defense component against herbivores (Göpfert et al., 2005; Rowe et al., 2012).
Since STL are phytotoxic, they are mainly synthesized in the stalk cells of CGTs and secreted and accumulated in the cuticle globes (Göpfert et al., 2005; Amrehn et al., 2013, 2015). Three sesquiterpene synthases (HaGAS1, HaGAS2, and HaTPS2) and one monooxynase (HaGAO) involved in the STL biosynthesis pathway have been identified in sunflower plants (Göpfert et al., 2009; Amrehn et al., 2015). The expression pattern of these genes and the STL biosynthesis activity accompanied anther development, and STL levels in CGTs reached a peak before disk flower opening and remained more or less constant for several days (Göpfert et al., 2009; Amrehn et al., 2015). While feeding or visiting, the insects cause the rupture of the CGTs on anther florets and lead to direct contact with STL. In vitro studies have shown that both polar and non-polar extracts of CGTs caused high mortality and retarded growth for the larvae of sunflower moth, and the purified STL species also showed similar effects (Rossiter et al., 1986; Rogers et al., 1987; Prasifka, 2015; Prasifka et al., 2015). Interestingly, the extracts of CGTs are also effective against other floret-feeding insects like yellowstriped armyworm (Spodoptera ornithogalli) and migratory grasshopper (Melanoplus sanguinipes), indicating a broader spectrum of CGT-mediated defense (Mabry et al., 1977; Isman, 1985).
Three observations have lead us to perform this study: (1) the number of plant trichomes is heritable. In plant taxonomy, trichomes are used as a grouping character since it is a heritable trait across different plant species (Spring, 2000; Reis et al., 2002). Studies in other crop plants have also shown GT density is a quantitatively-inherited trait (Kitch et al., 1985; Agren and Schemske, 1992; Maliepaard et al., 1995; Andrade et al., 2017). However, such information is not available in sunflower. (2) Studies in sunflower have suggested the possibility of developing inbreds or hybrids with high CGT density. Previously, Prasifka (2015) has shown that public sunflower inbred lines (both male and female) present a large range in mean CGT number (>50-fold differences), but wild sunflower and commercial hybrids showed much less variation in the mean of CGT number (~5-fold differences). Thus, breeding a sunflower line with high CGT density is plausible. (3) A positive correlation between CGT number and STL levels in sunflower florets has been documented (Prasifka et al., 2015). Increasing CGT density in sunflower florets could be a strategy for reducing the fitness of insect pests like sunflower moth, thereby reducing damage from insect pests. Therefore, the objective of this study is to identify the QTL responsible for CGT density of anther florets in sunflower. We believe this study will advance our understanding of CGT-mediated resistance to insect pests in sunflower and also benefit sunflower breeding programs by providing the first marker-assisted resource for insect resistance.
Materials and methods
Plant materials
Plants of H. annuus genotype HA 300 (PI 552938) and RHA 464 (PI 655015; Hulke et al., 2010) were grown in the one-gallon plastic pots under greenhouse conditions with a photoperiod of 16-h light and 8-h dark. High CGT number per floret was observed in maintainer inbred HA300, and low CGT number per floret was observed in restorer inbred RHA 464 (Prasifka, 2015). To map the genetic factors which control CGT number per floret in sunflower, crosses were made between the parents HA 300 (female) and RHA 464 (male) in the 2013 winter greenhouse, and F1 plants were grown in a field environment near Fargo, ND, in the summer of 2013. The 300 selfed F2 plants and 280 F3 plants were grown in the 2014 winter greenhouse and 2014 Fargo field, respectively. A total of 239 F4 plants were grown in the 2015 winter greenhouse in the conditions described above and subjected to CGT counting and DNA sampling.
To validate the QTL mapping results of the F4 population, a separate diversity panel of 39 inbred lines was selected, which provided a distinct genetic background based on breeding history and pedigree information. These lines were grown in 2016 under greenhouse conditions for CGT counting.
CGTs were sampled and counted in single environments because of the very high heritability of the trait and high cost of evaluation. Prasifka (2015) assessed CGT number from public inbred lines and sampled three replicate plants with three subsamples (florets) from each plant in field environments in 2012 and 2013, finding a very high correlation between the two years (environments; R2 = 0.98). Similarly, Spring and Bienert (1987) found that while compounds within glands on leaves were affected by lighting, the number of glands per unit area were not. These published data suggest this trait does not possess genotype-by-environment variation generally typical of quantitative traits.
CGT imaging and counting
Unopened sunflower florets from 239 F4 plants were removed with forceps from the outermost 1 cm of the capitulum 1 day before anthesis. After storage at −20°C, one floret per plant was dissected by removing the corolla and making a latitudinal cut through the fused anther tube, after which the unfurled anther tube was attached to an aluminum mount with double-sided carbon adhesive tape (Ted Pella Inc., Redding CA, USA). Mounted groups of 15 florets were sputter-coated with a conductive layer of gold/palladium (Balzers SCD 030, Balzers Union Ltd., Liechtenstein). Scanning electron micrographs (SEM) were obtained using a JEOL JSM-6490LV scanning electron microscope (JEOL USA, Inc., Peabody MA, USA) operating at an accelerating voltage of 15 kV and a magnification of 45 ×. Since the floret area of HA 300 (mean = 14.5, SE = 0.75, n = 25) and RHA 464 (mean = 13.3, SE = 0.2, n = 25) are similar, micrographs subsequently were used to quantify the total number of CGT per floret. To estimate the counting error between florets on the same plant (natural variation or trichome loss due to handling), a second floret of each of 30 plants in the sampled F4 population was analyzed by SEM and compared to the initial result.
DNA sampling and sequencing
Leaf tissues were collected from each F4 plant, and genomic DNA (gDNA) was extracted from lyophilized leaf material according to the Qiagen DNeasy 96 Plant Kit protocol. Subsequently, library prep was performed using a modified Genotyping by Sequencing (GBS) protocol (Meyer and Kircher, 2010). This protocol used restriction enzymes Mse1 and EcoRI to first digest and fragment the gDNA. The result was a pool of fragments with sticky-ends from restriction cut sites that provide the template for adaptor ligation. Illumina adaptors and barcodes were ligated to the digested fragments. A subset of fragments were then amplified using Illumina PCR primers. The samples were normalized and pooled following PCR, using SequalPrep normalization kit (Thermo Fisher Scientific, USA). Next, the pooled, normalized samples were run out on a 2.5% agarose gel. Using a 1,000 bp ladder, we selected 300–400 bp fragments by cutting out this segment from the gel. The gel segment was then purified using Qiagen gel purification kit according to accompanying protocol. Samples were sent to the University of Texas, Austin, for sequencing on an Illumina HiSeq 2500 sequencer. The results were multiplexed 150 bp single-end barcoded GBS reads.
For HA 300, RHA 464, and a portion of the validation panel, genomic DNA was extracted from lyophilized leaf, and whole genome sequencing was performed. Genomic libraries were prepared using Nextera® XT DNA library prep kits (Illumina®) according to the protocol. Each gDNA sample was diluted to the appropriate concentration using a Qubit 3.0 fluorometer (Thermo Fisher Scientific, USA). Each sample was barcoded by the unique dual index adapters Nextera® i5 and i7. Resulting libraries were cleaned using solid-phase reversible immobilization (SPRI) to remove fragment sizes less than 300 base pairs via an epMotion 5075TMX automated liquid handling system (Eppendorf North America). Sample quality control (QC) was conducted to ensure appropriate sample concentration and fragment size using a Qubit 3.0 fluorometer and an Agilent 2100 Bioanalyzer prior to normalizing the loading concentration to 1.8–2.1 pM with 1% PhiX control v3 added (Illumina®). Samples that passed QC were processed for paired-end 150 base pair reads on the Illumina NextSeq® sequencer. Whole genome sequencing was conducted at the BioFrontiers Institute Next-Generation Sequencing Facility at University of Colorado. The remainder of the validation panel was sequenced as part of the sunflower SAM population by the Genome Quebec Innovation Centre at McGill University, Montreal, QC, Canada (Mandel et al., 2013; Burke, pers. comm.).
SNP marker calling
All GBS reads were demultiplexed using process_radtags (v1.45) command from the Stacks software suite (Catchen et al., 2013). Default parameters were used with the exception of adding the “r” and “disable_rad_check” flags. After demultiplexing, each sample was checked for overrepresented sequences with FastQC (v0.11.5; Andrews, 2010). Trimmomatic (v0.35; Bolger et al., 2014) was used with default parameters to trim and clean the demultiplexed sequences. Reads with overrepresented sequences as identified by FastQC were also trimmed by appending them to the “TruSeq3-SE.fa” adapters file distributed with Trimmomatic. Reads were aligned to the sunflower reference genome (HA 412HO_v1.1) using the BWA mem (v0.7.12-r1039; Li and Durbin, 2009) algorithm with default parameters and the duplicates in alignments were marked with Picard (v2.8.1; Broad Institute). A realignment and base call recalibration were performed with GATK (Genome Analysis Toolkit) and followed the GATK Best Practices. Finally, the alignment quality was checked with Qualimap (v2.2.1; Okonechnikov et al., 2015), which rendered the data ready for variant calling. The variant calling was performed with freebayes (v1.1.0-1-gf15e66e; Garrison and Marth, 2012), and WGS data from the parental lines (HA 300 and RHA 464) were jointly genotyped to improve the variant calling quality. The variant sites were filtered using vcftools (v0.1.13; Danecek et al., 2011). We kept only bi-allelic single nucleotide polymorphic (SNP) sites, and required a minimum site quality of 30 and a minor allele frequency greater than 0.05.
In addition to filtering sites with vcftools, we applied a protocol to filter sites that are likely to be misassembled multi-copy portions of the genome. Repetitive sites are susceptible to misassembly with respect to the reference sequence and may result in several reads erroneously mapping to the same locus. To reduce the number of these sites in our variant calls, we assessed sequencing depth of whole genome shotgun sequence reads from four modern sunflower cultivars and one landrace (Table S1) aligned to the reference genome. We used the samtools depth command to produce the number of mapped sequences per site for each individual. We summed the depths at each position, and calculated the frequency of summed depth values across samples. We then plotted the summed depth values against their frequencies, providing a visual means to choose a range of summed depth values that are most likely to correspond to well-assembled single copy sites of the genome. We then filtered our vcf table to only include the identified single-copy sites. Out of 3,200,466 diagnostic sites differing between the parental genomes, we identified 1,138,669 single-copy diagnostic SNPs. After filtering the vcf table, missing genotypic information from the GBS data of the F4 population was trio imputed using our own custom phaser2.pl and genotyper7.pl software (Kane, 2017), and leveraging the whole genome sequence of the parent lines HA 300 and RHA 464. These programs are designed to accurately impute for high-quality single-copy, diagnostic SNPs. Because of our extensive filtering, and high quality data, these assumptions were met.
QTL analysis
QTL analyses were carried out using the R/qtl package in R version 3.2.3 (Broman et al., 2003). The genetic map was constructed with the est.map function (assuming a genotyping error rate of 0.001). Single QTL analysis and LOD scores were calculated by a single QTL genome scan (scanone function) with standard interval mapping (0.1 cM steps, assuming a genotyping error rate of 0.001). Pairwise QTL interactions were calculated using a two-dimensional QTL scan (scantwo function) via standard interval mapping (0.1 cM steps, assuming a genotyping error rate of 0.001). LOD significance thresholds for type I error rates of α < 0.05 were determined by running 1,000 permutations on the single- and two-dimensional QTL scan. In addition, the stepwiseqtl function (max.qtl = 4, additive.only = FALSE) was used to perform forward/backward stepwise search, and find the QTL model which has the highest LOD score. The fitqtl function was used to create a QTL project to fit the phenotypic data with the selected model, and one QTL was omitted at a time to obtain an ANOVA table.
Validation of the mapped QTL
To validate the QTL results, the SNP markers were called from the validation population together with HA 300 and RHA 464, as described above. The SNP markers derived from the QTL support interval were extracted. Monomorphic SNPs between the two parent lines were removed, and the heterozygous SNPs and markers with missing data in one of the parent lines were also filtered out. The SNP markers were treated as fixed effects and all possible marker pairs were tested by fitting a regression function in R as y ~ SNPm + SNPn, where y is the CGT number, SNPm is the effect of the mth marker from chromosome 5, and SNPn is the effect of the nth marker from chromosome 6. The significance threshold for multiple comparisons was determined by the adjusted p-values with the false discovery rate at 0.05. (Benjamini and Yekutieli, 2001).
Identification and phylogenetic analysis of candidate genes within the QTL support interval
To identify the candidate genes with the highest likelihood of influence on the phenotype, we first manually checked through the QTL support interval region in the sunflower reference genome (INRA Sunflower Bioinformatics Resources, 2014) and listed all the annotated genes (Table S5). We then filtered the list based on two criteria: (1) significant association between a co-located SNP and CGT number in the validation population; and (2) putative gene function. The sequences of selected candidate genes were further characterized by two approaches, NCBI (National Center for Biotechnology Information) ORFfinder search and BLAST homology search. All ORFs within these sequences were identified using NCBI ORFfinder with the following parameters: a minimal ORF length of 150 nucleotides, the standard genetic code, ATG and alternative start codons, and ignore nested ORFs. BLASTp searches were carried out with the given ORFs to obtain the most similar protein sequences in Asteraceae, Solanaceae, Malvaceae and Brassicaceae. The sequences with an E-value less than 10−5 were selected, combined, and searched for the presence of conserved protein domains using ScanProsite (Sigrist et al., 2012). Comparisons were made among HA 412HO (reference sequence), HA 300, and RHA 464 for genomic sequence differences, alternative splice sites, and changes in protein domains at the selected loci. For phylogenetic analysis, the conserved protein domain sequences with 10–15 additional amino acids on the N-terminal and C-terminal ends were used for alignment. The alignment was done with the L-INS-i strategy implemented in MAFFT (version 7.310; Katoh, 2002). The BLOSUM62 scoring matrix and a gap opening penalty of 1.5 were selected to assess the phylogenetic relationship among the protein sequences. After the alignment, a neighbor-joining tree was constructed with PhyML including the Smart Model Selection approach (version 3; Guindon et al., 2010; Lefort et al., 2017), bootstrap reassembling with 1,000 bootstrap samples. The phylogenetic tree was colorized according to protein families in Archaeopteryx (version 0.9920 beta; Han and Zmasek, 2009).
Results
Capitate glandular trichome density in mapping population
The two parents presented a significant difference in CGT density, with the male parent RHA 464 having ~2 CGT per floret and the female parent HA 300 having ~300 CGT per floret (Figure 1). The CGT numbers per floret were counted for 239 F4 plants, and 179 plants with good genotypic data quality were selected as the final mapping population. Based on frequency distribution, 179 F4 plants were classified into three groups (Figure 1). Of these 179 F4 plants, 25 plants (13.9%) had high CGT density which is more than 150 CGT per floret, 109 plants (60.9%) had medium CGT density which is 25–150 CGT per floret and 45 plants (25.2%) had low CGT density which is less than 25 CGT per floret. Shapiro-Wilk normality test (Shapiro and Wilk, 1965) indicated that the CGT number in the F4 population was not normally distributed (W = 0.915, p < 1.1 × 10−8), and the distribution of CGT number was moderately skewed toward the lower CGT number (Skewness = 0.98). The original mapping population with 239 individuals showed a similar distribution of CGT number (Skewness = 0.97; W = 0.916, p < 1.1 × 10−10).
To estimate the potential error in CGT counting (natural variation or trichome loss due to handling), 30 plants from the F4 population were randomly selected for recounting. A second floret from each of these 30 plants was counted, and the CGT numbers from first floret and second floret were highly correlated (r = 0.98, p < 2.2 × 10−16) (Figure S1).
Genetic map construction
A total of 443.2 M raw GBS reads were generated from 239 F4 plants with the Illumina HiSeq 2500 sequencing system. After trimming the adapters and overrepresented sequences with Trimmomatic (v0.35; Bolger et al., 2014), the number of remaining reads was 210.1 M. From these 210.1 M reads, 97.9% (205.7 M) of the reads were successfully aligned to the sunflower reference genome HA 412HO_v1.1., and the mean length of reads was 74.3 bp. Before performing variant calling, five individuals were removed due to poor genotyping quality. A total of 16,028,511 SNPs was produced in the initial variant calling, and 1,138,669 single-copy diagnostic SNP were retained after applying several filters (minQ > 30, MAF > 0.05, biallelic only). To further control genotypic data quality, we also checked each individual in the F4 population, and the individuals with low coverage on one or more chromosomes (less than 10 SNPs per chromosome) were dropped. A total of 179 individuals with acceptable sequence quality were kept to conduct imputation. A total of 1.09 million SNPs was generated from the imputation, and monomorphic markers and markers with LD above 0.9 were filtered out first, followed by removing SNPs with high missing rate and significant distortion from expected Mendelian ratio. Finally, 2,121 high quality SNPs were kept and used for constructing the genetic linkage map (Figure 2).
The 2121 SNP markers were distributed in all 17 chromosomes, and the number of markers per chromosome varies from 32 on chromosome 7 to 238 on chromosome 10 (Table 1). The total length of the map was 1,519.52 cM, and the size of chromosomes varies from 13.48 cM (chromosome 7) to 125.58 cM (chromosome 17). Chromosome 7 had the fewest SNP markers and shortest map distance. The average density per marker was 0.72 cM, and the largest gap between two adjacent loci was 27.32 cM on chromosome 6.
Table 1.
Linkage groups | Number of SNPs | Map length (cM) | Density (cM/marker) | Largest gap |
---|---|---|---|---|
1 | 129 | 54.56 | 0.43 | 2.58 |
2 | 149 | 96.23 | 0.65 | 9.71 |
3 | 134 | 83.62 | 0.63 | 5.36 |
4 | 86 | 74.24 | 0.87 | 11.91 |
5 | 147 | 107.71 | 0.74 | 7.58 |
6 | 73 | 103.31 | 1.43 | 27.32 |
7 | 32 | 13.48 | 0.43 | 0.85 |
8 | 130 | 95.07 | 0.74 | 5.01 |
9 | 129 | 100.54 | 0.79 | 4.43 |
10 | 238 | 118.06 | 0.50 | 6.30 |
11 | 141 | 114.65 | 0.82 | 9.63 |
12 | 129 | 81.91 | 0.64 | 3.49 |
13 | 123 | 97.05 | 0.80 | 8.63 |
14 | 95 | 87.95 | 0.94 | 12.24 |
15 | 112 | 70.77 | 0.64 | 2.29 |
16 | 86 | 94.79 | 1.12 | 9.51 |
17 | 188 | 125.58 | 0.67 | 5.94 |
Total | 2121 | 1519.52 | 0.72 | / |
QTL analysis
After performing a single QTL scan on the data collected from the F4 population, two putative QTL were revealed. One was mapped to chromosome Ha5 at the 14.64 cM position, and the other one was mapped to chromosome Ha6 at the 60.72 cM position (Figure 3). Both QTL had LOD scores greater than the LOD threshold (4.70, α = 0.05), which was 95 percentile of the distribution of genome-wide max LOD obtained by 1,000 permutations (Table S2). Therefore, a two-QTL model scan was performed with the scantwo function, and the same pair of QTL on Ha5 and Ha6 were identified (Table S3). In this two-QTL model, the additive effect of these two QTL was strongly supported by the data, but no epistasis interaction was detected. The two-QTL additive model (Ha5@14.6, Ha6@60.5) explained 24.58% of CGT number variation for the F4 mapping population, while the single QTL model explained 11.61 and 14.06% of the CGT number variation, respectively (Table 2). To confirm the QTL model, we also performed a forward/backward stepwise search, allowing four QTL at maximum, and found the two-QTL model had the best LOD score. The closest SNPs to each QTL were identified, and the phenotypes were plotted against genotypes at the putative QTL (Figure S2, Table S4). The 1.5—LOD support interval for SNP marker Ha5_11356218 was 12.1–15.7 cM, which extended from 8.07 to 12.65 Mbp on the physical map, and the support interval of SNP marker Ha6_8364901 was 41.3–76.3 cM, which covered from 7.42 to 8.58 Mbp on the physical map (Table 2, Figure S3).
Table 2.
QTL | Closest SNP | LOD | Explained variance (%) | p-value | 1.5-LOD support interval |
---|---|---|---|---|---|
Ha5@14.6 | Ha5_11356218 | 10.06 | 11.61 | 1.85 × 10−5*** | 12.1–15.7 cM |
Ha6@60.5 | Ha6_8364901 | 12.19 | 14.06 | 2.18 × 10−6*** | 41.3–76.3 cM |
Ha5@14.6 + Ha6@60.5a | / | 10.97 | 24.58 | 4.89 × 10−10*** | / |
Ha5@14.6*Ha6@60.5b | / | 1.62 | 0.935 | 0.45 | / |
Significant at p < 0.001 as determined by permutation testing.
Additive effect of two QTL.
Interaction effect of two QTL.
Validation of the mapped QTL
To validate the QTL analysis, we selected 39 inbred lines which were whole genome sequenced with 10 × coverage, and had phenotypic data from a previous study (Prasifka, 2015). With the WGS reads, 609914 SNPs were produced from variant calling and 485516 good quality SNPs were kept after applying the additional filters (minQ > 30, MAF > 0.05, biallelic only, max missing rate <10%). A phylogenetic tree was constructed from these 485516 SNPs using the SNPhylo pipeline (Lee et al., 2014; Figure S4). As shown in Figure S4, the two parent lines, RHA 464 and HA 300, were clustered into two distinct subgroups, and the other 37 lines in the validation population presented additional genetic backgrounds.
To test the two-QTL model, which was suggested by the QTL mapping results, we retrieved all the SNP markers from the QTL support intervals. A total of 990 SNP markers were extracted from the QTL support interval on chromosome 5, with a marker density of 0.22 SNP per kb, and 363 SNP markers were extracted from the QTL support interval on chromosome 6, with a marker density of 0.31 SNP per kb. After filtering these SNP markers, as described above, we kept 385 SNP markers from the QTL support interval on chromosome 5 and 33 SNP markers from the QTL support interval on chromosome 6. Next, we tested pairs of SNP markers, one from each QTL, with a linear regression function in R [lm(y ~ SNPm + SNPn), where y is the CGT number, SNPm is the effect of the mth marker on chromosome 5 and SNPn is the effect of the nth marker on chromosome 6]. In total, 12705 two-QTL models were tested, and 414 of these models were detected as significant with the adjusted p < 0.05 (p = 1.6 × 10−4) (Figures 4A,B). The best two-QTL model explained 59.9% of total phenotypic variation.
Identification of putative genes controlling CGT number
To identify putative genes which control CGT number in sunflower, we manually checked through the physical regions, which included the Ha5_11356218 marker support interval and the Ha6_8364901 marker support interval, with sunflower genome JBrowse (INRA Sunflower Bioinformatics Resources, 2014). A total of 105 genes were predicted within the Ha5_11356218 marker support interval on chromosome 5, and 35 genes were predicted within the Ha6_8364901 marker support interval on chromosome 6. A list of these genes with SNP marker data and expression profile in flower tissues is shown in Table S5.
Since some of the SNP markers in the QTL support intervals were within these genes, we found 16 genes on chromosome 5 and five genes on chromosome 6 with variation that had significant associations with CGT number (Figures 4A,B). Based on gene annotations that suggested plausible functional association, the genes co-localized with Ha5_10149906 and Ha6_7633946 were selected for further characterization. This pair of markers had strong association with CGT number (adjusted p < 0.05, p = 3.6 × 10−5) and the two-QTL model explained 47.2% of phenotypic variation in the validation data. The genotypic data at these loci together with phenotypic data are shown in Table 3. A single factor ANOVA was also used to test the association between each SNP marker and CGT number. The marker Ha5_10149906 showed strong association with CGT number (p = 0.004), explaining 20.3% of the phenotypic variation, and the single SNP marker Ha6_7633946 also showed significant association with CGT number (p = 0.0004), explaining 31.1% of the phenotypic variation (Figures 4C,D).
Table 3.
Inbred lines | CGT number | Ha5_10149906 | Ha6_7633946 |
---|---|---|---|
RHA 464 (male parent) | 2 | CC | CC |
HA 300 (female parent) | 309 | TT | TT |
HA 124 | 28 | CC | CC |
HA 133 | 188 | TT | TT |
HA 207 | 195 | TT | CC |
HA 232 | 92 | / | / |
HA 248 | 334 | / | / |
HA 291 | 60 | CC | CC |
HA 292 | 108 | CC | CC |
HA 301 | 30 | / | / |
HA 305 | 235 | CC | CC |
HA 321 | 174 | CC | TT |
HA 349 | 133 | CC | CC |
HA 371 | 109 | TT | CC |
HA 383 | 75 | TT | / |
HA 404 | 318 | CC | TT |
HA 441 | 169 | TT | CC |
HA 445 | 136 | / | / |
HA 456 | 88 | CC | CC |
HA 467 | 140 | / | / |
HA 821 | 320 | TT | TT |
HA 851 | 117 | CC | CC |
HA 853 | 111 | CC | TT |
HA 89 | 71 | CC | CC |
ND-NONOIL M3 | 70 | CC | TT |
RHA 270 | 196 | CC | CC |
RHA 272 | 49 | CC | CC |
RHA 273 | 0 | CC | CC |
RHA 274 | 91 | CC | CC |
RHA 278 | 227 | CC | TT |
RHA 331 | 108 | CC | CC |
RHA 374 | 101 | CC | TT |
RHA 377 | 45 | CC | CC |
RHA 386 | 55 | CC | CC |
RHA 388 | 65 | CC | CC |
RHA 396 | 0 | CC | CC |
RHA 400 | 160 | CC | TT |
RHA 408 | 55 | CC | CC |
RHA 468 | 118 | / | / |
RHA 855 | 13 | CC | CC |
The SNP marker Ha5_10149906 is located within the second exon of the gene Ha5g003120, which is annotated as a member of heat shock transcription factor (HSF) family (Table S5). The HSF proteins are not only involved in heat stress responses but also participate in many other developmental/physiological activities such as cell division and root growth (Westerheide et al., 2012). As shown in the phylogenetic tree (Figure S5), sunflower Ha5g003120 gene is orthologous to Arabidopsis thaliana gene HSF2A. Furthermore, we obtained the genomic sequence of gene Ha5g003120 from the two parents, HA 300 and RHA 464, and the reference genome HA 412HO, and performed alignments. Interestingly, the genomic sequence in RHA 464 has a 326 bp deletion in the promoter region of Ha5g003120, but this deletion is not present in HA 300 and HA 412HO (Figure 5A).
The SNP marker Ha6_7633946 is located in the 3′-UTR (untranslated region) of the gene Ha6g003560, which is annotated as a member of the WRKY transcription factor family (Table S5). WRKY proteins are key regulators of many biotic, abiotic and physiological responses in plants (Phukan et al., 2016). Similarly, the ORFs within the gene sequence of Ha6g003560 were confirmed with NCBI ORFfinder. The confirmed ORF sequence was used for BLASTp searches to obtain the most similar protein sequences in other phyla (Figure S6). As shown in the phylogenetic tree, Ha6g003560 closely resembles Arabidopsis thaliana gene WRKY44/TTG2. It has been shown that the WRKY44/TTG2 gene regulates trichome development in Arabidopsis (Ishida et al., 2007). In addition, we also performed alignments with the genomic sequence of gene Ha6_7633946 from HA 300, RHA 273 and reference genome XRQ. Since the sequence quality was poor in this region in RHA 464 we used the sequence from RHA 273 instead because these two inbred lines are phylogenetically close, they share the same phenotype for CGT, and they share the same haplotype of chromosome 6 (Table 3, Figure S4). As a result, we found a 51 bp deletion in the intron of gene Ha6_7633946 in RHA 273, and this deletion results in an alternative splicing (Figures 5B, 6). These results make the putative HSF gene Ha5g003120 and WRKY gene Ha6g003560 good candidates as the regulators of CGT density in sunflower florets.
Discussion
Trichomes, especially glandular trichomes, play an important role in plant defense. CGTs are most common in Asteraceae species (Bombo et al., 2016). CGTs are well-known for their ability to produce various secondary metabolites such as STLs. In sunflower, however, the genetics of CGT density is not well understood. In this study, we reported for the first time that CGT density in sunflower florets is a quantitative trait, and two major QTL were identified in a biparental mapping population. The F4 mapping population derived from high CGT density parent HA 300 and low CGT density parent RHA 464, showed segregation in CGT number per floret (Figure 1). Of 179 F4 plants, only 25 plants (13.9%) showed high CGT density (greater than 150 CGT per floret) and 45 plants (25.2%) showed low CGT density (less than 50 CGT per floret). The variation of CGT density from CGT counting errors was marginal since the CGT numbers from first floret and second floret were highly correlated (Figure S1). As shown in Figure 1, the frequency of CGT density in the F4 mapping population showed a moderately skewed distribution (Skewness = 0.98) toward the low CGT density parent. For this reason, we attempted non-parametric interval mapping, which is an alternative method for analyzing non-normal phenotypic data, but it exhibited less power to detect QTL in our population (Broman, 2003; Fernandes et al., 2007; data not shown). In general, the simple interval mapping method can give reasonable results if the phenotypic data are not highly skewed (Broman and Sen, 2009). We also determined statistical significance on the basis of a genome-wide permutation test, which utilizes the same empirical distribution (Table S3). For these reasons, we used the original CGT numbers to perform QTL analysis without any data transformation.
The linkage map in the QTL analysis was constructed with SNP markers generated through GBS. GBS is a high-throughput and highly cost-effective genotyping and SNP discovery approach, and has been applied to many plant species (Elshire et al., 2011; Kumar et al., 2012; Melo et al., 2016; Torkamaneh et al., 2016). To date, two GBS-SNP marker maps have been reported in sunflower (Celik et al., 2016; Talukder et al., 2016). In our study, we successfully mapped 2121 SNP markers, which is more than twice the number of unique SNP markers mapped in previous studies (Table 1). There are two possible reasons to explain why we were able to discover more unique SNP markers by GBS. First, we used a customized bioinformatic pipeline to process the GBS data. As described in our methods, we followed the GATK Best Practices to recalibrate the alignment, and used FreeBayes, a haplotype-based variant caller, to detect SNP markers (Garrison and Marth, 2012). The pipeline we used in our study could be valuable for discovering SNP markers, in addition to the TASSEL-GBS pipeline which was used in previous studies (Glaubitz et al., 2014; Celik et al., 2016; Talukder et al., 2016). Second, for our biparental mapping population, we carried out whole genome sequencing for the parent lines (HA 300 and RHA 464). These were included in the SNP marker calling and imputation steps together with their offspring, and this greatly helped to discover more reliable SNP markers. It is also a good practice to extract DNA from fresh young leaves or seedlings to obtain high quality DNA for GBS. With these modifications, we think GBS is still a good option for genotyping and SNP discovery in sunflower, especially considering the advantage of high-throughput and cost-effectiveness. The length of the genetic linkage map in this study was 1,519.52 cM, which is comparable with a sunflower consensus linkage map of 1,443.84 cM developed from three F2 mapping populations (Talukder et al., 2014), and also a recent SNP-based linkage map of 1,401.36 cM (Talukder et al., 2016). The marker density in this study (0.72 cM/SNP) is higher than the other two SNP marker maps (1.33 cM/SNP; Talukder et al., 2016; 3.03 cM/SNP; Celik et al., 2016). Despite good whole genome coverage, several regions with gaps greater than 10 cM were observed on chromosome 4, 6, and 14 (Table 1). Notably, the largest gap is 27.32 cM on chromosome 6, which is likely due to chromosome structure differences between the parents. Consequently, we were unable to detect SNP markers from these regions.
Two major QTL controlling CGT density in sunflower were identified in a biparental mapping population (Figure 3). One QTL was mapped to chromosome 5 (Ha5@14.6), which explained 11.61% of the CGT number variation. The second QTL was mapped to chromosome 6 (Ha6@60.5), which explained 14.06% of the CGT number variation (Table 2). No epistasis interaction was detected, and this suggests that these two QTL mediate CGT density in an additive manner. Similarly, two QTL associated with type VI glandular trichome density were identified in cultivated tomato, and seven QTL associated with trichome density were detected in soybean (Maliepaard et al., 1995; Du et al., 2009). These findings suggest that the trichome density in plants is regulated by multiple genes. A large portion of the CGT number variation is still unaccounted for in the current study, and this could be due to some error in genotyping, or a confluence of several genes of minor effect.
To validate the QTL results, we selected 39 sunflower inbred lines with diverse genetic backgrounds as the validation population (Figure S4). We used multiple linear regression analysis to validate the QTL-trait association and also estimate the QTL effects. A total of 385 SNP markers were selected from the 1.5-LOD support interval on chromosome 5 and 33 SNP markers were selected from the QTL support interval on chromosome 6. Based on annotated gene functions, one SNP marker pair, Ha5_10149906 and Ha6_7633946, was identified as the best two-QTL model and corresponded to genes with a plausible effect on the phenotype. This two-QTL model explained 47.2% of phenotypic variation in the validation population. Moreover, the significant associations between single SNP markers (Ha5_10149906 and Ha6_7633946) and CGT number were also supported by single factor ANOVA (Figures 4C,D). Thus, the QTL effects and the association between CGT density and QTL support intervals were confirmed in validation.
The SNP marker Ha5_10149906 is located within the second exon of the gene Ha5g003120. A 326 bp deletion was observed in the promoter region of the gene Ha5g003120 in RHA 464, and this deletion might lead to substantial change in gene expression pattern (Figure 5A). The gene Ha5g003120 is annotated as a member of heat shock transcription factor (HSF) family. Although no experimental results have shown that HSF proteins are involved in trichome development, some studies indicate these proteins are required for cell division and root growth (Westerheide et al., 2012). The SNP marker Ha6_7633946 is located in the 3′-UTR of the gene Ha6g003560 and is associated with an alternative splice site in RHA 273, due to a 51 bp deletion in the second intron. This alternative splicing alters the WRKY domain, which determines DNA-binding specificity (Llorca et al., 2014). The phylogenetic tree showed that Ha6g003560 is grouped together with Arabidopsis thaliana gene WRKY44/TTG2. WRKY44/TTG2 is a key regulatory gene for trichome development, and a mutation in WRKY44/TTG2 causes significantly reduced trichome number and unbranched trichomes in Arabidopsis (Johnson, 2002; Ishida et al., 2007; Pesch et al., 2014). Taken together, the HSF gene Ha5g003120 and WRKY gene Ha6g003560 are strong candidates for regulating CGT density in sunflower florets. It is possible, however, that other adjacent genes are contributing to the phenotype. Further studies are required to characterize gene functions in these regions.
In summary, we successfully detected SNP markers by GBS and constructed a genetic linkage map with these SNP markers. We also identified two major QTL controlling CGT density in sunflower florets by using the F4 population derived from the cross HA 300 × RHA 464. In addition, we found two plausible candidate genes in the QTL support intervals. Future work will focus on optimizing STL chemical composition in CGT for enhancing host resistance to sunflower insect pests.
Author note
Mention of trade names or commercial products in this report is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the US Department of Agriculture. The USDA is an equal opportunity provider and employer.
Author contributions
Q-MG analyzed the data and wrote the paper. NK, BH, and JP designed and led the genomics analyses, population development and genetics analyses, and trait evaluations, respectively. SR conducted phylogenetic analyses of the candidate genes. CP and ST conducted GBS and whole genome shotgun sequencing and bioinformatic analyses. All authors have read and agree with the contents of the manuscript.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors would like to acknowledge Jayma Moore and Scott Payne at NDSU Electron Microscopy Center for their help in obtaining SEM images of florets, and Jamie Miller-Dunbar for help in quantifying CGT. We sincerely thank Brady Koehler, Mike Grove and Brian Smart for collecting DNA samples, greenhouse operations and support in the laboratory. We thank Brian Smart and Dr. Gerald Seiler for critically reviewing the manuscript. We also thank Jamie Prior Kershner at the BioFrontiers NGS Core Facility for assistance with Illumina sequencing.
Footnotes
Funding. National Sunflower Association, award # 14-E03.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2017.02227/full#supplementary-material
References
- Agren J., Schemske D. (1992). Artificial selection on trichome number in Brassica rapa. Theor. Appl. Genet. 83, 673–678. 10.1007/BF00226683 [DOI] [PubMed] [Google Scholar]
- Amrehn E., Aschenbrenner A. K., Heller A., Spring O. (2015). Localization of sesquiterpene lactone biosynthesis in cells of capitate glandular trichomes of Helianthus annuus (Asteraceae). Protoplasma 253, 447–455. 10.1007/s00709-015-0823-4 [DOI] [PubMed] [Google Scholar]
- Amrehn E., Heller A., Spring O. (2013). Capitate glandular trichomes of Helianthus annuus (Asteraceae): ultrastructure and cytological development. Protoplasma 251, 161–167. 10.1007/s00709-013-0534-7 [DOI] [PubMed] [Google Scholar]
- Andrade M. C., Silva A. A. D., Neiva I. P., Oliveira I. R. C., Castro E. M. D., Francis D. M., et al. (2017). Inheritance of type IV glandular trichome density and its association with whitefly resistance from Solanum galapagense accession LA1401. Euphytica 213:52 10.1007/s10681-016-1792-1 [DOI] [Google Scholar]
- Andrews S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- Aschenbrenner A. K., Amrehn E., Bechtel L., Spring O. (2014). Trichome differentiation on leaf primordia of Helianthus annuus (Asteraceae): morphology, gene expression and metabolite profile. Planta 241, 837–846. 10.1007/s00425-014-2223-y [DOI] [PubMed] [Google Scholar]
- Aschenbrenner A. K., Horakh S., Spring O. (2013). Linear glandular trichomes of Helianthus (Asteraceae): morphology, localization, metabolite activity and occurrence. AoB Plants 5:plt028 10.1093/aobpla/plt028 [DOI] [Google Scholar]
- Aschenbrenner A.-K., Kwon M., Conrad J., Ro D.-K., Spring O. (2016). Identification and characterization of two bisabolene synthases from linear glandular trichomes of sunflower (Helianthus annuus L., Asteraceae). Phytochemistry 124, 29–37. 10.1016/j.phytochem.2016.01.009 [DOI] [PubMed] [Google Scholar]
- Benjamini Y., Yekutieli D. (2001). The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188. 10.1214/aos/1013699998 [DOI] [Google Scholar]
- Benz B. W., Martin C. E. (2006). Foliar trichomes, boundary layers, and gas exchange in 12 species of epiphytic Tillandsia (Bromeliaceae). J. Plant Physiol. 163, 648–656. 10.1016/j.jplph.2005.05.008 [DOI] [PubMed] [Google Scholar]
- Bolger A. M., Lohse M., Usadel B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bombo A. B., Appezzato-Da-Glória B., Aschenbrenner A. K., Spring O. (2016). Capitate glandular trichomes in Aldama discolor (Heliantheae - Asteraceae): morphology, metabolite profile and sesquiterpene biosynthesis. Plant Biol. 18, 455–462. 10.1111/plb.12423 [DOI] [PubMed] [Google Scholar]
- Broman K. W. (2003). Mapping quantitative trait loci in the case of a spike in the phenotype distribution. Genetics 163, 1169–1175. Available online at: http://www.genetics.org/content/163/3/1169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Broman K. W., Sen S. (2009). A Guide to QTL Mapping with R/qtl. Dordrecht: Springer. [Google Scholar]
- Broman K. W., Wu H., Sen S., Churchill G. A. (2003). R/qtl: QTL mapping in experimental crosses. Bioinformatics 19, 889–890. 10.1093/bioinformatics/btg112 [DOI] [PubMed] [Google Scholar]
- Catchen J., Hohenlohe P. A., Bassham S., Amores A., Cresko W. A. (2013). Stacks: an analysis tool set for population genomics. Mol. Ecol. 22, 3124–3140. 10.1111/mec.12354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Celik I., Bodur S., Frary A., Doganlar S. (2016). Genome-wide SNP discovery and genetic linkage map construction in sunflower (Helianthus annuus L.) using a genotyping by sequencing (GBS) approach. Mol. Breed. 36, 133–142. 10.1007/s11032-016-0558-8 [DOI] [Google Scholar]
- Chadwick M., Trewin H., Gawthrop F., Wagstaff C. (2013). Sesquiterpenoids lactones: Benefits to plants and people. Int. J. Mol. Sci. 14, 12780–12805. 10.3390/ijms140612780 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Covello P. S., Teoh K. H., Polichuk D. R., Reed D. W., Nowak G. (2007). Functional genomics and the biosynthesis of artemisinin. Phytochemistry 68, 1864–1871. 10.1016/j.phytochem.2007.02.016 [DOI] [PubMed] [Google Scholar]
- Cui H., Zhang S. T., Yang H. J., Ji H., Wang X. J. (2011). Gene expression profile analysis of tobacco leaf trichomes. BMC Plant Biol. 11:76. 10.1186/1471-2229-11-76 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danecek P., Auton A., Abecasis G., Albers C. A., Banks E., Depristo M. A., et al. (2011). The variant call format and VCFtools. Bioinformatics 27, 2156–2158. 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danielson S. D., Manglitz G. R., Sorensen E. L. (1990). Field resistance of perennial glandular-haired Medicago strains and alfalfa cultivars to the alfalfa weevil. Euphytica 49, 177–182. 10.1007/BF00027268 [DOI] [Google Scholar]
- Du W. J., Yu D. Y., Fu S. X. (2009). Analysis of QTLs for the trichome density on the upper and downer surface of leaf blade in soybean [Glycine max (L.) Merr.]. Agric. Sci. China 8, 529–537. 10.1016/S1671-2927(08)60243-6 [DOI] [Google Scholar]
- Duarte M. d. R., Empinotti C. B. (2012). Leaf and stem microscopic identification of Tithonia diversifolia (Hemsl.) A. Gray (Asteraceae). Brazilian J. Pharm. Sci. 48, 109–116. 10.1590/S1984-82502012000100013 [DOI] [Google Scholar]
- Eljounaidi K., Cankar K., Comino C., Moglia A., Hehn A., Bourgaud F., et al. (2014). Cytochrome P450s from Cynara cardunculus L. CYP71AV9 and CYP71BL5, catalyze distinct hydroxylations in the sesquiterpene lactone biosynthetic pathway. Plant Sci. 223, 59–68. 10.1016/j.plantsci.2014.03.007 [DOI] [PubMed] [Google Scholar]
- Elshire R. J., Glaubitz J. C., Sun Q., Poland J. A., Kawamoto K., Buckler E. S., et al. (2011). A robust, simple Genotyping-by-Sequencing (GBS) approach for high diversity species. PLoS ONE 6:e19379. 10.1371/journal.pone.0019379 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fernandes E., Pacheco A., Penha-Gonçalves C. (2007). Mapping of quantitative trait loci using the skew-normal distribution. J. Zhejiang Univ. Sci. B 8, 792–801. 10.1631/jzus.2007.B0792 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fraga B. M. (2005). Natural sesquiterpenoids. Nat. Prod. Rep. 22, 465. 10.1039/b501837b [DOI] [PubMed] [Google Scholar]
- Garrison E., Marth G. (2012). Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:1207.3907 [q-bio.GN] [Google Scholar]
- Ghantous A., Gali-Muhtasib H., Vuorela H., Saliba N. A., Darwiche N. (2010). What made sesquiterpene lactones reach cancer clinical trials? Drug Discov. Today 15, 668–678. 10.1016/j.drudis.2010.06.002 [DOI] [PubMed] [Google Scholar]
- Glas J., Schimmel B., Alba J., Escobar-Bravo R., Schuurink R., Kant M. (2012). Plant glandular trichomes as targets for breeding or engineering of resistance to herbivores. Int. J. Mol. Sci. 13, 17077–17103. 10.3390/ijms131217077 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glaubitz J. C., Casstevens T. M., Lu F., Harriman J., Elshire R. J., Sun Q., et al. (2014). TASSEL-GBS: A high capacity genotyping by sequencing analysis pipeline. PLoS ONE 9:e90346. 10.1371/journal.pone.0090346 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Göpfert J. C., Heil N., Conrad J., Spring O. (2005). Cytological development and sesquiterpene lactone secretion in capitate glandular trichomes of Sunflower. Plant Biol. 7, 148–155. 10.1055/s-2005-837575 [DOI] [PubMed] [Google Scholar]
- Göpfert J. C., Macnevin G., Ro D.-K., Spring O. (2009). Identification, functional characterization and developmental regulation of sesquiterpene synthases from sunflower capitate glandular trichomes. BMC Plant Biol. 9:86. 10.1186/1471-2229-9-86 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guindon S., Dufayard J. F., Lefort V., Anisimova M., Hordijk W., Gascuel O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321. 10.1093/sysbio/syq010 [DOI] [PubMed] [Google Scholar]
- Han M. V., Zmasek C. M. (2009). phyloXML: XML for evolutionary biology and comparative genomics. BMC Bioinformatics 10:356. 10.1186/1471-2105-10-356 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hare J. D., Elle E. (2002). Variable impact of diverse insect herbivores on dimorphic Datura wrightii. Ecology 83, 2711 10.1890/0012-9658(2002)083[2711:VIODIH]2.0.CO;2 [DOI] [Google Scholar]
- Howe G. A., Jander G. (2008). Plant immunity to insect herbivores. Annu. Rev. Plant Biol. 59, 41–66. 10.1146/annurev.arplant.59.032607.092825 [DOI] [PubMed] [Google Scholar]
- Hulke B. S., Miller J. F., Gulya T. J. (2010). Registration of the restorer oilseed sunflower germplasm RHA 464 possessing genes for resistance to downy mildew and sunflower rust. J. Plant Reg. 4, 249–254. 10.3198/jpr2010.02.0114crg [DOI] [Google Scholar]
- INRA Sunflower Bioinformatics Resources (2014). HA412-HO Genome Portal (Bronze version). Available online at: https://www.heliagene.org/HA412.v1.1.bronze.20141015/
- Ishida T., Hattori S., Sano R., Inoue K., Shirano Y., Hayashi H., et al. (2007). Arabidopsis TRANSPARENT TESTA GLABRA2 is directly regulated by R2R3 MYB transcription factors and is involved in regulation of GLABRA2 transcription in epidermal differentiation. Plant Cell Online 19, 2531–2543. 10.1105/tpc.107.052274 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Isman M. B. (1985). Toxicity and tolerance of sesquiterpene lactones in the migratory grasshopper, Melanoplus sanguinipes (Acrididae). Pestic. Biochem. Physiol. 24, 348–354. 10.1016/0048-3575(85)90146-4 [DOI] [Google Scholar]
- Johnson C. S. (2002). TRANSPARENT TESTA GLABRA2, a trichome and seed coat development gene of arabidopsis, encodes a WRKY transcription factor. Plant Cell Online 14, 1359–1375. 10.1105/tpc.001404 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kane N. (2017). Tri Imputation Tools for GBS Data: Phaser2.pl and genotyper7.pl. Available online at: https://github.com/KaneLab/Ref_guided_genotyping
- Katoh K. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066. 10.1093/nar/gkf436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kitch L. W., Shade R. E., Nyquist W. E., Axtell J. D. (1985). Inheritance of density of erect glandular trichomes in the Genus Medicago. Crop Sci. 25:607 10.2135/cropsci1985.0011183X002500040007x [DOI] [Google Scholar]
- Krimmel B. A., Pearse I. S. (2013). Sticky plant traps insects to enhance indirect defence. Ecol. Lett. 16, 219–224. 10.1111/ele.12032 [DOI] [PubMed] [Google Scholar]
- Kumar S., Banks T. W., Cloutier S. (2012). SNP discovery through Next-Generation Sequencing and its applications. Int. J. Plant Genomics 2012, 1–15. 10.1155/2012/831460 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Larkin J. C. (1994). Roles of the GLABROUS1 and TRANSPARENT TESTA GLABRA genes in Arabidopsis trichome development. Plant Cell 6, 1065–1076. 10.1105/tpc.6.8.1065 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee T. H., Guo H., Wang X., Kim C., Paterson A. H. (2014). SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data. BMC Genomics 15:162. 10.1186/1471-2164-15-162 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lefort V., Longueville J. E., Gascuel O. (2017). SMS: smart model selection in PhyML. Mol. Biol. Evol. 34, 2422–2424. 10.1093/molbev/msx149 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Durbin R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li-Weber M., Giaisi M., Treiber M. K., Krammer P. H. (2002). The anti-inflammatory sesquiterpene lactone parthenolide suppresses CD95 gene expression in peripheral blood T cells. Cell Death Differ. 9, 1256–1265. 10.1038/sj.cdd.4401102 [DOI] [PubMed] [Google Scholar]
- Llorca C. M., Potschin M., Zentgraf U. (2014). bZIPs and WRKYs: two large transcription factor families executing two different functional strategies. Front. Plant Sci. 5:169. 10.3389/fpls.2014.00169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lusa M. G., Cardoso E. C., Machado S. R., Appezzato-Da-Gloria B. (2014). Trichomes related to an unusual method of water retention and protection of the stem apex in an arid zone perennial species. AoB Plants 7:plu088. 10.1093/aobpla/plu088 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lv Z., Zhang L., Tang K. (2017). New insights into artemisinin regulation. Plant Signal. Behav. 12:e1366398. 10.1080/15592324.2017.1366398 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mabry T. J., Gill J. E., Burnett W. C., Jones S. B. (1977). Antifeedant sesquiterpene lactones in the Compositae, in ACS Symposium Series Host Plant Resistance to Pests (Washington, DC: American Chemical Society; ), 179–184. [Google Scholar]
- Maliepaard C., Bas N., Heusden S. V., Kos J., Pet G., Verkerk R., et al. (1995). Mapping of QTLs for glandular trichome densities and Trialeurodes vaporariorum (greenhouse whitefly) resistance in an F2 from Lycopersicon esculentum × Lycopersicon hirsutum f. glabratum. Heredity 75, 425–433. 10.1038/hdy.1995.155 [DOI] [Google Scholar]
- Mandel J. R., Nambeesan S., Bowers J. E., Marek L. F., Ebert D., Rieseberg L. H., et al. (2013). Association mapping and the genomic consequences of selection in sunflower. PLoS Genet. 9:e1003378. 10.1371/journal.pgen.1003378 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Melo A. T. O., Bartaula R., Hale I. (2016). GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data. BMC Bioinformatics 17:29. 10.1186/s12859-016-0879-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menin B., Comino C., Portis E., Moglia A., Cankar K., Bouwmeester H. J., et al. (2012). Genetic mapping and characterization of the globe artichoke (+)-germacrene A synthase gene, encoding the first dedicated enzyme for biosynthesis of the bitter sesquiterpene lactone cynaropicrin. Plant Sci. 190, 1–8. 10.1016/j.plantsci.2012.03.006 [DOI] [PubMed] [Google Scholar]
- Meyer M., Kircher M. (2010). Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010:pdb.prot5448. 10.1101/pdb.prot5448 [DOI] [PubMed] [Google Scholar]
- Michalska K., Kisiel W., Zidorn C. (2013). Sesquiterpene lactones from Crepis aurea (Asteraceae, Cichorieae). Biochem. Syst. Ecol. 46, 1–3. 10.1016/j.bse.2012.08.005 [DOI] [Google Scholar]
- Moraes C. M. D., Lewis W. J., Paré P. W., Alborn H. T., Tumlinson J. H. (1998). Herbivore-infested plants selectively attract parasitoids. Nature 393, 570–573. 10.1038/31219 [DOI] [Google Scholar]
- Nasim S., Pei S., Hagen F. K., Jordan C. T., Crooks P. A. (2011). Melampomagnolide B: a new antileukemic sesquiterpene. Bioorganic Med. Chem. 19, 1515–1519. 10.1016/j.bmc.2010.12.045 [DOI] [PubMed] [Google Scholar]
- Okonechnikov K., Conesa A., García-Alcalde F. (2015). Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics 32, 292–294. 10.1093/bioinformatics/btv566 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oppenheimer D. G., Herman P. L., Sivakumaran S., Esch J., Marks M. (1991). A myb gene required for leaf trichome differentiation in Arabidopsis is expressed in stipules. Cell 67, 483–493. 10.1016/0092-8674(91)90523-2 [DOI] [PubMed] [Google Scholar]
- Paré P. W., Tumlinson J. H. (1999). Plant volatiles as a defense against insect herbivores. Plant Physiol. 121, 325–332. 10.1104/pp.121.2.325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pattanaik S., Patra B., Singh S. K., Yuan L. (2014). An overview of the gene regulatory network controlling trichome development in the model plant, Arabidopsis. Front. Plant Sci. 5:259. 10.3389/fpls.2014.00259 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peiffer M., Tooker J. F., Luthe D. S., Felton G. W. (2009). Plants on early alert: glandular trichomes as sensors for insect herbivores. New Phytologist 184, 644–656. 10.1111/j.1469-8137.2009.03002.x [DOI] [PubMed] [Google Scholar]
- Pesch M., Dartan B., Birkenbihl R., Somssich I. E., Hulskamp M. (2014). Arabidopsis TTG2 regulates TRY expression through enhancement of activator complex-triggered activation. Plant Cell 26, 4067–4083. 10.1105/tpc.114.129379 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phukan U. J., Jeena G. S., Shukla R. K. (2016). WRKY transcription factors: molecular regulation and stress responses in plants. Front. Plant Sci. 7:760. 10.3389/fpls.2016.00760 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pillemer E. A., Tingey W. M. (1976). Hooked trichomes: a physical plant barrier to a major agricultural pest. Science 193, 482–484. 10.1126/science.193.4252.482 [DOI] [PubMed] [Google Scholar]
- Prasifka J. R. (2015). Variation in the number of capitate glandular trichomes in wild and cultivated sunflower germplasm and its potential for use in host plant resistance. Plant Genet. Resour. 13, 68–74. 10.1017/S1479262114000653 [DOI] [Google Scholar]
- Prasifka J. R., Spring O., Conrad J., Cook L. W., Palmquist D. E., Foley M. E. (2015). Sesquiterpene lactone composition of wild and cultivated sunflowers and biological activity against an insect pest. J. Agric. Food Chem. 63, 4042–4049. 10.1021/acs.jafc.5b00362 [DOI] [PubMed] [Google Scholar]
- Rautio P., Markkola A., Martel J., Tuomi J., Harma E., Kuikka K., et al. (2002). Developmental plasticity in birch leaves: defoliation causes a shift from glandular to nonglandular trichomes. Oikos 98, 437–446. 10.1034/j.1600-0706.2002.980308.x [DOI] [Google Scholar]
- Reis C. D., Sajo M. D. G., Stehmann J. R. (2002). Leaf structure and taxonomy of Petunia and Calibrachoa (Solanaceae). Braz. Archi. Biol. Technol. 45, 59–66. 10.1590/S1516-89132002000100010 [DOI] [Google Scholar]
- Riddick E. W., Wu Z. (2011). Lima bean–lady beetle interactions: hooked trichomes affect survival of Stethorus punctillum larvae. BioControl 56, 55–63. 10.1007/s10526-010-9309-7 [DOI] [Google Scholar]
- Rogers C. E., Gershenzon J., Ohno N., Mabry T. J., Stipanovic R. D., Kreitner G. L. (1987). Terpenes of wild sunflowers (Helianthus): an effective mechanism against seed predation by larvae of the sunflower moth, Homoeosoma electellum (Lepidoptera: Pyralidae). Environ. Entomol. 16, 586–592. 10.1093/ee/16.3.586 [DOI] [Google Scholar]
- Rossiter M., Gershenzon J., Mabry T. J. (1986). Behavioral and growth responses of specialist herbivore, Homoeosoma electellum, to major terpenoid of its host, Helianthus SPP. J. Chem. Ecol. 12, 1505–1521. 10.1007/BF01012368 [DOI] [PubMed] [Google Scholar]
- Rowe H. C., Ro D. K., Rieseberg L. H. (2012). Response of sunflower (Helianthus annuus L.) leaf surface defenses to exogenous methyl jasmonate. PLoS ONE 7:e37191. 10.1371/journal.pone.0037191 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Serna L., Martin C. (2006). Trichomes: different regulatory networks lead to convergent structures. Trends Plant Sci. 11, 274–280. 10.1016/j.tplants.2006.04.008 [DOI] [PubMed] [Google Scholar]
- Shapiro S. S., Wilk M. B. (1965). An analysis of variance test for normality (complete samples). Biometrika 52, 591–611. 10.1093/biomet/52.3-4.591 [DOI] [Google Scholar]
- Sigrist C. J. A., Castro E. D., Cerutti L., Cuche B. A., Hulo N., Bridge A., et al. (2012). New and continuing developments at PROSITE. Nucleic Acids Res. 41, D344–D347. 10.1093/nar/gks1067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simmons A. T., Gurr G. M. (2005). Trichomes of lycopersicon species and their hybrids: effects on pests and natural enemies. Agric. For. Entomol. 7, 265–276. 10.1111/j.1461-9555.2005.00271.x [DOI] [Google Scholar]
- Skaltsa H., Verykokidou E., Harvala C., Karabourniotis G., Manetasi Y. (1994). UV-B protective potential and flavonoid content of leaf hairs of Quercus ilex. Phytochemistry 37, 987–990. 10.1016/S0031-9422(00)89514-X [DOI] [Google Scholar]
- Spring O. (2000). Chemotaxonomy based on metabolites from glandular trichomes. Adv. Bot. Res. 31, 153–174. 10.1016/S0065-2296(00)31009-6 [DOI] [Google Scholar]
- Spring O., Bienert U. (1987). Capitate glandular hairs from sunflower leaves: development, distribution and sesquiterpene lactone content. J. Plant Physiol. 130, 441–448. 10.1016/S0176-1617(87)80209-2 [DOI] [Google Scholar]
- Szymanski D. B., Lloyd A. M., Marks M. (2000). Progress in the molecular genetic analysis of trichome initiation and morphogenesis in Arabidopsis. Trends Plant Sci. 5, 214–219. 10.1016/S1360-1385(00)01597-1 [DOI] [PubMed] [Google Scholar]
- Talukder Z. I., Gong L., Hulke B. S., Pegadaraju V., Song Q., Schultz Q., et al. (2014). A high-density SNP map of sunflower derived from RAD-sequencing facilitating fine-mapping of the rust resistance gene R12. PLoS ONE 9:e98628. 10.1371/journal.pone.0098628 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Talukder Z. I., Seiler G. J., Song Q., Ma G., Qi L. (2016). SNP discovery and QTL mapping of Sclerotinia basal stalk rot resistance in Sunflower using Genotyping-by-Sequencing. Plant Genome 9, 1–16. 10.3835/plantgenome2016.03.0035 [DOI] [PubMed] [Google Scholar]
- Tian D., Tooker J., Peiffer M., Chung S. H., Felton G. W. (2012). Role of trichomes in defense against herbivores: comparison of herbivore response to woolly and hairless trichome mutants in tomato (Solanum lycopersicum). Planta 236, 1053–1066. 10.1007/s00425-012-1651-9 [DOI] [PubMed] [Google Scholar]
- Tissier A. (2012). Glandular trichomes: what comes after expressed sequence tags? Plant J. 70, 51–68. 10.1111/j.1365-313x.2012.04913.x [DOI] [PubMed] [Google Scholar]
- Torkamaneh D., Laroche J., Belzile F. (2016). Genome-Wide SNP Calling from genotyping by sequencing (GBS) data: a comparison of seven pipelines and two sequencing technologies. PLoS ONE 11:e0161333. 10.1371/journal.pone.0161333 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagner G. J. (1991). Secreting glandular trichomes: more than just hairs. Plant Physiol. 96, 675–679. 10.1104/pp.96.3.675 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang W., Wang Y., Zhang Q., Qi Y., Guo D. (2009). Global characterization of Artemisia annua glandular trichome transcriptome using 454 pyrosequencing. BMC Genomics 10:465. 10.1186/1471-2164-10-465 [DOI] [PMC free article] [PubMed] [Google Scholar]
- War A. R., Paulraj M. G., Ahmad T., Buhroo A. A., Hussain B., Ignacimuthu S., et al. (2012). Mechanisms of plant defense against insect herbivores. Plant Signal. Behav. 7, 1306–1320. 10.4161/psb.21663 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werker E. (2000). Trichome diversity and development. Adv. Bot. Res. 31, 1–35. 10.1016/S0065-2296(00)31005-9 [DOI] [Google Scholar]
- Westerheide S. D., Raynes R., Powell C., Xue B., Uversky V. N. (2012). HSF transcription factor family, heat shock response, and protein intrinsic disorder. Curr. Protein Pept. Sci. 13, 86–103. 10.2174/138920312799277956 [DOI] [PubMed] [Google Scholar]
- Yang C., Ye Z. (2012). Trichomes as models for studying plant cell differentiation. Cell. Mol. Life Sci. 70, 1937–1948. 10.1007/s00018-012-1147-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zidorn C. (2008). Sesquiterpene lactones and their precursors as chemosystematic markers in the tribe Cichorieae of the Asteraceae. Phytochemistry 69, 2270–2296. 10.1016/j.phytochem.2008.06.013 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.