Skip to main content
Cannabis and Cannabinoid Research logoLink to Cannabis and Cannabinoid Research
. 2020 Feb 27;5(1):105–116. doi: 10.1089/can.2018.0015

Cannabinoid Inheritance Relies on Complex Genetic Architecture

Lesley G Campbell 1,*, Jaimie Dufresne 1, Sarah A Sabatinos 1
PMCID: PMC7173683  PMID: 32322682

Abstract

Introduction: Understanding the inheritance of cannabinoid compounds in Cannabis sativa will facilitate effective crop breeding and careful regulation of controlled substances. The production of two key cannabinoids, Δ-9-tetrahydrocannabinol (THC) and cannabidiol (CBD), is partially controlled by two additive loci. Here, we present the first study to search for evidence of alternate genetic models describing the inheritance and expression of cannabinoids.

Materials and Methods: Using an information-theoretic approach, we estimated composite genetic effects (CGEs) of four cultivars with pure CBD or pure THC chemotypes, their F1 and F2 hybrid progeny, to identify genetic models that explain cannabinoid inheritance patterns. We also estimated the effective number of genetic factors that control differences in cannabinoid concentration (THC, CBD, and cannabichromene [CBC]).

Results: Unlike previous research, we note nonadditive components of cannabinoid inheritance. Concentration of THC is a polygenic trait (three to four genetic factors). Both additive and dominance CGEs best explained THC expression patterns. In contrast, cytoplasmic genomes and additive genes may influence CBD concentration. Maternal additive effects and additive genetic effects apparently influence CBC expression.

Conclusions: Cannabinoid inheritance is more complex than previously appreciated; among other genetic effects, cytogenetic and maternal contributions may be undervalued influences on cannabinoid ratios and concentrations. Further research on the environmental sensitivity of cannabinoid production is advised.

Keywords: composite genetic effects, genetic architecture, hemp, joint-scaling test, line cross analysis, marijuana

Introduction

Selection on chemical traits plays a strong role in crop domestication, which is often characterized by recessive alleles at few loci1 (e.g., Cannabis sativa,2–5 Capsicum annuum,6 Coffea,7 Nicotiana tabacum,8 and Papaver somniferum9,10). Yet, understanding the genetic basis of chemical differences between crop and wild relatives is in its infancy. Cannabis sativa L. is a fascinating example of adaptive chemical divergence, producing more than 100 terpenophenolic secondary metabolites called cannabinoids.11–14 Although populations vary in cannabinoid composition, the most common cannabinoids include cannabidiol (CBD15,16), Δ-9-tetrahydrocannabinol (THC17), cannabigerol (CBG18), and cannabichromene (CBC19). Breeding efforts have largely focused on modifying the production of THC and CBD.20–23

Molecular data suggest that the stunning chemical diversification in cannabis cultivars is the result of recent breeding efforts.5,24,25 Cultivars of C. sativa are typically characterized based on both the ratio of THC:CBD and the overall abundance of these cannabinoids. Generally, psychoactive cultivars have high THC:CBD ratios and produce high quantities of cannabinoids. Fiber or oilseed cultivars (hemp) are characterized as low THC:CBD, and generally produce lower quantities of cannabinoids. Here, we ask: what genetic models describe cannabinoid expression within plants, and how many genes underlie cannabinoid differences between C. sativa cultivars?

In 2003, de Meijer et al.2 proposed a genetic model to explain THC and CBD production in C. sativa populations. Cannabinoid yield per crop area was described as a complex trait that was the product of total above-ground biomass, the fraction of biomass composed of inflorescence, total cannabinoid content, and cannabinoid purity (equation 1 in Ref.2). To start acquiring data to test this model, de Meijer et al.2 elucidated the inheritance of THC:CBD ratios. Since THC and CBD production is influenced by the presence of enzymes that catalyze deacidification, the enzymes themselves seem to be controlled by simple Mendelian additivity at a synthase locus.2 Individuals possessing two BT alleles result in THC synthase (and THC) production. Individuals possessing two BD alleles result in CBD synthase (and CBD) production. Individuals with one BT and BD allele produce both THC and CBD. This hypothesis was supported by sequencing of THC synthase and CBD synthase genes, which exhibit 89% genetic similarity.26,27 Furthermore, de Meijer et al.28 proposed that a homozygous genotype possessing two loss-of-function alleles, B0, at the same synthase locus controlled the accumulation of the precursor CBG in adult plants. Finally, the expression of CBC was found to be environmentally sensitive to light and its expression did not fit an additive or dominance genetic model.4 Thus, de Meijer et al.4,28 described the inheritance of THC:CBD ratios and CBC production, and proposed a mathematical model describing cannabinoid abundance as a potentially more complex quantitative trait. Although de Meijer et al.4,28 examined the inheritance patterns of cannabinoids, the only genetic model tested was a simple additive model, whereas more complex genetic effects remained unexamined. Moreover, the de Meijer et al. model remained largely unexamined until the recent research by Weiblen et al.5 which identified two genes controlling the ratio of THC:CBD and a third gene controlling the abundance of cannabinoids, still relying on a relatively simple model of cannabinoid inheritance.

Although cannabinoid inheritance has only been tested against relatively simple additive models,2,4,5,28,29 both published data and anecdotal evidence suggest that more complex genetic models are necessary. Growers repeatedly describe differences among clones (suggesting environmental effects) and mothers (suggesting maternal effects). Furthermore, genetically uniform F1 offspring of parents homozygous for THC and CBD synthase alleles exhibited significant amounts of phenotypic variation (e.g., THC:CBD range: 0.91–1.48, figure 2 in Ref.2), suggesting a strong influence of environmental variation on cannabinoid expression profiles. Furthermore, THC and CBD expression patterns are reliant on multiple linked loci on chromosome 6 and gene duplication to produce tetrahydrocannabinolic acid (THCA) and cannabidiolic acid (CBDA) synthase sequences.5 In addition, selection on hemp cultivars for low THC concentrations, to meet government regulations, has failed to completely remove the expression of THC in most hemp cultivars.30 All this evidence suggests more complex inheritance patterns of several cannabinoids than could be described by simple additive or dominance genetic models.

FIG. 2.

FIG. 2.

Results from replicate analyses of the genetic architecture of CBD concentration in Cannabis sativa. (A, C) Traditional line cross analysis plot of CBD concentration with respect to proportion of the genome inherited from THC-rich parent (P1). If CBD concentration was inherited in a Mendelian manner, the points should fall on the dotted line and the F1 and F2 data points should fall directly on top of each other. (B, D) Model-weighted parameter estimates for the genetic architecture underlying CBD concentration in C. sativa. Points are sized based on vi scores and indicate the magnitude of the genetic effects. Whiskers indicate the unconditional SEs. Only parameter estimate >0, where vi > 0.01 and wi > 0.001, was included. The genetic effects axis describes the weighted parameter estimates calculated by SAGA in R.31 Data presented in (A) and (B) are from de Meijer et al.,2 whereas data presented in (C) and (D) are from Staginnus et al.29

To predict the pace and direction of cannabinoid evolution in response to artificial selection, the genetic architecture that underlies variation in cannabinoid production must be understood. Two original studies2,29 compared chemotype data to either a single- or two-locus model with multiple alleles using chi-square tests to assess model fit, but did not consider alternative models. We use an information-theoretic (I-T) approach developed by Blackmon and Demuth31 to estimate the composite genetic effects (CGEs) contributing to variation in cannabinoid production among divergent psychoactive and fiber/oilseed cultivars, and their F1 and F2 offspring. In addition, to estimate the number of genes that drive the expression of THC:CBD chemotype and variation in the absolute production of CBD, THC, and CBC, we used the Castle–Wright estimator. Given that the studied lineages were a product of inbreeding and selection,32 we expected to confirm the previous findings that the difference in cannabinoid production among the cultivars examined was governed by few loci with mostly additive effects. However, the use of an I-T approach31 revealed several cannabinoids to be the product of multigene complexes and illuminated additional genetic effects contributing to cultivar differences that were undetected using previous approaches.

Materials and Methods

Parental and offspring lineages

We used two independent data sets with known pedigrees to more thoroughly assess the genetic architecture of cannabinoid inheritance (raw data sets were provided by the authors of Refs.2,29). Detailed methods for establishing the parental and offspring lineages are described elsewhere2,29 and summarized here (summarized data in Appendix Table A1). We encourage readers interested in reanalysis to contact the original authors2,29 who very generously shared the raw data.

In 2003,2 four isolated crosses between inbred female and sex-reversed male plants with contrasting cannabinoid profiles were established from genotypes used at HortaPharm in 1998–2000. Resulting F1 offspring were grown to maturity. A subset of F1 females were sex reversed to act as pollen parents and produce the F2 generation. Resulting F2 plants were grown to maturity. Each generation was raised under similar greenhouse conditions and strict isolation.

In 2014,29 one chemically produced male (from a THC-predominant clone) pollinated two inbred female plants (from two CBD-predominant clones). To produce F2 populations, F1 female plants were crossed with sex-reversed F1 females. Then, 230 F2 seedlings were grown in a greenhouse and flowering was induced through photoperiod adjustments.

Chemical assessment

Sample extraction from mature inflorescences of every plant for gas chromatography (GC) analysis was conducted.2,29 Both studies used the same equipment and the results are directly comparable; however, there were differences in how cannabinoids were identified and quantified (described in those publications). Briefly, the two studies used compared GC sample peaks with pure standard peaks to identify compounds. The GC peak areas were converted into dry weight concentrations using a linear calibration curve from CBD standards. The cannabinoid composition of THC and CBD fractions was reported as the weight proportion (%) of the individual cannabinoids in the total cannabinoid fraction.2 The cannabinoid composition was expressed as the proportion of the weight of individual cannabinoids in the total cannabinoid fraction (%). We also analyzed the inheritance patterns of the ratio of THC:CBD. CBC concentrations were also measured in a similar manner.2

Analysis

With these data, we estimated the expected mean phenotype of each parental lineage (PTHC, PCBD), along with F1 and F2 populations and used this information to distinguish among seven CGEs (Appendix Table A2).33 One strategy, although not used in any previous research on cannabinoid inheritance,2,4,5,28,29 to distinguish among genetic models is line cross analysis (LCA) where genetic models are compared using estimates of mean and standard deviation from a few line crosses (e.g., parental, F1, F2 populations). However, joint scaling tests (one method of performing LCA) favor simpler models of genetic architecture and do not allow for uncertainty in genetic model selection.31 Simple genetic models may suffer from bias against more complex genetic architecture and may overlook alternative explanations for patterns of phenotypic expression.34 When assessing the importance of epistatic or environmental effects, the Akaike information criterion (AIC) approach to conducting LCA appears to reduce bias.

Thus, we estimated the relative contribution of genetic effects (Appendix Table A2) on the inheritance of cannabinoid production. Using an I-T approach, we compared expected and observed mean cannabinoid concentration and standard error (SE) of the parental populations (P1, P2) and two subsequent generations (F1, F2). All analyses were computed using the software package SAGA v.2.0 (R 3.2.3; R Foundation for Statistical Computing 2015; https://github.com/coleoguy/SAGA; Appendix 3).

We first developed C-matrices for each trait to describe the potential contribution of CGEs to F0, F1, and F2 cohort means. We used a C-matrix that was scaled to the midparent mean and included 7 of the 23 CGEs available in SAGA (Appendix Table A2), thus presenting the first study of cannabinoid inheritance with lowered risk that results are biased by experimenter selection of the CGEs tested. For each CGE, we have calculated coefficients for four potential crosses, considering only mixed sex cohorts.

SAGA reduced the C-matrix to include only the CGEs that can be partitioned with the four available cohorts (P1, P2, F1, F2). Then, SAGA generated all possible models (maximum number of models=number of cohorts minus 2df) and performed a weighted least-squares regression.31 SAGA produced parameter (±SE) estimates along with AICC scores to adjust for model complexity, goodness of fit, and sample size.35 This approach (formerly termed the joint-scaling test) allowed us to distinguish among the seven CGEs. Once all models were evaluated, SAGA calculated ΔAICC, comparing each model AICC score to the minimum AICC score calculated across all possible models. Models were ranked according to ΔAICC, and an Akaike weight (wi) was generated.

With quantitative tools, namely the Castle–Wright estimator,36,37 one can estimate the minimum number of genes involved in a quantitative trait before investing significant resources in mapping studies of those quantitative trait loci (QTL). When the likely CGEs included an additive model, we applied the Castle–Wright estimator33 to estimate the effective number of loci involved in THC, CBD, and CBC trait expression (n^e), underlying differences in cannabinoid profiles between cultivars, based on the parental means (P1, P2) and variances (σP12, σP22) as well as the variances of the F1 and F2 hybrid offspring (σF22, σF12) using the following equation:

e=P1P22σP12σP228σF22σF12

This method assumes: (1) genes have an equal and additive effect on phenotype; and (2) these alleles are at unlinked loci and are stable and dependent on DNA sequence (i.e., not epigenetic). When assumptions are violated, the Castle–Wright method generally underestimates the effective number of loci involved.38,39 The estimated number of genetic factors controlling CBD and THC was greater than 20 in the 2014 data set. Since the Castle–Wright estimator performs poorly when the number of genetic factors is large,40 we did not report these values.

Results

There was a moderately high amount of model selection uncertainty for the inheritance of THC concentration in both data sets. The best model in both data sets, with high Akaike weight, considered autosomal additive effects (Aa, wi=0.29, wi=0.442, respectively; Table 1; Fig. 1). In 2003, the second-best model, with approximately half the Akaike weight of the Aa model (wi=0.26) considered both additive and autosomal dominant genetic effects (Aa+Ad), and the third-best model included cytotype additive genetic effects (Ca; wi=0.146). Moreover, in 2014, there were three second-best models (Ca+AaAa, Ca+Aa, Aa+AaAa) that included Aa, Ca, and autosomal additive by additive epistatic (AaAa) effects. The wi scores and vi scores suggest that Aa and Ca are both important contributors to mean THC concentration in 2014 lineages. AaAa CGEs may also be an important but small contributor to line means, since AaAa appeared in the top three models identified by wi scores in both data sets and had a relatively high vi score in 2014.

Table 1.

Parameter Estimates (±Unconditional Standard Error) for Models of the Genetic Architecture of Cannabinoid Inheritance: Δ-9-Tetrahydrocannabinol

Model
AICC weight (wi)
Estimate (SE)
(1) de Meijer et al.2 Aa Ad Ca Med AaAa AdAd Mea
Aa 0.285 2.43 (1.27)          
Aa, Ad 0.255 3.28 (0.17) −1.25 (0.11)          
Ca 0.146     1.12 (0.78)        
Med 0.095       0.68 (0.61)      
AaAa 0.076         −1.71 (1.77)    
AdAd 0.045           −0.47 (0.93)  
Ad 0.038   −0.36 (1.33)          
Mea 0.036             −0.11 (0.68)
Variable importance -> 0.551 0.293 0.146 0.095 0.076 0.065 0.052
Model
AICC weight (wi)
Estimate (SE)
       
(2) Staginnus et al.29 Aa Ca AaAa        
Aa
0.442
44.57 (0.53)
 
 
       
Ca, AaAa
0.176
 
44.50 (0.08)
53.37 (0.92)
       
Ca, Aa
0.176
53.37 (0.92)
−8.86 (0.92)
 
       
Aa, AaAa
0.176
44.50 (0.08)
8.86 (0.92)
 
       
Variable importance -> 0.822 0.352 0.351        

Cannabinoid inheritance is described for THC based on data from either (1) de Meijer et al.2 or (2) Staginnus et al.29 and an assessment of variable importance within the model and the Akaike weight associated with each genetic effect. AIC corrected for finite sample size (AICC) weights were used to select the models for presentation when the minimum number of models whose weights sum to >95%.

AIC, Akaike information criterion; CBC, cannabichromene; CBD, cannabidiol; SE, standard error; THC, Δ-9-tetrahydrocannabinol.

FIG. 1.

FIG. 1.

Results from replicate analyses of the genetic architecture of THC concentration in Cannabis sativa. (A, C) Traditional line cross analysis plot of THC concentration with respect to proportion of the genome inherited from CBD-rich parent (P1). If THC concentration was inherited in a Mendelian manner, the points should fall on the dotted line and the F1 and F2 data points should fall directly on top of each other. (B, D) Model-weighted parameter estimates for the genetic architecture underlying THC concentration in C. sativa. Points are sized based on vi scores and indicate the magnitude of the genetic effects. Whiskers indicate the unconditional SEs. Only parameter estimate >0, where vi > 0.01 and wi > 0.001, was included. The genetic effects axis describes the weighted parameter estimates calculated by SAGA in R.31 Data presented in (A) and (B) are from de Meijer et al.2; data presented in (C) and (D) are from Staginnus et al.29 Model acronyms include autosomal additive (Aa), autosomal dominant (Ad), complex additive (Ca), maternal effect additive (Mea), maternal effect dominance (Med), autosomal additive by additive epistasis (AaAa), and autosomal dominance by dominance epistasis (AdAd); same acronyms are explained in Appendix Table A2. CBD, cannabidiol; SE, standard error; THC, Δ-9-tetrahydrocannabinol.

There was a moderate amount of model selection uncertainty in both data sets in the inheritance of CBD concentration, after considering three models to describe the 2003 data set and four models to describe the 2014 data set (Table 2). The best and second-best models flip-flopped in rank order among the two data sets. Whereas the best model in 2014 was additive (wi=0.82), the best model in the 20032 data set was a model that considered cytotype additive effects (Ca, wi=0.56; Table 2; Fig. 2). The second-best model in 2003 was Aa (wi=0.37), whereas the second-best model, with a relatively low Akaike weight, in 2014 was the Ca model (wi=0.075).

Table 2.

Parameter Estimates (±Unconditional Standard Error) for Models of the Genetic Architecture of Cannabinoid Inheritance: Cannabidiol

Model
AICC weight (wi)
Estimate (SE)
(1) de Meijer et al.2 Ca Aa AaAa
Ca 0.558 1.40 (0.07)    
Aa 0.374   2.77 (0.16)  
AaAa 0.040     −2.79 (0.29)
Variable importance -> 0.560 0.391 0.040
Model
AICC weight (wi)
Estimate (SE)
(2) Staginnus et al.29 Aa Ca Ad Mea
Aa
0.817
48.79 (3.67)
 
 
 
Ca
0.075
 
28.63 (3.96)
 
 
Aa, Ad
0.043
42.95 (1.98)
 
11.37 (1.98)
 
Mea
0.026
 
 
 
31.63 (5.77)
Variable importance -> 0.884 0.098 0.044 0.027

Cannabinoid inheritance is described for CBD based on data from either (1) de Meijer et al.2 or (2) Staginnus et al.29 and an assessment of variable importance within the model and the Akaike weight associated with each genetic effect. AIC corrected for finite sample size (AICC) weights were used to select the models for presentation when the minimum number of models whose weights sum to >95%.

There was a small amount of model selection uncertainty in both data sets in the inheritance of THC:CBD ratio, after considering seven models to describe each of the 2003 and 2004 data sets. The best model in the 2003 data set was a model that considered additive and additive maternal effects (wi=0.98) and the distant second-best model considered dominant epistatic effects (AdAd) and additive maternal effects (wi=0.10). In the 2004 data set, the best model considered only additive effects (wi=0.63) and the second-best model considered additive and dominant effects (Aa, Ad; wi=0.094). These results pair closely with the conclusions of de Meijer et al.2

There was a moderate amount of model selection uncertainty in 2003 and no model selection uncertainty in 2014 for the inheritance of CBC concentration (Table 3). In 2003, the best model was additive (wi=0.62), and the single variable in the model was an important variable (vi=0.63) and had a relatively large parameter estimate (Table 3; Fig. 3). The second-best model included both autosomal dominant and additive maternal effects (Ad, Mea; wi=0.25), which were both moderately important variables (for both, vi=0.25) but with low parameter estimates. The third-best model involved epistasis among autosomal additive genetic factors (AaAa, wi=0.12), and the single variable in the model had relatively low importance (vi=0.12) but a relatively large parameter estimate. In 2014, the only model that explained chemical variation among parental, F1 and F2 populations was one that included both autosomal additive and dominance effects (Aa, Ad; wi=0.99), which were both important variables in the only model (for both, vi=0.99) and large parameter estimates.

Table 3.

Parameter Estimates (±Unconditional Standard Error) for Models of the Genetic Architecture of Cannabinoid Inheritance: Cannabichromene

Model
AICC weight (wi)
Estimate (SE)
(1) de Meijer et al.2 Aa Ad Mea AaAa
Aa 0.625 −0.26 (0.02)      
Ad, Mea 0.247   −0.004 (0.05) −0.15 (0.004)  
AaAa 0.118       −0.26 (0.03)
Variable importance -> 0.628 0.250 0.248 0.121
Model
AICC weight (wi)
Estimate (SE)
   
(2) Staginnus et al.29 Aa Ad    
Aa, Ad
0.999
−1.15 (0.002)
0.79 (0.004)
   
Variable importance -> 0.999 0.999    

Cannabinoid inheritance is described for CBC based on data from either (1) de Meijer et al.2 or (2) Staginnus et al.29 and an assessment of variable importance within the model and the Akaike weight associated with each genetic effect. AIC corrected for finite sample size (AICC) weights were used to select the models for presentation when the minimum number of models whose weights sum to >95%.

FIG. 3.

FIG. 3.

Results from replicate analyses of the genetic architecture of CBC concentration in Cannabis sativa. (A, C) Traditional line cross analysis plot of CBC concentration with respect to proportion of the genome inherited from CBC-depauperate parent (P1). If CBC concentration was inherited in a Mendelian manner, the points should fall on the dotted line and the F1 and F2 data points should fall directly on top of each other. (B, D) Model-weighted parameter estimates for the genetic architecture underlying CBC concentration in C. sativa. Points are sized based on vi scores and indicate the magnitude of the genetic effects (listed on the x-axis). Whiskers indicate the unconditional SE (note that in D, SE is obscured by the point size). Only parameter estimate >0, where vi > 0.01 and wi > 0.001, was included. The genetic effects axis describes the weighted parameter estimates calculated by SAGA in R.31 Data presented in (A) and (B) are from de Meijer et al.,2 whereas data presented in (C) and (D) are from Staginnus et al.29 CBC, cannabichromene.

Differences in THC, CBD, and CBC concentrations were predicted to be controlled by 3.56, 1.22, and 1.68 factors, respectively, in 2003. Thus, one to two genes controlled the expression of CBD and CBC concentration and three to four genes controlled the expression of THC concentration. Difference in CBC expression was predicted to be controlled by 0.78 factors for the 2014 population, and therefore, this trait is likely controlled by a single gene.

Discussion

Here, we show that inheritance of cannabinoids is far more complex than previously appreciated.2,4,5,28,29 In fact, not only are cannabinoid concentrations polygenic but we have also documented that they are influenced by maternal and cytoplasmic genetic effects. These complex genetic models alter the variance components of the one-locus, two-allele model in fundamental and enlightening ways, as detailed below. By using an I-T approach to identify the best genetic models, rather than a typical LCA approach to elucidate cannabinoid inheritance, we have had more power to predict variation in cannabinoids. However, it is difficult to disentangle the differences in these results because the genetic algorithms of SAGA are unclear in the original article and R documentation. The role of genetic variation on the inheritance of THC:CBD ratio is well described (and our modeling efforts confirmed the general consensus developed in the literature that the ratio is largely an additive inheritance process in C. sativa).2,5 Importantly, it appears as though there is a trade-off between the production of THCA and CBDA, which influences the relative cannabinoid abundance. Nevertheless, how absolute cannabinoid abundance is controlled by genetic factors still requires significant research. Our model predicts that the absolute abundance of cannabinoids, especially THC and CBD, is controlled by different genetic architectures. This discrepancy between the models of relative and absolute abundance hinders careful and predictable breeding efforts of cannabis cultivars, and experimentation is required to reconcile these two conclusions. Because we did not have access to data from unselected parental populations, reciprocal crosses for F1 generations, or backcrossed offspring,2,33 the available data were less likely to yield complex genetic effects. Yet, despite the reduced probability of detecting complex genetic effects and nonadditivity, we found evidence for complexity in the inheritance of all three cannabinoids. Therefore, the elucidated patterns are likely important and understudied aspects of cannabinoid inheritance. For example, altered CBD and THC concentration appears to result from cytoplasmic effects; maternal effects may explain CBC concentrations; and allelic dominance likely influences THC and CBC concentrations. Earlier studies, using traditional analytical approaches, did not explore phenotypic variance explained by the polygenic nature of these traits and nonadditive genetic effects and so their conclusions were limited to additive genetic effects.2–4,28 By identifying genetic effects on cannabinoid concentrations, we expect our results will make it easier to predict and control cannabinoid production.

Population-level studies involve limited parental germplasm and thus provide information of CGEs acting on that population alone (i.e., results are not globally generalizable). Our use of two complementary data sets with unique parental genotypes, and the comparison of our results with a third study using unique parental genotypes,5 highlights those CGEs common to multiple breeding populations. This confirms that cannabinoid abundance is, in part, controlled by a nonfunctional CBDA synthase, in contrast to a single-locus additive gene, as originally proposed.2,29 Furthermore, we detected the presence of multiple genes acting on the concentration of THC in both 20032 and 2014,29 consistent with de Meijer's original quantitative model2 and the mapped “cannabinoid quantity” QTL.5 However, we present only summarized data because the raw data set was a gift from de Meijer et al.2,29 We encourage readers who want to explore our approach to contact the authors of these studies directly.

CBD is likely controlled by at least one major gene and a series of smaller genetic effects. In contrast to previous research, cytogenetic effects appear to contribute to the inheritance of CBD in both populations, suggesting a role of mitochondrial or plastid genomes in its expression (Fig. 3; Appendix Table A2). Comparative sequencing of chloroplast and mitochondrial DNA noted the utility of extranuclear genomes in differentiating among fiber and drug cultivars,41 which are differentiated by THC concentrations but often also differ in CBD concentration.23 However, this remains understudied because other studies have removed “sequences obviously originating from organelles or ribosomal RNA.”42 Finally, CBC concentrations are likely the product of two major genes, one additive and one dominant, given the consistent role of these two CGEs across both data sets.

Our analyses also detected differences in the role of CGEs among interbreeding populations, suggesting that convergent evolution is responsible for some of the chemical diversity within C. sativa populations. For instance, where cytotype and epistatic additive effects appear to play a role in the expression of THC concentration in the 2014 populations,29 they play more minor roles in the 2003 population.2 Maternal effects influenced CBC and chemotype expression in the 2003 data set,2 whereas they were not included in the model in the 2014 data set.29 These differences in genetic effects may help explain the broad difficulties for hemp breeders when attempting to limit the production of THC, and for cannabis growers in the production of consistent concentrations of cannabinoids between and within crop varieties. Furthermore, more general epistatic effects have not been considered and may contribute to fluctuating chemotype expression.

The Castle–Wright estimator is commonly used to explore the genetic basis of trait evolution and has been commonly used to evaluate crop traits that have likely involved adaptation through altered expression of second metabolites (e.g., resistance to fungal infections, insect resistance, and heat tolerance43–45). Previous QTL and modeling approaches suggest that the Castle–Wright estimator may be robust for traits with few loci, even if assumptions are not met.40,46,47 Our Castle–Wright estimator results for THC, CBD, and CBC in 20032 are consistent with the independent CGE analyses run in SAGA, in terms of our expectations of the number of genes involved in a trait. However, the results differ dramatically between the two data sets, suggesting that populations may differ dramatically in the mechanisms of inheritance and the number of genes involved.

Our results suggest that variation in cannabinoid expression was not previously fully described. We hypothesize that nonadditive genetic effects play prominent roles in cannabinoid genetics. A formal test of integration, whether phenotypic,48 genetic,49,50 or developmental,51 would make significant advances in our understanding of cannabinoid inheritance. Thus, new crosses that map loci relative to psychoactive chemicals and pathways in the F1 and F2 generations of diverse chemotypes52 will complement ongoing efforts to characterize the genome of this plant.53,54

Acknowledgments

The authors gratefully acknowledge the Natural Sciences and Engineering Research Council (NSERC) Discovery Grants program (Nos. 402305-2011 to L.C., and 2015-04405 to S.A.S.) and the Ryerson University for funding; the previous authors for their original work and shared data.

Abbreviations Used

AIC

Akaike information criterion

CBC

cannabichromene

CBD

cannabidiol

CBDA

cannabidiolic acid

CBG

cannabigerol

CGEs

composite genetic effects

GC

gas chromatography

I-T

information-theoretic

LCA

line cross analysis

SE

standard error

THC

Δ-9-tetrahydrocannabinol

THCA

tetrahydrocannabinolic acid

Appendix

APPENDIX TABLE A1.

Summary of the Data Used for the Analysis

  De Meijer et al.A1
Staginnus et al.A2
n Mean (SE) n Mean (SE)
THC
 P1:daughters 9 7.59 (1.27) 1 92.91 (1.00×10−7)
 P2:sons 11 0.56 (0.36) 2 3.9 (0.2)
 F1 (P2×P1):mixed 546 2.56 (0.08) 12 38.91 (1.43)
 F2 (F1×F1):mixed 101 3.16 (0.10) 230 40.89 (2.09)
CBD
 P1:daughters 11 3.97 (0.77) 2 85, 2.9
 P2:sons 12 0 (1.00×10−7) 1 0 (1.00×10−7)
 F1 (P2×P1):mixed 101 2.97 (0.10) 12 53.90 (1.40)
 F2 (F1×F1):mixed 546 2.61 (0.10) 230 50.47 (2.08)
CBC
 P1:daughters 9 0 (1.00×10−7) 1 1.29 (1.00×10−7)
 P2:sons 10 0.32 (0.07) 2 3.6 (0.1)
 F1 (P2×P1):mixed 101 0.26 (0.01) 12 3.24 (0.07)
 F2 (F1×F1):mixed 118 0.28 (0.01) 230 2.84 (0.07)

Data gathered from de Meijer et al.A1 and Staginnus et al.A2 Data are presented with two decimal places. Note: From de Meijer et al.,A1 the P1 group included the following genotypes (and their selfed offspring): 94.4.12, 94.5.2, 55.28.1, 94.4.12. The P2 group included the following genotypes (and their selfed offspring): 55.22.7, 55.24.4, 55.28.1. From Staginnus et al.,A2 the P1 group included the following genotypes (and their selfed offspring): 2005.40.69, whereas the P2 group included 99.2.21.30.16.15.3 and 2001.22.6.20.14.1.

CBC, cannabichromene; CBD, cannabidiol; SE, standard error; THC, Δ-9-tetrahydrocannabinol.

APPENDIX TABLE A2.

The Composite Genetic Effects, Their Abbreviation, and a Brief Definition That Were Compared Using an Information Theory Approach with the Analysis of Line Cross Data

Name Abbreviation Definition
Autosomal additive Aa When the combined effects of alleles at different loci are equal to the sum of their individual effects. Autosomal genes are located on one of the numbered, or non-sex, chromosomes.A3,A4
Autosomal dominance Ad In dominant genetic models, a single allele copy of the mutation is enough to cause expression of the trait.A5
Cytotype additive Ca Genetic variation in cytoplasmic genomes (i.e., the combined mitochondrial and plastid genomes) can influence trait expression.A3,A6
Maternal effect additive Mea Maternal effects arise when the genetic and environmental characteristics of a mother influence the phenotype of her offspring, beyond the direct inheritance of alleles. In this case, transgenerational expression of induced genes with additive genetic effects.A3,A7
Maternal effect dominance Med In this case, transgenerational expression of induced genes with dominance genetic effects.A3
Autosomal additive by additive epistasis AaAa The single locus additive value for a given locus (e.g., A) changes depending on the genotype at a second locus (e.g., B) and vice versa. When an alternate allele at locus A is present, it will result in smaller phenotype values when the B-locus is homozygous for the primary genotype and will result in large phenotype values when the B-locus is homozygous for the alternate genotype.A8
Autosomal dominance by dominance epistasis AdAd The single locus dominance genotype value for a given locus (e.g., A) changes depending on the genotype at a second locus (e.g., B) and vice versa. The single locus dominance genotypic value is underdominant when the alternate locus genotype is homozygous and overdominant when the alternate locus genotype is heterozygous.A8

For quantitative traits such as cannabinoid concentrations, their expression may be determined by genetic factors, environmental factors, and parental environment. Thus, when assessing the inheritance of chemotypes, a diversity of genetic models must be considered.A3

APPENDIX 3. Sample Code Used to Analyze the Data. This Particular Code was Used to Analyze CBC Data from de Meijer et al.A1

#open the SAGA R library

library(SAGA)

#create a data matrix called cbc that is 4 rows by 3 columns

cbc<- matrix(,4,3)

#give the columns names

colnames(cbc) <- c(“Cohort.ID,” “Mean,” “SE”)

#populate the cells of the matrix with data

cbc[1,] <- c(1, 0, 0.0000001)

cbc[2,] <- c(5, 0.321014118, 0.065581287)

cbc[3,] <- c(9, 0.257499841, 0.009045537)

cbc[4,] <- c(15, 0.27703782, 0.012610792)

#show the matrix to confirm its organized correctly

cbc

#Perform the analysis of the available crosses and produce a graph to describe some of the results

results3 <- AnalyzeCrossesMM(cbc, graph=T, cex.names=.8)

#In this example, there were three models for which their summed weights accounted for 95% of the ΔAICC calculated the program. The value 3 will change depending on the results above. Next, I rank their model weights (so my table is arranged nicely and I know for which models I should produce parameter estimates).

order(results3[[3]])[1:3]

#Returns parameter estimates conditional on a single model (where 1, 15, and 6 represent the three best models identified in the previous step for this sample data set).

EvaluateModel(results3, 1, cex.names=.7, cex.main=.7)

EvaluateModel(results3, 15, cex.names=.7, cex.main=.7)

EvaluateModel(results3, 6, cex.names=.7, cex.main=.7)

#Examine the information returned from AnalyzeCrossesMM - first look at the names of the elements in this list

names(results3)

#Access variable importance in element 4

results3[[4]]

#Convert a vector of ΔAIC or ΔAICC values to model weights

AICtoMW(results3[[3]])

Appendix References

  • A1.de Meijer EPM, Bagatta M, Carboni A, et al. The inheritance of chemical phenotype in Cannabis sativa L. Genetics. 2003;163:335–346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • A2.Staginnus C, Zörntlein S, de Meijer EPM. A PCR marker linked to a THCA synthase polymorphism is a reliable tool to discriminate potentially THC-rich plants of Cannabis sativa L. J Forensic Sci. 2014;59:919–926 [DOI] [PubMed] [Google Scholar]
  • A3.Wade MJ. Epistasis, complex traits, and mapping genes. Genetica. 2001;112:59–69 [PubMed] [Google Scholar]
  • A4.Kempthorne O. An introduction to genetic statistics. John Wiley & Sons: New York, 1957 [Google Scholar]
  • A5.Lynch M, Walsh B. Genetics and analysis of quantitative traits. Sinauer Associates: Sunderland, MA, 1998 [Google Scholar]
  • A6.Dobler R, Rogell B, Budar F, Dowling DK. A meta-analysis of the strength and nature of cytoplasmic genetic effects. J Evol Biol. 2014;27:2021–2034 [DOI] [PubMed] [Google Scholar]
  • A7.Mousseau TA, Fox CW, eds. 1998. Maternal Effects as Adaptations. New York, USA: Oxford University Press [Google Scholar]
  • A8.Cheverud JM. Detecting epistasis among quantitative trait loci. In: Wolf JB, Brodie ED, Wade MJ (eds). Epistasis and the evolutionary process. Oxford University Press: Oxford, UK, pp. 58–81, 2000 [Google Scholar]

Author Disclosure Statement

No competing financial interests exist.

Cite this article as: Campbell LG, Dufresne J, Sabatinos SA (2020) Cannabinoid inheritance relies on complex genetic architecture, Cannabis and Cannabinoid Research 5:1, 105–116, DOI: 10.1089/can.2018.0015.

References

  • 1. Gepts P. Origins of plant agriculture and major crop plants. In: Tolba MK (ed). Our fragile world: challenges and opportunities for sustainable development, EOLSS Publishers: Oxford, UK, pp. 627–637, 2001 [Google Scholar]
  • 2. de Meijer EPM, Bagatta M, Carboni A, et al. The inheritance of chemical phenotype in Cannabis sativa L. Genetics. 2003;163:335–346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. de Meijer EPM, Hammond KM. The inheritance of chemical phenotype in Cannabis sativa L. (II): cannabigerol predominant plants. Euphytica. 2005;145:189–198 [Google Scholar]
  • 4. de Meijer EPM, Hammond KM, Micheler M. The inheritance of chemical phenotype in Cannabis sativa L. (III): variation in cannabichromene proportion. Euphytica. 2009;165:293–311 [Google Scholar]
  • 5. Weiblen GD, Wenger JP, Craft KJ, et al. Gene duplication and divergence affecting drug content in Cannabis sativa. New Phytol. 2015;208:1241–1250 [DOI] [PubMed] [Google Scholar]
  • 6. Kang BC, Nahm SH, Huh JH, et al. An interspecific (Capsicum annuum x C. chinense) F2 linkage map in pepper using RFLP and AFLP markers. Theor Appl Genet. 2001;102:531–539 [Google Scholar]
  • 7. Mondego JMC, Vidal RO, Carazzolle MF, et al. An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora. BMC Plant Biol. 2011;11:30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lewis RS, Bowen SW, Keogh MR, et al. Three nicotine demethylase genes mediate nornicotine biosynthesis in Nicotiana tabacum L.: functional characterization of the CYP82E10 gene. Phytochemistry. 2010;71:1988–1998 [DOI] [PubMed] [Google Scholar]
  • 9. Yadav HK, Shukla S, Singh SP. Genetic divergence in parental genotypes and its relation with heterosis, F1 performance and general combining ability (GCA) in opium poppy (Papaver somniferum L.). Euphytica. 2007;157:123–130 [Google Scholar]
  • 10. Ziegler J, Diaz-Chávez ML, Kramell R, et al. Comparative microarray analysis of morphine containing Papaver somniferum and eight morphine free Papaver species identifies an O-methyltransferase involved in benzylisoquinoline biosynthesis. Planta. 2005;222:458–471 [DOI] [PubMed] [Google Scholar]
  • 11. Mechoulam R. Marihuana chemistry. Science. 1970;168:1159–1166 [DOI] [PubMed] [Google Scholar]
  • 12. Mechoulam R. Plant cannabinoids: a neglected pharmacological treasure trove. Br J Pharmacol. 2005;146:913–915 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Pertwee RG. Cannabinoid pharmacology: the first 66 years. Br J Pharmacol. 2006;147:S163–S171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Turner CE, Bouwsma OJ, Billets S, et al. Constituents of Cannabis sativa L. XVIII–Electron voltage selected ion monitoring study of cannabinoids. Biomed Mass Spectrom. 1980;7:247–256 [DOI] [PubMed] [Google Scholar]
  • 15. Adams R, Hunt M, Clark JH. Structure of cannabidiol, a product isolated from the marihuana extract of Minnesota wild hemp. J Am Chem Soc. 1940;62:196–200 [Google Scholar]
  • 16. Mechoulam R, Shvo Y. Hashish-I, the structure of cannabidiol. Tetrahedron. 1963;19:2073–2078 [DOI] [PubMed] [Google Scholar]
  • 17. Gaoni Y, Mechoulam R. Isolation, structure and partial synthesis of an active constituent of hashish. J Am Chem Soc. 1964;86:1646–1647 [Google Scholar]
  • 18. Gaoni Y Mechoulam R. The structure and synthesis of cannabigerol a new hashish constituent. Proc Chem Soc. 1964;March:82 [Google Scholar]
  • 19. Gaoni Y, Mechoulam R. Cannabichromene, a new active principle in hashish. Chem Comm. 1966;1:20–21 [Google Scholar]
  • 20. de Meijer EPM, Van Der Kamp HJ Van Eeuwijk FA. Characterisation of Cannabis accessions with regard to cannabinoid content in relation to other plant characters. Euphytica. 1992;62:187–200 [Google Scholar]
  • 21. Fournier G. Les chemiotypes du chanvre (Cannabis sativa L.). Intérêt pour un programme de selection. Agronomie. 1981;1:679–688 [Google Scholar]
  • 22. Fournier G, Paris MR. Le chanvre papetier (Cannabis sativa L.) cultivé en France: le point sur ses constituants (Paper hemp cultivated in France: a point on the constituents). Plantes Médicin Phytothérap (Phytotherapeutic Medicinal Plants). 1979;13:116–121 [Google Scholar]
  • 23. Small E, Beckstead HD. Common cannabinoid phenotypes in 350 stocks of Cannabis. Lloydia. 1973;36:144–165 [PubMed] [Google Scholar]
  • 24. Akhmetova G, Sarsenbayev KN, Boribay E, et al. DNA heterogenity in the Cannabis population of Chui valley. J Biotechnol. 2014;185:S62–S63 [Google Scholar]
  • 25. Gao C, Xin P, Cheng C, et al. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers. PLoS One. 2014;9:e110638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Taura F, Morimoto S, Shoyama Y. First direct evidence for the mechanisms of delta-1-tetrahydrocannabinolic acid biosynthesis. J Am Chem Soc. 1995;38:9766–9767 [Google Scholar]
  • 27. Taura F, Morimoto S, Shoyama Y. Purification and characterization of cannabidiolic-acid synthase from Cannabis sativa L. J Biol Chem. 1996;271:17411–17416 [DOI] [PubMed] [Google Scholar]
  • 28. de Meijer EPM, Hammond KM, Sutton A. The inheritance of chemical phenotype in Cannabis sativa L. (IV): cannabinoid-free plants. Euphytica. 2009;168:95–112 [Google Scholar]
  • 29. Staginnus C, Zörntlein S, de Meijer EPM. A PCR marker linked to a THCA synthase polymorphism is a reliable tool to discriminate potentially THC-rich plants of Cannabis sativa L. J Forensic Sci. 2014;59:919–926 [DOI] [PubMed] [Google Scholar]
  • 30. Mandolino G, Carboni A. Potential of marker-assisted selection in hemp genetic improvement. Euphytica. 2004;140:107–120 [Google Scholar]
  • 31. Blackmon H, Demuth JP. An information-theoretic approach to estimating the composite genetic effects contributing to variation among generation means: moving beyond the joint-scaling test for line cross analysis. Evolution. 2016;70:420–432 [DOI] [PubMed] [Google Scholar]
  • 32. Lynch M, Walsh B. Genetics and analysis of quantitative traits. Sinauer Associates: Sunderland, MA, 1998 [Google Scholar]
  • 33. Wade MJ. Epistasis, complex traits, and mapping genes. Genetica. 2001;112:59–69 [PubMed] [Google Scholar]
  • 34. Demuth JP, Wade MJ. Experimental methods for measuring gene interactions. Annu Rev Ecol Evol Syst. 2006;37:289–316 [Google Scholar]
  • 35. McQuarrie AD, Tsai CL. Regression and time series model selection. World Scientific Publishing Company: Singapore, 1998 [Google Scholar]
  • 36. Castle WE. An improved method of estimating the number of genetic factors concerned in cases of blending inheritance. Proc Nat Acad Sci USA. 1921;81:6904–6907 [DOI] [PubMed] [Google Scholar]
  • 37. Wright S. Evolution and the genetics of populations. University of Chicago Press: Chicago, IL, 1968 [Google Scholar]
  • 38. Zeng ZB. Correcting the bias of Wright's estimates of the number of genes affecting a quantitative character: a further improved method. Genetics. 1992;131:987–1001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Zeng ZB, Houle D, Cockerham CC. How informative is Wright's estimator of the number of genes affecting a quantitative character? Genetics. 1990;126:235–247 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Otto SP, Jones CD. Detecting the undetected: estimating the total number of loci underlying a quantitative trait. Genetics. 2000;156:2093–2107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Gilmore S, Peakall R, Robertson J. Organelle DNA haplotypes reflect crop-use characteristics and geographic origins of Cannabis sativa. For Sci Int. 2007;172:179–190 [DOI] [PubMed] [Google Scholar]
  • 42. Marks MD, Tian L, Wenger JP, et al. Identification of candidate genes affecting delta 9-tetrahydrocannabinol biosynthesis in Cannabis sativa. J Exp Bot. 2009;60:3715–3726 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Chen X, Line RF. Gene number and heritability of wheat cultivars with durable, high-temperature, adult-plant (HTAP) resistance and interaction of HTAP and race-specific seedling resistance to Puccinia striiformis. Phytopathology. 1995;85:573–578 [Google Scholar]
  • 44. Adamczyk JJ, Meredith WR. Genetic basis for variability of Cry1Ac expression among commercial transgenic Bacillus thuringiensis (Bt) cotton cultivars in the United States. J Cotton Sci. 2004;8:17–23 [Google Scholar]
  • 45. Jones IT. Inheritance of adult plant resistance to mildew in oats. Ann Appl Biol. 2008;109:187–192 [Google Scholar]
  • 46. Peichel CL, Nereng KS, Ohgi KA, et al. The genetic architecture of divergence between three-spine stickleback species. Nature. 2001;414:901–905 [DOI] [PubMed] [Google Scholar]
  • 47. Westerbergh A, Doebley J. Morphological traits defining species differences in wild relatives of maize are controlled by multiple quantitative trait loci. Evolution. 2002;56:273–283 [DOI] [PubMed] [Google Scholar]
  • 48. Herrera CM, Cerdá X, García MB, et al. Floral integration, phenotypic covariance structure and pollinator variation in bumblebee-pollinated Helleborus foetidus. J Evol Biol. 2002;15:108–121 [Google Scholar]
  • 49. Cheverud JM. Phenotypic, genetic, and environmental morphological integration in the cranium. Evolution. 1982;36:499–516 [DOI] [PubMed] [Google Scholar]
  • 50. Delph LF. The genetic integration of sexually dimorphic traits in the dioecious plant, Silene latifolia. In: Fairbairn DJ, Blanckenhorn WU, Szekely T (eds). Sex, size and gender roles. Evolutionary studies of sexual size dimorphism. Oxford University Press: Oxford, UK, pp. 115–123, 2007 [Google Scholar]
  • 51. Schlichting CD, Pigliucci M. Phenotypic evolution: a reaction norm perspective. Sinauer Associates, Inc.: Sunderland, MA, 1998 [Google Scholar]
  • 52. Beavis WD. QTL analysis: power, precision, and accuracy. In: Paterson AH (ed). Molecular dissection of complex traits. CRC Press: Boca Raton, FL, pp. 145–162, 1997 [Google Scholar]
  • 53. Lynch RC, Vergara D, Tittes S, et al. Genomic and chemical diversity in Cannabis. Critical Reviews in Plant Sciences. 2016;35:349–363 [Google Scholar]
  • 54. Van Bakel H, Stout JM, Cote AG, et al. The draft genome and transcriptome of Cannabis sativa. Genome Biol. 2011;12:R102. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Cannabis and Cannabinoid Research are provided here courtesy of Mary Ann Liebert, Inc.

RESOURCES