Skip to main content
Plant Biotechnology Journal logoLink to Plant Biotechnology Journal
. 2019 Jun 12;18(1):96–105. doi: 10.1111/pbi.13175

Whole‐genome resequencing‐based QTL‐seq identified AhTc1 gene encoding a R2R3‐MYB transcription factor controlling peanut purple testa colour

Yuhan Zhao 1,2, Junjie Ma 1,3, Ming Li 1, Li Deng 4, Guanghui Li 1, Han Xia 1,2, Shuzhen Zhao 1,2, Lei Hou 1, Pengcheng Li 1,2, Changle Ma 2, Mei Yuan 5, Li Ren 4, Jianzhong Gu 4, Baozhu Guo 6,7, Chuanzhi Zhao 1,2,6,7,, Xingjun Wang 1,2,
PMCID: PMC6920131  PMID: 31131506

Summary

Peanut (Arachis hypogaea. L) is an important oil crop worldwide. The common testa colours of peanut varieties are pink or red. But the peanut varieties with dark purple testa have been focused in recent years due to the potential high levels of anthocyanin, an added nutritional value of antioxidant. However, the genetic mechanism regulating testa colour of peanut is unknown. In this study, we found that the purple testa was decided by the female parent and controlled by a single major gene named AhTc1. To identify the candidate gene controlling peanut purple testa, whole‐genome resequencing‐based approach (QTL‐seq) was applied, and a total of 260.9 Gb of data were generated from the parental and bulked lines. SNP index analysis indicated that AhTc1 located in a 4.7 Mb region in chromosome A10, which was confirmed by bulked segregant RNA sequencing (BSR) analysis in three segregation populations derived from the crosses between pink and purple testa varieties. Allele‐specific markers were developed and demonstrated that the marker pTesta1089 was closely linked with purple testa. Further, AhTc1 encoding a R2R3‐MYB gene was positional cloned. The expression of AhTc1 was significantly up‐regulated in the purple testa parent YH29. Overexpression of AhTc1 in transgenic tobacco plants led to purple colour of leaves, flowers, pods and seeds. In conclusion, AhTc1, encoding a R2R3‐MYB transcription factor and conferring peanut purple testa, was identified, which will be useful for peanut molecular breeding selection for cultivars with purple testa colour for potential increased nutritional value to consumers.

Keywords: peanut, testa colour, AhTc1, QTL‐seq, BSA, antioxidant

Introduction

Peanut (or groundnut), widely cultivated in more than 100 countries, with the annual production of 43.98 million tons, is one of the most important oil crops in the world (FAOSTAT 2016, www.faostat.fao.org). Peanut seeds are rich in oil (40%–60%), protein (20%–40%), carbohydrate (10%–20%) and many other nutritional values, such as vitamin B1, B3, B9 and E, biotin, resveratrol, isoflavones and phytic acid (Pandey et al., 2012; Zhao et al., 2012). The testa colours of most peanut varieties are with pink or red. Recently, the peanut varieties with purple testa have attracted increasing attention as a potential source of nutraceuticals, owing to their higher anthocyanin content and microelements, which are beneficial to human health (Attree et al., 2015; Kuang et al., 2017). Peanut varieties with purple testa satisfy the needs of the customers. However, the commercial fine varieties with black or purple testa are scarce which limits the production of purple peanuts.

In the last decade, molecular breeding approaches such as marker‐assisted selection (MAS) have been utilized in peanut and other crops, which significantly improved the efficiency of breeding. Despite the agronomic importance of testa colour, the genetic and the molecular mechanism that controls testa colour in peanut are not clear. Besides pink and purple testa, germplasms with white, red and tan testa are also available. It is reported that the purple testa is completely dominant to white and incompletely dominant to the basic tan or pink testa colour, and purple testa of peanut was controlled by a single dominant gene (Branch, 1985, 2001). A study showed that the purple testa of peanut was controlled by an incomplete dominant major gene, which linked with SSR marker PM93 (Hong et al., 2007). Sequence alignment showed that SSR marker PM93 was same as Seq4H11, located at 6.664 cM of linkage group TA10 in the integrated consensus map of cultivated peanut and wild relatives (Shirasawa et al., 2013).

Cultivated peanut (Arachis hypogaea. L) is an allotetraploid (AABB, 2n = 4× = 40). The large size of the genome (2800 Mb), the ploidy level and high content of repetitive DNA (Dhillon et al., 1980) have obstructed genetic and genomic studies in peanut, and make positional cloning of a gene very difficult. During the past 5 years, significant progress has been made in peanut genomic sequencing and QTL mapping. In 2016, the genome sequences of two diploid wild ancestors of cultivated peanut, Arachis duranensis and Arachis ipaensis, were determined (Bertioli et al., 2016; Chen et al., 2016). In 2017, two groups reported the completion of genome sequences of cultivated peanut, Tifrunner (Bertioli et al., 2019) and Shitouqi (Zhuang et al., 2019). In addition, the draft sequences of Arachis monticola, the only allotetraploid wild peanut in the Arachis genus, were reported (Yin et al., 2018). These efforts provided new opportunities for peanut genetic analysis, gene cloning, QTL mapping and molecular marker development (Luo et al., 2019; Pandey et al., 2017a,b; Wang et al., 2017; Zhao et al., 2017).

Traditional QTL analysis has been known as time‐consuming and labour‐intensive work. In comparison, bulked segregant analysis (BSA) was first reported in 1991 and is an elegant method for rapidly identifying markers linked to any specific gene or genomic region using two bulked DNA pools (Michelmore et al., 1991). Recent development of next‐generation sequencing technologies (NGS) reduced the cost and shortened the cycles of sequencing, promoting the use of sequence‐based trait mapping approaches. Based on the integration of BSA, NGS and bioinformatics analysis, a series of more efficient approaches were developed, including QTL‐seq, MutMup, MutMup+ and BSR‐seq. These approaches have been successfully employed in QTL mapping and candidate gene identification in several crops (Abe et al., 2012; Fekih et al., 2013; Steuernagel et al., 2016; Takagi et al., 2013). QTL‐seq is a whole‐genome resequencing (WGRS)‐based approach, which firstly to resequence two bulked DNA of progenies (each with 20–50 individuals) showing extreme phenotypic values and then identifying the candidate region or genes by counting and comparing the index SNPs between two bulks (Takagi et al., 2013). It is remarkable that QTL‐seq has been successfully used in peanut in mapping QTLs of rust resistance, late leaf spot resistance, and genomic regions and candidate genes controlling shelling percentage (Clevenger et al., 2018; Luo et al., 2019; Pandey et al., 2017a).

In this study, the inheritance of purple testa of peanut was analysed using F2, F2:3 segregation population derived from a cross between peanut varieties with pink and purple testa. A major location controlling the purple testa was mapped using QTL‐seq method. Furthermore, a candidate gene, AhTc1, encoding a R2R3‐MYB transcription factor, was identified using map‐based cloning method together with transcriptome analysis and gene expression profiling. Functional studies indicated that AhTc1 played important roles in regulating anthocyanin biosynthesis. Our results laid the foundation for breeding purple testa peanut varieties using peanut MAS program.

Results

Genetic analysis of purple testa in YH29

YH29 and ZH9 are varieties with purple testa, while WH10, GT‐C20 and ZH8 are varieties with pink testa. In addition, among the parental lines, there was also variation in the leaflets, flower, vasculature and petiole colour where lines with darker testa also showed more purple coloration in these tissues (Figure 1). The peanut varieties with purple testa were used as male parents for crossing with the varieties with pink testa (Table 1). All F1 plants showed the intermediate phenotype in leaflets, vasculature, petiole and flower colour comparison with their parental lines (Figure 1). However, all F1 seeds were with pink testa, same as the female parents. For F2 and F3 generation, the seeds harvested from a single plant were also with same testa colour, and the testa colour was in consistence with the colour of female parents. Our results were in consistence with studies in other plants (Lambrides et al., 2004). So, we predicted that testa was developed by the integument of the ovule, and hence, the testa has the same genotype with the maternal plant, which lead to the segregate of testa colour in the next generation. We determined the genotype of a seed through the phenotype of seeds from KF1 (WH10♀ × YH29♂) and KF2 (GT‐C20♀ × YH29♂) populations. Statistical results corresponded to a single locus segregation ratio (Table 1). Combined with phenotypes of F1 plants and F2 seeds, we suggested that purple testa of YH29 was controlled by an incomplete dominant gene and this gene was named as AhTc1.

Figure 1.

Figure 1

Variation in colour and content of pigments between parental lines and offsprings. (a) F1 seeds derived from the cross of WH10 and YH29. (b) The flower colour difference between WH10 and YH29. (c) The contents of anthocyanin in testa of parents and F2. (d) Leaves of parental lines and F1. (e) Seeds of parental lines and F2. (f) Seedlings of parental lines and F2.

Table 1.

Genetic analysis of purple testa colour in peanut

Name Generation Total plants Purple Light purple Pink Expected ratio χ2 P‐value

KF1

WH10♀ × YH29♂

F1 20 0 0 20
F2 355 0 355 0
F3 355 83 178 94 1 : 2 : 1 0.231 0.05

KF2

GT‐C20♀ × YH29♂

F1 10 0 10
F2 107 107
F3 107 26 52 29 1 : 2 : 1 0.252 0.05

ZH

ZH8♀ × ZH9♂

F1 8 0 8
F2 72 72
F7(RIL) 72 44 28

QTL‐seq predicted candidate genomic region controlling purple testa

To map AhTc1, KF1 population was used for QTL‐seq analysis. Two parents and two extreme bulks (purple‐pool and pink‐pool) were sequenced using Illumina HiSeq 2500 platform. In total, 116 631 629, 109 298 108, 350 931 558 and 292 791 543 read pairs (150 bp × 2) were generated from pink testa parent, purple testa parent, pink‐pool and purple‐pool, respectively. Among which, 88 487 214 (75.87%), 83 752 350 (76.63%), 266 400 406 (75.91%) and 222 527 818 (76.00%) read pairs were uniquely and confidently mapped to the reference genome, respectively (Table 2).

Table 2.

Data generated by QTL‐seq and BSR‐seq

Populations Samples Raw data (Gb) High‐quality reads (150 bp × 2) Uniquely mapped reads (150 bp × 2) Alignment (%) Average depth
For QTL‐seq
KF1 WH10 34.99 116 623 263 88 487 214 75.87 12.50
YH29 32.79 350 904 687 266 400 406 75.91 11.71
Purple‐pool 87.84 109 290 025 83 752 350 76.63 31.37
Pink‐pool 105.28 292 769 375 222 527 818 76.00 37.60
For BSR‐seq
KF1 WH10 4.34 14 462 145 11 341 989 78.43
YH29 4.74 15 812 681 12 285 282 77.69
Purple‐pool 7.93 26 442 982 20 577 921 77.82
Pink‐pool 8.97 29 907 000 23 786 858 79.54
KF2 GT‐C20 3.86 12 851 990 10 415 745 81.04
YH29 4.74 15 812 681 12 285 282 77.69
Purple‐pool 8.80 29 315 586 23 489 773 80.13
Pink‐pool 8.30 27 654 103 21 936 469 79.32
ZH ZH8 3.89 12 951 240 10 319 990 79.68
ZH9 4.55 15 162 715 12 275 279 80.96
Purple‐pool 11.06 36 875 909 29 091 397 78.89
Pink‐pool 9.18 30 611 925 24 335 624 79.50

By comparing the two extreme bulks, a total of 640 757 high‐quality genome‐wide SNPs were called (Table S1).With a filtration criterion of allele frequency difference (AFD) >0.5 and Fisher exact test P‐value < 1e−5, a total of 1797 SNPs were putatively associated with the purple testa phenotype, and 1317 (73.29%) of them were located on chromosome A10 (Figure 2a), suggesting that the purple testa controlling gene was located on A10. SNP index analysis showed that the region on A10 from 108.0 Mb to 112.7 Mb exhibited significant unequal contributions. In this 4.7 Mb region, the average AFD value was 0.782, and 214 SNPs with AFD > 0.8, which was the highest in peanut genome, suggesting that the region might contain candidate gene controlling purple testa (Figure 2b).

Figure 2.

Figure 2

Mapping of genomic region controlling purple testa. (a) Distribution of candidate SNPs per chromosomes (AFD > 0.5, P‐value < 1e−5), (b) AFD plot of chromosome A10. The significant region identified for purple is shaded (108.0–112.7 Mb).

Confirmation of the candidate region using BSR‐seq in three populations

Since the phenotype variation can be observed through the colour of the leaflets between parental lines and individual of F2, the candidate gene could be an expressing gene. Therefore, RNA bulks for extreme phenotypes were constructed using KF1 population (F2), KF2 population (F2) and ZH (ZH8♀ × ZH9♂) RIL population. BSR sequencing generated a total 25.99, 25.69 and 28.68 Gb raw data from KF1, KF2 and ZH population, respectively (Table 2). A total of 135, 51 and 172 SNPs putatively associated with the purple testa phenotype were identified, and 71.11%, 33.34% and 65.70% of these SNPs from KF1, KF2 and ZH population were in chromosome A10, respectively (Figure S1). BSR‐seq results also showed that the SNPs were enriched in the same candidate region (Chr. A10:108.0…112.7 Mb), which confirmed the results of QTL‐seq (Figure S1).

Development of STS markers, validation and narrowing the candidate genes

To narrow the candidate SNPs, 10 STS markers were developed in the candidate region on chromosome A10 range from Arahy.10:103268353 to 113621464 (Figure 3a). Seven out of these markers showed good amplification and polymorphism between parents (Table 3). Linkage of these SNPs to AhTc1 was confirmed by testing eight pink testa and eight purple F2 lines. Genotyping result demonstrated that the marker pTesta1089 (Arahy.10:108900285) was closely linked with AhTc1 (Figure 3c). Comparative genomic analysis showed that there are 258 genes in the 4.7 Mb region (Table S2). According to functional annotation, nine candidate genes including three MYB transcription factor genes, a bHLH gene, two MADX‐box transcription factor genes, an ABC transporter gene, a F‐box transcription factor gene and a cytochrome P450 gene were highlighted (Figure 3a, Table S2). The SNP marker pTesta1089 located in the 3′ of the MYB transcription factor (gene name: J3K16K), and the distance between them is only 2724 bp (Figure 3b). Interestingly, RNA‐seq result showed that gene (J3K16K) was highly expressed in the purple testa varieties (YH29 and ZH9) to compare with that in the pink testa varieties (WH10 and ZH8) (Figure 3d). These results implied that J3K16L was the candidate gene of AhTc1.

Figure 3.

Figure 3

Cloning and expression analysis of AhTc1. (a) SNP markers and genes in candidate region of Arachis hypogaea chromosome A10. (b) The location and gene structure of J3K16K. (c) Genotyping results of seven SNP markers in F2 population. (d) The expression level of J3K16K in leaves of purple and pink testa varieties.

Table 3.

Polymorphic markers linked to AhTc1

Primers SNP location AFD Forward primer Product length (bp)
pTesta1032 Arahy.10:103268353 0.65

F: GAATAGATCATCATATATTGAAA

R: TTCAAACTCGTATACCCAGTAG

696
pTesta1089 Arahy.10:108900285 0.74

F: TTGTAAAGAGAATTAAGACGACGGA

R: GCAGGTTTGTTCGCAGGA

637
pTesta1106 Arahy.10:110603758 0.83

F: AAAAGGGATTGGTTCGTC

R: CCCTATTAAATCTACCTGAT

759
pTesta1109 Arahy.10:110962253 0.77

F: CATTTGTGCACTTGAATAC

R: CTTTCTAATGGTCCTCAAT

791
pTesta1112 Arahy.10:111238108 0.82

F: TTGACAATAAAAAATGAGG

R: GCCTTTAGCTCAATGAT

827
pTesta1131 Arahy.10:113126877 0.60

F: CAGATTTTGATAATGATTTGTGGTA

R: TTTTTTATTTTACATTTATGTTTGA

765
pTesta1136 Arahy.10:113621464 0.57

F: ATAGTAAAAATACTGGAAC

R: ATTCACGCGACTATCTCAA

652

Function validation of the candidate gene

To confirm that the purple testa was caused by the expression of the candidate gene J3K16L, we cloned this gene (Table S3). The coding sequence of this gene, driven by CaMV 35S promoter, was introduced into the tobacco. All transgenic lines expressing J3K16L exhibited different degrees of purple colour in leaves and flowers, while the fruits and testa were all showed dark purple colour (Figure 4). In addition, the seed coats of the transgenic lines exhibited also showed more purple (Figure 4). These results confirmed that J3K16L promoted anthocyanin accumulation in transgenic tobacco plants. In peanut, the expression level of J3K16L was also positive correlated with the purple testa and the accumulation of anthocyanin (Figure 1e, Figure 3d). Taken together, our work demonstrated that J3K16L was AhTc1, and the high‐level expression of AhTc1 confers to the phenotype of purple testa in peanut.

Figure 4.

Figure 4

Phenotypic analysis of transgenic tobacco. (a) The transgenic seedling overexpressing J3K16K gene. (b) Nontransgenic tobacco plant. (c) Flowers of transgenic plant and wild type. (d) Fruits of transgenic plant and wild type. (e) Seeds of transgenic plant and wild type.

Identification of differentially expressed genes (DEGs) between pink and purple testa cultivars

To further reveal the roles of AhTc1 and gene expression regulation in testa colour of peanut, we employed RNA‐seq technology to analyse the genome‐wide gene expression profiles for the pink (WH10) and purple (YH29) testa cultivars. RNA‐seq generated a total of 64.97 and 65.14 Gb clean reads from WH10 and YH29, and 82.43% and 82.90% of them could be mapped with the genome of Tifrunner, respectively. Between the WH10 and YH29, a total of 2814 DEGs were identified under the criterion of log2Ratio ≥ 1 (Figure 5a). Compared with WH10, the expression level of 1538 genes were up‐regulated, and 1275 genes were down‐regulated in YH29 (Figure 5a). In order to get a better understanding of the transcriptome differences between two peanut varieties, KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis was performed. We found that the many pathways were significantly enriched including ‘anthocyanin biosynthesis’, an important pathway that might contribute to regulating the synthesis of pigment and testa colour (Figure 5b).

Figure 5.

Figure 5

RNA‐seq analysis between WH10 and YH29. (a) Number of differentially expressed genes (DEGs). (b) KEGG enrichment analysis.

Discussion

AhTc1, encoding a MYB transcript factor, regulated purple testa in peanut

In this study, the evidences of forward genetics and reverse genetics all supported that AhTc1 was the key gene in controlling purple testa in peanut. Functional annotation and sequencing alignment showed that AhTc1 encodes a member of R2R3‐MYB transcript factor. R2R3‐MYB represents one of the largest transcript factor gene families in plants. R2R3‐MYB transcription factors were involved plant development, metabolism and responses to biotic and abiotic stresses (Carre and Kim, 2002; Chen et al., 2017; Dubos et al., 2010). Previous studies showed that a distinct clade of R2R3‐MYB transcription factors played key regulatory roles in anthocyanin biosynthesis (Chen et al., 2017; Lin‐Wang et al., 2010; Liu et al., 2015; Nguyen and Lee, 2016). Anthocyanins, belong to the flavonoid family, are one of the most important pigments which can increase red, blue and purple colours in a range of flowers, fruits, foliage, seeds and roots (Lin‐Wang et al., 2010; Tanaka et al., 2008). In plant, the biosynthesis of anthocyanin was regulated through a complex MYB, basic helix‐loop‐helix (bHLH) and WD‐repeat proteins (MYB‐bHLH‐WD40, MBW) complex (Baudry et al., 2004). In the complex, the activity of R2R3‐MYB might determine the patterning and spatial localization of anthocyanins, different family members of R2R3‐MYB contributing to the accumulation of anthocyanin in tissue or developmental specific patterns. For instance, in apple, MdMYB1, MdMYBA and MdMYB3 regulate the synthesis of red pigmented anthocyanins in the peel, MdMYB10 regulates the synthesis of anthocyanins in the peel, flesh and foliage, and MdMYB110a controls the accumulation of red in the fruit cortex during the later phase of fruit maturity (Chagne et al., 2013; Espley et al., 2007; Liu et al., 2015; Takos et al., 2006; Vimolmangkang et al., 2013). Despite the important roles of R2R3‐MYB and MBW complex in regulation of anthocyanins synthesis have been well elucidated in many other plants, however, functional information of R2R3‐MYB in peanut is scarce, especially in regulation of testa colour of peanut.

Different KEGG pathways involved testa colour in peanut

In this study, we identified a total of 2814 DEGs between the pink and purple testa colour peanut varieties (Figure 5a). Through KEGG analysis, we found that many of the pathways related to synthesis of other metabolites were also enriched, such as ‘vitamin B6 metabolism’, ‘Sesquiterpenoid and triterpenoid biosynthesis’, ‘phenylpropanoid biosynthesis’, ‘flavonoid biosynthesis’, ‘flavone and flavanol biosynthesis’ and ‘isoflavonoid biosynthesis’, these results implied that there might be the differences in flavour and other nutritional ingredient between purple and pink peanut varieties. Moreover, ‘plant–pathogen interaction’ was also identified as enriched between two peanut varieties (Figure 5b). Past studies showed that the accumulation and localization of anthocyanins and flavonoids were involved in stress resistance (Chalker‐Scott, 1999; Treutter, 2005). For example, in barley, the content of proanthocyanidins in testa layer was involved in defence against Fusarium species (Skadhauge et al., 1997). However, there is no evidence currently to show the resistance different between the purple and pink testa peanut varieties. In addition, GO (Gene Ontology) analysis also showed that many stress‐related GO terms were enriched, such as ‘response to wounding’, ‘regulation of jasmonic acid mediated signalling pathway’ and ‘regulation of defence response’, which supported the results of KEGG analysis (Table S4).

BSA, BSR and QTL‐seq technologies for QTLs/genes identification

BSA was first reported in mapping the disease resistance genes in lettuce (Michelmore et al., 1991). During the past decades, BSA has been used to map the quantitative trait locus (QTL) controlled by single gene or major gene. BSA can be used for mapping the traits, which is opposite or with significant differences between parents, and having enough individuals with an extreme phenotype in the segregation populations. BSA can quickly figure out the molecular markers closely linked to the target gene (trait) without genetic linkage map (Poulsen et al., 1995; Salunkhe et al., 2011; Yuan et al., 2013). Recently, a series of NGS‐based BSA approaches were developed, including MutMup, MutMup+, BSR‐seq and QTL‐seq (Abe et al., 2012; Fekih et al., 2013; Steuernagel et al., 2016; Takagi et al., 2013). QTL‐seq is a whole‐genome resequencing (WGRS)‐based approach and has strong advantages in the species with reference genome (Takagi et al., 2013; Zhang et al., 2018). In this study, AhTc1 was mapped using QTL‐seq in peanut. BSR was used to verify the QTL‐seq result. QTL‐seq and BSR had the same result, suggested that BSR was also an effective method for primary mapping of QTLs. In comparison with BSR, QTL‐seq could generate more polymorphisms SNPs, randomly distributed in the genome, which is important for narrowing the candidate regions. For BSR, more SNPs could be generated from the highly expressed genes. So, many SNPs in the noncoding sequences and noncoding region of genes could not be indexed. The cost of BSR was lower than that of QTL‐seq. QTL‐seq and BSR will be very useful in further peanut QTL mapping studies.

Applications of AhTc1 gene in peanut breeding

Testa colour is an important trait for peanut breeding programme. In addition to appearance quality, the purple testa peanuts contain higher levels of anthocyanin and many other microelements than pink testa peanuts (Attree et al., 2015; Kuang et al., 2017). The unknown genetic and molecular mechanisms regulating testa colour limited the molecular breeding of purple testa peanut. Several attempts have been made to analyse the genetics of purple testa formation and identify the potential markers for purple testa (Branch, 1985, 2001). However, most of the studies failed to provide tightly linked markers and useful information on the candidate genes controlling the purple testa. Since the testa colour is a maternal gene determined trait, the segregation occurs in the following next generation. The traditional breeding faces more difficulties for purple testa peanut selecting and breeding. In this study, the purple testa controlling gene AhTc1 was firstly mapped in 4.7 Mb region of chromosome A10, and a series of closely linked markers were developed (Table 3). The utility of these markers will accelerate the process of purple testa peanut breeding. In addition, we identified the candidate gene, AhTc1, a R2R3‐MYB transcription factor coding gene. The expression level of this gene was related to the colour of the plants, providing important target gene for testa colour modification using transgenic or gene editing methods.

Conclusion

The major gene regulating purple testa of peanut, AhTC1, was mapped in a 4.7 Mb region in chromosome A10 using QTL‐seq approach. A series of SNP markers were developed and genotyped for narrowing the candidate region, and a R2R3‐MYB transcription factor gene was identified. The evidences of forward genetics and reverse genetics all supported that R2R3‐MYB transcription factor gene was AhTc1, conferring purple testa phenotype. This work lays the foundation for the further understanding of the regulation mechanisms of peanut purple testa formation and molecular breeding of new varieties with purple testa.

Materials and methods

Plant materials and anthocyanin content determination

Weihua 10 (WH10), Zhonghua 8 (ZH8) and Yueyou 20 (GT‐C20) are peanut varieties with pink testa. YH29 and Zhonghua 9 (ZH9) are peanut varieties with purple testa. WH10 and GT‐C20 with pink testa were crossed with the purple testa variety YH29, respectively. In addition, ZH8 with pink testa was crossed with a purple testa peanut variety ZH9 to produce F7 recombinant inbred lines (RILs). All F1 hybrids were evaluated using SSR and MITE transposon markers (Qiu et al., 2018). Anthocyanin content of peanut testa was determined using the method reported in previous studies (Mancinelli et al., 1991; Serrano et al., 2012).

Nucleic acid isolation and bulked libraries construction

Peanuts were planted in the field of Shandong Academy of Agricultural Science (SAAS) in Jiyang. To prepare the bulk DNA, leaves deriving from 30 pink testa and 30 purple testa F2 lines were collected for DNA extraction, respectively. Total DNA was extracted using Plant Genomic DNA Extraction Kit (TIANGEN, Beijing, China) according to the instructions of the manufacturer (http://www.tiangen.com/en/?productShow/t1/4/id/10.html). Then, pink‐pool and purple‐pool were constructed by mixing the equal amount of DNA from 25 pink and 25 purple F2 individuals, respectively. For constructing the BSR‐seq libraries, leaflet (unfolding young leaves) from 30 purple and 30 green F2 individual lines of KF1 population, 10 purple and 10 green F2 individual lines of KF2 population, and 19 purple and 19 green F7 individual lines of ZH population were collected, respectively. RNA was extracted using RNAiso Plus (Takara, Dalian, China) according to the instructions of the manufacturer (http://www.takarabiomed.com.cn). The pink‐pool and purple‐pool for each population were constructed by mixing the equal amount of total RNAs. The integrity and purity of the DNA and RNA samples were determined by 1% agarose gel electrophoresis, and the concentration of DNA and RNA was measured in NanoDrop. High‐throughput sequencing for RNA and DNA libraries was performed by Novogene (http://www.novogene.com/) using an Illumina HiSeq 2500 system with 150 bp*2 paired‐end sequencing.

QTL‐seq, BSR‐seq and bioinformatic analysis

To obtain clean reads, the low‐quality reads ( 20), adapter sequences, > 10% reads and too short reads (<20 bp after trimming the adapters) were removed by tool Trimmomatic. Then, all clean reads were mapped with the genome sequences of Arachis hypogaea (PeanutBase version 1.0, https://peanutbase.org/) using mem module of BWA software with the harder filters of minimum number of identical bases 40 (Li and Durbin, 2009) for QTL‐seq data, or using mapper STAR for BSR‐seq data. The alignments were further filtered with criteria that require minimum 50 bp identical bases to the reference sequence, maximum 5% mismatch rate, maximum 5% clipped ends, maximum 50 bp gap, maximum 2 alleles and minimum 50 mapping score. The germline SNP was called using HaplotypeCaller module of GATK tools software following the best practice protocol. The raw SNPs/Indels were further filtered using requirements that only two genotypes exist, total sequencing depth >20 and <10 000, mutant allele sequencing depth >5, proportion of either allele >5%. The allele frequency was calculated based on the filtered read counts (discarded inner low‐quality genotypes) of both alleles at the SNP/Indel sites. Then, AFD (allele frequency difference) was the absolute value after subtraction of the allele frequency of the purple‐pool from that of the pink‐pool. Fisher exact test was applied to evaluate the allele frequency difference as well using read counts. If P‐value < 1e−5 and AFD > 0.5, the SNP was regarded as candidates linked to the trait.

STS markers development and candidate SNP validation

To develop the markers for the validation of QTL‐seq results and narrow the candidate region, the sequences containing 1500 bp upstream and 1500 bp downstream of SNPs were downloaded from PeanutBase (https://peanutbase.org). Due to the high similarity of A and B subgenomes, it is difficult to design the primers for only amplifying the A or B allele. Thus, the 3001 bp sequences were used to blast with the genome sequences of cultivated species Tifrunner, donor ancestor species A. duranensis (A genome) and A. ipaensis (B genome). According to the alignment results, we designed the specific primers only for A using primer premier 5.0 software (primer premier 5.0 software). PCR product direct sequencing was used for genotyping of these STS markers.

Gene cloning, vector construction and gene transformation

The ORF region of AhTc1 was amplified using EX Taq HS (Takara, Dalian, China) from ZH9 and cloned into pMD19‐T Vector (Takara) according to the instruction of manufacturer. The following primers were used: TestaORF‐F (5′‐TGCTCTAGAATGGAGGGATCCATAGGCCT‐3′) and TestaORF‐R: 5′‐AACTGCAGTTATTGTGGATCCCACAAAT‐3), in which the unique Xba I and Pst I sites were introduced at the 5′ and 3′ ends of the ORF of AhTc1, respectively. The plant expression vector containing 35S:AhTc1 was constructed by cleaving the ORF was from T vector and recombining into pCAMBIA2300‐35S‐OCS vector, and using CaMV 35S promoter to direct the expression of AhTc1. The insertion of the AhTc1 construct was confirmed by PCR as well as enzymatic activity assays. The recombined vector was transferred into Agrobacterium tumefaciens LBA4404 and then used to transform Nicotiana tabacum cv Nc89 using leaf disc method (Horsch et al., 1986). Transformed seeds were selected for kanamycin resistance, and the transgenic lines were confirmed by PCR method (Figure S2).

Conflict of interest

The authors declare that they have no conflict of interest.

Author contributions

XW and CZ conceived and designed the experiments. YZ, JM, ML, LD, GL, HX, SZ, LH, PL, MY, LR and JG performed the experiments. CZ, CM, PL and YZ analysed the data. CZ, XW, and BZ wrote and revised the manuscript.

Supporting information

Figure S1 Confirm the QTL‐seq results using BSR in three populations.

PBI-18-96-s005.pdf (158.1KB, pdf)

Figure S2 Schematic map of the transgene construction. A: schematic diagram for construction of pCAMBIA2300‐35S‐OCS‐AhTc1. B: Transformation, regeneration and transplant. C: PCR detection of transgenic lines.

PBI-18-96-s004.pdf (73.8KB, pdf)

Table S1 Detail information of SNPs identified using whole genome resequencing.

PBI-18-96-s003.xlsx (66.7MB, xlsx)

Table S2 Gene information in the candidate region on chromosome A10.

PBI-18-96-s006.xlsx (22.3KB, xlsx)

Table S3 Genomic and cDNA sequence information of candidate gene J3K16K.

PBI-18-96-s002.docx (17KB, docx)

Table S4 Top 30 enriched GO terms between pink and purple testa colour peanut varieties.

PBI-18-96-s001.xlsx (11.6KB, xlsx)

Acknowledgements

This research is supported by the Shandong Province Natural Science Foundation (ZR2017MC005), Natural Science Foundation of China (31861143009), Shandong Province Germplasm Innovation and Utilization Project (2016LZGC025) and Shandong Key Research Projects (2018GSF121032), Agricultural Scientific and Technological Innovation Project of SAAS (CXGC2018E13, CXGC2016B02) and Young Talents Training Program of SAAS.

Contributor Information

Chuanzhi Zhao, Email: chuanzhiz@126.com.

Xingjun Wang, Email: xingjunw@hotmail.com.

References

  1. Abe, A. , Kosugi, S. , Yoshida, K. , Natsume, S. , Takagi, H. , Kanzaki, H. , Matsumura, H. et al. (2012) Genome sequencing reveals agronomically important loci in rice using MutMap. Nat. Biotechnol. 30, 174–178. [DOI] [PubMed] [Google Scholar]
  2. Attree, R. , Du, B. and Xu, B. (2015) Distribution of phenolic compounds in seed coat and cotyledon, and their contribution to antioxidant capacities of red and black seed coat peanuts (Arachis hypogaea L.). J. Ind. Crops, 67, 448–456. [Google Scholar]
  3. Baudry, A. , Heim, M.A. , Dubreucq, B. , Caboche, M. , Weisshaar, B. and Lepiniec, L. (2004) TT2, TT8, and TTG1 synergistically specify the expression of BANYULS and proanthocyanidin biosynthesis in Arabidopsis thaliana . Plant J. 39, 366–380. [DOI] [PubMed] [Google Scholar]
  4. Bertioli, D.J. , Cannon, S.B. , Froenicke, L. , Huang, G. , Farmer, A.D. , Cannon, E.K. , Liu, X. et al. (2016) The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut. Nat. Genet. 48, 438–446. [DOI] [PubMed] [Google Scholar]
  5. Bertioli, D.J. , Jenkins, J. , Clevenger, J. , Dudchenko, O. , Gao, D. , Seijo, G. and Leal‐Bertioli, S.C.M. (2019) `The genome sequence of segmental allotetraploid peanut Arachis hypogaea . Nat. Genet. 51, 877–884. [DOI] [PubMed] [Google Scholar]
  6. Branch, W. (1985) Inheritance of purple and purple‐stripe testa colors in the peanut. J. Hered. 76, 225–226. [Google Scholar]
  7. Branch, W. (2001) Genetic relationship between purple and wine testa color in peanut. J. Peanut Sci. 28, 19–20. [Google Scholar]
  8. Carre, I.A. and Kim, J.Y. (2002) MYB transcription factors in the Arabidopsis circadian clock. J. Exp. Bot. 53, 1551–1557. [DOI] [PubMed] [Google Scholar]
  9. Chagne, D. , Lin‐Wang, K. , Espley, R.V. , Volz, R.K. , How, N.M. , Rouse, S. , Brendolise, C. et al. (2013) An ancient duplication of apple MYB transcription factors is responsible for novel red fruit‐flesh phenotypes. Plant Physiol. 161, 225–239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chalker‐Scott, L. (1999) Environmental significance of anthocyanins in plant stress responses. Photochemistry, 70, 1–9. [Google Scholar]
  11. Chen, X. , Li, H. , Pandey, M.K. , Yang, Q. , Wang, X. , Garg, V. , Li, H. et al. (2016) Draft genome of the peanut A‐genome progenitor (Arachis duranensis) provides insights into geocarpy, oil biosynthesis, and allergens. Proc. Natl Acad. Sci. USA, 113, 6785–6790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chen, Y.S. , Chao, Y.C. , Tseng, T.W. , Huang, C.K. , Lo, P.C. and Lu, C.A. (2017) Two MYB‐related transcription factors play opposite roles in sugar signaling in Arabidopsis. Plant Mol. Biol. 93, 299–311. [DOI] [PubMed] [Google Scholar]
  13. Clevenger, J. , Chu, Y. , Chavarro, C. , Botton, S. , Culbreath, A. , Isleib, T.G. , Holbrook, C.C. et al. (2018) Mapping late leaf spot resistance in peanut (Arachis hypogaea) using QTL‐seq reveals markers for marker‐assisted selection. Front. Plant Sci. 9, 83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dhillon, S.S. , Rake, A.V. and Miksche, J. (1980) Reassociation kinetics and cytophotometric characterization of peanut (Arachis hypogaea L.) DNA. J. Plant Physiol. 65, 1121–1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Dubos, C. , Stracke, R. , Grotewold, E. , Weisshaar, B. , Martin, C. and Lepiniec, L. (2010) MYB transcription factors in Arabidopsis. Trends Plant Sci. 15, 573–581. [DOI] [PubMed] [Google Scholar]
  16. Espley, R.V. , Hellens, R.P. , Putterill, J. , Stevenson, D.E. , Kutty‐Amma, S. and Allan, A.C. (2007) Red colouration in apple fruit is due to the activity of the MYB transcription factor, MdMYB10. Plant J. 49, 414–427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fekih, R. , Takagi, H. , Tamiru, M. , Abe, A. , Natsume, S. , Yaegashi, H. , Sharma, S. et al. (2013) MutMap+: genetic mapping and mutant identification without crossing in rice. PLoS ONE, 8, e68529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hong, Y. , Lin, K. , Zhou, G. , Li, S.‐X. , Li, Y. and Liang, X.‐Q. (2007) Genetic linkage analysis of SSR markers and the gene for dark purple testa color in peanut (Arachis hypogaea L.). Chin. J. Oil Crop Sci. 29, 35. [Google Scholar]
  19. Horsch, R.B. , Klee, H.J. , Stachel, S. , Winans, S.C. , Nester, E.W. , Rogers, S.G. and Fraley, R.T. (1986) Analysis of Agrobacterium tumefaciens virulence mutants in leaf discs. Proc. Natl Acad. Sci. USA, 83, 2571–2575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Kuang, Q. , Yu, Y. , Attree, R. and Xu, B. (2017) A comparative study on anthocyanin, saponin, and oil profiles of black and red seed coat peanut (Arachis hypogacea) grown in China. J. Ind. Crops, 20, S131–S140. [Google Scholar]
  21. Lambrides, C.J. , Godwin, I.D. , Lawn, R.J. and Imrie, B.C. (2004) Segregation distortion for seed testa color in Mungbean (Vigna radiata L. Wilcek). J. Hered. 95, 532–535. [DOI] [PubMed] [Google Scholar]
  22. Li, H. and Durbin, R. (2009) Fast and accurate short read alignment with Burrows‐Wheeler transform. Bioinformatics, 25, 1754–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lin‐Wang, K. , Bolitho, K. , Grafton, K. , Kortstee, A. , Karunairetnam, S. , McGhie, T.K. , Espley, R.V. et al. (2010) An R2R3 MYB transcription factor associated with regulation of the anthocyanin biosynthetic pathway in Rosaceae. BMC Plant Biol. 10, 50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Liu, J. , Osbourn, A. and Ma, P. (2015) MYB transcription factors as regulators of phenylpropanoid metabolism in plants. Mol. Plant, 8, 689–708. [DOI] [PubMed] [Google Scholar]
  25. Luo, H. , Pandey, M.K. , Khan, A.W. , Guo, J. , Wu, B. , Cai, Y. , Huang, L. et al. (2019) Discovery of genomic regions and candidate genes controlling shelling percentage using QTL‐seq approach in cultivated peanut (Arachis hypogaea L.). Plant Biotechnol. J. 1015013, 10.1111/pbi.13050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Mancinelli, A.L. , Rossi, F. and Moroni, A. (1991) Cryptochrome, phytochrome, and anthocyanin production. Plant Physiol. 96, 1079–1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Michelmore, R.W. , Paran, I. and Kesseli, R. (1991) Identification of markers linked to disease‐resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations. Proc. Natl Acad. Sci. USA, 88, 9828–9832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Nguyen, N.H. and Lee, H. (2016) MYB‐related transcription factors function as regulators of the circadian clock and anthocyanin biosynthesis in Arabidopsis. Plant Signal. Behav. 11, e1139278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Pandey, M.K. , Monyo, E. , Ozias‐Akins, P. , Liang, X. , Guimaraes, P. , Nigam, S.N. , Upadhyaya, H.D. et al. (2012) Advances in Arachis genomics for peanut improvement. Biotechnol. Adv. 30, 639–651. [DOI] [PubMed] [Google Scholar]
  30. Pandey, M.K. , Khan, A.W. , Singh, V.K. , Vishwakarma, M.K. , Shasidhar, Y. , Kumar, V. , Garg, V. et al. (2017a) QTL‐seq approach identified genomic regions and diagnostic markers for rust and late leaf spot resistance in groundnut (Arachis hypogaea L.). Plant Biotechnol. J. 15, 927–941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Pandey, M.K. , Wang, H. , Khera, P. , Vishwakarma, M.K. , Kale, S.M. , Culbreath, A.K. , Holbrook, C.C. et al. (2017b) Genetic dissection of novel QTLs for resistance to leaf spots and tomato spotted wilt virus in peanut (Arachis hypogaea L.). Front. Plant Sci. 8, 25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Poulsen, D.M. , Henry, R.J. , Johnston, R.P. , Irwin, J.A. and Rees, R.G. (1995) The use of bulk segregant analysis to identify a RAPD marker linked to leaf rust resistance in barley. Theor. Appl. Genet. 91, 270–273. [DOI] [PubMed] [Google Scholar]
  33. Qiu, J. , Zhao, Y. , Deng, L. , Ren, L. , Xia, H. , Fu, C. , Zhao, C. et al. (2018) Molecular identification on ture F1 hybrid of peanut. Shandong Agric. Sci. 50, 13–18. [Google Scholar]
  34. Salunkhe, A.S. , Poornima, R. , Prince, K.S. , Kanagaraj, P. , Sheeba, J.A. , Amudha, K. , Suji, K.K. et al. (2011) Fine mapping QTL for drought resistance traits in rice (Oryza sativa L.) using bulk segregant analysis. Mol. Biotechnol. 49, 90–95. [DOI] [PubMed] [Google Scholar]
  35. Serrano, M. , Kanehara, K. , Torres, M. , Yamada, K. , Tintor, N. , Kombrink, E. , Schulze‐Lefert, P. et al. (2012) Repression of sucrose/ultraviolet B light‐induced flavonoid accumulation in microbe‐associated molecular pattern‐triggered immunity in Arabidopsis. Plant Physiol. 158, 408–422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shirasawa, K. , Bertioli, D.J. , Varshney, R.K. , Moretzsohn, M.C. , Leal‐Bertioli, S.C. , Thudi, M. , Pandey, M.K. et al. (2013) Integrated consensus map of cultivated peanut and wild relatives reveals structures of the A and B genomes of Arachis and divergence of the legume genomes. DNA Res. 20, 173–184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Skadhauge, B. , Thomsen, K.K. and Von Wettstein, D. (1997) The role of the barley testa layer and its flavonoid content in resistance to Fusarium infections. Hereditas, 126, 147–160. [Google Scholar]
  38. Steuernagel, B. , Periyannan, S.K. , Hernandez‐Pinzon, I. , Witek, K. , Rouse, M.N. , Yu, G. , Hatta, A. et al. (2016) Rapid cloning of disease‐resistance genes in plants using mutagenesis and sequence capture. Nat. Biotechnol. 34, 652–655. [DOI] [PubMed] [Google Scholar]
  39. Takagi, H. , Abe, A. , Yoshida, K. , Kosugi, S. , Natsume, S. , Mitsuoka, C. , Uemura, A. et al. (2013) QTL‐seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. Plant J. 74, 174–183. [DOI] [PubMed] [Google Scholar]
  40. Takos, A.M. , Jaffe, F.W. , Jacob, S.R. , Bogs, J. , Robinson, S.P. and Walker, A.R. (2006) Light‐induced expression of a MYB gene regulates anthocyanin biosynthesis in red apples. Plant Physiol. 142, 1216–1232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Tanaka, Y. , Sasaki, N. and Ohmiya, A. (2008) Biosynthesis of plant pigments: anthocyanins, betalains and carotenoids. Plant J. 54, 733–749. [DOI] [PubMed] [Google Scholar]
  42. Treutter, D. (2005) Significance of flavonoids in plant resistance and enhancement of their biosynthesis. Plant Biol. 7, 581–591. [DOI] [PubMed] [Google Scholar]
  43. Vimolmangkang, S. , Han, Y. , Wei, G. and Korban, S.S. (2013) An apple MYB transcription factor, MdMYB3, is involved in regulation of anthocyanin biosynthesis and flower development. BMC Plant Biol. 13, 176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Wang, P. , Song, H. , Li, C. , Li, P. , Li, A. , Guan, H. , Hou, L. et al. (2017) Genome‐Wide Dissection of the Heat Shock Transcription Factor Family Genes in Arachis. Front. Plant Sci. 8, 106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Yin, D. , Ji, C. , Ma, X. , Li, H. , Zhang, W. , Li, S. , Liu, F. et al. (2018) Genome of an allotetraploid wild peanut Arachis monticola: a de novo assembly. Gigascience, 7, 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Yuan, Y.W. , Sagawa, J.M. , Di Stilio, V.S. and Bradshaw, H.D. Jr . (2013) Bulk segregant analysis of an induced floral mutant identifies a MIXTA‐like R2R3 MYB controlling nectar guide formation in Mimulus lewisii . Genetics, 194, 523–528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Zhang, H. , Wang, X. , Pan, Q. , Li, P. , Liu, Y. , Lu, X. , Zhong, W. et al. (2018) QTG‐seq accelerates QTL fine mapping through QTL partitioning and whole‐genome sequencing on bulked segregant samples. Mol. Plant, 12, 426–437. [DOI] [PubMed] [Google Scholar]
  48. Zhao, X. , Chen, J. and Du, F. (2012) Potential use of peanut by‐products in food processing: a review. J. Food Sci. Technol. 49, 521–529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Zhao, C. , Qiu, J. , Agarwal, G. , Wang, J. , Ren, X. , Xia, H. , Guo, B. et al. (2017) Genome‐wide discovery of microsatellite markers from diploid progenitor species, Arachis duranensis and A. ipaensis, and their application in cultivated peanut (A. hypogaea). Front. Plant Sci. 8, 1209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Zhuang, W. , Chen, H. , Yang, M. , Wang, J. , Pandey, M.K. , Zhang, C. , Chang, W.C. et al. (2019) The genome of cultivated peanut provides insight into legume karyotypes, polyploid evolution and crop domestication. Nat. Genet. 51, 865–876. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1 Confirm the QTL‐seq results using BSR in three populations.

PBI-18-96-s005.pdf (158.1KB, pdf)

Figure S2 Schematic map of the transgene construction. A: schematic diagram for construction of pCAMBIA2300‐35S‐OCS‐AhTc1. B: Transformation, regeneration and transplant. C: PCR detection of transgenic lines.

PBI-18-96-s004.pdf (73.8KB, pdf)

Table S1 Detail information of SNPs identified using whole genome resequencing.

PBI-18-96-s003.xlsx (66.7MB, xlsx)

Table S2 Gene information in the candidate region on chromosome A10.

PBI-18-96-s006.xlsx (22.3KB, xlsx)

Table S3 Genomic and cDNA sequence information of candidate gene J3K16K.

PBI-18-96-s002.docx (17KB, docx)

Table S4 Top 30 enriched GO terms between pink and purple testa colour peanut varieties.

PBI-18-96-s001.xlsx (11.6KB, xlsx)

Articles from Plant Biotechnology Journal are provided here courtesy of Society for Experimental Biology (SEB) and the Association of Applied Biologists (AAB) and John Wiley and Sons, Ltd

RESOURCES