Abstract
Snap beans are a significant source of micronutrients in the human diet. Among the micronutrients present in snap beans are phenolic compounds with known beneficial effects on human health, potentially via their metabolism by the gut-associated microbiome. The genetic pathways leading to the production of phenolics in snap bean pods remain uncertain. In this study, we quantified the level of total phenolic content (TPC) in the Bean Coordinated Agriculture Program (CAP) snap bean diversity panel of 149 accessions. The panel was characterized spectrophotometrically for phenolic content with a Folin–Ciocalteu colorimetric assay. Flower, seed and pod color were also quantified, as red, purple, yellow and brown colors are associated with anthocyanins and flavonols in common bean. Genotyping was performed through an Illumina Infinium Genechip BARCBEAN6K_3 single nucleotide polymorphism (SNP) array. Genome-Wide Association Studies (GWAS) analysis identified 11 quantitative trait nucleotides (QTN) associated with TPC. An SNP was identified for TPC on Pv07 located near the P gene, which is a major switch in the flavonoid biosynthetic pathway. Candidate genes were identified for seven of the 11 TPC QTN. Five regulatory genes were identified and represent novel sources of variation for exploitation in developing snap beans with higher phenolic levels for greater health benefits to the consumer.
Keywords: common bean, Phaseolus vulgaris, flavonoids, association mapping, total phenolic compounds, genome wide association mapping
1. Introduction
The need for transdisciplinary efforts to bridge the gap between production agriculture and public health nutrition is clear [1]. A particularly useful channel through which this can occur is among scientists working in horticulture and in human health since horticulture is a science familiar to most consumers and is practiced by many. In particular, it is critical for nutritionists to recognize the many factors that are involved in providing consumers with health-beneficial cultivated varieties (cultivars) of a crop and that potential value-added health benefits may result if cultivar-specific food labeling becomes a reality [2,3]. From a horticultural perspective, this creates the opportunity for niche markets and for the direct marketing of specific cultivars to the consumer [4,5]. One crop that has received limited attention relative to its health benefits, particularly those related to phenolic compounds, is snap bean (Phaseolus vulgaris L.), which is the topic of this investigation.
Snap beans (also known as garden, green, string, or French beans) are a form of common bean. As opposed to dry bean, in which the mature seeds are eaten, the immature pods and seeds of snap beans are consumed as a vegetable. Dry beans are considered a good source of protein, complex carbohydrates, soluble fiber, folic acid and vitamin B. In contrast, because of their higher water content (~90% water compared to ~12%), snap beans have lower protein and total carbohydrate levels, but they possess certain vitamins and carotenoids that dry beans do not have or have only in trace amounts. These include vitamin C, β-carotene, lutein and zeaxanthin, α-tocopherol, and phylloquinone [6]. About 1.5 million Mg of snap beans are produced annually on a global basis [7]. The US accounts for 71% of world production of snap beans, and Americans consume about 3.0 kg per person per year [8], and as such, snap beans are a significant source of vitamins, carotenoids and other secondary plant compounds in the human diet.
Among secondary plant compounds are phenolics and related flavonoids. Phenolics constitute a large class of phytochemicals with over 8000 individual compounds having been identified. Structurally, they are characterized by an aromatic ring with one or more hydroxyl groups [9,10,11]. Phenolics and flavonoids are further divided into flavonols, flavones, flavan-3-ols, anthocyanidins, flavanones, isoflavones, and other minor subclasses, and nonflavonoids, which include phenolic acids, hydroxycinnamates, and stilbenes [9].
In general, phenolics are associated with various human health benefits including acting as anticarcinogens, antimicrobial agents, anti-inflammatories and improving vascular health, possibly by their metabolism via the gut-associated microbiome [12]. Total phenolic content has a strong antioxidant activity in legumes [13,14,15]. Dry bean consumption is associated with a reduced risk of diabetes, heart disease, colon cancer, and prostate cancer, and the mitigation of obesity [16,17,18,19]. These benefits may accrue in part from the phenolic content of seeds. We know less about the health benefits of snap beans, and to our knowledge, no human health studies similar to those conducted for dry beans have been reported.
Several research groups have characterized snap beans for various phenolics. Abu-Reidah et al. [20] found more than 72 compounds including 10 phenolic acids and 59 flavonoids. Predominant among these were the flavonol glucosides quercetin and kaempferol along with the phenolic acids (+) catechin and (−) epicatechin, chlorogenic and protocatechuic acids, as well as procyanidins [21], [22]. Flavonoid levels in processed fresh snap beans ranged from 20 to 51 mg kg−1 [23]. In a study of six wax and green bean cultivars, flavonols levels ranged from 19 to 184 µg g−1 fresh weight (FW) for quercetin derivatives and 6 to 15 µg g−1 for kaempferol derivatives [24]. Kleintop et al. [25] examined the total phenolic content (TPC) in fresh and frozen pods of a snap bean diversity panel of 149 accessions. They found more than a four-fold difference among cultivars, which ranged from 0.29 to 1.31 mg g−1 gallic acid equivalents (GAE). Higher levels were generally associated with cultivars that had colored seeds and a purple or pink flower color.
Snap beans grown for processing vs. the fresh market have different quality characteristics that may affect TPC. Processors require that cultivars be white-seeded to ensure that no water-soluble flavonoids are produced in any plant parts including the pods as they will leach into the liquor in canned beans and present as a colored ring in the testa in cut frozen beans, thereby reducing quality [26]. The standards for fresh market beans are less stringent, and while colored-seeded cultivars are acceptable, they are generally brown-seeded (produced by flavonols) rather than black or purple (a result of anthocyanins) because the former are less noticeable in the pod. Brown-seeded cultivars will generally have pods that are green in appearance, while purple- or black-seeded cultivars may have solid purple or striped purple and green pods.
In common bean, several gene combinations may give a white seed color, but in the commercial trade, essentially all white-seeded snap beans possess the recessive p gene. This gene is a major switch in the flavonoid biosynthetic pathway, with the dominant form allowing the expression of other color genes while the recessive allele blocks expression. P is located on Pv07 (Table 1) and appears to be a basic helix loop helix (bHLH) MYB transcription factor [27]. Other effects of p are a reduced lignin content in seeds and greater sensitivity to imbibitional injury and reduced germination and emergence if the seed is not protected with fungicides [28,29]. The types of flavonols and anthocyanins produced and their expression are controlled by more than 10 additional genes [30]. These can be divided into those that act in the flavonoid biosynthetic pathway and those that affect spatial and temporal expression [31]. Most but not all of these factors have been mapped with varying degrees of accuracy [27,32,33] (Table 1). Little is known about the genetic control of phenolic acids in common bean. Kleintop et al. [25] observed a substantial difference in TPC in colored-seeded vs. white-seeded accessions in a snap bean diversity panel, but within white-seeded types, variation in TPC was also noted, suggesting that there is a genetic variation for colorless phenolic compounds.
Table 1.
Gene 2 | Chrom. 3 | Marker/Gene Model | Linkage 4 | Start | End | Length 5 |
---|---|---|---|---|---|---|
cM | bp | |||||
B | 2 | ss715645998 | 0.0 | 48,634,623 | 48,634,743 | 121 |
Z | 3 | OAM10560 | 1.4 | 31,467,887 | 31,467,909 | 22 |
G | 4 | OAP7850 | 0.0 | 30,171,480 | 30,172,322 | 842 |
G | 4 | OAP31400 | 0.0 | 30,172,301 | 30,172,322 | 21 |
G | 4 | OU14900 | 0.0 | 31,093,347 | 31,093,369 | 22 |
V | 6 | OD12800 | 0.0 | 9,288,398 | 9,288,419 | 21 |
P | 7 | Phvul.007G171333 + Phvul.007G171466 | − | 28,752,132 | 28,774,743 | 22,611 |
Gy | 8 | OW17600 | 1.6 | 3,230,983 | 3,231,005 | 22 |
C | 8 | OAP2700 | 4.9 | 9,694,328 | 9,695,007 | 679 |
T | 9 | OM19400 | 1.4 | 11,731,567 | 11,731,944 | 377 |
Ana | 10 | OM9200 | 5.4 | 11,563,621 | 11,563,800 | 179 |
L | 10 | OL4525 | 1.2 | 41,443,673 | 41,443,694 | 21 |
1 Original linkage map and primer sequence data from [27,32,33]. 2 Color and patterning genes of common bean. Ana: modifies expression of T, B: greenish-brown seed testa, C: complex locus controlling seed testa patterning, G: yellow-brown seed testa, Gy: greenish-yellow seed testa, L: inhibitor of expression of partial coloring of t, P: recessive form conditions white seed testa and flowers, T: recessive form conditions partially colored testa, V: conditions violet seed testa, Z: modifier of expression of T. See [30] for additional detail. 3 Chromosome. 4 Linkage distance between the color/patterning gene (determined phenotypically) and marker as determined in [32,33]. 5 Length in base pairs refers to the length of the candidate gene (as in the case of p) or the length of a marker sequence linked to a color/patterning gene. 6 Not shown is Rk (recessive red seed testa), whose location is controversial; it has been located to both Pv01 [34] and Pv02 [35].
Genome-Wide Association Studies (GWAS) are a means of identifying and mapping quantitative trait loci (QTL) associated with phenotypic traits in plant populations. GWAS relies on linkage disequilibrium in individuals in regions in which QTL reside. A number of association-mapping studies have been conducted on common bean. These have analyzed traits associated with phenological variation [36,37,38], agronomic performance [39,40], biological nitrogen fixation [41], plant architecture [42], seed quality [43,44], and abiotic [45,46] and biotic [47,48,49], stress variation. None have focused on traits specific to snap bean. Association studies of phenolics in non-legume crops have focused primarily on cereals such as barley [50,51], whole rice [52,53,54], dehulled rice [55], wheat [56], sorghum [57], and maize [58], with single studies of tomato [59], rapeseed [60], and olive [61].
Considering the fairly substantial intake of snap beans in the United States, improving the phenolic content of snap beans could have a meaningful impact on human health. Therefore, the objective of this study was to conduct GWAS on a snap bean diversity panel that had been evaluated for total phenolic content. We wanted to 1) identify genomic regions associated with variation in TPC, 2) determine whether known color genes co-segregate with these regions, and 3) identify potentially novel genes that regulate TPC, especially in white-seeded cultivars, as a basis for further improving snap bean phenolic content.
2. Materials and Methods
2.1. Snap Bean Diversity Panel
One-hundred and forty-nine snap bean cultivars comprising the Bean CAP Snap Bean Diversity Panel were grown at Oregon State University (OSU) at Corvallis, OR during the 2010 growing season. The cultivars were planted into three field replicates using a randomized complete block design. Seeds were planted in 76 cm rows with a within row spacing averaging 5 cm in plots of 4 m in length. For fertility, 168 kg·ha−1 of 10N-4.4P-8.3K was banded in a row at planting. Weeds were controlled with application of 0.9 kg·ha−1 of Dual (Syngenta, Basel, Switzerland) S-metolachlor pre-emergent herbicide supplemented by mechanical cultivation. Western twelve-spotted cucumber beetles (Diabrotica undecimpunctata undecimpunctata) were controlled in young seedlings by the application of 1.0 L·ha−1 of Sevin (Bayer, Research Triangle Park, NC) 1-naphthyl N-methylcarbamate insecticide. Plots received approximately 2.5 cm of water per week from planting to harvest via solid set overhead sprinklers. Fresh pods of the predominant sieve size [25] for each cultivar were handpicked in the field at OSU, brought to the field laboratory, evaluated for fresh pod color and frozen at −20 °C for TPC analysis. Harvest maturity was determined by observing seed development and initiating harvest when seeds in the most mature pods had reached a length of approximately 10 mm. Approximately 10 pods per plot (50–100 g) were randomly selected from the plants within a plot. Frozen samples were shipped on dry ice to Colorado State University where they were stored at −80 °C until TPC analysis.
2.2. Total Phenolics Analysis
The TPC was analyzed spectrophotometrically using the Folin–Ciocalteu colorimetric method [62] adapted to a microplate format. The assay is based on the color reaction of phenolic compounds with Folin–Ciocalteu phenol reagent that absorbs light at 765 nm. The TPC is a measure of the reducing capacity of a sample through electron transfer reactions, expressed as gallic acid equivalents (GAE). All TPC values are expressed as milligrams GAE per gram of FW of tissue. Pod sample extracts were prepared according to the methods by Xu and Chang [14], and details of the procedure can be found in [25].
2.3. Phenotyping
Phenotypic traits used in the GWAS included flower and fresh pod color. Flowers were classified as white (vv) pink (vlaevlae) and purple (VV) and were coded to a three-point scale where 1 = white, 2 = pink and 3 = purple. The external color of fresh pods was quantified using a colorimeter (Minolta BC-10; Konica Minolta Sensing Americas, Ramsey, NJ) with data reported using the CIE (Commission Internationale de l’Eclairage) L*, a*, and b* scale [63]. L* is a measure of the brightness and ranges from 0 (black) to 100 (white). The a* vector represents colors from green (−) to red (+), and the b* vector measures blue (−) to yellow (+) in relative numbers [63].
2.4. Genotyping
A modified cetyl trimethyl ammonium bromide (CTAB) procedure was used to extract genomic DNA, and the resulting DNA samples were analyzed on an Illumina Infinium Genechip as described by [64]. The single nucleotide polymorphism (SNP) array utilized was composed of 10,546 allele-specific probes. The raw data was initially processed on GenomeStudio (v2.0.4) software (Illumina, San Diego, CA, USA). Two marker SNP positions contained greater than 20% missing data and were removed from the study. All missing data for the remaining SNPs was imputed using fastPHASE software (v1.4), including heterozygous SNPs which were treated as missing data. SNPs not assigned to a genomic position in Phytozome 12 (Phaseolus vulgaris, version 2.1) were removed from the study, resulting in 10,073 remaining SNPs.
2.5. Genome-Wide Association Mapping
GWAS was performed in R statistical software, version 3.3.2, (https://www.r-project.org/) using FarmCPU, version 1.02 (http://www.zzlab.net/FarmCPU/) and GAPIT, version 3.0, (https://www.maizegenetics.net/gapit) R packages, with data formatted in Microsoft Excel 2016 (https://office.microsoft.com/excel/) or TextPad version 7.5.1 (www.textpad.com). The principal component analysis utilized in the GWAS was generated with Tassel, version 5.2.24, (https://www.maizegenetics.net/tassel).
The iterative fixed and random model of FarmCPU was chosen as the primary model for GWAS with an added covariate of 1 PC. Analysis was performed in R using the FarmCPU source code provided by [65]. Data sets were also analyzed using an Efficient Mixed Model Association (EMMA) in GAPIT to only control for individual relatedness. For both models, a minor allele frequency (MAF) of 0.05 was used, which reduced the SNP number to 8008. The negative log P-value cutoff on the Manhattan plots was 4.912, representing a Bonferroni cutoff threshold using α = 0.05 to denote SNPs as significant. For the FarmCPU analysis, an additional line of code not shown in the manual was used to generate a complete list of SNPs in the results document: threshold. output = 1.
2.6. Candidate Gene Search
Literature on the known color genes of common bean were compiled to identify which portion of the phenolic biosynthetic pathway the gene might be involved in, and to identify known markers and their map positions (Table 1). For the sequence tagged site (STS) markers of McClean et al. [33], forward and reverse primer sequence matches were identified in Phytozome12 (Phaseolus vulgaris, version 2.1), and the chromosome and physical position obtained. A key word search for regulatory [66,67,68,69,70] and structural [35,71] genes in the phenolic biosynthetic pathway was conducted in supplemental tables obtained from [72] (updated to P. vulgaris genome sequence ver. 2.1) containing a list of Phaseolus vulgaris gene identifiers and their Arabidopsis thaliana gene homologues, gene symbols and annotations. The physical position of these candidate genes was compared to the physical position of SNPs identified by GWAS. A distance of approximately 350 kb upstream and downstream of the associated SNPs was used when considering the proximity of identified genes. This distance was established based on 1) the observed distance between SNP markers associated with the white- vs. colored-seed phenotype and the location of the candidate gene for p [27], and 2) the slow decay of linkage disequilibrium that is characteristic of highly self-pollinated species [73], and specifically common bean [74,75,76].
3. Results
3.1. Phenotypic Data
Detailed analysis of the phenotypic data can be found in Kleintop et al. [25] for TPC, pod color and flower color. TPC ranged from 0.29 to 1.31 mg g−1 GAE, with the highest levels being associated with colored flowers and pigmented pods. However, a two-fold variation (0.29–0.64 mg·g−1 GAE) was observed in TPC among white-flowered and -seeded accessions, implying variation in colorless phenolic compounds. The proportion of individuals with colored flowers and/or seeds is 16%, leading to relatively few individuals having a major influence on phenolics profiles. Pod color is also affected by non-phenolic compounds that affect the degree of greenness. Accessions with the lightest colored pods possessed the wax pod trait conditioned by y or were Refugee or Romano types [77].
3.2. Genome-Wide Association Study
A total of 30 quantitative trait nucleotides (QTN) significantly associated with all traits using a Bonferroni cut-off of 4.9 (using α = 0.05) were identified by both GAPIT and FarmCPU analyses (Table 2, Table 3 and Table 4). QTN were found on all chromosomes except Pv06 and Pv08. Two-thirds of the QTN were identified by FarmCPU and the remainder by GAPIT. QTN could be placed into about 21 groups with intervals of up to ~290 kb. Eleven SNPs were associated with TPC, six with pod color and 13 with flower color. The same SNP was associated with b* (B*3.2) and TPC (TPC3.1) on Pv03, with an effect of a negative shift (−8.05) in the yellow–blue axis being associated with increasing (0.27 mg·g−1 GAE) TPC (Table 2 and Table 3). TPC and flower color had two co-localized peaks found on Pv09 (Table 2 and Table 4). Both FarmCPU and GAPIT identified the same SNP on Pv07 for P, but the SNP identified by GAPIT did not meet the Bonferroni cutoff.
Table 2.
QTN Designation 1 | Method | ss ID No. 2 | Chrom. 3 | Position (bp) | Prob. Value | -Log10 P | MAF 4 | SNP 5 |
---|---|---|---|---|---|---|---|---|
TPC1.1 | FarmCPU | 715648077 | 1 | 33,470,635 | 7.952E-09 | 8.10 | 0.05 | C-T |
TPC1.2 | FarmCPU | 715646871 | 1 | 42,698,218 | 1.357E-10 | 9.87 | 0.07 | T-C |
TPC3.1 | GAPIT | 715646623 | 3 | 40,370,083 | 2.448E-07 | 6.61 | 0.05 | T-C |
TPC4.1 | FarmCPU | 715647815 | 4 | 2,190,413 | 2.912E-11 | 10.54 | 0.07 | G-A |
TPC7.1 | FarmCPU | 715647649 | 7 | 28,427,257 | 6.783E-08 | 7.17 | 0.29 | T-C |
TPC9.1 | GAPIT | 715647263 | 9 | 12,159,056 | 2.614E-06 | 5.58 | 0.09 | C-T |
TPC9.2 | GAPIT | 715646560 | 9 | 12,776,010 | 2.872E-07 | 6.54 | 0.08 | A-G |
TPC9.3 | FarmCPU | 715646559 | 9 | 13,053,019 | 7.182E-23 | 22.14 | 0.08 | G-A |
TPC10.1 | FarmCPU | 715639693 | 10 | 11,130,696 | 5.660E-12 | 11.25 | 0.06 | T-C |
TPC10.2 | FarmCPU | 715643755 | 10 | 23,719,802 | 2.746E-08 | 7.56 | 0.06 | C-T |
TPC11.1 | FarmCPU | 715650328 | 11 | 52,968,750 | 3.222E-07 | 6.49 | 0.05 | T-C |
1 Quantitative trait nucleotide. 2 National Center for Biotechnology Information (NCBI) Assay SNP ID. 3 Chromosome. 4 Minor allele frequency. 5 Allelic substitution at single nucleotide polymorphism (SNP) locus.
Table 3.
QTN Designation 1 | Method | ss ID No. 2 | Chrom. 3 | Position (bp) | Prob. Value | -Log10 P | MAF 4 | SNP 5 |
---|---|---|---|---|---|---|---|---|
A*2.1 | GAPIT | 715646673 | 2 | 542,087 | 2.684E-06 | 5.57 | 0.11 | G-T |
L*2.1 | GAPIT | 715639371 | 2 | 729,615 | 1.324E-06 | 5.88 | 0.18 | C-A |
L*2.1 | GAPIT | 715639372 | 2 | 766,293 | 1.324E-06 | 5.88 | 0.18 | C-T |
B*3.1 | FarmCPU | 715649460 | 3 | 3,463,964 | 2.601E-09 | 8.58 | 0.07 | T-C |
B*3.2 | GAPIT | 715646623 | 3 | 40,370,083 | 1.790E-07 | 6.75 | 0.05 | T-C |
B*5.1 | FarmCPU | 715645117 | 5 | 887,360 | 3.322E-06 | 5.48 | 0.14 | A-G |
1 L*, A* and B* are vectors for the CIE L*a*b* color scale; numbers following the color parameter indicate chromosome number and QTN number on that chromosome. 2 National Center for Biotechnology Information (NCBI) Assay SNP ID. 3 Chromosome. 4 Minor allele frequency. 5 Allelic substitution at SNP locus.
Table 4.
QTN Designation 1 | Method | ss ID No.2 | Chrom. 3 | Position (bp) | Prob. Value | -Log10 P | MAF 4 | SNP 5 |
---|---|---|---|---|---|---|---|---|
FC1.1 | FarmCPU | 715648402 | 1 | 8,955,838 | 3.088E-21 | 20.51 | 0.06 | A-C |
FC1.2 | FarmCPU | 715645902 | 1 | 48,430,158 | 3.215E-10 | 9.49 | 0.09 | T-C |
FC3.1 | FarmCPU | 715646620 | 3 | 40,621,263 | 1.087E-08 | 7.96 | 0.07 | T-C |
FC3.1 | FarmCPU | 715646619 | 3 | 40,634,185 | 1.087E-08 | 7.96 | 0.07 | A-G |
FC3.2 | FarmCPU | 715650182 | 3 | 40,901,620 | 1.087E-08 | 7.96 | 0.07 | T-C |
FC3.2 | FarmCPU | 715650183 | 3 | 40,922,819 | 1.087E-08 | 7.96 | 0.07 | A-C |
FC3.3 | FarmCPU | 715639426 | 3 | 41,081,362 | 1.087E-08 | 7.96 | 0.07 | C-T |
FC3.3 | FarmCPU | 715639423 | 3 | 41,169,500 | 1.087E-08 | 7.96 | 0.07 | T-G |
FC3.4 | FarmCPU | 715640920 | 3 | 43,174,196 | 1.087E-08 | 7.96 | 0.07 | A-C |
FC9.1 | GAPIT | 715647263 | 9 | 12,159,056 | 2.326E-07 | 6.63 | 0.09 | C-T |
FC9.2 | GAPIT | 715646560 | 9 | 12,776,010 | 7.518E-08 | 7.12 | 0.08 | A-G |
FC9.3 | FarmCPU | 715646559 | 9 | 13,053,019 | 8.124E-21 | 20.09 | 0.08 | G-A |
FC9.3 | GAPIT | 715648638 | 9 | 13,339,943 | 1.028E-06 | 5.99 | 0.19 | G-A |
1 Quantitative trait nucleotide. 2 National Center for Biotechnology Information (NCBI) Assay SNP ID. 3 Chromosome. 4 Minor allele frequency. 5 Allelic substitution at SNP locus.
3.3. Total Phenolic Content QTN
Eleven SNPs were associated with QTN for TPC (Table 2, Figure 1). Two separate QTN were observed on each of Pv01 and Pv10, and Pv09 had three separate QTN; the remainder had one QTN per chromosome. TPC7.1 appears to be associated with the P locus, which has been located to a region from 28,752,132–28,774,743 for the gene models Phvul.007G171333 and Phvul.007G171466 (Table 1). The SNP identified by FarmCPU was located about 320 kb from the candidate gene. A second SNP (ss715647650, P = 2.51 × 10−05) identified by GAPIT was located about 6 kb from the SNP identified by FarmCPU but did not meet the statistical threshold for significance. Two other color genes were located in the same general regions as significant QTN. TPC 9.1 and T were in proximity on Pv09 as were TPC10.1 and Ana. However, the distances between TPC SNPs and gene markers were about 427 and 433 kb, respectively, and were outside of our designated window for linkage disequilibrium decay. No other color genes for which the physical position could be estimated appear to be associated with TPC QTN. With the exception of TPC7.1 (MAF = 0.29), most QTNs had low minor allele frequencies ranging from 0.05–0.09 (Table 2).
3.4. Pod Color QTN
Pod color SNPs were associated with all three parameters of the L*a*b* scale (Table 3, Figures S1–S4). One SNP was associated with a* (red–green axis), two with L* (light–dark), and three with b* (blue–yellow axis) on three chromosomes. Three significant QTN identified by GAPIT on Pv02 form a 224kb interval spanning a region from 542,087–766,293 bp that includes SNPs for both L* and a*. The same SNP for A*2.1 was also associated with L*, but did not meet the Bonferroni cutoff. We have previously mapped the wax pod (y) locus to Pv02, which is 23 kb from the A*2.1 QTN. A possible candidate gene for this trait is Phvul.002G004400, located at 516,568–518,904kb [77]. It has been annotated in Arabidopsis as a pentatricopeptide repeat-containing protein, which is targeted to organelles and has been shown to cause cytoplasmic male sterility when it interacts with mitochondria and albinism when targeted to chloroplasts [78]. Our conclusion is that this pod color trait does not have any relationship to the genetic control of the phenolics biosynthetic pathway.
Some of the other pod color SNPs appear not to be associated with phenolics. For example, neither B*3.1 or B*5.1 show a relationship to TPC. However, as mentioned above, the SNP underlying B*3.2 is identical to TPC3.1, suggesting that this QTN is associated with a shift in pod color. None of the regions associated with pod color appeared to be associated with regions where classic bean color genes have mapped.
3.5. Flower Color SNPs
Thirteen SNPs on three chromosomes were associated with flower color (Table 4, Figures S5 and S6). Most were identified by FarmCPU, with three found by GAPIT. Two QTN were located on Pv01, four on Pv03 and three on Pv09. The first three groups on Pv03 spanned 12.9, 21.2 and 88.1 kb, respectively. FC3.1 was located about 255 kb from TPC3.1 and B*3.2. There were no significant QTN on Pv06 or Pv07 where genes are known to reside that qualitatively control flower color.
3.6. Candidate Genes in Proximity to Traits
A total of 545 gene models were found within 350 kb of the 11 TPC QTN (Table 5). Four-hundred and sixty-four of these had annotations for Arabidopsis homologues, while the function of 81 was unknown. Among all gene models, 19 appeared to be related to the phenolics biosynthetic pathway-based annotation, mainly as regulatory genes. The number of gene models within most intervals ranged from 52 to 67, but intervals on Pv01 and Pv10 were more sparsely populated, with 13–19 gene models observed. The greatest number of potentially phenolic-related gene models was found on Pv04, resulting from a cluster of 10 Cytochrome P450 genes arranged in tandem (Table 6).
Table 5.
QTN Designation | Gene Models ± 350 kb of QTN | Annotated Gene Models | Putative Phenolics Related Gene Models | Unknown Gene Models |
---|---|---|---|---|
no. | ||||
TPC1.1 | 19 | 18 | 1 | 1 |
TPC1.2 | 61 | 55 | 0 | 6 |
TPC3.1 | 67 | 61 | 0 | 6 |
TPC4.1 | 67 | 61 | 10 | 6 |
TPC7.1 | 56 | 48 | 2 | 8 |
TPC9.1 | 65 | 55 | 2 | 10 |
TPC9.2 | 61 | 45 | 1 | 16 |
TPC9.3 | 65 | 50 | 1 | 15 |
TPC10.1 | 13 | 13 | 0 | 0 |
TPC10.2 | 19 | 18 | 0 | 1 |
TPC11.1 | 52 | 40 | 2 | 12 |
Table 6.
QTN 1 | SNP Position | Distance 2 | P. vulgaris Gene Model | Start | End | A. thaliana Gene Model | Gene Abbreviation | Gene Function |
---|---|---|---|---|---|---|---|---|
bp | bp | |||||||
TPC1.1 | 33,470,635 | 83,056 | Phvul.001G121200 | 33,553,691 | 33,555,719 | AT2G41130.1 | BHLH106, STC8 | Basic helix-loop-helix (bHLH) DNA-binding superfamily protein |
TPC1.2 | 42,698,218 | -- | No candidates | -- | -- | -- | -- | -- |
TPC3.1 | 40,370,083 | -- | No candidates | -- | -- | -- | -- | -- |
TPC4.1 | 2,190,413 | 269,553 | Phvul.004G021100 | 2,459,966 | 2,462,674 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 |
275,490 | Phvul.004G021200 | 2,465,903 | 2,467,865 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
283,486 | Phvul.004G021300 | 2,473,899 | 2,476,387 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
289,437 | Phvul.004G021400 | 2,479,850 | 2,482,645 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
298,067 | Phvul.004G021500 | 2,488,480 | 2,491,886 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
304,827 | Phvul.004G021600 | 2,495,240 | 2,497,544 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
312,938 | Phvul.004G021700 | 2,503,351 | 2,505,988 | AT4G31970.1 | CYP82C2 | Cytochrome P450, family 82, subfamily C, polypeptide 2 | ||
321,346 | Phvul.004G021800 | 2,511,759 | 2,513,935 | AT4G31970.1 | CYP82C2 | Cytochrome P450, family 82, subfamily C, polypeptide 2 | ||
325,444 | Phvul.004G021900 | 2,515,857 | 2,518,683 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
338,085 | Phvul.004G022000 | 2,528,498 | 2,531,156 | AT4G31940.1 | CYP82C4 | Cytochrome P450, family 82, subfamily C, polypeptide 4 | ||
TPC7.1 | 28,427,257 | 324,875 | Phvul.007G171333 | 28,752,132 | 28,766,155 | AT4G09820.1 | BHLH42, TT8 | Basic helix-loop-helix (bHLH) DNA-binding superfamily protein |
345,156 | Phvul.007G171466 | 28,772,413 | 28,774,743 | -- | -- | Basic helix-loop-helix (bHLH) DNA-binding superfamily protein | ||
TPC7.1 | 28,433,796 | 318,336 | Phvul.007G171333 | 28,752,132 | 28,766,155 | AT4G09820.1 | BHLH42, TT8 | Basic helix-loop-helix (bHLH) DNA-binding superfamily protein |
338,617 | Phvul.007G171466 | 28,772,413 | 28,774,743 | -- | -- | Basic helix-loop-helix (bHLH) DNA-binding superfamily protein | ||
TPC9.1 | 12,159,056 | 30,719 | Phvul.009G069401 | 12,122,701 | 12,128,337 | AT5G19440.1 | -- | NAD(P)-binding Rossmann-fold superfamily protein |
28,423 | Phvul.009G069500 | 12,129,317 | 12,130,633 | AT1G51410.1 | -- | NAD(P)-binding Rossmann-fold superfamily protein | ||
TPC9.2 | 12,776,010 | 242,880 | Phvul.009G077300 | 13,018,890 | 13,020,479 | AT5G56840.1 | -- | myb-like transcription factor family protein |
TPC9.3 | 13,053,019 | 32,540 | Phvul.009G077300 | 13,018,890 | 13,020,479 | AT5G56840.1 | -- | myb-like transcription factor family protein |
TPC10.1 | 11,130,696 | -- | No candidates | -- | -- | -- | -- | -- |
TPC10.2 | 23,719,802 | -- | No candidates | -- | -- | -- | -- | -- |
TPC11.1 | 52,968,750 | 287,108 | Phvul.011G212600 | 53,255,858 | 53,257,674 | AT2G45400.1 | BEN1 | NAD(P)-binding Rossmann-fold superfamily protein |
291,448 | Phvul.011G212800 | 53,260,198 | 53,262,370 | AT2G45400.1 | BEN1 | NAD(P)-binding Rossmann-fold superfamily protein |
1 Quantitative trait nucleotide with letters designating trait (total phenolic content) and numbers following indicate chromosome number and QTN number on that chromosome. 2 Distance calculated from SNP to nearest end of gene model.
The potential phenolics pathway gene models included bHLH DNA-binding superfamily proteins, cytochrome P450 (various families and subfamilies), myb-like transcription factor family proteins, NAD(P)-binding Rossmann-fold superfamily protein, and Transducin/WD40 repeat-like superfamily protein (Table 6).
No candidate genes were found in proximity to TPC1.2, TPC3.1, TPC10.1 and TPC10.2 (Table 6). For TPC1.1, a single candidate gene was identified as Phvul.001G121200, located 83,056 bp distant from the SNP. The Arabidopsis homologue for Phvul.001G121200 is AT2G41130.1 which encodes the bHLH factor BHLH106. In Arabidopsis, it is also designated STC8 (salt tolerant callus 8) which has been shown to regulate a hyperosmotic salinity response in Arabidopsis [79], but in Vitus vinifera, BHLH106 is associated with the light-induced negative regulation of the anthocyanin pathway [80].
For TPC4.1, a cluster of 10 cytochrome P450, family 82, subfamily C, polypeptide 4 gene models (Phvul.004G021100 - Phvul.004G0220000 were found located from 269 to ~338 kb from the SNP. The corresponding Arabidopsis homologue is AT4G31940.1 (CYP82C4) (Table 6). In Arabidopsis, this gene is involved in iron uptake and transport, but in basil (Ocimum basilicum), it catalyzes the 7-O-methylation of flavones [81].
TPC7.1 is represented by two gene models that reside about 6.5 kb from each other (Table 6). TPC7.1 is associated with Phvul.007G171333 and Phvul.007G171466, which appear to be part of the same gene model and are ~318–345 kb from the SNP. These gene models have been shown to be homologous to the Arabidopsis gene model AT4G09820.1 which encodes for the basic helix loop helix DNA binding super family protein BHLH42 [27]. The Arabidopsis gene is transparent testa 8 (TT8), and in P. vulgaris, this controls flavonoid expression via the p locus.
For TPC9.1, two P. vulgaris genes of similar function are located within 28–30 kb of the SNP (Table 6). These are Phvul.009G069401 and Phvul.009G069500 which correspond to AT5G19440.1 and AT1G51410.1, respectively, which are uncharacterized NAD(P)-binding Rossmann-fold superfamily proteins. Among other functions, these types of proteins have been found to be involved in the flavonoid pathway [82].
TPC9.2 and TPC9.3 are located at 242,880 and 32,540, respectively, from the same candidate gene (Phvul.009G077300), which corresponds to the Arabidopsis homologue AT5G56840.1 (Table 6). This gene encodes an myb-like transcription factor family protein but remains uncharacterized in Arabidopsis. However, in tea (Camellia sinensis), an myb homologous to AT5G56840.1 is involved with epigallocatechin-3-gallate biosynthesis [83].
Two candidate genes of similar function were found for TPC11.1. These were Phvul.011G212600 and Phvul.011G212800, located within about 3 kb of each other and homologous to the same Arabidopsis gene model AT2G45400.1 (Table 6). The Arabidopsis gene has been named BEN1 (bri1-5 enhanced 1) and has been characterized as an NAD(P)-binding Rossmann-fold superfamily protein that encodes a dihydroflavonol 4-reductase (DFR)-like protein [82]. In Arabidopsis, BEN1 has been shown to be involved in the brassinosteroid metabolic pathway; however, BEN1 is a member of a small gene family that also includes dihydroflavonol 4-reductase and anthocyanidin reductase (BAN). In Brassica rapa, the homologous gene model (Bra004907.1) is involved in the flavonoid biosynthetic pathway [84].
4. Discussion
Because anthocyanins are well known for their red-to-blue color, and flavonols are associated with yellow-to-brown pigments, we classified snap bean accessions for flower and seed color as well as obtaining a color profile for pods, in addition to measuring TPC. However, other phenolic compounds may be colorless and may account for the two-fold variation in TPC that was observed apart from that attributed to flavonoids. Kleintop et al. [25] found moderate significant correlations between TPC and a* (0.40) and between TPC and b* (−0.34). There was essentially no correlation between TPC and L*. These results indicate that, of the parameters evaluated here, a* and b* are more likely to be associated with QTN for TPC than L*.
In this association study, we found 11 QTN for TPC located on seven chromosomes of common bean, but only one QTN (TPC7.1) was associated with a known major gene that has a large phenotypic effect. In addition, the QTN was located 324–245kb from the gene models associated with this trait. In a dry x snap bean mapping population, Hagerty et al. [32] found SNP markers associated with p that were located ~17.2Mb from the candidate genes identified by [27]. While the slow rate of LD decay in common bean may influence marker-gene associations, p is not located in a low recombination region of Pv07, and so there may be other explanations for why distances between this locus and markers are rather large.
Compared to the other phenotypic categories, there were relatively few pod color SNPs, and most were not likely to be associated with pigmented flavonoids. Based on the moderate correlations between pod color and total TPC identified in [25], we would expect that a* and b* would be most likely to be associated with phenolics; in this case, however, a* (and L*) are associated with the wax bean trait, which is marked by a deficit of chlorophyll and not variation in TPC. Three b* QTN may be related to phenolic-based pod color, which is supported by B*3.2-associated SNP also underlying TPC 3.1. In addition, FC 3.1 was located within the LD decay window at 255 kb from B*3.2 and TPC 3.1. These results all support the idea that a QTN affecting flower and pod color and phenolic content resides in this region of Pv03. This QTN does not appear to be related to any known color genes. All b* QTN may represent novel genes affecting pod color that have not been identified in classical genetic studies.
The findings for flower color QTN were unexpected. We predicted that we would find two or three QTN associated with P and V, and possibly T, because these genes are known to modify flower color. As described above, P is located on Pv07, whereas V has previously been placed on Pv06 (Table 1). Dominant P is required for observing segregation at V and T, which means that the number of genotypes that could be classified for these genes was small, and any associations that might have been found in GWAS would be discarded because they fell below a 5% MAF cut-off. The 13 QTN associated with flower color appeared to be novel and it is difficult to see how these were identified during GWAS, since the classification of genotypes was based on a three-point scale. This may point to the need for the better quantification of flower color. The pink class in particular can sometimes be difficult to classify as it varies from very similar to white on one hand to purple on the other hand. One QTN (FC 9.1) had the same underlying SNP as a TPC QTN (TPC 9.1), which suggests that the pigment difference in that case was real.
The data used in this study represent a single environment. In our previous study [25], we examined TPC over two years in a subset of pole and bush bean accessions. We found no genotype by the year of interaction for the indeterminate pole bean group, and a weak genotype by the year of interaction for determinant bush accessions. TPC levels were significantly higher in the second year, and the interaction term for the bush bean group was driven by changes in magnitude—particularly by one accession—rather than by cross-over interaction. Based on these results, we concluded that the genotype by environment effect on TPC in this population was relatively small, and that we could proceed with a GWAS analysis based on the single environment. In some cases, the magnitude of the TPC’s effects from GWAS in the present situation was small and would need to be verified in more environments to provide validation. With additional resources, it would be useful to conduct a more extensive analysis of environmental effects as well as other factors, such as pod maturity, on TPC.
While most structural genes in the phenolic biosynthetic pathway have been identified, the regulatory control of the pathway is in the early stages of analysis. Some regulatory factors have been identified, with the majority of these characterized in Arabidopsis [66,67,68,69,70,71] and grape [80]. One difficulty is that a function in one species is not necessarily the same in other species, especially where particular biosynthetic pathways may be missing (such as the glucosinolate biosynthetic pathway found in Brassicaceae but not in most other plant families). This works both ways, in that regulatory genes identified as being in the phenolics biosynthetic pathway in Arabidopsis may be involved in different pathways in another organism and vice versa. The lack of functional studies for regulatory genes in legumes in general and common bean in particular hampers the association of putative phenolics biosynthetic pathway regulatory genes with gene function. It may be that for several QTN (TPC 1.2, TPC 3.1, TPC 10.1 and TPC 10.2), novel genes may be involved. In the present case, these may take the form of unrecognized regulatory or structural genes, or genes that have yet to receive annotation in either model systems, such as Arabidopsis, or in the species of interest. The same may be true for those QTN for which we did identify potential genes. There may be another unrecognized candidate gene in the vicinity of the QTN. The ultimate proof will be research that demonstrates gene function.
The identification of QTN for health-promoting phenolics will be an asset to plant breeders. High-throughput molecular markers can readily be derived from SNP positions that have been identified as associated to a trait. Marker-assisted selection, along with simple colorimetric assays as described by [25], will result in a powerful set of tools for selecting for these traits. This will not only benefit fresh market beans but also processing beans because the white-seeded varieties required by processors can be bred for higher TPC. As previous research [25] has shown, the cultivar McCaslan No. 42 may be highly valuable with regard to developing white-seeded processing beans with increased TPC levels, and molecular markers will greatly assist in moving these traits into other snap bean backgrounds. Considering the fairly substantial intake of snap beans in the United States, improving the phenolic content of snap beans could have a meaningful impact on human health.
Acknowledgments
We wish to acknowledge the many commercial snap bean breeding programs that contributed cultivars to the snap bean diversity panel. JRM gratefully acknowledges the technical assistance of Joel Davis and Shinji Kawai in carrying out the field and greenhouse work at OSU on this project.
Supplementary Materials
The following are available online at https://www.mdpi.com/2072-6643/11/10/2509/s1, Figure S1. Manhattan and QQ plots for the CIE color parameter L* of pods from a Genome-Wide Association Mapping study of the Bean CAP snap bean diversity panel. Output from GAPIT using the EMMA model with one PCA. The horizontal line represents the Bonferroni cut-off (α = 0.05). Figure S2. Manhattan and QQ plots for CIE color parameter a* of pods from a Genome-Wide Association Mapping study of the Bean CAP snap bean diversity panel. Output from GAPIT using the EMMA model with one PCA. The horizontal line represents the Bonferroni cut-off (α = 0.05). Figure S3. Manhattan and QQ plots for b* of pods from a Genome-Wide Association Mapping study of the Bean CAP snap bean diversity panel. Output from GAPIT using the EMMA model with one PCA. The horizontal line represents the Bonferroni cut-off (α = 0.05). Figure S4. Manhattan and QQ plots for the CIE color parameter b* of pods from a Genome-Wide Association Mapping study of the Bean CAP snap bean diversity panel. Output from FarmCPU with one PCA. The horizontal line represents the Bonferroni cut-off (α = 0.05). Figure S5. Manhattan and QQ plots for flower color from a Genome-Wide Association study of the Bean CAP snap bean diversity panel. Output from FarmCPU with one PCA. The horizontal line represents the Bonferroni cut-off (α = 0.05). Figure S6. Manhattan and QQ plots for flower color from a Genome-Wide Association study of the Bean CAP snap bean diversity panel. Output from GAPIT using the EMMA model with one PCA. The horizontal line represents the Bonferroni cut-off (α = 0.05). The original SNP data used in this study has been deposited in the ScholarsArchive@ OSU and can be retrieved at: https://ir.library.oregonstate.edu/concern/datasets/m900p1589.
Author Contributions
Conceptualization, J.R.M., H.J.T., M.A.B. and P.E.M.; Data curation, J.R.M.; Formal analysis, J.R.M., L.T.W., S.M.M., A.E.K., R.L. and D.E.; Funding acquisition, J.R.M. and P.E.M.; Investigation, J.R.M., L.T.W., S.M.M., A.E.K., H.J.T., R.L. and M.A.B.; Methodology, J.R.M., L.T.W., S.M.M., A.E.K., D.E., H.J.T. and M.A.B.; Project administration, M.A.B. and P.E.M.; Resources, H.J.T., M.A.B. and P.E.M.; Visualization, L.T.W. and S.M.M.; Writing—original draft, J.R.M.; Writing—review & editing, J.R.M., L.T.W., S.M.M., A.E.K., H.J.T., M.A.B., R.L. and P.E.M.
Funding
This research was funded by USDA-NIFA Bean Coordinated Agricultural Project: USDA-NIFA-2009-85606-05964.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Thompson M.D., Thompson H.J. Biomedical agriculture: A systematic approach to food crop improvement for chronic disease prevention. Adv. Agron. 2009;102:1–54. [Google Scholar]
- 2.Hefferon K. Nutritionally enhanced food crops; progress and perspectives. Int. J. Mol. Sci. 2015;16:3895–3914. doi: 10.3390/ijms16023895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.FDA Guidance for Industry: Guide for Developing and Using Data Bases for Nutrition Labeling. [(accessed on 16 September 2019)]; Available online: http://www.fda.gov/regulatory-information/search-fda-guidance-documents/guidance-industry-guide-developing-and-using-data-bases-nutrition-labeling.
- 4.Bond C.A., Thilmany D.D., Bond J.K. What to choose? The value of label claims to fresh produce consumers. J. Agric. Resour. Econ. 2008;33:402–427. [Google Scholar]
- 5.Laborde L., McGee W. Food for Profit: Food Labels. Penn State Ext. 2019 Code UK153 04/19pod. [Google Scholar]
- 6.USDA USDA National Nutrient Database for Standard Reference. [(accessed on 12 July 2019)]; Available online: https://www.ars.usda.gov/northeast-area/beltsville-md-bhnrc/beltsville-human-nutrition-research-center/nutrient-data-laboratory/docs/usda-national-nutrient-database-for-standard-reference/
- 7.Myers J.R., Kmiecik K. The Common Bean Genome. Springer; Berlin, Germany: 2017. Common Bean: Economic importance and relevance to biological science research; pp. 1–20. [Google Scholar]
- 8.USDA Food Availability and Consumption. [(accessed on 12 July 2019)]; Available online: https://www.ers.usda.gov/data-products/ag-and-food-statistics-charting-the-essentials/food-availability-and-consumption/
- 9.Crozier A.B., Jaganath I.N., Clifford M. Dietary phenolics: Chemistry, bioavailability and effects on health. Nat. Prod. Rep. 2009;26:1001–1043. doi: 10.1039/b802662a. [DOI] [PubMed] [Google Scholar]
- 10.Hounsome N., Hounsome B., Tomos D., Edwards-Jones G. Plant metabolites and nutritional quality of vegetables. J. Food Sci. 2008;73:R48–R65. doi: 10.1111/j.1750-3841.2008.00716.x. [DOI] [PubMed] [Google Scholar]
- 11.Parr A.J., Bolwell G.P. Phenols in the plant and in man. The potential for possible nutritional enhancement of the diet by modifying the phenols content or profile. J. Sci. Food Agric. 2000;80:985–1012. doi: 10.1002/(SICI)1097-0010(20000515)80:7<985::AID-JSFA572>3.0.CO;2-7. [DOI] [Google Scholar]
- 12.Shahidi F., Ambigaipalan P. Phenolics and polyphenolics in foods, beverages and spices: Antioxidant activity and health effects—A review. J. Funct. Foods. 2015;18:820–897. doi: 10.1016/j.jff.2015.06.018. [DOI] [Google Scholar]
- 13.Amarowicz R., Pegg R.B. Legumes as a source of natural antioxidants. Eur. J. Lipid Sci. Technol. 2008;110:865–878. doi: 10.1002/ejlt.200800114. [DOI] [Google Scholar]
- 14.Xu B.J., Chang S.K.C. A comparative study on phenolic profiles and antioxidant activities of legumes as affected by extraction solvents. J. Food Sci. 2007;72:S159–S166. doi: 10.1111/j.1750-3841.2006.00260.x. [DOI] [PubMed] [Google Scholar]
- 15.Yang Q.-Q., Gan R.-Y., Ge Y.-Y., Zhang D., Corke H. Polyphenols in common beans (Phaseolus vulgaris L.): Chemistry, analysis, and factors affecting composition. Compr. Rev. Food Sci. Food Saf. 2018;17:1518–1539. doi: 10.1111/1541-4337.12391. [DOI] [PubMed] [Google Scholar]
- 16.Bazzano L.A., He J., Ogden L.G., Loria C., Vupputuri S., Myers L., Whelton P.K. Legume consumption and risk of coronary heart disease in US men and women: NHANES I Epidemiologic Follow-up Study. Arch. Intern. Med. 2001;161:2573–2578. doi: 10.1001/archinte.161.21.2573. [DOI] [PubMed] [Google Scholar]
- 17.Câmara C., Urrea C., Schlegel V. Pinto beans (Phaseolus vulgaris L.) as a functional food: Implications on human health. Agriculture. 2013;3:90–111. [Google Scholar]
- 18.Thompson M.D., Brick M.A., McGinley J.N., Thompson H.J. Chemical composition and mammary cancer inhibitory activity of dry bean. Crop. Sci. 2009;49:179–186. doi: 10.2135/cropsci2008.04.0218. [DOI] [Google Scholar]
- 19.Ganesan K., Xu B. Polyphenol-rich dry common beans (Phaseolus vulgaris L.) and their health benefits. Int. J. Mol. Sci. 2017;18:2331. doi: 10.3390/ijms18112331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Abu-Reidah I.M., Arráez-Román D., Lozano-Sánchez J., Segura-Carretero A., Fernández-Gutiérrez A. Phytochemical characterisation of green beans (Phaseolus vulgaris L.) by using high-performance liquid chromatography coupled with time-of-flight mass spectrometry. Phytochem. Anal. Pca. 2013;24:105–116. doi: 10.1002/pca.2385. [DOI] [PubMed] [Google Scholar]
- 21.Escarpa A., González M.C. Identification and quantitation of phenolics from green beans by high-performance liquid chromatography. Chromatographia. 2000;52:33–38. doi: 10.1007/BF02490789. [DOI] [Google Scholar]
- 22.Mattila P., Hellström J. Phenolic acids in potatoes, vegetables, and some of their products. J. Food Compos. Anal. 2007;20:152–160. doi: 10.1016/j.jfca.2006.05.007. [DOI] [Google Scholar]
- 23.Hertog M.G.L., Hollman P.C.H., Katan M.B. Content of potentially anticarcinogenic flavonoids of 28 vegetables and fruits commonly consumed in The Netherlands. J. Agric. Food Chem. 1992;40:2379–2383. doi: 10.1021/jf00024a011. [DOI] [Google Scholar]
- 24.Hempel J., Böhm H. Quality and quantity of prevailing flavonoid glycosides of yellow and green French beans (Phaseolus vulgaris L.) J. Agric. Food Chem. 1996;44:2114–2116. doi: 10.1021/jf9507001. [DOI] [Google Scholar]
- 25.Kleintop A.E., Myers J.R., Echeverria D., Thompson H.J., Brick M.A. Total phenolic content and associated phenotypic traits in a diverse collection of snap bean cultivars. J. Am. Soc. Hortic. Sci. 2016;141:3–11. doi: 10.21273/JASHS.141.1.3. [DOI] [Google Scholar]
- 26.Myers J.R., Baggett J.R. Improvement of Snap Bean. In: Singh S.P., editor. Common Bean Improvement in the Twenty-First Century. Kluwer; Dordrecht, The Netherlands: 1999. pp. 289–329. [Google Scholar]
- 27.McClean P.E., Bett K.E., Stonehouse R., Lee R., Pflieger S., Moghaddam S.M., Geffroy V., Miklas P., Mamidi S. White seed color in common bean (Phaseolus vulgaris) results from convergent evolution in the P (pigment) gene. New Phytol. 2018;219:1112–1123. doi: 10.1111/nph.15259. [DOI] [PubMed] [Google Scholar]
- 28.Dickson M.H. Breeding beans, Phaseolus vulgaris L., for improved germination under unfavorable low temperature conditions. Crop. Sci. 1971;11:848–850. [Google Scholar]
- 29.Powell A.A., Oliveira M.d.A., Matthews S. Seed vigour in cultivars of dwarf French bean (Phaseolus vulgaris) in relation to the colour of the testa. J. Agric. Sci. 1986;106:419–425. doi: 10.1017/S0021859600064030. [DOI] [Google Scholar]
- 30.Bassett M.J. Plant Breeding Reviews. Volume 28. John Wiley & Sons; New York, NY, USA: 2007. Genetics of seed coat color and pattern in common bean; pp. 239–315. [Google Scholar]
- 31.Bassett M.J., McClean P.E. A brief review of the genetics of partly colored seed coats in common bean. Annu. Rep. Bean Improv. Coop. 2000;43:99–101. [Google Scholar]
- 32.Hagerty C.H., Cuesta-Marcos A., Cregan P., Song Q., McClean P., Myers J.R. Mapping snap bean pod and color traits, in a dry bean x snap bean rcombinant inbred population. J. Am. Soc. Hortic. Sci. 2016;141:131–138. doi: 10.21273/JASHS.141.2.131. [DOI] [Google Scholar]
- 33.McClean P.E., Lee R.K., Otto C., Gepts P., Bassett M.J. Molecular and phenotypic mapping of genes controlling seed coat pattern and color in common bean (Phaseolus vulgaris L.) J. Hered. 2002;93:148–152. doi: 10.1093/jhered/93.2.148. [DOI] [PubMed] [Google Scholar]
- 34.Bassett M.J., Miklas P.N., Caldas G.V., Blair M.W. A dominant gene for garnet brown seed coats at the Rk locus in ‘Dorado’common bean and mapping Rk to linkage group 1. Euphytica. 2010;176:281–290. doi: 10.1007/s10681-010-0247-3. [DOI] [Google Scholar]
- 35.Reinprecht Y., Yadegari Z., Perry G.E., Siddiqua M., Wright L.C., McClean P.E., Pauls P.K. In silico comparison of genomic regions containing genes coding for enzymes and transcription factors for the phenylpropanoid pathway in Phaseolus vulgaris L. and Glycine max L. Merr. Front. Plant Sci. 2013;4:317. doi: 10.3389/fpls.2013.00317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Ates D., Asciogul T.K., Nemli S., Erdogmus S., Esiyok D., Tanyolac M.B. Association mapping of days to flowering in common bean (Phaseolus vulgaris L.) revealed by DArT markers. Mol. Breed. 2018;38:113. doi: 10.1007/s11032-018-0868-0. [DOI] [Google Scholar]
- 37.Cichy K.A., Porch T.G., Beaver J.S., Cregan P., Fourie D., Glahn R.P., Grusak M.A., Kamfwa K., Katuuramu D.N., McClean P., et al. A Phaseolus vulgaris diversity panel for Andean bean improvement. Crop. Sci. 2015;55:2149–2160. doi: 10.2135/cropsci2014.09.0653. [DOI] [Google Scholar]
- 38.Nascimento M., Nascimento A.C.C., Silva F.F.E., Barili L.D., Vale N.M.D., Carneiro J.E., Cruz C.D., Carneiro P.C.S., Serão N.V.L. Quantile regression for genome-wide association study of flowering time-related traits in common bean. PLoS ONE. 2018;13:e0190303. doi: 10.1371/journal.pone.0190303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kamfwa K., Cichy K.A., Kelly J.D. Genome-wide association study of agronomic traits in common bean. Plant. Genome. 2015;8 doi: 10.3835/plantgenome2014.09.0059. [DOI] [PubMed] [Google Scholar]
- 40.Moghaddam S.M., Mamidi S., Osorno J.M., Lee R., Brick M., Kelly J., Miklas P., Urrea C., Song Q., Cregan P., et al. Genome-wide association study identifies candidate loci underlying agronomic traits in a Middle American diversity panel of common bean. Plant. Genome. 2016;9 doi: 10.3835/plantgenome2016.02.0012. [DOI] [PubMed] [Google Scholar]
- 41.Kamfwa K., Cichy K.A., Kelly J.D. Genome-wide association analysis of symbiotic nitrogen fixation in common bean. Appl. Genet. 2015;128:1999–2017. doi: 10.1007/s00122-015-2562-5. [DOI] [PubMed] [Google Scholar]
- 42.Resende R.T., de Resende M.D.V., Azevedo C.F., E Silva F.F., Melo L.C., Pereira H.S., Souza T.L.P., Valdisser P.A.M., Brondani C., Vianello R.P. Genome-wide association and regional heritability mapping of plant architecture, lodging and productivity in Phaseolus vulgaris. G3 Genes Genomes Genet. 2018;8:2841–2854. doi: 10.1534/g3.118.200493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Cichy K.A., Wiesinger J.A., Mendoza F.A. Genetic diversity and genome-wide association analysis of cooking time in dry bean (Phaseolus vulgaris L.) Appl. Genet. 2015;128:1555–1567. doi: 10.1007/s00122-015-2531-z. [DOI] [PubMed] [Google Scholar]
- 44.Katuuramu D.N., Hart J.P., Porch T.G., Grusak M.A., Glahn R.P., Cichy K.A. Genome-wide association analysis of nutritional composition-related traits and iron bioavailability in cooked dry beans (Phaseolus vulgaris L.) Mol. Breed. 2018;38:44. doi: 10.1007/s11032-018-0798-x. [DOI] [Google Scholar]
- 45.Soltani A., MafiMoghaddam S., Walter K., Restrepo-Montoya D., Mamidi S., Schroder S., Lee R., McClean P.E., Osorno J.M. Genetic architecture of flooding tolerance in the dry bean Middle-American diversity panel. Front. Plant. Sci. 2017;8 doi: 10.3389/fpls.2017.01183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Soltani A., MafiMoghaddam S., Oladzad-Abbasabadi A., Walter K., Kearns P.J., Vasquez-Guzman J., Mamidi S., Lee R., Shade A.L., Jacobs J.L., et al. Genetic analysis of flooding tolerance in an Andean diversity panel of dry bean (Phaseolus vulgaris L.) Front. Plant. Sci. 2018;9 doi: 10.3389/fpls.2018.00767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Perseguini J.M.K.C., Oblessuc P.R., Rosa J.R.B.F., Gomes K.A., Chiorato A.F., Carbonell S.A.M., Garcia A.A.F., Vianello R.P., Benchimol-Reis L.L. Genome-wide association studies of anthracnose and angular leaf spot resistance in common bean (Phaseolus vulgaris L.) PLoS ONE. 2016;11:e0150506. doi: 10.1371/journal.pone.0150506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Tock A.J., Fourie D., Walley P.G., Holub E.B., Soler A., Cichy K.A., Pastor-Corrales M.A., Song Q., Porch T.G., Hart J.P. Genome-wide linkage and association mapping of halo blight resistance in common bean to race 6 of the globally important bacterial pathogen. Front. Plant. Sci. 2017;8:1170. doi: 10.3389/fpls.2017.01170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Zuiderveen G.H., Padder B.A., Kamfwa K., Song Q., Kelly J.D. Genome-wide association study of anthracnose resistance in Andean beans (Phaseolus vulgaris) PLoS ONE. 2016;11:e0156391. doi: 10.1371/journal.pone.0156391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cai S., Han Z., Huang Y., Chen Z.-H., Zhang G., Dai F. Genetic diversity of individual phenolic acids in barley and their correlation with barley malt quality. J. Agric. Food Chem. 2015;63:7051–7057. doi: 10.1021/acs.jafc.5b02960. [DOI] [PubMed] [Google Scholar]
- 51.Mohammadi M., Endelman J.B., Nair S., Chao S., Jones S.S., Muehlbauer G.J., Ullrich S.E., Baik B.-K., Wise M.L., Smith K.P. Association mapping of grain hardness, polyphenol oxidase, total phenolics, amylose content, and β-glucan in US barley breeding germplasm. Mol. Breed. 2014;34:1229–1243. doi: 10.1007/s11032-014-0112-5. [DOI] [Google Scholar]
- 52.Jin L., Lu Y., Xiao P., Sun M., Corke H., Bao J. Genetic diversity and population structure of a diverse set of rice germplasm for association mapping. Appl. Genet. 2010;121:475–487. doi: 10.1007/s00122-010-1324-7. [DOI] [PubMed] [Google Scholar]
- 53.Shao Y., Tang F., Huang Y., Xu F., Chen Y., Tong C., Chen H., Bao J. Analysis of genotype x environment interactions for polyphenols and antioxidant capacity of rice by association mapping. J. Agric. Food Chem. 2014;62:5361–5368. doi: 10.1021/jf500951e. [DOI] [PubMed] [Google Scholar]
- 54.Xu F., Bao J., Kim T.-S., Park Y.-J. Genome-wide association mapping of polyphenol contents and antioxidant capacity in whole-grain rice. J. Agric. Food Chem. 2016;64:4695–4703. doi: 10.1021/acs.jafc.6b01289. [DOI] [PubMed] [Google Scholar]
- 55.Shao Y., Jin L., Zhang G., Lu Y., Shen Y., Bao J. Association mapping of grain color, phenolic content, flavonoid content and antioxidant capacity in dehulled rice. Appl. Genet. 2011;122:1005–1016. doi: 10.1007/s00122-010-1505-4. [DOI] [PubMed] [Google Scholar]
- 56.Nigro D., Laddomada B., Mita G., Blanco E., Colasuonno P., Simeone R., Gadaleta A., Pasqualone A., Blanco A. Genome-wide association mapping of phenolic acids in tetraploid wheats. J. Cereal Sci. 2017;75:25–34. doi: 10.1016/j.jcs.2017.01.022. [DOI] [Google Scholar]
- 57.Rhodes D.H., Hoffmann L., Jr., Rooney W.L., Ramu P., Morris G.P., Kresovich S. Genome-wide association study of grain polyphenol concentrations in global sorghum (Sorghum bicolor (L.) Moench) germplasm. J. Agric. Food Chem. 2014;62:10916–10927. doi: 10.1021/jf503651t. [DOI] [PubMed] [Google Scholar]
- 58.Wen W., Li D., Li X., Gao Y., Li W., Li H., Liu J., Liu H., Chen W., Luo J. Metabolome-based genome-wide association study of maize kernel leads to novel biochemical insights. Nat. Commun. 2014;5:3438. doi: 10.1038/ncomms4438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Ruggieri V., Francese G., Sacco A., D’Alessandro A., Rigano M.M., Parisi M., Milone M., Cardi T., Mennella G., Barone A. An association mapping approach to identify favourable alleles for tomato fruit quality breeding. BMC Plant. Biol. 2014;14:337. doi: 10.1186/s12870-014-0337-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rezaeizad A., Wittkop B., Snowdon R., Hasan M., Mohammadi V., Zali A., Friedt W. Identification of QTLs for phenolic compounds in oilseed rape (Brassica napus L.) by association mapping using SSR markers. Euphytica. 2011;177:335–342. doi: 10.1007/s10681-010-0231-y. [DOI] [Google Scholar]
- 61.Kaya H.B., Cetin O., Kaya H.S., Sahin M., Sefer F., Tanyolac B. Association mapping in Turkish olive cultivars revealed significant markers related to some important agronomic traits. Biochem. Genet. 2016;54:506–533. doi: 10.1007/s10528-016-9738-9. [DOI] [PubMed] [Google Scholar]
- 62.Singleton V.L., Rossi J.A. Colorimetry of total phenolics with phosphomolybdic-phosphotungstic acid reagents. Am. J. Enol. Vitic. 1965;16:144–158. [Google Scholar]
- 63.CIE CIE Colorimetry—Part 4: 1976 L*a*b* Colour Space. [(accessed on 1 December 2018)]; Available online: http://cie.co.at/index.php?i_ca_id=485.
- 64.Wallace L., Arkwazee H., Vining K., Myers J. Genetic diversity within snap beans and their relation to dry beans. Genes. 2018;9:587. doi: 10.3390/genes9120587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Liu X., Huang M., Fan B., Buckler E.S., Zhang Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 2016;12:e1005767. doi: 10.1371/journal.pgen.1005767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Li S. Transcriptional control of flavonoid biosynthesis. Plant. Signal. Behav. 2014;9:e27522. doi: 10.4161/psb.27522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Xu W., Dubos C., Lepiniec L. Transcriptional control of flavonoid biosynthesis by MYB–bHLH–WDR complexes. Trends Plant. Sci. 2015;20:176–185. doi: 10.1016/j.tplants.2014.12.001. [DOI] [PubMed] [Google Scholar]
- 68.Chen L., Hu B., Qin Y., Hu G., Zhao J. Advance of the negative regulation of anthocyanin biosynthesis by MYB transcription factors. Plant. Physiol. Biochem. 2019;136:178–187. doi: 10.1016/j.plaphy.2019.01.024. [DOI] [PubMed] [Google Scholar]
- 69.Allan A.C., Espley R.V. MYBs drive novel consumer traits in fruits and vegetables. Trends Plant. Sci. 2018;23:693–705. doi: 10.1016/j.tplants.2018.06.001. [DOI] [PubMed] [Google Scholar]
- 70.Ma D., Constabel C.P. MYB repressors as regulators of phenylpropanoid metabolism in plants. Trends Plant. Sci. 2019 doi: 10.1016/j.tplants.2018.12.003. [DOI] [PubMed] [Google Scholar]
- 71.Cheynier V., Comte G., Davies K.M., Lattanzio V., Martens S. Plant phenolics: Recent advances on their biosynthesis, genetics, and ecophysiology. Plant. Physiol. Biochem. 2013;72:1–20. doi: 10.1016/j.plaphy.2013.05.009. [DOI] [PubMed] [Google Scholar]
- 72.Schmutz J., McClean P.E., Mamidi S., Wu G.A., Cannon S.B., Grimwood J., Jenkins J., Shu S., Song Q., Chavarro C. A reference genome for common bean and genome-wide analysis of dual domestications. Nat. Genet. 2014;46:707. doi: 10.1038/ng.3008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Flint-Garcia S.A., Thornsberry J.M., Buckler E.S. Structure of linkage disequilibrium in plants. Annu. Rev. Plant. Biol. 2003;54:357–374. doi: 10.1146/annurev.arplant.54.031902.134907. [DOI] [PubMed] [Google Scholar]
- 74.Diniz A.L., Giordani W., Costa Z.P., Margarido G.R., Perseguini J.M.K., Benchimol-Reis L.L., Chiorato A.F., Garcia A.A.F., Vieira M.L.C. Evidence for strong kinship influence on the extent of linkage disequilibrium in cultivated common beans. Genes. 2019;10:5. doi: 10.3390/genes10010005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Blair M.W., Cortés A.J., Farmer A.D., Huang W., Ambachew D., Penmetsa R.V., Carrasquilla-Garcia N., Assefa T., Cannon S.B. Uneven recombination rate and linkage disequilibrium across a reference SNP map for common bean (Phaseolus vulgaris L.) PLoS ONE. 2018;13:e0189597. doi: 10.1371/journal.pone.0189597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Valdisser P.A., Pereira W.J., Almeida Filho J.E., Müller B.S., Coelho G.R., de Menezes I.P., Vianna J.P., Zucchi M.I., Lanna A.C., Coelho A.S. In-depth genome characterization of a Brazilian common bean core collection using DArTseq high-density SNP genotyping. BMC Genom. 2017;18:423. doi: 10.1186/s12864-017-3805-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Myers J.R., Davis J., Arkwazee H., Wallace L., Lee R., Moghaddam S.M., McClean P. Why wax beans lack carotenoids. Annu. Rep. Bean Improv. Coop. 2018;61:29–30. [Google Scholar]
- 78.Barkan A., Small I. Pentatricopeptide repeat proteins in plants. Annu. Rev. Plant. Biol. 2014;65:415–442. doi: 10.1146/annurev-arplant-050213-040159. [DOI] [PubMed] [Google Scholar]
- 79.Ahmad A., Niwa Y., Goto S., Ogawa T., Shimizu M., Suzuki A., Kobayashi K., Kobayashi H. bHLH106 integrates functions of multiple genes through their G-box to confer salt tolerance on Arabidopsis. PLoS ONE. 2015;10:e0126872. doi: 10.1371/journal.pone.0126872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Ma Z.-H., Li W.-F., Mao J., Li W., Zuo C.-W., Zhao X., Dawuda M.M., Shi X.-Y., Chen B.-H. Synthesis of light-inducible and light-independent anthocyanins regulated by specific genes in grape ‘Marselan’ (V. vinifera L.) PeerJ. 2019;7:e6521. doi: 10.7717/peerj.6521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Berim A., Gang D.R. The roles of a flavone-6-hydroxylase and 7-O-demethylation in the flavone biosynthetic network of sweet basil. J. Biol. Chem. 2013;288:1795–1805. doi: 10.1074/jbc.M112.420448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Yuan T., Fujioka S., Takatsuto S., Matsumoto S., Gou X., He K., Russell S.D., Li J. BEN1, a gene encoding a dihydroflavonol 4-reductase (DFR)-like protein, regulates the levels of brassinosteroids in Arabidopsis thaliana. Plant. J. 2007;51:220–233. doi: 10.1111/j.1365-313X.2007.03129.x. [DOI] [PubMed] [Google Scholar]
- 83.Zheng S., Lin Y., Lai Z., Lin J. Isolation and characterisation of a MYB transcription factor associated with epigallocatechin-3-gallate biosynthesis in Camellia sinensis L. J. Hortic. Sci. Biotechnol. 2019;94:41–48. doi: 10.1080/14620316.2018.1454863. [DOI] [Google Scholar]
- 84.String String Database. [(accessed on 12 July 2019)]; Available online: https://string-db.org/cgi/network.pl?taskId=RHIDbolr9GEA.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.