Skip to main content
Frontiers in Plant Science logoLink to Frontiers in Plant Science
. 2013 Apr 15;4:95. doi: 10.3389/fpls.2013.00095

Association Studies and Legume Synteny Reveal Haplotypes Determining Seed Size in Vigna unguiculata

Mitchell R Lucas 1,*, Bao-Lam Huynh 2, Patricia da Silva Vinholes 2, Ndiaga Cisse 3, Issa Drabo 4, Jeffrey D Ehlers 1, Philip A Roberts 2, Timothy J Close 1
PMCID: PMC3625832  PMID: 23596454

Abstract

Highly specific seed market classes for cowpea and other grain legumes exist because grain is most commonly cooked and consumed whole. Size, shape, color, and texture are critical features of these market classes and breeders target development of cultivars for market acceptance. Resistance to biotic and abiotic stresses that are absent from elite breeding material are often introgressed through crosses to landraces or wild relatives. When crosses are made between parents with different grain quality characteristics, recovery of progeny with acceptable or enhanced grain quality is problematic. Thus genetic markers for grain quality traits can help in pyramiding genes needed for specific market classes. Allelic variation dictating the inheritance of seed size can be tagged and used to assist the selection of large seeded lines. In this work we applied 1,536-plex SNP genotyping and knowledge of legume synteny to characterize regions of the cowpea genome associated with seed size. These marker-trait associations will enable breeders to use marker-based selection approaches to increase the frequency of progeny with large seed. For 804 individuals derived from eight bi-parental populations, QTL analysis was used to identify markers linked to 10 trait determinants. In addition, the population structure of 171 samples from the USDA core collection was identified and incorporated into a genome-wide association study which supported more than half of the trait-associated regions important in the bi-parental populations. Seven of the total 10 QTLs were supported based on synteny to seed size associated regions identified in the related legume soybean. In addition to delivering markers linked to major trait determinants in the context of modern breeding, we provide an analysis of the diversity of the USDA core collection of cowpea to identify genepools, migrants, admixture, and duplicates.

Keywords: genome-wide association study, QTL analysis, single nucleotide polymorphism, cowpea, seed size, comparative genomics

Introduction

Cowpea is a warm-season legume grown throughout the tropics and several areas of the subtropics. West African countries led by Nigeria and Niger produce 70% of the world’s crop on 10 million ha (FAOSTAT, 2013). The North-Eastern region of Brazil is the second largest region of production, followed by Eastern and Southern Africa, South Asia, and North America. As is the case for other grain legumes, farmers’ and market acceptance of cowpea are driven by the visual appearance of the grain. In most markets, large seed size is desirable and this is reflected in price premiums for large cowpea grain. Across Africa a diversity of grain sizes and colors exist which have varying importance in local or regional contexts. In West Africa the two most important grain types are large white or brown with rough seed coat texture, while in East and Southern regions of Africa relatively smaller seeds with smooth texture and brown to red color predominate in markets. In the Western United States, Southern Europe, and the Middle-East the “blackeyed pea” cowpea predominates. This type of cowpea is characterized by a large grain and white seed coat with a pigmented “eye” around the hilum. Figure 1 displays a diversity of cowpea seed types. “Fresh-shell” varieties are also desired which are harvested before maturity for their large seed that can be easily removed from green pods. Consumer preference primarily demands large seed when grown for grain; however, small seed is preferred when seed is sold by volume for use as a fodder or cover crop.

Figure 1.

Figure 1

Popular cowpea seed types include “blackeyed” and “buff” represented by (A) California Blackeye 27 and (B) IT82E-18. However, a diversity of cowpea seed types exist (C).

Seed size has several agronomically important impacts. Large seeded cowpea have enhanced emergence when planted deep (up to 5 cm), tend to emerge earlier, and produce larger plants during early development (Lush and Wien, 1980). In contrast, while large seeds typically have advantages over small seeded competitors (Wulff, 1986), small seeds are desirable for early drought conditions because they are able to transpire less water relative to their ability to reach water supplies (Hendrix et al., 1992). This may be particularly important for semi-arid rainfed growing regions.

Seed size is a very stable component of grain yield with high heritability for many crop plants including wheat (Giura and Saulescu, 1996), soybean (Cober et al., 1997), cowpea (Drabo et al., 1984), and mung bean (Fery, 1980). Several genes are known to impact the inheritance of seed size in cowpea. Drabo et al. (1984) proposed that at least eight loci contribute to the quantitative inheritance of seed size and Fatokun et al. (1992) identified two major, unlinked genomic regions, one of which is orthologous to a seed size QTL in mung bean. The orthology of this locus was later confirmed by its identification and association to seed size in soybean (Maughan et al., 1996). Exploration of legume synteny for cowpea trait characterization continues to be a rewarding approach that has also been used to better describe resistance to fungal pathogens (Muchero et al., 2011; Pottorff et al., 2012a), tolerance to heat during reproductive development (Lucas et al., 2013a), and leaf morphology (Pottorff et al., 2012b).

The introgression of novel traits from diverse collections typically compromises seed size among progeny. Because of the importance of grain size in market appeal, recovery of adequate grain size is an important objective following elite × exotic crosses. Wide crosses are commonly pursued to help deliver new varieties with enhanced resistance to biotic and abiotic stress. Several cycles of backcrossing help recover elite characteristics including seed size; however, this process can be cumbersome and inefficient due to possible linkage drag and the polygenic nature of the trait. To help improve the selection of desirable lines we developed associations between genic SNP markers and seed size using experimental populations, a diversity collection, and knowledge of legume synteny.

Materials and Methods

Phenotype data

Seed size was calculated as weight per 100 seed. The seeds we measured were harvested from plants grown under favorable conditions whether in the field or in the greenhouse. This means that plants were well watered and treated with pesticides as needed. The populations which were used are presented in Table 1 and are among those used to develop the consensus genetic map of cowpea (Lucas et al., 2011). All populations were at least at the F8, except the IT84S-2246 × Mouride population which was phenotyped and genotyped at the F4 generation. All eight populations were grown in the greenhouse, while the CB27 × IT82E-18 and CB46 × IT93K-503 populations were also grown in field trials. The CB27 × IT82E-18 population was grown during the summers of 2010 and 2011 at the University of California Riverside Citrus Experiment Station in Riverside, Riverside, CA, USA. The CB46 × IT93K-503 population was grown during the summer of 2008 at two field stations led by (1) the Senegalese Institute of Agricultural Research (ISRA/CNRA) in Bambey, Senegal and (2) the Institut de l’Environnement et de Recherches Agricoles (INERA) at Kamboinse, Burkina Faso. In field trials ∼100 seeds per 6-m plot were planted in four replicates for each sample. In all greenhouse and field trials mature pods were harvested and dried for storage (<15% moisture). Seeds were subsequently cleaned from the pods, counted, and weighed to determine the weight of a random sample of 100 seeds. Seed size data provided online by Germplasm Resources Information Network (USDA-ARS and National Genetic Resources Program, 2013) was used for genome-wide association mapping.

Table 1.

Characteristics of eight bi-parental populations of cowpea used to associate loci with seed size.

Parents Population size* Number of polymorphic SNPs Population 100 seed weight (g)
Range Average
CB27 IT82E-18 160 430 8.53–30.96 17.58
CB27 UCR 779 56 560 9.57–29.81 18.28
CB27 24-125-B-1 87 329 9.98–28.50 17.54
CB46 IT93K-503 114 374 7.99–34.95 17.00
Dan Ila TVu-7778 79 288 3.26–19.50 12.77
524B IT84S-2049 85 438 12.55–24.15 17.29
TVu-14676 IT84S-2246 136 345 5.61–30.51 15.59
IT84S-2246 Mouride 87 347 12.55–22.35 16.70

*Number of samples used for QTL analysis after eliminating rogues.

Number of polymorphic SNPs out of 1,536 genotyped SNPs which could be mapped.

Genotype data

The 1,536-plex EST-derived SNP genotype data used to build the consensus genetic map of cowpea (Lucas et al., 2011) was also used to perform QTL analyses of the eight bi-parental populations. Genotype data for 171 individuals of the USDA core collection of cowpea (USDA Core) (Gillaspie et al., 1996) were also obtained using the 1,536-plex genotyping platform developed by our group (Muchero et al., 2009). SNP calls were exported for further processing from the Illumina GenomeStudio software (Illumina, 2010). Rogue individuals among the bi-parental populations which were described in Lucas et al. (2013b) were removed prior to QTL analysis. Similarly, genotype data for the USDA Core were used to identify and remove duplicate individuals using ParentChecker (Hu et al., 2012). ParentChecker was also helpful for formatting files for downstream analyses. SNPs were filtered on the basis of minor allele frequency (>0.20 for QTL and >0.10 for GWAS and analysis of population structure) to develop a set of polymorphic markers appropriate for analyses. The genotype data for the USDA core are provided in the Section “USDA Core Genotypes” in Supplementary Material.

Marker-trait associations

QTL IciMapping (Li et al., 2008) was used to perform inclusive composite interval mapping for seed size based on 100 seed weight data from eight bi-parental mapping populations. In a method similar to Lucas et al. (2013a), the genetic map used for QTL analyses was a composition of population specific map marker orders and distances, and consensus linkage groups assignments. Regions of the genome contributing major QTL were identified after considering (1) regions with LOD scores >3.0; (2) effect size >15% of phenotypic variance explained; (3) marker density; (4) span of the trait-associated region; (5) discovery in multiple populations or via GWAS; (6) haplotype consistency when QTL were discovered in multiple populations; and (7) homology with trait-associated regions in soybean. The potential effect of stacking favorable alleles for multiple QTL was also investigated by grouping lines based on their QTL composition. This was done for populations in which multiple QTLs were discovered. Individuals with no, one, or several favorable alleles underlying the seed size QTLs we report were grouped and the average seed size was determined for that group and compared to the population average. A single factor analysis of variance was performed to determine if differences in seed size were due to QTL content. The ICIM-EPI function within QTL IciMapping (Li et al., 2008) was used to search for QTL interactions.

Six-hundred and sixty-five EST-derived SNP markers with minor allele frequency >0.10 that were located among unique bins (one marker per bin) of the cowpea consensus genetic map were used to identify population structure of the subset of the USDA core. STRUCTURE (Pritchard et al., 2000) was used with BURNIN = 10,000 and NUMREPS = 50,000, with five runs of K = 1–15. The Evanno method (Evanno et al., 2005) facilitated by STRUCTURE HARVESTER (Earl and vonHoldt, 2012) was used in addition to CLUMPP (Jakobsson and Rosenberg, 2007) and DISTRUCT (Rosenberg, 2004) to reconcile genepools on the basis of geographic collection information provided online by GRIN (USDA-ARS and National Genetic Resources Program, 2013). Whole genome ancestry estimates (Q-matrix) computed from multiple STRUCTURE runs by CLUMPP were used as a covariate in the generalized linear model of association mapping provided by TASSEL 3.0 (Bradbury et al., 2007). Markers that showed −Log(P-Values) >3.0 that were also identified using the bi-parental populations were considered significantly associated.

Synteny analysis

Regions of the soybean genome syntenic with the cowpea seed size QTLs reported here were searched for seed size QTLs. HarvEST: cowpea (Wanamaker and Close, 2011) was used to identify synteny based on BLASTX scores (<10−10) between cowpea unigenes containing mapped SNPs and translated gene models from soybean (Schmutz et al., 2010). Soybean genomic locations homeologous to cowpea seed size QTL were reconciled with an abundance of soybean seed size QTL inventoried and integrated with the physical genome by SoyBase (Grant et al., 2010). Only soybean QTL that were within or tightly linked to the syntenic region (<3 million base pairs) were considered orthologous.

Results

Field and greenhouse trials

The two field trials using the CB27 × IT82E-18 population produced seed size data that were strongly correlated to each other (Pearson’s r = 0.83) and similar to that of the greenhouse trial (r = 0.59 and 0.63 for 2010 and 2011 respectively). This was also the case for the multiple trials of the CB46 × IT93K-503 population where field trials were correlated to each other (r = 0.30) and more so to the greenhouse trial (r = 0.52 and 0.31 for ISRA and INERA trials respectively). Figure 2 provides the phenotypic distribution of seed size among all eight bi-parental populations. The smallest seeded line had a 100 seed weight of 3.26 g and was produced by the parents Dan Ila and TVu-7778. The largest seeded line was produced by CB46 and IT93K-503 and had a 100 seed weight of 34.06 g. The average seed of an individual from the eight bi-parental populations had a 100 seed weight of 15.50 g. Phenotypic distributions of seed size for each trial are provided in Section “Population Seed Sizes” in Supplementary Material. Seed sizes for the parents of the mapping populations are provided in Section “Parent Seed Sizes” in Supplementary Material which ranged from 11.60 to 26.41 g per 100 seed. Phenotypic and genotypic characteristics of the mapping populations are provided in Table 1.

Figure 2.

Figure 2

Phenotypic distribution of seed size among eight bi-parental populations of cowpea.

Association studies

Ten QTL for seed size, representing ∼10% of the mapped cowpea genome were identified among the eight bi-parental populations (Table 2). Most had narrow spans (<5 cM), accounted for a substantial proportion of the phenotypic variance (average of 30%), and were associated with multiple SNP markers (average LOD > 8.5). LOD score traces for each QTL discovery are included in the Supplementary Material (“RIL QTL LOD”). Haplotypes associated with large and small seed were consistent among discovery populations when QTL were detected in multiple populations (“Alleles” in Supplementary Material). This is the situation for Css1 where markers 1_0974 and 1_0078 were detected among different experiments. Allelic variation important for seed size can be found among all parents of the bi-parental populations except for Dan Ila and IT84S-2049. The additive allelic effect of Css1 was similar (1.77 and 2.18 g) between multiple trials of the CB27 × IT82E-18 population. This is also true for the multi-trial detection of Css2 using the CB46 × IT93K-503 population (1.97 g for both experiments).

Table 2.

Ten seed size QTL identified among eight bi-parental populations of cowpea.

QTL name Discovery population(s) Large seed allele donor(s) LOD %Phe Additive (g) Linkage group QTL location (cM) Number of markers
Css – 1* CB27 × IT82E-18 IT82E-18 3.82–34.43 24.6–45.0 1.77–3.34 5 53.23–57.30 5
CB27 × UCR 779 UCR 779
TVu-14676 × IT84S-2246 TVu-14676
Css – 2§ CB27 × IT82E-18 CB27 4.64–7.94 4.2–17.2 1.03–1.97 7 18.70–22.68 6
CB46 × IT93K-503 CB46
TVu-14676 × IT84S-2246 TVu-14676
Css – 3 524B × IT84S-2049 524B 2.94–6.63 5.2–25.2 0.69–1.30 2 18.92–32.75 6
IT84S-2246 × Mouride IT84S-2246
TVu-14676 × IT84S-2246 TVu-14676
Css – 4§ CB27 × IT82E-18 CB27 5.37–5.42 9.9–26.8 1.12–1.16 6 31.28–57.41 4
IT84S-2246 × Mouride Mouride
Css – 5§ CB46 × IT93K-503 IT93K-503 5.30 41.1 1.70 8 55.20–61.60 2
Css – 6§ Dan Ila × TVu-7778 TVu-7778 3.38–3.69 20.3–40.8 1.24–1.38 10 0.00–4.90 3
IT84S-2246 × Mouride Mouride
Css – 7§ CB46 × IT93K-503 CB46 4.07 15.0 1.95 2 56.95–62.06 2
Css – 8§ CB27 × 24-125B-1 24-125B-1 5.18 26.7 1.80 6 3.82–4.55 2
Css – 9 CB27 × 24-125B-1 24-125B-1 3.24 16.6 1.42 5 18.50–21.57 2
Css – 10§ CB46 × IT93K-503 CB46 10.4 46.7 1.66 7 31.40–32.20 2

Statistical tests used to identify QTL from the discovery experiments are reported for the Log of Odds score (LOD), percent of phenotypic variation explained by the QTL (%Phe), and the absolute value of the additive effects (Additive). Linkage group and centi-Morgan (cM) positions were identified from the consensus genetic map of cowpea. The number of markers used to tag a QTL among the discovery experiments is also included. Underlined QTL were also identified in the genome-wide association study.

*Indicates the QTL was discovered by analyzing data from both field and greenhouse experiments.

Indicates the QTL was discovered by analyzing data from greenhouse experiments.

Indicates the QTL was discovered by analyzing data from field experiments.

§ Indicates the QTL is supported based on synteny to soybean seed size associated loci.

“Multi-QTL Effect” in Supplementary Material displays the potential for genetic gain by combining favorable alleles for multiple QTLs. QTL content has the most significant effect on seed size for the CB27 × IT82E-18 population F(3, 149) = 28.51, p = 1.25E-14, η2 = 0.36). This is also true for all other populations except for the CB46 × IT93K-503 population where groups based on QTL content are mainly different due to chance F(4, 86) = 1.29, p = 0.28, η2 = 0.06. See Section “Multi-QTL Effect” in Supplementary Material for test statistics for all populations. No significant QTL interactions were found among the discovery experiments.

Six-hundred and sixty-five SNPs which were polymorphic among 171 accessions of the USDA core were used to identify 27 duplicated accessions (“USDA Core Duplicates” in Supplementary Material). An additional 10 accessions were excluded from further analysis due to a lack of geographic collection information. This filter yielded 134 accessions appropriate for population structure and association analyses. Geographic collection information and the Evanno method (“Evanno Method” in Supplementary Material) supported four subpopulations which accounted for a substantial proportion of population structure underlying the USDA core (Figure 3). Genepool 1 was the most prolific and was comprised of a majority of the samples collected in Eastern and Southern Africa. Samples collected in Asia were categorized primarily in genepool 2, and West Africa and Turkey were identified as genepool 3 and genepool 4, respectively. The genomes of 46 samples were primarily derived from genepool 1 and up to 87 samples contained a substantial proportion originating from genepool 1 (“Merged Q-Matrix” in Supplementary Material). While only 8 samples could be attributed entirely to genepool 3, 43 samples were admixed with genepool 3. Samples collected in South America were almost always an admixture of genepools 1 and 3. Most of the migrants were collected in West Africa and Asia.

Figure 3.

Figure 3

Population structure underlying a subset of the USDA core collection of cowpea. Samples are first sorted based on their geographic location of collection and then sorted based on a coancestry matrix with K = 4.

Thirty-six SNP loci used in the GWAS of the USDA core surpassed −Log(P-value) thresholds and confirmed six of the ten QTL proposed by the bi-parental populations (Figure 4). This information is incorporated into Table 2 and is more comprehensively provided in Section “GLM” in Supplementary Material.

Figure 4.

Figure 4

Genome-wide association analysis of seed size using the USDA core collection of cowpea. Loci surpassing significance thresholds that were also associated with seed size among the bi-parental populations are boxed.

Synteny

Based on the syntenic relationships described by Lucas et al. (2011), 7 out of the 10 QTL identified in the bi-parental populations were supported by knowledge of seed size in soybean (Tables 2 and 3). A total of 19 associations between markers and seed size developed in soybean (Orf et al., 1999; Csanadi et al., 2001; Specht et al., 2001; Hoeck et al., 2003; Hyten et al., 2004; Zhang et al., 2004; Panthee et al., 2005; Reinprecht et al., 2006; Chen et al., 2007; Gai et al., 2007) were in regions homoeologs to the cowpea seed size QTL reported here. More details of the synteny analysis are presented in Section “Synteny Analysis” in Supplementary Material.

Table 3.

Seven QTL controlling the inheritance of seed size in cowpea are syntenic to regions with known association to seed size in soybean.

Cowpea Soybean
Soybean seed size associations
QTL Chr Start End QTL1 QTL2 QTL3
Css–2 1 1484033 3411009 Sdwt 18-1.2
9 39678719 40435370 Sdwt 10-10 Sdwt 15-6
Css–4 19 48637729 50058893 Sdwt 12-3 Sdwt 13-9
9 39678719 40435370 Sdwt 10-10 Sdwt 15-6
Css–5 4 1776391 1787115 Sdwt 13-4
Css–6 7 185839 2305626 Sdwt 10-11 Sdwt 7-6
8 15355701 17919999 Sdwt 22-1
Css–7 10 45955504 47689454 Sdwt 25-4
20 35530502 38505007 Sdwt 15-5 Sdwt 24-3
Css–8 8 14338579 15148040 Sdwt 22-1
Css–10 11 5421236 5680908 Sdwt 21-1 Sdwt 22-2 Sdwt 25-2
1 49378273 49834904 Sdwt 15-2 Sdwt 18-1.1 Sdwt 7-4

Chromosome (Chr) and base pair starting (Start) and ending (End) positions in the soybean genome are indicated for each syntenic relationship. Up to three soybean QTL can be found within or tightly linked (<3 Mbp) to the syntenic span; except Sdwt 18-1.1, 21-1, 22-2, and 25-2 which are <5 Mbp.

Discussion

Cowpea with specific seed size can be predicted on the basis of marker-trait associations. These associations provide a foundation for marker-assisted breeding and can be developed through QTL analysis and association mapping which couple phenotypes, genotypes, and a genetic map (Figure 5). DNA markers tagging allelic variation underlying seed size QTL can be used to track trait determinants among breeding cycles. This approach facilitates the simultaneous improvement of a variety for different traits of interest.

Figure 5.

Figure 5

General pathway of marker-assisted breeding strategies which rely heavily on the development of marker-trait associations.

From the standpoint of breeding, the most applicable association studies assess broad pedigrees and tag associated genomic regions with dense markers. Marker-trait associations identified in one population may not segregate or contribute to the inheritance of the trait in a different population. To support new marker-trait associations we used multiple populations, two popular methodologies (QTL and GWAS), and knowledge of seed size in soybean. The intent of this study was to assess allelic variation important for the inheritance of seed size in cowpea primarily in the context of marker-assisted breeding and comparative genomics. The associations developed in this study would be best validated after years of using them in breeding; however, we feel this work provides an important framework for future breeding initiatives and explores the potential of genomics to help deliver new varieties of cowpea. The accuracy of these marker-trait associations could be assessed by comparing the estimated additive allelic effects reported here with realized gains after using these markers for selection.

Based on our analyses there is a large potential to produce larger-seeded lines by combining favorable alleles for multiple QTLs. However, our analysis of this potential is limited because our study lacked recombinants for all possible QTL combinations. A more complete view could be provided by studying the behavior of QTLs outside of their discovery pedigree. This could be accomplished by pursuing a mating scheme which used lines from different pedigrees and with different QTL content.

Breeders interested in using marker-trait associations would benefit from knowledge of linkage between trait determinants. The locations of the QTLs reported here are mainly unlinked or distantly linked to other traits characterized using the consensus genetic map of cowpea, including heat tolerance during reproductive development (Lucas et al., 2013a), leaf morphology (Pottorff et al., 2012b), and resistance to Fusarium oxysporum f.sp. tracheiphilum race 3 (Pottorff et al., 2012a). One (Thr1) of the three QTL known to impact resistance to feeding damage caused by foliar thrips (Thrips tabaci and Frankliniella schultzei) (Lucas et al., 2012) overlaps with a seed size QTL (Css3) reported in the current work. The markers within this overlapping region include 1_0164 and 1_0589 where genotypes homozygous for AA at these were associated with large seed and thrips resistance. This means it would require a rare recombination to break the linkage between resistance to foliar thrips conferred by Thr1 and a small seed conferred by Css3. Other overlaps can be found between seed size QTL and regions associated with Macrophomina phaseolina resistance (Muchero et al., 2011) (Css9 with Mac6, and Css4 with Mac8). Therefore using markers linked to these overlapping regions may simultaneously affect seed size and resistance to Macrophomina. In such regions a higher density of markers would be useful for marker-assisted breeding.

SoyBase (Grant et al., 2010) is an excellent resource for legume researchers. The integration of QTL studies with the physical map made it possible for us to survey commonalities among association studies performed in plants of different genera. Such knowledge may provide paths for mechanistic studies aiming to pinpoint trait determinants. From the standpoint of this study, the co-localization of seed size QTL in soybean and cowpea provides a level of validation for new marker-trait associations. Knowledge of legume synteny and trait determinants would be enhanced by developing resources similar in density to the soybean community for other legumes (i.e., common bean, cowpea, mung bean, peanut, chickpea, etc.). An agricultural project that would be coordinated among groups with expertise in different legumes could greatly enhance comparative resources and the efficiency of new initiatives.

The fact that our study uses approaches capable of clarifying the domestication history and dispersal of modern cowpea does not escape our attention; however, due to sample size we advocate a conservative interpretation of the diversity analysis using the USDA core as presented here. Rather than focusing on potential insight concerning cowpea domestication or proposing new marker-trait associations, we present the results of the genome-wide association study only to provide a modest assessment of collection diversity and to help support QTL identified among the bi-parental populations. The International Institute of Tropical Agriculture maintains a diverse collection which has been previously characterized on the basis of geographic, agronomic, and botanical descriptors (Mahalakshmi et al., 2007), but no collection of cowpea has been viewed in light of dense genotype data. The financial costs required for genotyping the rest of the USDA core is inexpensive relative to the value of the insight that can be gained. From our analysis of a small subset (171 samples) of the entire USDA core (720 samples) we were able to identify many duplicated accessions (∼17%), overrepresentation of the South/East African genepool, and we were able to perform an association study which mainly agreed with the QTL studies stemming from the bi-parental recombinant-inbred populations. SNP data from the entire core collection could be used to improve the diversity collection and its impact on the cowpea community. Phenotypic data for a number of traits are available on GRIN and could be combined with genotype data, similar to this work, to facilitate the discovery of numerous marker-trait associations. The use of historical data would be a cost effective approach to improve knowledge of cowpea genetic diversity and allelic variation contributing to the inheritance of agronomically important traits. The feasibility of this approach was recently supported within the barley community (Wang et al., 2011). That work helped demonstrate the utility of historical data after careful consideration of population size and experimental design. Furthermore, a comparative analysis of the diversity among core collections (i.e., IITA, USDA, and UCR) would be valuable for identifying instances of ascertainment bias and duplicated accessions possibly known by different names. This is a documented issue for U.S. collections (Vigna Crop Germplasm Committee, 1996), and continued application of genotype data to identify duplicates would be particularly helpful in cutting costs associated with the maintenance of collections and for designing new experiments.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/Plant_Genetics_and_Genomics/10.3389/fpls.2013.00095/abstract

Acknowledgments

This work was supported by the CGIAR Generation Challenge Program through a grant from the Bill and Melinda Gates Foundation, U.S. Agency for International Development Collaborative Research Support Program Grants GDG-G-00-02-00012-00 and EDH-A-00-07-00005, by the United States Department of Agriculture Project Number: 6607-21000-010-20, and by the Genetics, Genomics, and Bioinformatics program at UC Riverside.

References

  1. Bradbury P. J., Zhang Z., Kroon D. E., Casstevens T. M., Ramdoss Y., Buckler E. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 10.1093/bioinformatics/btm308 [DOI] [PubMed] [Google Scholar]
  2. Chen Q., Zhang Z., Liu C., Xin D., Qiu H., Shan D., et al. (2007). QTL analysis of major agronomic traits in soybean. Agric. Sci. China 6, 399–405 10.1016/S1671-2927(07)60159-X [DOI] [Google Scholar]
  3. Cober E. R., Voldeng H. D., Fregeau-Reid J. A. (1997). Heritability of seed shape and seed size in soybean. Crop Sci. 37, 1767–1769 10.2135/cropsci1997.0011183X003700060017x [DOI] [Google Scholar]
  4. Csanadi G., Vollmann J., Stift G., Lelley T. (2001). Seed quality QTL identified in a molecular map of early maturing soybean. Theor. Appl. Genet. 103, 912–919 10.1007/s001220100621 [DOI] [Google Scholar]
  5. Drabo I., Redden R., Smithson J. B., Aggarwal V. D. (1984). Inheritance of seed size in cowpea (Vigna unguiculata (L.) Walp.). Euphytica 33, 929–934 10.1007/BF00021923 [DOI] [Google Scholar]
  6. Earl D. A., vonHoldt B. M. (2012). STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 10.1007/s12686-011-9548-7 [DOI] [Google Scholar]
  7. Evanno G., Regnaut S., Goudet J. (2005). Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 10.1111/j.1365-294X.2005.02449.x [DOI] [PubMed] [Google Scholar]
  8. FAOSTAT (2013). Agriculture organization of the United Nations. Available at: http://faostat.fao.org/ (accessed on April 3, 2013).
  9. Fatokun C. A., Menancio-Hautea D. I., Danesh D., Young N. D. (1992). Evidence for orthologous seed weight genes in cowpea and mung bean based on RFLP mapping. Genetics 132, 841–846 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Fery R. L. (1980). “Genetics of vigna,” in Horticultural Reviews, ed. Janick J. (Westport: AVI Publishing; ), 311–394 [Google Scholar]
  11. Gai J., Wang Y., Wu X., Chen S. (2007). A comparative study on segregation analysis and QTL mapping of quantitative traits in plants-with a case in soybean. Front. Agric. China 1, 1–7 10.1007/s11703-007-0022-y [DOI] [Google Scholar]
  12. Gillaspie A. G., Chambliss O. L., Fery R. L., Hall A. E., Miller J. C., Jr., Morelock T. E. (1996). A core subset established for the USDA cowpea [Vigna unguiculata (L.) Walp.] germplasm collection. Hortic. Sci. 31, 762 [Google Scholar]
  13. Giura A., Saulescu N. N. (1996). Chromosomal location of genes controlling grain size in a large-grained selection of wheat (Triticum aestivum L.). Euphytica 89, 77–80 10.1007/BF00015722 [DOI] [Google Scholar]
  14. Grant D., Nelson R. T., Cannon S. B., Shoemaker R. C. (2010). SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res. 38, D843–D846 10.1093/nar/gkp798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Hendrix S. D., Nielsen E., Nielsen T., Schutt M. (1992). Are seedlings from small seeds always inferior to seedlings from large seeds? New Phytol. 119, 299–305 10.1111/j.1469-8137.1991.tb01034.x [DOI] [PubMed] [Google Scholar]
  16. Hoeck J. A., Fehr W. R., Shoemaker R. C., Welke G. A., Johnson S. L., Cianzio S. R. (2003). Molecular marker analysis of seed size in soybean. Crop Sci. 43, 68–74 10.2135/cropsci2003.0068 [DOI] [Google Scholar]
  17. Hu Z., Ehlers J. D., Roberts P. A., Close T. J., Lucas M. R., Wanamaker S., et al. (2012). ParentChecker: a computer program for automated inference of missing parental genotype calls and linkage phase correction. BMC Genet. 13:9. 10.1186/1471-2156-13-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hyten D. L., Pantalone V. R., Sams C. E., Saxton A. M., Landau-Ellis D., Stefaniak T. R., et al. (2004). Seed quality QTL in a prominent soybean population. Theor. Appl. Genet. 109, 552–561 10.1007/s00122-004-1661-5 [DOI] [PubMed] [Google Scholar]
  19. Illumina (2010). GenomeStudio Genotyping Module v.2010.3. San Diego, CA: Illumina, Inc [Google Scholar]
  20. Jakobsson M., Rosenberg N. A. (2007). CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 10.1093/bioinformatics/btm233 [DOI] [PubMed] [Google Scholar]
  21. Li H., Ribaut J. M., Li Z., Wang J. (2008). Inclusive composite interval mapping (ICIM) for digenic epistasis of quantitative traits in biparental populations. Theor. Appl. Genet. 116, 243–260 10.1007/s00122-007-0663-5 [DOI] [PubMed] [Google Scholar]
  22. Lucas M. R., Diop N. N., Wanamaker S., Ehlers J. D., Roberts P. A., Close T. J. (2011). Cowpea-soybean synteny clarified through an improved genetic map. Plant Genome 4, 218–225 10.3835/plantgenome2011.06.0019 [DOI] [Google Scholar]
  23. Lucas M. R., Ehlers J. D., Roberts P. A., Close T. J. (2012). Markers for quantitative inheritance of resistance to foliar thrips in cowpea. Crop Sci. 52, 2075–2081 10.2135/cropsci2011.12.0684 [DOI] [Google Scholar]
  24. Lucas M. R., Ehlers J. D., Huynh B. L., Diop N. N., Roberts P. A., Close T. J. (2013a). Markers for breeding heat tolerant cowpea. Mol. Breed. 31, 529–536 10.1007/s11032-012-9810-z [DOI] [Google Scholar]
  25. Lucas M. R., Hunyh B. L., Ehlers J. D., Roberts P. A., Close T. J. (2013b). High-resolution single nucleotide polymorphism genotyping reveals a significant problem among breeder resources. Plant Genome 6. 10.3835/plantgenome2012.08.0020 [DOI] [Google Scholar]
  26. Lush W. M., Wien H. C. (1980). The importance of seed size in early growth of wild and domesticated cowpeas. J. Agric. Sci. 94, 177–182 10.1017/S0021859600028033 [DOI] [Google Scholar]
  27. Mahalakshmi V., Ng Q., Lawson M., Ortiz R. (2007). Cowpea [Vigna unguiculata (L.) Walp.] core collection defined by geographical, agronomical, and botanical descriptors. Plant Genet. Resour. 5, 113–119 10.1017/S1479262107837166 [DOI] [Google Scholar]
  28. Maughan P. J., Saghai-Maroof M. A., Buss G. R. (1996). Molecular-marker analysis of seed-weight: genomic locations, gene action, and evidence for ortholgous evolution among three legume species. Theor. Appl. Genet. 93, 574–579 10.1007/BF00417950 [DOI] [PubMed] [Google Scholar]
  29. Muchero W., Diop N. N., Bhat P. R., Fenton R. D., Wanamaker S., Pottorff M., et al. (2009). A consensus genetic map of cowpea [Vigna unguiculata (L) Walp.] and synteny based on EST-derived SNPs. Proc. Natl. Acad. Sci. U.S.A. 106, 18159–18164 10.1073/pnas.0905886106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Muchero W., Ehlers J. D., Close T. J., Roberts P. A. (2011). Genic SNP markers and legume synteny reveal candidate genes underlying QTL for Macrophomina phaseolina resistance and maturity in cowpea [Vigna unguiculata (L) Walp.]. BMC Genomics 12:8. 10.1186/1471-2164-12-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Orf J. H., Chase K., Jarvik T., Mansur L. M., Cregan P. B., Adler F. R., et al. (1999). Genetics of soybean agronomic traits: I. Comparison of three related recombinant inbred populations. Crop Sci. 39, 1642–1651 10.2135/cropsci1999.0011183X003900020063x [DOI] [Google Scholar]
  32. Panthee D. R., Pantalone V. R., West D. R., Saxton A. M., Sams C. E. (2005). Quantitative trait loci for seed protein and oil concentration, and seed size in soybean. Crop Sci. 45, 2015–2022 10.2135/cropsci2004.0720 [DOI] [Google Scholar]
  33. Pottorff M., Wanamaker S., Ma Y. Q., Ehlers J. D., Roberts P. A., Close T. J. (2012a). Genetic and physical mapping of candidate genes for resistance to Fusarium oxysporum f.sp. tracheiphilum race 3 in cowpea [Vigna unguiculata (L.) Walp]. PLoS ONE 7:e41600. 10.1371/journal.pone.0041600 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Pottorff M., Ehlers J. D., Fatokun C., Roberts P. A., Close T. J. (2012b). Leaf morphology in cowpea [Vigna unguiculata (L.) Walp]: QTL analysis, physical mapping and identifying a candidate gene using synteny with model legume species. BMC Genomics 13:234. 10.1186/1471-2164-13-234 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Pritchard J. K., Stephens M., Donnelly P. (2000). Inference of population structure using multilocus genotype data. Genetics 155, 945–959 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Reinprecht Y., Poysa V., Yu K., Rajcan I., Ablett G., Pauls K. (2006). Seed and agronomic QTL in low linolenic acid, lipoxygenase-free soybean (Glycine max (L.) Merrill) germplasm. Genome 4, 1510–1527 [DOI] [PubMed] [Google Scholar]
  37. Rosenberg N. A. (2004). DISTRUCT: a program for the graphical display of population structure. Mol. Ecol. Notes 4, 137–138 10.1046/j.1471-8286.2003.00566.x [DOI] [Google Scholar]
  38. Schmutz J., Cannon S. B., Schlueter J., Ma J., Mitros T., Nelson W., et al. (2010). Genome sequence of the palaeopolyploid soybean. Nature 463, 178–183 10.1038/nature08670 [DOI] [PubMed] [Google Scholar]
  39. Specht J. E., Chase K., Macrander M., Graef G. L., Chung J., Markwell J. P., et al. (2001). Soybean response to water: a QTL analysis of drought tolerance. Crop Sci. 41, 493–509 10.2135/cropsci2001.412493x [DOI] [Google Scholar]
  40. USDA-ARS and National Genetic Resources Program (2013). Germplasm Resources Information Network – (GRIN). Beltsville, MD: National Germplasm Resources Laboratory [Google Scholar]
  41. Vigna Crop Germplasm Committee (1996). Vigna Germplasm Current Status and Future Needs. Beltsville, MD: National Germplasm Resources Laboratory [Google Scholar]
  42. Wanamaker S., Close T. J. (2011). HarvEST. Cowpea Version 1.27. Riverside: HarvEST [Google Scholar]
  43. Wang H., Smith K. P., Combs E., Blake T., Horsley R. D., Muehlbauer G. J. (2011). Effect of population size and unbalanced data sets on QTL detection using genome-wide association mapping in barley breeding germplasm. Theor. Appl. Genet. 124, 111–124 10.1007/s00122-011-1691-8 [DOI] [PubMed] [Google Scholar]
  44. Wulff R. D. (1986). Seed size variation in Desmodium paniculatum II. Effects on seedling growth and physiological performance. J. Ecol. 74, 99–114 10.2307/2260351 [DOI] [Google Scholar]
  45. Zhang W., Wang Y., Luo G., Zhang J., He C., Wu X., et al. (2004). QTL mapping of ten agronomic traits on the soybean (Glycine max L. Merr) genetic map and their association with EST markers. Theor. Appl. Genet. 108, 1131–1139 10.1007/s00122-003-1558-8 [DOI] [PubMed] [Google Scholar]

Articles from Frontiers in Plant Science are provided here courtesy of Frontiers Media SA

RESOURCES