Skip to main content
PLOS One logoLink to PLOS One
. 2020 Oct 8;15(10):e0239122. doi: 10.1371/journal.pone.0239122

DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms

Selamawit Ketema 1,2, Bizuayehu Tesfaye 2, Gemechu Keneni 3, Berhanu Amsalu Fenta 1, Ermias Assefa 4, Nicolas Greliche 5, Eunice Machuka 6, Nasser Yao 6,*
Editor: Tzen-Yuh Chiang7
PMCID: PMC7544073  PMID: 33031381

Abstract

Cowpea [Vigna unguiculata (L.) Walp] is one of the important climate-resilient legume crops for food and nutrition security in sub-Saharan Africa. Ethiopia is believed to harbor high cowpea genetic diversity, but this has not yet been efficiently characterized and exploited in breeding. The objective of this study was to evaluate the extent and pattern of genetic diversity in 357 cowpea accestions comprising landraces (87%), breeding lines (11%) and released varieties (2%), using single nucleotide polymorphism markers. The overall gene diversity and heterozygosity were 0.28 and 0.12, respectively. The genetic diversity indices indicated substantial diversity in Ethiopian cowpea landraces. Analysis of molecular variance showed that most of the variation was within in the population (46%) and 44% between individuals, with only 10% of the variation being among populations. Model-based ancestry analysis, the phylogenetic tree, discriminant analysis of principal components and principal coordinate analysis classified the 357 genotypes into three well-differentiated genetic populations. Genotypes from the same region grouped into different clusters, while others from different regions fell into the same cluster. This indicates that differences in regions of origin may not be the main driver determining the genetic diversity in cowpea in Ethiopia. Therefore, differences in sources of origin, as currently distributed in Ethiopia, should not necessarily be used as indices of genetic diversity. Choice of parental lines should rather be based on a systematic assessment of genetic diversity in a specific population. The study also suggested 94 accesstions as core collection which retained 100% of the genetic diversity from the entire collection. This core set represents 26% of the entire collection pinpointing a wide distribution of the diversity within the ethiopian landraces. The outcome of this study provided new insights into the genetic diversity and population structure in Ethiopian cowpea genetic resources for designing effective collection and conservation strategies for efficient utilization in breeding.

Introduction

Cowpea [Vigna unguiculata (L.) Walp., 2n = 2x = 22] originated and was domesticated in Africa though the exact location of origin of domestication is still a matter of speculation and different authors suggest different areas in Africa, Northeastern Africa including Ethiopia [14], Central Africa [5], Southern Africa [6], and West Africa [5,7,8]. There are five known subspecies of cowpea, of which three are cultivated (unguiculata, cylindrical and sesquipedalis) and two are wild (dekindtiana and mensensis) [9,10]. In Ethiopia, all five subspecies are known to exist, and are of particular significance, being landraces of subspecies unguiculata and cylindrical, particularly in the drought-prone areas of eastern Ethiopia [11,12]. These subspecies are also grown in the northern, southwestern and southern parts of Ethiopia [12]. Thulin (1989) [13] reported that the subspecies sesquipedalis and dekindtiana are also cultivated in northern Ethiopia [14].

Cowpea is an important legume crop estimated to be grown on more than 11 million hectares of land, with an annual worldwide production of over 6 million tons, of which 96% is grown in Africa [15]. Cowpea is grown for different purposes, mainly as a source of staple food and nutritional security for farmers in sub-Saharan Africa. Cowpea is also useful for sustaining the farming system in Africa through its ability to fix atmospheric nitrogen and its tolerance to a wide number of abiotic stresses, including drought, heat, low soil pH and soil nutrient deficiency stress [16]. Cowpea, like other legumes, plays an important role when used in rotation with cereals by breaking the life cycles of pathogens of cereals [17,18]. In terms of nutrition security, cowpea is an affordable source of carbohydrate, protein, essential minerals, vitamins and folates, particularly to poor people who cannot afford animal-based diets [19,20].

Conventional cowpea breeding, particularly in African countries such as Nigeria, Senegal, Uganda and Tanzania, dates back to the 1960s [21]. Evidence shows that despite its many merits, cowpea breeding has suffered at least four main challenges. Firstly, the cowpea gene pool may be narrow, due partly to a genetic bottleneck during domestication. Secondly, genetic variation may be restricted by the ‘founder effects’ and limited germplasm exchange [22]. Thirdly, the crop has been awarded low priority and remained rather more orphaned in a number of countries, such as Ethiopia, where only limited efforts have been made to improve its productivity and utilization [23]. Fourthly, national cowpea improvement efforts in countries such as Ethiopia relied excessively on exotic genetic materials, particularly those from the International Institute of Tropical Agriculture (IITA). While broadening the genetic basis of breeding materials through incorporation of the exotic gene pool itself is the right way, landraces also have considerable breeding value, particularly under marginal conditions, as they contain valuable adaptive genes to different circumstances [24,25].

Knowledge of the extent and pattern of genetic diversity in a given gene pool provides plant breeders with an opportunity to develop new varieties with desirable traits [26]. Firstly, genetic gain from direct selection depends on the magnitude of genetic variability among the germplasms, heritability of a given trait in a given environment and the level of selection intensity applied [27,28]. Secondly, it is believed that crosses between genetically diverse parents are likely to produce higher heterosis, desirable genetic recombination and segregation in their progeny [29]. Apart from yield trials that have been conducted mostly on exotic genetic materials in Ethiopia, scientific evidence on the extent and pattern of genetic diversity in local cowpea landraces is limited. Among the few studies conducted, Belayneh et al. [30] assessed the genetic diversity in Ethiopian landraces using simple sequence repeats (SSR) markers and detected three-well differentiated ancestral populations. Genetic resource management, including building core collection for efficient space monitoring, is one of the common practice used worldwide to drive germplasm enhancement for future breeding. Core collections are subsamples of larger genetic resources collections which are created in order to include a minimum number of accessions representing the maximum if not the whole diversity of the original collection [31].

Molecular markers have been utilized in several crop species to ascertain the existence of an adequate amount of genetic diversity in a given gene pool. Over the last few decades, several marker technologies have been developed for cowpea, starting from the early days of the isozyme [32,33] to the relatively recent time of randomly amplified DNA fingerprinting and random amplified polymorphic DNA [3438], amplified fragment length polymorphism [3,22], randomly amplified microsatellite fingerprinting and microsatellite [16,30,3943] markers. With the more recent developments in molecular genetics, however, the single nucleotide polymorphism (SNP) method has emerged as a more precise, cost-effective and faster method that offered a lot of comparative advantages to the aforementioned molecular markers [44]. SNP markers are also common and are found throughout the genome; they are stable and readily assayed using high-throughput genotyping protocols with automated data analysis. Although SNP markers can be observed through various experimental protocols, at present, genotype-by-sequencing (GBS) is the most popular approach to their identification in plants [45].

GBS technologies produced robust marker genotypes and tens to thousands of them, in contrast to previous SNP arrays [46]. GBS has been used to build core collection as well as investigating the genetic diversity and population structure of many crop species, including cowpea [47,48]. The objectives of this study were, therefore, to determine the extent of genetic diversity comprehensively, to estimate the levels of population structure in Ethiopian cowpea germplasm collections using high-throughput GBS-derived SNP markers and to identify the minimum number of accessions capturing the maximum diversity for conservation.

Materials and methods

Plant materials

Three hundred and sixty-one cowpea genotypes were used for this study, of which 314 landraces collected from different regions in Ethiopia were kindly provided by Melkasssa Agricultural Research Center of the Ethiopian Institute of Agricultural Research. The original collection areas of the cowpea landraces in Ethiopia are given in Fig 1, along with a description of all sets of the test genotypes provided in S1 Table. Fig 1 was constructed using the software DIVA-GIS as described by Hijmans et al. (2012) [49] using the GPS coordinates of the collection sites (S1 File). The 314 collected cowpea landraces comprised 70 genotypes collected from Amhara, 94 from Southern Nations, Nationalities and Peoples Regional State (SNNPRS), 59 from Gambella, 49 from Oromia and 42 from Tigray. About 40 breeding lines previously introduced from the IITA and six released varieties were also included in the study and considered improved varieties. All sets of the cowpea test materials used in this study, hereafter called genotypes, were regarded as a population and each grouping, based on geographic regions and breeding status (landraces and improved varieties), was regarded as a subpopulation.

Fig 1. Map of Ethiopia showing the collection sites for the different cowpea landraces from different eco-geographical regions.

Fig 1

The map was constructed using the DIVA-GIS software as described by Hijmans et al., (2012) [49].

DNA extraction and sequencing

The genotypes were grown in a seed germination chamber (Conviron) at the Biosciences eastern and central Africa-International Livestock Research Institute (BecA-ILRI) hub using cell trays. Three seeds of each genotype were sown per tray. Ten-day-old leaf material was collected from the three seedlings and the pooled leaf samples were frozen in liquid nitrogen and stored at −80°C for later use. Genomic DNA (gDNA) was extract from the frozen tissue according to the CTAB protocol, with some modifications [50]. The quantity of extracted DNA was checked using a Thermo Scientific NanoDrop Spectrophotometer 2000c. The quality of the DNA was confirmed on 0.8% agarose gel run in 1% TAE buffer at 70 V for 45 minutes. After the quality had been checked, 40 μl of a 50 ng/μl gDNA of each sample of 359 cowpea genotypes was sent for whole genome scanning using Genotyping by sequencing technology as described by Elshire et al. (2011) [45], using DArTseqTM technology (https://www.diversityarrays.com/) of the Integrated Genotype Service and Support platform in Nairobi, Kenya. GBS was performed by using a combination of DArT complexity reduction methods and next generation sequencing following protocols described in [5153]. The complexity reduction method used involves digestion with the methylation-sensitive restriction enzyme, PstI. In conjunction with digestion using this relatively rarely-cutting restriction enzyme (six bp recognition site plus methylation sensitivity; Gruenbaum et al. 1981 [54]), an enzyme with frequent cutting capabilities. In this study, the frequently-cutting enzymes AluI, BstNI, TaqI or MseI were used. PCR adapters were ligated to the PstI fragment ends, and the PCR-amplification was performed using primers complementary to the PstI adapters. Only those fragments with PstI adapters at both ends were amplified.

SNPs calling and data filtering

The data were previously trimmed following the DArTSeqTM technology based on the following filter criteria; markers/SNPs with call rate > 97% and allele-calling equal or greater than 98% were selected. Genotypes with read depth less than the threshold were coded as missing. SNP markers with high proportion of missing data (>10%), individuals who have high rates of genotype missingness (>10%) and rare SNPs with <5% minor allele frequency (MAF) were discarded for further analysis using R software (version 2.8.3). The most informative SNPs were selected based on a threshold PIC value equal or higher than 0.2. Finally 357 cowpea genotypes and 6,498 (32%) of SNP markers were maintained for further analysis (S2 File).

Genetic diversity and phylogenic analysis

The population’s genetic structure was analyzed by conducting an analysis of molecular variance [55] using poppr package in the R version 2.8.3 [56,57]. The phylogenetic relationships of the subpopulations were generated based on pair-wise fixation indexes using the StAMPP package [58] and neighbor joining trees were constructed using the dartR package in R.

Pair-wise genetic frequency-based dissimilarity or distance matrix between individuals was calculated according to Euclidean distance as implemented in the R environment. The resulting dissimilarity matrix was subjected to tree construction using the unweighted pair group method analysis (UPGMA) employing the same software with the ggdendro and ggplot2 packages. Phylogenetic trees were constructed in R implementing the hclust algorithm, with the UPGMA relevant agglomeration method.

Population structure analysis

To infer the population structure of Ethiopian cowpea landraces, three complementary methods were used: 1) Bayesian model-based clustering algorism (STRUCTURE software) [59], 2) discriminant analysis of principal components (DAPC) and 3) principal coordinate analysis (PCoA). The structure analysis was run five times for each K value (K = 1 to 10) using a burn-in period of 50 000 with 100 000 Markov Chain Monte Carlo iterations, assuming an admixture model and uncorrelated allele frequencies. The most probable value of K for each test was detected by ΔK [60], using the web-based program Structure Harvester [61]. CLUMPP v.1.1.2 [62] was used to align cluster assignment from independent runs using the in-files generated by structure Harvest. Bar plots were generated with average results of runs for the most probable K value, using DISTRUCT v.1.1 [63]. A genotype was considered to belong to a group if its membership coefficient was ≥ 0.70. Genotypes with membership coefficient lower than 0.70 at each assigned K were regarded as admixed.

To cross-check the results from the model-based population structure from STRUCTURE with a model-free other method, DAPC was used. DAPC is a multivariate method designed to identify and describe clusters of genetically related individuals [64]. In the absence of a known grouping pattern, DAPC uses sequential K-means and model selection to build genetic clusters based on information from genetic data. The Bayesian information criterion (BIC) was used to identify an optimal number of genetic clusters (K) to describe the data. Based on the calculation of the α-score, the optimal number of principal components was retained. DAPC also provides membership probabilities for each individual to each identified group (or subpopulation), which can be equated to admixture proportions provided by STRUCTURE [59].

PCoA is a distance-based approach to dissect and display dissimilarities between individulas. The number of clusters obtained from STRUCTURE and DAPC was compared with those from PCoA without any assumption about the underlying population genetic model and it was performed using the dartR-R package [65].

Construction of core collection

DARwin version 6.0.010 was used to build the diversity trees [66]. Dissimilarities were calculated and transformed into Euclidean distances. Un-Weighted Neighbor-Joining (N-J) method was applied to the Euclidean distances to build a tree with all genotypes. Then, ‘maximum length sub tree function’ was used to draw the core collection. Maximum length sub-tree implemented is a stepwise procedure that successively prunes redundant individuals. This procedure allows the choice of the sample size which retains the largest diversity and is visualized by the tree as built on the initial set of accessions (357 genotypes). The size of the core collection and efficiency of the strategy was assessed by comparing and keeping the total number of alleles captured for each run using the same software. The size of the core collection was expressed as a proportion of the number of individuals selected for the core collection to the number of individuals in the entire collection.

Results

SNP variations

Of the total of 20 276 SNP markers, 6 498 (32%) SNP markers and 357 cowpea genotypes were retained after filtering. These 6 498 SNP markers were spread over the 11 chromosomes with an average of 591 markers per chromosome (S2 Table). Among the 11 chromosomes, the overall polymorphic information content values ranged from 0.25 (chromosome 9) to 0.30 (chromosome 6), with an average of 0.28. Gene diversity (Ho) values varied from 0.30 on chromosome 9 to 0.36 on chromosome 6, with an average of 0.33. The expected heterozygosity (He) values ranged from 0.11 (chromosome 9) to 0.13 (chromosome 4, 6 and 11), with an average of 0.12. For all chromosomes, the expected heterozygosity values (He) were higher than the observed heterozygosity values (Ho).

In the collected cowpea genomes, more transition-type SNPs (57%) were observed than transversion-type SNPs (43%), with a transition/transversion (Ts/Tv) SNP ratio of 1.33:1 (3714/2784). More A/G and C/T transitions were observed than G/A and T/C transitions. On the other hand, more G/T, A/T, A/C and C/G transversions were observed than T/G, T/A, C/A and G/C transversions.

Genetic diversity and relationship

The genetic parameter estimate of the pre-defined subpopulations is presented in Table 1. Landraces collected from Tigray, Amhara and Oromia had a higher Shannon diversity index (H’ = 0.44–0.45) and higher gene diversity (He = 0.29–0.30) than those from Gambella and SNNPRS. The population from Tigray had a higher heterozygosity value (Ho = 0.21) than those from other regions, whereas those from Gambella and SNNPRS had the lowest observed and expected heterozygosity values (Ho = 0.11). Landraces collected from the Oromia region had the highest inbreeding coefficient (FIS = 0.55) and the population from Tigray a relatively low inbreeding coefficient (FIS = 0.28), suggesting that 62% of the alleles were not fixed in the latter. Improved cultivars showed lower diversity (0.24) compared to the whole landraces population (0.30), with lower inbreeding in landraces (0.55) than in the improved varieties (0.58). The same trend was observed whie comparing the improved cultivars to the subpopulations of Oromia and Tigray that have similar size.

Table 1. Genetic parameter estimates based on 6498 SNPs among cowpea subpopulations.

Populations Genetic Parameters
N H’ Ho He FIS
Based on Geographical Region
Amhara 70 0.44 0.13 0.29 0.51
Gambella 59 0.41 0.12 0.27 0.47
Oromia 49 0.44 0.12 0.29 0.55
SNNPRS 92 0.41 0.11 0.27 0.52
Tigray 41 0.45 0.21 0.30 0.28
Mean 0.43 0.14 0.28 0.47
SE 0.008 0.018 0.006 0.048
Based on Breeding Status
Improved 47 0.37 0.06 0.24 0.58
Landraces 311 0.47 0.13 0.30 0.55
Mean 0.42 0.10 0.27 0.57
SE 0.05 0.05 0.03 0.02
Overall mean 0.43 0.12 0.28 0.52
SE 0.004 0.02 0.004 0.04

N, Number of genotypes; H’, Shannon diversity index; Ho, heterozygosity; He, gene diversity; FIS, inbreeding coefficient.

Genetic distance among cowpea genotypes varied from the lowest of 0.00 to the highest of 0.69 based on Euclidean distances. Twenty-nine pairs of genotypes had a genetic distance of 0, suggesting that the members of these pairs may in fact have been separately collected, but had an identical genetic background. Six pairs of genotypes, with an average genetic distance of 0.69, were found to be highly divergent. Cowpea landrace CP20, which was collected from Tigray, was found to diverge highly from other landraces, including CP5 (SNNPRS), CP259 (Amhara), CP277 (Oromia) and CP352 (SNNPRS). Another landrace, CP23, from the Amhara region was found to diverge from CP5 (SNNPRS) and CP259 (Amhara). The resulting distance matrix was used to construct an UPGMA dissimilarity dendrogram that classified the 357 cowpea genotypes into three distinct clusters (Fig 2). The first cluster (C-1) contained 31% of the genotypes (88 landraces and 22 improved varieties), the second (C-2) contained 24% of the genotypes, all of them being landraces with a major contribution from Amhara and Tigray, and the third cluster (C-3) contained 45% of the genotypes, including 25 improved varieties and 136 landraces mostly from Gambella, SNNPRS and Oromia. The improved varieties showed a clear tendency to be clustered together into only two clusters (C-1 and C-3).

Fig 2. UPGMA dendrogram showing the genetic relationships among cowpea collections grouped into three distinct clusters (cluster 1 = red, cluster 2 = green and cluster 3 = blue).

Fig 2

The neighbor joining (NJ) tree constructed from the pairwise FST values also grouped the regions of origin into three distinct clusters (Fig 3). The adjacent regions showed tendencies to be grouped together. Gambella and SNNPRS are in the first group (P-I); Oromia, Amhara and Tigray in the second group (P-II) and the introductions are in the third group (P-III) (Fig 3).

Fig 3. Neighbor joining tree among five different regions of Ethiopia and introduced improved cultivars based on pairwise FST.

Fig 3

Analysis of molecular variance

The analysis of molecular variance of the 357 cowpea genotypes as pre-defined, based on geographical regions and breeding status, is presented in Table 2. The results indicated high variance within populations of 46% based on geographical regions and 53% based on breeding status. Likewise, variation within individuals of 44% was recorded based on geographical regions and 39% based on breeding status. Variance among populations based on geographical regions accounted for only 10% of the variation and 8% based on breeding status.

Table 2. Analysis of molecular variance among and within cowpea subpopulations.

Source of variation Df SS MS EV PV F-Statistic
Based on Geographical Origin
Among populations 4 90397 22599 167 10 FST = 0.01
Within populations 306 700158 2288 775 46 FIS = 0.01
Within individuals 311 229322 737 737 44 FIT = 0.01
Total variations 621 1019877 1642 1679 100
Based on Breeding Status
Among populations 1 24591 24591 135 8 FST = 0.01
Within populations 356 883411 2481 904 53 FIS = 0.01
Within individuals 358 241095 673 673 39 FIT = 0.01
Total variations 715 1149097 1607 1713 100

Df = degrees of freedom; SS = sum of squares; EV = estimated variance, PV = percentage variance; FST = genetic differentiation, FIS = fixation index or inbreeding coefficient and FIT = Overall fixation index.

A high inbreeding coefficient (FIS), overall fixation index (FIT) and genetic differentiation (FST) values were observed. The FIT values for all the SNP loci ranged between 0.641 and 0.720, and FIS values between 0.602 and 0.698 based on geographical regions and breeding status, respectively. The pairwise FST values for all the SNP loci showed significant differentiation, ranging from -0.011 to 0.410 and -0.012 to 0.718 among the subpopulations, based on geographic regions and breeding status, respectively. The analysis showed low to moderate differentiation in allele frequencies among the populations (FST), 0.10 and 0.07, based on geographical area and breeding status, respectively. Gene flow between and within geographical regions was also moderate (Nm = 2.409).

Population structure

The population structure of the 357 cowpea genotypes was determined using STRUCTURE software. The most probable number of subpopulations in the collected cowpea genotypes was K = 3. Based on the probable likelihood of each genotype to be grouped into any of the three distinct groups, 57 (16%) fell into the first cluster (C-I), 55 (13%) into the second cluster (C-II), and 124 (34%) into the third cluster (C-III) (Fig 4). The remaining 121 of the 357 accessions (36%) were placed in the admixture group (Table 3). All five subpopulations of landraces based on the geographical regions had three structured populations, whereas the improved cultivars had shown only two (C-I and C-III).

Fig 4. Population structure of 357 cowpea genotypes, in K = 3; each color represents one cluster.

Fig 4

Table 3. Proportion of membership of each predefined population in each of the clusters obtained at the best K (K = 3).

Population Number of accessions Admixed individual Proportion of membership in each cluster (%)
Cluster I Cluster II Cluster III
STRUCTURE
Amhara 70 23 6 53 19
Gambella 59 29 22 3 46
Improved 47 53 9 0 38
Oromia 49 27 8 12 53
SNNPRS 92 29 30 2 38
Tigray 40 58 5 25 13
357 36.4 13.3 15.9 34.4
DAPC
Amhara 70 15 10 59 16
Gambella 59 32 25 2 41
Improved 47 13 40 0 47
Oromia 49 31 10 10 49
SNNPRS 92 17 41 3 39
Tigray 40 30 12 38 20
357 23.0 23.0 35.3

To confirm the true value of K, another model-free method, DAPC, was used. The optimum number of clusters was obtained with K = 3 using the BIC, which again divided the genotypes into three sub-populations. Membership clustering using DAPC also grouped the genotypes into three clusters (Fig 5). The first cluster had 109 (31%) genotypes, of which 22 were improved varieties and 87 landraces. The second cluster had 86 genotypes, which were all landraces and the third cluster had 162 genotypes, including 25 improved varieties and 137 landraces (Table 3).

Fig 5. Scattered plot of DAPC, each color represents one cluster.

Fig 5

PCoA showed that the first three principal component vectors explained a total of 37.8% of the genotypic variability, of which 32.9% was contributed by the first two principal components (PC1 and PC2). A bi-plot of the first two principal components (PC1 and PC2) also revealed a more or less consistent population structure, as presented earlier (Fig 6).

Fig 6. A bi-plot of the first two principal components (PC1 and PC2) of 357 cowpea genotypes, using 6 498 SNP markers.

Fig 6

Each color corresponds to population structuring and grouping.

Core collection. A total core set of 94 individuals out of 357 genotypes were sufficient to retain 100% of SNP diversity and captured all the alleles revealed by the 6498 SNPs. The core collection comprised 71 landraces collected from different regions of Ethiopia and 23 genotypes drawn from the improved cowpea cultivars representing 76% and 24% of the core collection respectively (Table 4). The core collection named as CC-94 represents 26% of the entire collections. In the core collection all the geographical regions were represented by 20% to 33% of genotypes from the total genotypes collected (Table 4). Fourteen [14], 15, 23, 11, 18 and 13 accessions were drawn from Ambara, Gambella, Improved cultivars, Oromia, ANNPRS and Tigray respectively. These accessions represent 20%, 25%, 49%, 22%, 20% and 33% respectively of the total individuals investigated from each region (Table 4).

Table 4. Number and proportion of accessions in the core collection along with each collection regions.

Collection regions Number of accessions by region Number of accessions from the core collection by region % of the core collection along region
Amhara 70 14 20
Gambella 59 15 25
Improved 47 23 49
Oromia 49 11 22
SNNPRS 92 18 20
Tigray 40 13 33
Total 357 94 26

Discussion

SNP variation

The genetic diversity in a set of 357 Ethiopian cowpea genotypes as revealed by 6 498 SNP markers showed all possible types of SNPs in these genotypes, A/G and C/T, being most prevalent. Observation of a transition/transversion (Ts/Tv) ratio of 1.33:1 may reflect high frequencies of A to G and C to T mutations following methylation. This result is in agreement with what was found in a global collection of 422 cowpea landraces and African ancestral wild cowpea genotypes previously genotyped with 1 536 SNPs markers [67] and another study of 768 cultivated cowpea genotypes from the USDA GRIN cowpea collections, genotyped with 1048 SNP markers [47].

Genetic diversity and relationship

The landraces collected from different parts of Ethiopia showed 13% allele heterozygosity compared to only 6% in improved varieties, indicating that most of the alleles in improved cowpea varieties were almost fixed to homozygosity. The Shannon-Weaver index is highly correlated, with evenness, i.e. the number of genotypes per population and/or number of unique genotypes represented in the population [68,69]. Despite the relatively small number, landraces collected from the Tigray region [41] showed the highest values of gene diversity (0.30) and heterozygosity (0.21). On the contrary, landraces collected from SNNPRS had the largest relative sample size [92] and showed the lowest gene diversity (0.27) and heterozygosity (0.11). Our result is different from that reflected in a previous report based on 81 Ethiopian cowpea accessions analyzed using SSR markers, which stated that accessions from the SNNPRS region had higher genetic diversity than those from Tigray [30]. This discrepancy might be due to a very small number of samples from the Tigray region [6] in the previous report, or the difference in the type of markers used, or both. It is believed that the differentiation of genotypes into different clusters is independent of the type of germplasm [70], the type of marker used [71,72], the primers selected within markers [72,73] and the sampling strategy [73].

The gene diversity values obtained for the entire population are similar to those reported from the world’s largest cowpea collections using SNP markers [47,48]. A study from Africa of Senegalese cowpea varieties using SSR markers showed similar results as those found in the present study [16], but higher values were observed compared to Chinese cowpea collections genotyped using SSR markers [74] and Iberian Peninsula collections with SNP markers [75]. Results from the present study, however, showed the existence of lower genetic diversity compared to similar studies in many other self-pollinated crops such as common bean [76], and chickpea and lentil [77]. This could be corroborated by the fact that, in general, cowpea has a narrow genetic base due to the initial bottleneck during domestication [3,4,36,41], and strict self-pollinated nature of the crop [7880].

It is worth noting that the cowpea landraces used in this study were collected from farmers’ fields and local markets; most of the cowpea growers in Ethiopia use farm-saved seeds of their own [81] and this type of seed system limits the movement of germplasm from farmer to farmer and among localities. Cowpea breeding is also still in its early infancy in Ethiopia and all these considerations together limit the development and provision of varietal options to the farmers, which, in turn, may inhibit the integration of new genotypes from other sources and result in limited genetic diversity of the crop.

Analysis of molecular variance

Variations within populations and within individuals accounted for the largest proportion of the total variation. Coupled with the lower fixation index (FST) estimates (0.10 and 0.07) and the small percentage of variation among populations (10% and 8% based on collection region and breeding status, respectively), this may suggest a low to moderate level of differentiation among populations with an increased level of admixtures. In practice, an FST of 0.00–0.05 indicates low differentiation, 0.05–0.15 indicates moderate differentiation and 0.15–0.25 high levels of differentiation, while an FST > 0.25 indicates a very high level of differentiation [8284]. As stated earlier, this lower level of variation among populations might be attributed to germplasm exchange among regions, limited introduction of new varieties to the farming system in each region and wider agro-ecological adaptation of the crop. This result is in agreement with many diversity studies in cowpea collections using different markers [30,48,67,74,85,86].

In the Ethiopian cowpea collection, we found pairwise FST values ranging from 0.022 to 0.122, indicating low to moderate levels of genetic differentiation among regions. Wright indicated that if Nm > 1 [87], there is enough gene flow. The gene flow (Nm) among regions in the current study is 2.409, indicating the existence of germplasm exchange among cowpea accessions collected from different geographic regions and the introduced improved cowpea cultivars. Similar results were reported from a previous study on Ethiopian cowpea germplasm collections [30].

Population structure

Information about the structure of germplasm collections is of great importance for both conservation and utilization of genetic resources. Different approaches were used to infer the population structure of Ethiopian cowpea germplasm; the Bayesian model-based clustering algorithm using STRUCTURE and DAPC and the optimal K value were compared with those from principal coordinate analysis and the UPGMA tree. These methods showed the existence of three main ancestral populations. DAPC is a clustering multivariate method that uses sequential K-means and model selection [57] for genetic clustering in the absence of a prior grouping pattern. It provides an interesting alternative to STRUCTURE software, as it does not require that populations are in Hardy-Weiber equilibrium and can handle large sets of data without using parallel processing software. Nevertheless, our results showed good consistency between STRUCTURE and DAPC analyses when no admixed individuals were considered. The result is in agreement with a previous structuration of Ethiopian cowpea genotypes using the SSR marker [30], as indicated by previous works from the world collection [47,48].

Despite the availability of newer approaches, traditional hierarchical clustering analysis such as UPGMA provides easy and effective determination of genetic diversity in plants [88]. Furthermore, multivariate relationships among accessions were revealed through PCoA. Both the UPGMA tree and PCoA confirm the result from STRUCTURE and DAPC.

The clustering of the genotypes presented in this study may give interesting clues for increasing diversity in breeding programs and germplasm collections. Landraces were spread in all three clusters, whereas most of the improved cultivars were included in only two clusters (C-2 and C-3). Hence, the use of landraces different from clusters 2 and 3, as founding clones, may increase the genetic diversity of new cultivars. Deep knowledge of the population structure and understanding of the clustering pattern would assist the efficient choice of parental lines in current breeding programs. This will maximize genetic diversity, enhance the potential gain from selection and would help to increase the breeding programs’ efficiency to face new demands from producer, consumers, and emerging ecological constraints (i.e. adaptation to climate change, resistance against pests).

Core collection

Establishement of core collection is important to have manageable and representative sample size that can represent the diversity of the entire collection [89]. Brown believes that a core collection sample size from 5% to 10% of the original germplasm resources can represent more than 70% of the genetic variations of the whole germplasm [31]. However, Yonezawa et al. [90] assumed 20%–30% of the sampling percentage was needed to well conserve the genetic diversity of the entire collection. This trend was observed in our study, where 26% of genotypes were sufficient to retain 100% of SNP diversity of the whole population. This also indicated the existence of remarkable genetic diversity in the Ethiopian cowpea collection. Similar result (27%) was reported in faba bean [91]. Conversely, lower proportion of core collection was observed in lupin [92] at 16% while higher proportion of 36% as core collection size was reported in common bean [93]. The present result demonstrated the potential of highly informative and selective DArTSeq SNP markers to construct core collection and to enhance proper utilization and conservation of Ethiopian cowpea accesstions. This core collection will serve as a primary source for SNP mining and further associational analysis for novel genes in cowpea.

Conclusion

Three well-differentiated genetic populations or clusters were postulated from this study in the 310 Ethiopian cowpea landraces and 47 improved cultivars based on genome-wide SNPs scanning. This population structure will inform a genomic selection-based approach to introgress genomic regions associated to ion content both in the leaf and grain of cowpea. Although different reports indicated that East Africa, including Ethiopia, is one of the centers of origin, center of diversity or secondary center of diversity, there was no cowpea collection neither any characterization to such extent involving Ethiopia cowpea landraces. Though this collection is only limited to Ethiopia, the result of this study shed light on the existence of genetic diversity in the landraces more than the cowpea collection used worldwide and it’s expected that these landraces might have unharnessed potential for future breeding owing different traits for cowpea improvement. Thus, these germplasms can be used globally for cowpea future breeding.

The limited genetic distances among the pre-defined populations suggest the existence of a large number of duplications of accessions in the Ethiopian cowpea germplasm collections. Therefore, a core collection was built to avoid genotype duplication in germplasm. In this study, we propose the first core collection of 94 accessions capturing all the diversity from the Ethiopian landrace collections representing five administrative regions and 47 improved cultivars from international research centers, based on GBS derived SNP markers. The establishment of this core collection will also enhance the proper conservation and utilization of cowpea genetic resources for crop improvment. The present study also demonstrated the potential of highly informative and selective DArTSeq-derived SNP markers for genetic diversity and population structure studies. Therefore, given the proven high variability level of the Ethiopia germplam, a core set of diagnostic marker for cowpea genetic diversity study can be built for use worldwide and in Ethiopia. Similarly, Kompetitive Allele Specific PCR markers, derived from the GBS (DArTSeq)-SNPs, can be designed, validated and used for marker trait association of the crop such as calcium, iron, magnesium and zinc content for traits that are important for the economy and nutrition.

Supporting information

S1 Table. Collection region, number of accession from each region and altitudinal ranges of the test cowpea genetic resources.

(DOCX)

S2 Table. Distribution and genetic diversity parameters of 6498 SNPs measured in a set of 357 cowpea genotypes.

(DOCX)

S1 File. GPS coordinates data used to construct the map showing the collection sites (Fig 1).

(XLSX)

S2 File. Genotyping supporting data after filtering used for the study.

(TXT)

Acknowledgments

The study was part of PhD research work of the first author (SK) and she is grateful to the Ethiopian Institute of Agricultural Research for giving her study leave. The authors are grateful to the Collaborative Crop Research Project of McKnight Foundation for the collection and maintenance of the accession and germplasm respectively; and the Ethiopian Bio-diversity Institute for provision of some of the accessions.

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

The laboratory work was supported by the BecA-ILRI Hub through the Africa Biosciences Challenge Fund (ABCF) program. The ABCF Program is funded by the Australian Department for Foreign Affairs and Trade (DFAT) through the BecA-CSIRO partnership; the Syngenta Foundation for Sustainable Agriculture (SFSA); the Bill & Melinda Gates Foundation (BMGF); the UK Department for International Development (DFID) and; the Swedish International Development Cooperation Agency (Sida). The authors are also grateful to the Collaborative Crop Research Project of McKnight Foundation for the financial support for the collection of the landraces and maintenance of the germplasm. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Vavilov NI. The origin, variation, immunity and breeding of cultivated plants. Chron Bot. 1951;13(1):366. [Google Scholar]
  • 2.Steele WM. Cowpeas: Vigna unguiculata (Leguminosae-Papilionatae) In: Simmonds NW, editor. Evolution of Crop Plants. London: Longman,; 1976. p. 183–5. [Google Scholar]
  • 3.Coulibaly S, Pasquet S, Papa R, Gepts P. AFLP analysis of the phenetic organization and genetic diversity of Vigna unguiculata L. Walp. reveals extensive gene flow between wild and domesticated types. Theor Appl Genet. 2002;104:358–66. 10.1007/s001220100740 [DOI] [PubMed] [Google Scholar]
  • 4.Pasquet R. Allozyme diversity of cultivated cowpea Vigna unguiculata (L.) Walp. Theor Appl Genet. 2000;101(July 2000):211–9. [Google Scholar]
  • 5.Faris DG. The origin and evolution of the cultivated forms of vigna sinensis. Can J Genet Cytol. 1965;7:433–52. [Google Scholar]
  • 6.Zhukovashii PM. Cultivated plants and their wild relatives, Common wealth Bureau of Plant Breeding. UK: Cambridge; 1962. [Google Scholar]
  • 7.Vaillancourt RE, Weeden NF. Chloroplast DNA polymorphism suggests Nigerian center of domestication for the cowpea, Vigna ungiculata (Leguminosae). Am J Bot. 1992;79(10):1194–9. 10.1002/j.1537-2197.1992.tb13716.x [DOI] [PubMed] [Google Scholar]
  • 8.Ng N, Marechal R. Cowpea taxonomy, origin and germplasm In: Singh S, Rachie K, editors. Cowpea research, production and utilization. New York: Wiley; 1985. p. 11–21. [Google Scholar]
  • 9.Pasquet RS. Genetic relationships among subspecies of Vigna unguiculata (L.) Walp. based on allozyme variation. Theor Appl Genet. 1999;98:1104–5. [Google Scholar]
  • 10.Verdcourt B. Studies in the leguminosae-Papilionoideae for the ‘flora of tropical East Africa’: IV”, Kew Bulletin, Vol. 24, No. 3,. Vol. 24, Kew Bulletin. 1970. p. 507–69.
  • 11.Edwards SB. Crops with wild relatives found in Ethiopia In: Engels JMM, Hawkes JG, Worede M, editors. Plant genetic resources of Ethiopia. NY, USA: University of Cambridge; 1991. p. 58–74. [Google Scholar]
  • 12.FAO. Ethiopia: Country report to the FAO International Technical Conference. Plant Genetic Resources, Leipzig, Germany, 17–23 June 1996. 1996.
  • 13.Thulin M. Fabaceae (Leguminosae) In: Hedberg I, Edwards S, editors. Flora of Ethiopia Volume 3. Addis Ababa and Asmara, Ethiopia, Uppsala, Sweden; 1989. p. 49–174. [Google Scholar]
  • 14.Thulisn M. Fabaceae (Leguminosae). Pittosporaceae to Araliaceae. In: S. HI and E, editor. Flora of Ethiopia Vol 3. Addis Ababa and Asmara, Ethiopia, Uppsala, Sweden.; 1989. p. 49–174.
  • 15.FAOSTAT. FAOSTAT. Food and Agriculture Organization of the United Nations. Statistics Division. Rome, Italy; 2018.
  • 16.Badiane FA, Gowda BS, Cissé N, Diouf D, Sadio O, Timko MP. Genetic relationship of cowpea (Vigna unguiculata) varieties from Senegal based on SSR markers. Genet Mol Res. 2012;11(1):292–304. 10.4238/2012.February.8.4 [DOI] [PubMed] [Google Scholar]
  • 17.Malhotra RS, Singh KB. Classification of chickpea growing environments to control genotype by environment interaction. Euphytica. 2004;58:5–12. [Google Scholar]
  • 18.Kirkegaard J, Christen O, Krupinsky J, Layzell D. Break crop benefits in temperate wheat production. F Crop Res. 2008;107:185–195. [Google Scholar]
  • 19.Mduma I. Modified Traditional Cowpea Leaf Vegetable Preparation Methods for Enhancing Carotenoid Retenstion and Bioavailability. MSc Thesis. Sokoine University of Agriculture. Morogoro, Tanzania.; 2010.
  • 20.USDA. Nutrient data for: 11202, Cowpeas, leafy tips, cooked, boiled, drained, without salt. USDA National Nutrient Database for Standard Reference 27. 2015.
  • 21.Gomes AMF, Nhantumbo N, Ferreira-Pinto, Manuela, Rafael Massinga JCR, Ribeiro-Barros and A. Breeding elite cowpea [Vigna unguiculata (L.) Walp] varieties for improved food security and income in Africa: Opportunities and challenges. In: Plant Breeding. 2019. p. 1–14.
  • 22.Fang J, Chao C-CT, Roberts PA and Ehlers JD. Genetic diversity of cowpea [Vigna unguiculata (L.) Walp.] in four West African and USA breeding programs as determined by AFLP analysis. Genet Resour Evol. 2007;54:1197–209. [Google Scholar]
  • 23.NRC. Lost Crops of Africa: volume II Vegetables. Vol. II, National Academies Press; The National Academies Press; 2006. 378 p. [Google Scholar]
  • 24.Xu Y, Li P, Zou C, Lu Y, Xie C, Zhang X, et al. Enhancing genetic gain in the era of molecular breeding. J Exp Bot. 2017;68(11):2641–66. 10.1093/jxb/erx135 [DOI] [PubMed] [Google Scholar]
  • 25.Carolina T, Villa C, Maxted N, Scholten M, Ford-lloyd B. Defining and identifying crop landraces. Plant Genet Resour. 2006;3(3):373–84. [Google Scholar]
  • 26.Govindaraj M, Vetriventhan M, Srinivasan M. Importance of genetic diversity assessment in crop plants and its recent advances: An overview of its analytical perspectives. Genet Res Int. 2015;(December):14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Falconer DS. Introduction to Quantitative Genetics. 3rd ed Longman Scientific and Technical; 1989. 438 p. [Google Scholar]
  • 28.Singh BB. Recent genetics studies in cowpea: Challenges and opportunites. In: Fatokun CA, Tarawali SA, Singh BB, Kormawa PM, Tamò and M, editors. Challenges and opportunities for enhancing sustainable cowpea production Proceedings of the World Cowpea Conference III held at the International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria, 4–8 September 2000 IITA,. Ibadan, Nigeria.; 2002. p. 3–13.
  • 29.Chahal GS, Gosal SS. Principles and procedures of plant breeding: Biotechnological and conventional approaches Oxford, United Kingdom: Alpha Science International Ltd; 2002. [Google Scholar]
  • 30.Belayneh AD, Mohammed S, Dagne K, Timko MP. Assessment of genetic diversity in Ethiopian cowpea [Vigna unguiculata (L.) Walp.] germplasm using simple sequence repeat markers. Plant Mol Biol Rep. 2016;3:978. [Google Scholar]
  • 31.Brown AHD. Core collections: A practical approach to genetic resource management. Genome. 1989;31(2):818–24. [Google Scholar]
  • 32.Reis CM, Frederico AM. Genetic diversity in cowpea (Vigna unguiculata) using isozyme electrophoresis. Acta Hortic. 2001;546:497–501. [Google Scholar]
  • 33.Vaillancourt RE, Weeden NF, Barnard J. Isozyme diversity in the cowpea species complex. Crop Sci. 1993;33:606–13. [Google Scholar]
  • 34.Zannou A, Kossou DK, Ahanchédé A, Zoundjihékpon J, Agbicodo E, Struik PC, et al. Genetic variability of cultivated cowpea in Benin assessed by random amplified polymorphic DNA. African J Biotechnol. 2008;7(24):4407–14. [Google Scholar]
  • 35.Malviya N, Manoj BKS, Yadav K. Analysis of genetic diversity in cowpea (Vigna unguiculata L. Walp.) cultivars with random amplified polymorphic DNA markers. Plant Syst Evol. 2012;298:523–6. [Google Scholar]
  • 36.Ba FS, Pasquet RS, Gepts P. Genetic diversity in cowpea [Vigna unguiculata (L.) Walp.] as revealed by RAPD markers. Genet Resour Crop Evol. 2004;51(5):539–50. [Google Scholar]
  • 37.Anatala TJ, Gajera HP, Savaliya DD, Domadiya RK, Patel S V., Golakiya B a. Molecular diversity analysis of cowpea (Vigna unguiculata L.) genotypes determined by ISSR and RAPD markers. Int J Agric Environ Biotechnol. 2014;7(2):269. [Google Scholar]
  • 38.Nkongolo KK. Genetic characterization of Malawian cowpea (Vigna unguiculata (L.) Walp) landraces: diversity and gene flow among accessions. Euphytica. 2003;129:219–20. [Google Scholar]
  • 39.Li C, Fatokun CA, Ubi B, Singh BB, Scoles GJ, Vigna C, et al. Determining genetic similarities and relationships among cowpea breeding lines and cultivars by microsatellite markers. Crop Sci. 2001;41:189–97. [Google Scholar]
  • 40.Ogunkanmi LA, Ogundipe OT, Ng NQ. Genetic diversity in wild relatives of cowpea (Vigna unguiculata) as revealed by simple sequence repeats (SSR) markers. J Food, Agric Environ. 2008;6(3&4):263–8. [Google Scholar]
  • 41.Asare AT, Galyuon IKA, Timko M. Assessment of the genetic diversity in cowpea (Vigna unguiculata L. Walp.) germplasm from Ghana using simple sequence repeat markers. Plant Genet Resour. 2010;8(2):142–50. [Google Scholar]
  • 42.Olasupo FO, Ilori CO, Stanley EA, Owoeye TE, Igwe DO. Genetic analysis of selected mutants of cowpea (Vigna unguiculata [L.] Walp) using simple sequence repeat and rcb L markers. Am J Plant Sci. 2018;9:2728–56. [Google Scholar]
  • 43.Xu P, Wu X, Wang B, Liu Y, Qin D, Ehlers JD, et al. Development and polymorphism of Vigna unguiculata ssp. unguiculata microsatellite markers used for phylogenetic analysis in asparagus bean (Vigna unguiculata ssp. sesquipedialis (L.) Verdc.). Mol Breed. 2010;25(4):675–84. [Google Scholar]
  • 44.Rafalski A. Applications of single nucleotide polymorphisms in crop genetics. Curr Opin Plant Biol. 2002;5(2):94–100. 10.1016/s1369-5266(02)00240-6 [DOI] [PubMed] [Google Scholar]
  • 45.Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One. 2011;6(5):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Bajgain P, Rouse MN, Anderson JA. Comparing genotyping-by-sequencing and single nucleotide polymorphism chip genotyping for quantitative trait loci mapping in wheat. Crop Sci. 2016;56:1–17. [Google Scholar]
  • 47.Xiong H, Shi A, Mou B, Qin J, Motes D, Lu W. Genetic diversity and population structure of cowpea (Vigna unguiculata L. Walp). PLoS One. 2016;11(8):1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Fatokun C, Girma G, Abberton M, Gedil M, Unachukwu N, Oyatomi O, et al. Genetic diversity and population structure of a mini-core subset from the world cowpea (Vigna unguiculata (L.) Walp.) germplasm collection. Sci Rep. 2018;8(October):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Hijmans RJ, Guarino L, Mathur P. DIVA-GIS. Version 7.5. 2012. p. 77.
  • 50.Rogers S, Bendich A. Extraction of DNA from milligram amounts of fresh, herbarium and mummified plant tissues. Plant Mol Biol. 1985;5(2):69–76. 10.1007/BF00020088 [DOI] [PubMed] [Google Scholar]
  • 51.Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H, et al. Diversity arrays technology: a generic genome profiling technology on open platforms. Data Production and Analysis in Population Genomics: Methods and Protocols. 2012. 67–89 p. [DOI] [PubMed] [Google Scholar]
  • 52.Altshuler D, Pollara V, Cowles C, Van Etten W, Baldwin J, Linton L, et al. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature. 2000;407(6803):513–6. 10.1038/35035083 [DOI] [PubMed] [Google Scholar]
  • 53.Wells SJ, Dale J. Contrasting gene flow at different spatial scales revealed by genotyping- by-sequencing in Isocladus armatus, a massively colour polymorphic New Zealand marine isopod. PeerJ. 2018;6:e5462: 1–27. 10.7717/peerj.5462 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Gruenbaum Y, Stein R, Cedar H, Razin A. Methylation of CpG sequences in eukaryotic DNA. FEBS Lett. 1981;124(1):67–71. 10.1016/0014-5793(81)80055-5 [DOI] [PubMed] [Google Scholar]
  • 55.Excoffier L, Smouse PE, Quattro JM. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application. Genetics. 1992;131:479–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Kamvar Z, Tabima J, Grünwald N. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ 2:e281 2014; 10.7717/peerj.281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kamvar Z, Brooks J, Grünwald N. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality. Front Genet. 2015;6(208). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Pembleton LW, Cogan N, Forster J. StAMPP: An R package for calculation of genetic differentiation and structure of mixed-ploidy level populations. Mol Ecol Resour. 2013;13(5):946–52. 10.1111/1755-0998.12129 [DOI] [PubMed] [Google Scholar]
  • 59.Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol Ecol. 2005;14:2611–20. 10.1111/j.1365-294X.2005.02553.x [DOI] [PubMed] [Google Scholar]
  • 61.Earl D. Structure Harvester: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012;4(2):359–61. [Google Scholar]
  • 62.Jakobsson M, Rosenberg NA. CLUMPP: A cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23(14):1801–6. 10.1093/bioinformatics/btm233 [DOI] [PubMed] [Google Scholar]
  • 63.Rosenberg NA. DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes. 2004;4:137–8. [Google Scholar]
  • 64.Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11(94):1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Gruber B, Unmack PJ, Georges OF, Arthur B. dartR: an R package to facilitate analysis of SNP data generated from reduced representation genome sequencing. Mol Ecol Resour. 2018;18(3):691–9. 10.1111/1755-0998.12745 [DOI] [PubMed] [Google Scholar]
  • 66.Perrier X, Jacquemoud-Collet JP. DARwin software.http://darwin.cirad.fr/. 2006.
  • 67.Huynh B, Close TJ, Roberts PA, Hu Z, Wanamaker S, Lucas MR, et al. Gene pools and the genetic architecture of domesticated cowpea. Plant Genome. 2013;6(2):1–8. [Google Scholar]
  • 68.Magurran AE. Measuring biological diversity. Blackwell Publishing; 2004. 215 p. [Google Scholar]
  • 69.Magurran AE. Ecological diversity and its measurement. Princeton, New Jersey: Princeton University Press; 1988. 179 p. [Google Scholar]
  • 70.He Q, Li X, Liang G, Ji K, Guo Q, Yuan W, et al. Genetic diversity and identity of Chinese loquat cultivars/accessions (Eriobotrya japonica) using apple SSR markers. Plant Mol Biol Rep. 2011;29:197–208. [Google Scholar]
  • 71.Baraket G, Chatti K, Saddoud O, Abdelkarim A Ben, Mars M, Trifi M, et al. Comparative assessment of SSR and AFLP markers for evaluation of genetic comparative assessment of SSR and AFLP markers for evaluation of genetic diversity and conservation of Fig, Ficus carica L., Genetic Resources in Tunisia. Plant Mol Biol Rep. 2011;29:171–84. [Google Scholar]
  • 72.Sharma S, Negi M, Sinha P, Kumar K, Tripathi S. Sharma SS, et al. (2011) Assessment of genetic diversity of biodiesel species Pongamia pinnata accessions using AFLP and three endonuclease-AFLP. Plant Mol Biol Rep 29:12–18. 10.1007/s11105-010-0204-2 [DOI] [Google Scholar]
  • 73.Kong Q, Li X, Xiang C, Wang H, Song J, Zhi H. Genetic Diversity of Radish (Raphanus sativus L.) Germplasm Resources Genetic Diversity of Radish (Raphanus sativus L.) Germplasm Resources Revealed by AFLP and RAPD Markers. Plant Mol Biol Rep. 2011;29:217–23. [Google Scholar]
  • 74.Chen H, Chen H, Hu L, Wang L, Wang S, Li M, et al. Genetic diversity and a population structure analysis of accessions in the Chinese cowpea [Vigna unguiculata (L.) Walp.] germplasm collection. Crop J. 2017;5:363–72. [Google Scholar]
  • 75.Carvalho M, Muñoz-amatriaín M, Castro I, Lino-neto T, Matos M, Egea-cortines M, et al. Genetic diversity and structure of Iberian Peninsula cowpeas compared to world- wide cowpea accessions using high density SNP markers. BMC Genet. 2017;18(891):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Asfaw A, Blair MW, Almekinders C. Genetic diversity and population structure of common bean (Phaseolus vulgaris L.) landraces from the East African highlands. Theor Appl Genet. 2009;120:1–12. 10.1007/s00122-009-1154-7 [DOI] [PubMed] [Google Scholar]
  • 77.Mekonnen F, Mekbib F, Kumar S, Ahmed S, Sharma TR. Molecular diversity and population structure of the Ethiopian lentil (Lens Culinaris Medikuss) genotype assessment using SSR markers. J Crop Sci Biotech. 2015;19(1):1–15. [Google Scholar]
  • 78.Venter HM. Difficulties with cross-pollinating five cowpea lines: Technique development The Biodiversity of African Plants. 1996. p. 656–60. [Google Scholar]
  • 79.Kouam EB, Pasquet RS, Campagne P, Tignegre JB, Thoen K, Gaudin R, et al. Genetic structure and mating system of wild cowpea populations in West Africa. BMC Plant Biol. 2012;12(113):1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Blambert L, Mallet B, Humeau L, Pailler T. Reproductive patterns, genetic diversity and inbreeding depression in two closely related Jumellea species with contrasting patterns of commonness and distribution. Ann Bot. 2016;118:93–103. 10.1093/aob/mcw014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Bedru B, Berhanu A, Mulugeta T, Dagmawit T, Bezawit Y, Selamawit K. Cowpea: Production, marketing and utilization in Ethiopia. Research Report 121. Addis Ababa, Ethiopia; 2019.
  • 82.Hartl DL, Clark AG. Principles of population genetics. 4th ed Sunderland, Massachusetts: Sinauer Associates, Inc.; 1997. [Google Scholar]
  • 83.De Vicente MC, Lopez C, Fulton T. Genetic diversity analysis with molecular marker data: Learning module. International Plant Genetic Resources Institute (IPGRI). Cornell University. 2004.
  • 84.Wright S. Evolution and the Genetics of Populations, Volume 3 Experimental reseults and Evolutionnary Dedudctions. London: The University of chicago Press; 1977. 622 p. [Google Scholar]
  • 85.Belayneh AD, Dagne K, Melaku G, Ousmane B, A FC. Efficiency of SNP and SSR-based analysis of genetic diversity, population structure, and relationships among cowpea (Vigna unguiculata (L.) Walp.) germplasm from east Africa and IITA Inbred Lines. J Crop Sci Biotech. 2017;20(2):107–28. [Google Scholar]
  • 86.Xiong H, Qin J, Shi A, Mou B, Wu D. Genetic differentiation and diversity upon genotype and phenotype in cowpea (Vigna unguiculata L. Walp.). Euphytica. 2018;214(4):1–15. [Google Scholar]
  • 87.Wright S. Evolution in Mendelian population. Genetics. 1931;16:97–159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Odong TL, Van Heerwaarden J, Jansen J, Van Hintum TJL, Van Eeuwijk FA. Determination of genetic structure of germplasm collections: Are traditional hierarchical clustering methods appropriate for molecular marker data? Theor Appl Genet. 2011;123:195–205. 10.1007/s00122-011-1576-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Van Hintum TJL, Brown AHD, Spillane C, Hodgkin T. Core collections of plant genetic resources IPGRI technical bulletin No. 3. Rome, Italy: International Plant Genetic Resources Institute (IPGRI); 2000. p. 51. [Google Scholar]
  • 90.Yonezawa K, Nomura T, Morishima H. Sampling strategies for use in stratified germplasm collections In: Hodgkin T, Brown AHD, van Hintum TJ., Morales EAV, editors. Core Collections of Plant Genetic Resources. UK: John Wiley and Sons; 1995. p. 35–54. [Google Scholar]
  • 91.Liu Y, Hou W. Genetic diversity of faba bean germplasms in qinghai and core germplasm identified based on AFLP analysis. Legum Genomics Genet. 2010;1(1):1–6. [Google Scholar]
  • 92.Atnaf M, Yao N, Martina K, Dagne K, Wegary D, Tesfaye K. Molecular genetic diversity and population structure of Ethiopian white lupin landraces: Implications for breeding and conservation. PLoS One. 2017;12(11):1–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Perseguini JMKC, Silva GMB, Feijó Rosa JRB, Gazaffi R, Marçal JF, et al. Developing a Common Bean Core Collection Suitable For Association Mapping. Genet Mol Biol. 2015;38(1):67–78. 10.1590/S1415-475738120140126 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Tzen-Yuh Chiang

7 Apr 2020

PONE-D-19-32934

DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms

PLOS ONE

Dear Dr. Yao,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by May 22 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Tzen-Yuh Chiang

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We noticed you have some minor occurrence of overlapping text with the following previous publication(s), which needs to be addressed:

https://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-016-0712-9

In your revision ensure you cite all your sources (including your own works), and quote or rephrase any duplicated text outside the methods section. Further consideration is dependent on these concerns being addressed.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Dear Editor and Author:

In the article “DArTSeq SNP-based markers revealed high genetic diversity and structured

population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms”, the authors evaluated the genetic diversity in 357 Ethiopian cowpea germplasm, using 6498 GBS based single nucleotide polymorphism (SNPs) markers. The population structure, genetic diversity, and phylogeny were analyzed in this research.

Generally, this manuscript has a large amount of data, a clear idea, and a relatively intelligent data analysis. We can see the author's research ability and efforts. Personally, I like this article very much. However, I think this manuscript is not appropriate to be published right now, until the below problems are solved:

1. This manuscript is to describe the diversity of cowpea in Ethiopian, why the author added breeding line and varieties which is not collected from Ethiopian? I agree and recommend adding a certain amount of breeding lines for diversity analysis, however, the source of the materials does not associate with the main idea of the article, it will cause confusion in the analysis. If the author implied or thought those breeding lines were all from Ethiopia, why compare breeding lines with all Ethiopian landraces as two independent groups in Table 1 and 2?

2. How did the authors pick up the 314 landraces from whole germplasms? Are these materials representative of the entire germplasm pool? Theoretically, diversity analysis must collect all local landraces, but considering the research cost. The researchers will select some highly diverse materials from all the collections to represent the whole germplasm pool. Please clarify whether these materials have such qualifications. My suggestion is that if the 314 landraces are part of germplasm, the author should provide the phenotype of all germplasm resources and prove the representativeness of the selected material; if this material is the entire germplasm of Ethiopia, it is also desirable to conduct analysis and statistics on the phenotype of this material.

3. How did the authors screen the 6498 markers from the 20276 SNPs? The main problem of GBS is missing and false data, please clarify the quality control of genotype. In my suggestion, it is also better to prove the distribution of the 6498 SNPs in each chromosome, because of the character of GBS sequencing, the average distance of markers is too much different from the real situation.

4. Please re-make Fig 2, 5 and 6 to improve the resolution, if the authors could.

Except for the flaw of the scientific part, my main concerns are:

1 The germplasm in this manuscript was only from one country, which makes it of limited reference value to researchers worldwide. Please elaborate on the implications of this study for researchers worldwide.

2 Overall, although the diversity analysis of this study was relatively integrated, I did not see any innovation, depth understanding, or thinking about diversity research from the conclusions and perspectives.

To sum up, I am conservative and cautious about the publication of this manuscript immediately, unless the authors can find new research ideas or impressive conclusion with a more thorough discussion and understanding from current results. My suggestion is that, since the author has mentioned the establishment of a core collection, it is better to build the “core germplasm” for Ethiopia based on the existing data combining with the phenotype of whole the germplasm. This project is more important than an only diversity study, and its results are more valuable for reference over the world.

Sincerely

Reviewer #2: In this manuscript the authors explored the genetic diversity and population structure of Ethiopian cowpea using DArTSeq markers. However, the paper needs improvement. The authors forgot to include line numbers in the manuscript which make it difficult to mention typographical errors. A correction of the text is recommended.

In the abstract replace 357 cowpea “germplasms” by 357 cowpea accessions

In the introduction the 1st sentence needs correction, also there is a lack of consensus on where in Africa cowpea was domesticated. Authors should take that in account.

Methods section: a brief description of the sequencing method is recommended.

Authors should provide detailed information on the SNP calling approach, SNP filtering thresholds as well as the filtering of the accessions.

Population structure analysis: I would recommend a membership coefficient of ≥ 0.70.

Authors should also replace the figures with high resolution figures.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Oct 8;15(10):e0239122. doi: 10.1371/journal.pone.0239122.r003

Author response to Decision Letter 0


14 Aug 2020

Rebuttal letter in response to the comments forwarded by the reviewers on manuscript number, PONE-D-19-32934 [DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms].

Dear Editor and Reviewers

We are grateful for the reviewers who have been reviewing our manuscript during this COVID -19 pandemic time. We are also thankful for the valuable comments given to improve our manuscript. We would also like to seek apology for the delayed response for this paper, this was due to communication gap among the authors worsened by the pandemic situation and the internet restriction by the Ethiopian gouvernement. The reviewer’s comments have helped us to improve the manuscript and we have tried our best to address all the comments given by the reviewers. The changes made in the manuscript are shown in track in the manuscript. The response for each comment, along with the query of the reviewer is summarize as below.

Sincerely,

The Authors

Revision notes

Reviewer 1 Comments:

Comment 1. This manuscript is to describe the diversity of cowpea in Ethiopian, why the author added breeding line and varieties which is not collected from Ethiopian? I agree and recommend adding a certain amount of breeding lines for diversity analysis, however, the source of the materials does not associate with the main idea of the article, it will cause confusion in the analysis. If the author implied or thought those breeding lines were all from Ethiopia, why compare breeding lines with all Ethiopian landraces as two independent groups in Table 1 and 2?

Response 1.). One of our objectives is to see if the landraces collected from different regions of Ethiopia maintained and used by the local farmers are different from the cowpea genotypes collected from IITA or developed by breeding program in Ethiopia. The inclusion of the breeding lines is driven by the fact that the cowpea-breeding program, being dependent on the introduced cultivars imported and/or from the International Institute of Tropical Agriculture (IITA), would represent exotic/external germplasm outside Ethiopia. Our goal in comparing breeding lines/improved variety from IITA with Ethiopian landraces is to see the allelic richness of these two groups and hence develop a better germplasm management strategy and for use for breeding in Ethiopia and worldwide. In addition, to mitigate the bias that can be introduced by the difference in number of samples, we have compared the improved varieties with each of the subpopulation that constitute the whole population for landraces.

Comment 2. How did the authors pick up the 314 landraces from whole germplasms? Are these materials representative of the entire germplasm pool? Theoretically, diversity analysis must collect all local landraces, but considering the research cost. The researchers will select some highly diverse materials from all the collections to represent the whole germplasm pool. Please clarify whether these materials have such qualifications. My suggestion is that if the 314 landraces are part of germplasm, the author should provide the phenotype of all germplasm resources and prove the representativeness of the selected material; if this material is the entire germplasm of Ethiopia, it is also desirable to conduct analysis and statistics on the phenotype of this material.

Response 2. The 314 landraces represent the entire collection of cowpea germplasm in Ethiopia. From this study we planned to only phenotype the core collection extensively for several key and economically important traits to cut on cost usually associated with efficient broad genetic stock/germplasm characterization.

Comment 3. How did the authors screen the 6498 markers from the 20276 SNPs? The main problem of GBS is missing and false data, please clarify the quality control of genotype. In my suggestion, it is also better to prove the distribution of the 6498 SNPs in each chromosome, because of the character of GBS sequencing, the average distance of markers is too much different from the real situation.

Response 3. The comment is accepted, and one paragraph was inserted in material and methods under the section “SNPs calling and data filtering” regarding data filtering to respond to the request query. Data filtering conditions were included as follow: SNP markers with high proportion of missing data (>10%), individuals who have high rates of genotype missingness (>10%) and rare SNPs with <5% minor allele frequency (MAF) were discarded from further analysis using R software (version 2.8.3). The most informative SNPs were selected based on a threshold PIC value equal or higher than 0.2. Finally 357 cowpea genotypes and 6498 (32%) of SNP markers were maintained for further analysis.

In addition, the distribution of SNPs per chromosomes was provided in S2 Table. The number of SNPs ranges from 463 on chromosome 2 to 849 on chromosome 3 with a mean of 591 SNPs per chromosome

Comment 4. Please re-make Fig 2, 5 and 6 to improve the resolution, if the authors could

Response 4. The comment is accepted, and new captures were done to improve the figure quality subsequently. The figures were also validated through the PLOS one PACE requirements.

Comment 5. Except for the flaw of the scientific part, my main concerns are:

The germplasm in this manuscript was only from one country, which makes it of limited reference value to researchers worldwide. Please elaborate on the implications of this study for researchers worldwide. Overall, although the diversity analysis of this study was relatively integrated, I did not see any innovation, depth understanding, or thinking about diversity research from the conclusions and perspectives.

To sum up, I am conservative and cautious about the publication of this manuscript immediately, unless the authors can find new research ideas or impressive conclusion with a more thorough discussion and understanding from current results. My suggestion is that, since the author has mentioned the establishment of a core collection, it is better to build the “core germplasm” for Ethiopia based on the existing data combining with the phenotype of whole the germplasm. This project is more important than an only diversity study, and its results are more valuable for reference over the world.

Response 5. The queries and suggestion of comment 5 has been addressed as follows and included in the conclusion to show the importance of the current study and its potential use worldwide

� Although different reports indicated that East Africa including Ethiopia is one of the centers of origin, center of diversity or secondary center of diversity there was no cowpea collection neither any characterization to such extent in Ethiopia cowpea landraces. Though this collection was only done in Ethiopia, the result of this study has shown the existence of genetic diversity in the landraces and it expected that these germplasms might have unharnessed potential for future breeding owing different trait for cowpea improvement. Thus, these germplasms can be used globally for cowpea future breeding.

Innovation, depth understanding, or thinking about diversity research from the conclusions and perspectives.

� Worldwide, germplasm collection has been done for most crops for a long period, and it is assumed as we already had enough collection. However, this study could show, as we have not still collected the germplasm resource of the world. Thus, this study sheds light on how we could still have several germplasm in most crops that needs collection, preservation, and use for future breeding. In this study in the future, the collected germplasm will be thoroughly phenotype, and used for cowpea improvement for the target traits. A core set of diagnostic makers can be developed to serve worldwide purpose

My suggestion is that, since the author has mentioned the establishment of a core collection, it is better to build the “core germplasm” for Ethiopia based on the existing data combining with the phenotype of whole the germplasm,

• The suggestion for establishing core collection is well taken and included in the manuscript as shown in the introduction, material method, result, discussion and conclusion. However, we wouldn’t be able to phenotype the whole collection due to budget limitation, hence we will be focusing on phenotyping extensively the core collection for several traits

Reviewer 2 Comments:

Comment 1. The authors forgot to include line numbers in the manuscript, which make it difficult to mention typographical errors. A correction of the text is recommended.

Response 1 Comment accepted and line numbers included

Comment 2. In the abstract replace 357 cowpea “germplasms” by 357 cowpea accessions

Response 2. Comment accepted and correction made according to the suggestion

Comment 3. In the introduction, the first sentence needs correction; also, there is a lack of consensus on where in Africa cowpea was domesticated. Authors should take that in account.

Response 3. Comment accepted and the sentence was rephrased as “Cowpea [Vigna unguiculata (L.) Walp., 2n = 2x = 22] originated and was domesticated in Africa though the exact location of origin of domestication is still a matter of speculation and different authors suggest different areas in Africa, Northeastern Africa including Ethiopia (1–4), Central Africa (5), South Africa (6), and West Africa (5,7,8). (Line 45 to line 48)

Comment 4. Methods section: a brief description of the sequencing method is recommended.

Response 4. A brief description of the GBS protocol as developed and implemented by DArT was added to the Material and Method under the section “DNA extraction and sequencing”

Comment 5. Authors should provide detailed information on the SNP calling approach, SNP filtering thresholds as well as the filtering of the accessions.

Response 5. One paragraph was inserted in Material and Methods section with respect to data filtering under a new title “SNP calling and data filtering”

Comment 6. Population structure analysis: I would recommend a membership coefficient of ≥ 0.70.

Response 6. The comment accepted and membership coefficient of ≥ 0.70 were used and the results were corrected accordingly.

Comment 7. Authors should also replace the figures with high resolution figures.

Response 7. The comment is accepted, and the figures have been replaced with figures with higher resolution quality.

Response to reviewer comments, August 05

The reviewer comments are shown below in black and the response by the authors is provided as response to reviewers in italics

Dear Dr Yao,

Thank you for submitting your manuscript entitled "DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms" to PLOS ONE. Your manuscript files have been checked in-house but before we can proceed we need you to address the following issues:

1) Thank you for updating your data availability statement. You note that your data are available within the Supporting Information files, but no such files have been included with your submission. At this time we ask that you please upload your minimal data set as a Supporting Information file, or to a public repository such as Figshare or Dryad.

Please also ensure that when you upload your file you include separate captions for your supplementary files at the end of your manuscript.

As soon as you confirm the location of the data underlying your findings, we will be able to proceed with the review of your submission.

Response 1 to reviewers: The genotyping data set was included as supporting information and a caption was added at the end of the manuscript under the title: S3: Genotyping supporting data obtained after filtering used for the study

2) Please amend your list of authors on the manuscript to ensure that each author is linked to an affiliation.

We note that you have included affiliation numbers 1-6 and *, ¶ however only affiliations 1-6 and * have authors linked to them. Please amend affiliation ¶ to link an author to it or remove if added in error.

Response 2 to reviewers: The affiliation “¶” was added by mistake and was removed from the author list

3) Please include a copy of Table 4 which you refer to in your text on page 10.

Response 3 to reviewers: Table 4 and Table 6 should read the same. The numbering as Table 6 came from the fact that S1 and S2 tables were previously included in the manuscript body as Table 4 and Table 5 before been moved as supplementary files. Then the actual Table 6 was amended as Table 4 in the table caption and in the text body.

Attachment

Submitted filename: Combined response to Reviewers_August 13.docx

Decision Letter 1

Tzen-Yuh Chiang

1 Sep 2020

DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms

PONE-D-19-32934R1

Dear Dr. Yao,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Tzen-Yuh Chiang

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: Authors have addressed the comments. I recommend to italicize the scientific name of the species in the references section.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Acceptance letter

Tzen-Yuh Chiang

28 Sep 2020

PONE-D-19-32934R1

DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms

Dear Dr. Yao:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Tzen-Yuh Chiang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Collection region, number of accession from each region and altitudinal ranges of the test cowpea genetic resources.

    (DOCX)

    S2 Table. Distribution and genetic diversity parameters of 6498 SNPs measured in a set of 357 cowpea genotypes.

    (DOCX)

    S1 File. GPS coordinates data used to construct the map showing the collection sites (Fig 1).

    (XLSX)

    S2 File. Genotyping supporting data after filtering used for the study.

    (TXT)

    Attachment

    Submitted filename: Declaration_ SK.pdf

    Attachment

    Submitted filename: Combined response to Reviewers_August 13.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES