Skip to main content
Frontiers in Plant Science logoLink to Frontiers in Plant Science
. 2014 May 13;5:185. doi: 10.3389/fpls.2014.00185

Developing market class specific InDel markers from next generation sequence data in Phaseolus vulgaris L.

Samira Mafi Moghaddam 1,2, Qijian Song 3, Sujan Mamidi 1,2, Jeremy Schmutz 4, Rian Lee 2, Perry Cregan 3, Juan M Osorno 2, Phillip E McClean 1,2,*
PMCID: PMC4026720  PMID: 24860578

Abstract

Next generation sequence data provides valuable information and tools for genetic and genomic research and offers new insights useful for marker development. This data is useful for the design of accurate and user-friendly molecular tools. Common bean (Phaseolus vulgaris L.) is a diverse crop in which separate domestication events happened in each gene pool followed by race and market class diversification that has resulted in different morphological characteristics in each commercial market class. This has led to essentially independent breeding programs within each market class which in turn has resulted in limited within market class sequence variation. Sequence data from selected genotypes of five bean market classes (pinto, black, navy, and light and dark red kidney) were used to develop InDel-based markers specific to each market class. Design of the InDel markers was conducted through a combination of assembly, alignment and primer design software using 1.6× to 5.1× coverage of Illumina GAII sequence data for each of the selected genotypes. The procedure we developed for primer design is fast, accurate, less error prone, and higher throughput than when they are designed manually. All InDel markers are easy to run and score with no need for PCR optimization. A total of 2687 InDel markers distributed across the genome were developed. To highlight their usefulness, they were employed to construct a phylogenetic tree and a genetic map, showing that InDel markers are reliable, simple, and accurate.

Keywords: InDel marker, next generation sequencing, market class, phylogenetics, genetic map, common bean

Introduction

Plant breeding embraces both art and science for crop improvement. Marker assisted selection (MAS) can boost the efficiency of breeding when markers linked to genes of interest are discovered (Yang et al., 2012). Marker development requires the comparison of genetic material of two or more genotypes to find the polymorphic regions that segregate in a breeding population. Thus, it is important to have adequate genetic variation among the genotypes of interest to develop markers that can be used for MAS and other genetic studies. In fact, using MAS to improve a trait of interest in a self-pollinating species like common bean is becoming more challenging today in the United States because of the narrow genetic diversity of this species (McClean et al., 1993; Sonnante et al., 1994).

Common bean is a diploid legume species with 11 chromosomes, a genome size of approximately 520 megabase pairs (Mbp), a few duplicated loci (Vallejos et al., 1992; Freyre et al., 1998) and 49% transposable elements (Schlueter et al., 2008). A reference genome sequence of genotype G19833 was recently released in August (Schmutz et al., in press). Common bean has two distinct gene pools, Middle American (from northern Mexico to Colombia) and Andean (from southern Peru to northwestern Argentina). Each gene pool underwent separate domestication events (Gepts and Bliss, 1986; Koenig and Gepts, 1989; Khairallah et al., 1990, 1992; Koinange and Gepts, 1992; Freyre et al., 1996) followed by the creation of ecogeographic races in each of the two gene pools due to further selection under domestication (Singh et al., 1991; Beebe et al., 2000; Diaz and Blair, 2006; Mamidi et al., 2011). Mamidi et al. (2011) estimated the duration and time of the single domestication event in each gene pool and suggested reciprocal migration between wild and landrace genotypes. There is a strong genetic differentiation between Middle American and Andean gene pools, and the Middle American gene pool is more diverse compared to the Andean gene pool (Mamidi et al., 2013). According to Singh et al. (1991), the Middle American gene pool with the center of domestication in Central and North America consist of three races, Durango, Jalisco, and Mesoamerican. The typical commercial market classes in the United States from this gene pool are pinto, great northern (GN), small red and pink beans from race Durango, and navy, small white and black beans from race Mesoamerican. The Andean gene pool with its center of domestication in South America includes three races: Nueva Granada, Peru, and Chile. The commercial market classes of this gene pool in the United States are light red kidney (LRK), dark red kidney (DRK), white kidney, and cranberry beans which are all from the Nueva Granada race (Mensack et al., 2010).

Breeding for commercial varieties in common bean usually occurs within each market class in order to retain their preferred seed size, shape, color, and pattern. The narrow genetic diversity within a market class is due to the small size of bottleneck populations (Gepts and Bliss, 1986), the rigid quality required by processors and consumers (Ghaderi et al., 1984; Wang et al., 1988; Hosfield et al., 2000; Myers, 2000), the finite use of exotic germplasm (Silbernagel and Hannan, 1988, 1992; Miklas, 2000), and the restricted breeding strategies to meet consumer satisfaction regarding seed size, shape, and color (Singh, 1992). Although incorporation of exotic and unadapted germplasm is helpful in enhancement of genetic diversity, maintenance of the necessary phenotypic characteristics of each market class is challenging when using novel sources of variation due to linkage drag. Indeed, it has been documented in multiple plant species that quantitative traits are affected by genetic background (Tanksley and Hewitt, 1988; Doebley et al., 1995; Lark et al., 1995; Cockerham and Zeng, 1996; Li et al., 1997, 1998). This indicates the need for market class specific marker development to facilitate bean improvement by monitoring the variation that exists in each market class.

Most of the currently available markers for common bean are Sequence Characterized Amplified Region (SCAR) markers developed from Random Amplification of Polymorphic DNA (RAPD) markers through a slow and difficult process (Kelly et al., 2003). Other types of marker systems have been developed and used in different studies in common bean such as Inter Simple Sequence Repeats (ISSR) (Gonzalez et al., 2005), Simple Sequence Repeats (SSR) (Blair et al., 2003; Gomez et al., 2004; Buso et al., 2006; Galeano et al., 2009; Cordoba et al., 2010). Recently a high-throughput Golden Gate SNP assay was released by Hyten et al. (2010). However, most of the markers are based on polymorphism among a few genotypes from different market classes or even gene pools. Thus, the development and application of high throughput, user-friendly, market class-specific markers are indispensable.

Insertion-deletions (InDel) are one of the common sources of variation that are distributed widely throughout the genome. Mechanisms such as transposable elements, slippage in simple sequence replication, and unequal crossover can result in the formation of InDels (Britten et al., 2003). They can be converted to user-friendly markers that can be distinguished easily based on their size (Vali et al., 2008) with minimum laboratory equipment. Many genetic studies in plants and animals have successfully utilized InDels (Hayashi et al., 2006; Vali et al., 2008; Vasemagi et al., 2010; Ollitrault et al., 2012). InDels and SNPs are now the most widely used marker systems in Arabidopsis because they are abundant, PCR-based, and informative due to their co-dominant nature (Pacurar et al., 2012).

Next generation sequencing (NGS) provides inexpensive sequence data needed to develop genetic markers to be used in plant breeding and genetic studies. NGS technologies are efficient and offer new genomic information for minor crops for which a reference genome sequence is not available (Varshney et al., 2009) and accelerates the development of genomic resources for crops with a reference genome. The objective of this study was to use Illumina sequence data from multiple genotypes within five bean market classes, which were selected from both the Andean and the Middle American gene pools, to develop user-friendly InDel markers that have wide applications for MAS and genomic studies.

Materials and methods

Plant materials

Three diverse genotypes from pinto, navy, black, and LRK and two genotypes from DRK market classes where sequenced with the Illumina Genome Analyzer (GAII). Genotypes were chosen based on their divergence in a neighbor joining (NJ) tree that was created for 192 genotypes from nine different market classes using 1159 high quality SNP markers (Hyten et al., 2010). The sequenced genotypes in each market class were as follows: Stampede, Buckskin, and Sierra from the pinto market class; C20, Michelite, and Laker from the navy market class; Cornell 49242, T-39, and UI 906 from the black market class, California Early, Lark and, Kardinal from the LRK market class, and Red Hawk and Fiero from the DRK market class.

Marker development

The first step of marker design was a within genotype de novo assembly of the Illumina GAII DNA sequence data into contigs using Velvet 1.0 (Zerbino and Birney, 2008) with the default settings. BLAST+ (Camacho et al., 2009) was used to discover potential InDels. Using three genotypes within a market class, InDels were discovered by aligning contigs from one genotype as the query against a database consisting of the contigs of the two other genotypes. In addition, a pairwise blastn alignment was performed among all pairs of genotypes within a market class. An e-value cutoff of 1E-50 and a maximum hit of one and two were used in BLAST to obtain the best hit for pair-wise and three-way alignments, respectively. InDels were discovered based on the size differences between the query and the database subject. Several filters were applied to potential InDels: A minimum InDel size of 8 bp was used to ensure an appropriate resolution using agarose gel electrophoresis; unique InDel fragments were assured by blasting InDel fragments against the Phaseolus vulgaris L. scaffold assembly ARRA-V0.9. ARRA-V0.9 was an intermediate scaffold assembly in the whole genome sequencing project. The e-value and maximum hit were set to 1E-50 and 20, respectively. Queries with multiple hits were excluded to decrease the probability of designing markers from repetitive regions. Contigs that contained more than four consecutive Ns were excluded because this could lead to false InDel discovery or false InDel size.

The contigs that contained InDels were aligned using Multalin (Corpet, 1988) to obtain the consensus region around the InDels for primer design. Primers were designed from the consensus sequence in BatchPrimer3 (You et al., 2008). The primer size parameters were set to 22, 26, and 32 bp as the minimum, optimum, and maximum size, respectively, and GC content was set to 35, 50, and 60% as the minimum, optimum, and maximum, respectively. Primer annealing temperature was set to a very narrow range of 67, 68, and 69°C as the minimum, optimum, and maximum temperature, respectively. Finally, the maximum Tm difference between forward and reverse primers was set to 2°C only. The PCR product length was approximatly10 times the InDel size to ensure the PCR products could be adequately separated on a 3% agarose gel for efficient scoring. The product size varied between InDel size (bp) × 10 and [InDel size (bp) × 10] + 10 (bp) for optimum and maximum values, respectively. The minimum product sizes were as listed in Table 1. These stringent criteria were intentional to avoid the need for PCR optimization for each primer set. All primer sets were optimized to amplify with a 55°C annealing temperature.

Table 1.

InDel size and the corresponding minimum product size that was used by BatchPrimer3 for primer design.

InDel size (bp) Minimum product size (bp)
8–9 70
10–11 80
12–14 90
15–17 100
18–22 110
23–26 120
27–29 130
30–36 150

Nomenclature

Including information on marker position on the reference genome in the marker names, provides the user with valuable information on marker distribution. Thus, the markers were named in the format NDSU-IND-NN-XX.XXXX, where NDSU stands for North Dakota State University, IND for InDels and NN for the chromosome number. The Xs represent the physical position on the reference genome (G19833- V1.0) in megabase pairs up to four decimals. For example InDel marker NDSU_IND_07_02.6485 is located on chromosome Pv07 at position 2.6485 Mbp in common bean V1.0 reference genome.

PCR amplification

The PCR protocol used to amplify all InDel markers was: 3 min at 95°C for one cycle, 20 s at 95°C, 30 s at 55°C, and 1 min at 72°C for 45 cycles, and 10 min at 72°C for one cycle. Each PCR reaction consisted of a final concentration of 1× PCR buffer including 0.15 mM MgCl2, 0.5 mM dNTP mix, 0.25 mM forward/reverse primers and 1 unit of Taq polymerase with a 20 μl final volume.

Alignment of sequence data with the reference genome

The sequence data from 14 genotypes were mapped to the reference genome (V1.0) when the complete assembly became available. Burrows-Wheeler Aligner (BWA) (Li and Durbin, 2009) with default settings was used to map the reads with the reference genome. SAMtools (Li et al., 2009) was used to convert the BWA output to a sorted bam file and obtain the mpileup file. The “pileup2indel” command with minimum coverage of five reads was used in VarScan (Koboldt et al., 2009) to find the number of InDels for each genotype based on the reference genome (G19833). The VarScan output was also filtered based on the frequency of the variant allele by read count. Only InDels with variant allele frequency of 80% and higher were considered.

Marker performance and application

To evaluate the performance of the designed markers, 219 pinto markers were screened on Stampede, Sierra, Buckskin, and G19833. Moreover, six markers were tested on a few random genotypes from nine market classes (Table 2) to evaluate the performance of InDel markers in the market classes other than the one from which they were originally designed.

Table 2.

Genotypes used to test the performance of six markers in other market classes.

Order* Genotype Market class
1 Domino Black
2 Raven Black
3 T-39 Black
4 Cornell 49242 Black
5 Shania Black
6 Black Knight Black
7 BelMiNeb-RMR-3 Great northern
8 Matterhorn Great northern
9 Tara Great northern
10 Coyne Great northern
11 JM-24 Great northern
12 Gemini Great northern
13 Michelite Navy
14 Sanilac Navy
15 Seafarer Navy
16 Bunsi Navy
17 C20 Navy
18 Laker Navy
19 Pink Floyd Pink
20 Victor Pink
21 Viva Pink
22 Roza Pink
23 Gloria Pink
24 PK915 Pink
25 AC Redbond Small red
26 AC Earlired Small red
27 Sapphire Small red
28 Ember Small red
29 UI-3 Small red
30 NW-63 Small red
31 Sierra Pinto
32 Buckskin Pinto
33 Durango Pinto
34 PT7-2 Pinto
35 Lariat Pinto
36 Hatton Pinto
37 Chinook 2000 Light red kidney
38 K-42 Light red kidney
39 Blush Light red kidney
40 VA-19 Light red kidney
41 Montcalm Dark red kidney
42 USDK-CBB-15 Dark red kidney
43 Fiero Dark red kidney
44 CDRK Dark red kidney
45 Benton Snap bean
46 91-G Snap bean
47 Harvester Snap bean
48 Cantar Snap bean

Six random genotypes were selected from pinto, great northern, navy, black, pink, and small red market classes and four random genotypes were selected from dark and light red kidney and snap bean marker classes.

*

Indicates the order of the genotypes from left to right on the agarose gel in Figure 3.

To assess the InDel markers for applied genetic studies, 196 markers were used to screen 24 diverse pinto genotypes. The 24 pinto genotypes were as follow: Sierra, Aztec, Santa Fe, La Paz, Stampede, ND-307, Medicine Hat, Lariat, BelDakMe-RR-5, Sequoia, Remington, Durango, Max, PT7-2, Ouray, JM-126, Olathe, Hatton, Apache, UI-114, Nodak, Buckskin, Flint, and UI-196. These genotypes were chosen based on their diversity in a NJ tree which was constructed in ClustalX 2.1 (Larkin et al., 2007) using 1300 SNP markers (Hyten et al., 2010). The number of bootstraps used in ClustalX was 1000 and the genotypes were chosen from clusters that were diverse and had high bootstrap values in the tree. The 24 pinto genotypes were screened using the 196 markers and the markers that showed polymorphism were used to evaluate the performance of the InDel markers for distinguishing the 24 genotype and to construct a NJ tree. PowerMarker version 3.25 (Liu and Muse, 2005) was used to construct the NJ tree with 1000 bootstraps. The Fst value was calculated in PowerMarker as well to evaluate the overall genetic divergence among this collection of genotypes.

In addition, an F2 population was used to evaluate the InDel markers for mapping purposes. The F2 population, NDZ-11002 was derived from a cross between Lariat × Medicine Hat and consisted of 87 F2 genotypes. Eighty two pinto markers that were polymorphic between the two parents were employed to screen the F2 population, and CarthaGène (De Givry et al., 2005) was used to build the genetic map. In CarthaGène, the “group” command was used with a distance and LD threshold of 20 cM and 3.00, respectively to group markers into linkage groups. The “Buildfw 2 2 {} 1” command was used to order the markers on each linkage group and to obtain the map with the highest likelihood value. Qgene (Nelson, 1997) was then used to visualize the map images.

Multiplexing

Multiplexing of InDel markers was conducted using two and four markers in the same reaction mix. A total of one duplex and six fourplex sets were tested on 96 Middle American genotypes. The protocol for four markers in a 20 μl reaction mix was as follow: 1× PCR Buffer including 0.15 mM MgCl2, 0.8 mM dNTP, 0.125 mM of each forward and reverse primer [total of 0.5 mM (0.125 × 4) forward and reverse primers] and 2 units of Taq polymerase. The protocol for multiplexing two markers was the same as above with an exception that 0.25 mM of each forward and reverse primer [total of 0.5 mM (0.25 × 2)] was used. The PCR amplification cycle was the same as the cycle used to amply a single marker. The resulting amplification products were visualized on a 3% agarose gel.

Results

Illumina sequencing, de novo assembly and primer design

The DNA sequence data consisted of 19.6 billion bases in the form of 114 bp paired-end reads from 250 to 400 bp size selected fragments of 14 genotypes. The 114 bp paired-end reads did not include Illumina adaptor sequences. The sequencing coverage ranged from 1.6× to 5.1× with an average of 3.7×. Laker and Kardinal had the lowest and highest number of raw GAII reads, 3,711,450 and 11,748,671 reads, respectively. Cornell 49242 with 261,313 and Stampede with 752,015 contigs had the smallest and largest number of assembled contigs, respectively. Only contigs 120 bp or greater were used for the assembly statistics. The mean contig length across all 14 genotypes was 322 bp. The N50 varied from 222 to 651 bp, with an average N50 value of 394 bp. The GC content ranged from 32.5 to 40.9% with an average of 34.4%. The minimum GC value of 35% was used to design primers in BatchPrimer3. The Illumina reads and assembly information are summarized in Table 3.

Table 3.

Illumina paired-end reads information and contig information after the de novo assembly.

Illumina GAII reads Assembly
Genotype Market class Number of reads (in millions) Genome coverage (x)* Total length of assembled contigs (Mbp)+ Number of assembled contigs N50 (bp) GC%
Cornell 49242 Black 7.20 3.2 66.97 261,313 300 40.9
T-39 Black 8.42 3.7 178.16 527,965 422 32.6
UI 906 Black 6.44 2.8 107.99 400,652 320 33.3
Red Hawk DRK 7.20 3.2 128.33 427,260 360 35.0
Fiero DRK 7.61 3.3 146.58 470,693 373 34.0
California Early LRK 7.52 3.3 145.26 461,881 387 34.2
Lark LRK 10.30 4.5 199.12 531,149 501 33.8
Kardinal LRK 11.74 5.1 238.71 534,298 651 32.5
C20 Navy 7.84 3.4 100.52 355,778 338 37.6
Michelite Navy 9.47 4.1 170.02 504,441 426 34.3
Laker Navy 3.71 1.6 81.10 381,399 222 33.8
Stampede Pinto 10.20 4.5 214.71 752,015 330 33.2
Sierra Pinto 9.58 4.2 178.36 532,937 409 32.7
Buckskin Pinto 10.45 4.6 190.93 529,875 477 33.7
*

The genome coverage was calculated based on a 521 Mbp genome size and 114 bp paired-end Illumina reads.

+

Contigs 120 bp or greater were used for the assembly statistics.

Alignment of the reads with the V1.0 reference genome for 14 genotypes indicated a range of 2330 to 45,770 InDels of 1 bp or greater across the genome. The minimum and maximum number of InDels belonged to the Laker and Buckskin genotypes, respectively (Table 4).

Table 4.

Number and distribution of InDels in each genotype when aligned with G19833.

Genotype Market class Genome coverage (x) Number of InDels in alignment with G19833 InDel frequency per Mbp Number of InDels in 1× coverage
Cornell 49242 Black 3.2 9226 17.7 2901
T-39 Black 3.7 33,662 64.6 9049
UI 906 Black 2.8 13,681 26.3 4817
Red Hawk DRK 3.2 6899 13.2 2169
Fiero DRK 3.3 8961 17.2 2667
California Early LRK 3.3 7017 13.5 2114
Lark LRK 4.5 17,056 32.7 3749
Kardinal LRK 5.1 27,122 52.1 5226
C20 Navy 3.4 11,205 21.5 3238
Michelite Navy 4.1 31,455 60.4 7525
Laker Navy 1.6 2330 4.5 1421
Stampede Pinto 4.5 37,902 72.8 8404
Sierra Pinto 4.2 32,199 61.8 7612
Buckskin Pinto 4.6 45,770 87.9 9907

The numbers are based on InDel size of 1 bp and greater, and 521 Mbp genome size. VarScan was used to discover the InDel polymorphisms.

The number of aligned contigs (BLAST output) within a market class varied between 296,852 and 859,350 in the three-way alignments. The aligned contigs were filtered based on the InDel size, and those with a tentative size of 8 bp or greater were retained. This significantly reduced the number of sequences to analyze in the next step. For example, the number of contigs in the DRK market class dropped from 296,852 to 1378, and the reduction was from 859,350 to 9634 for pinto market class. Filtering for uniqueness of hits in the common bean reference scaffolds reduced the range of contigs from 450 in DRK to 6010 in pinto, with an average value of 2114 contigs across all market classes. The total number of consensus sequences submitted to BatchPrimer3 across all alignments (three-way and all pair-wise alignments) varied from 11,406 in the pinto to 324 in the DRK market class when fragments containing four consecutive Ns in their sequence were removed (Table 5). The basic properties of 11,406 pinto contigs submitted to BatchPrimer3 are summarized in Table 6.

Table 5.

Filtering criteria for contigs used for primer design.

Number of contigs with InDels in three-way alignment Numbers across all alignments
Market classes Blast hits InDel size ≥8 Unique scaffold hit/InDel ≥8 Contigs submitted to Batch primer3 Number of primer pairs
Pinto 859,350 9634 6010 11,406 1343
Black 342,085 1955 708 1913 292
Navy 507,147 1515 650 1867 323
LRK 834,726 5524 2754 5456 669
DRK 296,852 (pair-wise) 1378 450 324 60

The numbers are provided only for three-way alignment in all market classes except the dark red kidney market class.

Table 6.

Pre-analysis of 11,406 pinto contigs submitted to BatchPrimer3.

Item Mean Std. deviation Min Max Coe. of variation (%)
Sequence length (bp) 390.49 174.45 104 2680 44.67
GC contents (%) 29.04 5.92 4.45 53.21 20.38

The final number of primer pairs ranged from 1343 in the pinto market class (highest) to 60 in the DRK market class (lowest) (Table 5). There were a total of 2687 primer pairs designed across all market classes (Supplementary Table), all having single hits on the common bean reference genome V1.0 and only six homologs to common bean transposable elements (Scott Jackson, personal communication). The distribution of InDel sizes in each market class is illustrated in Figure 1. The average distribution of InDel markers varied from one per132 Kb on chromosome Pv05 to one per 314 Kb on chromosome Pv03 with an average of one InDel marker every 200 Kb across the genome. Although the euchromatic region forms less than half of the bean genome (44.1%), the marker density was higher in this region (65.8%) (Figure 2). Markers located in the pericentromeric region are highlighted in orange color in the Supplementary Table. The number of markers on the chromosomes varied from 144 to 333 on chromosomes Pv10 and Pv02, respectively, with an average value of 238 markers per chromosome.

Figure 1.

Figure 1

Distribution of 2687 InDle sizes in five market classes.

Figure 2.

Figure 2

Physical distribution of 2687 InDel markers across 11 chromosomes of common bean. The x axis shows the chromosome length in Mbp and the y axis represents the frequency of InDel markers. The green rectangle indicates the pericentromeric region in each chromosome.

Marker performance

To evaluate the performance of the InDel markers, a total of 219 markers from the pinto market class were tested with Stampede, Sierra, Buckskin, and G19833. A total of 196 markers showed polymorphism (89.5%), and only 23 markers (10.5%) were either monomorphic or difficult to score among four genotypes. Six markers from four market classes were tested on 48 random genotypes from nine different market classes as well. Although the primers were originally designed for a specific market class, we observed polymorphism among the genotypes from other market classes. Based on a sample of genotypes, markers from the Middle American gene pool did not show polymorphism in light and DRK market classes and the marker from the LRK market class showed polymorphism only among Andean genotypes (Figure 3).

Figure 3.

Figure 3

Six InDel markers tested on random genotypes from nine different market classes. The names of the genotypes, from left to right, are listed in Table 2. (A) Marker NDSU_IND_07_02.6485 from pinto market class. (B) Marker NDSU_IND_10_42.1355 from pinto market class. (C) Marker NDSU_IND_06_12.3324 from light red kidney market class. (D) Marker NDSU_IND_05_01.7405 from pinto market class. (E) Marker NDSU_IND_08_36.2119 from navy market class. (F) Marker NDSU_IND_09_07.6278 from black market class. First lane from right in all panels is the DNA Ladder.

Marker application

Phylogenetic analysis

A set of 196 InDel markers were used to screen 24 diverse pinto genotypes (Table 7) and 172 (87.7%) were polymorphic and used in a phylogenetic analysis. The NJ tree and the Fst value indicated two distinct clusters among the 24 pinto genotypes (Figure 4).

Table 7.

Specifications of 24 pinto genotypes.

Genotype Growth habit Application date Source
PT7-2 II to III Not released USDA-ARS-Washington
Sequoia IIb NA ISB
Max III NA ISB
Santa Fe IIa 2010 MSU
Stampede IIa 2008 NDSU
La Paz IIb 2008 ProVita, Inc.
ND-307 IIb 2008 NDSU
Lariat IIb 2008 NDSU
Medicine Hat IIa 2007 Seminis
Durango IIb 2007 ProVita, Inc.
Remington IIb 1996 RogersSeedCo
Hatton IIIa 1996 NDSU
Buckskin IIIb 1995 Novartis Seed Inc.
Apache IIIa 1995 ISB
Aztec IIb 1993 MSU
BelDakMi-RR-5 II 1993 USDA-ARS-Beltsville-MD
Sierra IIb 1990 MSU
UI-196 IIIb 1990 UI
Flint II to IIIb 1989 RogersSeedCo
JM-126 III to IIIa 1986 USDA-ARS/WSU
Nodak III 1985 USDA-ARS/NDSU
Olathe III 1980 CSU
Ouray III to IIIa 1975 CSU
UI-114 III 1967 UI

NA, Information not available; ISB, Idaho Seed Bean Company; MSU, Michigan State University; NDSU, North Dakota State University; UI, University of Idaho; WSU, Washington State University; CSU, Colorado State University.

Figure 4.

Figure 4

Neighbor joining tree of 24 pinto genotypes that cluster into two distinct groups (i) newer varieties with type II growth habit and (ii) older varieties with type III growth habit. The Fst value of 0.16 indicates the degree of variation between the two groups. Bootstrap values greater than 50% are shown on the nodes.

Genetic map

Eighty two polymorphic InDel markers where used to construct a genetic map. A total of nine linkage groups that correspond to nine chromosomes (all chromosomes except one and four) were built from the F2 population with 87 genotypes. Five pairs of markers co-segregated in CarthaGène analysis. Among the 77 remaining markers, 18 were excluded from the genetic map when the “Buildfw” function of CarthaGène with a LOD threshold of 2.2 was used. The genetic and physical order was consistent for 54 of the 59 marker loci (Figure 5).

Figure 5.

Figure 5

Correspondence between genetic and physical positions. The pink bars are linkage groups and the blue bars are the chromosomes with the physical positions of the InDel markers on their right side. The sizes of the chromosomes are proportional to their actual size.

Multiplexing

Multiplexing of tested InDel markers showed clear and scorable bands on the 3% agarose gel when one set of duplex and six sets of fourplex were used. Figure 6 illustrates the results of two fourplex sets on 48 Middle American genotypes on a 3% agarose gel as an example. The marker information is provided in Table 8.

Figure 6.

Figure 6

Multiplexing of markers on 48 Middle American bean genotypes showed distinct bands on the 3% agarose gel electrophoresis. (A) Amplification products using InDel markers NDSU_IND_05_37.2272, NDSU_IND_06_16.5002, NDSU_IND_11_30.9655, and NDSU_IND_06_31.8021 on 48 bean genotypes. All four markers showed polymorphism. (B) Amplification products using InDel markers NDSU_IND_11_33.0572, NDSU_IND_07_42.1709, NDSU_IND_07_25.1928, and NDSU_IND_10_19.1957 on the same 48 bean genotypes. Marker NDSU_IND_07_25.1928 was monomorphic and the other three were polymorphic. The first lane from the right in (A,B) are the DNA Ladders.

Table 8.

Specifications of InDel markers used for multiplexing (two sets of fourplex).

graphic file with name fpls-05-00185-i0001.jpg

Marker shaded in gray was monomorphic.

*

One fourplex marker set mixed in one PCR reaction.

+

The second fourplex marker set mixed in one PCR reaction.

Discussion

Common bean is a diverse crop species with much variation in the seed color, shape, and many other phenotypic characteristics. The species includes wild types, landraces which are the domesticated forms, ecogeographical races which are the result of selection, and market classes within each of the ecogeographical races. Plant breeding is generally restricted to market classes to retain the specific characteristics of the market class. However, as indicated by the high polymorphism rate (87.7%) based upon our analysis of 24 genotypes of the pinto market class, InDel markers appear to be polymorphic even within a market class. InDel markers are easy to use co-dominant markers and are present throughout the genome. With the availability of abundant next generation sequence data, identification of InDels has become a simple process.

Marker development

We selected diverse genotypes in each market class based on the most comprehensive SNP dataset available. The Illumina GA II was used to generate paired end reads. The Illumina technology results in short reads but high coverage (Vera et al., 2008) as well as high quality data where 70% of base calls in 2 × 75 bp paired-end sequences have a quality score of Q30 or higher. The standard paired-end libraries of Illumina with a length between 200 and 500 bp can provide a platform to identify large and small InDels, inversions and other rearrangements. Paired-end reads boost the robustness of de novo assembly, SNP identification, and InDel discovery.

In this study we developed a genome wide collection of 2687 InDel markers that can be amplified without any PCR optimization and with minimum lab equipment. One of the filtering criteria that dramatically decreased the number of markers was the InDel size. There was a 215-fold decrease in the number of potential InDels in DRK when the contigs from the BLAST output were filtered for a minimum InDel size of 8 bp, and this reduction was about 89-fold for the pinto market class in the three-way alignment. However, filtering for uniqueness of hits to the common bean reference scaffolds (ARRA-V0.9) did not cause a dramatic reduction. For example, the number of contigs in the DRK market class dropped to one third, and this reduction was only 1.6-fold for the pinto market class in the three-way alignment. Generally there were less InDels in the DRK market class possibly due to the presence of only two sequenced genotypes in this market class. Moreover, Andean types are reported to have narrower genetic diversity compared to the Mesoamerican genotypes (Beebe et al., 2001). The stringent primer design criteria also resulted in another huge drop in the number of primer pairs that were selected. These stringent criteria were necessary because the development of markers should be precise and cost effective with proper throughput (Jander et al., 2002).

Several factors affect the discovery of functional InDel markers. As observed in Arabidopsis, decreasing InDel size from 25 to 6 bp increased the number of markers from 277 to 1073 (Salathia et al., 2007; Hou et al., 2010). The phylogenetic relationship between the genotypes used for InDel discovery is important. Kardinal, an Andean genotype, is more closely related to the reference genome (G19833), another Andean genotype, than Buckskin, a Middle American genotype. Less Kardinal InDels were observed than Buckskin InDels (27,122 vs. 45,770) even thought it had greater read coverage (5.1× vs. 4.6×). This trend was observed for all genotypes: more InDels were discovered among Middle American genotypes than the Andean genotypes because the reference genome is of Andean origin (Table 4).

In total we discovered 2687 InDel markers for an average of one per 200 Kb. The fact that they are preferentially distributed in the highly recombinogenic region of the genome increases their utility for multiple genetic analyses.

Marker application

In our study, 87.7% of the 196 markers that were used in the phylogenetic analysis of 24 pinto genotypes were polymorphic. In the NJ tree, the pintos were separated based on plant architecture and the application/release date of the variety. Indeed, newer, upright pintos clearly clustered separately from older, prostrate pintos with fixation index (Fst) of 0.16 which indicates a great degree of genetic divergence among subpopulations (Hartl and Clark, 1997). This might be due to selection for growth habit in bean breeding programs where the newer breeding programs prefer the upright beans since this trait offers several advantages such as ease of management, higher grain yield, and reduced disease issues (Cunha et al., 2005). InDel markers have been used for phylogenetic studies. Steele et al. (2008) used InDel polymorphisms in rice to separate Basmati genotypes from other genotypes. Ollitrault et al. (2012) showed that citrus diversity and phylogenetics based on InDel data are consistent with those based on SSR markers.

The observation that InDel markers developed from one market class showed polymorphism in the other market classes indicates their broad utility. This denotes that although these InDel markers were designed to capture the variation within each market class, their performance and application can be expanded to the entire bean germplasm.

The InDel markers should be useful for genetic map construction because there are on average about 200 markers on each chromosome. We used a relatively small mapping population to illustrate their application for linkage analysis. Generally, recombination occurs more frequently in regions distal to the centromeric region (Curtis and Lukaszewski, 1991; Tanksley et al., 1992; Werner et al., 1992). Because of low marker density and a small number of recombination events, our map did not cover the centromeric blocks in the F2 mapping study, resulting in mapping only a portion of the chromosome or of two clusters of markers, one from each arm. There were five discrepancies in our genetic map relative to the physical map. Marker order differences between the genetic and physical map or genetic maps from different populations or marker systems has been observed in other studies as well (Snelling et al., 2007; Wei et al., 2007; Xia et al., 2007). These differences could be a result of sequence assembly errors, inversions, and segregation distortion.

The possibility of conducting multiplex PCR is another indicator of the broad utility of InDel markers. With multiplexing, genotyping is even more cost effective due to reduced amount of reagents and DNA quantity needed for PCR amplification. Moreover, this method saves time when hundreds of markers are screened and broadens the coverage when DNA availability is limited (Edwards and Gibbs, 1994; Karaiskou and Primmer, 2008). Our InDel markers meet many of the criteria that Henegariu et al. (1997) mentioned as critical parameters in a multiplex PCR. According to their study, some of the basic principles include the appropriate primer length which should be 18–34 bp or higher and all of our primers are designed with a length of 26–32 bp. Henegariu et al. (1997) also reported that by increasing the primer length up to 28–30 bp, the annealing temperature could be increased resulting in a reduction of non-specific PCR products. GC content of 35–60% and annealing temperature between 55 and 58°C are other basic principles of multiplex PCR. The primers we designed have a minimum GC content of 35%, and all amplify at 55°C. Henegariu et al. (1997) indicated 54°C as the optimum temperature for co-amplification of loci in the multiplex PCR. Although the probability of non-specific product amplification increases at this temperature, simultaneous amplification of many specific loci greatly suppresses the yield of non-specific amplification products due to limited enzyme and nucleotide resources.

In conclusion, this study shows the usefulness of DNA sequence data as the raw material for primer development in the presence or absence of a reference genome. We show that contigs obtained from the de novo assembly of sequence data are sufficient for polymorphism discovery. However, without a completely assembled reference genome or a set of primary scaffolds, contigs cannot be filtered to eliminate the duplicate loci or transposable elements. The reference sequence could reduce the development of redundant markers and allow the determination of the exact physical position and order of the markers. The availability of high density markers affects the success of genetic map construction, map-based cloning (Pacurar et al., 2012) and diversity studies. The availability of 2687 InDel primers will enhance MAS and diversity studies in common bean.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This project was supported by Agriculture and Food Research Initiative Competitive Grants no. 2009-01929 and 2010-03612 from the USDA National Institute of Food and Agriculture.

Supplementary material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fpls.2014.00185/abstract

References

  1. Beebe S., Rengifo J., Gaitan E., Duque M. C., Tohme J. (2001). Diversity and origin of Andean landraces of common bean. Crop Sci. 41, 854–862 10.2135/cropsci2001.413854x [DOI] [Google Scholar]
  2. Beebe S., Skroch P. W., Tohme J., Duque M. C., Pedraza F., Nienhuis J. (2000). Structure of genetic diversity among common bean landraces of Middle American origin based on correspondence analysis of RAPD. Crop Sci. 40, 264–273 10.2135/cropsci2000.401264x [DOI] [Google Scholar]
  3. Blair M. W., Pedraza F., Buendia H. F., Gaitan-Solis E., Beebe S. E., Gepts P., et al. (2003). Development of a genome-wide anchored microsatellite map for common bean (Phaseolus vulgaris L.). Theor. Appl. Genet. 107, 1362–1374 10.1007/s00122-003-1398-6 [DOI] [PubMed] [Google Scholar]
  4. Britten R. J., Rowen L., Williams J., Cameron R. A. (2003). Majority of divergence between closely related DNA samples is due to indels. Proc. Natl. Acad. Sci. U.S.A. 100, 4661–4665 10.1073/pnas.0330964100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Buso G. S. C., Amaral Z. P. S., Brondani R. P. V., Ferreira M. E. (2006). Microsatellite markers for the common bean Phaseolus vulgaris. Mol. Ecol. Notes 6, 252–254 10.1111/j.1471-8286.2006.01210.x18050090 [DOI] [Google Scholar]
  6. Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., et al. (2009). BLAST+: architecture and applications. BMC Bioinformatics 10:421 10.1186/1471-2105-10-421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Cockerham C. C., Zeng Z. B. (1996). Design III with marker loci. Genetics 143, 1437–1456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cordoba J. M., Chavarro C., Schlueter J. A., Jackson S. A., Blair M. W. (2010). Integration of physical and genetic maps of common bean through BAC-derived microsatellite markers. BMC Genomics 11:436 10.1186/1471-2164-11-436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Corpet F. (1988). Multiple sequence alignment with hierarchical-clustering. Nucleic Acids Res. 16, 10881–10890 10.1093/nar/16.22.10881 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cunha W. G. D., Ramalho M. A. P., Abreu Â. D. F. B. (2005). Selection aiming at upright growth habit common bean with carioca type grains. Crop Breed. Appl. Biotechnol. 5, 379–386 10.12702/1984-7033.v05n04a02 [DOI] [Google Scholar]
  11. Curtis C. A., Lukaszewski A. J. (1991). Genetic-linkage between C-bands and storage protein genes in chromosome-1b of tetraploid wheat. Theor. Appl. Genet. 81, 245–252 10.1007/BF00215730 [DOI] [PubMed] [Google Scholar]
  12. De Givry S., Bouchez M., Chabrier P., Milan D., Schiex T. (2005). CAR(H)(T)AGene: multipopulation integrated genetic and radiation hybrid mapping. Bioinformatics 21, 1703–1704 10.1093/bioinformatics/bti222 [DOI] [PubMed] [Google Scholar]
  13. Diaz L. M., Blair M. W. (2006). Race structure within the Mesoamerican gene pool of common bean (Phaseolus vulgaris L.) as determined by microsatellite markers. Theor. Appl. Genet. 114, 143–154 10.1007/s00122-006-0417-9 [DOI] [PubMed] [Google Scholar]
  14. Doebley J., Stec A., Gustus C. (1995). Teosinte branched1 and the origin of maize: evidence for epistasis and the evolution of dominance. Genetics 141, 333–346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Edwards M. C., Gibbs R. A. (1994). Multiplex PCR: advantages, development, and applications. PCR Methods Appl. 3, S65–S75 10.1101/gr.3.4.S65 [DOI] [PubMed] [Google Scholar]
  16. Freyre R., Rios R., Guzman L., Debouck D. G., Gepts P. (1996). Ecogeographic distribution of Phaseolus spp (Fabaceae) in Bolivia. Econ. Bot. 50, 195–215 10.1007/BF02861451 [DOI] [Google Scholar]
  17. Freyre R., Skroch P. W., Geffroy V., Adam-Blondon A. F., Shirmohamadali A., Johnson W. C., et al. (1998). Towards an integrated linkage map of common bean. 4. Development of a core linkage map and alignment of RFLP maps. Theor. Appl. Genet. 97, 847–856 10.1007/s001220050964 [DOI] [Google Scholar]
  18. Galeano C. H., Fernandez A. C., Gomez M., Blair M. W. (2009). Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.). BMC Genomics 10:629 10.1186/1471-2164-10-629 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gepts P., Bliss F. A. (1986). Phaseolin variability among wild and cultivated common beans (Phaseolus vulgaris) from Colombia. Econ. Bot. 40, 469–478 10.1007/BF02859660 [DOI] [Google Scholar]
  20. Ghaderi A., Hosfield G. L., Adams M. W., Uebersax M. A. (1984). Variability in culinary quality, component interrelationships, and breeding implications in navy and pinto beans. J. Am. Soc. Horticult. Sci. 109, 85–90 [Google Scholar]
  21. Gomez O. J., Blair M. W., Frankow-Lindberg B. E., Gullberg U. (2004). Molecular and phenotypic diversity of common bean landraces from Nicaragua. Crop Sci. 44, 1412–1418 10.2135/cropsci2004.1412 [DOI] [Google Scholar]
  22. Gonzalez A., Wong A., Delgado-Salinas A., Papa R., Gepts P. (2005). Assessment of inter simple sequence repeat markers to differentiate sympatric wild and domesticated populations of common bean. Crop Sci. 45, 606–615 10.2135/cropsci2005.0606 [DOI] [Google Scholar]
  23. Hartl D. L., Clark A. G. (1997). Principles of Population Genetics. Sunderland, MA: Sinauer Associates [Google Scholar]
  24. Hayashi K., Yoshida H., Ashikawa I. (2006). Development of PCR-based allele-specific and InDel marker sets for nine rice blast resistance genes. Theor. Appl. Genet. 113, 251–260 10.1007/s00122-006-0290-6 [DOI] [PubMed] [Google Scholar]
  25. Henegariu O., Heerema N. A., Dlouhy S. R., Vance G. H., Vogt P. H. (1997). Multiplex PCR: Critical parameters and step-by-step protocol. Biotechniques 23, 504–511 [DOI] [PubMed] [Google Scholar]
  26. Hosfield G. L., Uebersax M. A., Occena L. G. (2000). Technological and genetic improvements in dry bean quality and utilization, in Bean Reseach, Production and Utilization. Proceeding of Idaho Bean Workshop, ed Singh S. P. (Moscow: University of Idaho; ), 135–152 [Google Scholar]
  27. Hou X. H., Li L. C., Peng Z. Y., Wei B. Y., Tang S. J., Ding M. Y., et al. (2010). A platform of high-density INDEL/CAPS markers for map-based cloning in Arabidopsis. Plant J. 63, 880–888 10.1111/j.1365-313X.2010.04277.x [DOI] [PubMed] [Google Scholar]
  28. Hyten D. L., Song Q. J., Fickus E. W., Quigley C. V., Lim J. S., Choi I. Y., et al. (2010). High-throughput SNP discovery and assay development in common bean. BMC Genomics 11:475 10.1186/1471-2164-11-475 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Jander G., Norris S. R., Rounsley S. D., Bush D. F., Levin I. M., Last R. L. (2002). Arabidopsis map-based cloning in the post-genome era. Plant Physiol. 129, 440–450 10.1104/pp.003533 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Karaiskou N., Primmer C. (2008). PCR multiplexing for maximising genetic analyses with limited DNA samples: an example in the collared flycatcher, Ficedula albicollis. Ann. Zool. Fenn. 45, 478–482 10.5735/086.045.0602 [DOI] [Google Scholar]
  31. Kelly J. D., Gepts P., Miklas P. N., Coyne D. P. (2003). Tagging and mapping of genes and QTL and molecular marker-assisted selection for traits of economic importance in bean and cowpea. Field Crops Res. 82, 135–154 10.1016/S0378-4290(03)00034-0 [DOI] [Google Scholar]
  32. Khairallah M. M., Adams M. W., Sears B. B. (1990). Mitochondrial DNA polymorphisms of Malawian bean lines: further evidence for 2 major gene pools. Theor. Appl. Genet. 80, 753–761 10.1007/BF00224188 [DOI] [PubMed] [Google Scholar]
  33. Khairallah M. M., Sears B. B., Adams M. W. (1992). Mitochondrial restriction fragment length polymorphisms in wild Phaseolus vulgaris L: insights on the domestication of the common bean. Theor. Appl. Genet. 84, 915–922 [DOI] [PubMed] [Google Scholar]
  34. Koboldt D. C., Chen K., Wylie T., Larson D. E., McLellan M. D., Mardis E. R., et al. (2009). VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25, 2283–2285 10.1093/bioinformatics/btp373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Koenig R., Gepts P. (1989). Allozyme diversity in wild Phaseolus vulgaris: further evidence for 2 Major centers of genetic diversity. Theor. Appl. Genet. 78, 809–817 10.1007/BF00266663 [DOI] [PubMed] [Google Scholar]
  36. Koinange E. M. K., Gepts P. (1992). Hybrid weakness in wild Phaseolus vulgaris L. J. Hered. 83, 135–139 [Google Scholar]
  37. Lark K. G., Chase K., Adler F., Mansur L. M., Orf J. H. (1995). Interactions between quantitative trait loci in soybean in which trait variation at one locus is conditional upon a specific allele at another. Proc. Natl. Acad. Sci. U.S.A. 92, 4656–4660 10.1073/pnas.92.10.4656 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Larkin M. A., Blackshields G., Brown N. P., Chenna R., McGettigan P. A., McWilliam H., et al. (2007). Clustal W and clustal X version 2.0. Bioinformatics 23, 2947–2948 10.1093/bioinformatics/btm404 [DOI] [PubMed] [Google Scholar]
  39. Li H., Durbin R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Li Z. K., Pinson S. R. M., Park W. D., Paterson A. H., Stansel J. W. (1997). Epistasis for three grain yield components in rice (Oryza sativa L). Genetics 145, 453–465 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Li Z. K., Pinson S. R. M., Stansel J. W., Paterson A. H. (1998). Genetic dissection of the source-sink relationship affecting fecundity and yield in rice (Oryza sativa L.). Mol. Breed. 4, 419–426 10.1023/A:1009608128785 [DOI] [Google Scholar]
  43. Liu K. J., Muse S. V. (2005). PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 21, 2128–2129 10.1093/bioinformatics/bti282 [DOI] [PubMed] [Google Scholar]
  44. Mamidi S., Rossi M., Annam D., Moghaddam S., Lee R., Papa R., et al. (2011). Investigation of the domestication of common bean (Phaseolus vulgaris) using multilocus sequence data. Funct. Plant Biol. 38, 953–967 10.1071/FP11124 [DOI] [PubMed] [Google Scholar]
  45. Mamidi S., Rossi M., Moghaddam S. M., Annam D., Lee R., Papa R., et al. (2013). Demographic factors shaped diversity in the two gene pools of wild common bean Phaseolus vulgaris L. Heredity 110, 267–276 10.1038/hdy.2012.82 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. McClean P. E., Myers J. R., Hammond J. J. (1993). Coefficient of parentage and cluster analysis of North American dry bean cultivars. Crop Sci. 33, 190–197 10.2135/cropsci1993.0011183X003300010034x [DOI] [Google Scholar]
  47. Mensack M. M., Fitzgerald V. K., Ryan E. P., Lewis M. R., Thompson H. J., Brick M. A. (2010). Evaluation of diversity among common beans (Phaseolus vulgaris L.) from two centers of domestication using ‘omics’ technologies. BMC Genomics 11:686 10.1186/1471-2164-11-686 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Miklas P. N. (2000). Use of Phaseolus germplasm in breeding pinto, great northern, pink, and red bean for the Pacific Northwest and intermountain region, in Bean Research, Production and Utilization. Proceeding of the Idaho Bean Workshop, ed Singh S. P. (Moscow: University of Idaho; ), 13–29 [Google Scholar]
  49. Myers J. R. (2000). Tomorrow's snap bean cultivars, in Bean Reseach, Production and Utilization. Proceeding Idaho Bean Workshop, ed Singh S. P. (Moscow: University of Idaho; ), 39–51 [Google Scholar]
  50. Nelson J. C. (1997). QGENE: software for marker-based genomic analysis and breeding. Mol. Breed. 3, 239–245 10.1023/A:1009604312050 [DOI] [Google Scholar]
  51. Ollitrault F., Terol J., Martin A. A., Pina J. A., Navarro L., Talon M., et al. (2012). Development of indel markers from Citrus clementina (Rutaceae) BAC-end sequences and interspecific transferability in Citrus. Am. J. Bot. 99, E268–E273 10.3732/ajb.1100569 [DOI] [PubMed] [Google Scholar]
  52. Pacurar D. I., Pacurar M. L., Street N., Bussell J. D., Pop T. I., Gutierrez L., et al. (2012). A collection of INDEL markers for map-based cloning in seven Arabidopsis accessions. J. Exp. Bot. 63, 2491–2501 10.1093/jxb/err422 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Salathia N., Lee H. N., Sangster T. A., Morneau K., Landry C. R., Schellenberg K., et al. (2007). Indel arrays: an affordable alternative for genotyping. Plant J. 51, 727–737 10.1111/j.1365-313X.2007.03194.x [DOI] [PubMed] [Google Scholar]
  54. Schlueter J. A., Goicoechea J. L., Collura K., Gill N., Lin J. Y., Yu Y., et al. (2008). BAC-end sequence analysis and a draft physical map of the common bean (Phaseolus vulgaris L.) genome. Trop. Plant Biol. 1, 40–48 10.1007/s12042-007-9003-9 [DOI] [Google Scholar]
  55. Schmutz J., McClean P., Mamidi S., Wu G. A., Cannon S. B., Grimwood J., et al. (in press). A reference genome for common bean and genome-wide analysis of dual domestications. Nat. Genet. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Silbernagel M. J., Hannan R. M. (1988). Utilization of genetic resources in the development of commercial bean cultivars in the USA, in Genetic Resources of Phaseolus Beans, ed Gepts P. (Dordrecht: Kluwer; ), 561–596 10.1007/978-94-009-2786-5_23 [DOI] [Google Scholar]
  57. Silbernagel M. J., Hannan R. M. (1992). Use of plant introductions to develop U.S. bean cultivars, in Use of Plant Introductions in Cultivar Development, Part 2, eds Shands H., Weisner L. E. (Madison, WI: CSSA Special Publication; ), 1–8 [Google Scholar]
  58. Singh S. P. (1992). Common bean improvement in the tropics, in Plant Breeding Reviews, ed Janick J. (John Wiley and Sons, Inc.), 199–269 [Google Scholar]
  59. Singh S. P., Gepts P., Debouck D. G. (1991). Races of common bean (Phaseolus vulgaris, Fabaceae). Econ. Bot. 45, 379–396 10.1007/BF02887079 [DOI] [Google Scholar]
  60. Snelling W. M., Chiu R., Schein J. E., Hobbs M., Abbey C. A., Adelson D. L., et al. (2007). A physical map of the bovine genome. Genome Biol. 8:R165 10.1186/gb-2007-8-8-r165 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Sonnante G., Stockton T., Nodari R. O., Velasquez V. L. B., Gepts P. (1994). Evolution of genetic diversity during the domestication of common bean (Phaseolus vulgaris L). Theor. Appl. Genet. 89, 629–635 10.1007/BF00222458 [DOI] [PubMed] [Google Scholar]
  62. Steele K. A., Ogden R., McEwing R., Briggs H., Gorham J. (2008). InDel markers distinguish Basmatis from other fragrant rice varieties. Field Crops Res. 105, 81–87 10.1016/j.fcr.2007.08.001 [DOI] [Google Scholar]
  63. Tanksley S. D., Ganal M. W., Prince J. P., Devicente M. C., Bonierbale M. W., Broun P., et al. (1992). High density molecular linkage maps of the tomato and potato genomes. Genetics 132, 1141–1160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Tanksley S. D., Hewitt J. (1988). Use of molecular markers in breeding for soluble solids content in tomato - a re-examination. Theor. Appl. Genet. 75, 811–823 10.1007/BF00265610 [DOI] [Google Scholar]
  65. Vali U., Brandstrom M., Johansson M., Ellegren H. (2008). Insertion-deletion polymorphisms (indels) as genetic markers in natural populations. BMC Genet. 9:8 10.1186/1471-2156-9-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Vallejos C. E., Sakiyama N. S., Chase C. D. (1992). A molecular marker-based linkage map of Phaseolus Vulgaris L. Genetics 131, 733–740 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Varshney R. K., Close T. J., Singh N. K., Hoisington D. A., Cook D. R. (2009). Orphan legume crops enter the genomics era! Curr. Opin. Plant Biol. 12, 202–210 10.1016/j.pbi.2008.12.004 [DOI] [PubMed] [Google Scholar]
  68. Vasemagi A., Gross R., Palm D., Paaver T., Primmer C. R. (2010). Discovery and application of insertion-deletion (INDEL) polymorphisms for QTL mapping of early life-history traits in Atlantic salmon. BMC Genomics 11:156 10.1186/1471-2164-11-156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Vera J. C., Wheat C. W., Fescemyer H. W., Frilander M. J., Crawford D. L., Hanski I., et al. (2008). Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol. Ecol. 17, 1636–1647 10.1111/j.1365-294X.2008.03666.x [DOI] [PubMed] [Google Scholar]
  70. Wang C. R., Chang K. C., Grafton K. (1988). Canning quality evaluation of pinto and navy beans. J. Food Sci. 53, 772–776 10.1111/j.1365-2621.1988.tb08953.x [DOI] [Google Scholar]
  71. Wei F., Coe E., Nelson W., Bharti A. K., Engler F., Butler E., et al. (2007). Physical and genetic structure of the maize genome reflects its complex evolutionary history. PLoS Genet. 3:e123 10.1371/journal.pgen.0030123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Werner J. E., Endo T. R., Gill B. S. (1992). Toward a cytogenetically based physical map of the wheat genome. Proc. Natl. Acad. Sci. U.S.A. 89, 11307–11311 10.1073/pnas.89.23.11307 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Xia Z., Tsubokura Y., Hoshi M., Hanawa M., Yano C., Okamura K., et al. (2007). An integrated high-density linkage map of soybean with RFLP, SSR, STS, and AFLP markers using a single F-2 population. DNA Res. 14, 257–269 10.1093/dnares/dsm027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Yang H. A., Tao Y., Zheng Z. Q., Li C. D., Sweetingham M. W., Howieson J. G. (2012). Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L. BMC Genomics 13:318 10.1186/1471-2164-13-318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. You F. M., Huo N. X., Gu Y. Q., Luo M. C., Ma Y. Q., Hane D., et al. (2008). BatchPrimer3: a high throughput web application for PCR and sequencing primer design. BMC Bioinformatics 9:253 10.1186/1471-2105-9-253 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Zerbino D. R., Birney E. (2008). Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 10.1101/gr.074492.107 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Frontiers in Plant Science are provided here courtesy of Frontiers Media SA

RESOURCES