Abstract
The Arabidopsis (Arabidopsis thaliana) FLOWERING LOCUS T (FT) gene and its orthologs in other plant species (e.g. rice [Oryza sativa] OsFTL2/Hd3a) have an established role in the photoperiodic induction of flowering response. The genomic and phenotypic variations associated with the perennial ryegrass (Lolium perenne) ortholog of FT, designated LpFT3, was assessed in a diverse collection of nine European germplasm populations, which together constituted an association panel of 864 plants. Sequencing and genotyping of a series of amplicons derived from the nine populations, containing the complete exon and intron sequences as well as 5′ and 3′ noncoding sequences of LpFT3, identified a total of seven haplotypes. Genotyping assays designed to detect the genomic variation showed that three haplotypes were present in approximately equal proportions and represented 84% of the total, with a fourth representing a further 11%. Of the three major haplotypes, two were predicted to code for identical protein products and the third contained two amino acid substitutions. Association analysis using either a mixed model with a relationship matrix to correct for population structure and relatedness or structured association with further correction using genomic control indicated significant associations between LpFT3 and variation in flowering time. These associations were corroborated in a validation population segregating for the same major alleles. The most “diagnostic” region of genomic variation was situated 5′ of the coding sequence. Analysis of this region identified that the interhaplotype variation was closely associated with sequence motifs that were apparently conserved in the 5′ region of orthologs of LpFT3 from other plant species. These may represent cis-regulatory elements involved in influencing the expression of this gene.
Perennial ryegrass (Lolium perenne) is the most widely used pasture grass species in temperate areas of the world and is also a major constituent of the amenity grass mixtures used in lawns and sports turfs. Taxonomically, it is classified in the Pooideae subfamily, along with the Triticeae cereals and oats (Avena sativa) and in the same clade as rice (Oryza sativa). For all of these grasses, the control of flowering is not only fundamental to their biology but also of crucial importance to their adaptation and exploitation in an agricultural context. For the cereals, the timing and coordination of flowering is obviously intimately linked with grain production; for forage grasses, the significance is less intuitive, but flowering has a profound effect on resource utilization and allocation within the plant. This has an impact on field performance in terms of overall biomass, quality, and seed production, the major route of ryegrass propagation. Consequently, understanding the genetic control of flowering within the ryegrasses and related species is not only of great interest per se but is also agronomically a very important trait in forage and turf grasses.
A number of quantitative trait loci (QTLs) for flowering time have been identified in ryegrass, some of which seem to be conserved across different studies (Armstead et al., 2008, 2009). These represent targets for gene and marker identification that may have wide applications in ryegrass breeding. The identification of candidate genes in species such as perennial ryegrass, for which there are limited molecular biological tools, can be problematic, but comparative genetic approaches have indicated that key genes from established flowering induction pathways may be involved (Armstead et al., 2005; Jensen et al., 2005; Skøt et al., 2007). One such QTL is located on a region of chromosome 7 (C7) that is syntenic with the region of rice C6, which is associated with the Heading date3 (Hd3) QTL (Monna et al., 2002; Armstead et al., 2004, 2009). This QTL is determined by variation at the underlying Hd3a/OsFTL2 gene, an ortholog of Arabidopsis (Arabidopsis thaliana) FLOWERING LOCUS T (AtFT; Kojima et al., 2002) and of perennial ryegrass FT3 (LpFT3).
The AtFT gene codes for a phosphoethanolamine-binding protein (PEBP), a family of proteins that are known to be of key importance in the floral induction process in plants (Yamaguchi et al., 2005; Ahn et al., 2006; Yan et al., 2006; Jaeger and Wigge, 2007; Tamaki et al., 2007; Igasaki et al., 2008; Komiya et al., 2008; Turck et al., 2008). In Arabidopsis, AtFT is believed to function as an integrator of different flowering induction pathways (Komeda, 2004; Wu et al., 2008; Michaels, 2009), and the effects of modulating FT expression on the flowering phenotype have been extensively studied (Komeda, 2004; Moon et al., 2005; Wigge et al., 2005; Hisamatsu and King, 2008; King et al., 2008; Turck et al., 2008). There is also strong evidence that the FT protein itself can act as a transmissible “florigen,” the signal molecule that moves from the leaf to the shoot apex in floral meristem induction (Corbesier et al., 2007; Tamaki et al., 2007). The direct experimental manipulation of FT orthologs has also demonstrated the potentially pivotal role of FT in monocot systems (Hayama et al., 2003; Hayama and Coupland, 2004; Yan et al., 2006) and has been associated with quantitative variation in flowering time in a number of monocot genetic and physiological studies (King et al., 2006; Bonnin et al., 2008; Hemming et al., 2008; Li et al., 2008; Kikuchi et al., 2009; Sasani et al., 2009; Stracke et al., 2009), including ryegrass (Armstead et al., 2004, 2008).
Linkage disequilibrium (LD)-based association genetics provide means for statistically validating the coincidence of allelic and phenotypic variations. These approaches have been widely used in animal systems and in human genetics, but more recently, a number of suitable populations have been developed for plant species (Skøt et al., 2005, 2007; Nordborg and Weigel, 2008; Stich et al., 2008; van Berloo et al., 2008; McMullen et al., 2009; Smith et al., 2009; Stracke et al., 2009; Waugh et al., 2009; Yao et al., 2009; Zhang et al., 2009). Skøt et al. (2007) described the development of a perennial ryegrass population consisting of nine populations of diverse European origin suitable for candidate gene evaluation of flowering phenotype through association analysis (the LD family). This study confirmed the role of LpHd1 (an ortholog of rice OsHd1 and Arabidopsis CONSTANS) in affecting time of flowering. In this work, we describe the range and molecular nature of allelic variation for LpFT3 in the same population and demonstrate that there are significant associations between this variation and flowering time. Furthermore, these are confirmed in a validation population segregating for these polymorphisms.
RESULTS
Allelic Variation in LpFT3
A total of 3,668 bp containing the target gene LpFT3 was sequenced from the perennial ryegrass bacterial artificial chromosome (BAC) clone (accession no. FN993916), which was used to design internal primers. Initial sequencing of a nested 1,500-bp segment, containing the complete LpFT3 coding sequence, from 288 clones isolated from a representative sample of the association mapping panel (LD family) revealed the presence of three major haplotypes representing 36% (A) 30% (B), and 25% (C) of the total. Subsequent genotyping and sequencing identified four further haplotypes present at lower frequencies (D–G; Supplemental Fig. S1). From these sequences, the four genotyping primer pairs FT-LD1 to -4 were designed, which spanned insertion/deletion (indel)-type variation that distinguished the haplotypes (Supplemental Fig. S2).
The protein predicted from haplotypes B, C, D, and F consisted of 177 amino acids and was predicted to have three exons, identical to GenBank accession ABC33722 (LpFT3, perennial ryegrass). The protein derived from haplotypes A and G could be distinguished from those derived from B, C, D, and F by a Val/Ile substitution at position 23 and an Ala/Pro substitution at position 156. The haplotype E protein could be distinguished from the B-type protein just by the Val substitution at position 23 (Supplemental Fig. S3).
Genotyping of the Nine Perennial Ryegrass Populations
Assaying of the nine populations constituting the LD family with markers FT-LD1 to FT-LD4 allowed for the discrimination of six alleles with FT-LD1, three with FT-LD2, and four with FT-LD3/4 (Supplemental Table S1). A further haplotype, G, was identified with FT-LD4 (identical to haplotype A, except for a single base deletion at position 1,635). However, except for the FT-LD4 E allele confirmation, none of the FT-LD4 data were included, since FT-LD4 could not discriminate all of the other haplotypes. Overall, approximately 84% of the population consisted of combinations of the A, B, and C haplotypes, with a further 11% contributed by the E haplotype. However, there was considerable variation between the nine populations in the frequency of different genotype classes, varying from two in Ba11304 to 15 in Ba10158 and 16 in Ba10113 (Supplemental Tables S2 and S3).
Genotype/Phenotype Associations
Flowering time of the LD family showed a bimodal distribution in both years, and there were generally earlier flowering times in 2005 compared with 2004 (Fig. 1). To what extent this was due to yearly environmental variation or to the difference in cultivation conditions is difficult to ascertain. A more detailed statistical analysis of the associations between populations, years, and flowering time has been presented (Skøt et al., 2007). The variance component estimate was 73% for the effect of subpopulation within the LD family. Thus, when considering the three major alleles, a, b, and c, the general observation that genotypes with a and/or b alleles tended to be later flowering than genotypes with c alleles (Supplemental Tables S4 and S5; Supplemental Figs. S4 and S5) is confounded by the considerable population structure within the LD family (Supplemental Table S6; Supplemental Fig. S6).
Figure 1.
Flowering times for the LD family in 2004 and 2005.
To establish the most suitable analytical model for identifying significant population-wide genotype/phenotype associations in the presence of such population structure and relatedness, comparisons were made between a simple ANOVA, a general linear model incorporating structured associations (SA), SA plus genomic control (GC), and a mixed linear model incorporating a relationship matrix as a random effect. The expected P values plotted against observed genome-wide “neutral” amplified fragment length polymorphism (AFLP) markers are shown in Supplemental Figure S7. While the simple and SA models gave inflated P values, both the SA+GC model and the mixed linear model indicated good correction for population structure and relatedness. The relationship matrix corrected for both structure between populations and individual relatedness, while GC corrected for relatedness not accounted for by the SA model alone. The P values for the three FT-LD markers increased substantially from the simple model to SA+GC but were still remarkably low for the mixed model (Table I). Part of the explanation may be found in the composition of the relationship matrix. Experimenting with a matrix diagonal lower than 2 gave no restricted maximum likelihood (REML) values, perhaps indicating a nonpositive semidefinite matrix, which is a requirement (Kang et al., 2008). It was possible to reduce the diagonal to 1 + FST (=1.1184) using a kinship matrix and still get a REML output, but it had little effect on the P values. Changing the matrix diagonals altered the balance between Vg and Ve values. However, using the sum of Vg and Ve in the calculation of the percentage of the phenotypic variance accounted for by the marker made these values relatively insensitive to matrix changes. As for the P values, the genetic effect was estimated based on a relationship matrix with 2 as the diagonal, which was slightly more conservative than other estimates.
Table I. Association analysis of flowering times in 2004 and 2005 with the major alleles at the three loci in the LpFT3 gene.
Probabilities for four models are shown: simple, ANOVA without any correction for population structure; SA, structured association; SA+GC, structured association and genomic control; EMMA MLM, mixed linear model as executed with the program EMMA with relationship matrix included as a random effect. Percentage marker effect estimates the percentage of phenotypic variance accounted for by the marker and was obtained as described in “Materials and Methods.” NS, Not significant.
| Locus/Allele | Year |
P |
Percentage Marker Effect | |||
| Simple | SA | SA+GC | EMMA MLM | |||
| FT-LD1a | 2004 | <10−17 | 4 × 10−12 | 3 × 10−10 | 4 × 10−29 | 12.4 |
| FT-LD1b | 2004 | NS | NS | NS | 8 × 10−3 | 0.7 |
| FT-LD1c | 2004 | <10−17 | 6 × 10−12 | 5 × 10−10 | 3 × 10−30 | 13.4 |
| FT-LD1e | 2004 | 5 × 10−8 | NS | NS | 1 × 10−3 | 1.0 |
| FT-LD2a | 2004 | <10−17 | 1 × 10−8 | 2 × 10−7 | 2 × 10−21 | 9.0 |
| FT-LD2b | 2004 | 6 × 10−15 | 1 × 10−2 | 2 × 10−2 | 7 × 10−5 | 1.6 |
| FT-LD2e | 2004 | 3 × 10−11 | 7 × 10−3 | 1 × 10−2 | 3 × 10−5 | 1.8 |
| FT-LD3a | 2004 | <10−17 | 4 × 10–9 | 1 × 10−7 | 5 × 10−20 | 8.5 |
| FT-LD3b | 2004 | 3 × 10−15 | 2 × 10−3 | 6 × 10−3 | 1 × 10−5 | 1.9 |
| FT-LD3e | 2004 | 3 × 10−8 | 4 × 10−2 | NS | 1 × 10−3 | 1.0 |
| FT-LD1a | 2005 | <10−17 | < 10−17 | 2 × 10−10 | 8 × 10−44 | 20.2 |
| FT-LD1b | 2005 | 3 × 10−5 | 2 × 10−3 | NS | 1 × 10−7 | 3.0 |
| FT-LD1c | 2005 | <10−17 | <10−17 | <10−17 | 2 × 10−78 | 34.0 |
| FT-LD1e | 2005 | NS | NS | NS | NS | 0.2 |
| FT-LD2a | 2005 | <10−17 | <10−17 | 4 × 10−8 | 1 × 10−34 | 16.1 |
| FT-LD2b | 2005 | <10−17 | 4 × 10−11 | 4 × 10−5 | 1 × 10−10 | 5.0 |
| FT-LD2e | 2005 | 6 × 10−3 | 2 × 10−3 | 5 × 10−2 | 4 × 10−2 | 0.4 |
| FT-LD3a | 2005 | <10−17 | <10−17 | 2 × 10−8 | 5 × 1034 | 15.7 |
| FT-LD3b | 2005 | <10−17 | 2 × 10−11 | 3 × 10−5 | 6 × 10−11 | 4.9 |
| FT-LD3e | 2005 | NS | 1 × 10−3 | 4 × 10−2 | 5 × 10−2 | 0.4 |
Whichever model was used, the FT-LD1 a and c alleles both had highly significant effects. At the FT-LD2 locus, all three alleles had significant effects, as did the a and b alleles at FT-LD3. Four alleles (FT-LD1 a and c, FT_LD2 a, and FT_LD3 a) accounted for most of the phenotypic variation (Table I). The lack of effect of the FT-LD1 b allele can partly be explained by its recessive nature.
These LD family-wide significant results were corroborated using within-population ANOVA, which was not influenced by population structure to the same degree (Supplemental Fig. S8). This indicated a range of significant LpFT3 genotype/phenotype associations, similar to those identified using the mixed model within-relationship matrices (Supplemental Table S7), and served as a further indication of a role for LpFT3 variation in influencing the flowering time phenotype.
The second experiment was carried out to confirm the effect of genotypic variation associated with the FT-LD1 marker on flowering time without the confounding effect of population structure present in the LD family. We analyzed a population of 563 genotypes derived from a polycross (see “Materials and Methods”) segregating for the a, b, c, and e alleles, termed the validation population. This population contained all the expected 10 genotype classes, although progeny containing the b or e allele were relatively underrepresented (Supplemental Table S8A). A comparison of the 10 resultant genotype classes in relation to flowering time (ANOVA) indicated highly significant effects of allelic substitution (Supplemental Table S8B). Variance component analysis showed that genotypes accounted for 13.8% of the total variance. It also indicated the apparent dominance of the c allele, as the four genotypic classes containing the c allele were the earliest to flower (Fig. 2). The overall effects of allelic substitution on flowering time were similar to those observed in the LD family (compare Fig. 2 with Supplemental Fig. S9). Two sets of genotypes, bb, be, and ee on the one hand and cc, ca, and aa on the other, are segregants from the two different pairs of grandparents. Population structure can definitely be discounted within these, as they are progeny from full sib matings. The bb genotypes had significantly later flowering time than ee and be genotypes (Fig. 2). Similarly, aa genotypes had significantly later flowering time than cc and ca genotypes (Fig. 2). These findings confirm the results of the association analysis in the LD family.
Figure 2.
Flowering times of the 10 genotype groups in the validation population. The error bars represent the lsd (P ≤ 0.05) between genotype group mean values.
DISCUSSION
The outbreeding, heterozygous natures of perennial ryegrass and other forage grasses of importance in temperate agriculture and the broad, but still relatively well characterized, gene pool lend themselves to association genetics approaches for linking genotype with phenotype. While working exclusively with natural variation limits the degree to which the fine control of phenotype can be analyzed in terms of laboratory-based molecular biology, there is the potential for a high degree of biological and agricultural relevance to the findings. Previous work in rice has established the role of FT as a key gene in the induction to flowering, with a similar, although not completely analogous, role to that observed for AtFT in Arabidopsis (Hayama and Coupland, 2004). Comparative genetics and genomics have also indicated that FT is a candidate gene underlying both flowering and vernalization QTLs in a number of Lolium and Hordeum species studies as well as showing an expression profile compatible with its role as a flowering promoter (King et al., 2006; Armstead et al., 2008; Hemming et al., 2008; Stracke et al., 2009). Fine genetic mapping in Arabidopsis has also indicated that variation at FT (specifically, its promoter region) underlies flowering QTLs in natural populations (Schwartz et al., 2009). In this study, a direct link between genotype and flowering phenotype has been established for LpFT3 across different perennial ryegrass populations using association genetics and a catalogue of the underlying allelic variation assembled. This link between LpFT3 genotypic variation and annual flowering time in perennial ryegrass is not, in itself, unexpected. What is of greater interest, however, is that variation was identified both in the predicted protein sequence and in noncoding regions, most diagnostically within the region 5′ of the coding sequence and so possibly in candidate cis-regulatory regions.
Predicted Protein Variation within the Haplotypes
Comparative analysis of FT family and related proteins indicates a high degree of sequence conservation (Faure et al., 2007; Danilevskaya et al., 2008), and the FT proteins predicted for the perennial ryegrass A and B haplotypes are consistent with these. Secondary structure predictions (data not shown) do not indicate a marked effect of the amino acid substitutions that distinguish the A and B haplotype proteins, and neither of these substitutions is associated directly with regions of FT with proposed or defined functions (Hanzawa et al., 2005; Ahn et al., 2006; Faure et al., 2007; Danilevskaya et al., 2008). The amino acid residue at the equivalent to position 23 in LpFT3 is relatively unconserved in FT family proteins (Faure et al., 2007); by contrast, an alignment of 248 FT, FT-like, and other closely related plant PEBP proteins (TERMINAL FLOWER1 [TFL1], TFL1-like, MOTHER OF FT [MFT], MFT-like, and maize [Zea mays] CENTRORADIALIS [ZCN]) indicates that the Pro at the equivalent of position 156 (perennial ryegrass B haplotype protein) is a highly conserved residue and the Ala substitution is unique to perennial ryegrass haplotype A (Supplemental Fig. S10). In this study, allele h was identified as a haplotype A/B recombinant, with the recombination occurring at or near the beginning of the third exon. Therefore, this haplotype would be predicted to code for a protein derived from A-type exons 1 and 2 and B-type exon 3. While only a limited set of comparisons is available for the h allele, the results from this study suggest that the A haplotype-associated phenotype was conferred by variation 5′ of the A/B recombination in allele h and not directly by the Ala/Pro substitution. However, a contrary observation is that in exon 3 of haplotype E, six of the seven single-nucleotide polymorphisms (SNPs) that could distinguish the A from the B/C haplotypes were A-type SNPs with one B/C-type SNP. The latter conferred the retention of (reversion to?) Pro in E haplotype proteins (Supplemental Fig. S1). Either this latter base change occurred only in the A haplotype, subsequent to the divergence of the A and E haplotypes, or there has been reversion to the “wild-type” protein in the E haplotypes. If the latter is true, there may be some functional significance of the Ala/Pro substitution, at least in some circumstances.
FT Genotype and Flowering Phenotype
While strong population structure existed within the LD family, the association analysis implemented was able to show that different LpFT3 alleles and genotype classes were significantly associated with the flowering time variation. Additionally, while it is still relatively rare that such associations are confirmed in other populations, we were able to do this in a validation population in which population structure could be discounted. This is particularly useful in that QTLs have been evaluated for flowering time in a number of different perennial ryegrass crosses, and markers based directly on LpFT3 have been mapped in at least two of these mapping populations (Armstead et al., 2008). Thus, the association we have established between LpFT3 genotype and flowering phenotype in this study allows us to reevaluate this previous work. The WSC F2 mapping population was a cross between early- and late-flowering genotypes that segregated markedly for heading date. The major and highly significant QTL was associated with the LpFT3 region. Subsequent sequencing of LpFT3 haplotypes from this population identified FT-LD1 bb, bc, and cc segregating genotype classes (Armstead et al., 2004, 2008), which were associated with average flowering times of 61.1 d (bb), 42.7 d (bc), and 39.2 d (cc) after April 1. This is in line with the observation from the LD family that the FT-LD1 c allele is generally associated with earlier flowering times than the FT-LD1 b allele and is consistent with the results obtained for the validation family (Fig. 2). In the F2 mapping family, there is an overall dominant effect of the c allele over the b allele. This relationship is also evident in the validation population but less clear in the LD family: in Ba10284, the bc average values are closer to the bb rather than the cc average values; in Ba10278, the heterozygote is approximately midrange. In two further mapping families, the ILGI (Armstead et al., 2008) and VRN (Jensen et al., 2005; Studer et al., 2008) families, b and e alleles were segregating as FT-LD1 be and ee genotype classes. QTLs for flowering time were present at or near the LpFT3 genomic location on C7 in both families, but in neither family was the associated effect large, which is not inconsistent with the comparison of be and ee genotype classes in the validation family (Fig. 2). In the LD family, the be/ee genotype comparison was only present in the Ba10158 population, in which no LpFT3 genotype comparisons were significant, so it is not possible to draw any parallels.
Comparative Genomic Analysis
The sequence differences in the 5′ region were particularly interesting because of their potential to represent variation in cis-regulatory elements that can influence gene expression. Comparison of the 5′ regions of the perennial ryegrass haplotypes indicates that they are, in general, strongly conserved. When the 5′ regions from Lolium are compared with the 500-bp 5′ region of rice OsFT (LOC_Os06g06320.1, 12006.m05358|1000 bp_UpstreamSeq|2938157..2939159; http://rice.plantbiology.msu.edu/index.shtml), three particular motifs are conserved across the two species in terms of relative positions and distances from the ATG initiation codon (motifs 1–3; Fig. 3). Similar motifs are also present in relatively conserved positions in the Triticeae and Brachypodium distachyon FT 5′ regions, although the homology extends beyond the motifs between these species and perennial ryegrass due to their closer evolutionary distances. Motifs 2 and 3 are also present in similar positions in maize ZCN15 (GenBank accession no. EU241906), a PEBP protein with strong homology to FT (Supplemental Fig. S3), and motif 1 is also possibly represented in ZCN15, although reduced in size and at approximately 410 bp upstream of the ATG as opposed to 179, 187, 238, and 239 bp upstream of the ATG for wheat (Triticum aestivum), perennial ryegrass, rice, and B. distachyon, respectively. Searches in PLACE and Consite confirmed that the sequence of motif 2 was consistent with that of a TATA box but also with the binding consensus sequence (plus strand) for the MADS box transcription factor AGL3 from Arabidopsis. Similar searches did not identify any obvious plant-derived cis-regulatory sequence models for motif 3, although the conserved sequence CTAGACATGG in the perennial ryegrass haplotypes, which overlaps motif 1 (GACATGG), is also consistent with the binding consensus sequence (minus strand) for AGL3 (Consite prediction). Clearly, there is a high probability of obtaining matches to short transcription factor-binding site target sequences by chance, and biological relevance is not indicated from this evidence alone. However, it is interesting that both motifs 1 and 3 are juxtaposed to regions within the perennial ryegrass 5′ sequences that show indel-type variation, which could affect binding affinities (Fig. 3; Supplemental Fig. S1), with motif 1 positioned just upstream of the only region that distinguishes the A, B, and C haplotypes.
Figure 3.
Representation of approximately 500 bp 5′ of the ATG start codon of FT3 from perennial ryegrass (haplotypes A–F), F. pratensis (Fp), Italian ryegrass (Lolium multiflorum; Lm), darnel (Lolium temulentum; Lt), wheat (Ta), B. distachyon (Bd), rice (Os), and Z. mays (Zm). For perennial ryegrass A to F, Fp, Lm, and Lt, thick black horizontal bars represent aligned sequences, where breaks represent gaps in the sequence alignments. Vertical dotted lines illustrate the positions of the conserved motifs 1, 2, and 3. For Ta, Bd, Os, and Zm, thin black horizontal bars represent the 500 bp 5′ of the ATG start codon (sequence nonaligned), and thick black horizontal bars illustrate the relative positions of the conserved motifs 1, 2, and 3. The sequence alignments of motifs 1, 2, and 3 are given below.
In animals and yeast, the transcriptional control of PEBPs has been associated with CCAAT transcription binding domains, targeted by Nuclear Factor Y (NFY)/Heme Activator Protein (HAP) protein complexes, although in plants, the identification of conserved CCAAT domains in gene promoters is less clear (Siefers et al., 2009). A model for the regulation of AtFT in Arabidopsis involves the interaction of the 5′ promoter region with a trimeric complex of the proteins CONSTANS, NFY-B (also known as HAP3), and NFY-C (HAP5; Ben-Naim et al., 2006; Kumimoto et al., 2008; Turck et al., 2008; Siefers et al., 2009). Kumimoto et al. (2008) demonstrated that a trimeric complex of Arabidopsis-derived NFY-B proteins (closely related to yeast HAP3) in combination with yeast HAP2 and HAP5 could significantly bind to a 44-bp sequence starting 890 bp upstream of the ATG initiation codon in the Arabidopsis FT gene. This 44-bp sequence was identified as it contained the closest candidate CCAAT sequence (minus strand) to the ATG (Fig. 4). Interestingly, this 44-bp sequence also contains an overlapping GGACAT on the plus strand, which is the core conserved sequence in motif 1 of the monocot FT upstream sequences illustrated in Figure 3. Kumimoto et al. (2008) also reported that introducing a mutation into the 44-bp sequence (minus strand, CCAAT to AAAAT; Fig. 4) reduced the binding of the HAP/NFY-B complexes tested; this introduced mutation still leaves the GGACAT motif intact and would suggest that the GGACAT motif is not an effective binding domain, at least in this system. However, it is also possible that (1) sequence variation at the borders of the core motif may influence binding activity; (2) the use of one plant NFY-B protein and two yeast-derived HAP proteins (known to be specific for CCAAT) in the trimeric complex introduces a “bias” toward CCAAT recognition; and (3) there has been divergence of sequence and/or specificity between dicots and monocots. If the first is the case, then the proximity of allelic, indel-type variation next to the GGACAT motif might be the mechanism that confers the different phenotypes associated with the perennial ryegrass A, B, and C LpFT3 haplotypes. In a related flowering induction study in perennial ryegrass, Skøt et al. (2007) noted the association between flowering time and the LpHD1 (CONSTANS) gene that was also clearest in the 5′ untranslated region of that gene. In summary, while it is far from demonstrated that motif 1, as identified in this study, is significant in the control of gene expression, the close association of this conserved motif with a diagnostically variable sequence region in the LpFT3 5′ region deserves further examination.
Figure 4.
The 44-bp sequence used by Kumimoto et al. (2008) to study the binding affinity of a NFY-B/HAP2/HAP5 protein complex to the 5′ region of the Arabidopsis FT gene. The first row illustrates the position of the CCAAT domain (shaded black; minus strand), the second row the GGACAT motif (shaded black), and the third row the mutation introduced into the CCAAT domain (shaded gray).
Recent work in animal systems suggests that, due to the theoretical and apparent experimental multiple redundancy of potential transcription factor-binding sites throughout eukaryotic genomes, transcription factor-mediated control of gene expression may be modulated more by quantitative binding variation than by direct, discriminatory sequence specificity (MacArthur et al., 2009; Wunderlich and Mirny, 2009). The implication of this is that a hierarchical assessment of potential DNA-binding sites will depend on developing whole-genome, statistically validated assays for such binding sites. Such analyses are beyond the tools and resources available for perennial ryegrass at this point. However, as our understanding of the control and function of cis-regulatory elements increases and the resources for perennial ryegrass and the related model B. distachyon are developed, it will be interesting to see if it can be established whether the conserved motifs identified in this study play a role in gene regulation, possibly analogous to that established for the CCAAT-binding domains known to be involved in PEBP regulation in animal and yeast biology.
CONCLUSION
In conclusion, this work has demonstrated an association between allelic variation in the LpFT3 gene and flowering time in perennial ryegrass. Analyses of an association mapping panel, a genetic mapping family, and a validation population all indicate that haplotype C associates with early flowering and that the A and B haplotypes associate with later flowering. The additive and dominance characteristics of the different haplotypes are less certain, probably influenced by other genes/QTLs associated with flowering time. This work also illustrates the complementary properties of association and mapping populations. In the former, there are more alleles and haplotypes, but when they are very rare, their effect is difficult to assess because of the lack of statistical power. In mapping populations, fewer alleles segregate, but their effect can be assessed more accurately, as was the case for the e- and b-containing genotypes in the validation population.
Molecular characterization of LpFT3 in this cross-section of European perennial ryegrass germplasm identified three predominant haplotypes that code for two predicted different LpFT3 proteins, distinguished by two amino acid substitutions. While the protein sequence differences may influence flowering phenotype, the clearest association between phenotype and genotype is in the 5′ noncoding region, where there is indel variation in close proximity to a sequence motif that is conserved across perennial ryegrass, Festuca pratensis, wheat, rice, B. distachyon, and, possibly, maize and Arabidopsis. Thus, this region may influence gene expression (and so flowering time) by modulating transcription binding affinities, and there may be parallels with the control of other PEBPs via CCAAT-binding domains.
MATERIALS AND METHODS
Nomenclature
FT Gene/Protein Names
There is no consistent naming scheme for FT-like gene families based upon conserved syntenic relationships in monocots. In this study, we have designated the clone under study as LpFT3, in line with the name of an identical protein sequence derived from perennial ryegrass (Lolium perenne) already submitted to public databases (ABC33722). However, this is likely to be orthologous to barley (Hordeum vulgare) HvFT1 and rice (Oryza sativa) OsFTL2 (Faure et al., 2007; Kikuchi et al., 2009) as well as to rice Hd3a (Kojima et al., 2002) and wheat (Triticum aestivum) and barley VRN3/-H3 genes (Yan et al., 2006).
Alleles and Haplotypes
Amplification products of primer pair LpFT.F and LpFT-LD4.R containing LpFT3 are referred to as haplotypes and capitalized (e.g. haplotypes A and B). Size-evaluated amplification products of markers LpFT.1-4 are referred to as alleles and italicized in lowercase (e.g. alleles a and b). Supplemental Table S1 shows the relationship between haplotypes and alleles.
Identification of the Perennial Ryegrass BACs Containing LpFT3
The perennial ryegrass (approximately five-genome coverage) BAC library has been described previously (Farrar et al., 2007; Langdon et al., 2009). Identification of LpFT3 from the perennial ryegrass BAC library was achieved by PCR screening with the primers Hd3a.1f and Hd3a.3r (for primer sequences and thermal cycling profiles, see Supplemental Table S9). The identity of LpFT3 was confirmed by direct sequencing of the Hd3a.1f/3r-positive BACs with coding sequence and protein prediction generated using FGENESH+ (http://www.softberry.com/berry.phtml) and by direct comparison with other monocot FT sequences obtained from GenBank (http://www.ncbi.nlm.nih.gov/genbank/GenbankSearch.html). The FpFT sequence was similarly identified in a Festuca pratensis BAC library (Donnison et al., 2005).
Derivation of Perennial Ryegrass Populations for Association Analysis and Validation
Association Analysis
The perennial ryegrass LD family (association mapping panel) and the phenotypic evaluation for flowering time have been described in detail (Skøt et al., 2007). Briefly, the LD family consisted of nine populations, of which seven were natural or seminatural populations of European origin and two were varieties. Each population was represented by 96 genotypes. The entire population (9 × 96 genotypes) was evaluated for flowering time over two consecutive years (2004 and 2005), the first year as individual genotypes grown in 6-inch-diameter pots and the second year as two spaced-plant replicates in the field. DNA was extracted from each of the 864 genotypes as described previously (Skøt et al., 2005) and stored at −20°C.
Validation Population
The 16 parental genotypes for this population were obtained from progeny of two crosses. The first was a genotype of the late-flowering variety Bargold with a genotype of Ba10732 (genotype no. 22) from the LD family. The genotypes at the LD-FT1 locus were (aa) × (ce). The second cross was a genotype of the late-flowering variety Hugo with another genotype of population Ba10732 (genotype no. 48) from the LD family. The LD-FT1 genotypes were (bb) × (ae). The two varieties acted as mother plants, with the Ba10732 genotypes as pollen donors. Progeny from these two crosses were genotyped with the LD-FT1 marker to identify ac and be genotypes. These two genotypes were used in a 1:1 mixture as parents in a polycross (i.e. one in which random mating is allowed in an isolation greenhouse). The self-incompatibility in perennial ryegrass is almost absolute, so the progeny can be assumed to be derived from crossing between the parental genotypes. The 563 progeny were obtained from 36 randomly chosen mother plants. This mating design ensured a panmictic population in which we have assumed no population structure to be present.
LpFT3 Allele Amplification
The LpFT3 alleles from each genotype within the LD family were initially amplified in an approximately 1,500-bp fragment using primers Hd3a.8F and FT-LD.3R with HiFiTaq DNA polymerase (Roche) with the supplied PCR buffer. Equal volumes of the amplification reactions from each of the 96 genotypes within each of the nine populations were pooled, and the products were cloned into pCR4-TOPO (Invitrogen). Approximately 50 colonies from each of the populations were grown as minipreps, and the insert was sequenced. The resulting sequences were compared after alignment to identify the allelic variants for LpFT3 present in the screen. In order to avoid misscoring sequencing errors as low-level SNP/indel variations, sequence variants present in only one LpFT3 cloning product were excluded from the evaluation.
After sequencing of the LpFT3-containing perennial ryegrass BAC clone, a further primer pair, LpFT.F and LpFT.4R, was developed that amplified the LpFT3 gene as an approximately 1,900-bp fragment. This primer pair was also used in the amplification of FT from single genotypes of Italian ryegrass (Lolium multiflorum) and darnel (Lolium temulentum).
LD Marker Development and Analysis
Four primer pairs (FT-LD.1F/1R, FT-LD.2F/2R, FT-LD.3F/3R, and FT-LD.4F/4R; Supplemental Table S9) were designed that spanned size polymorphisms reflecting the allelic variation within the LpFT3 haplotypes present in the LD family (for haplotype sequences and relative primer and coding sequence positions, see Supplemental Figs. S1 and S2). The reverse primer from each pair was fluorescently labeled for the detection of size polymorphisms on an ABI3130XL Genetic Analyzer (Applied Biosystems), and each genotype was scored for LpFT3 allelic complement. Subsequently, the correspondence between predicted genotype and DNA sequence was checked in 20 individuals from across the nine populations using LpFT.F and LpFT.4.R.
The validation population had been genotypically assessed previously using primer pair Hd3a(LD)F/R (Armstead et al., 2009), which gave the same allelic discrimination as primer pair FT-LD.1F/1R, although with the reverse primer anchored in the 5′ untranslated region rather than in the first exon. This equivalence was checked by regenotyping a subset of the validation population with FT-LD.1F/1R.
Association Analysis and ANOVA
Statistical analysis of the association mapping experiment was carried out using a mixed linear model. We implemented this with the efficient mixed model association (EMMA) program in R (Kang et al., 2008). The association test was performed via a t test after estimating variance components using REML. Due to the multiallelic nature of the markers used, genotypes containing a particular allele were scored dominantly as a binary marker. For example, the FT-LD1 a marker was scored as 1 if the genotype was either homozygous or heterozygous for the a allele. The genetic relatedness was incorporated as a random effect using the relationship coefficient obtained from the software program SPAGeDi version 1.3 (Hardy and Vekemans, 2002) with the genome-wide AFLP data set described previously for this family as input (Skøt et al., 2007). One way ANOVA was carried out using GENSTAT 11.1 (www.vsni.co.uk). Within-population significance was evaluated using lsd (P ≤ 0.05). For population Ba11304, which contained only two genotypic groups, within-population significance was evaluated using a t test. Within-population associations were also carried out using REML with relationship matrices to account for any remaining relatedness within populations.
The proportion of the phenotypic variance accounted for by an individual marker was calculated as 1 – (Vg + Ve (with marker))/(Vg + Ve (without marker)). The variances for this were obtained using the REMLE option in EMMA, by either incorporating markers in the model or not.
The simple ANOVA, SA, and GC+SA were performed in R. The GC method (Devlin and Roeder, 1999) also uses genome-wide markers but has been mostly used in human genetics. As shown here, it is remarkably good at accounting for relatedness existing after correcting for the gross effect of population structure with SA. We used the genomic AFLP markers described above to obtain test statistics. This was compared with the expected value of 1 for a 1-degree of freedom χ2 test or F test, in our case with 1 numerator degree of freedom. An observed average test statistic above 1 is an indication of population structure. The test statistic of the candidate markers after GC represents the observed test statistic divided by the average test statistic of the genome-wide markers.
In Silico DNA Sequence Analyses
DNA and protein sequences were aligned using ClustalX 2.0.10 (Larkin et al., 2007) and Macaw version 2.0.5 (Schuler et al., 1991) with manual adjustment in GeneDoc (Nicholas and Nicholas, 1997).
Primary protein structure prediction was implemented using FGENESH (http://linux1.softberry.com/berry.phtml). Predicted secondary structures for the proteins derived from the perennial ryegrass A and B haplotypes were obtained using PredictProtein (Rost et al., 2004) and Jpred (Cole et al., 2008).
Similarities of potential 5′ cis-regulatory elements in LpFT3 and previously identified motifs were evaluated using PLACE (http://www.dna.affrc.go.jp/PLACE/signalscan.html) and Consite (http://asp.ii.uib.no:8090/cgi-bin/CONSITE/consite/).
Accession numbers for nucleotide sequences (Supplemental Sequences S1) are as follows: F. pratensis FpFT3 (BAC), FN993915; perennial ryegrass LpFT3 (BAC), FN993916; LpFT3 haplotype A, FN993917; LpFT3 haplotype B, FN993918; LpFT3 haplotype C, FN993919; LpFT3 haplotype D, FN993920; LpFT3 haplotype E, FN993921; LpFT3 haplotype F, FN993922; LpFT3 haplotype G, FN993923; Italian ryegrass LmFT3, FN993924; darnel LtFT3, FN993925; VRNA family LpFT3 haplotype E, FN993926; LpFT3 haplotype E/B, FN993927; LpFT3 haplotype B, FN993928.
Supplemental Data
The following materials are available in the online version of this article.
Supplemental Figure S1. DNA sequence alignments for LpFT3 haplotypes A to G.
Supplemental Figure S2. FT-LD marker distribution within the LpFT3 amplicons.
Supplemental Figure S3. Comparison of FT protein sequences.
Supplemental Figure S4. FT-LD1 genotype frequencies in relation to flowering time.
Supplemental Figure S5. FT-LD1 allele frequencies in relation to flowering time.
Supplemental Figure S6. Distribution of flowering times across the nine populations.
Supplemental Figure S7. Comparison of the cumulative probabilities with five different models applied for population structure correction.
Supplemental Figure S8. ANOVA of within-family FT-LD1 to -4 genotype/phenotype comparisons.
Supplemental Figure S9. The phenotypic effect of within-population FT-LD-1 allele substitution.
Supplemental Figure S10. Alignment of a segment of 248 LpFT3-related PEBP proteins.
Supplemental Figure S11. The composition of the nine populations that constitute the LD family in terms of FT-LD1 genotype and allele frequency.
Supplemental Table S1. FT-LD1 to -4 allele sizes in relation to haplotype.
Supplemental Table S2. FT-LD1 to -3 genotype frequencies in relation to population.
Supplemental Table S3. FT-LD1 to -3 allele frequencies in relation to population.
Supplemental Table S4. FT-LD1 genotype frequencies in relation to flowering time.
Supplemental Table S5. FT-LD1 to -3 allele frequencies in relation to flowering time.
Supplemental Table S6. Distribution of flowering times across the nine populations.
Supplemental Table S7. Mixed model analysis of the relationship between FT-LD1, -2, and -3 genotypes and flowering time within each of the nine populations.
Supplemental Table S8. Genotypic segregation and association with flowering time in the validation population.
Supplemental Table S9. Primer sequences and PCR conditions.
Supplemental Sequences S1. DNA sequences of genomic regions containing Fp/LpFT3 from the F. pratensis and perennial ryegrass BACs, amplification products of perennial ryegrass haplotypes A to H from the LD family, and amplification products from darnel, Italian ryegrass, and the perennial ryegrass VRNA mapping family haplotypes.
Supplementary Material
Acknowledgments
We thank Dr. Ian Mackay (National Institute of Agricultural Botany) for his advice concerning the statistical analysis.
References
- Ahn JH, Miller D, Winter VJ, Banfield MJ, Lee JH, Yoo SY, Henz SR, Brady RL, Weigel D. (2006) A divergent external loop confers antagonistic activity on floral regulators FT and TFL1. EMBO J 25: 605–614 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Armstead I, Yang BC, Barth S, Turner L, Skot L, Marshall A, Humphreys M, King I, Thorogood D. (2009) Understanding the genetic basis of flowering and fertility in the ryegrasses (Lolium spp.). In Yamada T, Spangenburg G, eds, Molecular Breeding of Forage and Turf. Springer Science+Business Media, New York, pp 185–192 [Google Scholar]
- Armstead IP, Skøt L, Turner LB, Skøt K, Donnison IS, Humphreys MO, King IP. (2005) Identification of perennial ryegrass (Lolium perenne (L.)) and meadow fescue (Festuca pratensis (Huds.)) candidate orthologous sequences to the rice Hd1(Se1) and barley HvCO1 CONSTANS-like genes through comparative mapping and microsynteny. New Phytol 167: 239–247 [DOI] [PubMed] [Google Scholar]
- Armstead IP, Turner LB, Farrell M, Skøt L, Gomez P, Montoya T, Donnison IS, King IP, Humphreys MO. (2004) Synteny between a major heading-date QTL in perennial ryegrass (Lolium perenne L.) and the Hd3 heading-date locus in rice. Theor Appl Genet 108: 822–828 [DOI] [PubMed] [Google Scholar]
- Armstead IP, Turner LB, Marshall AH, Humphreys MO, King IP, Thorogood D. (2008) Identifying genetic components controlling fertility in the outcrossing grass species perennial ryegrass (Lolium perenne) by quantitative trait loci analysis and comparative genetics. New Phytol 178: 559–571 [DOI] [PubMed] [Google Scholar]
- Ben-Naim O, Eshed R, Parnis A, Teper-Bamnolker P, Shalit A, Coupland G, Samach A, Lifschitz E. (2006) The CCAAT binding factor can mediate interactions between CONSTANS-like proteins and DNA. Plant J 46: 462–476 [DOI] [PubMed] [Google Scholar]
- Bonnin I, Rousset M, Madur D, Sourdille P, Dupuits C, Brunel D, Goldringer I. (2008) FT genome A and D polymorphisms are associated with the variation of earliness components in hexaploid wheat. Theor Appl Genet 116: 383–394 [DOI] [PubMed] [Google Scholar]
- Cole C, Barber JD, Barton GJ. (2008) The Jpred 3 secondary structure prediction server. Nucleic Acids Res 36: W197–W201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbesier L, Vincent C, Jang SH, Fornara F, Fan QZ, Searle I, Giakountis A, Farrona S, Gissot L, Turnbull C, et al. (2007) FT protein movement contributes to long-distance signaling in floral induction of Arabidopsis. Science 316: 1030–1033 [DOI] [PubMed] [Google Scholar]
- Danilevskaya ON, Meng X, Hou ZL, Ananiev EV, Simmons CR. (2008) A genomic and expression compendium of the expanded PEBP gene family from maize. Plant Physiol 146: 250–264 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Devlin B, Roeder K. (1999) Genomic control for association studies. Biometrics 55: 997–1004 [DOI] [PubMed] [Google Scholar]
- Donnison IS, O'Sullivan DM, Thomas A, Canter P, Moore B, Armstead I, Thomas H, Edwards KJ, King IP. (2005) Construction of a Festuca pratensis BAC library for map-based cloning in Festulolium substitution lines. Theor Appl Genet 110: 846–851 [DOI] [PubMed] [Google Scholar]
- Farrar K, Asp T, Lubberstedt T, Xu M, Thomas A, Christiansen C, Humphreys M, Donnison I. (2007) Construction of two Lolium perenne BAC libraries and identification of BACs containing candidate genes for disease resistance and forage quality. Mol Breed 19: 15–23 [Google Scholar]
- Faure S, Higgins J, Turner A, Laurie DA. (2007) The FLOWERING LOCUS T-like gene family in barley (Hordeum vulgare). Genetics 176: 599–609 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanzawa Y, Money T, Bradley D. (2005) A single amino acid converts a repressor to an activator of flowering. Proc Natl Acad Sci USA 102: 7748–7753 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hardy OJ, Vekemans X. (2002) SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes 2: 618–620 [Google Scholar]
- Hayama R, Coupland G. (2004) The molecular basis of diversity in the photoperiodic flowering responses of Arabidopsis and rice. Plant Physiol 135: 677–684 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hayama R, Yokoi S, Tamaki S, Yano M, Shimamoto K. (2003) Adaptation of photoperiodic control pathways produces short-day flowering in rice. Nature 422: 719–722 [DOI] [PubMed] [Google Scholar]
- Hemming MN, Peacock WJ, Dennis ES, Trevaskis B. (2008) Low-temperature and daylength cues are integrated to regulate FLOWERING LOCUS T in barley. Plant Physiol 147: 355–366 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hisamatsu T, King RW. (2008) The nature of floral signals in Arabidopsis. II. Roles for FLOWERING LOCUS T (FT) and gibberellin. J Exp Bot 59: 3821–3829 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Igasaki T, Watanabe Y, Nishiguchi M, Kotoda N. (2008) The FLOWERING LOCUS T/TERMINAL FLOWER 1 family in Lombardy poplar. Plant Cell Physiol 49: 291–300 [DOI] [PubMed] [Google Scholar]
- Jaeger KE, Wigge PA. (2007) FT protein acts as a long-range signal in Arabidopsis. Curr Biol 17: 1050–1054 [DOI] [PubMed] [Google Scholar]
- Jensen LB, Andersen JR, Frei U, Xing YZ, Taylor C, Holm PB, Lübberstedt TL. (2005) QTL mapping of vernalization response in perennial ryegrass (Lolium perenne L.) reveals co-location with an orthologue of wheat VRN1. Theor Appl Genet 110: 527–536 [DOI] [PubMed] [Google Scholar]
- Kang HM, Zaitlen NA, Wade CM, Kirby A, Heckerman D, Daly MJ, Eskin E. (2008) Efficient control of population structure in model organism association mapping. Genetics 178: 1709–1723 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kikuchi R, Kawahigashi H, Ando T, Tonooka T, Handa H. (2009) Molecular and functional characterization of PEBP genes in barley reveal the diversification of their roles in flowering. Plant Physiol 149: 1341–1353 [DOI] [PMC free article] [PubMed] [Google Scholar]
- King RW, Hisamatsu T, Goldschmidt EE, Blundell C. (2008) The nature of floral signals in Arabidopsis. I. Photosynthesis and a far-red photoresponse independently regulate flowering by increasing expression of FLOWERING LOCUS T (FT). J Exp Bot 59: 3811–3820 [DOI] [PMC free article] [PubMed] [Google Scholar]
- King RW, Moritz T, Evans LT, Martin J, Andersen CH, Blundell C, Kardailsky I, Chandler PM. (2006) Regulation of flowering in the long-day grass Lolium temulentum by gibberellins and the FLOWERING LOCUS T gene. Plant Physiol 141: 498–507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kojima S, Takahashi Y, Kobayashi Y, Monna L, Sasaki T, Araki T, Yano M. (2002) Hd3a, a rice ortholog of the Arabidopsis FT gene, promotes transition to flowering downstream of Hd1 under short-day conditions. Plant Cell Physiol 43: 1096–1105 [DOI] [PubMed] [Google Scholar]
- Komeda Y. (2004) Genetic regulation of time to flower in Arabidopsis thaliana. Annu Rev Plant Biol 55: 521–535 [DOI] [PubMed] [Google Scholar]
- Komiya R, Ikegami A, Tamaki S, Yokoi S, Shimamoto K. (2008) Hd3a and RFT1 are essential for flowering in rice. Development 135: 767–774 [DOI] [PubMed] [Google Scholar]
- Kumimoto RW, Adam L, Hymus GJ, Repetti PP, Reuber TL, Marion CM, Hempel FD, Ratcliffe OJ. (2008) The Nuclear Factor Y subunits NF-YB2 and NF-YB3 play additive roles in the promotion of flowering by inductive long-day photoperiods in Arabidopsis. Planta 228: 709–723 [DOI] [PubMed] [Google Scholar]
- Langdon T, Thomas A, Huang L, Farrar K, King J, Armstead I. (2009) Fragments of the key flowering gene GIGANTEA are associated with helitron-type sequences in the Pooideae grass Lolium perenne. BMC Plant Biol 9: 70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23: 2947–2948 [DOI] [PubMed] [Google Scholar]
- Li D, Liu C, Shen L, Wu Y, Chen H, Robertson M, Helliwell CA, Ito T, Meyerowitz E, Yu H. (2008) A repressor complex governs the integration of flowering signals in Arabidopsis. Dev Cell 15: 110–120 [DOI] [PubMed] [Google Scholar]
- MacArthur S, Li XY, Li JY, Brown JB, Chu HC, Zeng L, Grondona BP, Hechmer A, Simirenko L, Keränen SVE, et al. (2009) Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions. Genome Biol 10: R80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McMullen MD, Kresovich S, Villeda HS, Bradbury P, Li HH, Sun Q, Flint-Garcia S, Thornsberry J, Acharya C, Bottoms C, et al. (2009) Genetic properties of the maize nested association mapping population. Science 325: 737–740 [DOI] [PubMed] [Google Scholar]
- Michaels SD. (2009) Flowering time regulation produces much fruit. Curr Opin Plant Biol 12: 75–80 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monna L, Lin X, Kojima S, Sasaki T, Yano M. (2002) Genetic dissection of a genomic region for a quantitative trait locus, Hd3, into two loci, Hd3a and Hd3b, controlling heading date in rice. Theor Appl Genet 104: 772–778 [DOI] [PubMed] [Google Scholar]
- Moon J, Lee H, Kim M, Lee I. (2005) Analysis of flowering pathway integrators in Arabidopsis. Plant Cell Physiol 46: 292–299 [DOI] [PubMed] [Google Scholar]
- Nicholas KB, Nicholas HB. (1997) GeneDoc: a tool for editing and annotating multiple sequence alignments. www.nrbsc.org/ (December 15, 2010)
- Nordborg M, Weigel D. (2008) Next-generation genetics in plants. Nature 456: 720–723 [DOI] [PubMed] [Google Scholar]
- Rost B, Yachdav G, Liu J. (2004) The PredictProtein server. Nucleic Acids Res 32: W321–W326 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sasani S, Hemming MN, Oliver SN, Greenup A, Tavakkol-Afshari R, Mahfoozi S, Poustini K, Sharifi HR, Dennis ES, Peacock WJ, et al. (2009) The influence of vernalization and daylength on expression of flowering-time genes in the shoot apex and leaves of barley (Hordeum vulgare). J Exp Bot 60: 2169–2178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schuler G, Altschul S, Lipman D. (1991) A workbench for multiple alignment construction and analysis. Proteins: Structure, Function, and Bioinformatics 9: 180–190 [DOI] [PubMed] [Google Scholar]
- Schwartz C, Balasubramanian S, Warthmann N, Michael TP, Lempe J, Sureshkumar S, Kobayashi Y, Maloof JN, Borevitz JO, Chory J, et al. (2009) Cis-regulatory changes at FLOWERING LOCUS T mediate natural variation in flowering responses of Arabidopsis thaliana. Genetics 183: 723–732 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siefers N, Dang KK, Kumimoto RW, Bynum WE, IV, Tayrose G, Holt BF., III (2009) Tissue-specific expression patterns of Arabidopsis NF-Y transcription factors suggest potential for extensive combinatorial complexity. Plant Physiol 149: 625–641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skøt L, Humphreys J, Humphreys MO, Thorogood D, Gallagher J, Sanderson R, Armstead IP, Thomas ID. (2007) Association of candidate genes with flowering time and water-soluble carbohydrate content in Lolium perenne (L.). Genetics 177: 535–547 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skøt L, Humphreys MO, Armstead I, Heywood S, Skøt KP, Sanderson R, Thomas ID, Chorlton KH, Hamilton NRS. (2005) An association mapping approach to identify flowering time genes in natural populations of Lolium perenne (L.). Mol Breed 15: 233–245 [Google Scholar]
- Smith KF, Dobrowolski MP, Cogan NOI, Spangenberg GC, Forster JW. (2009) Utilizing linkage disequilibrium and association mapping to implement candidate gene based markers in perennial ryegrass breeding. In Yamada T, Spangenburg G, eds, Molecular Breeding of Forage and Turf. Springer Science+Business Media, New York, pp 259–274 [Google Scholar]
- Stich B, Piepho HP, Schulz B, Melchinger AE. (2008) Multi-trait association mapping in sugar beet (Beta vulgaris L.). Theor Appl Genet 117: 947–954 [DOI] [PubMed] [Google Scholar]
- Stracke S, Haseneyer G, Veyrieras JB, Geiger HH, Sauer S, Graner A, Piepho HP. (2009) Association mapping reveals gene action and interactions in the determination of flowering time in barley. Theor Appl Genet 118: 259–273 [DOI] [PubMed] [Google Scholar]
- Studer B, Jensen LB, Hentrup S, Brazauskas G, Kölliker R, Lübberstedt T. (2008) Genetic characterisation of seed yield and fertility traits in perennial ryegrass (Lolium perenne L.). Theor Appl Genet 117: 781–791 [DOI] [PubMed] [Google Scholar]
- Tamaki S, Matsuo S, Wong HL, Yokoi S, Shimamoto K. (2007) Hd3a protein is a mobile flowering signal in rice. Science 316: 1033–1036 [DOI] [PubMed] [Google Scholar]
- Turck F, Fornara F, Coupland G. (2008) Regulation and identity of florigen: FLOWERING LOCUS T moves center stage. Annu Rev Plant Biol 59: 573–594 [DOI] [PubMed] [Google Scholar]
- van Berloo R, Zhu AG, Ursem R, Verbakel H, Gort G, van Eeuwijk FA. (2008) Diversity and linkage disequilibrium analysis within a selected set of cultivated tomatoes. Theor Appl Genet 117: 89–101 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waugh R, Jannink JL, Muehlbauer GJ, Ramsay L. (2009) The emergence of whole genome association scans in barley. Curr Opin Plant Biol 12: 218–222 [DOI] [PubMed] [Google Scholar]
- Wigge PA, Kim MC, Jaeger KE, Busch W, Schmid M, Lohmann JU, Weigel D. (2005) Integration of spatial and temporal information during floral induction in Arabidopsis. Science 309: 1056–1059 [DOI] [PubMed] [Google Scholar]
- Wu CY, You CJ, Li CS, Long T, Chen GX, Byrne ME, Zhang QF. (2008) RID1, encoding a Cys2/His2-type zinc finger transcription factor, acts as a master switch from vegetative to floral development in rice. Proc Natl Acad Sci USA 105: 12915–12920 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wunderlich Z, Mirny LA. (2009) Different gene regulation strategies revealed by analysis of binding motifs. Trends Genet 25: 434–440 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yamaguchi A, Kobayashi Y, Goto K, Abe M, Araki T. (2005) TWIN SISTER OF FT (TSF) acts as a floral pathway integrator redundantly with FT. Plant Cell Physiol 46: 1175–1189 [DOI] [PubMed] [Google Scholar]
- Yan L, Fu D, Li C, Blechl A, Tranquilli G, Bonafede M, Sanchez A, Valarik M, Yasuda S, Dubcovsky J. (2006) The wheat and barley vernalization gene VRN3 is an orthologue of FT. Proc Natl Acad Sci USA 103: 19581–19586 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yao J, Wang L, Liu L, Zhao C, Zheng Y. (2009) Association mapping of agronomic traits on chromosome 2A of wheat. Genetica 137: 67–75 [DOI] [PubMed] [Google Scholar]
- Zhang LY, Marchand S, Tinker NA, Belzile F. (2009) Population structure and linkage disequilibrium in barley assessed by DArT markers. Theor Appl Genet 119: 43–52 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




