Abstract
Early vigour (EV) and precocity are important traits for orchard establishment and profitability in macadamia. EV determines tree growth and adaptation, while precocity facilitates early yield, offering economic benefits. Although, a positive relationship between these traits has been observed in other tree crops, their association in macadamia remains unclear. This study aimed to identify genetic links between EV and precocity by assessing genetic variability, heritability, and correlations in a 5-year-old macadamia breeding population. The population comprised 904 progenies planted across six sites in Queensland, Australia. Genome-wide association studies (GWAS) were conducted on a subset of 220 accessions genotyped with 7401 SNP markers. A linear mixed model incorporating a kinship matrix and principal components to account for population structure was used to perform association analysis in TASSEL. Phenotypic analyses in ASReml-R revealed that precocity had higher broad- (H2 = 0.25–0.84) and narrow-sense (h2 = 0.10–0.77) heritability compared to EV (H2 = 0–0.61, h2 = 0–0.49). EV and precocity showed positive phenotypic (0.25–0.42) and genetic (0.21–0.31) correlations. GWAS identified 11 significant markers (false discovery rate < 0.05), including two pleiotropic markers (Mint10079 and Mint4004) associated with both EV and precocity. Putative genes linked to these markers were involved in cell wall modelling, pathogen defence, abiotic stress tolerance, flowering, overall growth, and development in other tree species. These significant markers, postvalidation, hold substantial promise for utilization in marker-assisted selection (MAS). Integrating putative pleiotropic markers into MAS can enhance genetic gain by reducing the selection time for and enabling simultaneous selection for EV and precocity.
Introduction
Macadamia (Macadamia integrifolia, Macadamia ternifolia, and hybrids) is a native Australian tree crop cultivated globally for its high-value kernels. Australia is one of the leading producers with cultivation expanding into subtropical and tropical regions around the world, including China, South Africa, Hawaii, New Zealand, South America, and Southeast Asia [1]. Despite its popularity, the genetic improvement of macadamia has been slow due to obstacles such as high heterozygosity, protracted juvenility, and long breeding cycle ([2–4]). Genetic gains are further impeded by vigourous vegetative growth at maturity, which restrict orchards to low-density planting with high management costs [5, 6]. Traditional breeding approaches involve laborious phenotyping over many years to identify elite candidate cultivars, hence it takes at least 24 years to release new varieties [1]. Industry may thus benefit from genomic approaches that reduce selection cycles and increase genetic gain.
Macadamia trees remain juvenile for at least 5–7 years until maturing at 10–15 years [1]. Long juvenility extends the breeding cycle, slowing the development of new cultivars, while growers face the challenge of delayed orchard productivity, requiring substantial financial investments without returns for several years. Precocity refers to the ability of a tree crop to bear fruits or flowers early in its life cycle [7]. It is associated with a short juvenile period and high productivity at maturity in wood crops (Zimmerman, 1972 [127]). In macadamia, precocity is particularly valuable as it accelerates breeding progress by reducing selection cycles and increasing genetic gain [8]. Maximizing early yields and shortening the time to first commercial harvest are top priorities for growers when establishing new macadamia orchards [9]. Precocity offers the benefit of early productivity and faster economic returns for growers. In the absence of genetically precocious cultivars, methods such as trunk girdling and application of plant growth regulators have shown to increase early fruiting [10]. Developing naturally precocious cultivars would optimize both breeding efficiency and orchard productivity, reducing reliance on management interventions.
Early vigour (EV) or growth of young trees is another important trait with clear advantages in tree crops, although vigour at maturity is considered detrimental for productivity [11]. EV determines crop’s ability to cope with biotic and abiotic stresses with significant implications on climate change adaptation such as drought and heat [12]. For instance, increased early biomass promotes rapid canopy closure, reducing soil evaporation and improving water availability [13]. High early shoot growth also facilitates better photosynthate transport to roots, ensuring proper establishment and resilience against stress. This has been observed in crops like cotton, wheat, rice, Arabidopsis, and some forest trees [12, 14, 15]. Furthermore, stem diameter in young plants correlates with traits associated with water absorption and transport, supporting survival during water stress [16]. In macadamia, EV is particularly important for establishing a healthy and uniform orchard canopy, which enhances a plant’s ability to compete for resources like sunlight, water, nutrients, and contributes to a greater crop output.
Strong EV is also linked to precocity because it is essential to reach an appropriate growth size in order to enter the reproductive phase [17]. EV expedites the transition from juvenility to the adult phase, a process marked by flowering. The positive relationship between EV and precocity has been well established in several tree crops such as apple, pear, pecan, cocoa, and olive [17–20]. Increased light interception had been suggested as a contributing factor to this relationship [21]. Since EV accelerates canopy closure, it optimizes light interception and supports precocity [22]. For instance, growth of 1- to 4-year-old pecan seedlings showed positive correlations with precocity, with the highest correlations observed in later growth stages [17]. In addition, establishment of this EV and precocity relationship has allowed preselection of precocious cultivars on the basis of early vegetative growth in tree crops [18–20, 23]. However, in macadamia, the only study reporting positive influence of growth on yield in 4 to 6-year-old trees was limited to 15 genotypes [21], with no explorations into the genetic basis of this relationship. Exploring such genetic links would facilitate the selection of one trait based on the other, thereby reducing evaluation time and enhancing genetic gain.
Correlated traits often share genes or quantitative trait loci (QTL/s) that are located in proximate genomic regions [24]. Understanding the genetic basis of traits also facilitates the selection of new candidate cultivars. One of the most useful methods of dissecting the genetic components of complex quantitative traits is through genome-wide association studies (GWAS). GWAS is a genomic approach to identify candidate gene/QTL associated with target trait [25]. Significant marker–trait associations from GWAS facilitate marker-assisted selection (MAS), expediting breeding by enabling the selection for desirable traits during the juvenile stage, hence eliminating the necessity for field evaluations to maturity [25].
However, individual GWAS studies on the same trait often yield inconsistent results due to factors such as population-specific allele frequencies, inadequate control of population structure, and environmental influences [26]. Additionally, GWAS findings require validation across multiple environments to ensure their reliability [27]. A promising approach to address these limitations is meta-analysis of GWAS (metaGWAS), which integrates summary statistics (e.g. marker effects and their standard errors) from multiple independent studies to increase statistical power and reduce false-positive associations [28]. By combining data from different populations and environments, metaGWAS provides more robust and generalizable insights into the genetic architecture of traits. Despite its advantages, meta-analysis of QTLs has been extensively applied in grain crops [29–31], but remains rare in fruit and nut crops. The limited number of GWAS in these species poses a challenge for conducting metaGWAS, as this approach relies on multiple independent studies to combine data effectively. In macadamia, metaGWAS is currently not feasible due to the scarcity of GWAS studies. While recent efforts have identified marker–trait associations for yield and nut-related traits [32–34], traits such as EV and precocity remain unexplored.
To address this gap, this study aims to investigate the genetic basis of EV and precocity in macadamia through GWAS. Specifically, this study measured the genetic variance and heritability of EV and precocity in seedling progenies and estimated phenotypic and genetic correlations among examined traits. A GWAS was performed to identify candidate genes associated with EV and precocity traits and common genetic mechanism of traits were investigated. Results of this study provide the genetic links of EV and precocity and highlight the potential of early and rapid selection for these traits in future breeding programmes.
Results
Phenotypic variation
A summary of the mean and standard errors (SEs) of the raw data revealed a range of phenotypic variation across sites for all traits (Table 1). The highest means for EV traits were observed at Dunoon, where height (HT), canopy width (CW), and canopy volume (CV) were 4.94 m, 3.77 m, and 85.42 m3, respectively. The lowest means for HT (3.27 m) and CV (31.28 m3) were recorded at Newrybar, while the lowest mean for CW (2.67 m) was found at Baffle Creek_S1. Similarly, precocity traits, as represented by total nut-in-shell mass (TNM) and cumulative nut-in-shell mass (CNM), were lowest at Baffle Creek_S1, with mean values of 474.8 and 478.4 g, respectively. In contrast, the highest means for TNM (4181 g) and CNM (6563 g) were observed at Bundaberg.
Table 1.
Mean, maximum (max), and minimum (min) of EV and precocity traits across sites
| Traits | Description | Alloway | Ammamoor_S1 | Ammamoor_S2 | Baffle Creek_S1 | Baffle Creek_S2 | Bundaberg | Dunoon | Newrybar |
|---|---|---|---|---|---|---|---|---|---|
| Height (m) | Mean (SE) | 3.77 (0.04) | 3.93 (0.05) | 4.25 (0.06) | 3.88 (0.1) | 4.8 (0.06) | 4 (0.09) | 4.94 (0.13) | 3.27 (0.06) |
| Max | 5.02 | 5.47 | 5.80 | 6.01 | 6.86 | 6.23 | 7.8 | 4.88 | |
| Min | 1.16 | 1.52 | 2.6 | 1.5 | 1.3 | 1.99 | 1.9 | 1.28 | |
| Canopy width (m) | Mean (SE) | 3.43 (0.05) | 2.71 (0.05) | 3.25 (0.06) | 2.67 (0.1) | 3.8 (0.07) | 3.21 (0.09) | 3.77 (0.11) | 2.83 (0.06) |
| Max | 4.53 | 4.27 | 5.00 | 4.45 | 5.90 | 5.20 | 5.80 | 4.85 | |
| Min | 0.79 | 0.84 | 1.4 | 0.73 | 0.8 | 1.2 | 1 | 0.84 | |
| Canopy volume (m3) | Mean (SE) | 49.12 (1.38) | 33.55 (1.34) | 50.54 (2.23) | 34.31 (2.68) | 81.51 (3.31) | 48.18 (3.27) | 85.42 (5.42) | 31.28 (1.66) |
| Max | 94.98 | 99.71 | 120.74 | 114.80 | 220.77 | 133.71 | 247.18 | 105.72 | |
| Min | 0.76 | 1.44 | 7.60 | 0.86 | 1.36 | 4.04 | 1.99 | 0.95 | |
| Total nut in shell mass (g) | Mean (SE) | 2297 (175) | 1170 (135) | 705.2 (84) | 474.8 (129) | 2892 (243) | 4181 (358) | 1535 (273) | 888.9 (128) |
| Max | 8997 | 8388 | 4666 | 4960 | 17 984 | 12 159 | 11 400 | 6714 | |
| Min | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Cumulative nut in shell mass (g) | Mean (SE) | 2297 (175) | 1288 (155) | 1401 (170) | 478.4 (129) | 3564 (310) | 6563 (578) | 1734 (326) | 940.6 (136) |
| Max | 8997 | 9600 | 8541 | 4960 | 22 114 | 18 661 | 15 630 | 7428 | |
| Min | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Genetic variance and heritability
The multisite analysis revealed significant differences (P < 0.05, Walds test, Supplementary Table 1) among sites for all traits. Substantial differences in additive genetic and residual variances for each trait across sites were observed (Table 2, Supplementary Table 2). Heritability estimates varied widely among sites and traits. Broad-sense (H2) and narrow-sense heritability (h2) values were generally higher for precocity traits compared to EV traits. For precocity traits, the average h2 and H2 were 0.27 and 0.48 for CNM, and 0.28 and 0.50 for TNM, respectively. Among EV traits, HT showed the highest heritability, with h2 averaging 0.22 and H2 at 0.41. In contrast, CW had the lowest heritability estimates, with an average h2 of 0.09 and H2 of 0.21. CV exhibited moderate heritability, with h2 averaging 0.12 and H2 at 0.23.
Table 2.
Range of variance components, heritability, and average heritability across sites for EV and precocity traits. Detailed numeric data on variance components and heritability of each trait at each site is presented in Supplementary Table 2
| Trait | Va (% of Vp) |
Vf (% of Vp) |
Ve (% of Vp) |
h2 | H2 | Average h2 | Average H2 |
|---|---|---|---|---|---|---|---|
| HT | 0%–49% | 0%–44% | 39%–72% | 0–0.49 | 0.28–0.61 | 0.22 | 0.41 |
| CW | 0%–36% | 0%–28% | 57%–100% | 0–0.36 | 0–0.42 | 0.09 | 0.21 |
| CV | 0%–35% | 0%–18% | 54%–100% | 0–0.34 | 0–0.46 | 0.12 | 0.23 |
| TNM | 10%–77% | 0%–62% | 16%–75% | 0.10–0.77 | 0.25–0.84 | 0.28 | 0.50 |
| CNM | 10%–70% | 9%–62% | 16%–75% | 0.12–0.70 | 0.25–0.84 | 0.27 | 0.48 |
Va, additive genetic variance; Vp, total phenotypic variance; Vf, family variance; Ve, residual variance; h2, narrow-sense heritability; H2, broad-sense heritability
Phenotypic and genetic correlations
Phenotypic correlations among EV and precocity traits were positive and significant (rp = 0.25–0.42, P < 0.001; Table 3). There was also some degree of correlation between Best Linear Unbiased Predictions (BLUPs) (rg = 0.21–0.31) among these traits. Highest rp and rg for EV and precocity were observed for the pair CW-TNM and the lowest for HT-CNM.
Table 3.
Phenotypic correlations (top right, rp) and correlation between BLUPs (bottom left, rg) between EV and precocity traits
| rp rg |
HT | CW | CV | TNM | CNM |
|---|---|---|---|---|---|
| HT | 0.66*** | 0.78*** | 0.28*** | 0.27*** | |
| CW | 0.64*** | 0.91*** | 0.42*** | 0.39*** | |
| CV | 0.78*** | 0.94*** | 0.39*** | 0.36*** | |
| TNM | 0.29*** | 0.28*** | 0.31*** | 0.96*** | |
| CNM | 0.21*** | 0.22*** | 0.23*** | 0.94*** |
‘*’ in the superscript represents the level of significance of the correlation component (***P < 0.001).
Genome-wide associations
The LDKNNi imputation in TASSEL imputed the genotypic data with 89.7% accuracy, masking a random 1% of markers. Quantile-quantile (QQ) plots demonstrated that the linear mixed model (LMM) effectively accounted for population structure, as the expected and observed P-values aligned closely without early deviations (Fig. 1). On the basis of a threshold of P < 1e-04, four single nucleotide polymorphism markers (SNPs) were associated with HT, nine with CW, four with CV, eight with TNM, and six with CNM (Supplementary Table 3). After adjusting for false discovery rate (FDR < 0.05), the number of significant SNPs decreased to two for HT, five for CW, four for CV, two for TNM, and five for CNM, explaining 11%–31% of the phenotypic variance for the associated traits (Fig. 2; Table 4). Among the significant SNPs, Mint10079 was associated with HT, CV, TNM, and CNM, suggesting pleiotropic effects. Similarly, Mint4004 was significantly associated with both HT and CNM, while Mint11317 and Mint9112 showed overlap in associations with CV/CW and TNM/CNM, respectively.
Figure 1.
QQ plots showing expected versus observed significant levels for EV and precocity traits. Each black dot represents one of the total SNP markers in the study. Red diagonal lines indicate the expected P-values. SNP markers above the dashed horizontal lines (FDR = 0.05) are significantly associated with the trait.
Figure 2.
Manhattan plots showing distribution of SNPs across the macadamia genome and significance (−log10 Observed p) of marker–trait associations for EV and precocity traits. SNP markers above the yellow horizontal lines (FDR = 0.05) were significantly associated with the trait. Markers in unknown regions are shown on chromosome 15, unanchored scaffolds on chromosome 16, and unassigned markers on chromosome 0.
Table 4.
Summary of significant SNPs associated with EV and precocity traits identified from GWAS
| Trait | SNP ID | SNP position (bp) | FDR | SNP | MAF | Ma/Mi allele | R2 (%) |
|---|---|---|---|---|---|---|---|
| CV | Mint11317 | Chr 2: 4672424 | 0.049 | C/A | 0.309 | C/A | 17 |
| Mint10079 | Chr 3: 5338694 | 0.045 | C/A | 0.007 | A/C | 11 | |
| Mint7820 | Unknown | 0.045 | A/G | 0.179 | G/A | 15 | |
| HT | Mint4004 | Chr 5: 22548014 | 0.044 | C/G | 0.495 | G/C | 18 |
| Mint10079 | Chr 3: 5338694 | 0.009 | C/A | 0.007 | A/C | 13 | |
| CW | Mint11317 | Chr 2: 4672424 | 0.0004 | C/A | 0.309 | C/A | 31 |
| Mint7123 | Chr 5: 10381062 | 0.028 | T/C | 0.037 | T/C | 12 | |
| Mint6881 | Chr 9: 27124204 | 0.040 | A/G | 0.076 | A/G | 11 | |
| Mint433 | Unknown | 0.040 | G/T | 0.213 | T/G | 14 | |
| Mint8813 | Unknown | 0.003 | A/G | 0.036 | A/G | 15 | |
| TNM | Mint10079 | Chr 3: 5338694 | 0.038 | C/A | 0.007 | A/C | 11 |
| Mint9112 | Chr 4: 18401039 | 0.023 | C/G | 0.069 | G/C | 23 | |
| CNM | Mint4004 | Chr 5: 22548014 | 0.007 | C/G | 0.495 | G/C | 20 |
| Mint10079 | Chr 3: 5338694 | 0.002 | C/A | 0.007 | A/C | 15 | |
| Mint1617 | Chr 1: 26883847 | 0.007 | C/T | 0.033 | C/T | 13 | |
| Mint9112 | Chr 4: 18401039 | 0.049 | C/G | 0.069 | G/C | 18 | |
| Mint3645 | Unknown | 0.010 | G/A | 0.212 | A/G | 12 |
Ma, major allele; Mi, minor allele.
The effects of different allelic states (homozygous reference, homozygous alternate, heterozygous) of significant SNPs were analysed and compared with genotypes lacking that particular SNP (NN). For Mint10079, the homozygous alternate allele (CC) had a highly negative effect on both EV and precocity traits, though these differences were not statistically significant (P > 0.05) due to rarity (present in only one out of 220 accessions) (Fig. 3A). The heterozygous state (AC) exhibited a highly positive effect on precocity but was also a rare variant. Similarly, Mint4004 exhibited significant positive effects for CNM (P < 0.001 for CC vs NN and P = 0.05 for GG vs NN) but had negative effects for HT when in the heterozygous state (CG, n = 1) (Fig. 3B). The heterozygote CG of Mint4004 also had a highly positive effect on CNM although the difference was not significant due to the rarity of the allele. Other notable SNPs included Mint11317, which negatively affected CW and CV, with significant differences between CC and NN (P < 0.001) (Fig. 4A), and Mint9112, which positively influenced TNM and CNM, with CC showing the highest averages significantly different from GG (P < 0.001) and NN (P < 0.001) (Fig. 4B). Favourable SNPs such as Mint7820 (for CV) (Fig. 4C) and Mint1617 (for CNM) (Fig. 4D) also exhibited significant allele effects, while four markers associated with CW (Mint8813, Mint7123, Mint6881, and Mint433) and two markers associated with precocity (Mint1617 and Mint3645) showed no significant allelic differences (data not shown).
Figure 3.
Allele effect of pleiotropic markers (A) Mint10079 and (B) Mint4004 associated with EV and precocity traits. Genotypic states of each marker are shown below the respective boxplots, with the numbers in parentheses indicating the number of accessions for each genotype. Mean trait values for each genotypic state are displayed next to the red dots, and significant differences (P < 0.05) are indicated above the corresponding boxplots. NN represents accessions lacking the significant SNP.
Figure 4.
Allele effects of significant markers (A) Mint11317 and (B) Min9112 (C) Mint7820 and (D) Mint1617 associated with EV and precocity traits. Genotypic states of each marker are shown below the respective boxplots, with the numbers in parentheses indicating the number of accessions for each genotype. Mean trait values for each genotypic state are displayed next to the red dots, and significant differences (P < 0.05) are indicated above the corresponding boxplots. NN represents accessions lacking the significant SNP.
Significant SNPs were mapped to seven of the 14 chromosomes in M. integrifolia (chromosomes 1, 2, 3, 4, 5, 7, and 9), while a few markers had unknown positions (Table 4). Genes closely associated with these significant SNPs (located within 5000 bp upstream and downstream of the SNP positions) were predominantly involved in regulating plant growth and development, pathogen defence, disease tolerance, and cell division (Table 5).
Table 5.
Significant SNPs, closely associated genes, and their annotations from the latest whole-genome assembly of M. integrifolia along with probable gene functions in plant species
| Traits | SNP ID | Closely associated genes | Gene annotation | Probable function/s | References |
|---|---|---|---|---|---|
| CV, CW |
Mint11317 | LOC122072123 | Probable carotenoid cleavage dioxygenase 4 | Controls lateral branching | [35] |
| CV, HT, TNM, CNM | Mint10079 | LOC122073194 | Glucan endo-1,3-beta-D-glucosidase-like | Cell division, pathogen defence, and pollen development | [36, 37] |
| CV | Mint7820 | LOC122089925 | 1-Aminocyclopropane-1-carboxylate oxidase homolog 4-like | Regulates ethylene production affecting plant growth, development, and survival | [38] |
| HT, CNM | Mint4004 | LOC122079354 | Probable tRNA (guanine(26)-N(2))-dimethyl transferase 2 | tRNA modification, plant protein synthesis, growth, development, and flowering time regulation. | [39, 40] |
| CW | Mint7123 | LOC122079597 | Transcription initiation factor TFIID subunit 15 | Regulates gene expression and transcription pathways | [41] |
| CW | Mint6881 | LOC122088113 | DNA-directed RNA polymerase V subunit 5A-like | RNA production and reproductive development | [42] |
| CW | Mint433 | LOC122074227 | L-type lectin-domain containing receptor kinase IV.1-like | Bacterial pathogen defence, positive regulator of plant height | [43, 44] |
| CW | Mint8813 | LOC122094272 | Uncharacterized | ||
| CNM, TNM | Mint9112 | LOC122076299 | Probable disease resistance RPP8-like protein 2 | Pathogen resistance | [45] |
| CNM | Mint1617 | LOC122074502 | Uncharacterized | ||
| CNM | Mint3645 |
Discussion
Large plant size and a lengthy life cycle, particularly an extended juvenile phase, pose significant challenges to the cost and time efficiency of fruit and nut breeding programmes. GWAS provides a powerful tool for overcoming these limitations by identifying genetic variants associated with key traits, facilitating MAS, and ultimately reducing selection cycles while increasing genetic gain. GWAS has successfully identified candidate trait-associated genes in several fruit and nut crops, including apple, pear, mango, kiwifruit, walnut, and almond ([46–49]; Pérez de los Cobos et al., 2023; [50]). The integration of GWAS with MAS has been shown to accelerate breeding cycles and improve genetic gain in tree fruit crops [51–53]. In macadamia breeding, GWAS has the potential to overcome traditional constraints by enabling early seedling preselection based on validated markers. This approach can significantly reduce costs and time by minimizing the number of trees required for phenotyping [3, 8]. EV and precocity are critical traits for macadamia selection, yet previous GWAS efforts for these traits have been limited. This study is the first to identify genetic links between EV and precocity traits in macadamia.
Variability and heritability of traits in breeding population
Detection of a marker–trait association in GWAS is highly dependent on the extent of variation for the trait and a portion of this variation must be explained by genetic effects [54]. In this study, the mean values of traits varied across sites, with the highest EV observed at the Dunoon site and the lowest at Newrybar. Similarly, precocity had the highest mean value at the Bundaberg site and the lowest at Baffle Creek_S1. These differences in raw phenotypes suggest that growing location strongly influences EV and precocity traits in macadamia, a conclusion further supported by the high range of genetic variances across sites for these traits. Residual variances (Ve) for EV traits were higher than additive genetic variances (Va) at all sites, indicating a large proportion of unexplained variation. The lack of replicates in the study prevented the estimation of nonadditive genetic variances, which may have contributed to the unexplained variance. In contrast, precocity traits exhibited higher Va than Ve at most sites, suggesting stronger genetic control. The variation identified in this breeding population underscores its suitability for GWAS.
Heritability estimates for EV traits in this study demonstrate the complexity of these traits, with low to moderate H2 and h2 values. For instance, traits like CV (H2 = 0.23, h2 = 0.12) and CW (H2 = 0.21, h2 = 0.09) showed substantial environmental influence, suggesting that these traits may be challenging to improve genetically. The low heritabilities observed for these traits indicate that a significant portion of their expression is determined by environmental factors, making selection for these traits less effective unless extensive testing across different environments is conducted. In comparison, higher heritability estimates for vigour traits in other tree crops, such as cashew (H2 > 0.58, h2 > 0.51 for CW), walnut (h2 = 0.8 for HT), and pecan (H2 = 0.78 for HT) ([55–58]), suggest that these traits are more responsive to genetic selection in those species. This contrast underscores the importance of understanding species-specific genetic architectures. Similar to the finding in this study, Toft et al. [59] reported a very low H2 of 0.07 for CV of 4 to 5-year-old macadamia trees. Conversely, Hardner et al. [2] reported a higher heritability of CW at age five (H2 = 0.51–0.65 and h2 = 0.13–0.40). However, discrepancies between studies may stem from methodological differences, including the absence of pedigree information in earlier studies, which can result in underestimation of genetic effects [60]. In contrast, HT exhibited the highest heritability (H2 = 0.48, h2 = 0.27) among EV traits, reinforcing its potential as a practical target for breeding programmes aiming to enhance EV. This aligns with findings from macadamia and other tree crops like cashew and peach, where HT tends to show higher heritability compared to other vigour traits ([11, 55, 56, 59, 61]).
Precocity traits showed higher heritability than EV traits, suggesting a greater influence of genetic factors and reduced environmental variation. Comparable heritability estimates of early nut yield were reported in cashew (H2 = 0.38), kiwifruit (H2 = 0.35), and pecan (h2 = 0.35) [62–64]. In macadamia, similar estimates of TNM at age five (h2 = 0.19–0.37, H2 = 0.37–0.72) were reported by Hardner et al. (2019). These estimates contrasts with the low H2 of TNM (0.06) and CNM (0.13) reported by Hardner et al. [65] as well as TNM per tree (H2 = 0.13) by Toft et al. [59]. Differences in heritability estimates across studies underline the need for breeding programmes to carefully consider the population structure and selection strategies, as studies with selected progenies, such as that of Toft et al. [59] and Hardner et al. [65], may underestimate heritability due to reduced genetic variation [2]. These findings highlight the importance of accounting for environmental variation and leveraging appropriate statistical models, including those incorporating pedigree or genomic information, to more accurately estimate genetic parameters. Nonetheless, the higher genetic control over precocity traits makes them promising targets for genetic improvement, particularly in breeding programmes focused on early nut production.
Correlation between EV and precocity traits
Positive correlations between EV and precocity have been previously established in tree crops including apple, pear, pecan, cocoa, and olive [17–20]. This study identified low to moderate phenotypic (rp) and genetic (correlations between BLUPs, rg) correlations between EV and precocity in macadamia, with the strongest associations observed for CW-TNM (rp = 0.42; rg = 0.29), followed by CV-TNM (rp = 0.39; rg = 0.28) and HT-TNM (rp = 0.28; rg = 0.26). Toft et al. [59], reported a higher rg of 0.48 between early yield and CV. The correlations observed in this study were weaker than those found in other tree crops, such as walnut (rp = 0.74) and cashew (rp = 0.49) [66, 67]. The low to moderate rg indicates that while EV can predict precocity to some extent, its utility as a selection criterion is limited. Selection strategies focused solely on increasing EV may result in marginal improvements in precocity, but the rate of genetic gain is likely to be slow.
The physiological significance of these correlations lies in the potential trade-off between early tree vigour and precocity. Increased EV may accelerate canopy development, allowing trees to intercept more light and accumulate carbohydrate reserves necessary for early flowering and fruiting. Rapid early development of above-ground biomass also ensures shading of the soil surface, reducing soil water evaporation and increasing the crop’s water use efficiency [68, 69]. However, if EV is strongly correlated with mature vigour, such selection for EV could have negative consequences, as high vigour at maturity is often associated with increased management costs and reduced yield efficiency [3]. Given these potential drawbacks, correlations between early-stage and mature vigour have been examined in other tree crops. Some studies have reported negligible correlations, providing opportunities to select early-bearing cultivars with low mature vigour [17, 70]. To optimize breeding strategies for macadamia, comprehensive data on vigour across developmental stages, particularly juvenile–mature correlations, is essential. Such information would enable the identification of genotypes that balance precocity and manageable vigour, optimizing both early performance and long-term productivity. Additionally, the observed correlations between EV and precocity may be influenced by hormonal signalling pathways. Brassinosteroids, in coordination with other plant growth hormones such as gibberellins and auxins, play a key role in regulating vegetative growth and flowering, particularly in the transition from juvenile to reproductive phases [71]. Their involvement in cell elongation, carbohydrate metabolism, and meristem activity suggests a potential molecular basis for the observed relationship between EV and precocity. Further genomic and physiological studies are needed to elucidate the mechanisms governing these correlations, particularly in identifying candidate genes and pathways underlying EV and precocity in macadamia.
Genetic data and marker–trait associations
The genetic data used in this study comprised 7401 SNP markers obtained after the imputation and filtering of an initial dataset of 12 732 SNPs. Missing markers were imputed with high accuracy before applying quality control measures, including a call rate >50% and a minor allele frequency (MAF) >0.005. This contrasts with previous GWAS studies in macadamia, which filtered genetic data before imputation [32, 34], potentially reducing the number of usable markers. Pre-imputation filtering can impair imputation quality, as it may exclude markers that would otherwise meet quality thresholds after imputation [72].
A conventional GWAS approach filters SNPs with a MAF >0.05 to improve detection power, as rare alleles generally lack statistical power [73]. However, recent studies have extended this threshold, including markers with MAF as low as 0.03 [50], and even 0.01 [74], arguing that rare alleles, often arising from purifying selection, may carry causal variants critical for trait variation. Reflecting this, this study included markers with MAF >0.005, leading to the discovery of a significant SNP (Mint11079) with a very low MAF (0.007) associated with most traits under investigation. Although other significant SNPs had MAFs >0.03, this highlights the value of retaining low-frequency markers to capture rare but potentially important alleles. The low MAF of significant markers could shift with sample size, potentially affecting the proportion of explained variance [34].
Previous GWAS in macadamia excluded markers with MAF <0.025, although they identified many significant markers with MAF below this threshold due to a pre-imputation filtering of the SNP data [32, 34]. Excluding low MAF SNPs could prevent the identification of rare causal variants, as demonstrated in other crops: the Brachytic2 locus (MAF <0.01) associated with plant height in maize and the GS3 locus (MAF <0.01) linked to yield traits in rice [75, 76]. While rare alleles offer opportunities for genetic improvement, their utility in MAS must be approached cautiously. Statistical models designed for rare variant analysis in human genetics, such as burden tests or SKAT, may provide a useful framework for plants [77]. In the absence of such models for plant species, increasing marker density remains a viable strategy to enhance detection power [54]. Nevertheless, postvalidation, these rare alleles could be particularly valuable for gene editing approaches, such as target site mutation and CRISPR-Cas systems [78], to directly modify key loci associated with EV and precocity traits, enabling rapid genetic improvement and precision breeding in macadamia.
Despite the advantages of GWAS, false-positive associations remain a concern due to factors such as population structure, low-frequency causal alleles, and the multiple-testing burden [79]. The number of SNPs used in this study (7401) was relatively smaller compared to recent GWAS in other fruit and nut crops such as mango (100 000), apple (43 000), kiwifruit (9 million), walnut (364 275), and almond (60 581), which have leveraged higher SNP densities for improved detection power ([47, 49][80, 81]; Pérez de los Cobos et al., 2023; Y. Wang et al., 2024). Similarly, studies in grain crops typically employ larger SNP datasets, facilitating the identification of a greater number of significant associations [82–84]. Although this study utilized highly informative markers with stringent quality control, increasing SNP density in future studies could enhance genome-wide coverage and improve the detection of additional alleles linked to the traits of interest. While the SNP dataset in this study is not as extensive, it serves as a preliminary effort in applying genomic approaches to macadamia breeding. These findings provide an essential foundation for future research and breeding strategies, demonstrating the potential of GWAS to identify key genetic factors influencing important traits in macadamia.
Although this study effectively accounted for population structure and kinship using an LMM, as evidenced by QQ plots showing no early deviation from the null expectation, low MAF of a few SNPs is still a concern. Each marker–trait association must be validated before incorporating into MAS. Two common approaches are (1) independent validation in larger populations and (2) functional validation via transcriptomics, transgenesis, gene editing, or gene silencing [79]. While cross-validation is labour- and time-intensive, functional validation, particularly expression analysis of candidate genes, offers a more direct and efficient pathway to confirm GWAS results [85]. The step of validation through transcriptomics therefore provides better confidence in utilizing candidate genes/markers in MAS. By providing functional evidence, transcriptomics strengthens confidence in utilizing candidate genes and markers for MAS, bridging the gap between association studies and practical breeding applications.
Allelic effects on phenotypic variation
Significant SNPs identified through GWAS explained 11%–31% of phenotypic variance, suggesting the presence of major QTLs controlling these traits. SNPs explaining >10% variance are considered effective for breeding applications [86]. Higher variance explanation also indicates stronger marker–trait correlations [87].
Visualizing the phenotypic effects of SNP alleles provides insights for MAS in breeding programme. In this study, favourable and unfavourable alleles were determined based on the phenotypic differences among different allelic states of each marker. For Mint10079, the rare allele AC (n = 1) showed a favourable effect on precocity with negligible impact on EV, while the rare CC allele (n = 1) reduced both EV and precocity, though these effects lacked statistical significance due to their rarity. Selecting for the AC allele, as such, could enhance precocity without compromising EV. Similarly, at another pleiotropic locus, Mint4004, the homozygous CC allele was favourable for HT, and the rare heterozygous CG allele positively influenced precocity while reducing EV. Selecting accessions for this rare heterozygous allele might be beneficial in identifying cultivars suitable for high-density planting systems, promoting small tree size and high precocity. Given the known importance of EV in stress tolerance and root growth [15, 88], further investigation into the effect of this SNP allele, especially on yield stability until maturity, is warranted. Considering that the favourable alleles in these instances are rare, thorough further investigation, as previously emphasized, is imperative.
For EV, individuals with homozygous CC allele at Mint11317 displayed unfavourable effects, whereas those with AA or AG at Mint7820 exhibited favourable impacts. Similarly, homozygous CC at Mint9112 and Mint1617 demonstrated highly favourable effects on precocity. Implementing MAS based on these findings involves genotyping progenies at the seedling stage, followed by selecting the most favourable alleles of these markers, postvalidation. This approach streamlines the selection process for improved EV and precocity, eliminating the need for manual phenotyping and population evaluation.
Partially shared genetic mechanism between EV and precocity
The observed correlations between EV and precocity may be driven by shared molecular pathways controlling these traits. Pleiotropy, where a single gene affects multiple phenotypic traits, is a common mechanism underlying such correlations [46, 89, 90]. This study identified two SNPs, Mint10079 and Mint4004, significantly associated with both EV and precocity. Mint10079 was linked to HT, CV, TNM, and CNM, while Mint4004 was associated with HT and CNM. Although the genetic correlation between EV and precocity traits was modest, the identification of shared loci highlights the potential for genetic co-regulation.
Pleiotropic genes influencing vegetative and reproductive development have been identified in other perennial crops. For example, FLOWERING LOCUS T (FT) is known to coordinate vegetative growth and flowering time in poplar, integrating environmental signals to regulate growth transitions [91, 92]. Similarly, significant loci associated with precocity have been linked to EV in pistachio [93]. Interestingly, pleiotropic loci have been observed even in cases where traits lack significant genetic correlations, as seen in wheat, where vigour and yield-related traits shared common genetic loci [94]. This suggests that pleiotropy may play a role in coordinating traits through shared pathways or regulatory mechanisms. From a breeding perspective, SNP markers with pleiotropic effects on EV and precocity enable simultaneous selection for favourable alleles across traits, enhancing MAS. However, thorough validation through independent testing and functional analyses, as mentioned before, is essential to ensure their reliability and practical application.
Earlier studies have shown that functional genes are often located near or within the regions of associated SNPs [95, 96]. With the availability of a well-annotated M. integrifolia genome [95], candidate genes were defined as those within 5000 bp of the associated SNPs. Among the eight putative candidate genes identified, two were found to exhibit pleiotropic effects, being associated with both EV and precocity traits: one encoding glucan endo-1,3-β-D-glucosidase-like and the other encoding probable tRNA (guanine(26)-N(2))-dimethyl transferase 2.
The glucan endo-1,3-β-D-glucosidase-like gene, officially classified as endo β-1,3-glucanase (β-1,3-G) [97], plays a critical role in various physiological and developmental processes in plants. β-1,3-G enzymes facilitate callose degradation at plasmodesmata, influencing cell-to-cell communication, axillary bud outgrowth, and shoot meristem differentiation [98, 99]. This function is critical for canopy expansion and biomass accumulation, which contribute to EV. Additionally, β-1,3-G has been linked to stomatal development [99], potentially enhancing gas exchange efficiency and supporting rapid growth. Beyond its role in vegetative processes, β-1,3-G is also implicated in floral induction by regulating callose turnover in meristematic tissues, thereby facilitating the transition from vegetative to reproductive growth [98]. The involvement of this gene in macadamia suggests its role in promoting vegetative growth while also enabling the transition to reproductive development.
The probable tRNA (guanine(26)-N(2))-dimethyl transferase 2 gene is a key post-transcriptional regulator that stabilizes tRNA, enhances the accuracy of codon–anticodon pairing, and ensures proper tRNA folding under high temperature or oxidative stress [39]. tRNA modifications can enhance translational accuracy and efficiency, which are critical for optimal protein synthesis in plants [39]. This optimization of translation efficiency reduces energy wastage, potentially allowing plants to allocate resources more effectively to growth and defence. In Arabidopsis, mutants of this gene exhibited slower growth, reduced biomass, and delayed flowering, emphasizing its role in developmental regulation [40]. The association of this gene with both EV and precocity in macadamia suggests that enhanced protein synthesis efficiency may support robust early vegetative growth while ensuring timely floral transition. Efficient translation could facilitate the rapid accumulation of structural and regulatory proteins required for canopy expansion while simultaneously promoting the timely onset of reproductive development.
Other candidate genes associated with EV traits primarily regulate plant growth, reproductive development, pathogen defence, and transcriptional pathways in plant species. A probable carotenoid cleavage dioxygenase (CCD) gene associated with CV and CW showed a negative effect on these traits. CCDs regulate the synthesis of apocarotenoid hormones, which control lateral shoot growth and branching [100]. Reduced branching and compact growth due to CCD activity may influence overall tree architecture in macadamia, potentially affecting light interception and biomass allocation. Another candidate gene, 1-aminocyclopropane-1-carboxylate oxidase homolog 4-like (ACO), linked to CV, is involved in ethylene synthesis [101]. The positive effect of ACO on EV traits in this study may be related to enhanced root development and improved cambial meristem activity, as ethylene signalling has been shown to regulate cell division and vascular differentiation [102]. Additionally, a locus annotated as probable disease resistance RPP8-like protein 2, known for its involvement in pathogen defence in Arabidopsis, was associated with precocity in this study [45]. While the precise function of this gene in precocity and yield remains unclear, enhanced pathogen resistance may reduce stress-induced delays in flowering and fruiting, thereby promoting early reproductive development. Future work should focus on functional validation of these candidate genes to confirm their roles and explore their regulatory networks in macadamia. Identified candidate genes, particularly the pleiotropic genes, are particularly promising.
Conclusions
Macadamia is an ideal candidate for genomic-assisted breeding due to their long juvenile period, large size, and the costs associated with phenotyping, evaluation, and management. Despite these challenges, the application of MAS in macadamia breeding programmes has only recently gained attention. The present study, being the first to explore common genetic link between EV and precocity, thus has important implications for macadamia breeding. The findings offer significant insights into the variability of these traits in a diverse breeding population across different sites, emphasizing the influence of growing locations. The heritability estimates suggest that traits related to precocity are more heritable than EV. Marker–trait associations identified eight SNPs significantly associated with EV and five SNPs with precocity, including two pleiotropic SNPs linked with both traits. These pleiotropic markers present a unique opportunity for MAS in breeding, enabling the simultaneous enhancement of multiple desirable traits. Putative candidate genes associated with these markers, shedding light on their potential roles in physiological and developmental processes were investigated. Genes related to β-1,3-glucanases and guanine-methyl transferase influenced both EV and precocity traits. Additional candidate genes involved in plant growth, reproductive development, and defence mechanisms were also identified. The study also acknowledges the common challenges associated with GWAS, including false-positive associations, population structure concerns, and the importance of validating associations, particularly with low MAF markers. Overall, this study enhances our understanding of the genetic mechanisms underlying EV and precocity in macadamia, offering a pathway for the use of significant markers in MAS to accelerate the selection process in macadamia breeding.
Methodology
Plant material and phenotypic data
This study involves a subset of seedling progenies from the Australian macadamia breeding programme’s B1.2 population [103]. The entire progeny population consisted of 2345 biparental progenies from 204 families planted between 2001 and 2003 across nine locations in southeast Queensland and northeast New South Wales [104]. The subset in this study includes 904 progenies from 111 families planted from 2001 to 2003 across six sites in southeast Queensland: Alloway, Amamoor, Baffle Creek, Bundaberg, Dunoon, and Newrybar. Amamoor and Baffle Creek had two separate trials planted in 2002 and 2003, which were treated as distinct sites and are denoted as Amamoor_S1, Amamoor_S2, Baffle Creek_S1, and Baffle Creek_S2, respectively. These six sites were selected for analysis because phenotypic data for early vigour and precocity at age five were available only for these sites, while the remaining three locations lacked phenotypic records for these traits. Trees were planted in single tree plots using an incomplete block design with replication of families. Detailed information on trial layouts, block structure, and site-specific management is provided in Hardner [103] and Supplementary Table 3. Spacing was set at 4 m within rows and 8 m between rows [104]. Standard horticultural practices, including pruning, fertilization, irrigation, and pest management, were applied to maintain tree health [105].
Historical data on growth and precocity traits were utilized for this study. EV traits were represented by HT, CW, and CV measured in 5-year-old trees. HT was measured from the ground to the tallest point of the tress, using a retractable measuring pole. CW was measured from one end of the canopy to the other along the rows. CV was derived from the measured traits using the formulae: 
Precocity was estimated using TNM and CNM measured at age five. Traits were assessed on an individual tree basis, following methods described by O’Connor et al. [106].
Phenotypic data analysis
LMMs were fitted for each trait across sites using the asreml function in the ASREML-R package [107] in R v 4.3.2 [108]. Structured variance models were used to estimate genetic and residual variance components for each trait across sites, allowing for heterogeneous variances across different sites. Site was fitted as a fixed effect, while individual genotype, family, block, row, and column effects were fitted as random effects. The additive genetic relationship matrix was calculated from the available pedigree information, and its inverse was computed in ASReml-R using the algorithm of Luo [109]. Residuals were modelled with site-specific structures to account for spatial variability or independence within each site. For sites with spatial correlations, a separable 2D (row and column) first-order autoregressive (AR1 × AR1) structure was applied, as described by Gilmour et al. [110] At other sites, residuals were treated as independent (ID) in one or both dimensions, depending on the experimental conditions and site-specific patterns.
The data can be represented by ypi(jklm), denoting the observation for the pth genotype (g) in the ith site (a), with associated family (f), block (b), row (r), and column (c), and the model can be defined as:
ypi(jklm) = μ + ai + gpi + fij + bik + ril + cim + epi(jklm) Model (1)
where μ is the overall mean, ai is the fixed effect of the ith site, gp is the random additive effect of the pth genotype across years (with distribution g ~ N(0,
) where A is the known additive relationship matrix based on pedigree and
is the additive genetic variance across ith site), fij is the random effect of jth family at ith site, bik is the random effect of kth block at ith site, ril is the random effect of lth row at ith site, cim is the random effect of mth column (tree position within rows) at ith site and epi(jklm) is the residual error e ~ N (0, Ei) where Ei is a block diagonal matrix specific to site i, modelled with residual variance–covariance structures as follows: E =
for Amamoor_S1, Amamoor_S2, Baffle Creek_S2, and Bundaberg,; E=
for Alloway; and E=
for Baffle Creek_S1 and Newrybar, where
and
are first-order autoregressive (ar1) spatial correlation matrices in the row and column dimensions, respectively, and I is the identity matrix. Wald tests of fixed effects were performed to identify the level of significance of differences among sites.
The variance components from Model 1 were used to estimate the narrow and broad sense heritabilities of each trait for each site using the following equations [111]:
where
is the estimated additive genetic variance,
is the estimated family variance, and
is the estimated residual variance. Variance components for each site used to estimate heritabilities are presented in Supplementary Table 2.
Predictions of total genetic effects (BLUPs) for each genotype and for each trait were made using an LMM based on Model 1 but with factor analytic (FA) genetic covariance models [112, 113]. In Model 1, the genetic effects across years were modelled using a diagonal structure, allowing for heterogeneous variances but assuming zero covariances between years [114]. The modified model (Model 2) replaced the diagonal structure with an FA model [113], which gives an approximation to an unstructured covariance matrix allowing for the direct estimation of genetic variances for each site and covariances between all pairs of sites.
All the phenotypic correlations and correlations between BLUPs presented in this study were calculated with the function ggpairs of the GGally package in R [115].
Genotypic data analysis
Genotyping and SNP filtering
For the GWAS analysis, a subset of 220 genotypes was selected from the original 904 progenies. This subset was chosen based on yield performance of genotypes. Five high-yielding and five low-yielding progenies per family were selected [116]. These genotypes had previously been genotyped for 12 732 SNP markers by Diversity Array Technology (DArT) Pty Ltd. The markers used in this study were selected from an initial pool of ~1 million SNPs, which were assessed for reproducibility, and only those with >99% reproducibility were retained for analysis [117]. To address missing genotypes, the markers were imputed using the LD KNNi method in TASSEL v 5.2.89 [118]. Markers were filtered for various quality control measures including >50% call rate, >0.5% MAF, and 5% minimum heterozygosity. After filtering, 7401 SNP markers were retained for the analysis.
Association analysis
To adjust for the population stratification by accounting for family structure and cryptic relatedness [119], a principal component analysis (PCA) based on multidimensional scaling (MDS) of the genotypic dataset was performed in TASSEL. This was followed by obtaining the kinship (K) between genotypes, using the centred identity by state (centred-IBS) method as explained by Endelman and Jannink [120]. An LMM accounting for both of these population structure and relatedness (PCA + K model) was used to perform association analysis between the genotype and the phenotype in TASSEL [118]. The statistical method used for LMM is:
y = Xβ + Zu + e
where y is the vector of observations; β is an unknown vector containing fixed effects including genetic marker and population structure (Q); u is an unknown vector of random additive genetic effects from multiple background QTL for individuals or lines; X and Z are the known design matrices; and e is the unobserved vector of random residuals. The u and e vectors are assumed to be normally distributed with null mean and variance of
, where G =
with
as the unknown additive genetic variance and K as the kinship matrix.
To visualize SNP markers associated with specific traits and verify that population structure was adequately accounted for, QQ and Manhattan plots were generated based on the association analysis. The QQ plot displayed the calculated SNP P-values on the y-axis against the expected uniform distribution of P-values on the x-axis, providing a visual assessment of deviations from the null hypothesis. The Manhattan plot highlighted markers significantly associated with the trait. The initial threshold for significant association was considered as P < 1e−04. To identify true-positive association of the markers and the phenotype, an FDR for each marker was calculated with the Benjamini and Hochberg approach [121, 122]. In this approach, all P-values from the GWAS were ranked in ascending order, with each P-value assigned a rank (R, e.g. 1, 2, 3, . . ., K, where K is the total number of markers). The FDR-adjusted P-value for each marker was calculated as: P-value (FDR) =
[123], where pR refers to the rank of marker P-value and K is the total number of markers. The FDR-adjusted P-values were computed using the p.adjust function in R [124], and SNPs with FDR <0.05 were considered statistically significant. This threshold is widely used in plant GWAS studies aiming to identify candidate loci for further genetic and molecular analyses [125]. The allele effects of each significant SNP on the phenotype were visualized using the ggbetweenstats function from the ggstasplot package in R [126]. Pairwise tests were conducted to identify the level of significance between the different alleles.
Candidate gene detection
Putative candidate genes were identified using the latest whole-genome sequence of M. integrifolia [95] available on the NCBI database (www.ncbi.nlm.nih.gov/datasets/genome/?taxon=60698). Since the locations of most SNP markers were known, candidate genes within 5000 bp upstream and downstream of the SNP positions were detected. For SNPs with unknown positions, the allele sequences of these SNPs were blasted (https://blast.ncbi.nlm.nih.gov/Blast.cgi) against the NCBI nucleotide database for macadamia (SCU_Mint_v3 GenBank assembly [GCF_013358625.1]) to determine putative genes associated with the markers.
Consent for publication
All authors approve the manuscript and consent to publication of the work.
Acknowledgement
This work has been supported by the National Macadamia Breeding and Evaluation Program (MC19000, MC14000), funded by Hort Innovation Australia, using the Macadamia research and development levy and contributions from the Australian Government. Hort Innovation is the grower-owned, not-for-profit research and development corporation for Australian horticulture. Research funding was also provided by Queensland Government-supported Advance Queensland Industry Research Mid-Career Fellowship grant (AQIRF073-2022RD5; RM 2022002724). The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript. The funders had no role in the conceptualization, design, data collection, analysis, decision to publish, or preparation of the manuscript. The University of Queensland provided a higher degree research scholarship to P.D.P.
Supplementary Material
Contributor Information
Pragya Dhakal Poudel, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072, Australia.
Joanne-De Faveri, AV Data Analytics, Adelaide, SA 5153, Australia.
Bruce Topp, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072, Australia.
Mobashwer Alam, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072, Australia.
Author contributions
B.T. and M.A. acquired the funding and resources. P.D.P. wrote the paper, performed analyses, and made final edits. P.D.P., J.D.F., and M.A. developed the analytical models. J.D.F., B.T., and M.A. assisted in interpretation of results and revised the manuscript. All authors read and approved the final manuscript.
Data availability
The datasets used and/or analysed during the current study are available from the corresponding author/s on reasonable request.
Conflict of interest statement
The authors declare that they have no competing interests.
Supplementary data
Supplementary data is available at Hortre Journal online.
References
- 1. Topp BL, Nock CJ, Hardner CM. et al. Macadamia (Macadamia spp.) breeding. In: Al-Khayri JM, Jain SM, Johnson DV, eds. Advances in Plant Breeding Strategies: Nut and Beverage Crops. Springer: Cham, 2019,221–51 [Google Scholar]
- 2. Hardner C, Costa e Silva J, Williams E. et al. Breeding new cultivars for the Australian macadamia industry. HortScience. 2019;54:621–8 [Google Scholar]
- 3. Poudel P, Cowan M, Shaw L. et al. Macadamia breeding for reduced plant vigor: progress and prospects for profitable and sustainable orchard systems. Sustainability. 2023;15:14506 [Google Scholar]
- 4. Topp B, Hardner C, Kelly A. Strategies for breeding macadamias in Australia. Acta Hortic. 2010;935:47–53 [Google Scholar]
- 5. Huett D. Macadamia physiology review: a canopy light response study and literature review. Aust J Agric Res. 2004;55:609–24 [Google Scholar]
- 6. McFadyen L, Morris S, McConchie C. et al. Effect of hedging and tree removal on productivity of crowding macadamia orchards. Aust J Exp Agric. 2005;45:725–30 [Google Scholar]
- 7. Alam M, Howell E, Hardner C. et al. Variation in precocity in a macadamia breeding population. In: International Symposia on Tropical and Temperate Horticulture-ISTTH2016 1205. Cairns, Australia, 2016, [Google Scholar]
- 8. O’Connor K, Hayes B, Topp B. Prospects for increasing yield in macadamia using component traits and genomics. Tree Genet Genomes. 2018;14:1–14 [Google Scholar]
- 9. Thorp G. Macadamia - Propagation and Precocity (MC13014). 2019:
- 10. Thorp G, Smith A. Advancing the timing of first commercial harvest using trunk girdling. AMS News Bulletin, Spring. 2018;2018:75–7 [Google Scholar]
- 11. Poudel PD, Cowan M, De Faveri J. et al. Genetic variability in macadamia rootstocks and their effects on early vigor traits of ‘HAES 741′. Sci Hortic. 2024;338:113842 [Google Scholar]
- 12. Vukasovic S, Alahmad S, Christopher J. et al. Dissecting the genetics of early vigour to design drought-adapted wheat. Front Plant Sci. 2021;12:754439 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Condon AG, Richards R, Rebetzke G. et al. Breeding for high water-use efficiency. J Exp Bot. 2004;55:2447–60 [DOI] [PubMed] [Google Scholar]
- 14. Mahmood T, Iqbal MS, Li H. et al. Differential seedling growth and tolerance indices reflect drought tolerance in cotton. BMC Plant Biol. 2022;22:1–11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Srivastava R, Kobayashi Y, Koyama H. et al. Overexpression of cowpea NAC transcription factors promoted growth and stress tolerance by boosting photosynthetic activity in Arabidopsis. Plant Sci. 2022;319:111251 [DOI] [PubMed] [Google Scholar]
- 16. McDowell N, Pockman WT, Allen CD. et al. Mechanisms of plant survival and mortality during drought: why do some plants survive while others succumb to drought? New Phytol. 2008;178:719–39 [DOI] [PubMed] [Google Scholar]
- 17. Thompson T, Grauke L. Pecan tree growth and precocity. J Am Soc Hortic Sci. 2003;128:63–6 [Google Scholar]
- 18. Padi FK, Opoku SY, Adomako B. et al. Effectiveness of juvenile tree growth rate as an index for selecting high yielding cocoa families. Sci Hortic. 2012;139:14–20 [Google Scholar]
- 19. Pritsa T, Voyiatzis D, Voyiatzi C. et al. Evaluation of vegetative growth traits and their relation to time to first flowering of olive seedlings. Aust J Agric Res. 2003;54:371–6 [Google Scholar]
- 20. Visser T. Juvenile phase and growth of apple and pear seedlings. Euphytica. 1964;13:119–29 [Google Scholar]
- 21. Toft B, Alam MM, Wilkie JD. et al. Phenotypic association of multi-scale architectural traits with canopy volume and yield: moving toward high-density systems for macadamia. HortScience. 2019;54:596–602 [Google Scholar]
- 22. Lawes G, Spence C, Tustin D. et al. Tree quality and canopy management effects on the growth and floral precocity of young ‘Doyenne du Cornice’ pear trees. N Z J Crop Hortic Sci. 1997;25:177–84 [Google Scholar]
- 23. Hammami SB, León L, Rapoport HF. et al. A new approach for early selection of short juvenile period in olive progenies. Sci Hortic. 2021;281:109993 [Google Scholar]
- 24. Xiao J, Li J, Yuan L. et al. Identification of QTLs affecting traits of agronomic importance in a recombinant inbred population derived from a subspecific rice cross. Theor Appl Genet. 1996;92:230–44 [DOI] [PubMed] [Google Scholar]
- 25. Iwata H, Minamikawa MF, Kajiya-Kanegae H. et al. Genomics-assisted breeding in fruit trees. Breed Sci. 2016;66:100–15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Shook JM, Zhang J, Jones SE. et al. Meta-GWAS for quantitative trait loci identification in soybean. G3. 2021;11:jkab117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Coser SM, Chowda Reddy R, Zhang J. et al. Genetic architecture of charcoal rot (Macrophomina phaseolina) resistance in soybean revealed using a diverse panel. Front Plant Sci. 2017;8:1626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Evangelou E, Ioannidis JP. Meta-analysis methods for genome-wide association studies and beyond. Nat Rev Genet. 2013;14:379–89 [DOI] [PubMed] [Google Scholar]
- 29. Joukhadar R, Thistlethwaite R, Trethowan R. et al. Meta-analysis of genome-wide association studies reveal common loci controlling agronomic and quality traits in a wide range of normal and heat stressed environments. Theor Appl Genet. 2021;134:2113–27 [DOI] [PubMed] [Google Scholar]
- 30. Shariatipour N, Heidari B, Richards CM. Meta-analysis of QTLome for grain zinc and iron contents in wheat (Triticum aestivum L.). Euphytica. 2021a;217:86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Shariatipour N, Heidari B, Tahmasebi A. et al. Comparative genomic analysis of quantitative trait loci associated with micronutrient contents, grain quality, and agronomic traits in wheat (Triticum aestivum L.). Front Plant Sci. 2021b;12:709817 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Nunn J, De Faveri J, O’Connor K. et al. Genome-wide association study for abscission failure of fruit pericarps (stick-tights) in wild macadamia germplasm. Agronomy. 2022;12:1913 [Google Scholar]
- 33. O’Connor K, Hayes B, Hardner C. et al. Selecting for nut characteristics in macadamia using a genome-wide association study. HortScience. 2019a;54:629–32 [Google Scholar]
- 34. O’Connor K, Hayes B, Hardner C. et al. Genome-wide association studies for yield component traits in a macadamia breeding population. BMC Genomics. 2020;21:1–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Booker J, Sieberer T, Wright W. et al. MAX1 encodes a cytochrome P450 family member that acts downstream of MAX3/4 to produce a carotenoid-derived branch-inhibiting hormone. Dev Cell. 2005;8:443–9 [DOI] [PubMed] [Google Scholar]
- 36. Balasubramanian V, Vashisht D, Cletus J. et al. Plant β-1, 3-glucanases: their biological functions and transgenic expression against phytopathogenic fungi. Biotechnol Lett. 2012;34:1983–90 [DOI] [PubMed] [Google Scholar]
- 37. Doxey AC, Yaish MW, Moffatt BA. et al. Functional divergence in the Arabidopsis β-1, 3-glucanase gene family inferred by phylogenetic reconstruction of expression states. Mol Biol Evol. 2007;24:1045–55 [DOI] [PubMed] [Google Scholar]
- 38. Ruduś I, Sasiak M, Kępczyński J. Regulation of ethylene biosynthesis at the level of 1-aminocyclopropane-1-carboxylate oxidase (ACO) gene. Acta Physiol Plant. 2013;35:295–307 [Google Scholar]
- 39. Chen P, Jäger G, Zheng B. Transfer RNA modifications and genes for modifying enzymes in Arabidopsis thaliana. BMC Plant Biol. 2010;10:1–19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Guo Q, Ng PQ, Shi S. et al. Arabidopsis TRM5 encodes a nuclear-localised bifunctional tRNA guanine and inosine-N1-methyltransferase that is important for growth. PLoS One. 2019;14:e0225064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Lawit SJ, O’Grady K, Gurley WB. et al. Yeast two-hybrid map of Arabidopsis TFIID. Plant Mol Biol. 2007;64:73–87 [DOI] [PubMed] [Google Scholar]
- 42. Zheng K, Wang L, Zeng L. et al. The effect of RNA polymerase V on 24-nt siRNA accumulation depends on DNA methylation contexts and histone modifications in rice. Proc Natl Acad Sci. 2021;118:e2100709118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Saidou M, Zhang Z. The L-type lectin-like receptor kinase gene TaLecRK-IV. 1 regulates the plant height in wheat. Int J Mol Sci. 2022;23:8208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Singh P, Kuo Y-C, Mishra S. et al. The lectin receptor kinase-VI. 2 is required for priming and positively regulates Arabidopsis pattern-triggered immunity. Plant Cell. 2012;24:1256–70 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Mohr TJ, Mammarella ND, Hoff T. et al. The Arabidopsis downy mildew resistance gene RPP8 is induced by pathogens and salicylic acid and is regulated by W box cis elements. Mol Plant-Microbe Interact. 2010;23:1303–15 [DOI] [PubMed] [Google Scholar]
- 46. Bernard A, Marrano A, Donkpegan A. et al. Association and linkage mapping to unravel genetic architecture of phenological traits and lateral bearing in Persian walnut (Juglans regia L.). BMC Genomics. 2020;21:1–25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Eltaher S, Li J, Freeman B. et al. A genome-wide association study identified SNP markers and candidate genes associated with morphometric fruit quality traits in mangoes. BMC Genomics. 2025;26:120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Minamikawa MF, Takada N, Terakami S. et al. Genome-wide association study and genomic prediction using parental and breeding populations of Japanese pear (Pyrus pyrifolia Nakai). Sci Rep. 2018;8:1–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Noh J, Do YS, Kim GH. et al. A genome-wide association study for the detection of genes related to apple Marssonina blotch disease resistance in apples. Sci Hortic. 2020;262:108986 [Google Scholar]
- 50. Zhang M-Y, Xue C, Hu H. et al. Genome-wide association studies provide insights into the genetic determination of fruit traits of pear. Nat Commun. 2021;12:1144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Luby JJ, Shaw DV. Does marker-assisted selection make dollars and sense in a fruit breeding program? HortScience. 2001;36:872–9 [Google Scholar]
- 52. Ru S, Main D, Evans K. et al. Current applications, challenges, and perspectives of marker-assisted seedling selection in Rosaceae tree fruit breeding. Tree Genet Genomes. 2015;11:1–12 [Google Scholar]
- 53. Wannemuehler SD, Luby JJ, Yue C. et al. A cost–benefit analysis of DNA informed apple breeding. HortScience. 2019;54:1998–2004 [Google Scholar]
- 54. Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods. 2013;9:29–9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Cavalcanti JJV, de Resende MDV, Crisóstomo JR. et al. Genetic control of quantitative traits and hybrid breeding strategies for cashew improvement. Crop Breeding and Applied Biotechnology. 2007;7:186–95 [Google Scholar]
- 56. Ramteke V, Paikra M, Netam R. et al. Genetic variability, trait association, and path analysis studies for nut yield and yield-related traits in cashew (Anacardium occidentale L.). Journal of Agricultural Sci Technol. 2024;26:403–14 [Google Scholar]
- 57. Rezaee R, Vahdati K, Valizadeh M. Variability of seedling vigour in Persian walnut as influenced by the vigour and bearing habit of the mother tree. J Hortic Sci Biotechnol. 2009;84:228–32 [Google Scholar]
- 58. Wang X, Stein L, Black M. et al. Phenotypic diversity and population structure of pecan (Carya illinoinensis) collections reveals geographic patterns. Sci Rep. 2024a;14:18592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Toft BD, Alam M, Topp B. Estimating genetic parameters of architectural and reproductive traits in young macadamia cultivars. Tree Genet Genomes. 2018;14:1–10 [Google Scholar]
- 60. Falconer DS. Introduction to Quantitative Genetics. Noida, Utter Pradesh, India: Pearson Education India, 1996. [Google Scholar]
- 61. da Costa e Silva JO, Bruckner CH, Carneiro PCS. et al. Estimates of genetic parameters, genetic variability, and selection in the S 1 generation of peach. Ciência Rural. 2020;50:e20190976 [Google Scholar]
- 62. Adu-Gyamfi PK, Dadzie MA, Barnor M. et al. Genetic variability and trait association studies in cashew (Anacardium occidentale L.). Sci Hortic. 2019;255:108–14 [Google Scholar]
- 63. Marsh H, Paterson T, Seal A. et al. Heritability estimates in kiwifruit. Acta Hortic. 2002;62:221–29 [Google Scholar]
- 64. Thompson T, Baker J. Heritability and phenotypic correlations of six pecan nut characteristics. J Am Soc Hortic Sci. 1993;118:415–8 [Google Scholar]
- 65. Hardner CM, Winks CW, Stephenson RA. et al. Genetic parameters for yield in macadamia. Euphytica. 2002;125:255–64 [Google Scholar]
- 66. Adu-Gyamfi PK, Barnnor M, Akpertey A. et al. Genetic variability and combining abilities for earliness to nut yield and nut weight in selected cashew (Anacardium occidentale L.) clones. International Journal of Fruit Science. 2022;22:539–50 [Google Scholar]
- 67. Dadzie A, Adu-Gyamfi P, Akpertey A. et al. Assessment of juvenile growth and yield relationship among dwarf cashew types in Ghana. J Agric Sci. 2020;12:116 [Google Scholar]
- 68. Siddique K, Tennant D, Perry M. et al. Water use and water use efficiency of old and modern wheat cultivars in a Mediterranean-type environment. Aust J Agric Res. 1990;41:431–47 [Google Scholar]
- 69. ter Steege MW, den Ouden FM, Lambers H. et al. Genetic and physiological architecture of early vigor in Aegilops tauschii, the D-genome donor of hexaploid wheat. A quantitative trait loci analysis. Plant Physiol. 2005;139:1078–94 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. León L, de la Rosa R, Barranco D. et al. Breeding for early bearing in olive. HortScience. 2007;42:499–502 [Google Scholar]
- 71. Baghel M, Nagaraja A, Srivastav M. et al. Pleiotropic influences of brassinosteroids on fruit crops: a review. Plant Growth Regul. 2019;87:375–88 [Google Scholar]
- 72. Roshyara NR, Kirsten H, Horn K. et al. Impact of pre-imputation SNP-filtering on genotype imputation results. BMC Genet. 2014;15:1–11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Donkpegan AS, Bernard A, Barreneche T. et al. Genome-wide association mapping in a sweet cherry germplasm collection (Prunus avium L.) reveals candidate genes for fruit quality traits. Horticulture research. 2023;10:191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Holušová K, Čmejlová J, Suran P. et al. High-resolution genome-wide association study of a large Czech collection of sweet cherry (Prunus avium L.) on fruit maturity and quality traits. Horticulture Research. 2023;10:uhac233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Fan C, Xing Y, Mao H. et al. GS3, a major QTL for grain length and weight and minor QTL for grain width and thickness in rice, encodes a putative transmembrane protein. Theor Appl Genet. 2006;112:1164–71 [DOI] [PubMed] [Google Scholar]
- 76. Xing A, Gao Y, Ye L. et al. A rare SNP mutation in Brachytic2 moderately reduces plant height and increases yield potential in maize. J Exp Bot. 2015;66:3791–802 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Chen W, Coombes BJ, Larson NB. Recent advances and challenges of rare variant association analysis in the biobank sequencing era. Front Genet. 2022;13:1014947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Thomson MJ, Biswas S, Tsakirpaloglou N. et al. Functional allele validation by gene editing to leverage the wealth of genetic resources for crop improvement. Int J Mol Sci. 2022;23:6565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Alseekh S, Kostova D, Bulut M. et al. Genome-wide association studies: assessing trait characteristics in model and crop plants. Cell Mol Life Sci. 2021;78:5743–54 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. de los Cobos FP, Coindre E, Dlalah N. et al. Almond population genomics and non-additive GWAS reveal new insights into almond dissemination history and candidate genes for nut traits and blooming time. Horticulture research. 2023;10:uhad193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Wang Y, Wu Y, Wang X. et al. Genome wide association analysis identifies candidate genes for fruit quality and yield in Actinidia eriantha. J Integr Agric. 2024b;23:1929–39 [Google Scholar]
- 82. Hong Y, Zhang M, Zhu J. et al. Genome-wide association studies reveal novel loci for grain size in two-rowed barley (Hordeum vulgare L.). Theor Appl Genet. 2024;137:58. [DOI] [PubMed] [Google Scholar]
- 83. Salami M, Heidari B, Alizadeh B. et al. Dissection of quantitative trait nucleotides and candidate genes associated with agronomic and yield-related traits under drought stress in rapeseed varieties: integration of genome-wide association study and transcriptomic analysis. Front Plant Sci. 2024a;15:1342359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Salami M, Heidari B, Batley J. et al. Integration of genome-wide association studies, metabolomics, and transcriptomics reveals phenolic acid-and flavonoid-associated genes and their regulatory elements under drought stress in rapeseed flowers. Front Plant Sci. 2024b;14:1249142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Albert E, Sauvage C. Identification and validation of candidate genes from genome-wide association studies. In: Torkamaneh D, Belzile F (eds), Genome-Wide Association Studies. Springer: New York, NY, 2022,249–72 [DOI] [PubMed] [Google Scholar]
- 86. Lee H-Y, Ro N-Y, Patil A. et al. Uncovering candidate genes controlling major fruit-related traits in pepper via genotype-by-sequencing based QTL mapping and genome-wide association study. Front Plant Sci. 2020;11:1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Wen J, Chen J, Sun Y. et al. Association mapping of major economic traits and exploration of elite alleles in Prunus sibirica. Euphytica. 2023;219:39 [Google Scholar]
- 88. Grossnickle SC. Importance of root growth in overcoming planting stress. New For. 2005;30:273–94 [Google Scholar]
- 89. Bükücü ŞB, Sütyemez M, Kefayati S. et al. Major QTL with pleiotropic effects controlling time of leaf budburst and flowering-related traits in walnut (Juglans regia L.). Sci Rep. 2020;10:15207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Wagner GP, Kenney-Hunt JP, Pavlicev M. et al. Pleiotropic scaling of gene effects and the ‘cost of complexity’. Nature. 2008;452:470–2 [DOI] [PubMed] [Google Scholar]
- 91. André D, Marcon A, Lee KC. et al. FLOWERING LOCUS T paralogs control the annual growth cycle in Populus trees. Curr Biol. 2022;32:2988-2996. e2984. [DOI] [PubMed] [Google Scholar]
- 92. Hsu C-Y, Adams JP, Kim H. et al. FLOWERING LOCUS T duplication coordinates reproductive and vegetative growth in perennial poplar. Proc Natl Acad Sci. 2011;108:10756–61 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93. Palmer W, Jacygrad E, Sagayaradj S. et al. Genome assembly and association tests identify interacting loci associated with vigor, precocity, and sex in interspecific pistachio rootstocks. G3. 2023;13:jkac317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94. Parent B, Shahinnia F, Maphosa L. et al. Combining field performance with controlled environment plant imaging to identify the genetic control of growth and transpiration underlying yield response to water-deficit stress in wheat. J Exp Bot. 2015;66:5481–92 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95. Nock CJ, Baten A, Mauleon R. et al. Chromosome-scale assembly and annotation of the macadamia genome (Macadamia integrifolia HAES 741). G3: genes, genomes. Genetics. 2020;10:3497–504 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Zhang X, Mi Y, Mao H. et al. Genetic variation in ZmTIP1 contributes to root hair elongation and drought tolerance in maize. Plant Biotechnol J. 2020;18:1271–83 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97. Ramos O, Malcata F. 3.48—food-grade enzymes. In: Comprehensive Biotechnology, 2nd ed.; Moo-Young, M., Ed. 2011,555–69
- 98. Gisel A, Hempel FD, Barella S. et al. Leaf-to-shoot apex movement of symplastic tracer is restricted coincident with flowering in Arabidopsis. Proc Natl Acad Sci. 2002;99:1713–7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99. Guseman JM, Lee JS, Bogenschutz NL. et al. Dysregulation of cell-to-cell connectivity and stomatal patterning by loss-of-function mutation in Arabidopsis chorus (glucan synthase-like 8). Development. 2010;137:1731–41 [DOI] [PubMed] [Google Scholar]
- 100. Wei H, Movahedi A, Liu G. et al. Comprehensive analysis of carotenoid cleavage dioxygenases gene family and its expression in response to abiotic stress in poplar. Int J Mol Sci. 2022;23:1418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101. Houben M, Van de Poel B. 1-Aminocyclopropane-1-carboxylic acid oxidase (ACO): the enzyme that makes the plant hormone ethylene. Front Plant Sci. 2019;10:695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102. Wei H, Xue Y, Chen P. et al. Genome-wide identification and functional investigation of 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) genes in cotton. Plants. 2021;10:1699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103. Hardner C. Macadamia Improvement and Conservation (MC02054). North Sydney, Australia: Horticulture Innovation, 2008 [Google Scholar]
- 104. Topp B, Hardner C, Neal J. et al. Overview of the Australian macadamia industry breeding program. Acta Hortic 2016;1127:45–50 [Google Scholar]
- 105. O'Hare P, Quinlan K, Stephenson R. et al. Macadamia information kit. Agrilink, your growing guide to better farming guide. In: In. Queensland Horticulture Institute. Agrilink Series Q103052. Department of Primary Industries: Brisbane, Queensland, 2004, [Google Scholar]
- 106. O’Connor KM, Hayes BJ, Hardner CM. et al. Genomic selection and genetic gain for nut yield in an Australian macadamia breeding population. BMC Genomics. 2021;22:1–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107. Butler D, Cullis B, Gilmour A. et al. ASReml-R reference manual version 4. VSN International Ltd, Hemel Hempstead, HP1 1ES, UK. 2017.
- 108. Team , R. C. (2013). A Language and Environment for Statistical Computing. Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
- 109. Luo Z. Computing inbreeding coefficients in large populations. Genet Sel Evol. 1992;24:305–13 [Google Scholar]
- 110. Gilmour AR, Cullis BR, Verbyla AP. et al. Accounting for natural and extraneous variation in the analysis of field experiments. J Agric Biol Environ Stat. 1997;2:269–93 [Google Scholar]
- 111. Hardner CM, Bally IS, Wright CL. Prediction of breeding values for average fruit weight in mango using a multivariate individual mixed model. Euphytica. 2012;186:463–77 [Google Scholar]
- 112. Meyer K. Factor-analytic models for genotype× environment type problems and structured covariance matrices. Genet Sel Evol. 2009;41:1–11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113. Smith A, Cullis B, Thompson R. Analyzing variety by environment data using multiplicative mixed models and adjustments for spatial field trend. Biometrics. 2001;57:1138–47 [DOI] [PubMed] [Google Scholar]
- 114. Butler D, Cullis B, Gilmour A. et al. ASReml-R Reference Manual: ASReml Estimates Variance Components under a General Linear Mixed Model by Residual Maximum Likelihood (REML). Australia: Queensland Department of Primary Industries; 2009 [Google Scholar]
- 115. Schloerke B, Cook D, Larmarange J. et al. GGally: extension to'ggplot2'. R package version 2.1. 2. Jamovi project. 2021
- 116. O’Connor K, Kilian A, Hayes B. et al. Population structure, genetic diversity and linkage disequilibrium in a macadamia breeding population using SNP and silicoDArT markers. Tree Genet Genomes. 2019b;15:1–1630546292 [Google Scholar]
- 117. Alam M, Neal J, O’Connor K. et al. Ultra-high-throughput DArTseq-based silicoDArT and SNP markers for genomic studies in macadamia. PLoS One. 2018;13:e0203465 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118. Bradbury PJ, Zhang Z, Kroon DE. et al. TASSEL: software for association mapping of complex traits in diverse samples. bioinformatics. 2007;23:2633–5 [DOI] [PubMed] [Google Scholar]
- 119. Liu L, Zhang D, Liu H. et al. Robust methods for population stratification in genome wide association studies. BMC bioinformatics. 2013;14:1–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120. Endelman JB, Jannink J-L. Shrinkage estimation of the realized relationship matrix. G3: Genes| genomes| genetics. 2012;2:1405–13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57:289–300 [Google Scholar]
- 122. Brzyski D, Peterson CB, Sobczyk P. et al. Controlling the rate of GWAS false discoveries. Genetics. 2017;205:61–75 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123. Alqudah AM, Sallam A, Baenziger PS. et al. GWAS: fast-forwarding gene identification and characterization in temperate cereals: lessons from barley–a review. J Adv Res. 2020;22:119–35 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124. Gogarten SM, Bhangale T, Conomos MP. et al. GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies. bioinformatics. 2012;28:3329–31 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125. Alqudah AM, Youssef HM, Graner A. et al. Natural variation and genetic make-up of leaf blade area in spring barley. Theor Appl Genet. 2018;131:873–86 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126. Patil I. Visualizations with statistical details: the'ggstatsplot'approach. Journal of Open Source Software. 2021;6:3167 [Google Scholar]
- 127. Zimmerman R. H.. Juvenility and flowering in woody plants: a review. Hortsci. 1972;7:447–445 [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets used and/or analysed during the current study are available from the corresponding author/s on reasonable request.




