Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Jun 22;15(6):e0234458. doi: 10.1371/journal.pone.0234458

The genetic profile of elite youth soccer players and its association with power and speed depends on maturity status

Conall F Murtagh 1, Thomas E Brownlee 1, Edgardo Rienzi 2, Sebastian Roquero 2, Sacha Moreno 2, Gustavo Huertas 2, Giovani Lugioratto 3, Philipp Baumert 1, Daniel C Turner 1, Dongsun Lee 1, Peter Dickinson 1, K Amber Lyon 1, Bahare Sheikhsaraf 1, Betül Biyik 1, Andrew O’Boyle 1,4, Ryland Morgans 5, Andrew Massey 6, Barry Drust 1, Robert M Erskine 1,7,*
Editor: Luca Paolo Ardigò8
PMCID: PMC7307776  PMID: 32569264

Abstract

We investigated the association of multiple single nucleotide polymorphisms (SNPs) with athlete status and power/speed performance in elite male youth soccer players (ESP) and control participants (CON) at different stages of maturity. ESP (n = 535; aged 8–23 years) and CON (n = 151; aged 9–26 years) were genotyped for 10 SNPs and grouped according to years from predicted peak-height-velocity (PHV), i.e. pre- or post-PHV, to determine maturity status. Participants performed bilateral vertical countermovement jumps, bilateral horizontal-forward countermovement jumps, 20m sprints and modified 505-agility tests. Compared to CON, pre-PHV ESP demonstrated a higher ACTN3 (rs1815739) XX (‘endurance’) genotype frequency distribution, while post-PHV ESP revealed a higher frequency distribution of the PPARA (rs4253778) C-allele, AGT (rs699) GG genotype and NOS3 (rs2070744) T-allele (‘power’ genotypes/alleles). BDNF (rs6265) CC, COL5A1 (rs12722) CC and NOS3 TT homozygotes sprinted quicker than A-allele carriers, CT heterozygotes and CC homozygotes, respectively. COL2A1 (rs2070739) CC and AMPD1 (rs17602729) GG homozygotes sprinted faster than their respective minor allele carrier counterparts in CON and pre-PHV ESP, respectively. BDNF CC homozygotes jumped further than T-allele carriers, while ESP COL5A1 CC homozygotes jumped higher than TT homozygotes. To conclude, we have shown for the first time that pre- and post-PHV ESP have distinct genetic profiles, with pre-PHV ESP more suited for endurance, and post-PHV ESP for power and speed (the latter phenotypes being crucial attributes for post-PHV ESP). We have also demonstrated that power, acceleration and sprint performance were associated with five SNPs, both individually and in combination, possibly by influencing muscle size and neuromuscular activation.

Introduction

Human physical performance is influenced by a number of environmental and genetic factors [1]. The physical determinants of elite youth soccer playing status are specific to the stage of maturity [2], with power being more important in physically mature- vs. immature elite youth soccer players (ESP) [2]. However, it is not known if genetic variation can account for the differences in speed and power between ESP and recreationally active controls (CON) at different stages of maturity [2]. Such information could help inform genetic screening criteria in ESP, which could help practitioners identify talented pre-pubescent players, who are more likely to continue to succeed at an elite level when physically mature. Furthermore, talented pre-pubescent players found to have a less favourable genetic profile for post-pubescent physical performance could undergo personalised training to improve certain performance characteristics.

The majority of previous genetics research in soccer comprises case control studies in mainly senior players [3,4]. However, such studies are of limited use to the applied practitioner, as unlike cross-sectional genotype-phenotype association studies, they do not reveal the association between specific genetic polymorphisms and physical performance phenotypes. There is a paucity of cross-sectional genotype-phenotype studies in soccer [57] and, importantly, the contribution of genetic variation to power, speed and agility performance in ESP remains unknown.

Over 69 genetic markers have been associated with power athlete status [8] and the genetic association with power and speed is likely due to specific gene variants influencing protein abundance/expression, which will in turn affect the physiological determinants of speed and power. Considering this, patellar tendon compliance is a key physiological determinant of horizontal-forward jump performance in under 18 year-old (U18) and U21 ESP [9], and has previously been associated with sprint performance [10]. The COL5A1 (rs12722) single nucleotide polymorphism (SNP) has been associated with the extensibility of the tendon-aponeurosis structures of the knee extensors in some [11], but not all [12] studies. Other collagen SNPs, such as COL1A1 (rs2249492) and COL2A1 (rs2070739), which are variants of the genes encoding the alpha 1 chain of procollagen type I and II, respectively, may also influence tendon properties and therefore power performance, but this has yet to be investigated.

In contrast to horizontal-forward jump performance, one of the main physiological determinants of vertical jump performance in soccer players is quadriceps femoris muscle volume [13]. The NOS3 (rs2070744) T [14] and AGT (rs699) G [15,16] alleles have both been associated with elite power athlete status and are thought to exert their favourable effect on power performance by promoting skeletal muscle hypertrophy [14,16]. However, these SNPs have not been investigated in association with power/speed performance in ESP.

In addition to muscle volume, muscle fibre-type composition can also influence peak power output, with type II fibres being larger (therefore producing more force) and able to shorten quicker, thus able to generate more power, than type I fibres [17]. The ACTN3 (rs1815739) R-allele and PPARA (rs4253778) C-allele have previously been associated with ESP status [3,4] and a greater composition of type II skeletal muscle fibres [18] but evidence linking the ACTN3 SNP with power/speed in professional soccer players is equivocal [5,6]. Therefore, it is still unclear whether these SNPs influence power/speed performance in ESP at different stages of maturity.

Maximal power during sprinting and jumping is not only governed by muscle-tendon properties, but also neuromuscular activation [13,19]. Brain derived neurotrophic factor (BDNF) is a neurotrophin that regulates neuronal survival, growth, maintenance, neurogenesis and synaptic plasticity, and the BDNF (rs6265) C>T SNP has been associated with BDNF serum concentration in response to exercise [20]. It is possible that this SNP might influence neural adaptation and, therefore, the ability to activate the relevant muscles and generate more power during jump and sprint performance. However, this has yet to be investigated in ESP.

Furthermore, the AMPD1 (rs17602729) GG genotype has been associated with elite power athlete status [21] but has never been investigated in ESP. The VDR (rs2228570) GG genotype, on the other hand, has been associated with average- to high-level youth soccer playing status but was not associated with vertical jump, 10 m acceleration or 20 m sprint performance in adolescent soccer players [7]. However, this study was limited by a relatively low sample size (n = 125) and did not account for the confounding factor of maturity status. It therefore remains unknown if the VDR (rs2228570) A>G SNP is associated with power/sprint performance in ESP at different stages of maturity.

We aimed to investigate the association of 10 candidate SNPs with ESP playing status at different stages of maturity, and the individual and combined association of those SNPs with power/speed performance in ESP and CON according to maturity status. We hypothesised that the physically mature players would have a more power-orientated genetic profile than the younger players, and that individuals possessing multiple ‘power’ genotypes would perform better in the power/speed assessments within their respective maturity groups.

Materials and methods

All procedures performed in this study involving human participants were approved by Liverpool John Moores University Research Ethics Committee (protocol number: P13/SPS/033) and were in accordance with the 1964 Helsinki Declaration and its later amendments. Written informed consent was obtained from all individual study participants (and parents/guardians if participants <18 years old).

Participants

Six hundred and eighty-six males volunteered to take part in this study and formed two cohorts: 535 ESP and 151 CON. The ESP (U9 to U23 level) were considered elite if they had competed regularly (played over 12 competitive games in the previous 6 months) for a Category One (highest national category) soccer academy in England, or one of two Category A (highest national category) soccer academies in Uruguay. The age-matched CON were young, healthy males, who had not previously played soccer at academy or professional level. All participants were free from any lower-limb injury to the lower body in the three months prior to the study and had not previously sustained a serious knee or ankle injury that may affect performance during the study. Ethnic background was reported for all ESP (white, 79%; black, 5%; black/white, 16%; and Asian, 1%) and a sub-sample (n = 58) of CON (white, 93%; black, 2%; mixed black/white, 3%; and Asian, 2%). Participant characteristics are displayed in Table 1.

Table 1. Participant characteristics for the elite soccer player (ESP) and control (CON) cohorts, according to maturity status, i.e. pre-, mid- or post-peak height velocity (PHV).

Cohort Maturity group (n) Age (years) Height (m) Body mass (kg)
ESP Pre-PHV (124) 10.6 ± 1.4 1.44 ± 0.09 35.7 ± 6.0
Mid-PHV (43) 14.1 ± 0.8 1.64 ± 0.06 51.8 ± 7.5
Post-PHV (368) 16.8 ± 2.3 1.75 ± 0.12 69.1 ± 10.7
CON Pre-PHV (44) 11.2 ± 1.3 1.45 ± 0.08 37.5 ± 5.8
Mid-PHV (30) 13.9 ± 0.5 1.62 ± 0.06 52.0 ± 7.3
Post-PHV (77) 17.5 ± 3.2 1.75 ± 0.06 68.7 ± 9.4

Experimental design

Allele/genotype frequency distribution for 10 SNPs was compared between all ESP and all CON at different stages of maturity. A list of the SNPs, their associated genes, proteins and functions are presented in Table 2. Of the 686 participants, 303–602 performed power/speed assessments (depending on the assessment, exact sample sizes are stipulated in the relevant sections, below). Power assessments included a bilateral horizontal countermovement jump (BH CMJ) and a bilateral vertical countermovement jump (BV CMJ). Speed assessments included acceleration (10 m straight line sprint), sprint (20 m straight line sprint) and agility (modified 505 agility test) performance tests. We investigated associations between the 10 different SNPs and power/speed performance in ESP and CON, according to maturity status. In a sub-sample of 42 post-PHV participants (ESP, n = 22; CON, n = 20), quadriceps femoris (QF) muscle physiological characteristics were determined. These included knee extension maximum voluntary contraction (MVC) moment, muscle volume, muscle physiological cross-sectional area (PCSA) and vastus lateralis muscle fascicle pennation angle. All tests were performed during the in-season period and testing sessions were scheduled >48 h after competition or a high intensity training session to minimize the influence of prior exercise.

Table 2. Information pertaining to the 10 single nucleotide polymorphisms (SNPs) investigated in the current study.

The location of each SNP is identified according to Genome Reference Consortium Human Build 38 patch release 12 (GRCh38.p12) and alleles for each SNP are reported according to the forward DNA strand.

Gene Encoded protein Protein function Alleles and rs number Location MAF SNP Type SNP function
ACTN3 α-actinin-3 Binds actin to the Z-line in type II SkM fibres [22]; blocks calcineurin, inhibiting slow myogenic programme [23] C>T rs1815739 chr11:66560624 T: 0.434 Transition substitution, nonsense mutation, intragenic T-allele leads to a stop codon at amino acid 577, preventing protein production [24]
AGT Angiotensinogen Precursor of angiotensin II, which modulates SkM hypertrophy in response to mechanical loading [25] A>G rs699 chr1:230710048 G: 0.412 Transition substitution, missense mutation, intragenic G-allele associated with higher plasma AGT levels and higher blood pressure [26]
AMPD1 Adenosine Monophosphate Deaminase 1 Enzyme catalysing the deamination of AMP to IMP in SkM G>A rs17602729 chr1:114693436 A: 0.123 Missense mutation, intron, transition substitution, intragenic A-allele results in a premature stop codon and non-functional enzyme, causing impaired AMP metabolism, which can lead to muscle fatigue, weakness and cramping [27]
BDNF Brain-derived neurotrophic factor Promotes neurone growth, differentiation, and maintenance [28] C>T rs6265 chr11:27658369 T: 0.197 Transition substitution, missense mutation, intragenic C-allele results in valine rather than methionine (T-allele) at amino acid 66, leading to greater abundance of exercise-induced serum BDNF concentration [20]
COL1A1 Pro-α1 (I) chain Major component of type I collagen, a structural protein found in most connective tissues, including tendon [29] C>T rs2249492 chr17:50185660 C: 0.379 Transition substitution, intron, intragenic Not yet known but intronic SNPs have the potential to influence gene expression and mRNA stability [30]
COL2A1 Pro-α1 (II) chain Major component of type II collagen, providing structure and strength to connective tissue, e.g. at the enthesis [31] C>T rs2070739 chr12:47974193 T: 0.096 Transition substitution, missense mutation, intragenic Not yet known
COL5A1 Pro-α1 (V) chain Major component of type V collagen, which regulates the diameter of collagen fibrils [32] C>T rs12722 chr9:134842570 C: 0.415 Transition substitution, 3’-UTR, intragenic Not yet known but 3’-UTR SNPs have the potential to alter the level, location, or timing of gene expression [30]
NOS3 Nitric oxide synthase-3 Potentially stimulating muscle hypertrophy through NO-mediated vasodilatation[14] C>T rs2070744 chr7:150992991 C: 0.438 Transition substitution, intron, intragenic T-allele increases gene promoter activity, thus increasing eNOS and NO synthesis [33]
PPARA Peroxisome proliferator-activated receptor-α Present in SkM and promotes uptake, utilization, and catabolism of fatty acids (FAs) by upregulation of genes involved in FA transport, FA binding activation, and peroxisomal and mitochondrial FA β-oxidation [34] G>C rs4253778 chr22:46234737 C: 0.192 Intron, transversion substitution, intragenic Not yet known but intronic SNPs have the potential to influence gene expression and mRNA stability [30]
VDR Vitamin D receptor Downstream targets of VDR are involved in SkM regeneration, recovery and hypertrophy following strenuous exercise [35] A>G rs2228570 chr12:47879112 A: 0.378 Missense mutation, transition substitution, intragenic G-allele leads to 244 amino acid-long VDR, while A-allele produces 247 amino acid-long protein. The shorter VDR has enhanced transactivation capacity as a transcription factor [3638]

SNP, single nucleotide polymorphism; MAF, minor allele frequency (according to European population [39]); chr, chromosome; SkM, skeletal muscle.

Anthropometry

Height, sitting height and body mass were assessed as previously described [2]. Pubertal timing was calculated according to the estimated biological age of each individual, using calculations described elsewhere [40]. The age at which peak linear growth in stature occurs [i.e. peak height velocity (PHV)] is an indicator of somatic maturity, and biological age was calculated by subtracting the chronological age at the time of testing from the estimated chronological age at PHV. Participants were segregated into two maturity groups based on biological age, i.e. pre-PHV (≤ -1.0 years), mid-PHV (-0.99 to 0.5 years) and post-PHV (> 0.5 years). The number of participants in each maturity group is presented in Table 1.

Warm up protocol, jump and speed assessments

The warm up, jump and speed assessments were performed as detailed elsewhere [2] and are described briefly, below. Participants wore a t-shirt, shorts and football boots for all tests except the BV CMJ, for which no footwear was permitted.

Warm up protocol

The standardized 10 min warm up comprised 5 min dynamic movements (e.g. high knees, skips, lunges), followed by at least 3 practice trials at each assessment (after a demonstration of each).

Jump assessments

Participants performed a minimum of 3 trials of the BH CMJ (ESP n = 471; CON n = 131) and BV CMJ (ESP n = 211; CON n = 92) with approximately 30 seconds of recovery between trials and 5 min between jump types. If the third jump measurement (height or distance) was higher than the first or second, the participant performed a fourth trial. The highest or longest jump was selected for subsequent analysis.

Acceleration/Sprint assessments

A photocell timing system (Brower Timing System, Salt Lake City, UT, USA) was used to assess sprints to the nearest 0.001 s. Participants (ESP n = 458; CON n = 111) were required to perform three maximal sprints in which they were instructed to run 24 m as quickly as possible. The first, second and third timing gates were positioned 1 m, 11 m and 21 m from the start line, respectively. Thus, acceleration was defined as the time taken to run 0–10 m, and sprint was defined as the time taken to run 0–20 m.

Agility (change of direction) assessment

A timing gate (Brower) was positioned 1 m in front of the start line. After assuming a split stance crouch position, with their front foot behind the start line, participants (ESP n = 439; CON n = 38) were instructed to sprint to a line 5 m further from the start line, plant their right foot on this line, turn and sprint back in a straight line past the original start line. The time taken for the participants to run the full 5 m plus 5 m distance, i.e. breaking the infrared beam of the timing gate twice, was recorded using a hand-held wireless controller. This procedure was performed three times using the right leg, and three times using the left leg as a pivot. The fastest time of the three sprints was recorded and represented agility using that particular leg as a pivot. Participants performed this “modified 505 agility test” on a 4G artificial grass surface.

Quadriceps femoris muscle physiological characteristics

The protocols for measuring QF muscle strength, volume, PCSA and architecture has been described in detail previously [41]. A summary of these procedures is outlined below.

Muscle strength. Knee extension isometric MVCs of the dominant leg were assessed on an isokinetic dynamometer (Biodex 3, Medical Systems, Shirley, USA) and analysed using AcqKnowledge data acquisition software (Biopac Systems Inc., Goleta, CA, USA). Participants sat on the rigid chair with their hip angle set to 85° (supine position was equivalent to 180°) and their knee angle set to 90° (0° = full extension), before being secured at the hip, chest and distal thigh with inextensible straps to minimise movement. Participants then performed at least three MVCs lasting 2–3 s, with 60 s rest in between attempts. If the highest of the three MVC swas >5% higher than the next highest, more attempts were made to ensure true MVC was achieved.

Muscle volume (Vm). With the participant in a relaxed seated position on the isokinetic dynamometer (knee joint angle at 90°), B-mode ultrasonography (MyLab 30 CV, Esoate Biomedica, Genoa, Italy) was used to locate the distal (lateral femoral condyle) and proximal (base of greater trochanter) ends of the femur, with the distance between both points providing the femur length. The anatomical cross-sectional area (ACSA) of the quadriceps was then measured at 40% of femur length (from the distal end) using ultrasound [42]. Vm was calculated from a combination of this ACSA, femur length, and a series of regression equations detailed elsewhere [43].

Vastus lateralis (VL) muscle fascicle length (Lf) and pennation angle (θp). VL Lf and θp were measured at rest using ultrasonography with the participant in a relaxed seated position on the isokinetic dynamometer (knee joint angle at 90°). Once the origin and insertion of the vastus lateralis were identified, this enabled the lateral and medial boundaries of the muscle to be located at 50% of its length. The centre of the muscle was then identified and marked on the skin with a permanent marker pen, and this location was used to measure Lf and θp in three fascicles, and the mean of those three was used for subsequent analysis.

Physiological cross sectional area (PCSA). QF muscle PCSA was calculated by dividing QF Vm by VL Lf [40].

Blood and saliva sampling

Participants’ DNA was obtained from either whole blood (n = 83) or saliva (n = 603) samples. For blood sampling, 10-mL was drawn from a superficial forearm vein into an EDTA vacutainer (BD Vacutainer Systems, Plymouth, UK). The whole blood was then aliquoted into 2-mL cryotubes (Eppendorf AG, Hamburg, Germany) and stored at -80°C until DNA extraction. For saliva samples, the participants dribbled 2 mL saliva into a Genefix saliva collection tube (Isohelix Ltd., Harrietsham, UK), having not eaten or drunk anything (or chewed gum) for at least 30 min prior to saliva sampling. After collection, the saliva tubes were closed and gently shaken to mix the saliva with 2 mL non-toxic stabilization buffer contained within the tube. Samples were then aliquoted into 2 mL cryotubes (Eppendorf AG) and stored at -80°C until DNA extraction.

DNA extraction and genotyping

DNA purification from whole blood and saliva samples was performed manually using a QIAamp® DNA Blood Mini Kit (Qiagen Ltd., Manchester, UK), following the manufacturer’s guidelines. DNA samples were then stored at 4°C until subsequent genotyping.

Real-time polymerase chain reaction was performed (Rotor-Gene Q, Qiagen) to establish the genotypes of each SNP for each participant. Each 10 μL reaction volume contained 5 μL Genotyping Master Mix (Applied Biosystems, Foster City, USA), 3.5 μL nuclease-free H2O (Qiagen), 0.5 μL genotyping assay (Applied Biosystems), plus 1 μL DNA sample. Both negative [1 μL nuclease-free H2O (Qiagen) replaced the DNA template] and positive controls were included in each RT-PCR run, which used the following protocol: denaturation at 95°C for 10 min, followed by 50 cycles of incubation at 92°C for 15 s, then annealing and extension at 60°C for 1 min. Genotypes were determined using Rotor-Gene Q Pure Detection 2.1.0 software (Qiagen). All samples were analysed in duplicate and there was 100% agreement between genotype calls for samples from the same participant. Genotyping was performed in accordance with published genotyping and quality control recommendations [44] and all SNPs were genotyped according to the forward DNA strand.

Statistical analysis

Genotype frequency distributions for all ESP, pre-PHV ESP, post-PHV ESP and CON were tested for compatibility with Hardy-Weinberg equilibrium (HWE) using χ2 goodness of fit tests. Allele and genotype frequency distribution for each SNP were compared between all ESP and CON, and between pre- and post-PHV ESP and CON, using χ2 tests of homogeneity (Table 3). The mid-PHV ESP group was omitted from the latter analyses due to its relatively small sample size. To control for multiple SNP comparisons, a false discovery rate (FDR) of 0.2 [45] was applied to both the allele and genotype frequency distribution models. For both allele and genotype, two FDR models (each including 10 P-values, one for each SNP) were calculated for each population comparison, i.e. (i) all ESP vs. all CON; and (ii) pre-PHV ESP vs. post-PHV ESP vs. all CON. The original P-values in each model were sorted separately in ascending order, and then each original P-value was compared with its respective Benjamini-Hochberg (B-H) critical value [45]. For each FDR model, the largest original P-value that was less than its B-H critical value was deemed significant, together with any other original P-values smaller than that [45].

Table 3. Allele and genotype frequency distributions for the 10 investigated single nucleotide polymorphisms (SNPs) in all elite youth soccer players (ESP) and control participants (CON), and in pre- and post-peak height velocity (PHV) ESP.

All SNPs were analysed using a co-dominant model (e.g. AA vs. Aa vs. aa) except when the minor allele frequency (MAF) for a European population [39] was <0.25, in which case a dominant model (e.g. AA vs Aa+aa) was used instead. P-values are presented for allele and genotype frequency distribution χ2 models (model 1: CON vs. ESP; and model 2: CON vs. pre-PHV ESP vs. post-PHV ESP) and post-hoc between group differences are identified (P < 0.05), i.e. * difference between ESP and CON; ** difference between post-PHV ESP and CON; *** difference between pre-PHV ESP and CON; difference between post- and pre-PHV ESP.

Cohort ESP CON χ2 model 1 χ2 model 2
Sub groups All Pre-PHV Post-PHV All CON vs. ESP CON vs. pre-PHV ESP vs. post-PHV ESP
n 535 124 372 151 P-values P-values
PPARA G>C (rs4253778)
Allele (%) C 24.1 19.0 25.8 18.5 0.042 0.011
G 75.9 81.0 74.2 81.5
Genotype (%) CC+GC 42.2* 34.7 44.9** 31.8 0.021 0.009
GG 57.8* 65.3 55.1** 68.2
NOS3 C>T (rs2070744)
Allele (%) C 35.3* 38.7 34.8 42.7 0.019 0.057
T 64.7* 61.3 65.2 57.3
Genotype (%) CC 13.3 16.1 12.4 18.5 0.069 0.218
CT 44.1 45.2 45.2 48.3
TT 42.6 38.7 42.5 33.1
ACTN3 C>T (rs1815739)
Allele (%) C (R) 56.7 52.0 58.3 57.9 0.706 0.206
T (X) 43.3 48.0 41.7 42.1
Genotype (%) CC (RR) 35.7 33.1 36.6 31.8 0.061 0.038
CT (RX) 42.1 37.9*** 43.5 52.3
TT (XX) 22.2 29.0*** 19.9 15.9
AGT A>G (rs699)
Allele (%) G 50.5* 44.8 53.5** 42.7 0.017 0.002
A 49.5* 55.2 46.5** 57.3
Genotype (%) GG 25.0 21.8 28.0** 16.6 0.051 0.008
GA 50.8 46.0 51.1 52.3
AA 24.1 32.3 21.0** 31.1
AMPD1 G>A (rs17602729)
Allele (%) G 90.0 89.9 90.6 90.1 0.973 0.937
A 10.0 10.1 9.4 9.9
Genotype (%) GG 81.5 81.5 82.8 82.1 0.861 0.939
GA+AA 18.5 18.5 17.2 17.9
BDNF C>T (rs6265)
Allele (%) C 83.2 79.8 84.5 82.1 0.666 0.204
T 16.8 20.2 15.5 17.9
Genotype (%) CC 69.0 63.7 71.2 66.9 0.626 0.249
CT+TT 31.0 36.3 28.8 33.1
COL1A1 C>T (rs2249492)
Allele (%) T 58.6 60.1 58.3 59.3 0.834 0.879
C 41.4 39.9 41.7 40.7
Genotype (%) TT 34.8 37.1 34.9 37.1 0.773 0.978
TC 47.7 46.0 46.8 44.4
CC 17.6 16.9 18.3 18.5
COL2A1 C>T (rs2070739)
Allele (%) C 89.4 89.5 89.2 89.7 0.882 0.972
T 10.6 10.5 10.8 10.3
Genotype (%) CC 80.0 79.8 79.6 80.1 0.971 0.989
CT+TT 20.0 20.2 20.4 19.9
COL5A1 C>T (rs12722)
Allele (%) C 49.6 51.6 49.2 43.7 0.069 0.144
T 50.4 48.4 50.8 56.3
Genotype (%) CC 23.0 27.4 22.6 21.2 0.043 0.144
CT 53.3 48.4 53.2 45.0
TT 23.7 24.2 24.2 33.8
VDR A>G (rs2228570)
Allele (%) G 64.5 64.9 63.8 68.5 0.191 0.351
A 35.5 35.1 36.2 31.5
Genotype (%) GG 40.7 40.3 40.1 45.7 0.400 0.650
GA 47.5 49.2 47.6 45.7
AA 11.8 10.5 12.4 8.6

Those SNPs that demonstrated differences were combined to create a total genotype score (TGS) [46], with the ‘power/speed’ homozygote genotype for each SNP given a score of 2, the heterozygote a score of 1, and the ‘non-power/speed’ homozygote a score of 0. This ‘ESP’ TGS was then compared between CON, pre-PHV ESP and post-PHV ESP with a one-way ANOVA.

For each of the 10 SNPs, a three-way between-subjects ANOVA was used to investigate the association between genotype (either 3 or 2 genotype groups, depending on the minor allele frequency, see Tables 2 and 3), player status (ESP vs. CON), maturity status (pre- vs. mid- vs. post-PHV) and physical performance (horizontal/vertical power, acceleration, sprint and agility). Thus, we focussed on the three-way interaction (genotype x athlete status x maturity status), two-way interactions (genotype x athlete status and genotype x maturity status), and main effect of genotype. To control for multiple comparisons (10 SNPs), an FDR of 0.2 [45] was applied to the SNP main effect and interaction effect P-values for each performance variable. Thus, four FDR models (each including 10 P-values, one for each SNP) were calculated for each phenotype, i.e. (i) the main effect of SNP; (ii) SNP x athlete status two-way interaction; (iii) SNP x maturity status two-way interaction; (iv) SNP x athlete status x maturity status three-way interaction. As above, P-values in each model were sorted separately in ascending order, and compared with their respective B-H critical values [45]. If a significant three-way interaction occurred, simple simple main effects and simple simple pairwise comparisons with Bonferroni adjustment revealed significant differences. If a significant two-way interaction occurred, simple main effects and pairwise comparisons with Bonferroni adjustment were performed to reveal differences. If a significant genotype main effect existed, Bonferroni post-hoc tests were used to locate the genotype difference in test performance scores. We accounted for the larger ESP vs. CON cohort size by ensuring that all data were checked for violations of specific three-way ANOVA assumptions that may invalidate the interpretation of results. More specifically, Mauchly's test of sphericity, which assesses whether the variance of the differences between levels of the within-subject factors are equal, was applied to establish if the homogeneity of variance principle had been violated. If Mauchly's test of sphericity was violated, the Greenhouse-Geisser correction method was applied [47]. Box plots were also used to check for outliers (none were found); and all data sets were normally distributed according to the Kolmogorov-Smirnov test.

A separate ‘performance’ TGS [46] to the one described above was used to estimate the combined influence of only those SNPs that were individually associated with power/speed performance after FDR correction. This was based on a similar approach used by our group to investigate the polygenic association with exercise-induced muscle damage [48]. Spearman correlations were used to determine relationships between this TGS and speed/power performance according to PHV group.

Genotype differences in quadriceps femoris muscle strength, size and architecture were assessed via one-way ANOVAs. To control for multiple comparisons (10 SNPs and four physiological variables), an FDR of 0.2 [45] was applied to the SNP main effect P-values for each variable. As above, P-values in each model were sorted separately in ascending order, and compared with their respective B-H critical values [45]. If a significant main effect existed, Bonferroni pairwise comparisons were used to locate differences between genotypes.

Statistical analyses were completed using IBM SPSS Statistics 26 (IBM, New York, USA), and the significance level was set at P < 0.05.

Results

Allele and genotype frequency distributions

Hardy-Weinberg equilibrium (HWE)

Genotype frequency distributions of all 10 SNPs were in HWE in all CON and ESP except for the ACTN3 SNP, which was not in HWE in all ESP (χ2 = 11.430, P = 0.003). When ESP were segregated according to maturity status, pre- and post-PHV genotype distributions for all 10 SNPs were in HWE except for the ACTN3 SNP, which was not in HWE in pre-PHV elite soccer players (χ2 = 6.267, P = 0.044).

Genotype and allele frequency distributions in ESP vs. CON

After FDR correction, four SNPs [PPARA G>C (rs4253778), NOS3 C>T (rs2070744), ACTN3 C>T (rs1815739), and AGT A>G (rs699)] demonstrated differences in allele/genotype frequency distribution between all ESP and CON, pre-PHV ESP and CON, post-PHV ESP and CON, or pre-PHV ESP and post-PHV ESP (Table 3).

‘ESP’ TGS based on differences in allele/genotype frequency distributions between pre- and post-PHV ESP, and between pre-/post-PHV ESP and CON. The ‘ESP’ TGS differed between groups (F = 9.064; P<0.001), with post-PHV ESP (50.7 ± 18.2) being higher than pre-PHV ESP (44.1 ± 19.0; P = 0.001) and CON (44.5 ± 18.5; P = 0.001), but no difference between pre-PHV ESP and CON (P = 0.848). Regarding muscle physiological variables in post-PHV ESP and CON (n = 42), the ‘ESP’ TGS correlated with quadriceps femoris (QF) muscle volume (r = 0.400; P = 0.010), QF PCSA (r = 0.334; P = 0.033), VL θp (r = 0.416; P = 0.007) and knee extension MVC (r = 0.408; P = 0.008). In post-PHV ESP only (n = 22), the ‘ESP’ TGS correlated with QF muscle volume (r = 0.430; P = 0.046) and QF PCSA (r = 0.438; P = 0.041), vastus lateralis θp (r = 0.441; P = 0.040) and knee extension MVC (r = 0.474; P = 0.026).

Associations between genotype and power/speed performance

Bilateral horizontal jump performance

There was a main effect of BDNF genotype (F = 7.367, P = 0.007; Fig 1A), with CC homozygotes achieving greater bilateral horizontal jump distance than T-allele carriers (regardless of athlete or maturity status).

Fig 1. The association between BDNF (rs6265) and horizontal power, acceleration and sprint capabilities.

Fig 1

The association between BDNF (rs6265) C>T genotype and: (a) bilateral horizontal forward countermovement jump (BH CMJ), (b) 10 m sprint and (c) 20 m sprint performance in pre-, mid- and post-peak height velocity (PHV) elite youth soccer players (ESP, pre-PHV: n = 121; mid-PHV: n = 37; post-PHV: n = 313) and controls (CON, pre-PHV: n = 44; mid-PHV: n = 30; post-PHV: n = 57). * Difference between BDNF CC homozygotes and T-allele carriers (P < 0.05).

Bilateral vertical countermovement jump

Prior to FDR correction, there was a main effect of AGT genotype (F = 3.624, P = 0.028), with GG genotype jumping higher than both AG (P<0.001) and AA (P<0.001) genotypes but no difference between AG and AA (P = 0.150). However, following FDR correction, this main effect was no longer significant. There was a two-way interaction between COL5A1 genotype and athlete status (F = 5.129, P = 0.004; Fig 3C), with ESP CC homozygotes achieving greater jump height than TT homozygotes (P = 0.002). There was also a non-significant tendency for CC homozygotes to achieve greater jump height than CT genotypes (P = 0.090).

Fig 3. The association between COL5A1 (rs12722) and acceleration, sprint and vertical power capabilities.

Fig 3

The association between COL5A1 (rs12722) C>T genotype and: (a) 10 m sprint, (b) 20 m sprint and (c) bilateral vertical countermovement jump (BV CMJ) performance in pre-, mid- and post-peak height velocity (PHV) elite youth soccer players (ESP, pre-PHV: n = 114; mid-PHV: n = 35; post-PHV: n = 309) and controls (CON, pre-PHV: n = 44; mid-PHV: n = 30; post-PHV: n = 56). * Main effect of COL5A1 (P < 0.05), with a difference between CC and CT genotypes (P < 0.05); # difference between ESP CC and TT homozygotes (P < 0.05).

10 m acceleration performance

There was a main effect of BDNF genotype (F = 6.697, P = 0.010; Fig 1B), with CC homozygotes achieving faster 10 m acceleration times than T-allele carriers. There was a main effect of COL2A1 genotype (F = 5.365, P = 0.021; Fig 3A), with CC homozygotes achieving faster 10 m acceleration times than T allele carriers. There was also a two-way interaction between COL2A1 genotype and athlete status (F = 12.346, P<0.0001; Fig 3A), with CON CC homozygotes achieving faster 10 m acceleration times than T-allele carriers (P < 0.0001). There was a main effect of COL5A1 genotype (F = 3.532, P = 0.030; Fig 2A), with CC homozygotes achieving faster 10 m acceleration times than CT genotype (P = 0.025). There was a main effect of NOS3 genotype (F = 4.308, P = 0.014; Fig 4), with TT homozygotes achieving faster 10 m acceleration times than CC homozygotes (P = 0.011). There was a three-way interaction between AMPD1 genotype, athlete status and maturity status (F = 4.061; P = 0.018), with pre-PHV ESP GG homozygotes achieving faster 10 m sprint times than pre-PHV ESP A-allele carriers (P = 0.003).

Fig 2. The association between COL2A1 (rs2070739) and acceleration and sprint capabilities.

Fig 2

The association between COL2A1 (rs2070739) C>T genotype and: (a) 10 m sprint and (b) 20 m sprint performance in pre-, mid- and post-peak height velocity (PHV) elite youth soccer players (ESP, pre-PHV: n = 121; mid-PHV: n = 37; post-PHV: n = 314) and controls (CON, pre-PHV: n = 44; mid-PHV: n = 29; post-PHV: n = 57). * Difference between COL2A1 CC homozygotes and T-allele carriers (P < 0.05); # difference between CON CC homozygotes and T-allele carriers (P < 0.05).

Fig 4. The association between NOS3 (rs2070744) and acceleration capability.

Fig 4

The association between NOS3 (rs2070744) C>T genotype and 10 m acceleration in pre-, mid- and post-peak height velocity (PHV) elite youth soccer players (ESP, pre-PHV: n = 121; mid-PHV: n = 36; post-PHV: n = 313) and controls (CON, pre-PHV: n = 44; mid-PHV: n = 30; post-PHV: n = 57). * Main effect of NOS3 genotype (P < 0.05), with a difference between TT and CC homozygotes (P < 0.05).

20 m sprint performance

There was a main effect of COL2A1 genotype (F = 6.188, P = 0.013; Fig 2B), with CC homozygotes achieving faster 20 m sprint times than T-allele carriers. However, there was also a two-way interaction between COL2A1 genotype and athlete status (F = 8.735, P = 0.003; Fig 2B), with CON CC homozygotes achieving faster 20 m sprint times than T-allele carriers (P<0.001). There was an initial two-way interaction between COL2A1 genotype and PHV status (F = 3.049, P = 0.048) but this did not remain significant following FDR correction. Likewise, there was an initial three-way interaction between PPARA genotype, ESP status and maturity status (F = 3.034, P = 0.049), with pre-PHV CON C-allele carriers being faster than pre-PHV CON GG genotypes (F = 4.511, P = 0.034) but this was not significant after FDR correction. There was a main effect of BDNF genotype (F = 7.224, P = 0.007; Fig 1C), with GG homozygotes achieving faster 20 m sprint times than T-allele carriers. There was a main effect of COL5A1 genotype (F = 6.188, P = 0.029; Fig 3C), with CC homozygotes achieving faster 20 m sprint times than individuals of CT genotype (P = 0.024). There was a non-significant tendency for a main effect of NOS3 genotype (F = 2.584, P = 0.076), with post hoc analyses revealing a tendency for TT homozygotes to achieve faster 20 m sprint times compared to CC genotypes (P = 0.079). There was a three-way interaction between AMPD1 genotype, athlete status and maturity status (F = 4.292; P = 0.014), with pre-PHV ESP GG homozygotes achieving faster 20 m sprint times than pre-PHV ESP A-allele carriers (P = 0.019).

Agility (change of direction) performance

After FDR correction, there were no significant associations between any SNP and agility 505 performance.

Correlations between ‘performance’ total genotype score (TGS) and power/speed performance

Determination of ‘performance’ TGS

After FDR correction, four SNPs [AMPD1 G>A (rs17602729), BDNF C>T (rs6265), COL5A1 C>T (rs12722), COL2A1 C>T (rs2070739)] were associated with various power/sprint performance variables and were, therefore, included in our ‘performance’ TGS model. The TGS for post-PHV ESP (75.6 ± 13.0) was higher than for CON (73.0 ± 12.2; P = 0.048) but did not differ from pre-PHV ESP (73.9 ± 15.1; P = 0.227), which in turn did not differ from CON (P = 0.609).

‘Performance’ TGS correlations with power/speed performance variables

In pre-PHV ESP, TGS correlated with BV CMJ (r = 0.334, P<0.001), BH CMJ (r = 0.272, P = 0.004), 10 m acceleration (r = -0.238, P = 0.012), 20 m sprint (r = -0.286, P = 0.002) and 505 agility score (r = -0.258, P = 0.017). In post-PHV ESP, TGS correlated solely with BH CMJ (r = 0.140, P = 0.014). In post-PHV CON, TGS correlated with 10 m sprint (r = -0.465, P<0.001), 20 m sprint (r = -0.298, P = 0.029) and 505 agility (r = -0.418, P = 0.047). When ESP and CON were analysed together, pre-PHV TGS correlated with BV CMJ (r = 0.271, P = 0.001), BH CMJ (r = 0.195, P = 0.015), 10 m sprint (r = -0.210, P = 0.014), 20 m sprint (r = -0.242, P = 0.004) and 505 agility score (r = -0.258, P = 0.017). In all post-PHV participants, TGS correlated with BH CMJ (r = 0.198, P = 0.018) and BV CMJ (r = 0.134, P = 0.010). Regarding muscle physiological variables in post-PHV ESP and CON (n = 42), the ‘performance’ TGS correlated with quadriceps femoris (QF) muscle volume (r = 0.422; P = 0.006), QF PCSA (r = 0.425; P = 0.006) and knee extension MVC (r = 0.388; P = 0.012). In post-PHV ESP only (n = 22), the ‘performance’ TGS correlated with QF muscle volume (r = 0.573; P = 0.005) and QF PCSA (r = 0.566; P = 0.006).

SNPs associated with skeletal muscle physiological variables

In a sub-sample of 42 post-PHV participants (ESP, n = 22; and CON, n = 20), we found associations between three SNPs [AGT A>G (rs699), COL5A1 C>T (rs12722) and PPARA G>C (rs4253778)] and quadriceps femoris muscle volume, PCSA, architecture and maximum strength (Table 4).

Table 4. Quadriceps femoris muscle strength, size and architecture in 42 post-PHV soccer players (ESP, n = 22; and CON, n = 20).

Mean ± SD data are presented according to genotype for three different SNPs.

SNP Quadriceps femoris muscle physiological characteristic
KE MVC (N∙m) QF Vm (L) QF PCSA (cm2) VL θp (°)
AGT A>G (rs699)
AA (n = 6) 252 ± 40 2.5 ± 0.2 203 ± 15 12.8 ± 1.3
AG(n = 23) 306 ± 61 2.5 ± 0.3 197 ± 31 14.9 ± 1.8
GG (n = 13) 349 ± 71** 3.0 ± 0.6*** 235 ± 48** 15.3 ± 2.5*
COL5A1 C>T (rs12722)
CC (n = 17) 345 ± 70** 2.8 ± 0.6* 225 ± 43* 14.8 ± 2.3
CT (n = 18) 297 ± 61 2.6 ± 0.3 207 ± 32 14.7 ± 2.1
TT (n = 7) 268 ± 46 2.4 ± 0.2 180 ± 26 14.6 ± 2.0
PPARA G>C (rs4253778)
GG (n = 25) 293 ± 61 2.6 ± 0.4 208 ± 34 14.4 ± 1.9
GC (n = 13) 343 ± 76 2.8 ± 0.5 204 ± 43 14.3 ± 1.9
CC (n = 4) 329 ± 59 2.9 ± 0.7 241 ± 47 17.9 ± 1.8***

KE MVC, knee extension maximum voluntary contraction; QF Vm, quadriceps femoris muscle volume; PCSA, physiological cross-sectional area; VL θp, vastus lateralis muscle fascicle pennation angle

*significantly greater than other genotypes for that respective SNP (P < 0.05)

**significantly greater than other genotypes for that respective SNP (P < 0.02)

***significantly greater than other genotypes for that respective SNP (P < 0.01).

Discussion

The aims of this study were to investigate the association of 10 candidate SNPs with elite youth soccer player (ESP) status at different stages of maturity, and to investigate the individual and combined associations of those 10 SNPs with power/speed performance in ESP and recreationally active control (CON) participants, according to maturity status. In a cohort of 686 healthy, young males, including 535 ESP and 151 CON, we have shown for the first time that pre- and post-PHV ESP have distinct polygenic profiles. Post-PHV ESP had a genetic profile that favours superior power/speed characteristics, while pre-PHV ESP had a more endurance-associated polygenic profile. Furthermore, we found that five SNPs were associated with power/speed in ESP at different stages of maturity. The combination of these five SNPs explained a proportion of the variability in all performance tasks in pre-PHV ESP, but only the variability in bilateral horizontal-forward countermovement jump (BH CMJ) performance in post-PHV ESP. Together, these genetic differences suggest the soccer talent identification procedures (i.e. the process of recognising and recruiting individuals with the perceived potential to become elite players [49,50]) employed by the multiple academies in this study recruit pre- and post-PHV ESP based on different physiological criteria, i.e. endurance (pre-PHV) vs. power/speed (post-PHV) capability. Given the importance of power/speed in determining post-PHV ESP status [2,9,13,51,52], these findings have significant implications for future youth soccer talent identification and development programmes (i.e. intervention programmes prescribed to provide the most optimal environment to allow the player to realise their full potential [49,50]).

We have previously shown that horizontal and vertical power are indicators of ESP playing status in post-PHV ESP but not in pre-PHV ESP [2] but it was not known whether genetic variation could explain these disparities between maturity groups. In the present study, post-PHV ESP demonstrated a greater frequency distribution of the PPARA (rs4253778) C-allele and the AGT (rs699) G-allele compared to pre-PHV ESP and CON. When maturity groups were combined, we found that ESP had a greater frequency distribution of PPARA (rs4253778) C-allele carriers, the AGT (rs699) G-allele and the NOS3 (rs2070744) T-allele compared to CON. Crucially, all three of these alleles have previously been associated with elite power athlete status [15,16,53] and superior power performance [15,16,53,54]. Thus, the higher frequency of the PPARA (rs4253778) C-allele and the AGT (rs699) G-allele in post-PHV ESP compared to pre-PHV ESP and CON could help explain these previously reported maturation-dependent differences in power/speed between ESP and CON.

The NOS3 T-allele may exert its influence on ESP and power athlete status via increased gene promoter activity and higher endothelial nitric oxide synthesis [14]. This would stimulate muscle hypertrophy through nitric oxide-mediated vasodilatation [14], thus increasing strength and power. The AGT (rs699) G-allele, on the other hand, is thought to be associated with elite power athlete status [15,16,53] via its ability to increase serum angiotensinogen (AGT) concentration [16]. Increased AGT upregulates angiotensin II, which acts as a growth factor in skeletal muscle, and can modulate skeletal muscle hypertrophy in response to mechanical loading, thus enhancing power/speed [14,16,53]. The PPARA (rs4253778) C-allele has previously been associated with ESP status [3,4] and a greater composition of type II skeletal muscle fibres [18], which are larger (therefore producing more force) and able to shorten quicker (thus able to generate more power) than type I fibres [17]. These findings therefore support our original hypothesis that post-PHV ESP would demonstrate a polygenic profile more advantageous for power/speed performance.

In line with the above maturity/athlete status-dependent differences in allele/genotype frequency distribution, we found that the frequency distribution of ACTN3 (rs1815739) TT (XX) genotype was greater in pre-PHV ESP compared to CON (29.0% vs. 15.9%, respectively). ACTN3 XX homozygotes cannot produce alpha-actinin-3 [24], a structural sarcomeric protein found in type II skeletal muscle fibres that inhibits the transition of larger, stronger, more powerful type II fibres towards a smaller, weaker more oxidative phenotype [23]. This probably explains the greater muscle strength, power and volume observed in R-allele carriers compared to XX homozygotes [55], as well as the higher frequency distribution of the R-allele in elite power athletes compared to control participants and endurance athletes [56]. Thus, the PPARA (rs4253778), AGT (rs699) and ACTN3 (rs1815739) allele/genotype frequency distributions within our ESP and CON cohorts suggest that the polygenic profile of pre-PHV ESP favours fatigue resistance, rather than power/speed.

Indeed, when all four SNPs associated with ESP status [PPARA (rs4253778), AGT (rs699), ACTN3 (rs1815739) and NOS3 (rs2070744)] were combined to create a ‘ESP’ total genotype score (TGS) [46], post-PHV ESP had a higher TGS than both pre-PHV and CON, with no difference between pre-PHV and CON, thus supporting the individual SNP differences in allele/genotype frequency distribution. In English Category 1 soccer academies, players do not compete 11 vs. 11 on a full-sized pitch until they reach ~13 years of age (approximately when PHV occurs [40]). While 11 vs. 11 match-play requires a greater percentage of time spent at low speeds and more frequent explosive sprint actions, match-play on smaller pitches with fewer players requires a greater distance per minute, more time running at moderate speeds, and a higher work load per minute [57], thus placing a greater physiological demand on aerobic rather than anaerobic capacity. The playing demands of elite soccer therefore differ according to maturity status and, at pre-PHV level, aerobic capacity is more important than power/speed, which would explain the endurance-favourable genetic profile of our pre-PHV ESP. However, at post-PHV level, when powerful actions are performed more frequently [51] and often determine the outcome of games [58], academy recruitment staff may choose to release players with suboptimal power capabilities (and therefore a suboptimal power genetic profile) from their contracts in preference for players with superior power capabilities (and therefore a more favourable power genetic profile). This probably explains the superior power previously observed in ESP vs. CON at post-PHV [2,9,13,51,52] but not at pre-PHV [2]. Overall, this novel information suggests that current elite soccer pre-PHV talent identification procedures may benefit from the addition of genetic profiling to increase the chance of developing powerful players from a young age through to first team.

As well as investigating differences in genetic profile between maturity status within ESP, we sought to examine the genetic association with power/speed performance in ESP at different stages of maturity. Four of the five SNPs associated with power/speed in this study [NOS3 C>T (rs2070744), COL5A1 C>T (rs12722), BDNF C>T (rs6265), AMPD1 G>A (rs17602729)] have previously been investigated within a sport/exercise context. Interestingly, with the exception of NOS3 C>T, these were different SNPs to the four that demonstrated different allele/genotype frequency distributions between pre- and post-PHV ESP and CON. Although we found no association between NOS3 C>T and power per se, we are the first to show that the TT genotype was associated with greater acceleration/sprint speed, which is a component of power. The NOS3 TT genotype may therefore stimulate an increase in the number of sarcomeres arranged in series rather than in parallel, thus increasing muscle fibre length, which is the main determinant of muscle fibre contraction velocity [59].

While muscle fibre length is a determinant of power output, the extensibility of the tendon is related to speed and horizontal-forward power capabilities in sprinters [10] and post-PHV ESP [9], respectively. Our data show that, regardless of maturity and athlete status, individuals of COL5A1 (rs12722) CC genotype achieved quicker acceleration and sprint times compared to CT heterozygotes. As the CC genotype has previously been associated with more extensible knee extensor tendons [11], it is possible that the tendons of CC homozygotes can store and release more energy, thus amplifying power during sprint and acceleration. Our data also show that ESP CC homozygotes achieved greater BV CMJ height compared to TT homozygotes. Assuming that CC homozygotes have more elastic tendons compared to TT homozygotes [11], our findings are somewhat in agreement with Kubo and colleagues [60], who reported that greater compliance of the VL tendon–aponeurosis complex facilitates improved countermovement jump performance. This relationship may only be significant in ESP, as they could probably co-ordinate the vertical jump actions better, thus gaining greater use of the stretch shortening cycle and elastic tendon properties. Nevertheless, we are the first to show that COL5A1 (rs12722) CC genotype is associated with superior acceleration/sprint and vertical jump capabilities, presumably due to the regulatory role that procollagen type V has in controlling fibril diameter and patellar tendon elasticity [11]. Furthermore, this is the first time the COL2A1 C>T (rs2070739) SNP has been investigated in a sport/exercise context and our data show that CC homozygotes sprinted quicker than their T-allele carriers, possibly due to having a greater capacity to store and release mechanical energy in the tendon, similarly to COL5A1 CC homozygotes.

In addition to having more compliant patellar tendons [10], sprinters producing more horizontal force during sprinting are able to highly activate their hamstring muscles just before ground contact [19]. Neuromuscular activation is therefore a major factor in determining sprint performance, and neural function is largely determined by neurotrophins, of which BDNF is one of the most active [28]. In this study, we have shown for the first time that BDNF (rs6265) CC homozygotes demonstrated greater horizontal power, acceleration and sprint performance. As the C-allele is associated with a greater abundance of exercise-induced serum BDNF concentration [20], our findings suggest that individuals with potentially enhanced neuromuscular characteristics perform better in the performance tasks that require the body to be propelled in the horizontal-forward direction.

We have also shown that AMPD1 (rs17602729) GG homozygotes achieved quicker acceleration and sprint times in pre-PHV ESP only. As reduced muscle AMPD1 activity can compromise the capacity to deplete ATP pools and accumulate adenosine monophosphate during high intensity exercise [61], it is possible that pre-PHV ESP AMPD1 GG homozygotes can repeat high intensity acceleration and sprint efforts more frequently during soccer match-play. As speed performance prior to the onset of maturation is underpinned by inter- and intra-muscular co-ordination [62], performing a greater volume of such actions may lead to preferential adaptations in acceleration and sprint performance in GG homozygotes compared to their A-allele counterparts. The AMPD1 G>A SNP is the only one of our panel of 10 SNPs that appears to influence acceleration and speed in pre-PHV ESP on an individual SNP basis.

Individual SNP associations with power/speed in ESP can potentially elucidate novel mechanisms linking genotype with tissue phenotype with physical performance. However, these complex power/speed tasks are underpinned by a combination of different physiological factors [9,13], which in turn are thought to be influenced by a combination of several genetic variants [53]. Therefore, we developed a novel ‘performance’ polygenic profile based on only those five key SNPs shown to be associated with acceleration, speed, vertical and/or horizontal power capabilities on an individual SNP basis. In pre-PHV ESP, there were five significant but weak correlations between the ‘performance’ TGS and acceleration, sprint, vertical power, horizontal power and agility performance, while in post-PHV ESP, there was only one positive, but very weak, correlation between the ‘performance’ TGS and horizontal power performance. The fact that these correlations were weak was not unexpected, as the contribution of single gene variants to relatively complex powerful tasks was always likely to be small. The higher number of correlations in pre- vs. post-PHV ESP could be due to the ESP environment increasingly impacting on their physical performance as they mature (e.g. training exposure in ESP increases and includes more athletic development and resistance training as well as soccer activities [63]). Furthermore, practitioners may attempt to improve the power/speed capabilities of less powerful post-PHV ESP through personalised training, which may help compensate to some degree for an unfavourable power/speed genetic profile, thus reducing the likelihood of finding relationships between this particular TGS and power/speed performance in post-PHV ESP.

We have also shown that, while not one single SNP was associated with agility performance, a combination of favourable power/speed genotypes may predispose pre-PHV ESP to a greater ability to change direction quickly. As agility performance has been associated with co-ordination and unilateral reactive strength capabilities [64], the main physiological factors underpinning agility capacity appear to be a combination of neural (inter-muscular co-ordination, neural firing frequency and motor unit synchronization) and muscle-tendon unit mechanical properties. It therefore seems logical that agility is determined by a combination of SNPs, rather than one individual genetic variant. Overall, our novel polygenic analysis shows that, while one SNP may not have a significant impact on explosive performance, a combination of SNPs could have a favourable effect, especially in pre-PHV individuals.

We acknowledge some limitations with our study. For example, we did not measure the physiological determinants of power/speed (e.g. lower limb muscle-tendon properties) in all of our participants. However, these SNPs were investigated on the basis of our previous findings that power [52], muscle morphology [13] and tendon properties [9] differed between post-PHV ESP and CON, and that muscle [13] and tendon [9] properties are related to power generated during CMJs in different directions. Measuring muscle-tendon properties at the cellular and/or molecular level would have required harvesting muscle/tendon biopsies, which would have been highly invasive and impractical in such a large cohort. It is possible, however, to measure muscle-tendon properties non-invasively in vivo using ultrasonography and isokinetic dynamometry [9,13]. Although it was not feasible to perform these additional assessments in all 686 participants, we completed them on a sub-sample of n = 42 (22 post-PHV ESP and 20 post-PHV CON; Table 4). We found significant main effects of COL5A1, AGT, and PPARA genotypes, i.e. those genotypes associated with superior power/speed phenotypes from the main study data were also associated with greater quadriceps muscle volume and physiological cross-sectional area (PCSA), muscle fascicle pennation angle (an indirect indicator of muscle fibre CSA) and/or greater maximum knee extensor strength, all physiological variables that are related to power [9,13]. Furthermore, both our ‘ESP’ and ‘performance’ TGSs correlated with quadriceps volume, which is the product of muscle PCSA (the main determinant of force) multiplied by muscle fascicle length (the main determinant of muscle contraction velocity). As power is the product of force x velocity, these correlations suggest that the combination of these nine SNPs influence power/speed, at least in part, via their combined influence on muscle volume and/or PCSA. Thus, although the muscle physiological phenotype-genotype associations were completed on a very limited sample size (n = 42), and should therefore be treated with caution, they do provide physiological evidence for the genotype-power/speed performance associations observed in our much larger cohort of ESP and CON (n = 686). Secondly, our estimation of PHV was not a direct measure of pubertal status, so we cannot make direct assertions regarding genetic associations with pubescent phase. However, previous studies have shown that PHV estimated via the methods employed in this study does compare well with direct measures of pubertal growth [40]. Moreover, measuring pubertal status directly involves invasive procedures that would not be practically feasible to undertake in 686 pre-, mid- and post-PHV males. Finally, both ESP and CON comprised participants of different ethnic origin, which may be considered a limitation due to evidence that genetic variation can differ between ethnic groups [65]. However, we sought to include as many ESP as possible, regardless of ethnicity, so that our results would more accurately reflect all ESP in England and Uruguay, rather than only those of a particular ethnic group. Nevertheless, despite ethnicity information only being available in 38% of CON, sub-sample analysis in participants of known Caucasian origin (ESP, n = 422; CON, n = 54) revealed that ACTN3 XX genotype frequency distribution was higher (P = 0.004) in pre-PHV ESP (32.0%) compared to both CON (13.0%) and post-PHV ESP (20.4%). This therefore supports our total cohort data.

To conclude, in a cohort of 535 ESP (plus 151 CON) from multiple academies across two continents (thus raising the external validity of our study), we have shown for the first time that pre- and post-PHV ESP have distinct genetic profiles thought to favour endurance and power/speed capabilities, respectively. We have also demonstrated that power, acceleration and sprint performance were associated with five SNPs, both individually and in combination, possibly by influencing muscle size and neuromuscular activation. Therefore, our results not only identify potentially novel physiological mechanisms underpinning the influence of muscle-tendon properties on power/speed performance, but they suggest that pre-PHV ESP are currently recruited with a genetic profile more suited for endurance rather than power and speed, with the latter phenotypes being crucial attributes for post-PHV ESP. Elite soccer academies may wish to consider this information when recruiting talented pre-PHV players, with a view of developing those players for the different demands of the post-PHV game, rather than relying on the recruitment of new players at that stage of maturity. Furthermore, before genetic testing can be considered a potentially useful tool to assist in talent identification and development, replication of our results by independent groups and the identification of more genetic markers of performance in youth soccer is necessary.

Supporting information

S1 File

(XLSX)

Acknowledgments

We are extremely grateful to the following individuals for their assistance with data collection: Ignacio Turrens, Ignacio Berriel, Maximiliano Bizzio, Gabriel González, Guillermo Gil, Santiago Ferro, Nicolas Silva, Cesar Santos, Fernando Fadeuille, Lucia Alza, Barbara Buriani, Florencia Fraque, Luciana Tripodi, Melissa Nuñez, Valeria Acosta, Federico Vanni, Cecilia Mariño, Carolina Mutilva, Carolina Pintos, Robert Naughton, Matthew Girven, Stephen McQuilliam, and Raffaella Rodighiero.

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

We declare that seven co-authors are employed by commercial companies: Liverpool Football Club (AM), Club Atlético Peñarol (ER, SR, SM, GH), Defensor Sporting Club (GL) and the Premier League (AOB). However, these companies only provided support in the form of salaries for authors AM, ER, SR, GL, SM, GH and AOB, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

References

  • 1.De Moor MHM, Spector TD, Cherkas LF, Falchi M, Hottenga JJ, Boomsma DI, et al. Genome-wide linkage scan for athlete status in 700 British female DZ twin pairs. Twin Res Hum Genet. 2007;10: 812–820. 10.1375/twin.10.6.812 [DOI] [PubMed] [Google Scholar]
  • 2.Murtagh CF, Brownlee TE, O’Boyle A, Morgans R, Drust B, Erskine RM. Importance of Speed and Power in Elite Youth Soccer Depends on Maturation Status. J strength Cond Res. 2018;32: 297–303. 10.1519/JSC.0000000000002367 [DOI] [PubMed] [Google Scholar]
  • 3.Egorova ES, Borisova A V, Mustafina LJ, Arkhipova AA, Gabbasov RT, Druzhevskaya AM, et al. The polygenic profile of Russian football players. J Sports Sci. 2014;32: 1286–1293. 10.1080/02640414.2014.898853 [DOI] [PubMed] [Google Scholar]
  • 4.Gineviciene V, Jakaitiene A, Tubelis L, Kucinskas V. Variation in the ACE, PPARGC1A and PPARA genes in Lithuanian football players. Eur J Sport Sci. 2014;14: S289–S295. 10.1080/17461391.2012.691117 [DOI] [PubMed] [Google Scholar]
  • 5.Pimenta EM, Coelho DB, Veneroso CE, Coelho EJB, Cruz IR, Morandi RF, et al. Effect of ACTN3 gene on strength and endurance in soccer players. J Strength Cond Res. 2013;27: 3286–3292. 10.1519/JSC.0b013e3182915e66 [DOI] [PubMed] [Google Scholar]
  • 6.Coelho DB, Pimenta E, Rosse IC, Veneroso C, Becker LK, Carvalho MR, et al. The alpha-actinin-3 r577x polymorphism and physical performance in soccer players. J Sport Med Phys Fit. 2016;56: 241–248. [PubMed] [Google Scholar]
  • 7.Micheli ML, Gulisano M, Morucci G, Punzi T, Ruggiero M, Ceroti M, et al. Angiotensin-converting enzyme/vitamin D receptor gene polymorphisms and bioelectrical impedance analysis in predicting athletic performances of Italian young soccer players. J Strength Cond Res. 2011;25: 2084–2091. 10.1519/JSC.0b013e31820238aa [DOI] [PubMed] [Google Scholar]
  • 8.Maciejewska-Skrendo A, Cięszczyk P, Chycki J, Sawczuk M, Smółka W. Genetic Markers Associated with Power Athlete Status. J Hum Kinet. 2019;68: 17–36. 10.2478/hukin-2019-0053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Murtagh CF, Stubbs M, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, et al. Patellar tendon properties distinguish elite from non-elite soccer players and are related to peak horizontal but not vertical power. Eur J Appl Physiol. 2018;118: 1737–1749. 10.1007/s00421-018-3905-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Stafilidis S, Arampatzis A. Muscle–tendon unit mechanical and morphological properties and sprint performance. J Sports Sci. 2007;25: 1035–1046. 10.1080/02640410600951589 [DOI] [PubMed] [Google Scholar]
  • 11.Kubo K, Yata H, Tsunoda N. Effect of gene polymorphisms on the mechanical properties of human tendon structures. Springerplus. 2013;2: 343 10.1186/2193-1801-2-343 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Foster BP, Morse CI, Onambele GL, Williams AG. Human COL5A1 rs12722 gene polymorphism and tendon properties in vivo in an asymptomatic population. Eur J Appl Physiol. 2014;114: 1393–1402. 10.1007/s00421-014-2868-z [DOI] [PubMed] [Google Scholar]
  • 13.Murtagh CF, Nulty C, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, et al. The Neuromuscular Determinants of Unilateral Jump Performance in Soccer Players Are Direction-Specific. Int J Sports Physiol Perform. 2018;13: 604–611. 10.1123/ijspp.2017-0589 [DOI] [PubMed] [Google Scholar]
  • 14.Gómez-Gallego F, Ruiz JR, Buxens A, Artieda M, Arteta D, Santiago C, et al. The −786 T/C polymorphism of the NOS3 gene is associated with elite performance in power sports. Eur J Appl Physiol. 2009;107: 565–569. 10.1007/s00421-009-1166-7 [DOI] [PubMed] [Google Scholar]
  • 15.Gomez-Gallego F, Santiago C, González-Freire M, Yvert T, Muniesa CA, Serratosa L, et al. The C allele of the AGT Met235Thr polymorphism is associated with power sports performance. Appl Physiol Nutr Metab. 2009;34: 1108–1111. 10.1139/H09-108 [DOI] [PubMed] [Google Scholar]
  • 16.Zarebska A, Sawczyn S, Kaczmarczyk M, Ficek K, Maciejewska-Karlowska A, Sawczuk M, et al. Association of rs699 (M235T) polymorphism in the AGT gene with power but not endurance athlete status. J Strength Cond Res. 2013;27: 2898–2903. 10.1519/JSC.0b013e31828155b5 [DOI] [PubMed] [Google Scholar]
  • 17.Gilliver SF, Degens H, Rittweger J, Sargeant AJ, Jones DA. Variation in the determinants of power of chemically skinned human muscle fibres. Exp Physiol. 2009;94: 1070–8. 10.1113/expphysiol.2009.048314 [DOI] [PubMed] [Google Scholar]
  • 18.Ahmetov II, Druzhevskaya AM, Lyubaeva E V, Popov D V, Vinogradova OL, Williams AG. The dependence of preferred competitive racing distance on muscle fibre type composition and ACTN3 genotype in speed skaters. Exp Physiol. 2011;96: 1302–1310. 10.1113/expphysiol.2011.060293 [DOI] [PubMed] [Google Scholar]
  • 19.Morin J-B, Gimenez P, Edouard P, Arnal P, Jiménez-Reyes P, Samozino P, et al. Sprint acceleration mechanics: the major role of hamstrings in horizontal force production. Front Physiol. 2015;6: 404 10.3389/fphys.2015.00404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Egan MF, Kojima M, Callicott JH, Goldberg TE, Kolachana BS, Bertolino A, et al. The BDNF val66met polymorphism affects activity-dependent secretion of BDNF and human memory and hippocampal function. Cell. 2003;112: 257–269. 10.1016/s0092-8674(03)00035-7 [DOI] [PubMed] [Google Scholar]
  • 21.Cieszczyk P, Ostanek M, Leońska-Duniec A, Sawczuk M, Maciejewska A, Eider J, et al. Distribution of the AMPD1 C34T polymorphism in Polish power-oriented athletes. J Sports Sci. 2012;30: 31–35. 10.1080/02640414.2011.623710 [DOI] [PubMed] [Google Scholar]
  • 22.Beggs AH, Byers TJ, Knoll JH, Boyce FM, Bruns GA, Kunkel LM. Cloning and characterization of two human skeletal muscle alpha-actinin genes located on chromosomes 1 and 11. J Biol Chem. 1992;267: 9281–9288. [PubMed] [Google Scholar]
  • 23.Seto JT, Quinlan KGR, Lek M, Zheng XF, Garton F, MacArthur DG, et al. ACTN3 genotype influences muscle performance through the regulation of calcineurin signaling. J Clin Invest. 2013;123: 4255–4263. 10.1172/JCI67691 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.North KN, Beggs AH. Deficiency of a skeletal muscle isoform of α-actinin (α-actinin-3) in merosin-positive congenital muscular dystrophy. Neuromuscul Disord. 1996;6: 229–235. 10.1016/0960-8966(96)00361-6 [DOI] [PubMed] [Google Scholar]
  • 25.Gordon SE, Davis BS, Carlson CJ, Booth FW. ANG II is required for optimal overload-induced skeletal muscle hypertrophy. Am J Physiol Metab. 2001;280: E150–E159. [DOI] [PubMed] [Google Scholar]
  • 26.Sethi AA, Nordestgaard BG, Tybjærg-Hansen A. Angiotensinogen gene polymorphism, plasma angiotensinogen, and risk of hypertension and ischemic heart disease: a meta-analysis. Arterioscler Thromb Vasc Biol. 2003;23: 1269–1275. 10.1161/01.ATV.0000079007.40884.5C [DOI] [PubMed] [Google Scholar]
  • 27.Morisaki T, Gross M, Morisaki H, Pongratz D, Zöllner N, Holmes EW. Molecular basis of AMP deaminase deficiency in skeletal muscle. Proc Natl Acad Sci. 1992;89: 6457–6461. 10.1073/pnas.89.14.6457 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zigova T, Pencea V, Wiegand SJ, Luskin MB. Intraventricular administration of BDNF increases the number of newly generated neurons in the adult olfactory bulb. Mol Cell Neurosci. 1998;11: 234–245. 10.1006/mcne.1998.0684 [DOI] [PubMed] [Google Scholar]
  • 29.McKee TJ, Perlman G, Morris M, Komarova S V. Extracellular matrix composition of connective tissues: a systematic review and meta-analysis. Sci Rep. 2019;9: 1–15. 10.1038/s41598-018-37186-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Tabor HK, Risch NJ, Myers RM. Candidate-gene approaches for studying complex genetic traits: practical considerations. Nat Rev Genet. 2002;3: 391 10.1038/nrg796 [DOI] [PubMed] [Google Scholar]
  • 31.Kuntz LA, Rossetti L, Kunold E, Schmitt A, von Eisenhart-Rothe R, Bausch AR, et al. Biomarkers for tissue engineering of the tendon-bone interface. PLoS One. 2018;13: e0189668 10.1371/journal.pone.0189668 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Birk DE, Fitch JM, Babiarz JP, Doane KJ, Linsenmayer TF. Collagen fibrillogenesis in vitro: interaction of types I and V collagen regulates fibril diameter. J Cell Sci. 1990;95: 649–657. [DOI] [PubMed] [Google Scholar]
  • 33.Nakayama M, Yasue H, Yoshimura M, Shimasaki Y, Ogawa H, Kugiyama K, et al. T-786 → C mutation in the 5’-flanking region of the endothelial nitric oxide synthase gene is associated with myocardial infarction, especially without coronary organic stenosis. Am J Cardiol. 2000;86: 628–634. 10.1016/s0002-9149(00)01041-9 [DOI] [PubMed] [Google Scholar]
  • 34.Kersten S. Integrated physiology and systems biology of PPARα. Mol Metab. 2014;3: 354–371. 10.1016/j.molmet.2014.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Owens DJ, Sharples AP, Polydorou I, Alwan N, Donovan T, Tang J, et al. A systems-based investigation into vitamin D and skeletal muscle repair, regeneration, and hypertrophy. Am J Physiol Metab. 2015;309: E1019–E1031. [DOI] [PubMed] [Google Scholar]
  • 36.Colin EM, Weel AEAM, Uitterlinden AG, Buurman CJ, Birkenhäger JC, Pols HAP, et al. Consequences of vitamin D receptor gene polymorphisms for growth inhibition of cultured human peripheral blood mononuclear cells by 1, 25‐dihydroxyvitamin D3. Clin Endocrinol (Oxf). 2000;52: 211–216. 10.1046/j.1365-2265.2000.00909.x [DOI] [PubMed] [Google Scholar]
  • 37.Arai H, Miyamoto K, Taketani Y, Yamamoto H, Iemori Y, Morita K, et al. A vitamin D receptor gene polymorphism in the translation initiation codon: effect on protein activity and relation to bone mineral density in Japanese women. J Bone Miner Res. 1997;12: 915–921. 10.1359/jbmr.1997.12.6.915 [DOI] [PubMed] [Google Scholar]
  • 38.Whitfield GK, Remus LS, Jurutka PW, Zitzer H, Oza AK, Dang HTL, et al. Functionally relevant polymorphisms in the human nuclear vitamin D receptor gene. Mol Cell Endocrinol. 2001;177: 145–159. 10.1016/s0303-7207(01)00406-3 [DOI] [PubMed] [Google Scholar]
  • 39.Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, et al. A global reference for human genetic variation. Nature. 2015;526: 68–74. 10.1038/nature15393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mirwald RL, Baxter-Jones ADG, Bailey DA, Beunen GP. An assessment of maturity from anthropometric measurements. Med Sci Sport Exerc. 2002;34: 689–694. [DOI] [PubMed] [Google Scholar]
  • 41.Murtagh CF, Nulty C, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, et al. The neuromuscular determinants of unilateral jump performance in soccer players are direction-specific. Int J Sports Physiol Perform. 2018;13 10.1123/ijspp.2017-0589 [DOI] [PubMed] [Google Scholar]
  • 42.Reeves ND, Maganaris CN, Narici M V. Ultrasonographic assessment of human skeletal muscle size. Eur J Appl Physiol. 2004;91: 116–118. 10.1007/s00421-003-0961-9 [DOI] [PubMed] [Google Scholar]
  • 43.Morse CI, Degens H, Jones DA. The validity of estimating quadriceps volume from single MRI cross-sections in young men. Eur J Appl Physiol. 2007;100: 267–274. 10.1007/s00421-007-0429-4 [DOI] [PubMed] [Google Scholar]
  • 44.Chanock SJ, Manolio T, Boehnke M, Boerwinkle E, Hunter DJ, Thomas G, et al. Replicating genotype–phenotype associations. Nature. 2007;447: 655 10.1038/447655a [DOI] [PubMed] [Google Scholar]
  • 45.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995; 289–300. [Google Scholar]
  • 46.Williams AG, Folland JP. Similarity of polygenic profiles limits the potential for elite human physical performance. J Physiol. 2008;586: 113–121. 10.1113/jphysiol.2007.141887 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Maxwell SE, Delaney HD, Kelley K. Designing experiments and analyzing data: A model comparison perspective. 3rd ed New York: Routledge; 2018. [Google Scholar]
  • 48.Baumert P. The physiological and genetic factors underpinning the response to muscle damaging exercise. Ph.D. Thesis, Liverpool John Moores University. 2019. Available from: http://researchonline.ljmu.ac.uk/id/eprint/10325/
  • 49.Williams AM, Reilly T. Talent identification and development in soccer. J Sports Sci. 2000;18: 657–667. 10.1080/02640410050120041 [DOI] [PubMed] [Google Scholar]
  • 50.Sarmento H, Anguera MT, Pereira A, Araujo D. Talent identification and development in male football: a systematic review. Sport Med. 2018;48: 907–931. [DOI] [PubMed] [Google Scholar]
  • 51.Murtagh CF, Naughton RJ, McRobert AP, O’Boyle A, Morgans R, Drust B, et al. A Coding System to Quantify Powerful Actions in Soccer Match Play: A Pilot Study. Res Q Exerc Sport. 2019; 1–10. [DOI] [PubMed] [Google Scholar]
  • 52.Murtagh CF, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, Erskine RM. Unilateral jumps in different directions: a novel assessment of soccer-associated power? J Sci Med Sport. 2017;20: 1018–1023. 10.1016/j.jsams.2017.03.016 [DOI] [PubMed] [Google Scholar]
  • 53.Ruiz JR, Arteta D, Buxens A, Artieda M, Gómez-Gallego F, Santiago C, et al. Can we identify a power-oriented polygenic profile? J Appl Physiol. 2010; 108, 561–566. 10.1152/japplphysiol.01242.2009 [DOI] [PubMed] [Google Scholar]
  • 54.Ginevičienė V, Pranckevičienė E, Milašius K, Kučinskas V. Relating fitness phenotypes to genotypes in Lithuanian elite athletes. Acta medica Litu. 2010;17: 1–10. 10.2478/v10140-010-0001-0 [DOI] [Google Scholar]
  • 55.Erskine RM, Williams AG, Jones DA, Stewart CE, Degens H. The individual and combined influence of ACE and ACTN3 genotypes on muscle phenotypes before and after strength training. Scand J Med Sci Sports. 2014;24(4): 642–648. 10.1111/sms.12055 [DOI] [PubMed] [Google Scholar]
  • 56.Yang N, MacArthur DG, Gulbin JP, Hahn AG, Beggs AH, Easteal S, et al. ACTN3 genotype is associated with human elite athletic performance. Am J Hum Genet 2003;73: 627–631. 10.1086/377590 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Casamichana D, Castellano J, Castagna C. Comparing the physical demands of friendly matches and small-sided games in semiprofessional soccer players. J Strength Cond Res. 2012;26: 837–843. 10.1519/JSC.0b013e31822a61cf [DOI] [PubMed] [Google Scholar]
  • 58.Faude O, Koch T, Meyer T. Straight sprinting is the most frequent action in goal situations in professional football. J Sports Sci. 2012;30: 625–631. 10.1080/02640414.2012.665940 [DOI] [PubMed] [Google Scholar]
  • 59.Kumagai K, Abe T, Brechue WF, Ryushi T, Takano S, Mizuno M. Sprint performance is related to muscle fascicle length in male 100-m sprinters. J Appl Physiol. 2000;88: 811–816. 10.1152/jappl.2000.88.3.811 [DOI] [PubMed] [Google Scholar]
  • 60.Kubo K, Kawakami Y, Fukunaga T. Influence of elastic properties of tendon structures on jump performance in humans. J Appl Physiol. 1999;87: 2090–2096. 10.1152/jappl.1999.87.6.2090 [DOI] [PubMed] [Google Scholar]
  • 61.Norman B, Glenmark B, Jansson E. Muscle AMP deaminase deficiency in 2% of a healthy population. Muscle Nerve Off J Am Assoc Electrodiagn Med. 1995;18: 239–241. [DOI] [PubMed] [Google Scholar]
  • 62.Venturelli M, Bishop D, Pettene L. Sprint training in preadolescent soccer players. Int J Sports Physiol Perform. 2008;3: 558–562. 10.1123/ijspp.3.4.558 [DOI] [PubMed] [Google Scholar]
  • 63.Brownlee TE, Murtagh CF, Naughton RJ, Whitworth-Turner CM, O’Boyle A, Morgans R, et al. Isometric maximal voluntary force evaluated using an isometric mid-thigh pull differentiates English Premier League youth soccer players from a maturity-matched control group. Sci Med Footb. 2018; 1–7. [Google Scholar]
  • 64.Sheppard JM, Young WB. Agility literature review: Classifications, training and testing. J Sports Sci. 2006;24: 919–932. 10.1080/02640410500457109 [DOI] [PubMed] [Google Scholar]
  • 65.Mills M, Yang N, Weinberger R, Vander Woude DL, Beggs AH, Easteal S, et al. Differential expression of the actin-binding proteins, α-actinin-2 and-3, in different species: implications for the evolution of functional redundancy. Hum Mol Genet. 2001;10: 1335–1346. 10.1093/hmg/10.13.1335 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Luca Paolo Ardigò

19 Nov 2019

PONE-D-19-26674

THE GENETIC PROFILE OF ELITE YOUTH SOCCER PLAYERS AND ITS ASSOCIATION WITH POWER AND SPEED DEPENDS ON MATURATION STATUS

PLOS ONE

Dear Dr Erskine,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

Please, pay particular (but not limitedly to) attention to Reviewer 1's issues.

==============================

We would appreciate receiving your revised manuscript by Jan 03 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Luca Paolo Ardigò, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

1. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

Additional Editor Comments (if provided):

Please, pay particular (but not limitedly to) attention to Reviewer 1's issues.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The author studied the association of 10 SNPs and elite athlete status and power/speed performance in 536 youth soccer players and 151 age-matched controls. The main issue of this study is founding by population stratification.

There are more Caucasians in the ESP group vs control group and many of the SNPs being investigated showed very different allele frequencies in Caucasian and African American population (e.g., MAF of rs4253778 in PPARA is 0.19 in Caucasian and 0.95 in African American). This will cause inflated Type I error rate (false positives). The analysis should either adjust for race or stratify on race. Without race information or a genomewide genotype data (to cluster the sample into different ethnic groups) the results are questionable.

Table 2. Please double check all the statistical tests. Some do not seem to be right. For example, rs4253778 in PPARA, neither the comparison of 18.8% vs 25.5% nor 25.5% vs 18.9% had p-value <0.05. Some SNPs a dominant model was assumed but some were not. Is that purely based on an allele frequency cutoff?

Table 2. When there are only two categories of a variable only one category needs to be shown in the table since the second categories carries identical information. For example, for C vs G alleles or CC/GG vs GG genotypes of PPARA rs4253778, only one allele and one genotype group need to be shown.

Table 2. Please show actual p-values. Please incorporate rs number in the table

The authors mentioned FDR to correct for multiple testing but I don’t find FDR results in the paper. It is unclear how FDR were used since there are many statistical tests in this paper, over all p-values?

It is mentioned that “three-way between-subjects ANOVA was used to investigate the association between genotype…”. This is will unbalance design. It is not clear what terms were included in the anova, two-way interaction? three-way interaction?

Page 17, line 295, please double check the p-value.

Reviewer #2: The study is well designed and presented. My only caution are the statements suggesting that the genetic information could be used for talent identification purposes. This is a controversial issue as a result of the claims by may DTC companies. I suggest that the authors focus on the genetic contribution to the studied phenotype.

Reviewer #3: 1. In the abstract, please confirm that the subjects were all male.

2. INTRODUCTION:

• In the context of athlete’s genetic studies more comprehensive review of literature is required.

• There is a lack of information about the function of 10 genes and their polymorphisms. Given that this is the premise of the research, this should be included. For example, a table could be provided describing the functions of genes and SNPs.

3. What do the authors mean with " talent identification and development programmes " ? This point should be clarified.

4. In the Results section, lines 256 and 280, please check the numbering of the tables.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Malcolm Collins

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jun 22;15(6):e0234458. doi: 10.1371/journal.pone.0234458.r002

Author response to Decision Letter 0


6 Feb 2020

Dear Editor and Reviewers,

We are very grateful to you all for you taking the time to review our manuscript and for providing very constructive comments. We believe the changes we have made to our revised manuscript in response to these comments have significantly improved the quality of the manuscript, so thank you. We have considered all comments and responded to each one in turn.

Best wishes,

Rob Erskine on behalf of all co-authors

Editor Comments:

1. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

AUTHORS’ RESPONSE: Thank you for raising this point. We feel that these data are important for providing further confidence in our main results. Therefore, we have included descriptions of the relevant physiological measurements in the Materials and Methods section (P 12-13), and the data in the results section (Table 3; P 24), as well as correlation data between the two TGSs and muscle physiological variables on P20 and P24:

Results section, P24:

SNPs associated with skeletal muscle physiological variables

In a sub-sample of 42 post-PHV participants (ESP, n = 22; and CON, n = 20), we found associations between three SNPs [AGT A>G (rs699), COL5A1 C>T (rs12722) and PPARA G>C (rs4253778)] and quadriceps femoris muscle volume, PCSA, architecture and maximum strength (Table 3).

Table 3. Quadriceps femoris muscle strength, size and architecture in 42 post-PHV soccer players (ESP, n = 22; and CON, n = 20). Mean ± SD data are presented according to genotype for three different SNPs.

SNP Quadriceps femoris muscle physiological property

KE MVC (N∙m) QF Vm (L) QF PCSA (cm2) VL θp (�)

AGT A>G (rs699)

TT (n = 6) 252 ± 40 2.5 ± 0.2 203 ± 15 12.8 ± 1.3

TC (n = 23) 306 ± 61 2.5 ± 0.3 197 ± 31 14.9 ± 1.8

CC (n = 13) 349 ± 71** 3.0 ± 0.6*** 235 ± 48** 15.3 ± 2.5*

COL5A1 C>T (rs12722)

CC (n = 17) 345 ± 70** 2.8 ± 0.6* 225 ± 43* 14.8 ± 2.3

CT (n = 18) 297 ± 61 2.6 ± 0.3 207 ± 32 14.7 ± 2.1

TT (n = 7) 268 ± 46 2.4 ± 0.2 180 ± 26 14.6 ± 2.0

PPARA G>C (rs4253778)

GG (n = 25) 293 ± 61 2.6 ± 0.4 208 ± 34 14.4 ± 1.9

GC (n = 13) 343 ± 76 2.8 ± 0.5 204 ± 43 14.3 ± 1.9

CC (n = 4) 329 ± 59 2.9 ± 0.7 241 ± 47 17.9 ± 1.8***

KE MVC, knee extension maximum voluntary contraction; QF Vm, quadriceps femoris muscle volume; PCSA, physiological cross-sectional area; VL θp, vastus lateralis muscle fascicle pennation angle; *significantly greater than other genotypes for that respective SNP (P < 0.05); **significantly greater than other genotypes for that respective SNP (P < 0.02); ***significantly greater than other genotypes for that respective SNP (P < 0.01).

The ‘ESP’ TGS correlation data on P20:

… “Regarding muscle physiological variables in post-PHV ESP and CON (n = 42), the ‘ESP’ TGS correlated with quadriceps femoris (QF) muscle volume (r = 0.400; P = 0.010), QF PCSA (r = 0.334; P = 0.033), VL θp (r = 0.416; P = 0.007) and knee extension MVC (r = 0.408; P = 0.008). In post-PHV ESP only (n = 22), the ‘ESP’ TGS correlated with QF muscle volume (r = 0.430; P = 0.046) and QF PCSA (r = 0.438; P = 0.041), VL θp (r = 0.441; P = 0.040) and knee extension MVC (r = 0.474; P = 0.026).”

The ‘performance’ TGS correlation data on P24:

Regarding muscle physiological variables in post-PHV ESP and CON (n = 42), the ‘performance’ TGS correlated with quadriceps femoris (QF) muscle volume (r = 0.422; P = 0.006), QF PCSA (r = 0.425; P = 0.006) and knee extension MVC (r = 0.388; P = 0.012). In post-PHV ESP only (n = 22), TGS correlated with QF muscle volume (r = 0.573; P = 0.005) and QF PCSA (r = 0.566; P = 0.006).

2. Please, pay particular (but not limitedly to) attention to Reviewer 1's issues.

AUTHORS’ RESPONSE: Thank you. Once again, we are very grateful to have received such constructive comments from all three reviewers, and we have responded to all comments raised.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

________________________________________

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

AUTHORS’ RESPONSE: We have responded more comprehensively to Reviewer #1’s comments regarding our statistical approach in section 5, below. We believe our statistical analyses have been performed appropriately and extremely rigorously. This may not have been apparent in our original submission due to our desire to be succinct but, in response to Reviewer #1’s requests, we have included comprehensive information surrounding our statistical approaches in our revised manuscript, which should demonstrate the appropriateness and rigour of our statistical analyses.

Reviewer #2: Yes

Reviewer #3: Yes

________________________________________

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

________________________________________

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

________________________________________

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The author studied the association of 10 SNPs and elite athlete status and power/speed performance in 536 youth soccer players and 151 age-matched controls. The main issue of this study is founding by population stratification. There are more Caucasians in the ESP group vs control group and many of the SNPs being investigated showed very different allele frequencies in Caucasian and African American population (e.g., MAF of rs4253778 in PPARA is 0.19 in Caucasian and 0.95 in African American). This will cause inflated Type I error rate (false positives). The analysis should either adjust for race or stratify on race. Without race information or a genomewide genotype data (to cluster the sample into different ethnic groups) the results are questionable.

AUTHORS’ RESPONSE: Thank you for raising this point. In the discussion of our originally submitted manuscript, we acknowledged that the inclusion of different ethnic groups may be regarded as a limitation of our study due to these points mentioned by Reviewer #1 (although the data we have seen from the 1000 genome project demonstrates a slightly smaller difference in rs4253778 MAF, e.g. 0.192 vs. 0.745 for EUR vs. AFR).

Although the absolute number of Caucasians was higher in ESP compared to CON (the overall group size of ESP was larger than CON, regardless of ethnic group), the relative proportions of Caucasians (and other ethnic groups) within ESP and the sub-sample of CON were similar. Thus, a comparison between two groups (ESP and CON) with similar relative ethnic group composition should enable a fair comparison. As the ANOVA model compares means between groups, it is unclear to us how our current analysis would have led to type I errors.

We accounted for the imbalance in the cohort sizes between ESP and CON by ensuring that all data were checked for violations of specific three-way ANOVA assumptions that may invalidate the interpretation of results. More specifically, Mauchly's test of sphericity, which assesses whether the variance of the differences between levels of the within-subject factors are equal, was applied to establish if the homogeneity of variance principle had been violated. If Mauchly's test of sphericity was violated and the variance of the differences between levels of the within-subject factors were not equal, the Greenhouse-Geisser correction method was applied [1]. This correction method therefore accounted for the imbalance in cohort sizes when analysing three-way ANOVAs. In addition, box plots were used to check if there were any outliers (none were found); and all data sets were tested for normality using the Kolmogorov-Smirnov test. Testing for such assumptions allowed us to be confident that the interpretation of the three-way ANOVAs was robust despite different sample sizes in CON and ESP.

We have added the following to the statistical analysis section of the manuscript on P16:

We accounted for the imbalance in the cohort sizes between ESP and CON by ensuring that all data was checked for violations of specific three-way ANOVA assumptions that may invalidate the interpretation of results. More specifically, Mauchly's test of sphericity, which assesses whether the variance of the differences between levels of the within-subject factors are equal, was applied to establish if the homogeneity of variance principle had been violated. If Mauchly's test of sphericity was violated, the Greenhouse-Geisser correction method was applied [1]. To test for other assumptions, box plots were used to check if there were any outliers (none were found); and all data sets were tested for normality using the Kolmogorov-Smirnov test.

Finally, as discussed in our originally submitted manuscript, elite soccer academies in the countries included in our study do not comprise athletes from solely one ethnic group (as demonstrated from our ethnicity data), and we felt it was important that our results were representative of actual soccer academies rather than one ethnic group. Thus, we aimed to recruit our CON group with a similar composition of ethnic groups to our ESP cohort. Further, if we were able to stratify according to ethnic group, our sample sizes (which are further broken down into different maturation and genotype groups), particularly for non-Caucasian participants, would become significantly smaller, thus reducing the statistical power of our study and increasing the risk of making type II errors, i.e. finding false negatives. This would also make the study design extremely complicated. Critically, however, we do not have ethnic information for all of our control group (as originally stated in Materials and Methods), so stratifying according to ethnic group with a CON group sample size of n=59 (which would be further broken down into different maturation and genotype groups) would reduce statistical power even further. Therefore, although the reviewer raises a valid point (which we have acknowledged in the Discussion), with respect, we maintain that including ethnic group as a co-variate in a three-way between groups ANCOVA model, or analysing our data separately, according to ethnic group, would undoubtedly lead to type II errors.

Table 2. Please double check all the statistical tests. Some do not seem to be right. For example, rs4253778 in PPARA, neither the comparison of 18.8% vs 25.5% nor 25.5% vs 18.9% had p-value <0.05. Some SNPs a dominant model was assumed but some were not. Is that purely based on an allele frequency cutoff?

AUTHORS’ RESPONSE: Thank you for raising this point. We had actually reported significant differences between the allele frequency distributions you have mentioned in our original submission – please see the footnote beneath the original table. In this case, the asterisk (*) represented a significant (P < 0.05) difference between post-PHV ESP and CON, while the ampersand symbol (&) represented a significant (P < 0.05) difference between post-PHV ESP and pre-PHV ESP. However, we apologise that this was not clear. We have replaced the ampersand symbol with a dagger symbol (�) and highlighted the between group differences within each significant �2 model (please see the newly inserted text in the Table 3 header). We hope this now removes any confusion.

The reason why a dominant model was assumed was indeed based on a low MAF, for which our threshold was 0.25 in Caucasian populations (as this group was the predominant ethnic group in our study). Thus, for any SNP with a MAF <0.25 in a Caucasian population, the relatively low number of participants with the minor allele homozygote genotype was combined with the heterozygotes for that SNP. The following information has now been added to the table header:

“All SNPs were analysed using a co-dominant model (e.g. AA vs. Aa vs. aa) except when the minor allele frequency (MAF) for a Caucasian population [2] was <0.25, in which case a dominant model (e.g. AA vs Aa+aa) was used instead. P-values are presented for significant �2 tests of homogeneity and between group differences are identified (P < 0.05), i.e. * difference between All ESP and CON; ** difference between post-PHV ESP and CON; *** difference between pre-PHV ESP and CON; � difference between post- and pre-PHV ESP.”

We thank Reviewer #1 for raising these points. The addition of this information certainly helps to clarify between group differences in allele/genotype frequency distribution and why we have pooled some genotypes but not others.

Table 2. When there are only two categories of a variable only one category needs to be shown in the table since the second categories carries identical information. For example, for C vs G alleles or CC/GG vs GG genotypes of PPARA rs4253778, only one allele and one genotype group need to be shown.

AUTHORS’ RESPONSE: We are unsure of what Reviewer #1 is referring to with this point, as C vs G is not the same as CC/GG vs GG (or even CC/CG vs GG). This can be demonstrated by visually comparing allele frequency with genotype frequency distribution within the same group for any given SNP. Allele frequency distribution was analysed separately to genotype frequency distribution for each SNP, hence why we reported both the allele and genotype frequency distributions, i.e. we did not compare allele frequency with genotype frequency, so we are unsure why we would present only one allele and one genotype group. The way we have chosen to present these data is a common method of reporting allele and genotype frequency distributions for multiple SNPs.

Table 2. Please show actual p-values. Please incorporate rs number in the table

The authors mentioned FDR to correct for multiple testing but I don’t find FDR results in the paper. It is unclear how FDR were used since there are many statistical tests in this paper, over all p-values?

It is mentioned that “three-way between-subjects ANOVA was used to investigate the association between genotype…”. This is will unbalance design. It is not clear what terms were included in the anova, two-way interaction? three-way interaction?

Page 17, line 295, please double check the p-value.

AUTHORS’ RESPONSE: Thank you for raising these points. The reason we did not originally include P values in the table was to avoid including too much information, which may confuse the reader. However, we have now reformatted the table to accommodate the reviewer’s request and we agree that the addition of P values to the table is more informative.

The rs numbers were originally reported in the table heading but we have moved these to the appropriate places within the actual table, as the reviewer has requested. We agree that this avoids duplication of unnecessary information and makes the table clearer.

An earlier draft of this manuscript included more detail surrounding our FDR methods (both for our �2 tests of homogeneity and three-way between subjects ANOVAs) but we decided to remove this in an attempt to make the manuscript more concise. However, we agree that the addition of this information would make the manuscript more informative, so we have now re-instated the relevant information in the Statistics section on P15. This now explains the FDR for the �2 tests of homogeneity, the 3-way ANOVAs (associations between SNP and performance phenotypes) and the one-way ANOVAs for the newly introduced SNP associations with quadriceps femoris muscle physiological characteristics. The FDR thresholds for each SNP P value were calculated according to Benjamini and Hochberg [3].

Regarding the three-way ANOVA interactions and main effects investigated in this study, we focussed on the three-way interaction (genotype x athlete status x maturation), two-way interaction (genotype x athlete status and genotype x maturation status), and main effect of genotype. If a significant three-way interaction occurred, simple simple main effects and simple simple pairwise comparisons with Bonferroni adjustment revealed significant differences. If a significant two-way interaction occurred, simple main effects and pairwise comparisons with Bonferroni adjustment were performed to reveal differences. If a significant genotype main effect existed, Bonferroni post-hoc tests were used to locate the genotype difference in test performance scores. We have included the following in P16:

Thus, we focussed on the three-way interaction (genotype x athlete status x maturation), two-way interactions (genotype x athlete status and genotype x maturation status), and main effect of genotype. If a significant three-way interaction occurred, simple simple main effects and simple simple pairwise comparisons with Bonferroni adjustment revealed significant differences. If a significant two-way interaction occurred, simple main effects and pairwise comparisons with Bonferroni adjustment were performed to reveal differences. If a significant genotype main effect existed, Bonferroni post-hoc tests were used to locate the genotype difference in test performance scores.

Furthermore regarding the three-way ANOVAs, we accounted for the imbalance in the cohort sizes between ESP and CON by ensuring that all data was checked for violations of specific three-way ANOVA assumptions that may invalidate the interpretation of results. More specifically, Mauchly's test of sphericity, which assesses whether the variance of the differences between levels of the within-subject factors are equal, was applied to establish if the homogeneity of variance principle had been violated. If Mauchly's test of sphericity was violated and the variance of the differences between levels of the within-subject factors were not equal, the Greenhouse-Geisser correction method was applied [1]. This correction method therefore accounted for the imbalance in cohort sizes when analysing three-way ANOVAs. Box plots were used to check for outliers (none were found), and data sets were normally distributed according to the Kolmogorov-Smirnov test. Testing for such assumptions allowed us to be confident that the interpretation of the three-way ANOVA was robust despite different sample sizes in each group.

We have added the following to the statistical analysis section of the manuscript on P16:

We accounted for the larger cohort size in ESP vs. CON by ensuring that all data was checked for violations of specific three-way ANOVA assumptions that may invalidate the interpretation of results. More specifically, Mauchly's test of sphericity, which assesses whether the variance of the differences between levels of the within-subject factors are equal, was applied to establish if the homogeneity of variance principle had been violated. If Mauchly's test of sphericity was violated, the Greenhouse-Geisser correction method was applied [4]. Box plots were used to check for outliers (none were found); and all data sets were normally distributed according to the Kolmogorov-Smirnov test.

Finally, thank you for pointing out the typo in the P-value concerning lack of difference between pre- and post-PHV ESP in ‘performance’ TGS. This P-value was actually 0.227, not 0.022, which now agrees with the statement that there was no difference between pre-PHV ESP and CON (P = 0.227). We have also highlighted the lack of difference between pre-PHV ESP and CON (P = 0.609).

Once again, we are grateful to Reviewer #1 for raising all of these important points, and we feel that the changes we have made as a consequence have improved the quality of the manuscript.

Reviewer #2: The study is well designed and presented. My only caution are the statements suggesting that the genetic information could be used for talent identification purposes. This is a controversial issue as a result of the claims by may DTC companies. I suggest that the authors focus on the genetic contribution to the studied phenotype.

AUTHORS’ RESPONSE: Thank you for raising this important point. As this is the first study of its kind to investigate the genetic association with power and speed in elite youth soccer players (accounting for the potentially confounding factor of maturation), we wanted to highlight that our results provide preliminary evidence for genetic analyses being a potentially useful tool for future talent ID and development (i.e. after our results have been replicated by independent groups), and only in conjunction with traditional methods for talent ID and development. However, we completely agree with the reviewer’s concern about DTCs currently making false claims, e.g. that we are already at the point where genetic testing can be used for talent ID, injury risk, etc. Hence, our use of the words “assist” and “future” in our original concluding sentence to emphasise this point. On reflection and following the reviewer’s comment, we think this could still be misused by DTC companies as ‘current evidence’ that validates their ‘product’. Therefore, to prevent our findings being mis-represented by DTC companies, we have changed our concluding statement to the following:

“Elite soccer academies may wish to consider this information when recruiting talented pre-PHV players, with a view of developing those players for the different demands of the post-PHV game, rather than relying on the recruitment of new players at that stage of maturation. Furthermore, before genetic testing can be considered a potentially useful tool to assist in talent identification and development, replication of our results by independent groups and the identification of more genetic markers of performance is necessary.”

We have also revised our concluding statement in the abstract accordingly:

“To conclude, we have shown for the first time that pre- and post-PHV ESP have distinct genetic profiles, with pre-PHV ESP more suited for endurance, and post-PHV ESP for power and speed (the latter phenotypes being crucial attributes for post-PHV ESP). We have also demonstrated that power, acceleration and sprint performance were associated with five SNPs, both individually and in combination, possibly by influencing muscle size and neuromuscular activation.”

We hope Reviewer #2 agrees that our take home message has been satisfactorily changed to ensure it is unequivocal in terms of our study not supporting the false claims made by DTC companies.

Reviewer #3:

1. In the abstract, please confirm that the subjects were all male.

AUTHORS’ RESPONSE: Thank you for highlighting this omission. We have now included the word “male” in the Abstract.

2. INTRODUCTION:

• In the context of athlete’s genetic studies more comprehensive review of literature is required.

AUTHORS’ RESPONSE: Thank you for raising this point. As the reviewer is aware, the challenge with writing a good introduction is to include a comprehensive review of the literature to provide a solid background to the study, as well as a convincing rationale for the study, all as succinctly as possible. To do this, we decided to focus on the literature pertaining to the genetic association with elite soccer player status/performance. We have also attempted to review the current literature when introducing the SNPs of interest in the Introduction, e.g. regarding the (potential) mechanistic influence of each SNP. However, we agree with Reviewer #3’s comment that more comprehensive information could be provided regarding the SNPs investigated in the current study. Therefore, in response to this and Reviewer #3’s subsequent comment, we have included a NEW Table 2, which provides functional information on each of the 10 SNPs (plus their respective genes and proteins), supported by both classic and recent literature. We have also added the following part of a sentence and reference pertaining to a recent review of the genetic association with power and speed phenotypes [5] on P3: “Over 69 genetic markers have been associated with power athlete status [5]” and the genetic association with power and speed is likely due to specific gene variants influencing protein abundance/expression, which will in turn affect the physiological determinants of speed and power.

• There is a lack of information about the function of 10 genes and their polymorphisms. Given that this is the premise of the research, this should be included. For example, a table could be provided describing the functions of genes and SNPs.

AUTHORS’ RESPONSE: Thank you for raising this point. In our originally submitted manuscript, we introduced each of our investigated SNPs in the Introduction but we agree that more information about the individual SNPs would be beneficial. Consequently, and in response to the reviewer’s excellent suggestion, we have added this information in our NEW Table 2 on P7. In this table, we have included the gene name, the protein product, the alleles and rs number, location, MAF, and SNP type and function (supported by both classic and recent literature). We feel this provides significantly more information about each SNP, so thank you for raising this.

3. What do the authors mean with " talent identification and development programmes " ? This point should be clarified.

AUTHORS’ RESPONSE: Thank you for raising this point. Talent identification refers to the process of recognising and recruiting individuals with the perceived potential to become elite players. Talent development is a process whereby the player is provided with the optimal learning environment to achieve their full potential. Consequently, a talent development programme is the intervention programme prescribed to provide the most optimal environment to allow the player to realise their full potential [6,7]. We have now clarified “talent identification and development programmes” in the Discussion on P25:

The combination of these five SNPs explained a proportion of the variability in all performance tasks in pre-PHV ESP, but only the variability in bilateral horizontal-forward countermovement jump (BH CMJ) performance in post-PHV ESP. Together, these genetic differences suggest the soccer talent identification procedures (i.e. the process of recognising and recruiting individuals with the perceived potential to become elite players [6,7]) employed by the multiple academies in this study recruit pre- and post-PHV ESP based on different physiological criteria, i.e. endurance (pre-PHV) vs. power/speed (post-PHV) capability. Given the importance of power/speed in determining post-PHV ESP status [8–12], these findings have significant implications for future youth soccer talent identification and development programmes (i.e. intervention programmes prescribed to provide the most optimal environment to allow the player to realise their full potential [6,7]).

4. In the Results section, lines 256 and 280, please check the numbering of the tables.

AUTHORS’ RESPONSE: Thank you for pointing this out. We noticed that in some instances the COL2A1 and COL5A1 figure references were mixed up and we have now corrected the as COL2A1 is Figure 2 and COL5A1 is figure 3. We have also removed the asterisk from Figure 4 (NOS3 association with 20 m sprint), as this was a non-significant tendency for a main effect.

________________________________________

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Malcolm Collins

Reviewer #3: No

References

1. Maxwell SE, Delaney HD, Kelley K. Designing experiments and analyzing data: A model comparison perspective. 3rd ed. New York: Routledge; 2018.

2. Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, et al. A global reference for human genetic variation. Nature. 2015;526: 68–74. doi:10.1038/nature15393

3. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995; 289–300.

4. Maxwell SE, Delaney HD. Designing experiments and analyzing. Lawrence Erlbaum Associates, Hillsdale. Chapters; 2004.

5. Maciejewska-Skrendo A, Cięszczyk P, Chycki J, Sawczuk M, Smółka W. Genetic Markers Associated with Power Athlete Status. J Hum Kinet. 2019;68: 17–36. doi:10.2478/hukin-2019-0053

6. Williams AM, Reilly T. Talent identification and development in soccer. J Sports Sci. 2000;18: 657–667.

7. Sarmento H, Anguera MT, Pereira A, Araujo D. Talent identification and development in male football: a systematic review. Sport Med. 2018;48: 907–931.

8. Murtagh CF, Brownlee TE, O’Boyle A, Morgans R, Drust B, Erskine RM. Importance of Speed and Power in Elite Youth Soccer Depends on Maturation Status. J strength Cond Res. 2018;32: 297–303. doi:10.1519/JSC.0000000000002367

9. Murtagh CF, Stubbs M, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, et al. Patellar tendon properties distinguish elite from non-elite soccer players and are related to peak horizontal but not vertical power. Eur J Appl Physiol. 2018;118: 1737–1749. doi:10.1007/s00421-018-3905-0

10. Murtagh CF, Nulty C, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, et al. The Neuromuscular Determinants of Unilateral Jump Performance in Soccer Players Are Direction-Specific. Int J Sports Physiol Perform. 2018;13: 604–611. doi:10.1123/ijspp.2017-0589

11. Murtagh CF, Naughton RJ, McRobert AP, O’Boyle A, Morgans R, Drust B, et al. A Coding System to Quantify Powerful Actions in Soccer Match Play: A Pilot Study. Res Q Exerc Sport. 2019; 1–10.

12. Murtagh CF, Vanrenterghem J, O’Boyle A, Morgans R, Drust B, Erskine RM. Unilateral jumps in different directions: a novel assessment of soccer-associated power? J Sci Med Sport. 2017;20: 1018–1023. doi:10.1016/j.jsams.2017.03.016

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Luca Paolo Ardigò

9 Mar 2020

PONE-D-19-26674R1

THE GENETIC PROFILE OF ELITE YOUTH SOCCER PLAYERS AND ITS ASSOCIATION WITH POWER AND SPEED DEPENDS ON MATURATION STATUS

PLOS ONE

Dear Dr Erskine,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

Reviewer 1 still raises major issues: please, put all your effort to address them.

==============================

We would appreciate receiving your revised manuscript by Apr 23 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Luca Paolo Ardigò, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (if provided):

Reviewer 1 still raises major issues: please, put all your effort to address them.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Population stratification is still a big concern in this revision as commented before. The proportions of white vs black is not as similar as stated by the authors in the responses. In the paper it says “all ESP (white, 79%; black, 5%; black/white, 16%; and Asian, 1%) and a sub-sample (n = 59) of CON (white, 93%; black, 2%;mixed black/white, 3%; and Asian, 2%).” To me, 79% is pretty difference from 93%. Given that the majority of the samples are white, at least a subset analysis using only white is warranted.

Many hypotheses were tested in this paper, 10 SNPs x 5 outcome variables x 2 comparisons =100 already even before considering the two-way, three-way interactions. Adjusting for 10 tests is not enough. In addition, it is better to focus the paper on a smaller number of hypothesis. The sample size is limited in detection interaction effects, especially the three-way interaction.

Total genotype score based on SNPs that demonstrating differences is biased. Creating scores using genotypes from all 10 SNPs is not susceptible to such bias and it also reduces the number of comparisons.

The comment in the previous review: “Table 2. When there are only two categories of a variable only one category needs to be shown in the table since the second categories carries identical information. For example, for C vs G alleles or CC/GG vs GG genotypes of PPARA rs4253778, only one allele and one genotype group need to be shown.” What I meant is that if frequency of C allele is listed frequency of the G allele is just one minus that. But it is fine to leave both in. I would recommend putting actual counts in addition to frequencies in the table as well.

Reviewer #2: The revised manuscript is improved since the authors have addressed all the comments and concerns raised by the reviewers.

Reviewer #3: The authors have made good efforts to address the issues raised. The manuscript has scientific value and is suitable for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Malcolm Collins

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jun 22;15(6):e0234458. doi: 10.1371/journal.pone.0234458.r004

Author response to Decision Letter 1


24 Apr 2020

Dear Editor and Reviewers,

Once again, we are very grateful to you all for taking the time to review our revised manuscript and for providing feedback for a second time, so thank you.

Following your initial review of our original manuscript (submitted to PLoS ONE on 27th September 2019), we made significant changes in response to your welcomed comments, and we felt that we addressed all of them in a very comprehensive and fair manner.

We are very pleased that Reviewer #2 and Reviewer #3 have confirmed their satisfaction with our changes and that they feel that our manuscript has scientific value and is suitable for publication. We have responded to Reviewer #1’s additional comments, below.

As our manuscript will have been under review for seven months at the time of submitting our second revised manuscript, we would greatly appreciate it if this review could be expedited. Thank you for your understanding.

Best wishes,

Rob Erskine, on behalf of all co-authors

Reviewer #1:

1. Population stratification is still a big concern in this revision as commented before. The proportions of white vs black is not as similar as stated by the authors in the responses. In the paper it says “all ESP (white, 79%; black, 5%; black/white, 16%; and Asian, 1%) and a sub-sample (n = 59) of CON (white, 93%; black, 2%;mixed black/white, 3%; and Asian, 2%).” To me, 79% is pretty difference from 93%. Given that the majority of the samples are white, at least a subset analysis using only white is warranted.

AUTHORS’ RESPONSE: As we stated in our original response to Reviewer #1’s comment on population stratification, we only have ethnic origin data in 38% of the control group. Therefore, we felt that making a comparison between Caucasian only ESP (n=422) and Caucasian only CON (n=54) would be unhelpful due to significantly reduced statistical power and the inevitability of making type II errors. Given the similarities between ESP and CON in terms of anthropometry, age and maturity status, and the similar percentages of various ethnic groups within ESP and the sub-sample of CON, we maintain that our genetic comparisons are still valid and our results compelling. However, in response to Reviewer #1’s request, we have investigated genotype/allele frequency distribution in participants of known Caucasian origin (ESP, n = 422; CON, n = 54), and found that ACTN3 XX genotype frequency distribution was higher (P = 0.004) in pre-PHV ESP (32.0%) compared to both CON (13.0%) and post-PHV ESP (20.4%), which supports our total cohort data.

Consequently, we have included the following in the Discussion on P33 of the revised manuscript:

“Nevertheless, despite ethnicity information only being available in 38% of CON, sub-sample analysis in participants of known Caucasian origin (ESP, n = 422; CON, n = 54) revealed that ACTN3 XX genotype frequency distribution was higher (P = 0.004) in pre-PHV ESP (32.0%) compared to both CON (13.0%) and post-PHV ESP (20.4%). This therefore supports our total cohort data.”

2. Many hypotheses were tested in this paper, 10 SNPs x 5 outcome variables x 2 comparisons =100 already even before considering the two-way, three-way interactions. Adjusting for 10 tests is not enough. In addition, it is better to focus the paper on a smaller number of hypothesis. The sample size is limited in detection interaction effects, especially the three-way interaction.

AUTHORS’ RESPONSE: Each ANOVA model accounts for the multiple P values derived from the main effects and interaction effects within that model, i.e. main effect of genotype, 2-way interactions between genotype and maturation, 2-way interaction between genotype and athlete group, and three-way interaction between genotype, maturation and athlete group.

It is also perfectly reasonable to perform a three-way ANOVA for each dependent variable (vertical CMJ, horizontal CMJ, agility, acceleration and sprint), as each of these variables are assessments of different aspects of physical performance. Thus, it is reasonable to hypothesise that some SNPs may be associated with just one or several of these performance variables, depending on the function of the SNP(s), which is not known in all of our 10 SNPs.

We investigated the association 10 SNPs with these five performance variables. Therefore, we have performed 10 multiple comparisons within the same data set. This is why each of our FDR models (for the main effect and each of the interaction effects) included 10 P values. We feel that we have satisfactorily addressed this in our response to Reviewer #1’s initial comment regarding our statistical approach, i.e. as requested, we detailed our robust and rigorous statistical approach in our revised manuscript.

3. Total genotype score based on SNPs that demonstrating differences is biased. Creating scores using genotypes from all 10 SNPs is not susceptible to such bias and it also reduces the number of comparisons.

AUTHORS’ RESPONSE: We respectfully disagree with this comment. Firstly, the ‘performance’ TGS was calculated on the basis of a SNP being associated with any performance variable, and it could be associated solely with ESP or CON, or a particular maturation group within one or both groups. Therefore, we do not see how this TGS could be biased. Secondly, the ‘ESP’ TGS includes SNPs that had a higher/lower genotype/allele frequency distribution in one or multiple maturation groups in ESP vs. CON. We included only the associated SNPs, because they were the only ones of interest (we did not see any scientific justification for increasing the noise by including non-associated SNPs). Further, if a SNP is not associated with a dependent variable, we cannot be confident which genotype/allele is advantageous/disadvantageous. Referring to previous literature for this is not a viable solution, because ours is a completely novel population. Moreover, some of these SNPs have not been investigated in a sporting context before, making it impossible to know which is the advantageous genotype/allele. Finally, just because one SNP has a higher/lower genotype/allele frequency distribution does not mean that, when you include other SNPs in a combined TGS model, that the TGS will be higher in one cohort vs. another. This is because all ESP participants may not have all five beneficial genotypes. Indeed they did not but what was interesting (and the reason we performed the analysis this way) was that the ‘ESP’ TGS was higher in ESP than in CON, therefore providing evidence that a larger proportion of ESP had more of the beneficial genotypes than CON. Moreover (and another reason our TGS models were not biased), not all SNPs were associated with the same maturation group in ESP or CON but the TGS comparisons were total ESP TGS vs. total CON TGS.

4. The comment in the previous review: “Table 2. When there are only two categories of a variable only one category needs to be shown in the table since the second categories carries identical information. For example, for C vs G alleles or CC/GG vs GG genotypes of PPARA rs4253778, only one allele and one genotype group need to be shown.” What I meant is that if frequency of C allele is listed frequency of the G allele is just one minus that. But it is fine to leave both in. I would recommend putting actual counts in addition to frequencies in the table as well.

AUTHORS’ RESPONSE: We feel that including counts as well as percentages would clutter an already busy table. The sample sizes for each group are stated at the top of the table, and readers can calculate actual counts themselves using the percentages, if necessary.

Reviewer #2:

The revised manuscript is improved since the authors have addressed all the comments and concerns raised by the reviewers.

AUTHORS’ RESPONSE: Thank you for confirming your satisfaction with our revised manuscript. We are very grateful to you for your time and feedback, which has helped improve our manuscript.

Reviewer #3:

The authors have made good efforts to address the issues raised. The manuscript has scientific value and is suitable for publication.

AUTHORS’ RESPONSE: Thank you also for confirming your satisfaction with the extensive changes we made to our manuscript following the first revision. We are very grateful for your time and feedback too, and we have no doubt that this has helped improve our manuscript.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 2

Luca Paolo Ardigò

29 Apr 2020

PONE-D-19-26674R2

THE GENETIC PROFILE OF ELITE YOUTH SOCCER PLAYERS AND ITS ASSOCIATION WITH POWER AND SPEED DEPENDS ON MATURATION STATUS

PLOS ONE

Dear Dr Erskine,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

Please, try once more to address Reviewer 1's issues.

==============================

We would appreciate receiving your revised manuscript by Jun 13 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Luca Paolo Ardigò, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (if provided):

Please, try once more to address Reviewer 1's issues.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Regarding response to #1, not trying to be picky on words, however performing tests using Caucasians only does not “inevitably” introduce type II error since for type II error to occur the alternative hypothesis has to be true and we do not know if that is the case.

Only result of ACTN3 XX genotype is mentioned what about the other SNPs? It will be reassuring if the results of the Caucasian-analyses show similar trend, though not necessarily reaching statistical significance.

Regarding correction for multiple testing, I maintain my point of view that too many tests have been performed and the number of tests to be adjusted for should be more than 10. Having said that, the FDR adjusted p-values should be reported along with the unadjusted p-values so that the readers can weigh the evidence themselves. On line 247-248 “To control for multiple comparisons (10 SNPs and five performance variables), ….” , “and five performance variables” should be removed since they were not being controlled by the current FDR adjustment. Also a better place for this sentence is on line 241 after “main effects of genotypes”. This way it describes how overall association is evaluated before describing post-hoc comparison.

Regarding bias in “TGS”, even though the SNPs were selected based on different performance outcomes it is conceivable that these outcomes are also associated with ESP and CON group status. Therefore this preselection process can be biased.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jun 22;15(6):e0234458. doi: 10.1371/journal.pone.0234458.r006

Author response to Decision Letter 2


29 Apr 2020

Dear Editor,

We have responded to Reviewer #1’s additional comments, below.

As our manuscript will have been under review for over seven months at the time of submitting the fourth version of our manuscript, we would greatly appreciate it if this final review could be expedited. Thank you for your understanding.

Best wishes,

Rob Erskine, on behalf of all co-authors

Reviewer #1:

1. Regarding response to #1, not trying to be picky on words, however performing tests using Caucasians only does not “inevitably” introduce type II error since for type II error to occur the alternative hypothesis has to be true and we do not know if that is the case. Only result of ACTN3 XX genotype is mentioned what about the other SNPs? It will be reassuring if the results of the Caucasian-analyses show similar trend, though not necessarily reaching statistical significance.

AUTHORS’ RESPONSE: If the sample size is small, it increases the chance of making type II errors. Only using 38% of the CON group (not because they were the only Caucasian CON participants but because they were the only ones we had ethnicity information for) severely reduces the sample size, and therefore increases the chance of making type II errors. We were willing to report the ACTN3 genotype frequency data in Caucasian only participants in response to Reviewer #1’s request, because it did not confuse the main message of the manuscript. Due to only including 38% CON, the other genotype/allele data showed a similar trend to the overall genotype/allele data but did not reach statistical significance (PPARA GG: post-PHV ESP 59.2%, pre-PHV ESP 68.9%, CON 72.2%, P = 0.069; PPARA G-allele: post-PHV ESP 77.2%, pre-PHV ESP 83.0%, CON 85.2%, P = 0.060; AGT AA: post-PHV ESP 24.6%, pre-PHV ESP 35.0%, CON 27.8%, P = 0.133; AGT A-allele: post-PHV ESP 49.3%, pre-PHV ESP 56.8%, CON 56.5%, P = 0.109; and NOS3 T-allele: ESP 63.5%, CON 57.4%, P = 0.217). However, including those under-powered data would confuse the manuscript considerably but we trust this similar trend satisfies the reviewer. We believe including the Caucasian only ACTN3 genotype frequency information in the revised manuscript is a fair compromise.

2. Regarding correction for multiple testing, I maintain my point of view that too many tests have been performed and the number of tests to be adjusted for should be more than 10. Having said that, the FDR adjusted p-values should be reported along with the unadjusted p-values so that the readers can weigh the evidence themselves. On line 247-248 “To control for multiple comparisons (10 SNPs and five performance variables), ….” , “and five performance variables” should be removed since they were not being controlled by the current FDR adjustment. Also a better place for this sentence is on line 241 after “main effects of genotypes”. This way it describes how overall association is evaluated before describing post-hoc comparison.

AUTHORS’ RESPONSE: As explained in our initial Response to reviewers’ comments document, and in the first revised manuscript, the original (“unadjusted”) P values were compared with the respective B-H critical values (line 225-229). Reporting “adjusted” B-H P values can be confusing in itself but adding this information to the original (“unadjusted”) P values would complicate and confuse the results section, and the overall manuscript. Comparing the original (unadjusted) P values with their respective B-H critical values is normal practice and we see no reason to change this. However, as per Reviewer #1’s request. we have removed “and five variables” from line 247 and moved this sentence to line 241.

3. Regarding bias in “TGS”, even though the SNPs were selected based on different performance outcomes it is conceivable that these outcomes are also associated with ESP and CON group status. Therefore this preselection process can be biased.

AUTHORS’ RESPONSE: For a full rebuttal to this comment, please refer to our previous Response to reviewers’ comments document. In brief, the ‘performance’ TGS comprised SNPs that were associated with different performance variables in CON and/or ESP at different stages of maturity. Therefore, we maintain there was no bias.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 3

Luca Paolo Ardigò

28 May 2020

THE GENETIC PROFILE OF ELITE YOUTH SOCCER PLAYERS AND ITS ASSOCIATION WITH POWER AND SPEED DEPENDS ON MATURITY STATUS

PONE-D-19-26674R3

Dear Dr. Erskine,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Luca Paolo Ardigò, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

In my opinion, manuscript deserves publication in spite of not unanimous acceptance recommendation by all three expert reviewers. Congratulations to authors.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Acceptance letter

Luca Paolo Ardigò

4 Jun 2020

PONE-D-19-26674R3

THE GENETIC PROFILE OF ELITE YOUTH SOCCER PLAYERS AND ITS ASSOCIATION WITH POWER AND SPEED DEPENDS ON MATURITY STATUS

Dear Dr. Erskine:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Luca Paolo Ardigò

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    (XLSX)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES