Abstract
The present study was designed to evaluate the test-retest consistency of repetition maximum tests at standardized relative loads and determine the robustness of strength-endurance profiles across test-retest trials. Twenty-four resistance-trained males and females (age, 27.4 ± 4.0 y; body mass, 77.2 ± 12.6 kg; relative bench press one-repetition maximum [1-RM], 1.19 ± 0.23 kg•kg-1) were assessed for their 1-RM in the free-weight bench press. After 48 to 72 hours, they were tested for the maximum number of achievable repetitions at 90%, 80% and 70% of their 1-RM. A retest was completed for all assessments one week later. Gathered data were used to model the relationship between relative load and repetitions to failure with respect to individual trends using Bayesian multilevel modeling and applying four recently proposed model types. The maximum number of repetitions showed slightly better reliability at lower relative loads (ICC at 70% 1-RM = 0.86, 90% highest density interval: [0.71, 0.93]) compared to higher relative loads (ICC at 90% 1-RM = 0.65 [0.39, 0.83]), whereas the absolute agreement was slightly better at higher loads (SEM at 90% 1-RM = 0.7 repetitions [0.5, 0.9]; SEM at 70% 1-RM = 1.1 repetitions [0.8, 1.4]). The linear regression model and the 2-parameters exponential regression model revealed the most robust parameter estimates across test-retest trials. Results testify to good reproducibility of repetition maximum tests at standardized relative loads obtained over short periods of time. A complementary free-to-use web application was developed to help practitioners calculate strength-endurance profiles and build individual repetition maximum tables based on robust statistical models.
Introduction
Dynamic strength endurance has previously been defined as the amount of concentric work an individual can produce in a cyclic or repetitive movement [1]. Assuming that the range of motion is approximately constant for each repetition of a given resistance training exercise, strength endurance can therefore be described by the number of repetitions performed to momentary failure (RTF) at a given load for a single sustained trial [1,2]. The evaluation of strength endurance by means of a repetition maximum test (occasionally also called repetition endurance test) usually involves an exercise being performed to momentary failure at either a fixed absolute load, expressed in a unit of mass like kg or lbs, or a fixed relative load that has been normalized to the exercise-specific one-repetition maximum (1-RM). The concept is widely applied by coaches to guide resistance training programming [1,3,4]. However, given the fact that resistance training is usually carried out across a wider spectrum of loads, assessing the RTF an individual can execute at a single load only provides limited insight into a person’s fatigue resistance. More meaningful insights into strength endurance could be obtained by studying the relationship between load and RTF (i.e., the individual “strength-endurance profile”). Additionally, knowledge of the mathematical relationship between the two variables could be used by practitioners to predict the load associated with a certain repetition maximum. This may be of particular interest for individuals seeking to control intensity of effort within a set [5] by prescribing a certain percentage of the maximum load that can be used for a given number of repetitions [6]. While other methods have been proposed to evaluate or control intensity of effort based on perceived effort or movement velocity [7], an approach using strength-endurance profiles might overcome certain limitations of these methods. Such limitations include inappropriate anchoring of perception [5], inaccurate subjective estimates of repetitions in reserve at lower intensity of effort [8] and dependency on technology to provide reliable feedback on movement velocity [9].
The relationship between load and RTF can be expressed through simple bivariate models. Thus far, research has proposed models that describe either a linear [10–12] or an exponential relationship [11–14]; usually, the respective model equations are then rearranged to predict the 1-RM from a repetition maximum test. However, studies conducted to test the validity of these equations often showed poor predictive accuracy, especially when the applied repetition maximum test was executed at loads allowing for 10 repetitions or more [3,12,13,15–17]. The poor validity may be related to substantial inter-individual differences in strength-endurance relationships that models not incorporating the responsible confounding factors fail to account for. Indeed, there is evidence that the amount of repetitions that can be performed at a given relative load, and hence the strength-endurance relationship, may depend on various factors, such as qualitative and quantitative training background [11,18–20], fiber type composition and the capillary density of involved muscles [21,22], exercise [12,19,23] and movement cadence [14,24]. A possible solution to overcome these challenges in modeling strength-endurance relationships has been proposed by Morton and colleagues [25] who introduced the idea of creating subject-specific models, thereby treating the individual person as the population of interest. For this purpose, the authors reformulated the critical power model originally proposed by Monod and Scherrer [26] such that it may be applied to isoinertial resistance training exercises. The resulting model has recently been referred to as critical load model and was originally presented as a non-linear function featuring three parameters [25,27].
While the individualized modeling approach may reduce variance resulting from uncontrolled confounding variables, such models are typically estimated from a limited number of available data due to the exhaustive nature of sets performed to failure [25,27]. Hence, the estimation of model parameters can be strongly affected by variability in test results, as single data points tend to have a larger influence on parameter estimates in small samples compared to large samples. Therefore, the robustness of individual strength-endurance models over short periods is crucial for their application in practice. The present study was designed to target two objectives: 1) to evaluate the consistency of the RTF at standardized relative loads and 2) to compare the reproducibility of four recently proposed models describing the individual strength-endurance relationship. A complementary, freely available web application will be provided to allow practitioners to easily calculate strength-endurance profiles based on a model that can be considered sufficiently robust to help with the design and regulation of resistance training programs.
Materials and methods
Subjects
Fifteen resistance-trained men (age = 27.2 ± 3.3 yrs, body mass = 85.4 ± 7.9 kg, bench press 1-RM/body mass = 1.33 ± 0.11 kg•kg-1) and nine resistance-trained women (age = 27.7 ± 5.2 y, body mass = 63.6 ± 3.3 kg, bench press 1-RM/body mass = 0.96 ± 0.17 kg•kg-1) volunteered to be tested for the present study. In order to participate, subjects had to be between 18 and 40 years of age, free of illness and injury and have at least one year of training experience in the bench press exercise as well as a 1-RM corresponding to at least 1x body mass for men and 0.75x body mass for women, respectively. Prior to physical testing, participants were informed about the possible risks, had to complete a modified physical activity readiness questionnaire (PAR-Q) and sign an informed consent form. The study was designed in fulfillment of the ethical guidelines communicated in the Declaration of Helsinki and approved by the host institution’s local ethical committee (no. 00461).
Experimental approach
A test-retest design was used to determine the participants’ maximum strength and strength-endurance at high loads in the free-weight bench press exercise on two occasions (T1 & T2) separated by one week (Fig 1). Maximum strength was assessed according to a progressive 1-RM test. Strength-endurance was assessed using repetition maximum tests at 90%, 80% and 70% of the 1-RM, respectively. In order to provide some rest, the 1-RM test and the repetition maximum tests were executed on two different days, separated by 48 to 72 hours, resulting in a total of four visits to the laboratory within 11 days. Importantly, the relative loads applied for the repetition maximum tests during the fourth visit were adjusted to the 1-RM achieved during the third visit. Consequently, a change in the 1-RM between T1 and T2 also implied a change in the absolute load used for the repetition maximum tests at T2, in order to ensure that trials were performed at 90%, 80% and 70% of the current 1-RM. All tests were completed at the same time of day.
Fig 1. Experimental design.
1-RM, one-repetition maximum; RTF, repetitions performed to momentary failure.
Procedures
On the first day, subjects completed preliminary health screening and filled in a physical activity form to evaluate their experience with the tested exercise. Body height and body mass were assessed using a stadiometer (Seca Model 217; SECA GmbH & Co. KG., Hamburg, Germany) and scale (Seca Model 877; SECA GmbH & Co. KG., Hamburg, Germany). Participants then completed a standardized warm-up consisting of cycling for 5 min at a constant power output of 1 W per kg body mass and a rotational velocity of 80 rpm on an ergometer (Kettler X1, Trisport, Huenenberg, Switzerland), followed by a brief dynamic upper body mobilization routine. Subsequently, they were familiarized with the standardized movement technique for the bench press: each subject had to lower the barbell onto two safety pins, which were individually adjusted to a height that would allow for up to 3 cm of vertical distance between the bottom barbell position and the subject’s chest. An experienced staff member visually ensured that the barbell was placed on the safety pins without rebound, before giving the verbal command “Press!”, signaling the subject to execute the concentric phase of the bench press at maximum voluntary velocity. Participants were required to maintain their hip, shoulders and head positioned on the bench and their feet placed on the floor during each set.
Upon completion of the familiarization, participants were requested to estimate their 1-RM based on self-evaluation of their recent training performance. The subsequent 1-RM test featured a progressive loading pattern with the first five loads being fixed at 25%, 50%, 75%, 85% and 95% of the estimated 1-RM, while mean concentric barbell velocity was recorded with a linear position transducer (GymAware Power Tool, Kinetic Performance Technologies, Canberra, Australia). In the initial set, three repetitions were performed, followed by a 3-min break. Two repetitions were performed once the highest achieved velocity of the preceding set fell below 1.0 m/s, followed by a 4-min break and single repetitions were performed once it fell below 0.65 m/s, followed by a 5-min break. After successfully completing 95% of the estimated 1-RM, loads were increased individually to approximate the true 1-RM. The 1-RM was considered to be determined once a load increment of 2.5 kg from the preceding set would no longer allow the subject to complete the exercise across the full range of motion.
Repetition maximum tests were completed at 90%, 80% and 70% of the identified 1-RM, respectively, in the form of a single-visit protocol. Barbell loads were not randomized, but prescribed in a descending scheme, to minimize systematic effects of accumulated fatigue on the performance during subsequent sets [28]. In order to provide extended time for recovery in between repetition maximum tests, yet sustain warm-up effects during these periods, participants underwent the same general warm-up procedure that was used for the 1-RM test prior to each set to failure. Additionally, they performed a specific warm-up including three repetitions at 25%, three repetitions at 50% and two repetitions at 75% 1-RM prior to each set to failure. A passive rest of 3 min was provided between warm-up sets and additional 5 min before and immediately after each set to failure. Due to this methodological structure (i.e., the standardized warm-up; the standardized passive rest before and after each set to failure), the repetition maximum tests were separated by approximately 22 min each. Criteria for movement execution were kept identical to those communicated for the 1-RM test. Participants were instructed to lower the barbell in a controlled fashion on each repetition, albeit not being prescribed a fixed movement cadence. Similar to the 1-RM test, participants had to await the verbal command of the staff member before initiating the concentric phase of a repetition, in order to avoid any rebound from the safety pins. The concentric phase of each repetition had to be performed at maximum intended velocity. A repetition maximum test was terminated once the participant was unable to complete another repetition across the full range of motion despite using maximal effort, suggesting that the point of momentary failure had been reached [5].
Statistical analysis
Reproducibility of performance measures
Statistics were calculated following a Bayesian approach using weakly informative priors. To assess test-retest reliability of the 1-RM and RTF at 90%, 80% and 70% 1-RM, respectively, the following random-intercept mixed effects model was used:
[1] |
In this model, Pij describes the analyzed performance measure as a dependent variable, μ describes the mean performance for T1, si the random deviation from μ for subject i, Δt the fixed effect of time (i.e., the systematic difference in performance between T2 and T1), Dj a binary dummy variable for trial j, and σe2 the variance of model residuals. The random effect parameter si was considered to be sampled from a normal distribution with a mean of 0 and a variance of σs2, as suggested by Baumgartner and colleagues [29]. Posterior distributions for each model parameter were sampled using the Hamiltonian Monte Carlo algorithm of the probabilistic programming language Stan [30] controlled through an R interface (rstan R package, version 2.21.2). Based on the resulting random-intercept models, relative consistency (reliability) of each performance measure was evaluated using the Intraclass Correlation Coefficient (ICC), which was estimated and interpreted as the proportion of total variance (σs2 + σe2) attributed to the variance among subjects (σs2) [29]. Furthermore, absolute consistency (agreement) of performance was quantified using the Standard Error of Measurement (SEM = σe), Within-Subject Coefficient of Variation (WSCV = SEM / μ) and Standard Error of Prediction (SEP = SD(1-ICC2)(1/2)) [29,31,32]. Posterior distributions of the statistics were summarized and interpreted according to the Maximum a Posteriori point estimate (MAP) and 90% Highest Density Interval (HDI) [33]. Effect directions supported by at least 90% of posterior probability were considered “clear” or “likely”.
Reproducibility of strength-endurance models. To describe the relationship between relative load and RTF with respect to individual trends, four previously proposed model types were expressed according to a multilevel (mixed effects) structure:
[2] |
[3] |
[4] |
[5] |
Eq 2 (Lin) models the relationship as a linear regression. Eqs 3 (Ex2) and 4 (Ex3) both describe exponential regression models, where Ex3 follows the structure of a commonly proposed 3-parameters model [11,12,14] and Ex2 constitutes a simplified 2-parameters version without the additive parameter ci [13]. Eq 5 (Crit) presents the previously described critical load model adapted for relative load as dependent variable, using the original parameter labels L’, k and CL [25,27]. To evaluate how much parameter estimates for Eqs 2 to 5 differ between T1 and T2, a change effect was added for each of the abovementioned subject-level parameters. For example, the parameter expression ai was extended to (ai + Δai Dj), where ai reflects the target parameter at T1, Δai reflects the change effect (difference) of the target parameter between T2 and T1 and Dj reflects a binary dummy variable for trial j. Importantly, all of the abovementioned parameters were modeled as random effects that were free to vary across subjects. The multilevel structure was realized by sampling subject-level parameters and change effects from multivariate normal distributions, applying covariance matrices to account for possible correlations among subject-level parameters and change effects, respectively. Further details on models and prior selection are provided online (Supporting information 1 in S1 Appendix).
A posterior predictive distribution was calculated by drawing random samples from the respective group-level (fixed effects) distribution of each change effect and the draws were standardized to the scale of the associated model parameter at T1. The resulting posterior predictive distributions were summarized and compared to a threshold for acceptable differences that was set at ±0.6, reflecting a small or trivial standardized change of the parameter [34]. Change effects were also expressed as a percentage of the group-level mean of the associated model parameter at T1 to facilitate the interpretation of parameters that are exceptionally homogeneous across subjects.
Results
The variability of 1-RM performance as well as the RTF performed at 90%, 80% and 70% 1-RM is shown in Fig 2. On average, there was an increase in performance between T1 and T2 (Δt), the 90% HDI suggesting a small systematic increase of the 1-RM, the RTF at 80% 1-RM and the RTF at 70% 1-RM. Regarding relative consistency of performance, the 1-RM yielded nearly perfect reliability, with the ICC being close to 1. The RTF, on the other hand, showed a trend for higher relative consistency at lower loads, although the difference between load conditions was not statistically clear at the 90% credibility level. Analysis of absolute consistency revealed the SEM for the 1-RM to be likely less than 2.2 kg (90th percentile). Concerning the RTF performed at submaximal loads, the SEM was likely less than 1 repetition at 90% and 80% 1-RM, and likely less than 1.5 repetitions at 70% 1-RM. Subject performance and consistency statistics are summarized in Table 1.
Fig 2. Variability of strength performance in the bench press.
A, one-repetition maximum (1-RM); B, repetitions performed to momentary failure (RTF) at 90% 1-RM; C, RTF at 80% 1-RM; D, RTF at 70% 1-RM; grey circles, data points (jittered illustration); black circles, group means; solid grey lines, individual performance changes; dashed black lines, systematic performance changes (Δt).
Table 1. Consistency statistics for strength performance in the bench press.
1-RM (kg) | RTF at 90% 1-RM (n) | RTF at 80% 1-RM (n) | RTF at 70% 1-RM (n) | |
---|---|---|---|---|
Performance | ||||
T1 | 93.5 ± 28.9 | 4.2 ± 1.2 | 7.8 ± 1.7 | 12.2 ± 2.6 |
T2 | 95.4 ± 29.9 | 4.5 ± 1.1 | 8.5 ± 1.4 | 12.9 ± 2.6 |
Δt | 1.9 [1.0, 2.7] | 0.2 [-0.1, 0.6] | 0.7 [0.3, 1.0] | 0.7 [0.2, 1.2] |
Absolute consistency | ||||
SEM | 1.7 [1.4, 2.3] | 0.7 [0.5, 0.9] | 0.7 [0.6, 1.4] | 1.1 [0.8, 1.4] |
WSCV (%) | 1.8 [1.4, 2.5] | 15.9 [12.3, 21.3] | 9.2 [7.2, 12.2] | 8.8 [6.9, 11.8] |
SEP | 2.3 [1.6, 3.3] | 0.9 [0.7, 1.1] | 0.9 [0.7, 1.9] | 1.4 [1.0, 1.9] |
Relative consistency | ||||
ICC | 1.00 [0.99, 1.00] | 0.65 [0.39, 0.83] | 0.82 [0.64, 0.93] | 0.86 [0.71, 0.93] |
Sample data are presented as mean ± standard deviation.
Statistics are presented as Maximum a Posteriori estimate [90% Highest Density Interval].
1-RM, one-repetition maximum; Δt, fixed effect of time; ICC, interclass correlation coefficient; RTF, repetitions performed to momentary failure; SEM, standard error of measurement; SEP, standard error of prediction; T1, baseline test; T2, retest; WSCV, within-subject coefficient of variation.
Posterior predictive distributions of subject-level model parameters at T1 and T2 are summarized in Table 2. Moreover, posterior predictive distributions for standardized change effects are shown in Fig 3. The critical load model revealed a systematic positive change effect for L’ [p (ΔL’i > 0 | data) > 99.9%] and systematic negative change effects for k [p (Δki < 0 | data) > 99.9%] and CL [p (ΔCLi < 0 | data) = 97.3%]. Similarly, the 3-parameters exponential model showed a systematic positive change effect for c [p (Δci > 0 | data) = 99.7%] and systematic negative change effects for a [p (Δai < 0 | data) = 99.6%] and b [p (Δbi < 0 | data) = 96.4%]. None of the remaining models’ parameters resulted in a clear positive or negative change at the 90% credibility level. No model parameter resulted in a clearly small or trivial change at the chosen credibility level and threshold for acceptable differences. However, the slope parameter of the linear model (b) and the curvature parameter of the 2-parameters exponential model (b) indicated a probability of >80% for the change effect to be small or trivial. Furthermore, both intercept parameters (a) of the linear model and the 2-parameters exponential model indicated relative change effects close to 0 (Table 3).
Table 2. Summary of posterior predictive distributions of absolute parameter values during test (T1) and retest (T2).
Model | Parameter | T1 | T2 | Δxi (T2 –T1) |
---|---|---|---|---|
Lin | a | 101.5 [100.3, 102.5] | 101.9 [100.4, 103.4] | 0.4 [-0.8, 1.6] |
b | -2.73 [-3.77, -1.68] | -2.56 [-3.64, -1.47] | 0.09 [-0.23, 0.47] | |
Ex2 | a | 102.6 [101.6, 103.8] | 102.9 [101.5, 104.4] | 0.3 [-0.9, 1.5] |
b | -0.031 [-0.044, -0.020] | -0.030 [-0.043, -0.017] | 0.002 [-0.002, 0.006] | |
Ex3 | a | 76.3 [65.1, 95.3] | 63.4 [55.0, 75.2] | -12.7 [-25.1, -4.5] |
b | -0.045 [-0.068, -0.022] | -0.054 [-0.080, -0.032] | -0.010 [-0.021, -0.001] | |
c | 27.3 [7.2, 38.1] | 40.7 [27.9, 48.8] | 13.7 [5.2, 25.9] | |
Crit | L’ | 3638.8 [2062.5, 6422.1] | 4583.0 [2637.9, 7271.6] | 613.7 [287.6, 1129.9] |
k | -32.0 [-47.5, -20.5] | -36.5 [-52.0, -24.3] | -4.1 [-6.6, -2.0] | |
CL | -17.4 [-43.4, 3.6] | -23.5 [-48.5, -2.5] | -3.3 [-8.8, 0.0] |
Posterior predictive distributions are summarized using the Maximum a Posteriori estimate and 90% Highest Density Interval.
Crit, critical load model; Ex2, exponential model (2 parameters); Ex3, exponential model (3 parameters); Lin, linear model; Δxi, change effect between T1 and T2.
Fig 3. Posterior predictive distributions for standardized subject-level change effects (smoothed illustration).
Dashed black lines, threshold for acceptable differences set to [-0.6, 0.6] indicating small or trivial changes; *, change effects Δa and Δc of the exponential 3-parameters model are not visibly displayed due to very large scales.
Table 3. Summary of posterior predictive distributions of relative and standardized change effects.
Model | Change effect | Relative magnitude (%) * | Standardized magnitude ** | p (Δxi ∈ [-0.6, 0.6] | data) ** |
---|---|---|---|---|
Lin | Δa | 0.3 [-0.8, 1.6] | 0.38 [-4.46, 8.98] | 27.6% |
Δb | 4.8 [-8.9, 17.2] | 0.17 [-0.38, 0.78] | 86.9% | |
Ex2 | Δa | 0.4 [-1, 1.4] | 0.13 [-4.43, 9.35] | 29.5% |
Δb | 4.6 [-7.9, 18.1] | 0.22 [-0.36, 0.8] | 84.8% | |
Ex3 | Δa | -19.1 [-28.5, -7.8] | -15.26 [-184.85, 0.71] | 0.2% |
Δb | -20.4 [-48.5, -0.7] | -0.83 [-2, -0.04] | 26.1% | |
Δc | 42.1 [0.4, 217.4] | 18.34 [0.05, 232.88] | 0.1% | |
Crit | ΔL’ | 14.6 [6.7, 29.6] | 0.76 [0.24, 1.83] | 18.8% |
Δk | -12.1 [-22.1, -6] | -0.63 [-1.26, -0.24] | 36.2% | |
ΔCL | -10.9 [-129.6, 7.8] | -1.2 [-6.4, 0.52] | 11.5% |
Posterior predictive distributions are summarized using the Maximum a Posteriori estimate and 90% Highest Density Interval.
*, change effects are expressed relative to the group-level mean of the associated model parameter at T1.
**, change effects are standardized to the group-level standard deviation of the associated model parameter at T1. Crit, critical load model; Ex2, exponential model (2 parameters); Ex3, exponential model (3 parameters); Lin, linear model; p (Δxi ∈ [-0.6, 0.6] | data), probability of the standardized change effect falling within the threshold for acceptable differences given the data.
Discussion
The present study was designed to address two objectives: first, we evaluated the reliability and agreement of RTF performed at 90, 80 and 70% 1-RM in the bench press exercise. Second, we aimed to analyze the reproducibility of four different models representing the individual strength-endurance relationship to identify which ones provide the most robust parameter estimates. Test-retest analysis of performance indicated very good reproducibility of the 1-RM and the RTF at high relative loads in the bench press exercise. The linear regression and the 2-parameters exponential regression yielded the most robust parameter estimates across the investigated models of the strength-endurance relationship.
The 1-RM revealed both very high relative and absolute consistency. In particular, the SEM for the 1-RM was found to be likely less than the smallest load increment applied during the 1-RM assessment in the present study (2.5 kg). These findings correspond to previous research reporting excellent reliability of 1-RM performance in the bench press exercise [4,35,36]. Similarly, the RTF at 90, 80 and 70% 1-RM revealed high absolute consistency, the SEM likely being less than 1.5 repetitions at 70% 1-RM, and less than 1 repetition at 90% and 80% 1-RM. Posterior distribution analysis revealed no systematic differences of SEM between RTF performed at 70%, 80% and 90% 1-RM. However, a slight shift of SEM posterior distributions to lower values could be observed for RTF at higher relative loads. In particular, the difference of SEM between RTF at 70% and 90% 1-RM could have exceeded the predefined threshold for systematic differences at a larger sample size. Interestingly, the ICC showed an opposing non-systematic shift of posterior distributions, with lower relative loads resulting in slightly larger ICC values. These seemingly contradictory trends arising from absolute and relative consistency might be related to the computation of the respective statistics: in the present study, the ICC was calculated as the proportion of total variance attributed to the variance among subjects. Therefore, it tends to be smaller when between-subject variance is low and SEM is large. Indeed, our data suggest a higher between-subject variance of the RTF at 70% 1-RM compared to 90% 1-RM. A similar trend for heteroscedasticity in the relationship between relative load and RTF across individuals (i.e. a mean-variance “tradeoff”) has been reported on numerous occasions [3,11,12,14,18,19,37,38]. This phenomenon could be the result of normalizing the load to the 1-RM, which homogenizes the upper end of the load spectrum. However, it could also be partially explained by inter-individual differences in the strength-endurance relationship.
Conforming trends for the reliability of the RTF performed at given relative loads can be observed from other sources. For example, Anders and colleagues reported an ICC of 0.90 (95% CI: [0.58, 0.97]) for RTF completed at 70% 1-RM in the bench press [4], indicating a similar magnitude compared to the present study (ICC [90% HDI] = 0.86 [0.71, 0.93]). While the reported SEM of 0.68 repetitions was noticeably lower compared to the present study, the authors also described a lower between-subject standard deviation of ±1.5 repetitions. Similarly, Pereira and colleagues reported an ICC of 0.90 for the RTF achieved at 75% 1-RM in the bench press, when performing repetitions at a joint velocity of 100°/s. While no information on subject heterogeneity was provided, the authors also reported an ICC of 0.70 when the exercise was completed at a joint velocity of 25°/s. It could be hypothesized that the reduced movement cadence might have negatively affected the number of repetitions performed [14,24], possibly due to an increased duration of the concentric phase of each repetition and associated increases in metabolic demand [39]. Hence, a reduced movement cadence at lower loads could result in a distribution of RTF that is similar to the RTF at higher loads when repetitions are performed at maximal voluntary velocity, as was the case in the present study.
Other studies investigated the reproducibility of RTF in absolute loads. For example, Mann et al. analyzed the test-retest reliability of NCAA Division I football players in the NFL-225 test, which is a repetition maximum test using a fixed load of 225 lbs or 102.3 kg in the bench press exercise [40]. The authors reported an ICC of 0.98 to 0.99 and a typical error of 1.0 to 1.3 repetitions across three trials, the typical error corresponding to what has been calculated as SEM in the present study. While it is difficult to evaluate at what percentage of the 1-RM each participant performed the NFL-225 test in the absence of a 1-RM test, the authors estimated it to be around 67.9% 1-RM for athletes with a body mass below 100.5 kg and around 44.6% 1-RM for heavier athletes. Therefore, the majority of participants performed the NFL-225 test at lower relative loads compared to the present study. Given this fact, the reports of Mann et al. [40] correspond well to the results of the present study (SEM for RTF at 70% 1-RM [90% HDI] = 1.1 [0.8, 1.4] repetitions), especially when considering the large between-subject variance reported by the authors, which may have contributed to the large ICC, as discussed before. Finally, Rose and Ball analyzed the reliability of the RTF that could be achieved against 15.9 kg and 20.4 kg, reporting an ICC of 0.97 in both cases [36]. In their sample of 21 moderately trained women the two tested loads corresponded roughly to a mean relative load of 42% and 54% 1-RM, which supports the hypothesis of RTF tests showing higher relative consistency at low loads.
A systematic increase in the 1-RM between test and retest has previously been described on numerous occasions for various exercises [41]. Interestingly, Ribeiro and colleagues reported that this time effect did not interact significantly with participants’ experience in resistance training [42]. While the magnitude of the systematic change (Δt [90% HDI] = 1.9 kg [1.0, 2.7]) could be considered trivial in the present study, given the smallest load increment was 2.5 kg, previous research suggested that the effect may occur over the course of multiple consecutive retest trials as a result of practicing the test [42–44]. Similarly, the time effect of RTF performed at 90%, 80% and 70%-1RM showed a high probability for being less than 1 repetition. Despite the RTF at 80% and 70% 1-RM indicating a systematic difference between T1 and T2, the magnitude of this effect is likely trivial.
To the best of our knowledge, this is the first study to evaluate and compare the reproducibility of different strength-endurance models with respect to individual trends. Not all of the investigated models resulted in robust parameter estimates over time. Most notably, the 3-parameters exponential model and the critical load model exhibited systematic changes for all parameters. These findings suggest that naturally occurring variability in strength performance likely causes parameter estimates to systematically change, even over short periods, and that the magnitude of these changes is unacceptably high in relation to the respective parameter’s group-level standard deviation. Therefore, the two models may not provide sufficient reproducibility for application in the practical field. In comparison, the linear model and the 2-parameters exponential model both resulted in a high probability for Δb (i.e., the change in slope and curvature parameters, respectively) to fall within the threshold for acceptable differences, although the effects were not clear at the selected credibility level. No clear change effect could be identified for the intercept parameter a in both cases due to low between-subject variability. However, findings suggest a negligible relative magnitude for Δa in both models (Table 3). Therefore, both the linear model and 2-parameters exponential model yield the most robust parameter estimates across test-retest trials among the investigated models. To decide which of the two models to apply in a practical setting, practitioners should also consider statistical qualities other than the robustness of models. For example, both the model fit und predictive validity can be considered essential characteristics of a valuable strength-endurance profile. While previous research provided some evidence that the relationship may be considered approximately linear at high loads [3,10–12], it has been suggested that the relationship actually follows a curvilinear trend when considering the full spectrum of loads [11,13,14]. Therefore, practitioners might want to resort to applying the 2-parameters exponential regression rather than the linear regression to model strength-endurance profiles, as research has not proposed any explicit disadvantages reasoning against its use.
Based on the findings of the present study, a freely available web application was developed using the R package shiny (version 1.7.1). The application provides practitioners with a user-friendly interface to enter data from repetition maximum tests and offers different algorithms to compute the individual and exercise-specific strength-endurance profile. Upon computation, it offers a graphical display of the profile, a model equation and an adjusted R2 estimate to evaluate model fit. Furthermore, it produces an individual repetition-maximum table based on the estimated model parameters that predicts loads for a wider spectrum of RTF. A link to the web application is provided at the end of this article.
It should be pointed out that the order of repetition maximum tests was not randomized in the present study. Hence, a possible systematic effect of the earlier sets performed to momentary failure on subsequent sets and, thus, the presence of systematic bias in the RTF performed cannot be excluded. Future research should strive to compare different test protocols and identify a valid, yet practically applicable approach to acquiring the necessary data for model computation. However, the results of the present study may help practitioners understand the consistency of strength performance under standardized conditions and can assist with the selection of a reliable statistical model to calculate individual strength-endurance profiles.
Conclusions and practical applications
In conclusion, both the 1-RM and RTF at 90%, 80% and 70% 1-RM showed good reproducibility over test-retest trials in the bench press exercise for trained subjects. When modeling the relationship between load and RTF using a multilevel structure, the linear regression and 2-parameters exponential regression provide more stable parameter estimates than the 3-parameters exponential regression or critical load model.
To calculate a strength-endurance profile for a given individual and specific exercise, it is recommended to acquire the maximum number of repetitions that can be performed to momentary failure against three different loads. While the loads should be chosen according to a range of interest, practitioners should expect to experience higher absolute day-to-day variability of RTF at lower loads. For loads in the range of 70% - 100% 1-RM, a linear regression or a 2-parameters exponential regression should be applied to reliably model the relationship between tested loads and the number of achieved repetitions. To derive a robust strength-endurance profile, practitioners can access a free-to-use web application using the following link: https://strength-and-conditioning-toolbox.shinyapps.io/Strength-Endurance_Profile/.
Supporting information
This file contains detailed information on priors and models.
(PDF)
This file contains the data used for the statistical analyses.
(XLSX)
Acknowledgments
The authors would like to thank all participants for contributing to the realization of the present study. The authors further want to express their gratitude to the Centre for Sport Science and University Sports, University of Vienna for providing the equipment and facilities.
Data Availability
All relevant data are within the manuscript and its Supporting Information files. Scripts used for the statistical analyses can be accessed using the DOI: https://doi.org/10.5281/zenodo.5840363.
Funding Statement
Open access funding provided by University of Vienna.
References
- 1.Lawton TW, Cronin JB, McGuigan MR. Strength testing and training of rowers: a review. Sports Med 2011; 41(5):413–32. doi: 10.2165/11588540-000000000-00000 [DOI] [PubMed] [Google Scholar]
- 2.American College of Sports Medicine. American College of Sports Medicine position stand. Progression models in resistance training for healthy adults. Med Sci Sports Exerc 2009; 41(3):687–708. doi: 10.1249/MSS.0b013e3181915670 [DOI] [PubMed] [Google Scholar]
- 3.Brechue WF, Mayhew JL. Upper-body work capacity and 1RM prediction are unaltered by increasing muscular strength in college football players. J Strength Cond Res 2009; 23(9):2477–86. doi: 10.1519/JSC.0b013e3181b1ae5f [DOI] [PubMed] [Google Scholar]
- 4.Anders JPV, Keller JL, Smith CM, Hill EC, Housh TJ, Schmidt RJ et al. The Effects of Asparagus Racemosus Supplementation Plus 8 Weeks of Resistance Training on Muscular Strength and Endurance. J Funct Morphol Kinesiol 2020; 5(1):4. doi: 10.3390/jfmk5010004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Steele J, Fisher J, Giessing J, Gentil P. Clarity in reporting terminology and definitions of set endpoints in resistance training. Muscle Nerve 2017; 56(3):368–74. doi: 10.1002/mus.25557 [DOI] [PubMed] [Google Scholar]
- 6.DeWeese BH, Hornsby G, Stone M, Stone MH. The training process: Planning for strength–power training in track and field. Part 2: Practical and applied aspects. J Sport Health Sci 2015; 4(4):318–24. [Google Scholar]
- 7.Suchomel TJ, Nimphius S, Bellon CR, Hornsby WG, Stone MH. Training for Muscular Strength: Methods for Monitoring and Adjusting Training Intensity. Sports Med 2021; 51(10):2051–66. doi: 10.1007/s40279-021-01488-9 [DOI] [PubMed] [Google Scholar]
- 8.Hackett DA, Cobley SP, Davies TB, Michael SW, Halaki M. Accuracy in Estimating Repetitions to Failure During Resistance Exercise. J Strength Cond Res 2017; 31(8):2162–8. doi: 10.1519/JSC.0000000000001683 [DOI] [PubMed] [Google Scholar]
- 9.García-Ramos A, Torrejón A, Feriche B, Morales-Artacho AJ, Pérez-Castilla A, Padial P et al. Prediction of the Maximum Number of Repetitions and Repetitions in Reserve From Barbell Velocity. Int J Sports Physiol Perform 2018; 13(3):353–9. doi: 10.1123/ijspp.2017-0302 [DOI] [PubMed] [Google Scholar]
- 10.Brzycki M. Strength Testing—Predicting a One-Rep Max from Reps-to-Fatigue. JOPERD 1993; 64(1):88–90. [Google Scholar]
- 11.Desgorces FD, Berthelot G, Dietrich G, Testa MSA. Local muscular endurance and prediction of 1 repetition maximum for bench in 4 athletic populations. J Strength Cond Res 2010; 24(2):394–400. doi: 10.1519/JSC.0b013e3181c7c72d [DOI] [PubMed] [Google Scholar]
- 12.Reynolds JM, Gordon TJ, Robergs RA. Prediction of One Repetition Maximum Strength from Multiple Repetition Maximum Testing and Anthropometry. J Strength Cond Res 2006; 20(3):584–92. doi: 10.1519/R-15304.1 [DOI] [PubMed] [Google Scholar]
- 13.Mayhew JL, Johnson BD, Lamonte MJ, Lauber D, Kemmler W. Accuracy of prediction equations for determining one repetition maximum bench press in women before and after resistance training. J Strength Cond Res 2008; 22(5):1570–7. doi: 10.1519/JSC.0b013e31817b02ad [DOI] [PubMed] [Google Scholar]
- 14.Sakamoto A, Sinclair PJ. Effect of movement velocity on the relationship between training load and the number of repetitions of bench press. J Strength Cond Res 2006; 20(3):523–7. doi: 10.1519/16794.1 [DOI] [PubMed] [Google Scholar]
- 15.LeSuer DA, McCormick JH, Mayhew JL, Wasserstein RL, Arnold MD. The Accuracy of Prediction Equations for Estimating 1-RM Performance in the Bench Press, Squat, and Deadlift. J Strength Cond Res 1997; 11(4):211–3. [Google Scholar]
- 16.Ware JS, Clemens CT, Mayhew JL, Johnston TJ. Muscular endurance repetitions to predict bench press and squat strength in college football players. J Strength Cond Res 1995; 9(2):99–103. [Google Scholar]
- 17.Wood TM, Maddalozzo GF, Harter RA. Accuracy of Seven Equations for Predicting 1-RM Performance of Apparently Healthy, Sedentary Older Adults. Meas Phys Educ Exerc Sci 2002; 6(2):67–94. [Google Scholar]
- 18.Richens B, Cleather DJ. The relationship between the number of repetitions performed at given intensities is different in endurance and strength trained athletes. Biol Sport 2014; 31(2):157–61. doi: 10.5604/20831862.1099047 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hoeger WW, Hopkins DR, Barette SL, Hale DF. Relationship between repetitions and selected percentages of one repetition maximum: A comparison between un-trained and trained males and females. J Strength Cond Res 1990; 4(2):47–54. [Google Scholar]
- 20.Pick J, Becque MD. The Relationship Between Training Status and Intensity on Muscle Activation and Relative Submaximal Lifting Capacity During the Back Squat. J Strength Cond Res 2000; 14(2):175–81. [Google Scholar]
- 21.Douris PC, White BP, Cullen RR, Keltz WE, Meli J, Mondiello DM et al. The relationship between maximal repetition performance and muscle fiber type as estimated by noninvasive technique in the quadriceps of untrained women. J Strength Cond Res 2006; 20(3):699–703. doi: 10.1519/17204.1 [DOI] [PubMed] [Google Scholar]
- 22.Terzis G, Spengos K, Manta P, Sarris N, Georgiadis G. Fiber type composition and capillary density in relation to submaximal number of repetitions in resistance exercise. J Strength Cond Res 2008; 22(3):845–50. doi: 10.1519/JSC.0b013e31816a5ee4 [DOI] [PubMed] [Google Scholar]
- 23.Shimano T, Kraemer WJ, Spiering BA, Volek JS, Hatfield DL, Silvestre R et al. Relationship between the number of repetitions and selected percentages of one repetition maximum in free weight exercises in trained and untrained men. J Strength Cond Res 2006; 20(4):819–23. doi: 10.1519/R-18195.1 [DOI] [PubMed] [Google Scholar]
- 24.LaChance PF, Hortobagyi T. Influence of Cadence on Muscular Performance During Push-up and Pull-up Exercise. J Strength Cond Res 1994; 8(2):76–9. [Google Scholar]
- 25.Morton RH, Redstone MD, Laing DJ. The Critical Power Concept and Bench Press: Modeling 1RM and Repetitions to Failure. Int J Exerc Sci 2014; 7(2):152–60. [Google Scholar]
- 26.Monod H, Scherrer J. The Work Capacity of a Synergic Muscular Group. Ergonomics 1965; 8(3):329–38. [Google Scholar]
- 27.Bergstrom HC, Dinyer TK, Succi PJ, Voskuil CC, Housh TJ. Applications of the Critical Power Model to Dynamic Constant External Resistance Exercise: A Brief Review of the Critical Load Test. Sports (Basel) 2021; 9(2):15. doi: 10.3390/sports9020015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Sánchez-Medina L, González-Badillo JJ. Velocity loss as an indicator of neuromuscular fatigue during resistance training. Med Sci Sports Exerc 2011; 43(9):1725–34. doi: 10.1249/MSS.0b013e318213f880 [DOI] [PubMed] [Google Scholar]
- 29.Baumgartner R, Joshi A, Feng D, Zanderigo F, Ogden RT. Statistical evaluation of test-retest studies in PET brain imaging. EJNMMI Res 2018; 8(1):13. doi: 10.1186/s13550-018-0366-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M et al. Stan: A Probabilistic Programming Language. J. Stat. Soft. 2017; 76(1):32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Vet HCW de Terwee CB, Knol DL Bouter LM. When to use agreement versus reliability measures. J Clin Epidemiol 2006; 59(10):1033–9. doi: 10.1016/j.jclinepi.2005.10.015 [DOI] [PubMed] [Google Scholar]
- 32.Weir JP. Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM. J Strength Cond Res 2005; 19(1):231–40. doi: 10.1519/15184.1 [DOI] [PubMed] [Google Scholar]
- 33.Makowski D, Ben-Shachar M, Lüdecke D. bayestestR: Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework. JOSS 2019; 4(40):1541. [Google Scholar]
- 34.Hopkins WG, Marshall SW, Batterham AM, Hanin J. Progressive statistics for studies in sports medicine and exercise science. Med Sci Sports Exerc 2009; 41(1):3–13. doi: 10.1249/MSS.0b013e31818cb278 [DOI] [PubMed] [Google Scholar]
- 35.Pereira MIR, Gomes PSC. Muscular strength and endurance tests: reliability and prediction of one repetition maximum–Review and new evidences. Rev Bras Med Esporte 2003; 9(5):336–46. [Google Scholar]
- 36.Rose K, Ball TE. A Field Test for Predicting Maximum Bench Press Lift of College Women. J Strength Cond Res 1992; 6(2):103–6. [Google Scholar]
- 37.Moss AC, Dinyer TK, Abel MG, Bergstrom HC. Methodological Considerations for the Determination of the Critical Load for the Deadlift. J Strength Cond Res 2021; 35(Suppl 1):S31–S37. doi: 10.1519/JSC.0000000000003795 [DOI] [PubMed] [Google Scholar]
- 38.Dinyer TK, Byrd MT, Vesotsky AN, Succi PJ, Bergstrom HC. Applying the Critical Power Model to a Full-Body Resistance-Training Movement. Int J Sports Physiol Perform 2019; 14(10):1364–70. doi: 10.1123/ijspp.2018-0981 [DOI] [PubMed] [Google Scholar]
- 39.Fountain WA, Valenti ZJ, Lynch CE, Guarnera SR, Meister BM, Carlini NA et al. Order of concentric and eccentric muscle actions affects metabolic responses. J Sports Med Phys Fitness 2021; 61(12):1587–95. doi: 10.23736/S0022-4707.21.12010-9 [DOI] [PubMed] [Google Scholar]
- 40.Mann JB, Ivey PJ, Brechue WF, Mayhew JL. Reliability and smallest worthwhile difference of the NFL-225 test in NCAA Division I football players. J Strength Cond Res 2014; 28(5):1427–32. doi: 10.1519/JSC.0000000000000411 [DOI] [PubMed] [Google Scholar]
- 41.Grgic J, Lazinica B, Schoenfeld BJ, Pedisic Z. Test-Retest Reliability of the One-Repetition Maximum (1RM) Strength Assessment: a Systematic Review. Sports Med Open 2020; 6(1):31. doi: 10.1186/s40798-020-00260-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ribeiro AS, do Nascimento MA, Mayhew JL, Ritti-Dias RM, Avelar A, Okano AH et al. Reliability of 1RM test in detrained men with previous resistance training experience. IES 2014; 22(2):137–43. [Google Scholar]
- 43.Mattocks KT, Buckner SL, Jessee MB, Dankel SJ, Mouser JG, Loenneke JP. Practicing the Test Produces Strength Equivalent to Higher Volume Training. Med Sci Sports Exerc 2017; 49(9):1945–54. doi: 10.1249/MSS.0000000000001300 [DOI] [PubMed] [Google Scholar]
- 44.Dankel SJ, Counts BR, Barnett BE, Buckner SL, Abe T, Loenneke JP. Muscle adaptations following 21 consecutive days of strength test familiarization compared with traditional training. Muscle Nerve 2017; 56(2):307–14. doi: 10.1002/mus.25488 [DOI] [PubMed] [Google Scholar]