Skip to main content
PLOS One logoLink to PLOS One
. 2021 Apr 14;16(4):e0249564. doi: 10.1371/journal.pone.0249564

Determination of reference intervals for knee motor functions specific to patients undergoing knee arthroplasty

Hideyuki Ito 1, Kiyoshi Ichihara 2,*, Kotaro Tamari 3, Tetsuya Amano 4, Shigeharu Tanaka 5, Shigehiro Uchida 6, Shinya Morikawa 7
Editor: Alison Rushton8
PMCID: PMC8046200  PMID: 33852606

Abstract

Background

In patients with knee osteoarthritis (KOA) undergoing knee arthroplasty (KA), lower-limb motor function tests are commonly measured during peri-surgical rehabilitation. To clarify their sources of variation and determine reference intervals (RIs), a multicenter study was performed in Japan.

Methods

We enrolled 545 KOA patients (127 men; 418 women; mean age 74.2 years) who underwent KA and followed a normal recovery course. The surgical modes included total KA (TKA), minimally invasive TKA (MIS-TKA), and unicompartmental KA (UKA). Motor functions measured twice before and two weeks after surgery included timed up-and-go (TUG), maximum walking speed (MWS), extensor and flexor muscle strength (MS), and knee range of motion (ROM). Multiple regression analysis was performed to evaluate their sources of variation including sex, age, BMI, and surgical mode. Magnitude of between-subgroup differences was expressed as SD ratio (SDR) based on 3-level nested ANOVA. SDR≥0.4 was set as the threshold for requiring RIs specific for each subgroup.

Results

Before surgery, age-related changes exceeding the threshold were observed for TUG and MWS. Between-sex difference was noted for extensor and flexor MS, but extension and flexion ROMs were not influenced by sex or age. After surgery, in addition to similar influences of sex and age on test results, surgical modes of UKA and MIS-TKA generally had a favorable influence on MWS, extensor MS, and flexion ROM. All motor function test results showed a variable degree of skewness in distribution, and thus RIs were basically derived by the parametric method after Gaussian transformation of test results.

Conclusions

This is the first study to determine RIs for knee motor functions specific to KOA patients after careful consideration of their sources of variation and distribution shapes. These RIs facilitate objective implementation of peri-surgical rehabilitation and allow detection of patients who deviate from the normal course of recovery.

Introduction

Yoshimura et al. [29] reported in a large cohort study across Japan that the prevalence of knee osteoarthritis (KOA) is as high as 42.6% in men and 62.4% in women over 40 years old. Their report also disclosed that among the 25.3 million patients with KOA, 8 million have symptomatic disease, which poses a large socio-economic problem in Japan [1]. KOA features degeneration and attrition of joint structure that result in ossification of the cartilage and surrounding tissue. This restricts knee range of movement causing pain and gait disturbance. Conservative therapy, such as exercise, ultrasonic therapy, electrotherapy, and orthosis therapy, is the primary treatment choice. However, with the development of multiple osteophytes, sclerosis, and narrowing of joint space, knee arthroplasty eventually becomes necessary. Either unicompartmental knee arthroplasty (UKA) or total knee arthroplasty (TKA) has been the therapeutic regimen of choice. The former is a less invasive technique that only replaces a single granular area and preserves the anterior and posterior cruciate ligaments [2, 3]. Recently, as an alternative to conventional TKA (C-TKA), a minimally invasive TKA surgery (MIS-TKA) has become popular, which features a shorter skin incision than that with C-TKA.

In the clinical management of KOA patients undergoing knee arthroplasty, physical therapists take an important role in providing peri-surgical rehabilitation in close collaboration with orthopedic surgeons. The scheme of rehabilitation according to a common clinical pathway for knee arthroplasty is shown in Fig 1. The functional status of patients is usually assessed by performing knee motor function tests of a safe variety, such as the timed-up-and-go (TUG) test, maximum walking speed (MWS), knee muscle strength, and knee range of motion (ROM) [4]. These test results are often referred to in deciding a patient’s discharge date. To read the results objectively, it is essential to understand their sources of variation (factors that may alter test results without pathological conditions) and to have a reliable reference interval (RI) for each parameter. To the best of our knowledge, there have been no published studies that evaluated motor function tests of patients under the same situation despite the high demand for knee replacement surgery.

Fig 1. Clinical pathway for knee replacement surgery.

Fig 1

Although several reports have presented so-called “healthy” normative values for knee motor function tests [59], these normative values were reported only in the form of mean ± standard deviation (SD) of test results without consideration of the shapes of their distribution. Therefore, a RI that corresponds to the true central 95% range of test results is not certain for practical use. In recent years, in the field of clinical chemistry and laboratory medicine, RIs are determined as a central 95% range of healthy values either nonparametrically as a 2.5 to 97.5 percentile range of values or parametrically in three steps: (1) transformation of the distribution of test results into Gaussian shape, (2) calculation of the central 95% range as mean ± 1.96 SD, and (3) reverse-transformation of the range back to the original scale [10, 11]. Another problem with using the reported healthy normative values was that they were generally not matched to KOA patients, who have compromised knee motor functions due to the interference of the operative-side knee or the presence of KOA on the non-operative side as well.

On the basis of this background, we conducted a multicenter study to evaluate sources of variation (SVs) of knee motor functions and to establish well-stratified RIs in consideration of the SVs for KOA patients undergoing knee arthroplasty. Age matched healthy subjects were also recruited to ascertain the need for disease specific RIs. We aimed to use these quantitative information for objective implementation of peri-surgical rehabilitation and for screening of patients who deviate from the normal course of recovery.

Subjects and methods

Study subjects

Patients

In total, 624 KOA patients undergoing elective knee arthroplasty were recruited between July 2013 and February 2018 from 13 institutions across Western Japan that specialize in knee arthroplasty. The inclusion criteria were 1) diagnosed as having osteoarthritis and 2) having a clinical indication for TKA or UKA. The exclusion criteria were 1) presence of motor paralysis or other neurological dysfunctions such as stroke, lumbar disc herniation, spinal canal stenosis, or peripheral neuropathy due to diabetes mellitus unrelated to KOA, 2) obvious motion pain or restricted joint movements in locations other than the knee, which would hinder walking or the sit-to-stand motion, 3) cognitive impairment, 4) missing test results for motor functions tests as planned, 5) failure to be discharged because of some deviation from the clinical pathway, such as walking instability, persistent inflammation, surgery-related fracture, deep venous thrombosis, or post-surgical infection. After exclusion of 38 patients with missing motor function test results and 41 patients with delayed discharge, 545 patients were considered eligible for the subsequent analyses.

The breakdown of cases by surgical mode was C-TKA, 99 cases; MIS-TKA, 342 cases; and UKA, 104 cases, where MIS-TKA was defined as a surgical mode involving a skin incision width of <5 cm [12]. The surgical mode of UKA features less bone resection and less injury to the quadricep muscles compared to TKA [13, 14].

The patients were managed following a standardized clinical pathway based on a rehabilitation program implemented in common in the 13 institutions (Fig 1). The planned date of discharge was set at 3–4 weeks after the surgery. The common program was composed of the following for the first post-surgical week: exercise for knee joint ROM, muscle training for gluteus maximus and gluteus medius, sit-up and stand-up training; for the second week: muscle training for quadriceps and T-cane walking; and for the third week: staircase training, daily activity practice, and instructions toward discharge. These motor function tests were performed twice, just before and two weeks after the surgery. The general guideline of allowing discharge at 3–4 weeks after knee arthroplasty was based on attainment of stable walking with one cane and the capability of performing daily living activities safely.

Healthy volunteers

As described in the Introduction, we recruited 120 apparently healthy volunteers with comparable ages to make a contrast of their knee motor functions with those of KOA patients undergoing knee arthroplasty. They comprised 36 men and 84 women mostly from elderly clubs registered in a municipal health promotion office. The inclusion criteria adopted from Bohannon [15] were 1) self-awareness of physical and mental health, 2) self-efficacy for leading independent daily life, 3) not currently under medical care for motor or neurological diseases or cardiac diseases, 4) no previous history of knee or hip join replacement surgery, and 5) capable of walking 30 meters without any assistance or device.

Ethical considerations, explanation, and consent

This study was conducted strictly following the Declaration of Helsinki and “Ethical Guidelines on Clinical Studies”. The objective and protocol of this study were evaluated and approved by the Tokoha University Ethics Committee (approval no.: 2018-501H). We provided a written explanation of this study in language that was understandable to the participants and obtained written consent from each subject after confirming their understanding and acceptance.

Research design

This study corresponded to a prospective cohort study for establishing RIs of knee motor functions specific to KOA patients.

Measurements

Basic physical and medical attributes

We recorded sex, age, body mass index (BMI), Kellgren-Lawrence classification [16, 17] for the severity of knee joint degeneration (K-L classification), affected sides (unilateral or bilateral), and surgical mode (C-TKA, MIS-TKA, or UKA). Regular exercise was defined as exercise performed a minimum of twice weekly for 30 minutes or more.

Motor functions

The measurements of the knee motor functions were performed according to the following procedures.

Timed up-and-go. The examinee was asked to sit comfortably on an armless chair with seat-height set between 40 and 45-cm high with their back leaning against the backrest and both hands on the thighs.

On the vocal “go” signal, the examinee got up from the chair, walked 3 meters, turned around, and returned to sit on the chair. The time required for the entire movement was measured by stopwatch. This test was done in two ways: at a comfortable speed (TUGcom) of usual walking [18] and at the patient’s maximum walking speed (TUGmax) [19].

Maximum walking speed. The test distance for MWS measurement was set to 5 meters. The examinee was allowed 3 meters each for acceleration and deceleration before and after the test distance. The examinee was asked to safely walk as fast as possible without running as recommended by Dobson [4] and Fransen [20] for increased reproducibility. On the vocal “go” signal, the examinee started to walk from the line set 3 meters before the test distance. A stopwatch was used to record walking time on reaching both ends of the test distance. The MWS (m/sec) was calculated as 5 m ÷ walking time (sec) [9].

Knee extensor muscle strength. The measurement was performed by use of a hand-held dynamometer (HHD) (μTas F-1; Anima Corp., Tokyo, Japan) according to the following procedures (Fig 2). The examinee was placed in a sitting position with both arms crossed in front of the chest. The examiner stood in front of the patient’s lower leg being tested and attached the HHD-sensor belt around the front of the ankle with its belt fixed to the stool leg (Fig 2B). The examinee was then requested to apply maximum force to extend the lower leg against the ankle belt. The isometric force/torque (Nm) was recorded as a product of the HHD output (N) and lower leg length (m). Muscle strength (Nm/kg) was expressed as torque divided by body weight (kg). The lower leg length was measured as a distance between the distal end of the lower leg (where the center of the HHD sensor was placed) and the lateral cleft of the knee joint [2123]. The muscle strength was measured twice, and the average value was used because variability of the measurement was higher in elderly women as reported by Katoh et al [24].

Fig 2. Measurement of knee extensor/flexor muscle strength.

Fig 2

The measurement was performed by use of a hand-held dynamometer (HHD) [A]. For measuring extension muscle strength [B], the examinee sitting on chair with the knee joint bent at 90-degree angle was asked to extend the lower leg at maximum force against the black belt around the ankle, which was fixed behind to the chair with the HHD attached to its front. The length of the lower leg (Len in meter) was measured [C] as a distance between the lateral cleft of the knee joint and the center of the HHD sensor at the ankle. The muscle strength in N‧m was computed as a product of HHD output (N) and leg length (m). Similarly, the flexion muscle strength was measured [D] by forcibly flexing the lower leg with the belt fixed to the leg of the examiner and the HHD attached at the back side of the belt.

Knee flexor muscle strength. The strength of the knee flexor muscles was measured in a similar manner. The only difference was that the HHD sensor was placed around the back side of the ankle with its belt fixed to the examiner’s leg (Fig 2D). The examinee was asked to apply maximum force to flex the knee to an acute angle against the belt. The strength was recorded as muscle strength of the knee flexor [2123].

Measurement of knee flexion and extension range of motion. The ROM of the knee joint during flexion and extension was measured by a joint angle meter according to the guideline issued jointly by the Japan Orthopedic Surgery Society and the Japan Rehabilitation Medical Society [25]. The angle of the long axes of the tibia (on a line connecting the fibula head and lateral malleolus) and the femur (on a line connecting the greater trochanter and lateral epicondyle of the femur) was measured in units of 5 degrees.

Statistical methods

Sources of variation of motor function parameters

To identify factors associated with motor function parameters, multiple regression analysis (MRA) was performed. Values of each of the motor function tests were set as a dependent variable. The following parameters that we could obtain from all subjects were evaluated as explanatory variables: sex, age, BMI, habit of regular exercise (binary: no = 0, yes = 1), and K-L classification (ranking scale of 0 to 4). In the analysis of motor function tests performed after the surgery, the additional explanatory variable of surgical mode (dummy variables were created for MIS-TKA and UKA by setting C-TKA as the reference category) was introduced in the regression model.

All explanatory variables were fixed for all objective variables for ease of comparability across motor function parameters. In addition to expressing the statistical significance of each standardized partial regression coefficient (rp) as a P value, we marked its effect size (practical significance) by use of a threshold of |rp|≥0.2, which corresponds to a midpoint effect size of Cohen’s d of small (0.1) and medium (0.3) for correlation coefficients [26].

For motor function parameters before the surgery, three-level nested ANOVA was performed by setting the three factors to sex, age, and BMI, whereas for the parameters after the surgery, the ANOVA was performed by setting the factors to sex, age, and surgical mode. The choice of these three factors was determined based on the MRA results (see Results). In the analyses, age and BMI were converted into a ranking scale by partitioning values at 70 and 80 years for age (3 categories), and at 18.5, 25, and 30 kg/m2 for BMI (4 categories according to the WHO criteria of <18.5: underweight; 18.5≤–<25: normal range; 25≤–<30: pre-obese; and 30≤–<35: obese class I).

The analysis provided the magnitude of each source of variation in terms of the SD: SD for sex (SDsex), SD for age (SDage), SD for BMI (SDBMI), and SD for surgical mode (SDsur) together with the SD for between-individual variations (SDindiv). The relative magnitude of each SD was expressed by determining its ratio to SDindiv. We designated it as SD ratio or SDR. Namely SDRsex = SDsex/SDindiv; SDRage = SDage/SDindiv; SDRBMI = SDBMI/SDindiv; and SDRsur = SDsur/SDindiv. The threshold value for SDR at which we chose to partition test results by the factor was set to 0.4 as reported elsewhere [10, 11].

Before performing MRA and ANOVA, the values of TUG were converted logarithmically, and the values of extensor/flexor muscle strength were transformed to the power of 0.5 (square root transformation) to reduce the skewness of their distributions.

Method for statistical derivation of RIs

RIs for motor function parameters were basically derived by use of a parametric method that features power transformation of a value (test result) to make its distribution Gaussian by use of the following modified Box-Cox formula [27]:

X=(x-a)p-1p

where X represents the transformed value of test result x, and p and a represent power and an origin of transformation to be estimated by maximum likelihood method, respectively.

The central 95% interval under the transformed scale (LLT, ULT) can be calculated by use of the mean and SD of the transformed test results (mT, SDT) as

LLT=mT1.96SDT
ULT=mT+1.96SDT

Then, the lower and upper limits under the original scale (LL, UL) can be calculated by reverse transformation using the following formulae:

LL=(p×LLT+1)1/p+a
UL=(p×ULT+1)1/p+a

The success of the Gaussian transformation was confirmed by the linearity of the cumulative frequency curve on the probability plot as well as by the Kolmogorov-Smirnov test of normality.

When test results of a given parameter failed in the Gaussian transformation by the Box-Cox method or when test results took discrete values, we applied a nonparametric method for determining the central 95% range of test results after sorting the values and took the range of 2.5 and 97.5 percentiles as the limits of the RI.

Results

Profiles of KOA patients

The patients with KOA comprised 127 men and 418 women with respective mean ages ± SD of 74.4 ± 8.0 and 74.2 ± 7.6 years and BMIs of 25.3 ± 3.6 and 25.3 ± 3.6 kg/m2 as summarized in Table 1. The habit of regular exercise was self-reported as “yes” by 176 and “no” by 369 patients. The affected knee joints were unilateral in 205 and bilateral in 340 patients. The severity of the knee joint deformity by K-L classification was Grade 2, 3, and 4 in 29, 252, and 264 patients, respectively. The mode of surgery was C-TKA in 99 patients, MIS-TKA in 342, and UKA in 104.

Table 1. Demography of patients and healthy volunteers.

KOA patients healthy volunteers P-value
N 545 120
Sex (n) men:127 women: 418 men:36 women: 84 0.12
Age (year) 74.2 ± 7.7 71.3 ± 5.9 0.0001
BMI (kg/m2) 25.3 ± 3.7 22.5 ± 2.9 0.0000
Exercise (n) Yes:176 no:369 Yes:99 no:21 0.0000
K-L class (n) Grade 2:29 Grade 3:252 Grade 4:264
Surgical mode (n) C-TKA: 99 MIS-TKA: 342 UKA: 104
Affected knee (n) unilateral:205 bilateral:340
Days of admission 25.7 ± 6.3

Demography of healthy controls

The healthy controls comprised 36 men and 84 women with respective mean ages ± SD of 72.3 ± 5.9 and 70.9 ± 6.0 years and BMIs of 23.2 ± 2.4 and 22.1 ± 3.1 kg/m2. The habit of regular exercise was self-reported as “yes” in 99 and “no” in 21 subjects.

Differences between healthy subjects and KOA patients were tested by Mann-Whitney test for numerical variables and chi-square tests for categorical variables. Statistically significant differences were found for higher age and BMI, and less frequent habit of exercise observed among KOA patients.

Sources of variation of motor functions among osteoarthritis patients

Regarding the motor function tests before the surgery, as shown in the upper half of Table 2, MRA revealed that age was a significant source of variation with |rp|>0.2 for TUGcom, TUGmax, and MWS with respective rp values of 0.32, 0.34, and −0.31. Sex was also significant for extensor and flexor muscle strength of the knee. In general, there were greater between-sex differences in extensor muscle (rp = −0.32) than in flexor muscle (rp = −0.24) strength. None of the factors were associated with knee extension ROM, whereas BMI was negatively associated with knee flexion ROM (rp = −0.22), i.e., the higher the BMI, the smaller was the flexion ROM.

Table 2. Multiple regression analyses for sources of variation of motor function parameters.

Before surgery R Sex Age BMI Exercise K-L class Bilateral
TUG comfort 0.402 0.11 * 0.32 *** -0.01 -0.11 * 0.14 ** 0.01
TUG maximum 0.431 0.15 *** 0.34 *** 0.03 -0.08 0.15 ** 0.01
Maximum walking speed 0.436 -0.16 *** -0.31 *** 0.01 0.12 * -0.17 *** -0.04
Extensor muscle strength 0.355 -0.32 *** -0.12 -0.06 0.06 -0.01 0.03
Flexor muscle strength 0.302 -0.24 *** -0.10 -0.11 0.01 -0.07 -0.04
Extension ROM 0.198 -0.03 0.05 -0.08 0.05 -0.15 * 0.04
Flexion ROM 0.296 -0.11 * -0.02 ** -0.22 *** 0.05 -0.09 -0.01
After surgery R Sex Age BMI Exercise K-L class Bilateral MIS-TKA UKA
TUG comfort 0.437 0.08 0.35 *** 0.01 -0.04 0.07 0.04 -0.16 * -0.24 ***
TUG maximum 0.468 0.12 ** 0.37 *** 0.00 -0.05 0.10 0.03 -0.15 * -0.24 ***
Maximum walking speed 0.484 -0.16 ** -0.34 *** -0.02 0.08 -0.07 -0.03 0.22 *** 0.31 ***
Extensor muscle strength 0.367 -0.29 *** -0.10 -0.12 ** 0.00 0.02 0.06 0.19 ** 0.22 **
Flexor muscle strength 0.353 -0.24 *** -0.19 *** -0.15 ** -0.06 -0.03 -0.01 0.15 * 0.10
Extension ROM 0.205 0.11 * -0.04 -0.09 0.00 0.08 -0.02 0.09 0.11
Flexion ROM 0.242 -0.07 -0.08 -0.04 -0.01 0.01 0.01 0.12 0.26 ***

R = multiple correlation coefficient; Exercise (binary) indicating habit of regular exercise; K-L class (ordinal) from 2 to 4; Bilateral (binary) indicating bilateral involvement of osteoarthritis of the knees.

Multiple regression analysis (MRA) was performed by setting each motor function parameter as a dependent variable and a fixed list of explanatory variables: Sex, age, BMI, exercise, K-L class, bilateral. Listed are standardized partial regression coefficients (rp), which take values between -1.0 and 1.0. The values above the effect size of |rp|≥0.2 are highlighted by bold letter with light orange background. Statistical significance is also indicated as

* = P<0.01;

** = P<0.001, and

*** = P<0.0001.

For the motor function tests after surgery, as shown in the lower half of Table 2, age was positively associated with TUGcom and TUGmax (rp = 0.35 and 0.37) and negatively associated with MWS (rp = −0.34). The surgical mode of UKA was negatively associated with TUGcom and TUGmax (rp = −0.24 and −0.24) and positively associated with MWS (rp = 0.31), i.e., the surgical mode of UKA resulted in shorter TUG and faster MWS compared to C-TKA (the reference category). The surgical mode of MIS-TKA was also positively associated with MWS (rp = 0.22) compared to C-TKA, i.e., MWS was faster with MIS-TKA than with C-TKA. Furthermore, the women showed weaker extensor and flexor muscle strength (rp = −0.29 and −0.24), and UKA had a positive influence on the extensor muscle strength (rp = 0.22).

None of the factors were associated with the ROM of knee extension. For the ROM of knee flexion, however, the surgical mode of UKA showed a larger ROM compared to C-TKA, with a rp of 0.26. These findings revealed by MRA can be confirmed graphically as follows.

In Fig 3, the measured values of TUGcom (Fig 3A) and TUGmax (Fig 3B) were compared among healthy controls and patients before and after surgery subgrouped by age (≤69, 70–79, ≥80 years) and surgical mode. The prolongation of TUG with age is obvious across all groups. TUG was increased in ascending order of UKA, MIS-TKA, and C-TKA. The figure also shows how much difference in TUG occurred in KOA patients before and after the surgery compared to the healthy elderly of comparable ages.

Fig 3. Comparison of TUGs and MWS partitioned by age and surgical mode.

Fig 3

Measured results of timed up and go test (TUG) at comfortable (A) or at maximum (B) walking speed in seconds, and maximum walking speed (MWS) (C) were compared between three groups: Healthy controls and patients before and after surgery. Because of prominent age-related changes in the values, they were partitioned by age at 70 and 80 years, and for postsurgical values, also by surgical modes. The x-axis of TUG was transformed logarithmically and that of MWS was transformed to the power of 0.8 to make their distributions near Gaussian. The box and center line respectively represent the mid-50% range and median of each subgroup.

A similar comparison was made for MWS in Fig 3C. Both the healthy controls and the patients before surgery showed an age-related decrease in MWS. The reduction in MWS in the patients is prominent regardless of age compared to the healthy controls.

MWS after surgery was further subgrouped by surgical mode. Both age and surgical mode were associated with the levels of MWS both by MRA and ANOVA. However, surgical mode was relatively more intense (SDRage = 0.363 vs SDRsur = 0.442). Therefore, in this figure, postsurgical MWS was partitioned by the mode of surgery.

In Fig 4A and 4B, extensor and flexor muscle strengths were respectively compared after partitioning by sex between three groups: healthy controls and patients in presurgical and postsurgical states. Extensor muscle strength postoperatively is reduced in females.

Fig 4. Comparison of knee muscle strength and ROM between three groups.

Fig 4

The measured strengths of extensor (A) and flexor (B) muscles were compared between three groups: Healthy controls and patients before and after the surgery. The values were partitioned by gender. The x-axis was transformed to the power of 0.5 to make their distribution near Gaussian. The box and center line represent the mid-50% range and median of each subgroup, respectively. Similarly, the range of motion (ROM) of extension (C) and flexion (D) was compared between the three groups. As the test results were recorded discretely for every 5°, the distribution is shown by histogram.

In Fig 4C and 4D, extension and flexion ROM of the knee was respectively compared between healthy controls and KOA patients before and after surgery. The values of extension and flexor ROM of the patients were greatly decreased before the surgery compared to those of the healthy controls. However, after surgery, the values of extension ROM improved appreciably, whereas the values of flexion ROM decreased further after the surgery.

Assessment of factors to derive RIs specific to subgroups

We considered sex, age, BMI, and surgical mode as possible factors for the need of subgrouping test results when deriving RIs. Three-level nested ANOVA was performed separately for pre- and post-surgical results by use of the following three factors of sex, age, and BMI for the former, and sex, age, and surgical mode for the latter. As shown in Table 3 by setting SDR ≥0.4 as a threshold, for values before the surgery, partitioning by age was required for TUGcom, TUGmax, and MWS with respective SDRage values of 0.445, 0.462, and 0.407. However, partitioning by sex was required for extensor and flexor muscle strength with respective SDRsex values of 0.573 and 0.418, implying a more prominent sex difference in extensor than flexor muscle strength.

Table 3. SDRs calculated to assess the need for partitioning values.

Before surgery After surgery
SDRsex SDRage SDRBMI SDRsex SDRage SDRsur
TUG comfort 0.000 0.445 0.134 0.000 0.416 0.282
TUG maximum 0.000 0.462 0.201 0.000 0.456 0.298
Maximum walking speed 0.168 0.407 0.138 0.112 0.363 0.442
Extensor muscle strength 0.573 0.099 0.144 0.484 0.000 0.303
Flexor muscle strength 0.418 0.000 0.171 0.379 0.187 0.034
Extension ROM 0.000 0.000 0.159 0.173 0.000 0.159
Flexion ROM 0.174 0.000 0.252 0.000 0.000 0.263

Three-level nested ANOVA was performed in two ways. One for motor function tests before the surgery was performed by setting sex, age, and BMI as factors of variation, and the other for motor function tests after the surgery was performed by setting sex, age, and surgical mode as the factors. In the analyses, age and BMI were respectively partitioned at 70 and 80 years and at 18.5, 25, and 30 kg/m2. SDs attributable to sex, age, BMI, and surgical modes (SDsex, SDage, SDBMI, and SDsur) were divided by SD between-individuals (SDindiv) to obtain SDRsex, SDRage, SDRBMI, and SDRsur: i.e., SDindiv represents the residual SD after subtracting the influence of other sources of variation. We regard SDR ≥0.4 as a guide to judge the need for partitioning test results into subgroups when deriving the RIs [11].

In contrast, for values after the surgery, partitioning by age was required again for TUGcom and TUGmax. However, for MWS, SDR for surgical mode (SDRsur) was greater than SDRage. Therefore, we chose to partition the postsurgical MWS values into three categories by the surgical modes of UKA, MIS-TKA, and C-TKA. For post-surgical extensor and flexor muscle strength, the SDRsex values (0.484 and 0.379, respectively) were mixed relative to the threshold of 0.4. However, we chose to partition by sex both extensor and flexor muscle values for the sake of consistency with the pre-surgical values. As the SDRs for extension and flexion ROM were all below the threshold, no attempt to partition by sex and age was made for the values of these ROMs.

Determination of RIs

RIs for motor function test results were derived according to the scheme shown above guided by SDR values. The RIs derived are listed in seven blocks in Table 4. The parametric method was basically used in the derivation. The success of the Gaussian transformation of test results was confirmed by Kolmogorov-Smirnov test as shown in Table 4 and in S1 Fig. The nonparametric method of computing the 2.5–97.5 percentile range was used in determining RIs for knee extension and flexion ROMs because their angle values were recorded discretely at every 5° and were highly skewed to the lower sides as shown in Fig 4C and 4D.

Table 4. Reference intervals derived for motor function parameters.

TUG comfort Extensor muscle strength
Parametric RIs Parametric RIs
Group Sex Age n LL Me UL K-S test Sex n LL Me UL K-S test
Healthy All All 118 5.1 6.5 8.6 0.330 Healthy All 120 0.37 1.16 2.00 1.000
PreSurg *1 All -69 118 7.6 11.2 20.6 0.635 PreSurg *1 M 124 0.25 0.91 1.88 1.000
All 70–79 281 8.0 12.9 24.8 0.940 F 404 0.20 0.65 1.36 0.993
All 80- 140 8.9 15.5 33.6 0.730 PostSurg M 127 0.20 0.57 1.46 1.000
PostSurg All -69 119 8.1 12.3 27.4 0.150 F 417 0.14 0.40 0.82 0.530
All 70–79 279 9.2 15.2 37.2 0.090
All 80- 140 10.3 18.7 38.6 0.970 Flexor muscle strength
Parametric RIs
TUG maximum Sex n LL Me UL K-S test
Parametric RIs Healthy All 120 0.21 0.49 0.93 1.000
Group Sex Age n LL Me UL K-S test PreSurg *1 M 121 0.14 0.45 0.88 1.000
Healthy All All 119 3.9 5.0 7.2 0.160 F 403 0.11 0.34 0.79 0.332
PreSurg *1 All -69 118 5.9 8.7 17.0 0.585 PostSurg M 126 0.13 0.34 0.79 0.580
All 70–79 279 6.4 10.3 18.3 0.191 F 416 0.09 0.26 0.61 0.900
All 80- 138 6.7 12.6 23.4 0.810
PostSurg All -69 118 6.4 9.9 20.1 0.070 Extension ROM
All 70–79 279 7.7 12.2 29.1 0.060 Nonparametric RIs
All 80- 139 8.0 15.0 28.1 1.000 Sex n LL Me UL
Healthy All 120 0 0 −10
Maximum Walking Speed (MWS) PreSurg All 545 0 −5 −30
Parametric RIs PostSurg All 545 0 −5 −20
Group Sex Age n LL Me UL K-S test
Healthy All All 120 1.50 2.20 3.13 0.630 Flexion ROM
PreSurg *1 All -69 120 0.60 1.22 2.03 0.420 Nonparametric RIs
All 70–79 285 0.45 1.07 1.73 0.850 Sex n LL Me UL
All 80- 140 0.43 0.90 1.59 0.856 Healthy All 120 120 150 160
PostSurg UKA All All 104 0.55 1.00 2.01 1.000 PreSurg All 545 90 125 150
MIS-TKA All All 342 0.37 0.89 1.61 0.740 PostSurg All 545 85 115 140
C-TKA All All 99 0.27 0.74 1.47 0.230

RIs were derived for motor function parameters for healthy controls, patients before and after surgery (PreSurg and PostSurg). Partitioning by age was done for TUGs and MWS, and by sex for extensor and flexor muscles strength. No partitioning by sex and age was done for values of healthy controls due to the small data size. RIs were determined by nonparametric method for test results of ROMs. For all other motor function tests, RIs were derive by parametric method through Gaussian transformation of test results (see main texts). The appropriateness of the transformation was checked by use of Kolmogorov-Smirnov (K-S) test: i.e., P>0.05 in the rightmost column was regarded as successful transformation. Refer to S1 Fig for the parameters marked as (*1) for actual outputs for the transformation and K-S test.

The RIs determined from healthy controls were not partitioned by sex or by age because of the small data size of 120 samples. The data size (n) shown in the table fluctuates because the parametric method performed truncation once at mean ± 2.81 SD under the transformed scale. The derivation process of the motor function tests before the surgery (marked as *1) are documented in S1 Fig to show the success of the Gaussian transformation by use of the probability plot and Kolmogorov-Smirnov test (N.S. implies no significant deviation from the Gaussian shape).

Discussion

Physical therapists perform peri-surgical rehabilitation of KOA patients undergoing knee arthroplasty in collaboration with orthopedic surgeons. For optimal management and promotion of recovery, it is important to monitor knee motor functions such as TUG, MWS, knee muscle strength, and knee ROM. For objective use of these parameters, it is essential to have reliable RIs for them and to analyze their sources of variation.

Regarding RIs, there are several studies reporting ‘normative’ values for knee motor functions derived from healthy individuals. Tsubaki et al. [28] measured TUG in 172 healthy Japanese elderly (80 men, 92 women; 50–79 years of age) and showed age-specific reference values. Yoshimura et al. [29] reported reference values for MWS from the analyses of 2468 healthy Japanese elderly (826 men, 1642 women; mean age 71.8 years). Seino et al. [9] also reported healthy values for MWS. However, all the past studied reported values in mean ± SD format without consideration of the distribution pattern. Therefore, it is not possible to predict the RI or central 95% range of value from the reported values.

In addition to these methodological issues of the past studies targeting healthy subjects, as demonstrated in Figs 3 and 4, healthy ranges of knee motor function tests were quite different from those of KOA patients both before and after the surgery. Therefore, our team of physical therapists conducted this study with a main objective of exploring the sources of variation and determining RIs for knee motor function tests before and after the surgery by use of up-to-date statistical methods currently in use in the field of laboratory medicine [10, 11].

Analysis of the sources of variation of the motor function tests before surgery clearly showed that both TUG comfort and TUG maximum in our KOA patients increased with age. Steffen et al. reported that TUG maximum of healthy elderly subjects increased proportionately with age [30]. Adegoke et al. [31] studied TUG maximum and MWS in KOA patients and reported a similar age dependency and attributed it to unbalanced walking with age. Bohannon [15] reported in his study of healthy elderly subjects that compared to comfortable walking speed, the maximum walking speed decreased prominently with age. Knee muscle strength before the surgery in our patients was lower in the women. Logerstedt et al. [32] reported the same finding in their KOA patients before surgery.

In contrast, in the motor function tests performed 14 days after the surgery, the post-surgical increase in both TUG comfort and TUG maximum was more pronounced with age, but the increase was suppressed in patients who received UKA. Jones et al. [33] reported that UKA led to reduced TUG maximum because of limited surgical involvement of the quadricep muscles. Similarly, post-surgical MWS was decreased with age and influenced by the surgical mode. Meanwhile, extensor and flexor muscle strengths after surgery were lower in the women, as was already reported by Gustavson et al. [34] in KOA patients. Meanwhile, we found that the extensor muscle strength was also influenced by surgical mode with UKA resulting in higher strength than C-TKA.

We relied on the SDR to judge the need for partitioning test results in determining RIs specific for certain subgroups. It was not difficult to judge SDR for pre-surgical motor functions because either sex or age was the only factor to be considered. However, for post-surgical parameters, we had three factors to consider: sex, age, and surgical mode. For example, TUG comfort and TUG maximum after surgery were both associated with age and surgical mode. However, because of the insufficient sample size, we had to adopt just one factor with a higher SDR to avoid too many partitions. Therefore, we partitioned post-surgical TUG values only by age into three groups to compute the RIs.

Post-surgical MWS also showed significant association with both age and surgical mode, but we made the opposite decision of partitioning MWS values by surgical mode rather than by age based on the values of SDR for age = 0.363 versus SDR for surgical mode = 0.442. As the rationale for this decision, we found a report by Iijima et al. [35] showing that in KOA patients, turn-around speed during the TUG test depended on hip abductor muscles that are minimally affected by the surgery, whereas straight-walking speed depended mostly on the strength of the quadriceps. Therefore, selection among the three surgical modes with different degrees of quadriceps injury had more influence on MWS that represents straight-walking speed.

For partitioning of the RIs for extensor and flexor muscle strength, sex with high SDR was the only factor requiring consideration. This finding of sex-difference was supported in a report by Logerstedt et al. [32]. Regarding the source of variation of the ROM, we found the levels of flexion ROM were negatively associated with BMI before surgery, and their post-surgical values were better with UKA. However, the actual differences in terms of SDR were less than the threshold of 0.4, and thus, their RIs were determined without any partitioning.

In this study we showed that both pre- and post-surgical knee motor functions were all conspicuously different from those of the healthy elderly subjects as shown in Fig 3 for TUG and MWS and in Fig 4 for muscle strength and knee ROM. The obvious gaps we observed in all parameters from healthy elderly subjects clearly point to the importance of disease specific RIs for knee arthroplasty patients.

In summary, the clinical implication of this study is as follows. As we only enrolled patients who followed a normal course of recovery, the RIs stratified according to their factors of variation will facilitate peri-surgical rehabilitation customized to patients according to sex, age, and surgical mode. Furthermore, the subgroup-specific RIs will be useful in detecting patients who deviate from the normal course of recovery.

Limitations

This study has three limitations. 1) The sample size was not large enough to make a finer subgrouping of test results by sex, age, and surgical mode. We were obliged to partition tests results only by a single factor from these three major factors. 2) For the long-term management of knee arthroplasty patients, knee motor function tests should be monitored for a longer period. However, this was beyond the scope of our study. Even in routine clinical practice, the long-term follow-up of patients’ motor functions is difficult on the physical therapy side because most patients return to local clinics after their surgery. 3) We only evaluated the knee motor function tests that can be safely performed in the early post-surgical period. For long-term evaluation of knee arthroplasty patients, the 6-minute walk test and stair climb test are available. However, these tests were not included in our evaluation because of difficulty in performing them during the immediate post-surgical period.

Conclusion

With the increasing demand of knee arthroplasty for KOA patients, there is a dire need for objective reference values for knee motor functions to ensure efficient peri-surgical management of patients. Through the collaboration of 13 medical institutions in Japan that specialize in knee arthroplasty, we enrolled 545 KOA patients undergoing knee arthroplasty, who followed a normal course of recovery, and performed knee motor functions tests twice before and 14 days after the surgery. RIs specific to each time point were determined in consideration of the distribution pattern of each parameter. The need for partitioning of RIs was objectively judged by use of an SDR that represents the effect size of between-subgroup differences. To our knowledge, this is the first report in the field of orthopedics and rehabilitation to determine RIs specific to KOA patients by use of up-to-date statistical methods. The quantitative information on RIs and their factors of variation will facilitate objective implementation of peri-surgical rehabilitation and screening of patients who deviate from the normal course of recovery.

Supporting information

S1 Fig. Determination of reference intervals by parametric method.

(PDF)

S1 File

(XLSX)

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

The authors received no specific funding for this work.

References

  • 1.Yoshimura N, Muraki S, Oka H, Mabuchi A, En-Yo Y, Yoshida M, et al. Prevalence of knee osteoarthritis, lumbar spondylosis, and osteoporosis in Japanese men and women: The research on osteoarthritis/osteoporosis against disability study. J Bone Miner Metab. 2009; 27(5):620–628. 10.1007/s00774-009-0080-8 [DOI] [PubMed] [Google Scholar]
  • 2.Bonutti PM, Mont MA, Kester MA. Minimally invasive total knee arthroplasty: a 10-feature evolutionary approach. Orthop Clin North Am. 2004; 35(2):217–226. 10.1016/j.ocl.2004.02.001 [DOI] [PubMed] [Google Scholar]
  • 3.Khanna A, Gougoulias N, Longo UG, Maffulli N. Minimally invasive total knee arthroplasty: a systematic review. Orthop Clin North Am. 2009; 40(4):479–489. 10.1016/j.ocl.2009.05.003 [DOI] [PubMed] [Google Scholar]
  • 4.Dobson F, Hinman RS, Roos EM, Abbott JH, Stratford P, Davis AM, et al. OARSI recommended performance-based tests to assess physical function in people diagnosed with hip or knee osteoarthritis. Osteoarthritis and Cartilage. 2013; 21(8):1042–1052. 10.1016/j.joca.2013.05.002 [DOI] [PubMed] [Google Scholar]
  • 5.Bohannon RW. Reference values for the timed up and go test: a descriptive meta-analysis. J Geriatr Phys Ther. 2006; 29(2):64–68. 10.1519/00139143-200608000-00004 [DOI] [PubMed] [Google Scholar]
  • 6.Shimada H, Furuna T, Obuchi S, Sugiura M, Yoshida H, Kim H, et al. Timed Up & Go Test is a useful assessment tool for community health in elderly people. Physical Therapy Japan. 2006; 33(3):105–111. [Google Scholar]
  • 7.Pondal M, del Ser T. Normative data and determinants for the Timed “Up and Go” Test in a population-based sample of elderly individuals without gait disturbances. J Geriatr Phys Ther. 2008; 31(2):57–63. 10.1519/00139143-200831020-00004 [DOI] [PubMed] [Google Scholar]
  • 8.Ibrahim A, Singh DKA, Shahar S. ‘Timed Up and Go’ test: Age, gender and cognitive impairment stratified normative values of older adults. PLoS One. 2017;12(10): e0185641. 10.1371/journal.pone.0185641 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Seino S, Shinkai S, Fujiwara Y, Obuchi S, Yoshida H, Hirano H, et al. Reference values and age and sex differences in physical performance measures for community-dwelling older Japanese: a pooled analysis of six cohort studies. PLoS One. 2014; 9(6):e99487. 10.1371/journal.pone.0099487 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ichihara K, Ozarda Y, Barth JH, Klee G, Qiu L, Erasmus R, et al. A global multicenter study on reference values: 1. Assessment of methods for derivation and comparison of reference intervals. Clin Chim Acta. 2017; 467:70–82. 10.1016/j.cca.2016.09.016 [DOI] [PubMed] [Google Scholar]
  • 11.Ichihara K, Statistical considerations for harmonization of the global multicenter study on reference values. Clin Chim Acta. 2014; 432:108–118. 10.1016/j.cca.2014.01.025 [DOI] [PubMed] [Google Scholar]
  • 12.Tria AJ Jr. Minimally invasive total knee arthroplasty: The importance of instrumentation. Orthop Clin North Am. 2004; 35(2):227–234. 10.1016/S0030-5898(03)00118-4 [DOI] [PubMed] [Google Scholar]
  • 13.Lombardi AV Jr, Berend KR, Walter CA, Aziz-Jacobo J, Cheney NA. Is recovery faster for mobile-bearing unicompartmental than total knee arthroplasty? Clin Orthop Relat Res. 2009; 467(6):1450–1457. 10.1007/s11999-009-0731-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Wilson HA, Middleton R, Abram SGF, Smith S, Alvand A, Jackson WF, et al. Patient relevant outcomes of unicompartmental versus total knee replacement: systematic review and meta-analysis. BMJ. 2019; 364:l352. 10.1136/bmj.l352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bohannon RW. Comfortable and maximum walking speed of adults aged 20–79 years: reference values and determinants. Age Ageing. 1997; 26:15–19. 10.1093/ageing/26.1.15 [DOI] [PubMed] [Google Scholar]
  • 16.Kellgren JH, Lawrence JS. Radiological assessment of osteo-arthrosis. Am Rheum Dis 1957; 16(4):494–502. 10.1136/ard.16.4.494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kessler S, Guenther KP, Puhl W. Scoring prevalence and severity in gonarthritis: the suitability of the Kellgren & Lawrence scale. Clin Rheumatol 1998; 17(3):205–209. 10.1007/BF01451048 [DOI] [PubMed] [Google Scholar]
  • 18.Podsiadlo D, Richardson S. The timed "Up & Go": a test of basic functional mobility for frail elderly persons. J Am Geriatr Soc. 1991;39(2):142–148. 10.1111/j.1532-5415.1991.tb01616.x [DOI] [PubMed] [Google Scholar]
  • 19.Shumway-Cook A, Brauer S, Woollacott M. Predicting the probability for falls in community-dwelling older adults using the Timed Up & Go Test. Phys Ther. 2000; 80(9):896–903. [PubMed] [Google Scholar]
  • 20.Fransen M, Crosbie J, Edmonds J. Reliability of gait measurements in people with osteoarthritis of the knee. Phys Ther. 1997; 77(9):944–953. 10.1093/ptj/77.9.944 [DOI] [PubMed] [Google Scholar]
  • 21.Katoh M, Yamasaki H. Test-retest reliability of isometric leg muscle strength measurements made using a hand-held dynamometer restrained by a belt: comparisons during and between sessions. J Phys Ther Sci. 2009; 21(3):239–243. [Google Scholar]
  • 22.Katoh M, Yamasaki H. Comparison of reliability of isometric leg muscle strength measurements made using a hand-held dynamometer with and without a restraining belt. J Phys Ther Sci. 2009; 21(1):37–42. [Google Scholar]
  • 23.Katoh M, Yamasaki H, Uchida M. Validity of isometric muscle strength measurements of the lower limbs using a handheld dynamometer and belt: a comparison with an isokinetic dynamometer. J Phys Ther Sci. 2011; 23(4):553–557. [Google Scholar]
  • 24.Katoh M, Isozaki K, Sakanoue N, Miyahara T. Reliability of isometric knee extension muscle strength measurement using a handheld dynamometer with a belt: a study of test-retest reliability in healthy elderly subjects. J Phys Ther Sci. 2010; 22(4):359–363. [Google Scholar]
  • 25.Yonemoto K, Kondo T, Shigenobu I. Joint range of motion display and measurement method. The Japanese Journal of Rehabilitation Medicine. 1995; 32(4):207–217. [Google Scholar]
  • 26.Cohen J. A power primer. Psychol Bull. 1992; 112:155–159. 10.1037//0033-2909.112.1.155 [DOI] [PubMed] [Google Scholar]
  • 27.Ichihara K, Boyd JC; IFCC Committee on Reference Intervals and Decision Limits (C-RIDL). An appraisal of statistical procedures used in derivation of reference intervals. Clin Chem Lab Med. 2010; 48(11):1537–1551. 10.1515/CCLM.2010.319 [DOI] [PubMed] [Google Scholar]
  • 28.Tsubaki A, Kubo M, Kobayashi R, Jigami H, Takahashi H. Age-related changes in physical function in community-dwelling people aged 50–79 years. J Phys Ther Sci. 2010; 22:23–27. [Google Scholar]
  • 29.Yoshimura N, Oka H, Muraki S, Akune T, Hirabayashi N, Matsuda S, et al. Reference values for hand grip strength, muscle mass, walking time, and one-leg standing time as indices for locomotive syndrome and associated disability: the second survey of the ROAD study. J Orthop Sci. 2011; 16(6):768–777. 10.1007/s00776-011-0160-1 [DOI] [PubMed] [Google Scholar]
  • 30.Steffen TM, Hacker TA, Mollinger L. Age- and gender-related test performance in community-dwelling elderly people: Six-Minute Walk Test, Berg Balance Scale, Timed Up & Go Test, and Gait Speeds. Phys Ther. 2002; 82(2):128–137. 10.1093/ptj/82.2.128 [DOI] [PubMed] [Google Scholar]
  • 31.Adegoke BO, Babatunde FO, Oyeyemi AL. Pain, balance, self-reported function and physical function in individuals with knee osteoarthritis. Physiother Theory Pract. 2012; 28(1):32–40. 10.3109/09593985.2011.570858 [DOI] [PubMed] [Google Scholar]
  • 32.Logerstedt DS, Zeni J Jr, Snyder-Mackler L. Sex differences in patients with different stages of knee osteoarthritis. Arch Phys Med Rehabi. 2014; 95(12):2376–2381. 10.1016/j.apmr.2014.07.414 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jones GG, Kotti M, Wiik AV, Collins R, Brevadt MJ, Strachan RK, et al. Gait comparison of unicompartmental and total knee arthroplasties with healthy controls. Bone Joint J. 2016; 98(10):16–21. 10.1302/0301-620X.98B10.BJJ.2016.0473.R1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gustavson AM, Wolfe P, Falvey JR, Eckhoff DG, Toth MJ, Stevens-Lapsley JE. Men and women demonstrate differences in early functional recovery after total knee arthroplasty. Arch Phys Med Rehabil. 2016; 97(7):1154–1162. 10.1016/j.apmr.2016.03.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Iijima H, Yorozu A, Suzuki Y, Eguchi R, Aoyama T, Takahashi M. Hip abductor muscle weakness and slowed turning motion in people with knee osteoarthritis. J Biomech. 2020; 101:109652. 10.1016/j.jbiomech.2020.109652 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Alison Rushton

18 Dec 2020

PONE-D-20-18248

Determination of reference intervals for lower-limb motor functions specific to patients undergoing knee arthroplasty

PLOS ONE

Dear Dr. Ichihara,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

Please address the insightful points identified by the reviewers below.

==============================

Please submit your revised manuscript by Feb 01 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Alison Rushton

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

3. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

Reviewer #3: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: 

The manuscript is generally well structured and written in precise and understandable manner, but some points still need clarifications:

1. Please include information on timing of patients' examinations in the method section. It is for the first time given in the Discussion.

2. Line 181: “objective variable” is an unusual name. “Dependent variable” is more typical.

3. What is the meaning of the phrase “large data size” in line 188? Do you mean the number of variables or the number of cases? If the number of cases, then it somewhat contradicts limitations of the study. Similarly, what is meant by “small data size” in caption to Table 3?

4. “Probability paper plot” is typically called just “probability plot” or “Q-Q plot” (line 227 and others).

5. Have the KOA patients’ and healthy controls’ baseline characteristics been compared by statical tests? If so, such comparisons should be shown.

6. Why p-values are not used as a measure of significance of effects either in multiple regression or ANOVA? The should be reported in supplementary material at least to show if observed effects were significant.

7. Please reconsider graphical presentation in Fig. 1. Although presentation of raw data point is a good practice, in some cases they make your box plots difficult to read. I would recommend presentation that emphasizes the boxes more and points less (the one in Fig. 2 is better). Besides, panels are referenced by letters that are absent in the figure (also in Fig. 2).

8. Line 306 states that “Although the data are not shown, the level of decrease in muscle strength did not depend on the surgical mode but seemed to be simply due to immobility after surgery”. Ideally such data should be shown, at least in supplementary file.

9. Fig. 2, lower panels: vertical axis scale and legend is missing.

10. Caption to table 2: symbol for between-individual variation is different that one used in the methodology section. “SDsex” should probably be replaced by “SDRsex”.

11. Suppl. Fig. 2: test results (but not necessarily figures) showing successful transformation of distributions should be shown for all variables, ideally after partitioning as this is crucial factor for the choice of reference interval calculation method. If groups resulting from partitioning are not numerous enough for Kolmogorov-Smirnov test, then Shapiro-Wilk test can be used.

12. Fig. 3: what is the value of parameter p in left panel? The value of 0, that is given in the figure, would result in division by 0.

13: Line 363: description of non-parametric method of reference interval derivation should be moved to methods.

14. Table 3: tilde (~) symbol is quite confusing in the table. Usage of lower than, greater than and hyphen signs would be clearer.

Reviewer #2: 

The study presents the results of an original research, however some corrections must be taken into account.

- In the abstract, make the results clearer and in the conclusion specify how the information can contribute to the management of surgical management of KOA patients.

- It is necessary that the introduction is more robust, so that the clinical importance of the study is clear.

- There is no need to have a discussion of statistics in the introduction.

-Leave the objective of the study, highlighting the clinical relevance of the study

-Rows 64-66: I suggest deleting the information or placing a reference.

-Line 85: Delete the phrase "we found ir necessary ..." all information should be based only on the literature.

- In the first paragraph of the discussion, present the main results of this study.

-The authors in every manuscript, describe a lot of information and discussions about statist tests, however it is important to facilitate the reading so that clinicians can use these information to carry out evidence-based practice. I suggest making changes in the discussion, clarifying the importance of these findings, and clinical significance.

-Review English

Reviewer #3: 

1. Is the manuscript technically sound, and do the data support the conclusions?

The rationale could be clearer, and while overall the statistical techniques are sound I am unconvinced the data have been adequately discussed. I have some specific criticisms outlined below.

2. Has the statistical analysis been performed appropriately and rigorously?

Overall, the statistical techniques are sound. But I have some specific criticisms outlined below

*3. Have the authors made all data underlying the findings in their manuscript fully available?

No, not all data is available but is upon request to the corresponding author.

*4. Is the manuscript presented in an intelligible fashion and written in standard English?

There are suggested edits which are outlined in the comments to authors below. I commend the authors in producing the manuscript if English is not their first language. Overall the manuscript would benefit from some simplifying and accuracy in language and layout including a final sense-check from someone who writes in English as a first language. I provide some typos and edits below.

*5. Review Comments to the Author

The authors provide a technical investigation of normal recovery profiles in surgical interventions for KOA with respect to function (TUG) and impairment (strength and ROM) measurements that are feasible to measure in the clinic, and the associated effect of factors (age, gender etc) pre operatively and 14-days post operatively. Overall, I found the manuscript difficult to read and I think this is because the rationale was not entirely clear, and there is no clear mapping of objectives to the results and discussion sections. Based on this, I think the manuscript requires re-writing. I offer more detailed feedback below.

Please see attached document for more detailed comments for the authors.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Gareth D. Jones

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: Review of Study_PLoSOne_D-20-18248_for upload.docx

PLoS One. 2021 Apr 14;16(4):e0249564. doi: 10.1371/journal.pone.0249564.r002

Author response to Decision Letter 0


2 Jan 2021

Reviewer #1

Our response ⇒ We appreciate a great deal for your time and effort to review our manuscript and for providing us with invaluable comments to improve our manuscript. We carefully addressed each issue and revised the manuscript accordingly as describe one by one below.

1. Please include information on timing of patients' examinations in the method section. It is for the first time given in the Discussion. Line426

Our response ⇒ We described the timing for the post-surgical measurements of the knee motor functions in the Methods as follows: “The measurements of the following knee motor functions were performed twice in each KOA patients just before and at two weeks after knee arthroplasty.”.

2. Line 181: “objective variable” is an unusual name. “Dependent variable” is more typical.

Our response ⇒ We changed “objective variable” to “dependent variable” as pointed out.

3. What is the meaning of the phrase “large data size” in line 188? Do you mean the number of variables or the number of cases? If the number of cases, then it somewhat contradicts limitations of the study. Similarly, what is meant by “small data size” in caption to Table 3?

Our response ⇒ We meant that testing significance of correlations (like rp) with a large sample size (like n>200), even small rp can be statistically significant. However, we chose to show P values for each rp (as may be expected by the readers). Therefore, we changed the original description “Because of the large data size, the “effect size” of the 189 standardized partial regression coefficient (rp) was set to |rp|≥0.2” to “In addition to expressing the statistical significance of each standardized partial regression coefficient (rp) in P value, we marked its effect size (practical significance) by use of a threshold of |rp|≥0.2”

4. “Probability paper plot” is typically called just “probability plot” or “Q-Q plot” (line 227 and others).

Our response ⇒ We changed the term as kindly advised.

5. Have the KOA patients’ and healthy controls’ baseline characteristics been compared by statical tests? If so, such comparisons should be shown.

Our response ⇒ Following the advice, we newly made Table 1 that listed the demographic information comparing KOA patients and healthy subjects. The statistical testing results were also included in the table.

6. Why p-values are not used as a measure of significance of effects either in multiple regression or ANOVA? The should be reported in supplementary material at least to show if observed effects were significant.

Our response ⇒ We followed the advice and revised the table of multiple regression analysis (MRA) by adding judgment based on P-value for each rp (standardized partial regression coefficient, which corresponds “effect size” the correlatoin).

7. Please reconsider graphical presentation in Fig. 1. Although presentation of raw data point is a good practice, in some cases they make your box plots difficult to read. I would recommend presentation that emphasizes the boxes more and points less (the one in Fig. 2 is better). Besides, panels are referenced by letters that are absent in the figure (also in Fig. 2).

Our response ⇒ We redrew the 1D scattergrams with the central 50% range (box) and vertical median line drawn last so that they won’t be hidden behind the dots. Please note that we added another figure as Fig. 1 and thus original Fig.1 and 2 are now Fig. 2 and 3.

8. Line 306 states that “Although the data are not shown, the level of decrease in muscle strength did not depend on the surgical mode but seemed to be simply due to immobility after surgery”. Ideally such data should be shown, at least in supplementary file.

Our response ⇒ We realized that the above expression was not appropriate as was also pointed out by Reviewr-3, we simply deleted the confusing/speculative remark.

9. Fig. 2, lower panels: vertical axis scale and legend is missing.

Our response ⇒ We thankfully amended Fig. 2 (now Fig. 3) accordingly.

10. Caption to table 2: symbol for between-individual variation is different that one used in the methodology section. “SDsex” should probably be replaced by “SDRsex”.

Our response ⇒ We corrected “SDbi” to “SDindiv”, and “SDsex” to “SDRsex” as pointed.

11. Suppl. Fig. 2: test results (but not necessarily figures) showing successful transformation of distributions should be shown for all variables, ideally after partitioning as this is crucial factor for the choice of reference interval calculation method. If groups resulting from partitioning are not numerous enough for Kolmogorov-Smirnov test, then Shapiro-Wilk test can be used.

Our response ⇒ We followed the suggestion and we added one rightmost column to Table 4 (old Table 3, listing derived RIs) to show Kolmogorov-Smirnov (K-S) test results as P-value.

12. Fig. 3: what is the value of parameter p in left panel? The value of 0, that is given in the figure, would result in division by 0.

Our response ⇒ We are sorry about the confusing Fig. 3. We chose to delete it because of the following reason: The value of ‘p’ (lower-case p) represented “power” used in the Box-Cox power transformation. If p=0.0, it indicates logarithmic transformation, and if p=0.5, square-root transformation, etc. However, the figure was made as an illustrative example and the value of p (0.0, 0.5, and 0.3) was set approximately and true values of p was in Suppl Fig. 1. Therefore, we found it confusing and not worth showing in this manuscript.

13: Line 363: description of non-parametric method of reference interval derivation should be moved to methods.

Our response ⇒ We are sorry for the omission of defining nonparametric method. We added following explanation in the Methods: “When test results of a given parameter failed in the Gaussian transformation by the Box-Cox method or when test results took discrete values, we applied nonparametric method for determining the central 95% range of test results after sorting them, and took the range of 2.5 and 97.5 percentiles as the limits of the RI.”

14. Table 3: tilde (~) symbol is quite confusing in the table. Usage of lower than, greater than and hyphen signs would be clearer.

Our response ⇒ As suggested, we replaced (~) to (-)

Reviewer #2

- In the abstract, make the results clearer and in the conclusion specify how the information can contribute to the management of surgical management of KOA patients.

- It is necessary that the introduction is more robust, so that the clinical importance of the study is clear.

Our response ⇒ We are grateful for your time and kind effort to review our manuscript and to give us invaluable comments. For the first advice, we were pointed out the same problems and thus extensively amended our manuscript especially for the Introduction and Discussion. We made our best to clarify the background and objective of this study and make clear of the clinical implication of our results.

- There is no need to have a discussion of statistics in the introduction.

Our response ⇒ As mentioned above, in the process of rewriting the Introduction, we deleted the statistical issues in the past normative value studies and move them to the Discussion.

-Leave the objective of the study, highlighting the clinical relevance of the study

Our response ⇒ We believe after rewriting, clinical relevance of this study from the perspective of rehabilitation medicine has become clearer by revising the descriptions at the end of both the Introduction and the Discussion.

-Rows 64-66: I suggest deleting the information or placing a reference.

Our response ⇒ We deleted that part of the Introduction as suggested.

-Line 85: Delete the phrase "we found it necessary ..." all information should be based only on the literature.

Our response ⇒ We deleted that description in the process of rewriting the Introduction.

- In the first paragraph of the discussion, present the main results of this study.

Our response ⇒ We are sorry about the inappropriate description at the beginning of the Discussion. We rewrote the first paragraph by recalling the objectives of this study and then briefly wrote about the main results.

-The authors in every manuscript, describe a lot of information and discussions about statist tests, however it is important to facilitate the reading so that clinicians can use these information to carry out evidence-based practice. I suggest making changes in the discussion, clarifying the importance of these findings, and clinical significance.

Our response ⇒ Again, we apologize for poor writing of the manuscript without giving clear focus on the clinical implication of this study. After rewriting, we made it clear that this study originated from physical therapists who are in charge of rehabilitation of KOA patients undergoing knee arthroplasty. We then emphasized the importance of reference intervals of knee motor function tests for improved/objective implementation of peri-surgical rehabilitation, that are supposed to contribute to and predict the long-term success of the surgery.

Reviewer #3

Reference intervals of knee motor function tests for patients undergoing knee arthroplasty

The authors provide a technical investigation of normal recovery profiles in surgical interventions for KOA with respect to function (TUG) and impairment (strength and ROM) measurements that are feasible to measure in the clinic, and the associated effect of factors (age, gender etc) pre operatively and 14-days post operatively. Overall, I found the manuscript difficult to read and I think this is because the rationale was not entirely clear, and there is no clear mapping of objectives to the results and discussion sections. Based on this, I think the manuscript requires re-writing. I offer more detailed feedback below.

Please see attached document for more detailed comments for the authors.

Further Review Comments to the Author The authors provide a technical investigation of normal recovery profiles in surgical interventions for KOA with respect to function (TUG) and impairment (strength and ROM) measurements that are feasible to measure in the clinic, and the associated effect of factors (age, gender etc) pre operatively and 14-days post operatively. Overall, I found the manuscript difficult to read and I think this is because the rationale was not entirely clear, and there is no clear mapping of objectives to the results and discussion sections. Based on this, I think the manuscript requires re-writing. I offer more detailed feedback below.

Our response ⇒ We are indeed grateful for your time and effort to critically review our manuscript and for providing us with invaluable comments to improve our manuscript. We carefully addressed each issue and revised the manuscript accordingly as describe one by one below.

Introduction & Rationale

I sense the focus of the paper is in its utilisation of deploying statistical techniques to derive RIs that addressed the problem of non-normal data distributions. What I think is lacking in the introduction and discussion is a clear clinical rationale that this will help clinical practice. I agree that clinicians would welcome predicted recovery trajectories of typical KOA patients undergoing surgery based on the variables, but fear that these data alone might not affect practice due to the many other factors under consideration that the authors do not acknowledge e.g. fear of movement post-surgery, smoking and lifestyle choices. I could not see a clear statement in the introduction nor discussion that defends why this (albeit large) sample is definitively representative of the patient population in Japan. There is no information detailing the recruitment process for either group or justification for the discrepancy between the two groups sample size. The data do offer a basis on which studies could now be developed to explore why patients might deviate from a “normative” presurgical state and a “normative” recovery, and for that this paper is welcome. But I think the paper could benefit from a specific allusion to this and to how this relates or interacts with “healthy” age-matched adults.

Our response ⇒ We admit the problem of unclear description on our objectives of this study. We rewrote the section at the bottom of the Introduction to describe the background and objective more clearly, with suggestion of practical/clinical implications of this study.

To provide a scope of our subsequent responses, let us briefly list up the backgrounds as follow:

1) This is a research conducted by a team of physical therapists who are routinely involved in peri-surgical rehabilitation of KOA patients undergoing knee arthroplasty (KA) in close collaboration with orthopedic surgeons. [Please note that this study was not originated from orthopedicians].

2) For proper provision of rehabilitation to the patients, it was essential to have objective evaluations of knee-related motor functions before and after KA.

3) However, the normative values (Means and SDs) for the motor functions reported in the past have been determined from test results obtained in healthy individuals by use of out-dated methods. Therefore, it was necessary to determine reference intervals (RIs) using up-to-date statistical methods from test results measured in KOA patients undergoing KA.

4) There are studied that reported the importance of knee motor functions short-term after KA in predicting the long-term success of KA, but no referrable RIs for that specific period exit in the literature.

With these backgrounds, a team of physical therapists conducted this study with the following objectives:

1) For objective use of knee motor function tests in the rehabilitation, to explore their sources of variation (SVs) and to determine their reference interval (RIs), which are specific to KOA patients undergoing KA.

2) To apply up-to-date statistical methods in analyzing the SVs and in determining RIs, which have not been used in the field of rehabilitation medicine in the past,

(3) To compare knee motor functions before and two-weeks after knee arthroplasty, which is of relevance in predicting a long-term success of the surgery.

From these perspectives, we responded to the subsequent comments one by one and revised the section of Introduction extensively.

With respect to the inquiry regarding the representativeness of 583 KOA patients, we recruited them consecutively from 13 major institutions in western Japan by use of a harmonized protocol, and thus we can regard them as reflective of current clinical practice of KA in Japn. Please refer to our response to the “Method” section below in Page 5 of this document.

The introduction states that conservative therapy of KOA is the treatment of choice – I would like to read what conservative therapy is in this context, and what exactly defines epicondylar deformity to inform local surgical decisions so international readers can make a judgement on their own practice. A more explicit description of the clinical indicators for TKA surgery would strengthen the manuscript.

Our response ⇒ We gave explanatory description on the nature of “conservative therapy of KOA” as “Conservative therapy, such as exercise, ultrasonic therapy, electrotherapy, orthosis therapy, is the primary choice”. For the TKA surgeries, we added explicit description of indications as follows: “with development of multiple osteophytes, sclerosis, or narrowing of joint space, knee arthroplasty becomes necessary”

The authors cite 2 previous pieces of work (Bade et al 2014 & Mizner et al 2005) to defend the use of knee ROM, knee flexion/extension isometric strength and TUG by stating “The prognosis after these surgeries depends on the pre-operative and short-term post-operative capability of physical and motor functions” (L62-63). These two references do not in my opinion constitute an adequate defence of this statement by themselves and I would expect a much more compelling argument to be put forward regarding these particular measures seeing as they are the main basis for the paper.

Our response ⇒ We realized that we needed to include more papers to argue the importance of short-term postsurgical motor functions. However, in the process of rewriting the Introduction, we chose not to give detailed description on those papers. Rather we wrote briefly as follows and placed the descriptions near the end of the Discussions:

“Although we could only determine the RIs of knee motor function tests just before and at two weeks after the surgery, those peri-surgical values are of relevance in predicting the long-term outcome of KOA patients as reported by Dobson et al. [4]. The importance of pre-surgical knee motor functions in predicting the long-term recovery was also reported by Bade et al. [12] and Mizner et al. [13]. Furthermore, Bade et al. [12] and Zeni et al. [14] showed that values of TUG, knee muscle strength and knee flexion/extension ROM at short-term after surgery were important in predicting later recovery in daily activity. Therefore, we believe that the RIs derived in this study are of direct relevance in identifying patients whose motor functions deviate from those of others during the peri-surgical rehabilitation. Prompt attention and corrective measures would lead to improved speed of recovery from the surgery as suggested by Bade et al. [39] and Stevens et al. [40].”

References are missing for the section (L54-62) “Among the 25.3 million patients with KOA, 8 million have symptomatic disease, which poses a large socio-economic problem in Japan. KOA features degeneration and attrition of joint structure that result in ossification of the cartilage and surrounding tissue, which restricts knee movements and eventually leads to pain and gait disturbance. Conservative therapy is the primary choice, but with the development of deformity in one or both epicondyles, either unicompartmental knee arthroplasty (UKA) or total knee arthroplasty (TKA) has been the therapeutic regimen of choice. Recently, a less invasive regimen called minimally invasive surgery TKA (MIS-TKA) has become popular as an alternative to conventional TKA (C-TKA)”.

Our response ⇒ We are sorry about the inappropriate citation of the paper on epidemiology of KOA in Japan at the first paragraph of the Introduction. The descriptions were all based on a single paper [1]. We rephrased the parts as follow: “The prevalence of knee osteoarthritis (KOA) is as high as 42.6% in men and 62.4% in women above 40 years of age as reported in a large cohort study conducted in Japan in 2011 by Yoshimura et al. [1]. The report also disclosed that among the 25.3 million patients with KOA, 8 million have symptomatic disease, which poses a large socio-economic problem in Japan”.

The authors allow an entire paragraph (L67-78) listing previous TUG normative work based on healthy adults citing mean and SD values. Further explanation as to why this is specifically relevant to this study would be beneficial. I note though that the normative TUG meta-analysis work by Bohannan is missing from these data, and indeed from the whole paper (Bohannon, 2006). This is strange as not only did this work take various human ethnicities into account, it also included data from disparate TUG instructions – mainly to perform TUG at “comfortable” or “self-selected” pace or “quickly”. Despite these disparities in method the observed summary data was homogenous in Bohannon’s work.

Our response ⇒ We thanks for pointing us out of the important paper. However, in the process of rewriting the Introduction to clarify our background and objectives in a concise manner, we deleted most of the past papers on normative values of knee functions. We rather moved them to the Discussion and thankfully cited the key paper of meta-analysis by Bohannon.

L79 – if a sample distribution is Gaussian I agree that a sample 95%RI could be expressed as mean ± 1.96SD, but would a population estimate be more useful (i.e. mean ± 1.96SE) yielding a mean and 95%CI?

Our response ⇒ We understand that, if we are interested in comparing means between subgroups, the estimation of mean ±1.96SE is useful for the purpose, because SE represents standard error of mean: i.e., by-chance variability of mean. However, to determine the RI, it is necessary to know the variability of individual points (SD), not the variability of means (SE). If the distribution of values is Gaussian, we could calculate the RI as mean ± 1.96SD. If not, we first need to convert values to make the distribution Gaussian before calculating the RI.

As an aside comment - I was interested to note that TUG undertaken at a comfortable and maximal tempo was included in the current study. The authors did not as far as I could see justify why this would be helpful and did not explain if the two variants in instructions were undertaken randomly. I think dealing with these two criticisms would benefit the manuscript. I am guessing the debate about maximum versus comfortable comes from the disparity in instruction language. According to some literature, for example (Kamide et al., 2011), the original TUG paper (Podsiadlo and Richardson, 1991) instructed participants to perform the TUG at a usual or comfortable pace, whereas a later protocol (Shumway-Cook et al., 2000) asked participants to perform TUG with maximum effort. This is an understandable but, in my opinion, incorrect interpretation of there being disparate protocols and I think alluding to this disparity is unhelpful.

This is because first, the Shumway-Cook paper asked participants to perform TUG “quickly and safely”, not “with maximal effort” nor “as fast as possible”. Second, I think Shumway-Cook and colleagues included a temporal qualifier (“quickly”) simply because participants were instructed to perform TUG under multiple dual-task conditions in their experiment. It is of course possible that within-subject TUG performance can be influenced by instructions to perform the task either comfortably or maximally, so there is a practical reason to include both protocol phenotypes in determining reference values as did the authors in the paper under review but only if there is a debate in the community about how to instruct the TUG. My interpretation is there is no such debate among clinicians. Even so, if a strong case could be constructed in this manuscript, then I would expect the two protocols to have been undertaken by an independent group or sufficiently apart and randomly by the same individuals to avoid any interaction effects. In contrast, it is interesting that forward average walking velocity was only measured at maximal tempo in the authors’ paper. Why not comfortable pace as well?? I would also like to see the authors defend the use of maximal walking velocity within the context of average self-selected walking velocity parameters (step width and length) being observed to be less variable than velocities under maximal volition (Sekiya et al., 1997).

Our response ⇒ We understood the importance of distinguishing “comfortable TUG” from “maximum TUG”, which depends on the nature of instructions to be given to the examinees, which tend to differ depending on the researchers. Accordingly, we clarified in the Methods which type of TUG test was used in each paper we cited. In our study, we performed TUG both at comfortable speed as originally proposed by Podsiadlo and at the maximum speed as recommended by Shumway-Cook. On the other hand, for measuring walking speed, we just adopted the maximum speed because Dbson et al. recommended it due to higher precision of measurement. We cited those references in describing TUG and MWS.

Overall, I think the introduction lacks some clarity and would benefit from a more compelling argument for why this work is needed, and what it aimed to add for clinical practitioners or researchers.

Our response ⇒ As we described above, we rewrote the Introduction according to the helpful comments.

Method

Patients – the sample size is large (n>500), which is an advantage, across 13 institutions which is a disadvantage without some description of differences in patients and clinical practice between them so readers can make an informed decision about the validity of the study. For example, while the inclusion criterion of indication for surgery is sensible it would be good to know if the different institutions deployed similar clinical reasoning in their decision making and whether patients were included on that decision or not.

Our response ⇒ As we responded above, we regard the 583 KOA patients [Note: the number was originally 545 which was after applying exclusion criteria] undergoing knee arthroplasty (KA) are representative of the current practice for KA in Japan because we recruited them consecutively from 13 major institutions located in a wide area of western Japan. The protocol of the study was elaborated in harmonized ways. We modified our description on the patients as follows:

“A total of 583 KOA patients undergoing elective knee arthroplasty were recruited consecutively between July 2013 and February 2018 by use of harmonized study protocol from 13 institutions specialized in knee arthroplasty, which are scattered widely in western Japan.”

The allusion from L107 that a standard clinical rehabilitation is shared across the institutions is welcome, but could be moved to be more prominent within the paragraph. Additionally, I would like to see a better breakdown of what the specification of the rehabilitation programme is here in addition to L109-112, either tabulated or in supplementary material, or signposted for the reader if it exists later in the manuscript. It needs to include dosing parameters and specification of any rehabilitation beyond simply practicing functional tasks.

Our response ⇒ According to the suggestion, we gave a little more detailed account of the common clinical pathway as follows:

“The common three-week program composed of the followings: The first post-surgical week : exercise for knee joint range of motion (ROM), muscle training for gluteus maximus and gluteus medius, sit-up and stand-up training; the second week: muscle training for quadriceps T-cane walking; the third week: staircase training, daily activity practice, and instructions toward discharge.”

There is no indication to when recruitment or data collection started, how long data were collected for, or how much missing data there were. I would expect these to be included for completeness. Why was the long-term follow-up of patient difficult? Were patients recruited affected by the COVID pandemic, did they have to self-isolate / were they deconditioned prior to surgery?

Our response ⇒ We described the period of recruitment as “between July 2013 and February 2018”. The long-term follow-up of patients after surgery was generally difficult because most cases tended to be followed at a local orthopedic clinic. COVID pandemic had nothing to do with the situation.

I would to see how many eligible participants did not consent to be included in the study for completeness. L99 – the exclusion of co-morbid neurological sensorimotor dysfunction – this I think needs to be more specific e.g. acquired focal neurologic, or chronic deteriorating neurologic condition affecting individuals’ ability to walk.

L100 – exclusion criterion 2) I think could be made clearer, and criterion 3) would be simpler described as cognitive impairment.

Our response ⇒ According to the suggestion we modified the description of the exclusion criteria as “The exclusion criteria were 1) presence of motor paralysis or other neurological dysfunctions such as stroke, DM peripheral neuropathy, unrelated to KOA, 2) obvious motion pain or restricted joint movements in locations other than the knee, which would hinder walking or the sit-to-stand motion, and 3) cognitive impairment”.

L115 – how were the healthy volunteers recruited? I would like to see more detail about the healthy participant inclusion criteria 1) and 2) for completeness.

Our response ⇒ We added the following descriptions “We recruited 120 apparently healthy volunteers ….. mostly from elderly clubs registered in a municipal health promotion office”

L123 – the research design sentence could be made more succinct.

Our response ⇒ Following the suggestion we made the description succinct as “This study corresponded to a prospective cohort study for establishing RIs of knee motor functions specific to KOA patients”

Measurements

I would expect to see the K-L classification referenced please.

L136 – while the same chair was used for TUG (in all institutions?), its height and whether it had arms would be useful information here for completeness. What was the go signal - Audible? Visual?

Our response ⇒ We thanks for pointing out the problem. We added the following two references regarding the K-L classification: Kellgren JH, Lawrence JS. Radiological assessment of osteo-arthrosis. Am Rheum Dis 1957; 16:494-502. and Kessler S, Guenther KP, Puhl W. Scoring prevalence and severity in gonarthritis: the suitability of the Kellgren & Lawrence scale. Clin Rheumatol 1998; 17:205-209.

For the method of TUG test performed using a common protocol, we added the following descriptions in the Methods: “As a common specification, the height of the chair seat without arm-rest was set to 40−45cm. On the vocal “go” signal, the examinee got up from the chair,….”

L141 – as per above, I am not convinced that Shumway-Cook specified “maximum” tempo in TUG. The MWS excluded acceleration and deceleration phases which is well explained. Like TUG, the go signal for MWS specifics are unclear and should be included, I think.

Our response ⇒ As mentioned above, we adopted TUG at the maximum tempo by citing the paper of Shumway-Cook in addition to TUG at comfortable pace. This is because both methods are clinically in use in Japan. However, we feel that test results of TUG at the maximum tempo (TUGmax) are more reproducible. For the same reason, in measuring walking speed, we regard that MWS has a higher reproducibility, and thus we adopted MWS in this study.

There is lack of detail within the functional measurement methods section to both justify the approach, allow repeatability, and defend any validity risks. An inclusion of this detail would be beneficial and strengthen the paper. For example, there is no allusion to the reliability or validity of the hand-held dynamometer technique and simply referencing the previous paper (Katoh and Yamasaki, 2009) is not enough. Can the authors account for the quality of the data they used? how many different researchers undertook the measurements? For the muscle testing it was unclear to me whether the ankle was fixed passively to a wall as shown in the Katoh paper, or did the researcher fix the tibial segment manually? A figure might help here. What verbal encouragement was offered to the participants? Was it standardised? What is the justification of taking the mean of 2 (and not more) knee extension muscle contraction attempts? Can the authors account for a first trial response in their data (Allum et al., 2011)? L169 – knee flexion strength is measured in the same manner as knee extension but there is not an allusion to the mean of two attempts for knee flexion strength. I would encourage consistency and clarity in the document. I am unconvinced that the torque calculations could be repeated as it is unclear where the measurement of the dynamometer was taken with reference to the knee joint – i.e. was it the top, bottom, centre-line, other part of the dynamometer?

Our response ⇒ According to the suggestion, we added references on the reproducibility of muscle strength measurement reported by Katoh and Yamasaki (2009), and its validity by Katoh and Yamasaki (2011). With respect to between-institution variability, we believe it is negligible due to generally small between-examiner variability of measurements as reported in those papers. The reason for having measured twice was a higher variability in elderly females as reported by Katoh (2010).

As per advice, we made a new figure with photos (Figure 1) to demonstrate actual procedures to perform the measurements. The position in attaching the HHD sensor was described as follows: “The lower leg length was measured as a distance between the distal end of the lower leg (where the center of the HHD sensor was placed) and the lateral cleft of the knee joint”. More detailed descriptions were also given as legends to the Figure.

The method for measuring knee ROM is extremely thin and should not rely on a reader having to read the reference supplied. It needs to be completely rewritten and expanded for completeness. There is no allusion to what landmarks the femur and tibial segments are designated by nor where the approximation of the knee joint centre is. There is also no allusion in the manuscript to reference positions i.e. is a straight leg 180 degrees extension or 0 degrees flexion? I would expect this detail to be at least included in the supplementary material.

Our response ⇒ We are sorry for the insufficient descriptions of the measurement procedure for ROM. We added description as follows for reproducibility of the measurements: “The angle of the long axes of the tibia (on a line connecting greater trochanter and lateral epicondyle of femur) and the femur (on a line connecting fibula head and lateral malleolus) was measured in units of 5 degrees.”

Statistical Methods

I would like to see some justification about why the candidate factors were selected either by previous work, the authors’ experience, a logical argument, or other in the regression model. The factors selected seem sensible, but why these and not others? As it stands, I am not convinced why e.g. chronicity of OA, smoking, educational level, or living alone were not included as factors. The justification for selecting a rp≥0.2 simply because it is midway between the subjective small and medium correlation effect sizes offered by Cohen (Cohen, 1992) is by itself a weak justification in my opinion; I would prefer to see more justification of effect based on the context of the functional measures undertaken in this paper not just on a reported, yet essentially arbitrary, selection of numbers.

Our response ⇒ Regarding the selection of explanatory variables, we included all available variables in the regression model. However, we did not record other variables mentions. We wrote about this situation as follows: “The following parameters that we could obtain from all subjects were evaluated as explanatory variables: sex, age, BMI,…”.

For the use of |rp|≥0.2 as our threshold of “effect size”, we simply regarded Cohen’s 0.1 as too weak, and 0.3 as too conservative, and thus arbitrarily took the value in between. However, because most readers are used to P-values in judging significance, and because of the request of Reviewer-1, we added P-value in the regression table to demonstrate the level significance of rp values. At the same time, we modified our descriptions on the effect size as follows:

“In addition to expressing the statistical significance of each standardized partial regression coefficient (rp) in P value, we marked its effect size (practical significance) by use of a threshold of |rp|≥0.2, which corresponds to a midpoint effect size of Cohen’s small (0.1) and medium (0.3) for correlation coefficient [17]”

The rest of the statistical processing reported appears to be sound and referenced appropriately. I have one query as to why the 2 specific methods of transformation for MRA and SDR calculations were deployed for the TUG and strength data? In calculating the RIs, why was a Box-Cox transformation undertaken for all data without reporting on its normality distribution first? If there are any weaknesses or controversies in the statistical approaches used, then I would like to see them discussed, or at least acknowledged, in the discussion section.

Our response ⇒ We thank you for raising the important issue of when to transform response and/or explanatory variables before applying multiple regression analysis (MRA) or ANOVA (for computing SDR). There is no consensus criterion on “when to” and “how strictly to” transform values to Gaussian. General practice is to use Box-Cox transformation by selecting appropriate power (‘p’-value). When values are highly skewed, logarithmic transformation (p=0) is commonly used, and when moderately skewed, square-root (p=0.5) or cubic root (p=0.33) transformation are commonly used. However, a minor difference in the selection of power does not cause much difference in MRA results [Please refer to the Appendix at the last two pages of this document].

On the other, when we need to determine the central 95% interval strictly by use of parametric method, we must strictly transform value into Gaussian to use mean±1.96SD as the RI after reverse-transformation. This is the reason why the power-value (p) strictly predicted for deriving RIs as shown in Suppl Fig. 1 differs from the power (p=0.0 or 0.5) empirically used in performing MRA and ANOVA.

If strict prediction of p to attain Gaussian transformation is required for each variable, it should be impossible to perform MRA or ANOVA for most researchers.

The ethics paragraph is clearly written and welcome, it could be moved to the top of the methods section. Suggest change “We provided participants with a comprehensible document explaining this study…” (L234) to “We provided a written explanation of this study in language that was understandable to participants… “.

Our response ⇒ We followed the advice and moved the section upwards and rephrasing the description.

Results

L248 – I would like to know if the healthy control group were statistically different to the patient participants based on the observed demographics.

Our response ⇒ We followed the suggestion and prepared a new Table (Table 1) to show demographic comparison between patients and healthy subjects, including results of statistical testing for between-group differences. We added the following description in the Results: “Differences between healthy subjects and KOA patients were tested by Mann-Whiney test for numerical variables and chi-square test for categorical variables. Although actual differences in sex and age distributions were small, BMI was notably higher and proportions of those who exercise regularly were much smaller among KOA patients.”

Notwithstanding the criticisms already noted, Table 1 and 2 are very clear and their captions thorough.

Our response ⇒ We are pleased to know that the tables are accepted in the current form.

Figure 1 & 2 are a nice visual representation of these data. I note the data are presented transformed and wonder if the non-transformed data might be more meaningful to the reader and justify the use of the median and the inter-quartile range. Not sure why the ROM data are in histograms, the x-axis is already accounting for discreteness in the data.

Our response ⇒ Values for TUGs and MWS are highly skewed with more tailing of values to the higher side. Therefore, without transformation, the data points in the lower side are packed together to identify individual points. In showing median and IQR, we found there was mistake in drawing the statistics, and corrected them in the revised figure.

As for our selection of histogram for showing the distribution of ROM data, the values are discrete in nature (measured in every 5 degree angle) and thus it was not appropriate to use a 1D scatter plot that resulted in vertical stacking of dots at each discrete point of observation.

L370 – “The RIs derived are listed in seven blocks in Table 3. Please note that RIs were derived by a parametric method for motor function tests except for extension and flexor ROMs, for which nonparametric methods were used.” I may have missed it, but the non-parametric version is opaque in the methods section.

Our response ⇒We are sorry about the omission of the method used in deriving the RI non-parametrically. We added the following explanation: “When test results of a given parameter failed in the Gaussian transformation by the Box-Cox method or when test results took discrete values, we applied nonparametric method for determining the central 95% range of test results after sorting them, and took the range of 2.5 and 97.5 percentiles as the limits of the RI.”

Overall, I found the results section too long and suggest some editing to lighten the load on the reader.

Our response ⇒ Following the suggestion, we reduced texts in the Results by 19 lines, although we added 9 lines of new texts for improved legibility

Discussion

The first 2 paragraphs outlining the background is too long. I think it should simply remind the reader of the objectives, a brief justification of why this is important and/or unique, and perhaps the main findings.

L413 – “Factors associated with presurgical motor functions” section. This alludes to TUG but not which phenotype. Overall this section merely repeats the results which are not particularly compelling and offers no discussion point. As it is, I would remove.

Our response ⇒ We realized that distinction of TUG was omitted. We amended the problem in the subsection.

In contrast L425 section offers more discussion, but I would remove repeating results with no discussion point.

Our response ⇒ Unfortunately, we found it difficult to remove the discussion on “Factors associated with post-surgical motor functions” with need of citing related studies. We rather removed all the subheading within the Discussion to reduce the text volume, and also trimmed some of texts to make the paragraphs a little more concise.

L437 section alluding to healthy data – this should be a major discussion point, but as outlined above should reference the background where a defence of the need to present data of typical patient populations peri-operatively is required in order to quickly recognise outliers who might require more tailored management following their surgery.

Our response ⇒ Thanks for highlighting the important message of our study written in that paragraph of the Discussion. We modified our description to emphasize our point of “the irrelevance of healthy subjects’ knee function data for our purpose, and the need for the RIs specific for KOA patients in the management of peri-surgical rehabilitation”.

L446 – “Final decision on partitioning”, and L475 “Reliability of the parametric method ..”section, again these sections could do with some editing and more discussion. It is difficult to clearly map the authors’ objectives for the paper with the results and discussion sections –there were too many surprises in these two sections I was not prepared for making it difficult to read.

Our response ⇒ We recognized that two sections were difficult to read through, and thus amended the descriptions and trimmed some texts to make our point clearer.

L497 – “Clinical utility of RIs …” section. Sadly, this is the last, smallest, and rather meagre discussion section. Here I was desperate to see the authors discuss the RIs data. Instead this section lacks anything except that the RIs have been reported. This is a major omission in my opinion and completely under sells the main point of your work.

Our response ⇒ We admit this problem clearly as we mentioned earlier (the 2nd page of this document), we fully rewrote this block to made it clear that this study originated from physical therapists who are in charge of rehabilitation of KOA patients undergoing knee arthroplasty. We also emphasized the importance of reference intervals of knee motor function tests for improved/objective implementation of peri-surgical rehabilitation, that are supposed to contribute to and predict the long-term success of the surgery, by citing relevant references.

I would like to see much more discussion of the data and the fact that the authors could only follow-up patients 14 days post-surgery due to either lack of resources or because follow up care was too complex. This is alluded to in the limitations section, but why was this design undertaken if this limitation was known a priori?? The conclusion lacks any examples of how the RIs, despite their limitations, could be used objectively in practice by clinicians and researchers.

Our response ⇒ As we wrote at the beginning of our responses, this study was conducted by a team of physical therapists in charge of peri-surgical rehabilitation of KOA patients, who cover the period of before and up to three weeks after the surgery. Therefore, it is difficult to monitor knee motor functions long enough to determine the effectiveness of knee arthroplasty, which was out of our control.

Finally, there were some trivial edits outlined below to take or leave.

Our response ⇒ We appreciate a great deal for all the kind editing below of our texts throughout the manuscript for improvement. We amended all of them except for the ones that were deleted in the process of revision.

L27-28 The description of the ‘objective peri-surgical management’ in the abstract would be better written as ‘crucial for determining the effectiveness of KA surgery’ or ‘crucial part of objective measures’

L50 ‘objective peri-surgical management of KOA’ might be better as ‘determining the effectiveness of KA surgery in patients with KOA.’

L55 ‘KOA features degeneration and attrition of joint structure that result in ossification of the cartilage and surrounding tissue, which restricts knee movements and eventually leads to pain and gait disturbance.’ might be better ‘KOA features degeneration and attrition of joint structure that result in ossification of the cartilage and surrounding tissue. This restricts knee range of movement causing pain and gait disturbance (expect a reference here).

L58-60 ‘Conservative therapy is the primary choice, but with the development of deformity in one or both epicondyles, either unicompartmental knee arthroplasty (UKA) or total knee arthroplasty (TKA) has been the therapeutic regimen of choice.’ Might be better ‘Conservative management is the primary treatment for KOA. However, a TKA / UKA is suggested if the patient fails conservative management’ - plus other clinical indictors previously discussed.

L63 ‘pre-operative’ would be better ‘pre-operative status’

L85 ‘With this background’ could be changed to ‘considering this’

L130 ‘A query was made on the status of regular exercise, and we regarded it positive when 30 minutes or more exercise was performed twice weekly or more.’ Would be better written as ‘Regular exercise was defined as exercise performed a minimum of twice weekly for 30 minutes or more’

L135 * L143 probably not use the acronyms in these titles and use full measurement names

L146 might read better as ‘signal’ rather than ‘sign’

L179 I think ‘a’ is missing before ‘multiple regression’, or alternatively plularise

L190 ‘As a factor for partitioning values in deriving RIs, we considered sex, age, BMI, and surgical mode as candidates.’ Might read better as ‘factors which may influence the RI’s include sex, age, BMI, and surgical mode.’

L241 regular exercise description is in the methods and is duplicated here, consider removing

L305-306 ‘It is obvious that muscle strength is generally lower in the women’ suggest change to ‘extensor muscle strength post-operatively is reduced in females’

L345 suggest change from ‘larger’ to ‘greater’

L350 suggest change ‘Because’ to ‘As’

L370 suggested change from ‘Please note that RIs were derived by a parametric method for motor function tests except for extension and flexor ROMs, for which nonparametric methods were used. To ‘It is important to note that RIs were derived by a parametric method for motor function tests except for extension and flexor ROMs where nonparametric methods were used instead.’

L394 suggested change from ‘and thus they take a long time to reach a favorable level’ to ‘and thus they take a long time to reach their optimum’

L394-395 change from ‘Therefore, it is important to detect patients who are lagging behind in the recovery process.’ to ‘Therefore, it is important to detect when patients are not aligning with expected recovering profiles in a timely manner.’

L398-402 is a long sentence to parse. Suggest breaking up.

L404 suggest change ‘ideal’ to ‘important’

L430 ‘was’ change to ‘were’

L438 and 441 change ‘suppressed’ to ‘reduced’

L447 suggested change from ‘In judging the need for partitioning of the test results, we relied on SDR.’ ‘we relied on SDR to judge the need for partitioning of the test results.’

L455 suggest change ‘but’ to new sentence ‘However,’

L460 suggested change from ‘There was a similar issue for’ to ‘There were’ and pluarise

L487 suggest change ‘obviously’ to ‘clearly’

L494 suggested change from ‘again’ to ‘furthermore or nevertheless’

L507 ‘data size’ suggested change to ‘the sample size’

L509-510 ‘For the long-term management of KOA patients who undergo knee arthroplasty, lower-limb motor function should be monitored for many months’ suggested change ‘As part of the long-term management of patients who undergo knee arthroplasty, lower-limb motor function scores should be monitored.’ Also suggest adding in why this would be beneficial would strengthen the manuscript.

L500-501 ‘To the best of our knowledge, there has been no such a report in the past.’ Suggest to remove as repeated in the conclusion

*6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

Yes

*Confidential to Editor

I have no competing interests.

References

Allum, J.H., Tang, K.S., Carpenter, M.G., Oude Nijhuis, L.B., and Bloem, B.R. (2011). Review of first trial responses in balance control: influence of vestibular loss and Parkinson's disease. Hum Mov Sci 30(2), 279-295.

Bohannon, R.W. (2006). Reference values for the timed up and go test: a descriptive meta-analysis. J. Geriatr. Phys. Ther. 29(2), 64-68.

Cohen, J. (1992). A power primer. Psychol. Bull. 112(1), 155-159.

Kamide, N., Takahashi, K., and Shiba, Y. (2011). Reference values for the Timed Up and Go test in healthy Japanese elderly people: determination using the methodology of meta-analysis. Geriatr Gerontol Int 11(4), 445-451.

Katoh, M., and Yamasaki, H. (2009). Comparison of Reliability of Isometric Leg Muscle Strength Measurements Made Using a Hand-Held Dynamometer with and without a Restraining Belt. J Phys Ther Sci 21(1), 37-42.

Podsiadlo, D., and Richardson, S. (1991). The timed "Up & Go": a test of basic functional mobility for frail elderly persons. J. Am. Geriatr. Soc. 39(2), 142-148.

Sekiya, N., Nagasaki, H., Ito, H., and Furuna, T. (1997). Optimal walking in terms of variability in step length. J. Orthop. Sports Phys. Ther. 26(5), 266-272.

Shumway-Cook, A., Brauer, S., and Woollacott, M. (2000). Predicting the probability for falls in community-dwelling older adults using the Timed Up & Go Test. Phys. Ther. 80(9), 896-903.

Attachment

Submitted filename: Responses to Reviewers comments.pdf

Decision Letter 1

Alison Rushton

27 Jan 2021

PONE-D-20-18248R1

Determination of reference intervals for knee motor functions specific to patients undergoing knee arthroplasty

PLOS ONE

Dear Dr. Ichihara,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

ACADEMIC EDITOR:

The reviewers feed back positively on the changes that have been made to improve the manuscript but still raise a number of issues. Please address the reviewers further comments to enable greater clarity within the manuscript.

==============================

Please submit your revised manuscript by Mar 13 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Alison Rushton

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: 

The comment from previous review have been addressed, but some minor issues have arisen in newly added sections.

They are listed below with line numbers refering to text with tracked changes.

Line 226: misspelled name „Dbson” instead of Dobson

Lines 258-260: description of tibia and femur axis seem to be swapped; description for tibia fits to femur and reversely description for femur fits to tibia.

Table 1, row about sex: plural “men” should be written instead of singular “man”.

Lines 396-400: First, an information about extensor strength reduction in women is given. Next, it is repeated for both sexes, which is true, but there is no need to state it twice.

Line 516: strictly speaking, no data from Table 4 directly prove that distribution was not Gaussian before transform.

Reviewer #2: 

Although the authors made most of the suggestions, the objectives, results presented and discussions are still not clear for the reader to follow a clinical reason.

I suggest that the authors write the clinical meaning for each result found in the discussion.

Reviewer #3: 

The authors have improved the manuscript based on all reviewers’ comments. The rationale is clearer and the statistical techniques are better described now

Overall, I still find the manuscript difficult to read and I contend it could be made clearer and simpler with better mapping of the stated objectives to the results.

I think the authors want to say that previous reporting of RIs in predominantly healthy subjects are flawed statistically and offer methods as an alternative approach which is a valid purpose. Reporting RIs in healthy subjects using the alternative statistical technique might be the sole purpose of a separate paper. They then suggest that the reporting of specific motor functions (TUG, max walking velocity, leg ROM, & specified lower limb strength) is relevant. I am still to be convinced by the authors’ manuscript that they recognise that these measures are examples of proxies of physical functional constructs and that there exist other measures – so why these specific ones and not others, and why measured this way?? Again, this debate might provide a separate manuscript worthy of publication.

Regardless of this criticism, they go on to provide RI data for these measures which is in itself sensible and useful for discriminative use in clinical practice or research given the statistical flaws described in previous work. They content that these data are useful in clinical Japanese practice as a means to determine if a patient is starting from a deranged functional profile pre-surgery, or is deviating from a normal recovery trajectory after surgery based on the functional measures selected.

But, this reader is not convinced the authors have justified that their sample represents a typical patient sample. This is mainly because they openly admit they were limited in their design to determine whether their sample reached a pre-determined threshold of functional recovery longitudinally i.e. whether the longer-term outcome of their surgery was successful. We simply do not know if this is the case, and therefore there is doubt their RIs represent a compelling acceptable range. This needs to be better acknowledged in my opinion.

All in all, I think the authors have a made a decent attempt effort to incorporate and/or rebut the 3 reviewers’ comments. While the manuscript is improved, it still requires some revision. It could benefit from breaking into clear sections or even be incorporated into two manuscripts in my opinion.

Specific points and minor Edits (line numbers refer to tracked-changed manuscript version):

L82 onwards - Explain what the differences between the different surgical types in the introduction after L82.

L43 -Change ‘as controls’ to ‘a control group of 120 healthy elderly volunteers’

L72 - Change ‘by Yoshimura et al.’ to ‘Yoshimura et al. (2011) reported in a large cohort study the prevalence of knee osteoarthritis (KOA) is as high as 42.6% in men and 62.4% in women over 40 years old across Japan.

L85-88 - Change ‘For proper management and smooth recovery through rehabilitation, it is important to monitor knee motor functions [4] such as timed-up-and-go (TUG) test, maximum walking speed (MWS), knee muscle strength, and knee range of motion (ROM).’ To ‘It is important to use objective tests such as timed-up-and-go (TUG) test, maximum walking speed (MWS), knee muscle strength, and knee range of motion (ROM) to monitoring performance post operatively [4].’

L113 - Change objective 1 to ‘to explore sources of variation (SVs) and determine their reference interval (RIs) of objective tests used to measure knee motor function within KOA patients undergoing knee arthroplasty’

L116 - Change objective 2 ‘To be the first study to explore statistical methods for analysing the SVs and determining the RIs.’

L118 -Change objective 3 to ‘To compare knee motor functions before and two-weeks after knee arthroplasty as a predictor of long-term success of KA surgery.’

L124 - Change ‘A total of 583 KOA patients undergoing elective knee arthroplasty were recruited consecutively between July 2013 and February 2018 by use of harmonized study protocol from 13 institutions specialized in knee arthroplasty, which are scattered widely in western Japan.’ To ‘A total of 583 KOA patients undergoing elective knee arthroplasty were recruited from 13 institutions across Western Japan that specialized in knee arthroplasty between July 2013 and February 2018’.

L140-149 – The objectives are clearer now. But similar to previous criticisms of a lack of a clear mapping of objectives to the data collected - I do not see why a group of healthy subjects was recruited with these objectives in mind – the justification was not made clear to me in the methods section either. I had to read ahead to L395 in the results for a first mention of how the healthy data was used and why.

L161 – Define DM in full if using; but surely peripheral neuropathy of any aetiology would be the exclusion criteria not only diabetes?

L165 – “… enrolled were 545 subjects.” Suggest change to “…545 subjects were enrolled.”

L173 – Change “followings” to “following”

L211 – Change “… KOA patients …” to “… KOA patient …”

L214- Suggest change “… on a chair …” to “… on an armless chair with seat-height set between 40 and 40cm high …” and delete the next sentence.

L258-260 – I think the descriptors of the tibial and femoral segments need changing around.

L275 – “… in P value …” is better as “ … as a P value …”

L277 – pluralise “… coefficient …” I think

L317 – suggest change “… we applied nonparametric …” to “… we applied a nonparametric …”

L339 – “… Mann-Whiney…” should be “… Mann-Whitney …”

L340 – I think pluralise the word “test” when it appears in this sentence, and these analyses should be in the methods section.

L340-342 - Allusion to, or interpretations of, “small” or any statistical differences should be included in the discussion section – the results should merely report the results I think

L408 – Table 3, there is no adjacent mention of Table 3 in the text here, I had to look to L424 in the next section to find it, Table 3 needs better placement in the text

L506 – suggest change “… proper …” to “ … optimal …” or similar

L580 – I am not sure you need “on the other hand ….” At the start of the sentence here

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Gareth D. Jones

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 14;16(4):e0249564. doi: 10.1371/journal.pone.0249564.r004

Author response to Decision Letter 1


16 Feb 2021

Reviewer #1

Our response ⇒ We greatly appreciate your review of our manuscript and the invaluable comments provided to improve it. We addressed each issue and revised the manuscript accordingly as described below.

1. Line 226: misspelled name „Dbson” instead of Dobson

2. Lines 258-260: description of tibia and femur axis seem to be swapped; description for tibia fits to femur and reversely description for femur fits to tibia.

3. Table 1, row about sex: plural “men” should be written instead of singular “man”.

4. Lines 396-400: First, an information about extensor strength reduction in women is given. Next, it is repeated for both sexes, which is true, but there is no need to state it twice.

Our response ⇒We corrected the errors as suggested.

5. Line 516: strictly speaking, no data from Table 4 directly prove that distribution was not Gaussian before transform.

Our response ⇒ We thank you for pointing out this mistake. We removed the following phrase from Table 4: “the distribution of TUG values is not Gaussian as is evident from our Table. 4 and Suppl Fig. 1”

Reviewer #2

- Although the authors made most of the suggestions, the objectives, results presented and discussions are still not clear for the reader to follow a clinical reason.

I suggest that the authors write the clinical meaning for each result found in the discussion.

Our response ⇒ We are grateful for your review of our manuscript again. We realized the problem of describing our objectives of this study comprehensibly in the Introduction and the lack of our statement of clinical implications of this study in the Discussion. We concisely described the objectives as follow:

“With these backgrounds, we conducted a multicenter study to evaluate sources of variation of knee motor functions and to establish their RIs specific for KOA patients undergoing KA.

We aimed at using the quantitative information for objective implementation of peri-surgical rehabilitation and for screening patients who are deviated from the normal course of recovery.”

We added the clinical implication of this study as follows at the end of the Discussion:

“In summary, a clinical implication of this study is as follows. Since we only enrolled patients who followed the normal course of recovery, the quantitative information of RIs and their factors of variations obtained in this study will facilitate peri-surgical rehabilitation customized to patients according to sex, age, and surgical mode. Furthermore, the subgroup specific RIs will be useful in detecting patients who are deviated from the normal course of recovery.”

Reviewer #3

The authors have improved the manuscript based on all reviewers’ comments. The rationale is clearer and the statistical techniques are better described now

Overall, I still find the manuscript difficult to read and I contend it could be made clearer and simpler with better mapping of the stated objectives to the results.

Our response ⇒ We are grateful for your time and efforts to review our manuscript again in great detail to give us a chance to improve it.

First of all, we realized that the following three descriptions of our objectives in the Introduction were still incomprehensible and misleading and thus caused further confusion.

“With these backgrounds, we conducted this study as a team of physical therapists to provide optimal peri-surgical rehabilitation to KOA patients undergoing knee arthroplasty in collaboration with orthopedicians. Our specific objectives were:

(1) For objective use of knee motor function tests in the rehabilitation, to explore their sources of variation (SVs) and to determine their reference interval (RIs), which are specific to KOA patients undergoing knee arthroplasty.

(2) To apply up-to-date statistical methods in analyzing the SVs and in determining RIs, which have not been used in the field of rehabilitation medicine in the past,

(3) To compare knee motor functions before and two-weeks after knee arthroplasty, which is of relevance in predicting a long-term success of the surgery. “

We think that this problem occurred due to an insufficient description of the role of physical therapists in the post-surgical rehabilitation of patients undergoing knee arthroplasty (KA). Therefore, we added the following description of how the peri-surgical rehabilitation is generally performed and how the knee function tests are being used by adding a new Figure 1.

“In the clinical management of KOA patients undergoing knee arthroplasty, physical therapists take an important role in providing peri-surgical rehabilitation in close collaboration with orthopedic surgeons. The scheme of rehabilitation according to a common clinical pathway for knee arthroplasty is as shown in Fig. 1. The functional status of patients is usually assessed by performing knee motor function tests of a safe variety, such as timed-up-and-go (TUG) test, maximum walking speed (MWS), knee muscle strength, and knee range of motion (ROM) [4].

These test results are often referred to in deciding a patient’s discharge date. To read the results objectively, it is essential to understand their sources of variation and to have a reliable reference interval (RI) for each parameter.”

Therefore, we adopted only the first objective and deleted other two because of their misleading nature and irrelevancy for this study.

The first objective is now re-written as follows at the end of the Introduction:

“On the basis of this background, we conducted a multicenter study to evaluate sources of variation of knee motor functions and to establish specific RIs for KOA patients undergoing KA. We aimed to use quantitative information for objective implementation of peri-surgical rehabilitation and for screening of patients who deviate from the normal course of recovery.”

As for objective (2) of applying up-to-date statistical methods, we now regard it not so important and thus we toned down this point and substantially reduced discussion on the past “normative value” studies.

For objective (3) of comparing knee functions before and after KA, we found the description was misleading because immediate post-surgical test results from our study cannot be used in predicting long-term patient outcome, although there are some studies that claimed the importance of immediate postsurgical knee functions in relation to long term outcome.

I think the authors want to say that previous reporting of RIs in predominantly healthy subjects are flawed statistically and offer methods as an alternative approach which is a valid purpose. Reporting RIs in healthy subjects using the alternative statistical technique might be the sole purpose of a separate paper.

Our response ⇒ As we just mentioned above, we found it irrelevant to cite past studies of knee motor functions based on healthy elderly subjects, regardless of the statistical methods used. Therefore, we deleted some of those description in the Introduction and the Discussion. Thus, we have no plans to write a companion paper to this one because the two populations (healthy vs. KOA) are too mismatched to make any worthy comparison.

They then suggest that the reporting of specific motor functions (TUG, max walking velocity, leg ROM, & specified lower limb strength) is relevant.

I am still to be convinced by the authors’ manuscript that they recognise that these measures are examples of proxies of physical functional constructs and that there exist other measures – so why these specific ones and not others, and why measured this way??

Again, this debate might provide a separate manuscript worthy of publication.

Our response ⇒ We understand that there are many other measures of knee motor functions, such as the 6-minute walking distance and a stair climb test. However, during the immediate post-surgical period that physical therapists are involved in, these tests are generally not performed for the sake of safety. Unfortunately, a second study targeting other physical function tests is beyond the scope of our routine clinical practice.

Regardless of this criticism, they go on to provide RI data for these measures which is in itself sensible and useful for discriminative use in clinical practice or research given the statistical flaws described in previous work.

Our response ⇒ We toned down the issue of statistical flaws in the past normative value studies in the Discussion because of the irrelevance of the issue based on the main objectives of this study.

They content that these data are useful in clinical Japanese practice as a means to determine if a patient is starting from a deranged functional profile pre-surgery, or is deviating from a normal recovery trajectory after surgery based on the functional measures selected.

But, this reader is not convinced the authors have justified that their sample represents a typical patient sample.

This is mainly because they openly admit they were limited in their design to determine whether their sample reached a pre-determined threshold of functional recovery longitudinally i.e. whether the longer-term outcome of their surgery was successful.

We simply do not know if this is the case, and therefore there is doubt their RIs represent a compelling acceptable range. This needs to be better acknowledged in my opinion.

Our response ⇒ We again apologize for having cited misleading papers in the Discussion, which reported the importance of short-term post-surgical motor functions in predicting the long-term recovery of patients. We admit that we cannot say anything about the long-term success of the surgery.

As for the implication of the RIs we derived, we had missed important descriptions on how we enrolled KOA patients in the Methods. We just included patients who experienced a normal recovery course and were discharged within 3 to 4 weeks after KA. Therefore, our RIs can be referred to in judging whether a given patient is in a normal recovery course for scheduling discharge. We also need to emphasize that the clinical pathway we adopted in this study is widely used in Japan, and thus the results of this study are relevant in the clinical practise of peri-surgical rehabilitation of KA patients in Japan.

Since we clarified the sources of variation of the test results, an additional clinical implication of this study is that it was made possible to interpret test results more appropriately in considerations of sex, age, and surgical mode.

In the revised manuscript, we added these points as a clinical implication of this study in the last section of the Discussion:

“In summary, the clinical implication of this study is as follows. As we only enrolled patients who followed a normal course of recovery, the quantitative information of RIs and their factors of variation obtained in this study will facilitate peri-surgical rehabilitation customized to patients according to sex, age, and surgical mode. Furthermore, the subgroup-specific RIs will be useful in detecting patients who deviate from the normal course of recovery.”

All in all, I think the authors have a made a decent attempt effort to incorporate and/or rebut the 3 reviewers’ comments. While the manuscript is improved, it still requires some revision.

It could benefit from breaking into clear sections or even be incorporated into two manuscripts in my opinion.

Our response ⇒ We hope we have clarified our objective and clinical implications of this study as described above in responding to your invaluable suggestions one by one. We also hope that it will not be necessary to write other related manuscripts, which seems not possible from the objective of this study.

Specific points and minor Edits (line numbers refer to tracked-changed manuscript version):

L82 onwards - Explain what the differences between the different surgical types in the introduction after L82.

Our response ⇒ We added the following description of the surgical modes:

“Either unicompartmental knee arthroplasty (UKA) or total knee arthroplasty (TKA) has been the therapeutic regimen of choice. The former is a less invasive technique that only replaces a single granular area and preserves the anterior and posterior cruciate ligaments [2,3]. Recently, as an alternative to conventional TKA (C-TKA), a minimally invasive TKA surgery (MIS-TKA) has become popular, which features a shorter skin incision than that with C-TKA.”

L43 -Change ‘as controls’ to ‘a control group of 120 healthy elderly volunteers’

L72 - Change ‘by Yoshimura et al.’ to ‘Yoshimura et al. (2011) reported in a large cohort study the prevalence of knee osteoarthritis (KOA) is as high as 42.6% in men and 62.4% in women over 40 years old across Japan.

L85-88 - Change ‘For proper management and smooth recovery through rehabilitation, it is important to monitor knee motor functions [4] such as timed-up-and-go (TUG) test, maximum walking speed (MWS), knee muscle strength, and knee range of motion (ROM).’ To ‘It is important to use objective tests such as timed-up-and-go (TUG) test, maximum walking speed (MWS), knee muscle strength, and knee range of motion (ROM) to monitoring performance post operatively [4].’

L113 - Change objective 1 to ‘to explore sources of variation (SVs) and determine their reference interval (RIs) of objective tests used to measure knee motor function within KOA patients undergoing knee arthroplasty’

L116 - Change objective 2 ‘To be the first study to explore statistical methods for analysing the SVs and determining the RIs.’

Our response ⇒We revised the wording as suggested.

L118 -Change objective 3 to ‘To compare knee motor functions before and two-weeks after knee arthroplasty as a predictor of long-term success of KA surgery.’

Our response ⇒We deleted the misleading objective 3 as explained above.

L124 - Change ‘A total of 583 KOA patients undergoing elective knee arthroplasty were recruited consecutively between July 2013 and February 2018 by use of harmonized study protocol from 13 institutions specialized in knee arthroplasty, which are scattered widely in western Japan.’ To ‘A total of 583 KOA patients undergoing elective knee arthroplasty were recruited from 13 institutions across Western Japan that specialized in knee arthroplasty between July 2013 and February 2018’.

Our response ⇒We corrected the description as suggested. Please note that in the revised manuscript, we stated that we originally recruited 623 KOA patients, but we excluded 41 subjects who did not follow a normal course of recovery as well as 38 subjects with missing test results. Therefore, after the above revised description, we added the following facts about the enrolled cases.

“After exclusion of 38 patients with missing motor function test results and 41 patients who had a delay in discharge, a total of 545 patients were regarded as eligible for the subsequent analyses.”

L140-149 – The objectives are clearer now. But similar to previous criticisms of a lack of a clear mapping of objectives to the data collected - I do not see why a group of healthy subjects was recruited with these objectives in mind – the justification was not made clear to me in the methods section either. I had to read ahead to L395 in the results for a first mention of how the healthy data was used and why.

Our response ⇒ As mentioned above, we admitted the irrelevance of knee motor functions of healthy subjects, and we just described past normative value studies briefly. Nevertheless, to show the needs of the RIs specific for KOA patients, we think it necessary to show the test results of age-matched healthy subjects in this study. Therefore, we hope the healthy subjects’ results can be retained in the Results to emphasize the importance of deriving “disease-specific” RIs.

L161 – Define DM in full if using; but surely peripheral neuropathy of any aetiology would be the exclusion criteria not only diabetes?

Our response ⇒ We revised the description of the excluded conditions as follows:

“1) presence of motor paralysis or other neurological dysfunctions such as stroke, lumbar disc herniation, spinal canal stenosis, peripheral neuropathy of diabetes mellitus unrelated to KOA,…”

L165 – “… enrolled were 545 subjects.” Suggest change to “…545 subjects were enrolled.”

L173 – Change “followings” to “following”

L211 – Change “… KOA patients …” to “… KOA patient …”

L214- Suggest change “… on a chair …” to “… on an armless chair with seat-height set between 40 and 40cm high …” and delete the next sentence.

L258-260 – I think the descriptors of the tibial and femoral segments need changing around.

L275 – “… in P value …” is better as “ … as a P value …”

L277 – pluralise “… coefficient …” I think

L317 – suggest change “… we applied nonparametric …” to “… we applied a nonparametric …”

L339 – “… Mann-Whiney…” should be “… Mann-Whitney …”

L340 – I think pluralise the word “test” when it appears in this sentence, and these analyses should be in the methods section.

Our response ⇒We thank you for these suggestions, which we applied to the manuscript.

L340-342 - Allusion to, or interpretations of, “small” or any statistical differences should be included in the discussion section – the results should merely report the results I think

Our response ⇒According to the suggestion, we rephrased the results as follows removing the interpretations: “Statistically significant differences were found for higher age and BMI, and less frequent habit of exercise observed among KOA patients.”

L408 – Table 3, there is no adjacent mention of Table 3 in the text here, I had to look to L424 in the next section to find it, Table 3 needs better placement in the text

L506 – suggest change “… proper …” to “ … optimal …” or similar

L580 – I am not sure you need “on the other hand ….” At the start of the sentence here

Our response ⇒ We thank you for these suggestions, which we accordingly applied to the manuscript.

Attachment

Submitted filename: Responses to Reviewers comments towards R2 20210215.docx

Decision Letter 2

Alison Rushton

4 Mar 2021

PONE-D-20-18248R2

Determination of reference intervals for knee motor functions specific to patients undergoing knee arthroplasty

PLOS ONE

Dear Dr. Ichihara,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

ACADEMIC EDITOR:

Thank you for your diligence in addressing the reviewers' comments that I hope you agree has contributed to a stronger manuscript. Please address the few remaining issues raised by the reviewers that are detailed below.

==============================

Please submit your revised manuscript by Apr 18 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Alison Rushton

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: 

Previous comments has been addressed and manuscript is in general sound.

Some minor changes, listed below, are still needed, but they do not affect the content substantially. Line numbers below refer to the text with highlighted changes.

Lines 355-363: The authors probably meant Fig 3, not 2, here.

Line 435: There should be Fig 4, not 3.

Reviewer #3: 

The authors have made an excellent attempt to consider all the reviewer’s comments and there is no doubt this is an improved manuscript. The manuscript provides RI data for the functional measures in a real set of patients which is sensible and of use to the community for discriminative use in clinical practice or research given the statistical flaws described in previous work. They contend that these data are useful in clinical Japanese practice as a means to determine if a patient is starting from a deranged functional profile pre-surgery, or is deviating from a normal recovery trajectory after surgery based on the functional measures selected.

Overall, I still find the manuscript difficult to read. Suggest to include in line 88 in the introduction what the sources of variation are/might be to prime the reader to the factors explored in the paper. I still think it could be made clearer and simpler with better mapping of the stated objectives to the results – e.g. crafting the manuscript so the objective of identifying factors associated with motor function parameters is clearly stated at the end of the introduction and in the methods, and a clear rationale why healthy subjects were included (again at the end of the intro and in the methods section from line 144), and some allusion to discussing the healthy data in the discussion section.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: Yes: Gareth D Jones

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 14;16(4):e0249564. doi: 10.1371/journal.pone.0249564.r006

Author response to Decision Letter 2


14 Mar 2021

Reviewer #1:

Previous comments has been addressed and manuscript is in general sound.

Some minor changes, listed below, are still needed, but they do not affect the content substantially. Line numbers below refer to the text with highlighted changes.

Lines 355-363: The authors probably meant Fig 3, not 2, here.

Line 435: There should be Fig 4, not 3.

Our response → We thanks again for the review of our revised manuscript and the provision of the comment on the problem of incorrect numbering of Figures. We corrected them accordingly.

Reviewer #3:

The authors have made an excellent attempt to consider all the reviewer’s comments and there is no doubt this is an improved manuscript. The manuscript provides RI data for the functional measures in a real set of patients which is sensible and of use to the community for discriminative use in clinical practice or research given the statistical flaws described in previous work. They contend that these data are useful in clinical Japanese practice as a means to determine if a patient is starting from a deranged functional profile pre-surgery, or is deviating from a normal recovery trajectory after surgery based on the functional measures selected.

Our response → We are grateful for your time and efforts to review our manuscript again and the kind offering to us of additional comments to further improve the manuscript.

Overall, I still find the manuscript difficult to read. Suggest to include in line 88 in the introduction what the sources of variation are/might be to prime the reader to the factors explored in the paper.

Our response → According to your suggestion, we added the following explanation for the term of “sources of variation” at line 88: “To read the results objectively, it is essential to understand their sources of variation (factors that may alter test results without pathological conditions) and to have a reliable reference interval (RI) for each parameter.”

I still think it could be made clearer and simpler with better mapping of the stated objectives to the results – e.g. crafting the manuscript so the objective of identifying factors associated with motor function parameters is clearly stated at the end of the introduction and in the methods, and a clear rationale why healthy subjects were included (again at the end of the intro and in the methods section from line 144), and some allusion to discussing the healthy data in the discussion section. 

Our response → According to your advice, we also added the meaning of analyzing the sources of variation (SV) as follow near the end of the Introduction: “On the basis of this background, we conducted a multicenter study to evaluate sources of variation (SVs) of knee motor functions and to establish well-stratified RIs in consideration of the SVs for KOA patients undergoing knee arthroplasty.”.

Furthermore, the objective of stratifying the RIs in consideration of SVs had been mapped in the Results under the subheading of “Assessment of factors to derive RIs specific to subgroups” (just before the subheading of “Determination of RIs”) in describing the SV-based stratification of the RIs for the KOA patients.

Regarding the implication of recruiting healthy subjects and determining their RIs for the knee motor functions tests, we also added the following description at the very end of the Introduction to clarify the point according to your suggestion: “Age matched healthy subjects were also recruited to ascertain the need for disease -specific RIs. We aimed to use these quantitative information for objective implementation of peri-surgical rehabilitation and for screening of patients who deviate from the normal course of recovery.”

For this issue of our deliberate inclusion of age-matched healthy elderly, we also gave additional explanation of including them in the Methods under the sub-heading of “Healthy volunteers” as follows: “As described in the Introduction, we recruited 120 apparently healthy volunteers with comparable ages to make a contrast of their knee motor functions with those of KOA patients undergoing knee arthroplasty. They comprised 36 men and 84 women ……”

As for the need in the Discussion of additional description for the implication of deriving RIs from the healthy individuals, we had already described it as follows at the third paragraph of the Discussion: “In addition to these methodological issues of the past studies targeting healthy subjects, as demonstrated in Fig. 3 and 4, healthy ranges of knee motor function tests were quite different from those of KOA patients both before and after the surgery.…”

Attachment

Submitted filename: Responses to the Reviewers comments towards R3.docx

Decision Letter 3

Alison Rushton

22 Mar 2021

Determination of reference intervals for knee motor functions specific to patients undergoing knee arthroplasty

PONE-D-20-18248R3

Dear Dr. Ichihara,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Alison Rushton

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Thank you for your diligence in addressing the reviewers' comments at each stage. I hope that you agree that it has strengthened your manuscript.

Acceptance letter

Alison Rushton

5 Apr 2021

PONE-D-20-18248R3

Determination of reference intervals for knee motor functions specific to patients undergoing knee arthroplasty

Dear Dr. Ichihara:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Alison Rushton

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Determination of reference intervals by parametric method.

    (PDF)

    S1 File

    (XLSX)

    Attachment

    Submitted filename: Review of Study_PLoSOne_D-20-18248_for upload.docx

    Attachment

    Submitted filename: Responses to Reviewers comments.pdf

    Attachment

    Submitted filename: Responses to Reviewers comments towards R2 20210215.docx

    Attachment

    Submitted filename: Responses to the Reviewers comments towards R3.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES