Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2024 Mar 7;19(3):e0299486. doi: 10.1371/journal.pone.0299486

Inter- and intra-observer reliability and agreement of O2Pulse inflection during cardiopulmonary exercise testing: A comparison of subjective and novel objective methodology

Thomas Nickolay 1,2,*, Gordon McGregor 3,4,5, Richard Powell 3,4, Brian Begg 6,7, Stefan Birkett 8, Simon Nichols 9, Stuart Ennis 3,5, Prithwish Banerjee 4,5,10, Rob Shave 11, James Metcalfe 2, Angela Hoye 1, Lee Ingle 2
Editor: Zulkarnain Jaafar12
PMCID: PMC10919635  PMID: 38452129

Abstract

Cardiopulmonary exercise testing (CPET) is the ‘gold standard’ method for evaluating functional capacity, with oxygen pulse (O2Pulse) inflections serving as a potential indicator of myocardial ischaemia. However, the reliability and agreement of identifying these inflections have not been thoroughly investigated. This study aimed to assess the inter- and intra-observer reliability and agreement of a subjective quantification method for identifying O2Pulse inflections during CPET, and to propose a more robust and objective novel algorithm as an alternative methodology. A retrospective analysis was conducted using baseline data from the HIIT or MISS UK trial. The O2Pulse curves were visually inspected by two independent examiners, and compared against an objective algorithm. Fleiss’ Kappa was used to determine the reliability of agreement between the three groups of observations. The results showed almost perfect agreement between the algorithm and both examiners, with a Fleiss’ Kappa statistic of 0.89. The algorithm also demonstrated excellent inter-rater reliability (ICC) when compared to both examiners (0.92–0.98). However, a significant level (P ≤0.05) of systematic bias was observed in Bland-Altman analysis for comparisons involving the novice examiner. In conclusion, this study provides evidence for the reliability of both subjective and novel objective methods for identifying inflections in O2Pulse during CPET. These findings suggest that further research into the clinical significance of O2Pulse inflections is warranted, and that the adoption of a novel objective means of quantification may be preferable to ensure equality of outcome for patients.

Introduction

Cardiopulmonary exercise testing (CPET) allows for the non-invasive, objective quantification of cardiopulmonary fitness, and is thus held as the ‘gold standard’ methodology for evaluating functional capacity [1, 2]. In contrast to more traditional assessments, such as ECG stress testing and the 6-minute walk test, CPET makes it possible to determine the potential pathophysiological mechanisms underlying exercise intolerance [24].

The utility of CPET as a diagnostic and prognostic tool in the evaluation of patients with coronary artery disease (CAD) and heart failure has received some attention [15]. In particular, an early plateau or inflection in the normal linear progression of oxygen pulse (O2Pulse) and oxygen consumption (VO2) despite an increasing work-rate (WR) are suggested to be indicative of inducible and reversable ischaemia [2, 4, 69]. In principle, O2Pulse reflects left ventricular stroke volume (SV) (and arteriovenous oxygen difference) [1, 4]. Consequently, a plateau or inflection in O2Pulse, despite increasing WR suggests a pathological impairment of stroke volume, possibly caused by myocardial ischaemia [1, 2, 4].

To the best of our knowledge the reliability and agreement of O2Pulse inflections have not been previously investigated. However, we have shown in a healthy cohort, the minimal detectable change (MDC) for 15-second time-averaged and filtered O2Pulse, measured between 50 and 100% of peak work rate is 2.2 mL.beat, and 1.6 mL.beat respectively.

If this level of agreement remains consistent for O2Pulse at the point of inflection, and this morphology represents stable pathophysiological limitations in CAD patients, it may be very useful to clinicians. For example, it could provide a marker with which to track the progression and severity of dysfunction, without the need for repeated exposure to radiation or invasive procedures. Moreover, in rehabilitation settings it may provide a threshold value from which personalised exercise prescriptions could be developed.

However, to date the literature surrounding inflections in O2Pulse typically classifies them categorically, as ’normal’ or ’abnormal’. This system of classification does not quantify the position of inflection, for example the work rate or heart rate at which O2Pulse deviates from normality [1, 1015]. Categorisation of O2Pulse morphology is usually performed in one of two ways, which we refer to as ‘visual categorisation’, and secondly, ‘categorisation by regression’. In visual categorisation, one or more observers scrutinise the O2Pulse curve to identify where the curve begins to deviate from a linear increase, usually referred to as a plateau or inflection. This point can sometimes be the sole focus of the investigation. However, in other instances, observers may be required to further categorise the curve as ’normal’, ’probably normal’, ’probably abnormal’, or ’abnormal’, depending on its characteristics [10]. Alternatively, ‘categorisation by regression’ involves a quantified mathematical approach. Investigators select a specific point along the curve, such as two minutes before the cessation of exercise, and calculate the slope of the curve (slope A) from exercise beginning to this point using linear regression. This slope is then compared to the regression slope of the curve for the final two minutes (slope B) to quantify proportional change [16]. Based on this comparison, the curve may be further categorised as ‘normal augmentation’, ‘flat throughout’, plateau in late exercise’, and ‘decline in late exercise’ (inflection) [16].

The inter-rater agreement when categorising O2Pulse curves as ’normal’, ’probably normal’, ’probably abnormal’, ’ definitely abnormal’ has been reported by De Lorenzo and colleagues [10] to be κC = 0.65 (95% CI = 0.39–0.66). Efforts have been made by Chuang et al [15] to remove the subjectivity from categorisation by comparing an algorithmic approach to the consensus of two examiners. The resulting Kappa values were κC = 0.86 and 0.69 respectively for the conditions (normal) plateau and decrease.

To date there appears to have been no effort made to subjectively quantify the position of O2Pulse inflections. This is perhaps due to their identification being influenced by a multitude of factors, such as the experience and opinion of the individual interpreting the data, the method of data processing (time averaged versus data point averaging), and data presentation (axis size and aspect ratio). However, this form of data interpretation is not without precedent, the first ventilatory threshold (VT1) during CPET is often identified in much the same way, through the modified V-slope method [17]. Harwood et al. [18] investigated the agreement of CPET parameters in patients with abdominal aortic aneurysms, utilising modified V-slope method to identify VT1. The intraclass correlation coefficient (ICC) (two-way mixed) was used to measure reliability. For intra-rater reliability, the ICC was 0.834 (95% CI 0.215,0.975; P = 0.010) on the motorised treadmill and r = 0.959 (95% CI 0.741,0.994; P = 0.000) on the cycle ergometer. For inter-rater reliability, the ICC was r = 0.983 (95% CI 0.785,0.999; P = 0.002) on the motorised treadmill and r = 0.905 (95% CI 0.508,0.986; P = 0.003) on the cycle ergometer.

O2Pulse inflections may have the potential to be used to prescribe exercise intensity and monitor progression in much the same way VT1 is currently used. However, we must first establish the inter- and intra-rater variability of identification, and potentially provide a robust objective means of identification.

To the best of our knowledge there are no published data relating to the inter-, or intra- observer reliability and agreement of the subjective quantification of O2Pulse inflections. Therefore, the primary aim of this research is to determine the inter- and intra-observer reliability and agreement of the subjective quantification method. The secondary aim is to establish a suitable objective alternative methodology that provides zero intra- and inter- observer variability.

Methods

This was a retrospective baseline analysis of the HIIT or MISS UK trial [18]. The HIIT or MISS trial was a multicentre randomised controlled trial recruiting (1st September 2016 to 13th March 2020) CAD patients referred for exercise-based cardiac rehabilitation (CR) in the UK. Ethical approval for the protocol was provided by the NHS Health Research Authority, East Midlands–Leicester South Research Ethics Committee (16/EM/0079), with patients providing written informed consent prior to enrollment. Detailed methodology of the trial procedures are available elsewhere [19], in short, patients performed a baseline CPET on cycle ergometer following a standard ramp incremental protocol [20]. Fully anonymised data were accessed for the purpose of this analysis between 29th January 2021 and 28th September 2023. Raw ventilatory gas exchange data were exported as 15-second averaged.csv files and used to generate O2Pulse curves (x-axis = work rate; y-axis = O2Pulse).

Curves were then visually inspected by two independent examiners, each blinded to the interpretation of the other. Both examiners were clinical exercise physiologists with experience interpreting CPET. However, one examiner had substantially more experience with O2Pulse morphology (>6 years) and inflections. This examiner is subsequently referred to as ‘experienced’ whilst the other is termed ‘novice’ (<1 year). Each examiner viewed all available O2Pulse curves, categorising each as ‘yes’, to indicate the perceived presence of inflection or plateau, or ‘no’ to indicate the normal linear progression of O2Pulse. For each curve categorised as ‘yes’ the examiner would then quantify the threshold for inflection, identifying the exact point in the plot they believed represented a departure from normality. The ‘experienced’ examiner revisited the ‘yes’ curves at a later date (7–14 days) to re-quantify the inflection threshold. All subjective observations were then compared against an objective algorithm to compare categorisation and quantified threshold placement.

The algorithm was developed around the principle that a linear, or curvilinear increase in O2Pulse was the expected normal response. The algorithm functions on 15-second time averaged data. To further reduce data noise without having to identify and correct for individual out-lying data points, we applied a 9-point moving average filter. This process involved replacing each data point with the average of the 9 data points centred around it, which included the 4 preceding points, the point itself, and the 4 subsequent points. From this processed data the algorithm simply identified the first occurrence of the series peak value using conditional formatting. If this peak value occurred ≥ 6 data points prior to the end of the test, the row was highlighted as a departure from normal linear or curvilinear increase, and the point was plotted on the embedded figure for visual inspection (Fig 1). Using the algorithm template, this requires the 15-second averaged work rate and O2Pulse data to be copied and pasted into columns A and B.

Fig 1. Example of the algorithm function as an excel template.

Fig 1

The additional criteria specified for accepting inflections in O2Pulse, either via subjective observer or algorithm were as follows:

The point of inflection should occur ≥ 6 data points (90 seconds) prior to the end of the test.

The point of inflection in O2Pulse should coincide with a reduction in the ΔVO2/ΔWR slope of ≥ 10%.

The patient must not have achieved ≥ 90% predicted VO2peak.

These criteria are derived from the original findings of Belardinelli and colleagues [7, 8].

Statistical analysis

Statistical analysis was performed in RStudio version 4.2.2 using the R programming language and packages “readxl”, “irr”, “epiR”, and “BlandAltmanLeh” (Integrated Development for R. PBC, Boston, MA, USA). The categorisations of each observer (coded as "Yes" or "No" to indicate the perceived presence or absence of an inflection or plateau) were compared against that of the objective algorithm to establish whether the algorithm could adequately categorise inflections. Fleiss’ Kappa (κF) was used to determine the reliability of agreement between the three groups of observations. Kappa statistics were interpreted in accordance with the suggestions of Landis and Koch [21] with values <0.00, 0.00–0.20, 0.21–0.40. 0.41–0.60, 0.61–0.80 and 0.81–1.00 indicating poor, slight, fair, moderate, substantial and almost perfect respectively. In order to provide 95% confidence intervals around the Kappa value we performed 1000 bootstrap resamples with replacement from the original dataset. The algorithm was also compared against the consensus of both subjective examiners to determine its sensitivity and specificity as well as both positive and negative predictive values.

If all three observations were in agreement that an inflection had occurred, the threshold for inflection, expressed as heart rate and work rate were visually compared with Bland-Altman plots. In these instances, we compared experienced to novice, experienced to experienced (time), experienced to algorithm, and novice to algorithm. The intra-rater reliability was compared with a two-way random effects (2,1) ICC for absolute agreement and reported with standard error of measure (SEM) and minimal detectable change (MDC) values. The inter-rater reliability were compared using two-way mixed effects (3,1) model ICCs for absolute agreement [22]. ICC outputs were interpreted based upon the recommendations of Koo and Li [22] with values <0.5, 0.5–0.75, 0.75–0.9 and >0.9 indicating poor, moderate, good and excellent reliability respectively. Statistical significance was accepted P ≤ 0.05.

Results

In total 272 baseline CPET data in patients with CAD were analysed. The results of the analyses are presented in two parts: first, the inter-observer agreement of the subjective categorisation method versus the objective algorithm, and second, the evaluation of the proposed objective algorithms for quantifying thresholds in O2Pulse.

Inter-observer agreement

The computed Fleiss’ Kappa statistic for all raters was κF = 0.89 with a bootstrapped 95% confidence interval of 0.83–0.93. The corresponding z-score was 25.5 with a P < 0.0001. At least two raters were in agreement across all 272 files, with all three raters in agreement on 260 occasions (95.6%) The comparison of each interpreters’ analysis is summarised in Table 1.

Table 1. Comparison of sensitivity, specificity, positive predictive value, and negative predictive value across different rater comparisons.

Comparison Sensitivity (95% CI) Specificity (95% CI) Positive Predictive Value (95% CI) Negative Predictive Value (95% CI)
Algorithm Vs. Experienced 0.93 (0.82–0.99) 0.98 (0.96–1.00) 0.91 (0.79–0.98) 0.99 (0.96–1.00)
Algorithm Vs. Novice 0.89 (0.76–0.96) 0.99 (0.96–1.00) 0.93 (0.81–0.99) 0.98 (0.95–0.99)
Experienced Vs. Novice 0.93 (0.81–0.99) 0.97 (0.94–0.99) 0.87 (0.74–0.95) 0.99 (0.96–1.00)
Algorithm Vs. Consensus 0.82 (0.68–0.92) 0.99 (0.96–1.00) 0.92 (0.80–0.98) 0.97 (0.93–0.98)

95% CI = 95% Confidence Interval

Evaluation of the objective algorithm

In instances where all three observations agreed that an inflection had occurred (n = 37; 13.6%), the threshold for inflection, expressed as heart rate and work rate, were compared using Bland Altman plots (Figs 2 and 3). Values derived from or associated with Bland-Altman analysis, along with ICC values, are reported in Table 2.

Fig 2. Bland-Altman plots comparing agreement across subjective and objective inflection identification for heart rate.

Fig 2

Fig 3. Bland-Altman plots comparing agreement across subjective and objective inflection identification for work rate.

Fig 3

Table 2. Reliability and agreement analysis for inflection thresholds in heart rate and work rate.

Work Rate
Statistic Algorithm Vs. Experienced Algorithm Vs. Novice Experienced Vs. Novice Experienced Vs. Experienced (Time)
ICC 0.97 (0.95–0.99) 0.97 (0.86–0.99) 0.95 (0.87–0.98) 0.95 (0.90–0.97)
Lower LOA (95% CI) -16.57 (-21.42 - -11.73) -19.06 (-23.05 - -15.07) -13.74 (-19.37 - -8.11) -15.49 (-20.07 - -10.92)
Upper LOA (95% CI) 16.30 (11.46–21.15) 8.03 (4.04–12.02) 24.50 (18.86–30.13) 15.55 (10.98–20.13)
Mean Bias (95% CI) -0.14 (-2.93–2.66) -5.51 (-7.82 - -3.21) 5.38 (2.13–8.63) 0.03 (-2.61–2.69)
Systematic bias P = 0.92 P < 0.0001**** P = 0.002** P = 0.98
Proportional bias P = 0.11 P = 0.002** P = 0.48 P = 0.15
Heart Rate
Statistic Algorithm Vs. Experienced Algorithm Vs. Novice Experienced Vs. Novice Experienced Vs. Experienced (Time)
ICC 0.98 (0.96–0.99) 0.92 (0.82–0.96) 0.91 (0.81–0.95) 0.95 (0.91–0.98)
Lower LOA (95% CI) -10.68 (-13.68 - -7.67) -21.10 (-26.13 - -16.07) -14.77 (-20.17 - -9.38) -10.34 (-13.25 - -7.43)
Upper LOA (95% CI) 9.70 (6.70–12.71) 13.04 (8.01–18.07) 21.85 (16.46–27.25) 9.42 (6.51–12.33)
Mean Bias (95% CI) -0.49 (-2.22–1.25) -4.03 (-6.93 - -1.12) 3.54 (0.43–6.66) 0.46 (-2.14–1.22)
Systematic bias P = 0.57 P = 0.008*** P = 0.027* P = 0.58
Proportional bias P = 0.41 P = 0.83 P = 0.78 P = 0.77

ICC = Intraclass Correlation Coefficient; LOA = limits of agreement; CI = confidence interval

* = P ≤ 0.05

** = P ≤ 0.01

*** = P ≤ 0.001

**** = P ≤ 0.0001

Excellent reliability was recorded for all ICC, with the highest heart rate values (0.97) occurring in both the experienced and novice versus algorithm comparisons. The highest work rate ICC occurred in the algorithm versus experienced comparison (0.98). The intra-rater reliability for work rate (0.95) was accompanied by SEM (%SEM) and MDC (%MDC) values of 11.2 (11.1%) and 15.53 (15.4%) respectively. Whilst the intra-rater heart rate (0.95) SEM, and MDC values or 7.13 (6.8%) and 9.88 (9.4%).

Although ICC for the novice comparisons involving the novice examiner were excellent, the 95% CI for all of these readings were consistently broader than those involving the experienced examiner. Furthermore, the mean bias when comparing the novice against the algorithm and experienced examiner was consistently different from zero. Indeed, in all comparisons involving the novice examiner there was a statistically significant level of systematic bias (Figs 2 and 3). This systematic bias was compounded by significant proportional bias for comparisons versus the algorithm for work rate representing a statistically significant degree of both systematic and proportional bias. Indeed, all other comparison involving the novice examiner yielded statistically significant systematic bias.

Discussion

The primary aim of the study was to determine the inter- and intra-observer reliability and agreement of a subjective quantification method for identifying inflections in O2Pulse during cardiopulmonary exercise testing (CPET). Secondly, we sought to establish a suitable objective algorithm as an alternative methodology.

Before attempting to quantify the threshold of inflection using the proposed algorithm it was necessary to determine whether it could differentiate between normal and abnormal data. The results of the present study indicate that the algorithm can differentiate between data, providing excellent agreement when compared with both experienced and novice examiners. Previous research by de Lorenzo et al and Chuang et al have reported levels of inter-rater reliability of between κC = 0.65 and κC = 0.69 [10, 15] when categorising O2Pulse files, the value reported in the present study, however, are substantially higher at κF = 0.89 (0.83–0.93). There may be several reasons for this disparity. Firstly, the aforementioned studies applied Cohens’ Kappa, as they were interested in the agreement of two examiners, whilst we applied Fleiss’ Kappa to account for three ‘examiners’. Although the additional ‘examiner’ in this analysis introduces the possibility of greater variability, examiners were only required to score across two categories, that is "Yes" or "No" to indicate the perceived presence or absence of an inflection or plateau. In contrast, the study by de Lorenzo and colleagues [10] required two experienced examiners to place files into one of four categories (’normal’, ’probably normal’, ’probably abnormal’, ’ definitely abnormal’), resulting in double the variation in choice afforded in the present study. Similarly, Chuang and co-workers [14] placed an algorithm against the consensus of two human examiners, providing three choices for categorisation (‘increasing’, ‘plateau’, and ‘decreasing’).

The intra- and inter-observer reliability for subjective threshold quantification was assessed by two formats of ICC (2,1; 3,1). The analysis showed excellent reliability in both the intra- (r = 0.95) and inter-rater (r = 0.91–0.95) comparisons, irrespective of the unit of measurement (watts; bpm). As this is a novel methodology, there is no prior data with which to make comparison. However, the technique itself is reminiscent of the modified V-slope method [17], and thus comparisons with its reliability are perhaps justified. In this context, the subjective threshold quantification performs comparably well, as the modified V-slope reported intra-rater reliability of r = 0.83 when measured using treadmill, and r = 0.96 on cycle ergometry. Similarly, the inter-rater reliability is reported to be r = 0.98 (treadmill) and r = 0.91 (cycle ergometer). However, the mean bias for inter-rater comparisons of both work rate and heart rate was consistently different, as is evident from the significant levels of systematic bias (work rate P = 0.002; heart rate P = 0.027). This presents a substantial hurdle if inflection thresholds for O2Pulse are to be used in a similar way to ventilatory thresholds, for example, to quantify health status and prescribe exercise. For example, the same participant, given the same CPET could be prescribed wholly different exercise intensities by two investigators. This difference appears to be mitigated somewhat if the same examiner were to receive the same CPET, as is reflected by the MDC (15 watts; 10 bpm) and consistent mean bias values (0.03 watts; 0.46 bpm) recorded for the experienced examiner.

The normal progression of O2Pulse during CPET is linear or slightly curvilinear in nature, as stroke volume increase to peak exercise [23]. In such cases, the filtered and smoothed O2Pulse should peak in the latter stages of incremental exercise testing, especially when ≤ 90% predicted VO2peak has been achieved. Based on these logical assumptions the proposed algorithm identifies when peak values occur ≥ 90 seconds prior to the end of exercise and labels them as points of inflection. The proposed algorithm would inherently have zero intra rater reliability and zero MDC, assuming it were executed as intended. The inter-rater reliability of the algorithm when compared to both experienced and novice examiners was excellent (r = 0.92–0.98). However, as was seen with the experienced and novice examiner comparison there was a significant level of systematic bias when the algorithm and novice operator was compared. As bias was not present in the experienced versus algorithm comparison, it is perhaps more suggestive of a difference in interpretation that stems from level of experience. Furthermore, the limits of agreement and mean bias for algorithm and experienced examiner comparisons were almost identical to those observed in intra-examiner comparisons. Thus, the algorithm could theoretically replace the experienced examiner and eradicate intra-observer variability.

In real-world applications, the experience of clinicians and rehabilitators is wide ranging, thus, the adoption of an objective means of quantification is likely preferable to ensure equality of outcome for patients. For example, guidelines presented by the American College of Sports Medicine (ACSM) [24] suggest exercise intensities for CR to be below the ischaemic threshold (<10 beats), or a threshold that elicits the onset of angina symptoms. When following this guidance it would be preferable to know that, given the same baseline CPET, patients would be receive the same intensity recommendations irrespective of the site they test at or the examiner who reviews their results.

Limitations

The study is limited by both the small sample of examiners and the accompanying heterogeneous level of experience. Moreover, as there was no invasive ischaemic assessment, inflections in O2Pulse are not guaranteed to align with the onset of myocardial ischaemia. There are two avenues of enquiry for future research to pursue, firstly the algorithm could be used in conjunction with myocardial scintigraphy in an effort to corroborate the ischaemic threshold. Secondly, a larger sample of examiners with diverse levels of training and experience could be used to further establish the agreement of subjective threshold quantification and algorithm performance.

In conclusion, this study provides evidence for the reliability of both subjective and novel objective methods for identifying inflections in O2Pulse during CPET. These findings have important implications for the use of CPET in clinical populations, and suggest that further research into the clinical significance of O2Pulse inflections is warranted.

Data Availability

All Master files are available from the Open Science Framework database https://osf.io/d4pv3/.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.de Assumpção C, Prado DML, Camila P. Dourado Jordão, Luciana O.C, Vieira Marcelo L.C., Montenegro Carla G. de S.P., et al. Cardiopulmonary exercise test in patients with refractory angina: functional and ischemic evaluation. Clinics. 2022;77: 100003–100003. doi: 10.1016/j.clinsp.2021.100003 [DOI] [PubMed] [Google Scholar]
  • 2.Ganesananthan S, Christopher A. Foley Rajkumar, Michael, Thompson D, Nowbar Alexandra N, Seligman H, et al. Cardiopulmonary exercise testing and efficacy of percutaneous coronary intervention: a substudy of the ORBITA trial. Eur Heart J. 2022. doi: 10.1093/eurheartj/ehac260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Liu Wanjun, Liu Xiaolei, Liu Tao, Xie Yang, He Xingwei, Zuo Houjuan, et al. The Value of Cardiopulmonary Exercise Testing in Predicting the Severity of Coronary Artery Disease. J Clin Med. 2022;11: 4170–4170. doi: 10.3390/jcm11144170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.de Almeida Vitor Rossi, di Paschoale Ostolin Thatiane Lopes Valentim, Gonze Bárbara de Barros, de Almeida Flávio Rossi, Romiti Marcello, Arantes Rodolfo Leite, et al. Early flattening of the oxygen pulse during the cardiopulmonary exercise test in asymptomatic adults and its association with cardiovascular risk factors. Int J Cardiol. 2022. doi: 10.1016/j.ijcard.2022.08.011 [DOI] [PubMed] [Google Scholar]
  • 5.Ingle L. Prognostic value and diagnostic potential of cardiopulmonary exercise testing in patients with chronic heart failure. Eur J Heart Fail. 2008;10: 112–118. doi: 10.1016/j.ejheart.2007.12.011 [DOI] [PubMed] [Google Scholar]
  • 6.Chaudhry S, Arena R, Wasserman K, Wasserman Karlman, Hansen JE, Lewis GD, et al. Exercise-Induced Myocardial Ischemia Detected by Cardiopulmonary Exercise Testing. Am J Cardiol. 2009;103: 615–619. doi: 10.1016/j.amjcard.2008.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chaudhry S, Arena R, Hansen JE, Lewis GD, Myers J, Myers J, et al. The Utility of Cardiopulmonary Exercise Testing to Detect and Track Early-Stage Ischemic Heart Disease. 2010;85: 928–932. doi: 10.4065/mcp.2010.0183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Belardinelli R, Lacalaprice F, Carle F, Minnucci A, Cianci G, Perna G, et al. Exercise-induced myocardial ischaemia detected by cardiopulmonary exercise testing. Eur Heart J. 2003;24: 1304–1313. doi: 10.1016/s0195-668x(03)00210-0 [DOI] [PubMed] [Google Scholar]
  • 9.Belardinelli R, Lacalaprice F, Tiano L, Muçai A, Perna GP. Cardiopulmonary exercise testing is more accurate than ECG-stress testing in diagnosing myocardial ischemia in subjects with chest pain. Int J Cardiol. 2014;174: 337–342. doi: 10.1016/j.ijcard.2014.04.102 [DOI] [PubMed] [Google Scholar]
  • 10.De Lorenzo A, Da Silva C. L., Da Silva CL, Souza FCC, De Souza Leao Lima R. Value of the oxygen pulse curve for the diagnosis of coronary artery disease. Physiol Res. 2018;67: 679–686. doi: 10.33549/physiolres.933788 [DOI] [PubMed] [Google Scholar]
  • 11.De Lorenzo A, De Lorenzo Andrea, da Silva CL, de Castro e Souza FC, Serra S, Marino P, et al. Clinical, scintigraphic, and angiographic predictors of oxygen pulse abnormality in patients undergoing cardiopulmonary exercise testing. Clin Cardiol. 2017;40: 914–918. doi: 10.1002/clc.22747 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Klainman E, Fink G, Lebzelter J, Krelbaumm T, Kramer MR. The Relationship Between Left Ventricular Function Assessed by Multigated Radionuclide Test and Cardiopulmonary Exercise Test in Patients With Ischemic Heart Disease. Chest. 2002;121: 841–845. doi: 10.1378/chest.121.3.841 [DOI] [PubMed] [Google Scholar]
  • 13.Munhoz EC, Hollanda R, Vargas JP, Silveira CW, Lemos AL, Hollanda RMK, et al. Flattening of oxygen pulse during exercise may detect extensive myocardial ischemia. Med Sci Sports Exerc. 2007;39: 1221–1226. doi: 10.1249/mss.0b013e3180601136 [DOI] [PubMed] [Google Scholar]
  • 14.Ming Lung Chuang, Chuang M-L, Lin I-F, I.-Feng Lin, I.-Feng Lin, Huang Shih Feng, et al. Patterns of Oxygen Pulse Curve in Response to Incremental Exercise in Patients with Chronic Obstructive Pulmonary Disease—An Observational Study. Sci Rep. 2017;7: 10929–10929. doi: 10.1038/s41598-017-11189-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Chuang Ming-Lung, Tsai Chin-Feng, Ueng Kwo-Chang, Weng Jui-Hung, Tsai Ming-Fong, Lo Chien-Hsien, et al. The Impact of Oxygen Pulse and Its Curve Patterns on Male Patients with Heart Failure, Chronic Obstructive Pulmonary Disease, and Healthy Controls—Ejection Fractions, Related Factors and Outcomes. J Pers Med. 2022;12: 703–703. doi: 10.3390/jpm12050703 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Petek BJ, Churchill TW, Guseh JS, Loomer G, Gustus SK, Lewis GD, et al. Utility of the oxygen pulse in the diagnosis of obstructive coronary artery disease in physically fit patients. Physiol Rep. 2021;9. doi: 10.14814/phy2.15105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sue DY, Wasserman K, Moricca RB, Casaburi R. Metabolic Acidosis during Exercise in Patients with Chronic Obstructive Pulmonary Disease. Chest. 1988;94: 931–938. doi: 10.1378/chest.94.5.931 [DOI] [PubMed] [Google Scholar]
  • 18.Harwood AE, Pymer S, Hitchman L, Totty J, Wallace T, Smith GE, et al. The Intrarater and Interrater Reliability of Measures Derived from Cardiopulmonary Exercise Testing in Patients with Abdominal Aortic Aneurysms. Ann Vasc Surg. 2019;56: 175–182. doi: 10.1016/j.avsg.2018.08.087 [DOI] [PubMed] [Google Scholar]
  • 19.McGregor G, Powell R, Begg B, Birkett ST, Nichols S, Ennis S, et al. High-intensity interval training in cardiac rehabilitation: a multi-centre randomized controlled trial. Eur J Prev Cardiol. 2023; zwad039. doi: 10.1093/eurjpc/zwad039 [DOI] [PubMed] [Google Scholar]
  • 20.McGregor G, Nichols S, Hamborg T, Bryning L, Tudor-Edwards R, Markland D, et al. High-intensity interval training versus moderate-intensity steady-state training in UK cardiac rehabilitation programmes (HIIT or MISS UK): study protocol for a multicentre randomised controlled trial and economic evaluation. BMJ Open. 2016;6: e012843. doi: 10.1136/bmjopen-2016-012843 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Landis JR, Koch GG. The Measurement of Observer Agreement for Categorical Data. Biometrics. 1977;33: 159–174. doi: 10.2307/2529310 [DOI] [PubMed] [Google Scholar]
  • 22.Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15: 155–163. doi: 10.1016/j.jcm.2016.02.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sietsema KE, Stringer WW, Sue DY, Ward S. Wasserman & Whipp’s: Principles of Exercise Testing and Interpretation: Including Pathophysiology and Clinical Applications. Lippincott Williams & Wilkins; 2020. [Google Scholar]
  • 24.American College of Sports Medicine (ACSM). ACSM’s Guidelines for Exercise Testing and Prescription. Lippincott Williams & Wilkins; 2020. [Google Scholar]

Decision Letter 0

Zulkarnain Jaafar

2 Feb 2024

PONE-D-23-36999Inter- and Intra-Observer Reliability and Agreement of O2Pulse Inflection during Cardiopulmonary Exercise Testing: A Comparison of Subjective and Novel Objective MethodologyPLOS ONE

Dear Dr. Nickolay,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

ACADEMIC EDITOR: Dear Author,Please attend to the comments provided by the reviewer/s and make the necessary changes. The decision of this manuscript is justified based on PLOS ONE’s publication criteria and not on its novelty or perceived impact.

==============================

Please submit your revised manuscript by Mar 18 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Zulkarnain Jaafar

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please include your full ethics statement in the ‘Methods’ section of your manuscript file. In your statement, please include the full name of the IRB or ethics committee who approved or waived your study, as well as whether or not you obtained informed written or verbal consent. If consent was waived for your study, please include this information in your statement as well.

3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Generally, this article is clear, well-written and concise. The introduction is relevant and theory based. The authors provided enough and sufficient information about the previous studies' results for readers to understand the rationale and aim of this study. The methods are generally appropriate. Overall, the results are clear and compelling. Moreover, this is a high quality manuscript that has implications for the theoretical basis, diagnosis and identification of inflections in O2Pulse during CPET

Reviewer #2: In my opinion, the article is complete and does not need corrections. This article can be one of the good articles of the magazine.

The only suggestion I have is that they prepare a schematic figure for the methodology..

Reviewer #3: In the present study, the authors investigated the inter-and intra-observer reliability and agreement of O2 pulse inflection and compared subjective measure of O2 pulse inflection with a novel objective method. The study is interesting in nature and the manuscript is well organized. I put some comments on the manuscripts for authors’ consideration.

1. Line 86-87; according to the Fick principle, O2 pulse depends on both SV and Ca-vO2.

2. Line 151-153; Did the authors used any criteria for including the HIIT or MISS trial data? I recommend the author to provide more information in this regard. For instance, why just the tests performed on cycle ergometer were included?

3. Line 164; I recommend the author to provide clear definition or specific criteria for defining “experienced” and “novice” examiner in this study.

4. Line 169; What was the exact time between the first and second observation of the experienced examiner?

5. From the methodological point of view, despite the fact that it was mentioned in the limitation section, when the main goal of this study was to establish a novel objective method by comparing it with a common subjective method, using a novice examiner and basing the conclusion on that might raise some concerns. Please elaborate more on that.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Ehsan Amiri

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Mar 7;19(3):e0299486. doi: 10.1371/journal.pone.0299486.r002

Author response to Decision Letter 0


8 Feb 2024

Response to Reviewers

“Reviewer #1: Generally, this article is clear, well-written and concise. The introduction is relevant and theory based. The authors provided enough and sufficient information about the previous studies' results for readers to understand the rationale and aim of this study. The methods are generally appropriate. Overall, the results are clear and compelling. Moreover, this is a high quality manuscript that has implications for the theoretical basis, diagnosis and identification of inflections in O2Pulse during CPET”

Thank you for your comments.

“Reviewer #2: In my opinion, the article is complete and does not need corrections. This article can be one of the good articles of the magazine.

The only suggestion I have is that they prepare a schematic figure for the methodology..”

Thank you for the comments, we are unclear what was meant by a ‘schematic figure”, if the reviewer was indicating a flowchart indicating either the research design or recruitment through the study we would disagree given the simple nature of the study design and relatively small sample size. Each step is covered in detail in the methods and the manuscript already contains three figures and two tables.

“Reviewer #3: In the present study, the authors investigated the inter-and intra-observer reliability and agreement of O2 pulse inflection and compared subjective measure of O2 pulse inflection with a novel objective method. The study is interesting in nature and the manuscript is well organized. I put some comments on the manuscripts for authors’ consideration.

1. Line 86-87; according to the Fick principle, O2 pulse depends on both SV and Ca-vO2.

Thank you for the comments, this terminology is widely accepted in the field and appears in multiple sources cited throughout the manuscript. We have altered the sentence to read – “In principle, O2Pulse reflects left ventricular stroke volume (SV) (and arteriovenous oxygen difference) (1,4).”

2. Line 151-153; Did the authors used any criteria for including the HIIT or MISS trial data? I recommend the author to provide more information in this regard. For instance, why just the tests performed on cycle ergometer were included?

All available data were included, only those missing data relating to oxygen consumption or heart rate were not considered. The protocol dictated exercise be performed on cycle ergometer and therefore there is no treadmill data available. Cycle ergometry was principally included as workload/power output could be controlled and used to fine-tune progressive overload during the training programme. This is explained in the HITT or MISS UK protocol paper published in 2016.

3. Line 164; I recommend the author to provide clear definition or specific criteria for defining “experienced” and “novice” examiner in this study.

We have adjusted the manuscript to read “However, one examiner had substantially more experience with O2Pulse morphology (>6 years) and inflections. This examiner is subsequently referred to as ‘experienced’ whilst the other is termed ‘novice’ (<1 year).”

4. Line 169; What was the exact time between the first and second observation of the experienced examiner?

The times varied due to the volume of data and the time commitment involved. We have altered the manuscript to read “The ‘experienced’ examiner revisited the ‘yes’ curves at a later date (7- 14 days) to re-quantify the inflection threshold.”

5. From the methodological point of view, despite the fact that it was mentioned in the limitation section, when the main goal of this study was to establish a novel objective method by comparing it with a common subjective method, using a novice examiner and basing the conclusion on that might raise some concerns. Please elaborate more on that.”

In practice novice examiners are involved in data interpretation alongside more experienced colleagues. Thus, including the novice examiners allows for more real-world practical interpretations to be made. Furthermore, the experienced examiner is the one for whom intra-rater comparisons were made.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0299486.s001.docx (9.8KB, docx)

Decision Letter 1

Zulkarnain Jaafar

12 Feb 2024

Inter- and Intra-Observer Reliability and Agreement of O2Pulse Inflection during Cardiopulmonary Exercise Testing: A Comparison of Subjective and Novel Objective Methodology

PONE-D-23-36999R1

Dear Dr. Nickolay,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Zulkarnain Jaafar

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #3: The authors have addressed all my comments, and the manuscript is now appropriate for publication. I have no further comments.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #3: Yes: Ehsan Amiri

**********

Acceptance letter

Zulkarnain Jaafar

28 Feb 2024

PONE-D-23-36999R1

PLOS ONE

Dear Dr. Nickolay,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Zulkarnain Jaafar

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0299486.s001.docx (9.8KB, docx)

    Data Availability Statement

    All Master files are available from the Open Science Framework database https://osf.io/d4pv3/.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES