Skip to main content
PLOS One logoLink to PLOS One
. 2021 Jul 15;16(7):e0253926. doi: 10.1371/journal.pone.0253926

OCRbayes: A Bayesian hierarchical modeling framework for Seahorse extracellular flux oxygen consumption rate data analysis

Xiang Zhang 1,2, Taolin Yuan 2, Jaap Keijer 2, Vincent C J de Boer 2,*
Editor: Ravirajsinh Jadeja3
PMCID: PMC8282019  PMID: 34265000

Abstract

Background

Mitochondrial dysfunction is involved in many complex diseases. Efficient and accurate evaluation of mitochondrial functionality is crucial for understanding pathology as well as facilitating novel therapeutic developments. As a popular platform, Seahorse extracellular flux (XF) analyzer is widely used for measuring mitochondrial oxygen consumption rate (OCR) in living cells. A hidden feature of Seahorse XF OCR data is that it has a complex data structure, caused by nesting and crossing between measurement cycles, wells and plates. Surprisingly, statistical analysis of Seahorse XF data has not received sufficient attention, and current methods completely ignore the complex data structure, impairing the robustness of statistical inference.

Results

To rigorously incorporate the complex structure into data analysis, here we developed a Bayesian hierarchical modeling framework, OCRbayes, and demonstrated its applicability based on analysis of published data sets.

Conclusions

We showed that OCRbayes can analyze Seahorse XF OCR experimental data derived from either single or multiple plates. Moreover, OCRbayes has potential to be used for diagnosing patients with mitochondrial diseases.

Introduction

Mitochondria are double-membrane organelles that are central hubs in regulating energy generation and partitioning. Patients with genetic defects in mitochondrial function are often affected by severe and progressive disease in early life [1]. Furthermore, mitochondrial disorders have also been found to be involved in cardiovascular diseases [2], type II diabetes [3], neurodegenerative disease [4] and cancer [5]. Thus, restoring mitochondrial function is emerging to be a therapeutic target for both common diseases [6] as well as genetic mitochondrial diseases [7].

Mitochondria produce energy in oxidative phosphorylation (OXPHOS) primarily by transferring electrons along the electron transport chain (ETC) on the inner membrane. Along the ETC, there are four complexes (complex I, II, III and IV), together building up a proton gradient that is ultimately used by ATP synthase to generate ATP. Since electrons are primarily accepted by O2 to produce H2O, the OXPHOS activity can be assessed by measuring oxygen consumption rate (OCR). As a reliable and efficient platform, Seahorse XF analyzer provides a multiwell plate based respirometry assay that is widely used to quantify OCR in living cells [8]. Typically, the Seahorse XF analyzer measures the OCR of cells in a 96-well plate under different ETC/OXPHOS perturbation scenarios, used for assessing mitochondrial functionality such as maximal respiration, leak respiration and ATP-linked respiration [9].

A typical Seahorse assay includes three measurement cycles for each phase. Every measurement cycle starts by lowering the cartridge and creating a temporary semi-closed ~2 μL chamber [10]. During a measurement cycle, fluorescent oxygen sensors capture oxygen concentration changes in the chamber and outputs OCR. The Seahorse XF analyzer measures OCR in tens to hundreds of thousands of cells per well and typically requires 4–5 replicate wells per experimental group, allowing the analysis of multiple experimental groups in one plate [11]. Since measurement cycles are nested within phases, phases are crossed with wells, and wells are nested in experimental groups, a very complex structure is inherently embedded in the Seahorse XF OCR data. Surprisingly, current Seahorse XF data analyses most often ignore this complex structure, and by default, data sets are often chopped into subgroups followed by performing ANOVA-like statistical tests. Although there are advanced tools developed for Seahorse data analysis [12, 13], none of them comprehensively take the complexity of the data structure into account. As a result, OCR variation between measurement cycles, replicate wells and replicate plates is overlooked, impairing the robustness of the interpretation of Seahorse XF OCR outcomes, and eventually the development of mitochondrial targeted therapies as well as our understanding of mitochondrial biology.

A natural way to incorporate the complex structure into data analysis is to use hierarchical modeling, which has been extensively developed for analyzing gene expression data [14, 15]. Here we developed a Bayesian hierarchical modeling framework, OCRbayes, for the Seahorse XF OCR data analysis. Compared to the currently most advanced Seahorse data analysis tool, OCR-stats which is a frequentist approach, OCRbayes is based on a fully Bayesian approach due to its flexibility and convenience for constructing the hierarchical models. To demonstrate the applicability of our approach as well as its potential implication for mitochondrial disease diagnostics, we applied OCRbayes to a publicly available OCR data set [13], which contains over 200 Seahorse experiments performed on human fibroblasts derived from patients with mitochondrial diseases and controls.

Method

OCRbayes: A Bayesian hierarchical modeling framework for Seahorse OCR data analysis

In order to incorporate the complex data structure into the analysis of Seahorse XF OCR measurements, we developed a Bayesian hierarchical modeling framework, OCRbayes. In this study, we focused on experimental data containing two groups, such as patient and control group. From our perspective, the Seahorse OCR data include three levels, including 1) measurement cycle, 2) well and 3) plate.

During a measurement cycle, the Seahorse XF analyzer uses fluorescent oxygen sensors to track OCR. For every interval, multiple measurement cycles are performed in order to accurately measure the OCR. A typical Seahorse assay contains four intervals. The first interval refers to initial phase, and the second, third, fourth interval refer to phase after injecting oligomycin (blocking proton translocation through ATP synthase), FCCP (allowing protons to move into the mitochondrial matrix independent of the ATP synthase) and antimycin/rotenone (inhibiting complex I and III, shutting down mitochondrial respiration).

The variation between the OCR values within an interval is the between measurement cycle variation. Since OCR values must be positive, we used a lognormal distribution with the true OCR value at the log scale (logOCRtrue) and the between measurement cycle standard deviation (also at the log scale, logOCRsd) to model each observed OCR value (OCRobs).

OCRobs[i]Lognormal(logOCRtrue[P[i],W[i],I[i]],logOCRse[P[i],I[i]]) (1)

OCRobs is a vector of length Nplate×Nwell×Ninterval×Nmeasurement, where Nplate, Nwell, Ninterval and Nmeasurement are number of plates, wells, intervals and measurement cycles, respectively. The logOCRtrue refers to Nplate layers of matrices. Each layer is a matrix with Nwell rows and Ninterval columns. logOCRsd is a matrix of Nplate rows and Ninterval columns.

In a typical Seahorse XF assay, one cell line undergoing the same experimental treatment is seeded in multiple wells. However, the OCRs in these replicate wells will not be the same, and the difference is called between well variation. One obvious reason causing the between well variation is that the number of cells in these replicate wells are not identical. To adjust for the effect of cell number difference, we modeled the true OCR value as a function of the cell number, and from there estimated OCRper 1k cells, which represents OCR value per 1000 cells that received the same treatment (or from the same group) and injection. In addition to cell number difference, technical, procedural or instrumental noise can also contribute to between well variation. We captured this well-to-well variation after accounting for cell number difference with the residual parameter σwell.

logOCRtrue[P[i],W[i],I[i]]Normal(OCRper1kcells[P[i],G[i],I[i]]×Ncells[W[i]],σwell[P[i],G[i],I[i]]) (2)

OCRper 1k cells is a three-dimensional matrix with Nplate layers, and each layer has Ngroup rows and NInterval columns. Ncells is a vector of length Nwell, and every entry represents the cell number in that well.

Apart from the technical replicates in one plate, the biological insight for a (patient) cell line or a specific condition is generally validated by repeating the Seahorse assay on different days. As a result, OCR data are distributed on more than one plate. Due to batch effects such as plating, culturing or environmental differences between time and laboratories, OCR measurements will differ between plates. To take into account the between plate variation, we used another lognormal distribution with the logarithm transformed OCR value per 1000 cells, (μOCRper1kcells) and the between plate standard deviation (σplate) to model the OCR value per 1000 cells OCRper 1k cells.

OCRper1kcells[P[i],G[i],I[i]]Lognormal(log(μOCRper1kcells[G[i],I[i]]),σplate[G[i],I[i]]) (3)

Bayesian inference

OCRbayes focuses on calculating posterior distributions for OCR per 1000 cells in various experimental conditions (P(μOCRper1kcells|OCRobs)). The posterior distributions were combinations of prior distributions (P(logOCRse), P(σwell), P(σplate) and P(μOCRper1kcells)), and the likelihood function (P(OCRobs|Ncell,μOCRper1kcells,logOCRsd,σwell,σplate))

In this study, since our case studies focused on human fibroblast cells, we used informative prior distributions for logOCRsd, σwell, σplate and μOCRper1kcells. Since values of all these parameters must be positive, we used four lognormal distributions. We applied maximum likelihood estimation to calculate the lognormal distribution parameters based on the Seahorse XF OCR data in OCR-stats [13] by running the fitdistr function built in the MASS r package. In this study, our prior distributions are

logOCRsdLognormal(3.23,0.79)σwellLognormal(1.62,0.53)σplateLognormal(1.18,0.05)μOCRper1kcellsLognormal(0.3,0.79)

The Bayesian multi-level model of OCRbayes was implemented in Stan (version 2.19.3) [16]. We fitted the model by running Hamiltonian Markov Chain Monte Carlo. We ran four Markov chains with 2000 iterations in each chain. The code can be found at https://github.com/XiangZhangSC/seahorse. We also provided the R code of OCRbayes in the supplementary file.

Calculation of bioenergetic measures

Based on OCR (per 1000 cells), we calculated various bioenergetic measures, such as basal respiration, proton leak, ATP-linked respiration, spare respiratory capacity and maximal respiration. These bioenergetic measures are defined as below.

Basalrespiration=OCRper1kcells,initialOCRper1kcells,antimycin/rotenone (4)
ATPlinkedrespiration=OCRper1kcells,initialOCRper1kcells,oligomycin (5)
Protonleak=OCRper1kcells,oligomycinOCRper1kcells,antimycin/rotenone (6)
Sparerespiratorycapacity=OCRper1kcells,FCCPOCRper1kcells,initial (7)
Maximalrespiration=OCRper1kcells,FCCPOCRper1kcells,antimycin/rotenone (8)

Human fibroblast OCR data

To benchmark OCRbayes as well as illustrate how OCRbayes can be used for analyzing OCR in patients with mitochondrial diseases, we used the OCR data set provided in Yepez et al. [13]. This data set contains Seahorse OCR measurements from 203 human fibroblast cell lines that have been assayed in 126 plates. Normal human dermal fibroblast (NHDF) reference cell lines were used as controls in all plates. The other 202 cell lines were derived from patients suspected to suffer from rare mitochondrial diseases. Among these 202 cell lines, 26 fibroblast cell lines were measured in multiple plates. We used the 176 patient fibroblast cell lines that were assayed in a single plate as well as the control cell line in the same plates for estimating prior distributions. The other 26 fibroblast cell lines were used for benchmarking OCRbayes. Among the 202 cell lines that were analyzed, Yepez et al. [13] labeled 6 patient cell lines as positive controls that have shown statistically significant reduction in maximum respiration. Meanwhile, Yepez et al. also labeled another two patient cell lines as negative controls, since these cell lines did not show changes in OCR in earlier experiments [13].

We processed the original data by removing wells in which single or more OCR measurements were missing. After filtering, we used 176 patient cell lines together with the NHDF control cell line on 78 plates for estimating the prior distributions for between measurement cycle variation (logOCRsd) and between well variation (σwell). To estimate the prior distribution for between plate variation (σplate) and mean OCR per 1000 cells (μOCRper1kcells), we used the OCR values from the NHDF cell lines that were plated in all 78 plates.

Statistical analysis

For benchmarking OCRbayes, we compared the patient cell lines to the control cell line (NHDF). Since the maximal respiration was reported as the primary mitochondrial dysfunction outcome in the Yepez et al. study [13], we reported here the mean log2 fold change of maximal respiration together with the False Discovery Rate (FDR). FDR was calculated based on the posterior error probability (1P(log2(patientcontrol|OCRobs)<0)), where P(log2(patientcontrol|OCRobs) represented the posterior distribution of fold change after fitting the Bayesian model to the experimental OCR data. If the FDR was below 0.05, we considered that the difference between patient and control cell line was statistically significant.

Results

OCRbayes: From OCR to respiration metric difference

To demonstrate the applicability, we applied OCRbayes to analyze OCR data derived from two patient cell lines with known mutations in either BOLA3 or PET100 gene. These two genes encode proteins that are essential for biogenesis or assembly of mitochondrial complexes [1719]. We compared the patient cell line to the control cell lines within their respective plates. These two cell lines were both assayed on two plates on two different days. For both cell lines, we observed a clear decrease in maximal respiration compared to the control cell lines in both plates (Fig 1A and 1E). Meanwhile, we noticed that the range of OCR values in the two plates used for profiling BOLA3 patient cell line differed considerably (Fig 1A). In particular, maximal OCR value in the first plate was around 200 pmol/min, whereas in the second plate the maximal OCR value was around 100 pmol/min. In contrast, the range of OCR values in the two plates used for profiling PET100 were similar to each other. In addition, it was obvious that there was considerable variation between the replicate wells in both plates for both cell lines.

Fig 1.

Fig 1

(A) and (E) are OCR profiles for patient cell lines that have a genetic mutation in either BOLA3 (A) or PET100 (E), compared to control cell lines in two plates. (B) and (F) are posterior distributions for OCR per 1000 cells in BOLA3 (B) and PET100 (F) mutated patient cell lines during initial phase (Int1), oligomycin phase (Int2), FCCP phase (Int3) and antimycin/rotenone phase (Int4). (C) and (G) are posterior distributions for respiration metrics in BOLA3 (C) and PET100 (G) mutated patient cell lines and control cell lines. (D) and (H) are posterior distributions of log2 fold change in the respiration metrics between patient and control cell line.

By applying OCRbayes, we combined the two plates for each cell line, calculated the posterior distributions for OCR per 1000 cells during the four intervals of a typical Seahorse assay, including initial phase without injection (Int1), oligomycin phase (Int2), FCCP phase (Int3) and antimycin/rotenone phase (Int4) (Fig 1B and 1F). Next, the calculated posterior OCR per 1000 cells for the four intervals were transformed into the various respiration metrics, including ATP-linked respiration, basal respiration, maximal respiration, proton leak and spare respiratory capacity (Fig 1C and 1G). In the last step, we compared the respiration metrics in patient cell line to the control cell line, and calculated the posterior distribution for log2(fold change) (Fig 1D and 1H). We observed that BOLA3 patient cell line showed reduction in basal respiration (posterior mean log2(fold change) -0.974, 95% credible interval [-1.93, -0.0970]) and maximal respiration (-1.06 [-1.87,-0.228]), compared to the control cell line (Fig 1D). On the other hand, we observed no difference in ATP-linked respiration (-1.08 [-2.38, 0.117]), proton leak (-0.630 [-3.87, 2.32]) and spare respiratory capacity (-1.26 [-4.41,1.62]) between BOLA3 patient cell line and the control cell line (Fig 1D). Meanwhile we observed lower ATP-basal respiration (-1.75 [-3.18, -0.546]), basal respiration (-1.58 [-2.52, -0.667]) and maximal respiration (-1.28 [-2.10, -0.436]) in the PET100 patient cell line than the control cell line (Fig 1H). We observed no difference in proton leak (-0.906 [-4.14,2.54]) and spare respiratory capacity (-0.826 [-3.29, 2.34]) in the PET100 patient cell line compared to the control cell line (Fig 1H).

OCRbayes accounts for various technical variations during Seahorse XF OCR data analysis

OCR measurements generated by the Seahorse XF analyzer were affected by various technical variations, including between measurement cycle variation, between well variation and between plate variation. To visualize the technical variations, we used the cell line with a genetic mutation in NSUN3 gene (patient cell line 76065). This cell line was measured on five different plates, allowing us to visualize all three technical variations.

OCRbayes calculated the between measurement cycle variation for each plate during each interval. For each cell line in each plate, OCRbayes calculated the between well variation during each interval. Regarding the between plate variation, OCRbayes estimated it for each cell line during each interval. All the variation values were on the log scale. We observed that for both patient and control cell line in the initial phase (Int1) and FCCP phase (Int3), the between plate variation was larger than the between well variation, which itself was larger than the between measurement cycle variations (Fig 2). However, in the oligomycin phase (Int2) and antimycin/rotenone phase (Int4), the between plate variation was not always larger than the other two technical variations (Fig 2). In particular, we observed that the between well variation was larger than the between measurement cycle and the between plate variation in three plates during the antimycin/rotenone phase (Int4) in the patient cell line.

Fig 2. Technical variations in Seahorse XF OCR data of experiments based on the patient cell line with mutation in NSUN3 gene and the control cell line.

Fig 2

Int1, Int2, Int3 and Int4 represent initial, oligomycin, FCCP and antimycin/rotenone phase. All the variations were on log scale. The dots are the posterior mean and the line segment represent the 95% credible interval.

Benchmark OCRbayes

To demonstrate that OCRbayes works properly, we applied it to analyze the published Seahorse XF OCR data set containing 26 patient cell lines as well as a control cell line reported by Yepez et al. [13]. This data set contains 6 cell lines that were labeled as positive controls and 2 cell lines that were labeled as negative controls (as explained in the material and methods section).

Based on our analysis, we found 6 patient cell lines that had lower maximal respiration compared to the control cell line with False Discovery Rate (FDR) below 0.05 (Fig 3A and 3B). Among the 6 patient cell lines that showed statistically significant reduction in maximal respiration compared to the control cell line, the patient cell line 73387 (mutation in PET100 gene) showed the largest effect whereas the patient cell line 76065 (mutation in NUSN3 gene) showed the smallest effect (Fig 3A). Five of these 6 patient cell lines were labeled as positive controls in the original study [13]. Meanwhile, the two negative controls (patient cell lines 73901 and 91410) had FDR above 0.05 in our analysis.

Fig 3. Maximal respiration change patient vs. control on multiple plates.

Fig 3

(A) average log2 fold change (y-axis) of maximal respiration of all cell lines repeated across plates (x-axis) and their respective controls, sorted by the False Discovery Rate (FDR). Red and blue dots represent positive and negative controls, respectively. (B) similar to (A), but depicting FDR. Red dashed line represents FDR = 0.05.

Our analysis successfully recalled both negative controls and five out of six positive controls. Interestingly, our analysis showed that there was no significant reduction of maximal respiration in the cell line with a genetic mutation in SFXN4 gene (patient 61818) that was previously labeled as positive control [13]. This patient cell line was measured on three independent plates on three different days. The range of OCR values of these three plates differed, indicating considerable between plate variation (Fig 4A). The OCR profiles of the patient and control cell line were overlapping in the first plate (Fig 4A). In contrast, the OCR profiles derived from the second and third plate showed OCR decreasing in the patient cell line compared to the control cell line, especially during the FCCP phase (Fig 4A). Meanwhile, we observed considerable variation between the replicate wells as well as measurement cycles in these Seahorse assays (Fig 4A).

Fig 4.

Fig 4

(A) Raw OCR values for the cell line with genetic mutation in SFXN4 gene (patient cell line 61818) in three repeated plates. (B) Posterior distributions of maximal respiration of the cell line with genetic mutation in SFXN4 gene (patient cell line 61818) and the control cell line in the same plate. (C) Posterior distributions of maximal respiration of the cell line after combing all three repeated experiments.

In addition, the between measurement cycle variation in the third plate seemed to be larger than the other two plates (Fig 4A). We analyzed these three repeated experiments separately as well as combined them together. Our separate analysis showed that none of the three experiments showed significant reduction in maximal respiration in the SFXN4 patient cell line (Fig 4B). The posterior mean log2(fold change) and the corresponding 95% credible intervals derived from the first, second and third Seahorse assay were 0.122 [-0.834, 1.04], -0.431 [-1.40, 0.499] and -0.0977 [-1.04, 0.827], respectively. When we used OCRbayes to analyze all the three plates together, we found that the posterior mean log2(fold change) of maximal respiration was -0.200 [-0.877, 0.500] compared to the control cell line (Fig 4C). However, this tendency of reduction in maximal respiration in SFXN4 patient cell line was not statistically significant.

OCRbayes can be used to evaluate the probability that a patient fibroblast cell line has an abnormality in mitochondrial respiration

A feature that makes OCRbayes unique from other methods is that OCRbayes can evaluate what is the probability that a patient has abnormality in his or her fibroblast mitochondrial respiration based on a single Seahorse assay or multiple Seahorse assays. To demonstrate this feature, we used two patient cell lines that showed significant reduction in maximal respiration in our analysis. One patient cell line was the PET100 gene mutation fibroblast (patient cell line 73387) and the other patient cell line was NSUN3 gene mutation fibroblast (patient cell line 76065). Among the 6 patient cell lines that showed statistically significant reduction in maximal respiration compared to the control cell line, the PET100 and NSUN3 cell line showed the largest and smallest effect, respectively (Fig 3B).

The PET100 patient cell line was measured on two plates. Based on the first assay, our analysis showed that the posterior probability of this PET100 mutation carrier having lower maximal respiration than the control was 97.8%. By repeating the experiment once more, the posterior probability increased to 99.8% (Fig 5A). In contrast, the cell line with mutation in NSUN3 gene (patient cell line 76065) was measured on five different plates. Our analysis showed that based on the first assay, the posterior probability of this NSUN3 mutation carrier having lower maximal respiration than the control was 41.3%. Repeating the assay once increased the posterior probability from 41.3% to 73.4%. When the assay was repeated for the third, fourth and fifth time, the corresponding posterior probabilities increased to 78.7%, 87.7% and 90.0%, respectively (Fig 5B). In summary, we demonstrate that OCRbayes can be used to evaluate the probability that a patient fibroblast cell line has an abnormality in mitochondrial respiration based on a single Seahorse assay or multiple Seahorse assays.

Fig 5.

Fig 5

(A) Posterior probability of decreasing maximal respiration in patient cell line with mutation in PET100 gene compared to the control cell line in the scenario with 1 or 2 plates. (B) Posterior probability of decreasing maximal respiration in patient cell line with mutation in NUSN3 gene compared to the control cell line in the scenario with 1, 2, 3, 4 or 5 plates.

Discussion

Although Seahorse XF analyzer is widely used in bioenergetic profiling, its data analysis has not received sufficient attention. A hidden feature of Seahorse XF OCR data is that it contains a complex data structure. The complex data structure is due to the fact that measurement cycles are nested within injections, injections are crossed with wells, and wells are nested in plates. As far as we know, currently there is no data analysis protocol that takes into account this complex data structure, impairing the robustness of Seahorse XF OCR data analysis outcomes. This is because when one ignores the data structure, one also ignores the variations between measurement cycles, between wells and between plates. In order to make the Seahorse data analysis more robust, in this study we developed a Bayesian hierarchical modeling approach, OCRbayes, which accounts for all these technical variations during the data analysis.

Seahorse XF OCR measurements are noisy

An OCR value is determined not only by mitochondrial activity, but also by technical noise including 1) between measurement cycle variation, 2) between well variation and 3) between plate variation.

Every phase typically contains three measurement cycles, resulting in three OCR values. Since every measurement cycle starts with a “mix and wait” step to ensure the same baseline of cell values, cell physiology should not substantially change within a phase. Existing tools such as OCR-stats [13] and SHORE [12] use different strategies to select a single data point to represent an injection phase. When different data points were chosen, one can get different outcomes, making the current Seahorse XF data analysis less robust. To avoid this ambiguity, our approach did not select any particular data point, instead modeled all three OCR data. By doing so, we incorporated the uncertainty about “which data point should I choose?” into the data analysis and focused on the average behavior.

The between well variation refers to a common observation that OCR values differ among the replicate wells. One important factor leading to the variation between replicate wells is that cell numbers are not identical in these wells. The wells with more cells would have higher OCR than the wells containing fewer cells. Fortunately, cell number in each well can be quantified experimentally and used for the data analysis [20, 21]. In addition to cell number difference, initial conditions, treatment concentration, or fluorophore sleeve calibration can also contribute to variation between wells. OCRbayes also takes into account the between well variation caused by these unobserved factors.

Between plate variation takes place when the same Seahorse experiment is repeated on different days and on more than one plate. Due to differences in temperature, seeding time, growth time, growth medium, sensor cartridge as well as treatment efficiency, the OCR outcomes will differ between plates [13, 22]. Often the between plate variation is assumed to be the dominant technical variation involved in Seahorse OCR data. Based on our analysis, we showed that this assumption may be appropriate for OCR measurements derived from the initial and FCCP phase, but may not work for the OCR values derived from oligomycin and antimycin/rotenone phase. OCR values in the oligomycin and antimycin/rotenone phase were very small and possibly close to the detection limit of the Seahorse XF in a well. Thus, it is more challenging to accurately measure the OCR in these phases.

Comparison with other statistical tool for Seahorse XF OCR data analysis

In this work, we compared the maximal respiration in 26 patient cell lines to the control cell line individually as what was done in OCR-stats [13]. Overall our analysis recalled successfully all negative controls and five out of six positive controls. Patient cell line 61818 was labeled as a positive control since this patient was found having a mutation in SFXN4 gene. A recent study based on erythroleukemic cell line showed that SFXN4 knockout resulted in significant decrease in all parameters of respiration, including baseline respiration, respiratory ATP synthesis, maximal respiration, and spare respiratory capacity [23]. However, our analysis showed that the maximal respiration of this patient was not significantly different from the control cell line. Our further analysis showed that the patient cell line only showed a tendency of having lower maximal respiration than control in one of the three repeated experiments.

We also noticed that other outcomes were also not identical as what was presented in the OCR-Stats publication [13]. Strikingly, in our analysis PET100 mutation fibroblast (patient cell line 73387) showed a significant decrease in maximal respiration compared to the control cell line. However, in the OCR-stats [13], the difference in maximal respiration was not statistically significant in this patient. This patient was diagnosed carrying a homozygous loss of function mutation in the PET100 gene, which encodes a mitochondrial complex IV biogenesis factor [13, 17, 18]. A homozygous truncating variant (c.142C>T, p.(Gln48*)) in the PET100 gene was found to lead to a complete loss of enzyme activity, and caused deficiency in complex IV [24]. Therefore, our observation of significant reduction in maximal respiration in PET100 mutation fibroblast based on the OCRbayes is in line with the observed loss of enzymatic activity in patients carrying the genetic mutations in PET100. This difference in analysis outcome highlighted the advantage of OCRbayes which used hierarchical models to incorporate the complex data structure into Seahorse OCR data analysis, helping separate the technical variations from the OCR measurements and identify difference in the biological OCR.

OCRbayes is potentially used for screening patients with mitochondrial diseases

Since an increase in proton leak or a decrease in basal or maximal respiration are indicators of mitochondrial dysfunction [25], Seahorse XF is potentially to be used for screening mitochondrial disease patients. Besides providing solely statistical significance information as other methods do, OCRbayes allows us to calculate posterior probability that the maximal respiration (or any other respiration metrics) was abnormal in a patient even based on a single Seahorse assay. This feature is also helpful for deciding whether we need to run a single or multiple Seahorse assays for a patient.

For example, the patient cell line with mutation in PET100 gene showed the largest decrease in maximal respiration in our analysis. The posterior probability for this patient having abnormal mitochondrial respiration after observing a single Seahorse assay was already about 98%. It is reasonable to run just a single assay in this case. In contrast, the cell line with mutation in NSUN3 gene had the smallest effect size among all the patient cell lines that showed significant reduction in maximal respiration. The posterior probability of this patient having impaired maximum respiration after running a single Seahorse assay was modest (about 40%). Repeating the experiment once more increased the posterior probability from 40% to 70%. Thus in this case, it is beneficial to run multiple Seahorse assays. We think that the posterior probability is a useful metric to help scientists to decide whether to perform extra Seahorse assays on the patient cell lines. The ability of posterior probability calculation by OCRbayes not only allows to make better conclusions about the significance of the effect of the experimental perturbation, it can also prevent from repeating unnecessary, often expensive, experiments. Furthermore, when the availability of patient material or the number of target cells (specific isolated immune cell subsets) is limited, OCRbayes is valuable to exploit the limited data and facilitate proper validation the experiments.

Strengths and limitations

OCRbayes has several advantages. The first advantage of our approach is that it incorporates various technical variations including between measurement cycle variation, between well variation as well as between plate variation during the estimation of bioenergetic measures. All current methods need to choose a single data point from the three measurement cycles to represent the OCR during a particular phase. This procedure ignores the uncertainty and makes analysis less robust because different choice of data points may lead to different results. The second advantage is that OCRbayes can calculate posterior probability for difference in various bioenergetic measures based on Seahorse OCR data consisting of a single plate or multiple plates. This is a useful feature for screening samples derived from patients with mitochondrial diseases. A third advantage of OCRbayes is that it can be used for guiding improvement of experimental protocols for running Seahorse assays, because the hierarchical model can explicitly quantify the changes in technical variations resulted from different protocols.

One limitation of this study is that OCRbayes is extensively tested with experimental data derived from human fibroblasts cell lines. This is because our model development was restricted to the limited publicly available Seahorse OCR data sets. To generalize our statistical method to other cell lines than human skin fibroblasts, the prior distributions of those cell lines should be estimated from previously collected Seahorse OCR data. This is because each cell line or cell system has its own growth conditions and metabolic characteristics, making the prior distributions based on the human fibroblasts data not suitable for other cell lines. Furthermore, the prior distributions for plate-to-plate or well-to-well variation are likely dependent on factors such as laboratory and machine. Although many Seahorse studies have been published, few of them provided the raw OCR measurements together with the cell number quantification information. Generalization of our method would benefit from the open availability of raw Seahorse OCR data. This would also facilitate making OCRbayes applicable for other cell lines than human skin fibroblasts.

Data Availability

The datasets analysed during the current study are available in the Github repository (https://github.com/XiangZhangSC/seahorse).

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Koene S, Wortmann SB, Vries MC de, Jonckheere AI, Morava E, Groot IJM de, et al. Developing outcome measures for pediatric mitochondrial disorders: which complaints and limitations are most burdensome to patients and their parents? Mitochondrion. 2013;13: 15–24. doi: 10.1016/j.mito.2012.11.002 [DOI] [PubMed] [Google Scholar]
  • 2.Ballinger SW. Mitochondrial dysfunction in cardiovascular disease. Free Radic Biol Med. 2005;38: 1278–95. doi: 10.1016/j.freeradbiomed.2005.02.014 [DOI] [PubMed] [Google Scholar]
  • 3.Szendroedi J, Phielix E, Roden M. The role of mitochondria in insulin resistance and type 2 diabetes mellitus. Nat Rev Endocrinol. 2011;8: 92–103. doi: 10.1038/nrendo.2011.138 [DOI] [PubMed] [Google Scholar]
  • 4.Johri A, Beal MF. Mitochondrial dysfunction in neurodegenerative diseases. J Pharmacol Exp Ther. 2012;342: 619–30. doi: 10.1124/jpet.112.192138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Erez A, DeBerardinis RJ. Metabolic dysregulation in monogenic disorders and cancer—finding method in madness. Nat Rev Cancer. 2015;15: 440–8. doi: 10.1038/nrc3949 [DOI] [PubMed] [Google Scholar]
  • 6.Murphy MP, Hartley RC. Mitochondria as a therapeutic target for common pathologies. Nat Rev Drug Discov. 2018;17: 865–886. doi: 10.1038/nrd.2018.174 [DOI] [PubMed] [Google Scholar]
  • 7.Weissig V. Drug Development for the Therapy of Mitochondrial Diseases. Trends Mol Med. 2020;26: 40–57. doi: 10.1016/j.molmed.2019.09.002 [DOI] [PubMed] [Google Scholar]
  • 8.Wu M, Neilson A, Swift AL, Moran R, Tamagnine J, Parslow D, et al. Multiparameter metabolic analysis reveals a close link between attenuated mitochondrial bioenergetic function and enhanced glycolysis dependency in human tumor cells. Am J Physiol Cell Physiol. 2007;292: C125–36. doi: 10.1152/ajpcell.00247.2006 [DOI] [PubMed] [Google Scholar]
  • 9.Chacko BK, Kramer PA, Ravi S, Benavides GA, Mitchell T, Dranka BP, et al. The Bioenergetic Health Index: a new concept in mitochondrial translational research. Clin Sci (Lond). 2014;127: 367–73. doi: 10.1042/CS20140101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gerencser AA, Neilson A, Choi SW, Edman U, Yadava N, Oh RJ, et al. Quantitative microplate-based respirometry with correction for oxygen diffusion. Anal Chem. 2009;81: 6868–78. doi: 10.1021/ac900881z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Artyomov MN, Van den Bossche J. Immunometabolism in the Single-Cell Era. Cell Metab. 2020;0. doi: 10.1016/j.cmet.2020.09.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Nicholas D, Proctor EA, Raval FM, Ip BC, Habib C, Ritou E, et al. Advances in the quantification of mitochondrial function in primary human immune cells through extracellular flux analysis. PLoS One. 2017;12: e0170975. doi: 10.1371/journal.pone.0170975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Yépez VA, Kremer LS, Iuso A, Gusic M, Kopajtich R, Koňaříková E, et al. OCR-Stats: Robust estimation and statistical testing of mitochondrial respiration activities using Seahorse XF Analyzer. PLoS One. 2018;13: e0199938. doi: 10.1371/journal.pone.0199938 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26: 139–40. doi: 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15: 550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A Probabilistic Programming Language. J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Oláhová M, Hardy SA, Hall J, Yarham JW, Haack TB, Wilson WC, et al. LRPPRC mutations cause early-onset multisystem mitochondrial disease outside of the French-Canadian population. Brain. 2015;138: 3503–19. doi: 10.1093/brain/awv291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lim SC, Smith KR, Stroud DA, Compton AG, Tucker EJ, Dasvarma A, et al. A founder mutation in PET100 causes isolated complex IV deficiency in Lebanese individuals with Leigh syndrome. Am J Hum Genet. 2014;94: 209–22. doi: 10.1016/j.ajhg.2013.12.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Haack TB, Rolinski B, Haberberger B, Zimmermann F, Schum J, Strecker V, et al. Homozygous missense mutation in BOLA3 causes multiple mitochondrial dysfunctions syndrome in two siblings. J Inherit Metab Dis. 2013;36: 55–62. doi: 10.1007/s10545-012-9489-7 [DOI] [PubMed] [Google Scholar]
  • 20.Little AC, Kovalenko I, Goo LE, Hong HS, Kerk SA, Yates JA, et al. High-content fluorescence imaging with the metabolic flux assay reveals insights into mitochondrial properties and functions. Commun Biol. 2020;3: 1–10. doi: 10.1038/s42003-019-0734-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Janssen JJE, Lagerwaard B, Bunschoten A, Savelkoul HFJ, Neerven RJJ van, Keijer J, et al. Novel standardized method for extracellular flux analysis of oxidative and glycolytic metabolism in peripheral blood mononuclear cells. Sci Rep. 2021;11: 1662. doi: 10.1038/s41598-021-81217-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Koopman M, Michels H, Dancy BM, Kamble R, Mouchiroud L, Auwerx J, et al. A screening-based platform for the assessment of cellular respiration in Caenorhabditis elegans. Nat Protoc. 2016;11: 1798–816. doi: 10.1038/nprot.2016.106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Paul BT, Tesfay L, Winkler CR, Torti FM, Torti SV. Sideroflexin 4 affects Fe-S cluster biogenesis, iron metabolism, mitochondrial respiration and heme biosynthetic enzymes. Sci Rep. 2019;9: 19634. doi: 10.1038/s41598-019-55907-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Oláhová M, Haack TB, Alston CL, Houghton JA, He L, Morris AA, et al. A truncating PET100 variant causing fatal infantile lactic acidosis and isolated cytochrome c oxidase deficiency. Eur J Hum Genet. 2015;23: 935–9. doi: 10.1038/ejhg.2014.214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Brand MD, Nicholls DG. Assessing mitochondrial dysfunction in cells. Biochem J. 2011;435: 297–312. doi: 10.1042/BJ20110162 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Ravirajsinh Jadeja

25 May 2021

PONE-D-21-10491

OCRbayes: A Bayesian hierarchical modeling framework for Seahorse extracellular flux oxygen consumption rate data analysis

PLOS ONE

Dear Dr. Zhang,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

 Overall, the manuscript has been very well received by the reviewers and could be accepted for publication following addressing all the comments.

Please submit your revised manuscript by Jul 03 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Ravirajsinh Jadeja, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is an interesting paper which addresses major limitations with the data analysis related to Seahorse oxygen consumption rate assays, especially where the conclusions are drawn from multiple repeats of experiments. The current available analysis methods are limited which consider only a single set of data for each measurement from each plate that doesn’t account for the variability due to other measurements or between the plates assayed on different days. The authors used the Bayesian hierarchical modeling framework as a statistical tool to analyze the OCR data to account for the variables such as between plate, between well and between the measurement cycles. Overall the manuscript is very well written and the experiments are organized to support the hypothesis.

Please address the following

1. Please explain how OCR-stats are different from OCRbayes other than using single measurement in consideration.

2. Selecting a single data point has some rationale. Many times it’s important to choose a single data point as compared to multiple points. Like some times oligomycin takes time to act and its better to take the last measurement. How can you explain that for using OCRbayes?

3. Line 170 states that mitochondrial dysfunction is often reflected in the maximal respiration is not completely true as all other parameters are equally important like proton leak.

4. Is OCRbayes applicable for lab based experiments which are repeated 2-3 times? If not please mention in the limitations.

5. The robustness of the statistical method need to be proved in more than one assay/cells which at least need to be mentioned in limitations.

Reviewer #2: The Seahorse XF-96 analyzer evaluates mitochondrial function in real-time within live cells and would be of great value to test hypotheses on the relevance of mitochondria in various diseases, including cancer, cardiovascular, and neurodegenerative disease pathogenesis.

In this manuscript, Zhang et al. have reported an improved technique called OCRbayes to analyze the Seahorse XF OCR experimental data, which can be used for the diagnosis of patients suffering from mitochondrial diseases. The authors claim that this is a novel protocol that has never been reported elsewhere before.

Here, they have developed a statistically robust method by adopting the Bayesian hierarchical model approach. This approach is adequate to consider all the technical variations which arise due to complex data structures. The OCRbayes is advantageous and more suitable than any other method adopted so far. Also, this approach is ideal for application in the analysis of a patient with mitochondrial abnormalities.

Overall, the methods reported in this manuscript is important for the scientific community studying the mitochondrial function as it is more robust and will allow the investigators to measure mitochondrial function in live cell populations over time, eliminating the need for timed experiments with multiple samples at different time points; thus expediting the research and facilitating explorations of new scientific directions. Moreover, the manuscript is presented in a well-organized fashion, and the methods are well described systematically.

However, the current approach suffers certain limitations as the authors have validated their protocol on only a particular type of cell line and thus requires extended investigation on another type of cell line as mitochondrial dysfunction is linked to various organs, including the heart, brain, kidney, etc. The following comments need to be addressed to improve the current manuscript.

Comments:

Major: None

Minor

1.The author should clarify whether this protocol is applicable to analyze the data generated from other tools such as the Oroboros high-resolution oxygraphy or other similar devices.

.

2a.In their experiments, the authors considered the cell number variations as one of the factors responsible for the in-between well variations in the data within a cell line in a plate. Does minimize this variation by regulating the cell number variations by plating the cells with a multichannel pipette in high accuracy reduces these variations? Considering the following steps may be helpful (a) to mix the cell well before putting it to the reservoir to prevent settling down. (b) Mixing the cells once in every other column/row while plating. (c) Avoiding the outer walls in the experiments since the growth of the cells in these wells are significantly affected by the condensation or air. (d) Further, the slow-growing cells may increase the variability if keeping them in 96 well plates for more than two days.

2b. Is there any variation in their data between adherent and non-adherent cells?.

3. The authors should clarify whether their protocol will apply to the in-vivo settings as in the case of the OCR data generated from the zebrafish embryos in real-time from Simon T bond et al. Method Mol Bio 2018.

4. In this study, the authors have considered data generated from only cell type. The data generated from various sources, as published in Martin P Horan et al. 2012 in the Journal of Gerontology, summarizes multiple data generated from different sources by using the Seahorse XP analyzer. The author should consider discussed data if accessible.

5. Line 47 and 79 are repeated in the introduction/method section and can be rephrased.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Jul 15;16(7):e0253926. doi: 10.1371/journal.pone.0253926.r002

Author response to Decision Letter 0


28 May 2021

Dear Dr. Ravirajsinh Jadeja,

Thank you for considering our manuscript for publication in PLOS ONE. We were pleased to read your comments and those of the reviewers. We have addressed the comments of the reviewers point-by-point and revised the manuscript accordingly. They helped us to improve our manuscript and we thank the reviewers for their constructive suggestions.

Reviewer 1

This is an interesting paper which addresses major limitations with the data analysis related to Seahorse oxygen consumption rate assays, especially where the conclusions are drawn from multiple repeats of experiments. The current available analysis methods are limited which consider only a single set of data for each measurement from each plate that does not account for the variability due to other measurements or between the plates assayed on different days. The authors used the Bayesian hierarchical modeling framework as a statistical tool to analyze the OCR data to account for the variables such as between plate, between well and between the measurement cycles. Overall the manuscript is very well written and the experiments are organized to support the hypothesis.

Please address the following

1.Please explain how OCR-stats are different from OCRbayes other than using single measurement in consideration.

Answer: Statistically, OCRbayes is a Bayesian method which is fundamentally different from OCR-stats which is a frequentist tool. In contrast to OCR-stats which builds upon linear regression, OCRbayes applies hierarchical modeling approach and works with the full data set. We explained this now in the introduction section.

“Compared to the currently most advanced Seahorse data analysis tool, OCR-stats, which is a frequentist approach, OCRbayes is based on a fully Bayesian approach due to its flexibility and convenience for constructing the hierachical models.”

2.Selecting a single data point has some rationale. Many times it’s important to choose a single data point as compared to multiple points. Like some times oligomycin takes time to act and its better to take the last measurement. How can you explain that for using OCRbayes?

Answer: We agree with the reviewer that sometimes selecting a single data point can have some rationale, even though that means discarding part of valuable data that could also contain information. However, we also noticed that different labs and tools may use different strategies (one may choose the last data point or others may choose the minimum value) for data selection. In contrast, OCRbayes uses all data as well as prior knowledge about the OCR for a given model system in a particular injection phase, making the Seahorse data analysis more robust. In the case where a drug injection takes time to act, we recommend increasing the “wait” time after injection, so that a stable state is reached before starting a measurement cycle. The dataset we used did have stable states following injections.

3.Line 170 states that mitochondrial dysfunction is often reflected in the maximal respiration is not completely true as all other parameters are equally important like proton leak.

Answer: We agree with the reviewer that besides the maximal respiration, other respiration parameters are also important. We have changed the line 170 as:

“Since the maximal respiration was reported as the primary mitochondrial dysfunction outcome in the Yepez et al. study [13], we reported here the mean log2 fold change of maximal respiration together with the False Discovery Rate (FDR)”

4.Is OCRbayes applicable for lab based experiments which are repeated 2-3 times? If not please mention in the limitations.

Answer: Yes, OCRbayes is applicable for experiments repeated 2-3 times. In fact, in figure 5A we demonstrated that two plates are sufficient to reach statistical significance for the PET100 deficient cell line.

5.The robustness of the statistical method need to be proved in more than one assay/cells which at least need to be mentioned in limitations.

Answer: We agree with the reviewer and we have changed the limitation section as follow:

“One limitation of this study is that OCRbayes is extensively tested with experimental data derived from human fibroblasts cell lines. This is because our model development was restricted to the limited publicly available Seahorse OCR data sets. To generalize our statistical method to other cell lines than human skin fibroblasts, the prior distributions of those cell lines should be estimated from previously collected Seahorse OCR data. This is because each cell line or cell system has its own growth conditions and metabolic characteristics, making the prior distributions based on the human fibroblasts data not suitable for other cell lines. Furthermore, the prior distributions for plate-to-plate or well-to-well variation are likely dependent on factors such as laboratory and machine. Although many Seahorse studies have been published, few of them provided the raw OCR measurements together with the cell number quantification information. Generalization of our method would benefit from the open availability of raw Seahorse OCR data. This would also facilitate making OCRbayes applicable for other cell lines than human skin fibroblasts.”

Reviewer 2

The Seahorse XF-96 analyzer evaluates mitochondrial function in real-time within live cells and would be of great value to test hypotheses on the relevance of mitochondria in various diseases, including cancer, cardiovascular, and neurodegenerative disease pathogenesis.

In this manuscript, Zhang et al. have reported an improved technique called OCRbayes to analyze the Seahorse XF OCR experimental data, which can be used for the diagnosis of patients suffering from mitochondrial diseases. The authors claim that this is a novel protocol that has never been reported elsewhere before.

Here, they have developed a statistically robust method by adopting the Bayesian hierarchical model approach. This approach is adequate to consider all the technical variations which arise due to complex data structures. The OCRbayes is advantageous and more suitable than any other method adopted so far. Also, this approach is ideal for application in the analysis of a patient with mitochondrial abnormalities.

Overall, the methods reported in this manuscript is important for the scientific community studying the mitochondrial function as it is more robust and will allow the investigators to measure mitochondrial function in live cell populations over time, eliminating the need for timed experiments with multiple samples at different time points; thus expediting the research and facilitating explorations of new scientific directions. Moreover, the manuscript is presented in a well-organized fashion, and the methods are well described systematically.

However, the current approach suffers certain limitations as the authors have validated their protocol on only a particular type of cell line and thus requires extended investigation on another type of cell line as mitochondrial dysfunction is linked to various organs, including the heart, brain, kidney, etc. The following comments need to be addressed to improve the current manuscript.

Comments:

Major: None

Minor

1.The author should clarify whether this protocol is applicable to analyze the data generated from other tools such as the Oroboros high-resolution oxygraphy or other similar devices.

Answer: OCRbayes was designed for OCR measurements generated by Seahorse XF analyzer. At current stage, OCRbayes cannot be directly used to analyze OCR data derived from other technologies such as Oroboros. It would be interesting to follow a similar statistical approach for Oroboros data, but that is beyond the scope of this manuscript.

2a.In their experiments, the authors considered the cell number variations as one of the factors responsible for the in-between well variations in the data within a cell line in a plate. Does minimize this variation by regulating the cell number variations by plating the cells with a multichannel pipette in high accuracy reduces these variations? Considering the following steps may be helpful (a) to mix the cell well before putting it to the reservoir to prevent settling down. (b) Mixing the cells once in every other column/row while plating. (c) Avoiding the outer walls in the experiments since the growth of the cells in these wells are significantly affected by the condensation or air. (d) Further, the slow-growing cells may increase the variability if keeping them in 96 well plates for more than two days.

Answer: We appreciate the input from the reviewer. In a recent study, we have shown that by accurately quantifying cell numbers the variation in the OCR measurements can be reduced [1], other factors like in-plate position could indeed also contribute to well-to-well variation, which is taken into account in the OCRbayes. Since OCRbayes can estimate the variations from the experimental OCR data, we can also use the OCRbayes to optimize the experimental protocol for Seahorse assays. We now included this in the manuscript in line 408:

“A third advantage of OCRbayes is that it can be used for guiding improvement of experimental protocols for running Seahorse assays, because the hierarchical model can explicitly quantify the changes in technical variations resulted from different protocols.”

2b. Is there any variation in their data between adherent and non-adherent cells?.

Answer: we thank the reviewer for the excellent comment. OCRbayes requires prior distributions for all unknown parameters. For adherent and non-adherent cells, we need to estimate their own prior distributions.

3.The authors should clarify whether their protocol will apply to the in-vivo settings as in the case of the OCR data generated from the zebrafish embryos in real-time from Simon T bond et al. Method Mol Bio 2018.

Answer: OCRbayes can be applied to other in vitro or in vivo systems. For each model system, one needs to provide prior distributions for the required parameters. This can be achieved when more experimental data are generated and become accessible.

4.In this study, the authors have considered data generated from only cell type. The data generated from various sources, as published in Martin P Horan et al. 2012 in the Journal of Gerontology, summarizes multiple data generated from different sources by using the Seahorse XP analyzer. The author should consider discussed data if accessible.

Answer: We completely agree with the reviewer on this point. However, the reality is that few publications provided the raw Seahorse data as well as cell number data, required for analysis by OCRbayes. Currently, we are working on setting up a platform for sharing Seahorse raw data. Nevertheless, we have changed the text of our limitation section as follow:

“One limitation of this study is that OCRbayes is extensively tested with experimental data derived from human fibroblasts cell lines. This is because our model development was restricted to the limited publicly available Seahorse OCR data sets. To generalize our statistical method to other cell lines than human skin fibroblasts, the prior distributions of those cell lines should be estimated from previously collected Seahorse OCR data. This is because each cell line or cell system has its own growth conditions and metabolic characteristics, making the prior distributions based on the human fibroblasts data not suitable for other cell lines. Furthermore, the prior distributions for plate-to-plate or well-to-well variation are likely dependent on factors such as laboratory and machine. Although many Seahorse studies have been published, few of them provided the raw OCR measurements together with the cell number quantification information. Generalization of our method would benefit from the open availability of raw Seahorse OCR data. This would also facilitate making OCRbayes applicable for other cell lines than human skin fibroblasts.”

5.Line 47 and 79 are repeated in the introduction/method section and can be rephrased.

Answer: We thank the reviewer for this comment. We have rephrased the line 79 as:

“During a measurement cycle, the Seahorse XF analyzer uses fluorescent oxygen sensors to track OCR.”

Reference

1. Janssen JJE, Lagerwaard B, Bunschoten A, Savelkoul HFJ, Neerven RJJ van, Keijer J, et al. Novel standardized method for extracellular flux analysis of oxidative and glycolytic metabolism in peripheral blood mononuclear cells. Sci Rep. 2021;11: 1662. doi:10.1038/s41598-021-81217-4

Attachment

Submitted filename: rebuttal.docx

Decision Letter 1

Ravirajsinh Jadeja

16 Jun 2021

OCRbayes: A Bayesian hierarchical modeling framework for Seahorse extracellular flux oxygen consumption rate data analysis

PONE-D-21-10491R1

Dear Dr. Zhang,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Ravirajsinh Jadeja, Ph.D

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Ravirajsinh Jadeja

2 Jul 2021

PONE-D-21-10491R1

OCRbayes: A Bayesian hierarchical modeling framework for Seahorse extracellular flux oxygen consumption rate data analysis

Dear Dr. Zhang:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Ravirajsinh Jadeja

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: rebuttal.docx

    Data Availability Statement

    The datasets analysed during the current study are available in the Github repository (https://github.com/XiangZhangSC/seahorse).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES