Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Oct 5;15(10):e0240228. doi: 10.1371/journal.pone.0240228

Comparative test-retest variability of outcome parameters derived from brain [18F]FDG PET studies in non-human primates

Sébastien Goutal 1, Nicolas Tournier 2,#, Martine Guillermier 1,#, Nadja Van Camp 1, Olivier Barret 1, Mylène Gaudin 1, Michel Bottlaender 2,3, Philippe Hantraye 1, Sonia Lavisse 1,*
Editor: Matteo Bauckneht4
PMCID: PMC7535063  PMID: 33017429

Abstract

Introduction

Knowledge of the repeatability of quantitative parameters derived from [18F]FDG PET images is essential to define the group size and allow correct interpretation. Here we tested repeatability and accuracy of different [18F]FDG absolute and relative quantification parameters in a standardized preclinical setup in nonhuman primates (NHP).

Material and methods

Repeated brain [18F]FDG scans were performed in 6 healthy NHP under controlled experimental factors likely to account for variability. Regional cerebral metabolic rate of glucose (CMRglu) was calculated using a Patlak plot with blood input function Semi-quantitative approaches measuring standard uptake values (SUV, SUV×glycemia and SUVR (SUV Ratio) using the pons or cerebellum as a reference region) were considered. Test-retest variability of all quantification parameters were compared in different brain regions in terms of absolute variability and intra-and-inter-subject variabilities. In an independent [18F]FDG PET experiment, robustness of these parameters was evaluated in 4 naive NHP.

Results

Experimental conditions (injected dose, body weight, animal temperature) were the same at both imaging sessions (p >0.4). No significant difference in the [18F]FDG quantification parameters was found between test and retest sessions. Absolute variability of CMRglu, SUV, SUV×glycemia and normalized SUV ranged from 25 to 43%, 16 to 21%, 23 to 28%, and 7 to 14%, respectively. Intra-subject variability largely explained the absolute variability of all quantitative parameters. They were all significantly correlated to each other and they were all robust. Arterial and venous glycemia were highly correlated (r = 0.9691; p<0.0001).

Conclusion

[18F]FDG test-retest studies in NHP protocols need to be conducted under well-standardized experimental conditions to assess and select the most reliable and reproducible quantification approach. Furthermore, the choice of the quantification parameter has to account for the transversal or follow-up study design. If pons and cerebellum regions are not affected, non-invasive SUVR is the most favorable approach for both designs.

Introduction

Positron emission tomography (PET) imaging using 2-deoxy-2-[18F]fluoro-D-glucose (FDG) remains so far the most accurate in vivo method for the investigation of regional glucose metabolism in the living brain. In the brain, glucose metabolism is tightly linked to neuronal and synaptic functions. [18F]FDG PET therefore provides a convenient tool to non-invasively monitor changes in neuronal activity in pathophysiological states. [18F]FDG PET is a translational imaging method, which can be performed either in humans or in animals going from rodents to large animal species such as nonhuman primates (NHP) [1]. Though invasive in rodents, quantitative [18F]FDG PET is only minimally invasive in NHPs, which facilitates its use as a close-human animal model. The high-resolution of preclinical PET scanners in particular allows determination of the level of glucose metabolism in discrete brain regions. Over the last four decades, [18F]FDG brain PET has mostly been used in the context of neurodegenerative diseases and brain tumor research [2]. As non-invasive technique, [18F]FDG PET has also been used to monitor the regional effects of potential CNS drugs and/or pharmacological challenges with repeated scans [36]. More recently, [18F]FDG brain PET was also applied to unravel brain networks in healthy and diseased brains [7]. Many quantification approaches exist to determine brain glucose metabolism, hence questioning the accuracy of the different [18F]FDG PET quantification methods.

In the nuclear medicine practice, [18F]FDG PET images are often analyzed qualitatively, using visual comparison of static images to identify regions with abnormal glucose uptake compared with normal surrounding brain tissue. PET data can also be interpreted from the time-activity curves (TAC) in selected brain regions which are usually reported as percentage of injected dose (%ID/mL) or standardized uptake value (SUV) versus time. SUV (g/mL) is defined as the tissue concentration (MBq/mL) divided by injected activity per body weight or lean body mass (MBq/g) [8] and represents a normalization approach reflecting the degree of metabolic activity in healthy or pathological tissues.

Peripheral glucose metabolism drastically impacts the plasma concentration of FDG and its brain kinetics, hence influencing the estimation of brain glucose metabolism. Many authors have discussed the multiple factors that affect the SUV index, such as kinetics of tracer uptake in the brain [9]. Methods exist to approach absolute quantification of brain glucose utilization or, at least, reduce the effect of unintended and non-specific variability of brain [18F]FDG PET data between conditions. Estimation of the cerebral metabolic rate of glucose (CMRglu) can be achieved using full kinetic modelling of [18F]FDG PET data using the irreversible two-compartment model or the graphical method of Patlak [10]. The outcome parameter integrates glycemia and [18F]FDG plasma kinetics (i.e the arterial input function), and is technically demanding as it requires arterial blood sampling at multiple time-points. This is invasive, uncomfortable and can even be sometimes considered unethical for the patient. Further simplification or normalization methods for [18F]FDG PET have therefore been proposed to overcome this limitation and avoid arterial catheterization in patients [1113]. However, many factors of variability still exist and full kinetic modelling remains the gold-standard method for brain [18F]FDG PET quantification, notably in preclinical settings.

Knowledge of the repeatability of estimated outcome parameters derived from [18F]FDG PET is particularly relevant for longitudinal follow-up studies assessing brain disease progression and/or therapeutic efficacy. The determination of group sizes is essential to [18F]FDG PET studies design for correct interpretation of changes measured during follow-up. Also, it is essential to control for experimental factors so as to minimize additional sources of variability on data measurement, analysis or interpretation and their respective impact on overall test-retest variability during repeated brain [18F]FDG PET experiments.

Standardization of [18F]FDG PET imaging has previously been discussed in the clinical context [14]. Preclinical test-retest studies of cerebral [18F]FDG PET (with arterial blood sampling) have been performed by various groups but in rodents only [15, 16]. To the best of our knowledge, no preclinical study in NHP has yet reported the evaluation of the repeatability and accuracy of the different [18F]FDG PET quantification approaches in a standardized preclinical setup. In the present study, we performed repeated brain [18F]FDG PET acquisition in six healthy NHP using a standardized procedure to control for experimental factors. First, the test-retest variability of PET data metabolism outcome parameters was assessed, and compared for accurate interpretation of brain [18F]FDG images. Finally, we evaluated the robustness of the outcome parameters by considering their respective confidence interval on an independent group of naive NHP.

Material and methods

Animals and housing

All primate experiments performed in this study were approved by the local ethics committee Comité d’Ethique en Experimentation Animale, CETEA n°44 (authorization #12_074) and the Ministry of Higher Education, Research and Innovation (MESRI) ((APAFIS#389)2015032716213569.02.), were conducted according to European regulations (EU Directive 2010/63) and in compliance with Standards for Humane Care and Use of Laboratory Animals of the Office of Laboratory Animal Welfare (OLAW–n°#A5826-01) in a facility authorized by local authorities (authorization n°#B92-032-02).

Animals were originally obtained from approved breeding facilities and vendors (Noveprim; Ferney S.E.; Mahebourg Mauritius Island). The main study used six adult male cynomolgus macaques (Macaca fascicularis) of about 4 years of age (4.2 ± 0.8 yrs) and 5.8 ± 1.1 kg average weight. The robustness study included four adult cynomolgus male primates of about 4 years of age (4.2 ± 0.7 yrs) and 4.5 ± 0.9 kg average weight. Animals were housed in pairs or social groups depending on compatibility in a caging system with multiple compartments that were cleaned at least once every two weeks while sawdust was changed twice a week.

Animals were housed under a 12-hour light-dark cycle, temperature was maintained at 22±1°C and humidity at 50% in the animal quarters. Animals were fed with a commercial monkey chow, supplemented daily with fruits and vegetables changed on a daily basis as part of the enrichment program and drinking water was changed daily and was available ad libitum. Throughout the study, animals were checked daily by the technicians to evaluate their physical and clinical condition, food and water consumption. Environmental enrichment activities included grooming contact, perches, opaque and colored cage panels, and access to additional environment enrichment devices such as mirrors, swings, foraging devices, and manipulation and feeding toys. All imaging procedures were performed under ketamine/xylazine anesthesic induction (10 and 1mg/kg, respectively, intramuscular injection) and animals were then maintained anesthetized using an intravenous (i.v.) infusion of propofol (1mL/kg/h) to minimize pain. Animals were not euthanized at the end of these experiments. None of the animals became severely ill during the course of the study or none required euthanasia prior to their experimental endpoint.

Magnetic resonance imaging

Magnetic resonance imaging (MRI) acquisition procedure was previously described [1]. Briefly, MRI was performed on each NHP in order to delineate cerebral regions on co-registered PET images. They were placed in the magnet in a sphinx position with the head fixed in a stereotactic MRI-compatible frame (M2E, France) and heated by a hot air flux with their temperature and respiration parameters monitored remotely.

Acquisitions were performed on a horizontal 7T Agilent scanner (Palo Alto, CA, USA) equipped with a gradient coil reaching 100mT/m (300μs rise time). A surface coil (RAPID Biomedical GmbH, Rimpar, Germany) was used for transmission and reception. T2-weighted images were acquired using a high-resolution 2D fast spin-echo sequence (469×469 μm2 in-plane resolution, 1 mm slice thickness, 70 slices), with echo time TE/ Repetition time TR = 20/8000 ms, 5 echoes, effective TE = 52.5 ms and acquisition time Tacq = 43 min.

PET imaging

PET acquisition

[18F]FDG PET imaging was performed in all NHPs after overnight fasting with free access to water. Animals were initially sedated with the ketamine/xylazine mixture and anesthetized under propofol starting with an i.v. bolus followed by infusion in a saphenous vein. The other saphenous vein was catheterized for venous sampling. The femoral artery opposite to the tracer/propofol injection site was cannulated for arterial blood sampling. Primates were wrapped in a heat blanket to maintain body temperature throughout the protocol. Blood pressure, heart rate, respiration rate and oxygen saturation were remotely monitored and registered continuously before and during the acquisition.

Animals were placed in a PET dedicated stereotactic-like animal holder to immobilize the head by ear bars. PET imaging was performed on a Concorde Focus220 microPET scanner providing high sensitivity and high spatial resolution (FHWM = 2 mm) (Siemens, TN, USA). A 17-min-transmission scan (using a 57Co point source) was performed prior to PET acquisition to allow for attenuation correction, followed by a 60 min dynamic emission scan acquired in 2D mode. Data acquisition started simultaneously with the i.v. bolus injection of [18]FDG (into the same vein as for propofol injection) using an automated injection pump at a rate of 4 mL/min during 30 seconds. PET images were corrected for radioactive decay, scatter, attenuation and detector inhomogeneity, were rebinned in 27 frames (4×15, 4×30, 2×60, 5×120, 3×300, 9×600 sec.) and reconstructed as previously described (OSEM-2D) [17].

To estimate the test-retest variability, a second scan was performed under exact same conditions after an interval of 27 ± 6 days. The time interval was selected to reduce stress induced by close, repeated food deprivation, anesthesia and animal handling. Overall average of [18F]FDG injected dose (ID) was of 169.2 ± 9.0 MBq.

Arterial blood samples were manually drawn from the femoral artery just before and during each PET acquisition to establish the kinetics in arterial plasma of [18F]FDG (arterial input function, AIF) and glucose concentration. After [18F]FDG injection, 2 ml of arterial blood was sampled every 15 s in the first 2 min and then at 2.5, 3, 5, 10, 20, 40 and 60 min after the injection. Blood samples were centrifuged and 0.2 mL plasma samples were counted in a gamma counter (Wizard2, PerkinElmer) that was cross-calibrated with the PET scanner. Area under the curve of AIF was calculated from 0 to 60 minutes post injection.

Arterial glycemia (Gly) and venous blood samples were withdrawn simultaneously just before [18F]FDG injection. Both measurements were compared at test and retest sessions. All glycemia samples were measured in duplicate with a GM9 glucose analyser (Analox, Stourbridge, UK), which was calibrated at 5 mmol/L using standard solutions. The bias and repeatability of the GM9 analyzer was determined from 10 measures of a glucose standard.

The four additional and independent animals were scanned once under the same conditions so as to address the robustness of the quantification methods.

PET image analysis

Time frames were summed to create an integrated image for automatic co-registration with the T2-weighted MRI using a dedicated PMOD software (PMOD Technologies, version 3.6, Zurich, Switzerland). Anatomical regions of interest (ROI) were automatically delineated on individual MRI images using the CIVM atlas from the Primatologist segmentation pipeline for the macaque brain [18] and were as follows: thalamus, globus pallidus, caudate, putamen, global cortex, midbrain, pons, cerebellum, frontal and occipital cortices. Kinetics of the radioactivity of each ROI was then extracted from the dynamic PET data as a time-activity curves (TACs). For each animal, the same MRI segmentation was applied to extract TACs from test and retest PET images.

The Patlak graphical analysis was applied to the regional TACs in PMOD to quantitatively estimate the cerebral metabolic rate of glucose (CMRglu) under the hypothesis that the tissue includes a compartment in which the tracer is irreversibly entrapped [10, 19]. Linearization started at 30 min and the lumped constant representing the ratio of FDG utilization to actual glucose utilization in the brain was set to 0.34 [20, 21].

In each ROI, mean SUVs (g/mL) were calculated from the plateau of [18F]FDG brain kinetics, between 30 and 60 min post-injection. Additional SUV values were calculated accounting for glycemia (SUVxGly; mmol/L) [22]. Normalization to a “reference” brain region was performed using SUV uptake value in the pons (SUVRpons), or in the cerebellum (SUVRcerebellum) as already reported [23, 24].

In total, five quantification parameters were computed and used for statistical analyses: CMRglu, SUV, SUVxGly, SUVRpons and SUVRcerebellum.

Statistical analyses

Data are presented as mean ± SD unless otherwise indicated. Statistical analysis was performed using the R project free software version 3.3.1 (https://cran.r-project.org/).

The homogeneity of experimental conditions (body weight, temperature and injected dose at test and retest) were analyzed using Wilcoxon signed-rank test. Plasma exposure (AUCplasma) across animals were compared using a one-way analysis of variance (the distribution of residues was normal and homogeneous). Comparison of plasma exposure between test and retest was performed using a paired Wilcoxon rank sum test. The stability of the brain concentration of [18F]FDG was assessed using a two-way ANOVA where macaques and times were defined as explanatory variables (normal and homogeneous distribution of residues), followed by Tukey post hoc testing in TACs of all anatomical regions of interest. Comparison of the brain concentration of [18F]FDG was performed using a paired Wilcoxon rank sum test. Test-retest variability was estimated for each brain region and quantification parameter using the absolute variability across animals, defined as the absolute difference between values from the first and the second scan divided by the mean of both values (“AbsVar”, × 100 when expressed in %). The total variability was evaluated and explained by the contribution of intra-subject and inter-subject variabilities. Intra-subject variability was expressed 1)- through the coefficient of variation (CV%) and 2)- in relative proportion of the total variability. Inter-subject variability (expressed through the CV% and in relative proportion) was deduced from the total variability, as proposed by Mermet [25]. Exceptional negative inter-subject variability refered to random fluctuations only and was then set to 0. Finally, a 95% confidence interval (95% CI; absolute and relative value) was built. In 3 (out of 68) cases, lower limits of the CI were negative and were thus arbitrarily set to 0. All correlations were performed using Spearman’s rank correlation coefficient.

The sample size (n) to estimate the cerebral metabolic rate of glucose consumption with a desired precision level, arbitrary set at 20%, was determined for each quantification parameter:

n=(t×sa)2

with n being the sample size, t the coefficient of confidence (= 1.96 for a 5% alpha risk), a the acceptable imprecision of the mean (precision level of 20% of mean value) and s, the estimated standard deviation of the population.

Results

Homogeneity of experimental conditions

Body weight (5.8 ± 1.4 kg and 6.3 ± 1.3 kg) and injected dose (171.8 ± 9.7 MBq and 166.6 ± 8.7 MBq) were not significantly different between the test and retest scans, respectively (p = 0.4633; p = 0.6905). During PET acquisition, body temperature remained stable for both test- and the retest scan (37.5 ± 0.4 and 37.3 ± 0.5, respectively, p = 0.53).

Arterial input function

Bolus injection of [18F]FDG resulted in a sharp peak at 0.73 ± 0.07 min followed by a slow distribution phase. Plasma exposures (AUCplasma = 94.1 ± 13.7 SUV.min) were not significantly different accross animals (p = 0.07), nor scan sessions (p = 1), with mean variability at test and re-test sessions of 14%CV and 15%CV, respectively.

Glycemia

The standard calibration analysis showed that determination of glucose levels in plasma obtained with GM9 was highly accurate (bias < 1%) and reproducible (CV% < 1%). Glycemia measures from arterial samples were similar between test and retest conditions (4.5 ± 0.7 mmol/L and 5.1 ± 1.5 mmol/L; p = 0.75). Glycemia measures from arterial and venous samples at both sessions were highly correlated (4.6 ± 1.3 mmol/L and 4.7 ± 1.4 mmol/L, respectively. Spearman r = 0.9691; p < 0.0001, slope = 0.972).

[18F]FDG PET kinetics and quantification

Visual inspection of the [18F]FDG brain PET images (in Bq/cc and in SUV) revealed qualitative difference between test and retest scans, which was confirmed in the TACs (S1 Fig). This difference appears to be global rather than regional as it was greatly reduced in SUVr images. TACs showed rapid uptake of the tracer in all regions with a maximal uptake 3 minutes after injection and a plateau starting at 30 minutes post-injection, with no significant difference between radioactivity values at 30, 40, 50 and 60 min, whatever brain regions (p > 0.1) (S1 Fig).

No significant difference of regional [18F]FDG PET quantitative measures was found between the test and the retest sessions (p > 0.1). All quantification approaches resulted in similar regional rank order, measures being the highest in the caudate, putamen and frontal regions and the lowest in the pons. All regional mean SUVs, SUVR, CMRglu values are reported in Table 1.

Table 1. Mean, confidence interval, total, intra- and inter-subject variabilities of outcome parameters between [18F]FDG test and retest scans.

A) CMRglu (μmol/min/100g) B) SUV (g/mL)
Mean ± sd 95% CI Abs var IntraSV InterSV Mean ± sd 95% CI Abs var IntraSV InterSV
(n = 12) (relative value) (%; ) (%; n = 6) (%; n = 6) (n = 12) (relative value) (%;) (%; n = 6) (%; n = 6)
[EV,%] [EV,%] [EV, %] [EV, %]
Thalamus 20.6±10.1 [0.0;42.8] 42.8±28.5 45.8 [86] 18.6 [14] 2.5±0.5 [1.4; 3.5] 18.9±16.4 18.5 [93] 5.2 [7]
Globus pallidum 19.5±9.3 [0.0;40.0] 34.2±27.3 39.5 [67] 27.7 [33] 2.2±0.4 [1.3; 3.1] 20.2±15.3 18.7 [90] 6.3 [10]
Caudate nucleus 24.9±10.8 [1.2;48.6] 41.8±25.7 41.1 [89] 14.3 [11] 2.7±0.5 [1.5; 3.8] 16.4±16.4 17.4 [76] 9.7 [24]
Putamen 26.2±10.9 [2.2;50.3] 42.3±22.0 38.1 [82] 17.8 [18] 2.8±0.5 [1.7; 3.8] 16.1±15.7 16.5 [90] 5.4 [10]
Cortex 19.4±8.2 [1.5;37.5] 39.9±24.1 38.0 [80] 18.9 [20] 2.2±0.3 [1.5; 2.9] 16.3±12.5 14.8 [94] 3.8 [6]
Midbrain 20.4±9.5 [0.0;41.3] 37.1±29.2 43.5 [86] 17.6 [14] 2.3±0.4 [1.4; 3.3] 18.5±16.7 17.9 [100] 0.0
Pons 15.8±6.9 [0.7;30.9] 25.4±29.2 38.1 [75] 22.0 [25] 1.9±0.3 [1.3; 2.6] 19.0±15.4 16.6 [100] 0.0
Cerebellum 21.1±9.2 [0.8;41.5] 33.2±25.6 36.0 [65] 26.1 [35] 2.3±0.4 [1.6; 3.1] 19.4±10.9 15.8 [100] 0.0
Frontal cortex 26.3±11.1 [2.0;50.6] 40.4±26.8 40.1 [90] 13.1 [10] 2.8±0.5 [1.5; 4.0] 16.3±16.6 17.6 [76] 9.9 [24]
Occipital cortex 17.0±9.0 [0.0;36.8] 38.3±29.6 46.2 [75] 26.6 [25] 2.1±0.4 [1.2; 3.0] 20.5±18.9 21.4 [100] 0.0
C) SUVRpons (unitless) D) SUVRCerebellum (unitless)
Mean±sd 95% CI Abs var IntraSV InterSV Mean±sd 95% CI Abs var IntraSV InterSV
(n = 12) (relative value) (%; ) (%; n = 6) (%; n = 6) (n = 12) (relative value) (%;) (%; n = 6) (%; n = 6)
[EV,%] [EV, %] [EV, %] [EV, %]
Thalamus 1.3±0.1 [1.1 ;1.5] 11.4±8.4 10.2 [100] 0.0 1.0±0.1 [0.8 ;1.2] 8.7±4.5 6.8 [96] 1.3 [4]
Globus pallidum 1.1±0.1 [0.9 ;1.3] 9.5±9.9 9.9 [99] 0.9 [0.8] 0.9±0.1 [0.7 ;1.1] 6.8±4.9 5.9 [80] 3.0 [20]
Caudate nucleus 1.4±0.2 [1.0 ;1.8] 13.0±12.8 13.3 [100] 0.0 1.1±0.1 [0.9 ;1.4] 9.8±7.5 8.5 [84] 3.7 [16]
Putamen 1.4±0.1 [1.2 ;1.7] 11.8±11.6 11.8 [100] 0.0 1.2±0.1 [1.0 ;1.4] 8.6±6.8 7.5 [98] 0.9 [2]
Cortex 1.1±0.1 [0.9 ;1.4] 10.3±11.3 10.9 [100] 0.0 0.9±0.1 [0.7 ;1.1] 7.1±5.9 6.3 [100] 0.0
Midbrain 1.2±0.1 [1.0 ;1.4] 9.1±6.7 7.9 [100] 0.0 1.0±0.1 [0.8 ;1.2] 6.4±5.0 5.7 [92] 1.6 [8]
Pons 0.8±0.1 [0.6 ;1.0] 6.8±6.7 6.4 [96] 1.2 [4]
Cerebellum 1.2±0.1 [1.0 ;1.4] 6.8±6.7 6.5 [99] 0.3 [0.1]
Frontal cortex 1.4±0.2 [1.0 ;1.8] 13.9±14.6 15.2 [100] 0.0 1.2±0.1 [1.0 ;1.4] 10.7±8.2 9.5 [78] 5.1 [22]
Occipital cortex 1.1±0.1 [0.9 ;1.3] 10.5±8.8 9.7 [100] 0.0 0.9±0.1 [0.7 ;1.1] 9.8±7.2 8.5 [100] 0.0
E) SUV*Gly (mmol/L)
Mean±sd 95% CI Abs var IntraSV InterSV
(n = 12) (relative value) (%; ) (%; n = 6) (%; n = 6)
[EV, %] [EV, %]
Thalamus 12.2±4.6 [2.1;22.4] 27.8±31.4 39.4 [100] 0.0
Globus pallidum 10.9±4.1 [1.9;19.8] 27.3±30.9 38.8 [100] 0.0
Caudate nucleus 13.3±4.6 [3.2;23.5] 27.0±31.0 36.8 [100] 0.0
Putamen 13.8±4.7 [3.5;24.2] 26.2±30.8 36.2 [100] 0.0
Cortex 10.9±3.5 [3.2;18.6] 24.9±29.1 33.8 [100] 0.0
Midbrain 11.7±4.2 [2.4;21.0] 26.9±30.7 38.2 [100] 0.0
Pons 9.6±3.1 [2.8;16.3] 24.1±29.2 33.8 [100] 0.0
Cerebellum 11.6±3.8 [3.2;20.0] 22.8±27.6 33.0 [100] 0.0
Frontal cortex 13.7±4.6 [3.5;23.8] 27.9±30.3 36.2 [100] 0.0
Occipital cortex 10.5±4.3 [1.0;19.9] 26.7±35.1 43.0 [100] 0.0

[.] refers to explained variance (EV). Estimates and absolute variability expressed as mean ± SD. InterSV = inter-subject variability; IntraSV = intra-subject variability.

CMRglu estimate

The statistical values of the regional CMRglu are reported in Table 1A. Overall, absolute variability was high and the range of CMRglu variability across regions was from 25.4 to 43%. Lowest variability (25.4%) was found in the pons region. No significant CMRglu difference was reported across regions and intra-subject variability explained between 65 and 90% of the total variability (see Fig 1).

Fig 1. Mean intra- (n = 6) and inter-variabilities (n = 6) (expressed in CV%) of all [18F]FDG outcome parameters in the putaminal region.

Fig 1

SUVs and SUVR

All statistical measures of SUV and SUVR parameters are reported in Table 1B–1E.

SUV. Absolute variability of SUV data ranged from 16 to 21% with no regional distinction (global mean = 18.1 ± 1.7 g/mL) and was lower compared to CMRglu absolute variability. The intra-subject CV% (global mean = 17.5 ± 1.8%) was 2.6-fold higher than the inter-subject coefficient of variation in all anatomical ROIs (global mean = 6.7 ± 2.5%) and 2-fold lower than for CMRglu intra-subject variability). In other terms, intra-subject variability of SUV explained between 76 and 100% of the total variability while it explained 65 to 90% of the total variability for CMRglu (Fig 1).

SUVR. Normalization of SUV by a reference region decreased the absolute variability down to 7 to 14% for SUVRpons and SUVRcerebellum. Overall variability of SUVR was essentially explained by intra-subject variability ranging from 78 to 100% for SUVRcerebellum and almost 100% for SUVRpons across regions, whereas much lower inter-subject variability was estimated (lower than 5% for SUVRpons and SUVRcerebellum) (Fig 1).

SUV×Gly. Correction for arterial glycemia did not improve the absolute variability as compared to SUV: absolute variability ranged between 23 to 28% across brain regions. In all brain regions, intra-subject variability was higher while inter-subject variability was lower in comparison to SUV. Finally, the absolute variations of SUV×Gly (~ 26%) was lower than those obtained for CMRglu (~ 37%) but inter-subject variability was negligible in comparison to CMRGlu.

Correlations between the quantification parameters

All quantification parameters were significantly correlated to each other: CMRglu correlated similarly to each of the other [18F]FDG PET measures but correlation was weaker with SUV (r = 0.35). All correlations and statistical significances are provided in Table 2.

Table 2. Spearman's rank correlations between outcome parameters from the [18F]FDG test-retest study.
CMRglu SUV SUVRpons SUVRCerebellum
SUV 0.35**
SUVRpons 0.66**** 0.67****
SUVRCerebellum 0.53**** 0.69**** 0.91****
SUV*Gly 0.66**** 0.67**** 0.68**** 0.60***

****p-value < 0.0001

*** p-value < 0.001 and

**p-value < 0.01 (n = 6 at test and at retest sessions).

Sample size

Precise estimation (mean±20%) of outcome parameters derived from brain [18F]FDG PET requires sample size of n = 22, n = 5 and n = 6 NHPs for CMRglu, SUV and SUV×Gly, respectively. On the other hand, SUVR had a low variability which allowed to reach this precision with only 2 NHP.

Robustness of the outcome parameters

The 95% confidence interval was determined on the test-retest dataset for each outcome parameter (Table 1). All SUV×Gly and CMRglu values of the robustness group lied within the confidence interval. In addition, 95% of the regional values of SUV, SUVRpons and SUVRcerebellum were found within the confidence interval (Fig 2).

Fig 2. Robustness study.

Fig 2

Grey boxes represent the 95%IC of each parameter. Four naive animals were considered for this study and regional estimates are represented for each animal and region.

Discussion

[18F]FDG has been widely used in clinical and preclinical studies for few decades but this study provides for the first time test-retest variabilities of different quantification parameters in NHP under standardized conditions. Many experimental settings can affect the estimation of glucose consumption in the brain using [18F]FDG PET [22, 26, 27]. In this test-retest study, we aimed at controlling most of these settings to study their impact on various outcome parameters derived from [18F]FDG PET images in NHP. [18F]FDG PET scans were performed on six healthy NHP under the following conditions: experiments were performed in male macaques only [16] with constant and monitored temperature throughout the experiments; the same anesthesia protocol and experimental procedures were applied [28]; all animals were fasted for the same time period before each imaging session [22]; the time interval between PET sessions was constant; the same reconstruction method was applied for all scans [9] and finally, automatic tracer injection and automatic image segmentation were performed to limit the investigator-induced variability. Under these highly standardized conditions, quantification of outcome parameters were expected to be largely reproducible and indeed no significant differences were observed in any of these parameters between test- and retest PET sessions.

Various groups have reported test-retest [18F]FDG variability in preclinical studies but in rodents only. Sijbesma et al. [16] have focused on the rat gender difference in cerebral metabolism and reported non negligible absolute variations in different brain regions for CMRglu (mean of 13.4 and 19.6%), SUV (1.61 and 9.12%) and SUV×Gly (9.3 andd 11.9%) in anesthesized male and female rats, respectively. Huang et al. estimated the CMRglu parameter and associated variability in two anesthesized rats with 6 repeated [18F]FDG PET scans and arterial input function measurements [15]. They concluded that reapeated PET procedure could be feasible in small anesthetized animals when implementing the surgical methods they developed.

In NHPs, arterial catheterization is relatively feasible and repeated blood sampling is well tolerated after a recovery period [29]. Therefore, NHPs are excellent models for full kinetic modeling of radioligands, allowing the comparison of alternative simplified measures with the gold-standard CMRglu calculated by the Patlak method. The CMRglu equation involves the influx rate of [18F]FDG (Ki), the blood glucose concentration and the lumped constant, accounting for the differences in transport and phosphorylation between glucose and deoxyglucose [30]. Similar glucose levels in male cynomolgus macaques (of the same age as in our study) have been previously reported in few studies: Wu et al. [31] reported a glycemia range of 2 to 4 mmol/L and Guo et al. found glycemia values around 4.7 mmol/L [32]. Brain CMRglu estimates in the present study were also found in the expected and previously reported range [33] but encompassed non negligible variability. Inter and intra-subject variabilities could be partly due to anesthesia injected to the animals. Indeed, the effects of different anesthetic drugs on glucose metabolism in the human brain have been recently reported [34] showing that propofol decreased glucose metabolism compared to saline placebo. In preclinics, studies using isoflurane anesthesia reported a higher glucose level post-scan compared to pre-scan in rats, suggesting a different impact of isoflurane on glycemia [22, 35]. Furthermore, some studies have reported that ketamine combined with xylazine induces transient acute hyperglycemia lasting up to 25 minutes in rats [36, 37]. This was confirmed in the rhesus monkey where glucose levels were increased up to 60 minutes after ketamine/dexmedetomidine anesthesia as compared to ketamine only that did not elevate glucose level [38].

Ideally, metabolic brain studies should be performed on unanesthetized awake animals. But routine imaging of a conscious NHP is out of reach and the vast majority of PET studies in NHP are conducted in anesthesized animals. Therefore a standardized sedation protocol requires to maintain stable glycemia and hence reliable measurement of glucose metabolism in primate research. The test-retest procedure would have to be reiterated for each anesthesia condition to determine the intrinsic variability of each quantification parameter and select the most reliable and robust one(s) under experimental standardized conditions and for a given experimental design.

The choice of the processing and quantification methods has also to be considered to explain variability. In our study, the normalized SUV (SUVR) and especially the SUVRcerebellum showed low test-retest variability, indicating a global rather than regional difference between PET sessions. The residual variability might be related in part to our processing pipeline, particularly to the potential PET/MRI misalignment remaining between the two PET sessions of each animal. However, we would expect this local variability to be small compared to other factors affecting more globally the test/retest variabililty of SUV and CMRGlu parameters.

Efforts to decrease the intra- and inter-subject variability in [18F]FDG brain PET imaging may help gaining statistical power to detect therapeutic effects or disease progression patterns over time within the same subject panel or between groups with a limited number of subjects. In our experimental setup, we calculated that the sample size required to accurately determine brain glucose consumption was very different depending on the quantification method applied. Normalization of SUV by a reference region (such as pons or cerebellum) reduced the sample size down to 2 animals because of their much lower inter- and intra-individual variabilities. We noticed here that accounting for glycemia in SUV (SUV×Gly) did not reduce variability, in agreement with previous report [39].

We found that [18F]FDG PET outcome measures were significantly inter-correlated across regions and animals. Within-animal correlations (as its own control) were higher than global correlations, favouring individual follow-up experimental design. Nevertheless, significant global correlations (across all animals) between SUV/SUVR and CMRglu parameters underline the possible use of non-invasive procedures instead of arterial sampling to estimate change in brain glucose metabolism over time or treatments. Furthermore, we compared glycemia measurements from arterial and venous blood to avoid arterial collection in animals and found that both measurements were highly correlated. This means that glycemia may be measured in NHP either from venous or arterial blood as long as samples are collected similarly throughout the entire study.

Outcome quantification parameters of [18F]FDG images have to be selected upon the design of the imaging protocol. For cross-sectional studies, it is preferable to use a quantification parameter with a low inter-subject variability and for that, SUV, SUVRpons and SUVRcerebellum are excellent candidates. CMRglu showed a reasonable inter-subject variability as well (~20%). On the other hand, SUVRpons and SUVRcerebellum can only be considered in patients that do not suffer from a pathology affecting the cerebellum or the pons region. For longitudinal studies, a low intra-subject variability is needed to accurately follow-up each individual. SUVRpons and SUVRcerebellum fulfill also these criteria, but alternatively SUV can be considered if the cerebellum or the pons are involved in the patient’s pathology.

Conclusion

Altogether, this study suggests that preclinical [18F]FDG PET imaging requires highly controlled and standardized experimental conditions to minimize variability and reduce the size of animal cohorts. Definition of standardized protocols enables consideration of preclinical multicentric studies to further reduce animal cohorts in each research site. Appropriate choice of anesthetic drugs and outcome parameters is crucial and should be selected with respect to the study design with the aim to reduce variability and improve the sensitivity and accuracy of [18F]FDG PET images to detect changes in glucose brain metabolism.

Supporting information

S1 Fig. Representative time activity curves and SUV PET images.

A)- Time activity curves of a representative animal in the pons and thalamus regions; B)- SUV PET images of the same animal at test and retest sessions.

(TIFF)

S1 File

(XLSX)

Acknowledgments

Authors are thankful to Leopold Eymin and Sophie Lecourtois for animal handling and PET acquisition. They also thank Romina Aron-Badin for animal care support and MRI acquisitions.

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

This work was partially funded by ANR-11-INBS-0011 - NeurATRIS: A Translational Research Infrastructure for Biotherapies in Neurosciences. There was no additional external funding received for this study.

References

  • 1.Lavisse S, Williams S, Lecourtois S, van Camp N, Guillermier M, et al. (2019) Longitudinal characterization of cognitive and motor deficits in an excitotoxic lesion model of striatal dysfunction in non-human primates. Neurobiol Dis 130: 104484 10.1016/j.nbd.2019.104484 [DOI] [PubMed] [Google Scholar]
  • 2.Varrone A, Asenbaum S, Vander Borght T, Booij J, Nobili F, et al. (2009) EANM procedure guidelines for PET brain imaging using [18F]FDG, version 2. Eur J Nucl Med Mol Imaging 36: 2103–2110. 10.1007/s00259-009-1264-0 [DOI] [PubMed] [Google Scholar]
  • 3.Cleeren E, Casteels C, Goffin K, Koole M, Van Laere K, et al. (2018) Positron emission tomography imaging of cerebral glucose metabolism and type 1 cannabinoid receptor availability during temporal lobe epileptogenesis in the amygdala kindling model in rhesus monkeys. Epilepsia 59: 959–970. 10.1111/epi.14059 [DOI] [PubMed] [Google Scholar]
  • 4.Peng S, Ma Y, Flores J, Cornfeldt M, Mitrovic B, et al. (2016) Modulation of Abnormal Metabolic Brain Networks by Experimental Therapies in a Nonhuman Primate Model of Parkinson Disease: An Application to Human Retinal Pigment Epithelial Cell Implantation. J Nucl Med 57: 1591–1598. 10.2967/jnumed.115.161513 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tournier N, Saba W, Goutal S, Gervais P, Valette H, et al. (2015) Influence of P-Glycoprotein Inhibition or Deficiency at the Blood-Brain Barrier on (18)F-2-Fluoro-2-Deoxy-D-glucose ((18)F-FDG) Brain Kinetics. AAPS J 17: 652–659. 10.1208/s12248-015-9739-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhang X, Newport GD, Callicott R, Liu S, Thompson J, et al. (2016) MicroPET/CT assessment of FDG uptake in brain after long-term methylphenidate treatment in nonhuman primates. Neurotoxicol Teratol 56: 68–74. 10.1016/j.ntt.2016.06.005 [DOI] [PubMed] [Google Scholar]
  • 7.Sala A, Perani D (2019) Brain Molecular Connectivity in Neurodegenerative Diseases: Recent Advances and New Perspectives Using Positron Emission Tomography. Front Neurosci 13: 617 10.3389/fnins.2019.00617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sarikaya I, Albatineh A, Sarikaya A (2019) Re-visiting SUV-weight and SUV-lean body mass in FDG PET studies. J Nucl Med Technol. [DOI] [PubMed] [Google Scholar]
  • 9.Buvat I (2007) Understanding the limitations of SUV. Médecine Nucléaire 31: 165–172. [Google Scholar]
  • 10.Patlak CS, Blasberg RG, Fenstermacher JD (1983) Graphical evaluation of blood-to-brain transfer constants from multiple-time uptake data. J Cereb Blood Flow Metab 3: 1–7. 10.1038/jcbfm.1983.1 [DOI] [PubMed] [Google Scholar]
  • 11.Sundar LK, Muzik O, Rischka L, Hahn A, Rausch I, et al. (2019) Towards quantitative [18F]FDG-PET/MRI of the brain: Automated MR-driven calculation of an image-derived input function for the non-invasive determination of cerebral glucose metabolic rates. J Cereb Blood Flow Metab 39: 1516–1530. 10.1177/0271678X18776820 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Vanzi E, Berti V, Polito C, Freddi I, Comis G, et al. (2016) Cerebral metabolic rate of glucose quantification with the aortic image-derived input function and Patlak method: numerical and patient data evaluation. Nucl Med Commun 37: 849–859. 10.1097/MNM.0000000000000512 [DOI] [PubMed] [Google Scholar]
  • 13.Jansen MH, Kloet RW, van Vuurden DG, Veldhuijzen van Zanten SE, Witte BI, et al. (2014) 18 F-FDG PET standard uptake values of the normal pons in children: establishing a reference value for diffuse intrinsic pontine glioma. EJNMMI Res 4: 8 10.1186/2191-219X-4-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Antunovic L, Rodari M, Rossi P, Chiti A (2014) Standardization and quantification in PET/CT imaging: tracers beyond FDG. PET Clin 9: 259–266. 10.1016/j.cpet.2014.03.002 [DOI] [PubMed] [Google Scholar]
  • 15.Huang CC, Wu CH, Huang YY, Tzen KY, Chen SF, et al. (2017) Performing Repeated Quantitative Small-Animal PET with an Arterial Input Function Is Routinely Feasible in Rats. J Nucl Med 58: 611–616. 10.2967/jnumed.116.182402 [DOI] [PubMed] [Google Scholar]
  • 16.Sijbesma JWA, van Waarde A, Vallez Garcia D, Boersma HH, Slart R, et al. (2019) Test-Retest Stability of Cerebral 2-Deoxy-2-[(18)F]Fluoro-D-Glucose ([(18)F]FDG) Positron Emission Tomography (PET) in Male and Female Rats. Mol Imaging Biol 21: 240–248. 10.1007/s11307-018-1245-4 [DOI] [PubMed] [Google Scholar]
  • 17.Lavisse S, Inoue K, Jan C, Peyronneau MA, Petit F, et al. (2015) [18F]DPA-714 PET imaging of translocator protein TSPO (18 kDa) in the normal and excitotoxically-lesioned nonhuman primate brain. Eur J Nucl Med Mol Imaging 42: 478–494. 10.1007/s00259-014-2962-9 [DOI] [PubMed] [Google Scholar]
  • 18.Balbastre Y, Riviere D, Souedet N, Fischer C, Herard AS, et al. (2017) Primatologist: A modular segmentation pipeline for macaque brain morphometry. Neuroimage 162: 306–321. 10.1016/j.neuroimage.2017.09.007 [DOI] [PubMed] [Google Scholar]
  • 19.Patlak CS, Blasberg RG (1985) Graphical evaluation of blood-to-brain transfer constants from multiple-time uptake data. Generalizations. J Cereb Blood Flow Metab 5: 584–590. 10.1038/jcbfm.1985.87 [DOI] [PubMed] [Google Scholar]
  • 20.Brooks RA, Hatazawa J, Di Chiro G, Larson SM, Fishbein DS (1987) Human cerebral glucose metabolism determined by positron emission tomography: a revisit. J Cereb Blood Flow Metab 7: 427–432. 10.1038/jcbfm.1987.86 [DOI] [PubMed] [Google Scholar]
  • 21.Kennedy C, Sakurada O, Shinohara M, Jehle J, Sokoloff L (1978) Local cerebral glucose utilization in the normal conscious macaque monkey. Ann Neurol 4: 293–301. 10.1002/ana.410040402 [DOI] [PubMed] [Google Scholar]
  • 22.Deleye S, Verhaeghe J, wyffels L, Dedeurwaerdere S, Stroobants S, et al. (2014) Towards a reproducible protocol for repetitive and semi-quantitative rat brain imaging with (18) F-FDG: exemplified in a memantine pharmacological challenge. Neuroimage 96: 276–287. 10.1016/j.neuroimage.2014.04.004 [DOI] [PubMed] [Google Scholar]
  • 23.Buchert R, Wilke F, Chakrabarti B, Martin B, Brenner W, et al. (2005) Adjusted scaling of FDG positron emission tomography images for statistical evaluation in patients with suspected Alzheimer's disease. J Neuroimaging 15: 348–355. 10.1177/1051228405280169 [DOI] [PubMed] [Google Scholar]
  • 24.Son YD, Cho ZH, Choi EJ, Kim JH, Kim HK, et al. (2014) Individually differentiated serotonergic raphe nuclei measured with brain PET/MR imaging. Radiology 272: 541–548. 10.1148/radiol.14131547 [DOI] [PubMed] [Google Scholar]
  • 25.Mermet JM (2012) Potential of accuracy profile for method validation in inductively coupled plasma spectrochemistry. Spectrochimica Acta (Part B: Atomic Spectroscopy) 76. [Google Scholar]
  • 26.Boellaard R (2009) Standards for PET image acquisition and quantitative data analysis. J Nucl Med 50 Suppl 1: 11S–20S. [DOI] [PubMed] [Google Scholar]
  • 27.Schiffer WK, Mirrione MM, Biegon A, Alexoff DL, Patel V, et al. (2006) Serial microPET measures of the metabolic reaction to a microdialysis probe implant. J Neurosci Methods 155: 272–284. 10.1016/j.jneumeth.2006.01.027 [DOI] [PubMed] [Google Scholar]
  • 28.Itoh T, Wakahara S, Nakano T, Suzuki K, Kobayashi K, et al. (2005) Effects of anesthesia upon 18F-FDG uptake in rhesus monkey brains. Ann Nucl Med 19: 373–377. 10.1007/BF03027401 [DOI] [PubMed] [Google Scholar]
  • 29.Diehl KH, Hull R, Morton D, Pfister R, Rabemampianina Y, et al. (2001) A good practice guide to the administration of substances and removal of blood, including routes and volumes. J Appl Toxicol 21: 15–23. 10.1002/jat.727 [DOI] [PubMed] [Google Scholar]
  • 30.Wienhard K (2002) Measurement of glucose consumption using [(18)F]fluorodeoxyglucose. Methods 27: 218–225. 10.1016/s1046-2023(02)00077-4 [DOI] [PubMed] [Google Scholar]
  • 31.Wu D, Yue F, Zou C, Chan P, Alex Zhang Y (2012) Analysis of glucose metabolism in cynomolgus monkeys during aging. Biogerontology 13: 147–155. 10.1007/s10522-011-9364-1 [DOI] [PubMed] [Google Scholar]
  • 32.Guo S, Qian W, Du F, Bingdi W, Xiaoli W, et al. (2014) Proteinuria in Cynomolgus macaques (Macaca fascicularis) with Spontaneously Developed Metabolic Disorder and Diabetes: Transcriptome Analysis of Biopsy Kidney. Journal of Diabetes and Metabolism 5. [Google Scholar]
  • 33.Harper AM (1990) Cerebral Blood Flow and Metabolism; Alexander Murray Harper SJ, editor: Manchester Univ Pr; 110 p. [Google Scholar]
  • 34.Laaksonen L, Kallioinen M, Langsjo J, Laitio T, Scheinin A, et al. (2018) Comparative effects of dexmedetomidine, propofol, sevoflurane, and S-ketamine on regional cerebral glucose metabolism in humans: a positron emission tomography study. Br J Anaesth 121: 281–290. 10.1016/j.bja.2018.04.008 [DOI] [PubMed] [Google Scholar]
  • 35.Vodovar D, Duchene A, Wimberley C, Leroy C, Pottier G, et al. (2018) Cortico-Amygdala-Striatal Activation by Modafinil/Flecainide Combination. Int J Neuropsychopharmacol 21: 687–696. 10.1093/ijnp/pyy027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Saha JK, Xia J, Grondin JM, Engle SK, Jakubowski JA (2005) Acute hyperglycemia induced by ketamine/xylazine anesthesia in rats: mechanisms and implications for preclinical models. Exp Biol Med (Maywood) 230: 777–784. [DOI] [PubMed] [Google Scholar]
  • 37.Guarino MP, Santos AI, Mota-Carmo M, Costa PF (2013) Effects of anaesthesia on insulin sensitivity and metabolic parameters in Wistar rats. In Vivo 27: 127–132. [PubMed] [Google Scholar]
  • 38.Vaughan KL, Szarowicz MD, Herbert RL, Mattison JA (2014) Comparison of anesthesia protocols for intravenous glucose tolerance testing in rhesus monkeys. J Med Primatol 43: 162–168. 10.1111/jmp.12104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Prando S, Carneiro CG, Robilotta CC, Sapienza MT (2019) Comparison of different quantification methods for 18F-fluorodeoxyglucose-positron emission tomography studies in rat brains. Clinics (Sao Paulo) 74: e1273. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Matteo Bauckneht

10 Aug 2020

PONE-D-20-17186

Comparative test-retest variability of outcome parameters derived from brain [18F]FDG PET studies in non-human primates

PLOS ONE

Dear Dr. Lavisse,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 24 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Matteo Bauckneht

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for including the following funding statement in the acknowledgements sectionn of your manuscript; "This work was partially funded by ANR-11-INBS-0011 - NeurATRIS: A Translational Research Infrastructure for Biotherapies in Neurosciences."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"The authors received no specific funding for this work"

b.  Please provide an amended statement that declares *all* the funding or sources of support (whether external or internal to your organization) received during this study, as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now.  Please also include the statement “There was no additional external funding received for this study.” in your updated Funding Statement.

Please include your amended Funding Statement within your cover letter. We will change the online submission form on your behalf.

3.

We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors performed FDG PET brain scans on 6 cynomolgus macaques about one month apart. The parameters include CMRglu, SUVmean, SUVxGlu, SUVRpons and SUVRcerebellum. They also scanned 4 additional animals to see if the values of these parameters fall within the confidence interval defined by the previous 6 animals. The most stable parameter was SUVR with variability between 7-14%, while the variability of CMRglu reached 25-43%. It was concluded SUVR could be used for quantification if pons or crebellum was not affected by the experimental conditions.

This is a well-designed study with clear descriptions of the methodology to allow reproducibility. It provided absolute numbers to compare the variability of various quantification methods across the brain regions. Congratulations to the authors for their deeds and recommendation for publication to share their results.

Reviewer #2: The manuscript presented by Goutal and co-authors explore the repeatability of quantitative parameters from FDG-PET studies in nonhuman primates and also compare them. This is a well-written paper, with a clear and concise text where objectives are well defined, and introduction and discussion supported by a good argumentation. In general, methodology is adequate and experimental methods are described correctly.

However, there are some points that would need to be revised or discussed by the authors in order to improve the quality of the work.

- The authors have carried out a test-retest study controlling all the experimental conditions and have analyzed the reproducibility of the results by the study of intra-subject and inter-subject variability. However, another important measure which can be calculated from test-retest data is the reliability of the measurements by means intraclass correlation coefficient (ICC). Why authors don’t have calculated this coefficient?

- In “PET acquisition” section (page 9, line 130) authors should detail the times at which arterial blood samples were collected to obtain arterial input function.

- For each PET study, authors acquired a 60 min dynamic emission scan (Page 8, line 120). Which temporal sequence did the authors use to reconstruct the studies? Please, explain in “PET acquisition“ section.

- Which statistical test was performed to analyze the homogeneity of experimental conditions (body weight and injected dose)? Please, refer in “Statistical Analyses” section.

- The authors have used a segmentation pipeline to automatically delineate the anatomical regions of interest for each macaque. However, it is usual to spatially normalize brain images to a common stereotaxic space to apply a predetermined template of VOIs. Probably, the method used to quantify the different regions also influences the variability observed in the study. Although the authors discuss the influence of different types of anesthesia on the results of the study (page 19, lines 349-352), it would be also convenient to comment the importance of the quantification method.

- It would be desirable to show a graphical representation of FDG Time Activity Curves of test-retest studies, at least of one of the regions (representative). Likewise, it would be nice to show a representative test-retest PET images of one animal. All this information could be added as supplementary material.

Reviewer #3: Dear authors,

your study is very interesting and carried out very well. Nowadays it is very important to evaluate 18F-FDG PET images, from preclinical with animals to human studies, not just from a qualitative point of view. Some of the common quantitative parameters are still not robust enough to evaluate properly the true effects on the metabolism over time, for example when studying a new drug.

In this article the authors analyze the test-retest variance of many quantitative PET parameters (SUV, CMRglu, SUVxGly, SUV ratio on the pons or on the cerebellum) extracted from brain PET images performed twice on a small group of nonhuman primates (macaques) with a very controlled and reproducible protocol. They validated the results on an independent control group, too.

I think that the article is properly written down with a good depiction of the methods utilized and so, has to be accepted.

Major comment:

- Material and method: you describe in the “animals and housing” paragraph one anesthesia protocol for all procedures, but then it is slightly different for MRI and PET.

- I think that you can add a picture of the PET brain images and a figure of the curve obtained to show the quality of the scan.

- From a qualitative point of view there were any substantial difference between the scans? please add this information

- The mean glycemia level of this group of animals is comparable with other study with the same species? Please add this information

Minor comment:

- Line 63 “by the” is repeated twice

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Oct 5;15(10):e0240228. doi: 10.1371/journal.pone.0240228.r002

Author response to Decision Letter 0


2 Sep 2020

Reviewer #1: We thank Reviewer#1 for this comment.

Reviewer #2:

-The authors have carried out a test-retest study controlling all the experimental conditions and have analyzed the reproducibility of the results by the study of intra-subject and inter-subject variability. However, another important measure which can be calculated from test-retest data is the reliability of the measurements by means intraclass correlation coefficient (ICC). Why authors don’t have calculated this coefficient?

-> We initially calculated the intraclass correlation coefficient (ICC) which is indeed a widely used reliability index in test-retest, intrarater, and interrater reliability studies. However, some of the ICC values in our study were negative. Although the ICC value should theoretically range between 0 and 1 per se (as it represents the proportionality of the total variance), its estimation may be negative. In these cases, it has been suggested to reset the ICC to 0 but such cases are then uninterpretable (Chen, 2017, Human Brain Mapping, doi: 10.1002/hbm.23909). We therefore chose another option and used a different measure of reliability. We report the mean absolute difference between values from the first and the second scan divided by the mean of both values to reflect the reliability of the measurements.

- In “PET acquisition” section (page 9, line 130) authors should detail the times at which arterial blood samples were collected to obtain arterial input function.

-> This information has been added in the text. Please see page 9 line 131

- For each PET study, authors acquired a 60 min dynamic emission scan (Page 8, line 120). Which temporal sequence did the authors use to reconstruct the studies? Please, explain in “PET acquisition“ section.

->We provided the framing in the “PET acquisition” paragraph. Please see page 9 line 123

- Which statistical test was performed to analyze the homogeneity of experimental conditions (body weight and injected dose)? Please, refer in “Statistical Analyses” section.

-> The homogeneity of experimental conditions (body weight, temperature and injected dose at test and retest) were analyzed using Wilcoxon signed-rank test. We have inserted this information in the “Statistical analysis” paragraph. Please see page 10 line 170.

- The authors have used a segmentation pipeline to automatically delineate the anatomical regions of interest for each macaque. However, it is usual to spatially normalize brain images to a common stereotaxic space to apply a predetermined template of VOIs.

Probably, the method used to quantify the different regions also influences the variability observed in the study.

Although the authors discuss the influence of different types of anesthesia on the results of the study (page 19, lines 349-352), it would be also convenient to comment the importance of the quantification method.

-> We have established that a large part of the test/retest variability was due to global rather than local effects, as the normalized SUV (SUVR) and especially the SUVRcerebellum show low test/retest variability.

In addition, for each animal, the same MRI segmentation was used for the analysis of the test and retest PET scan. Therefore, we would not expect much variability to be introduced by the MRI segmentation. On the other hand, registration errors between the two PET sessions might indeed contribute to the test/retest regional variability, although we expect this effect to remain marginal compared to more global experimental factors affecting SUV and CMRGlu parameters.

We inserted a comment in the Discussion page 19 line 347.

- It would be desirable to show a graphical representation of FDG Time Activity Curves of test-retest studies, at least of one of the regions (representative). Likewise, it would be nice to show a representative test-retest PET images of one animal. All this information could be added as supplementary material.

-> We have added a supplementary Figure (Fig. S1) displaying a)- time activity curves of a representative animal in the thalamus and pons regions and b)- corresponding SUV-normalized PET images in test and retest condition

Reviewer #3:

- Material and method: you describe in the “animals and housing” paragraph one anesthesia protocol for all procedures, but then it is slightly different for MRI and PET.

-> We thank the reviewer for this relevant comment and have clarified this concern in the Methods section. Indeed, in both MRI and PET sessions, anesthesia was induced using a mixture of ketamine and xylazine (10 and 1mg/kg, respectively) in each animal. This information is now clearly reported p7 line 86.

- I think that you can add a picture of the PET brain images and a figure of the curve obtained to show the quality of the scan.

-> According to both reviewers suggestion, SUV values are now corrected and we have added a supplementary Figure (Fig. S1) displaying a)- time activity curves of a representative animal in the thalamus and pons regions and b)- corresponding SUV-normalized PET images in test and retest condition

- From a qualitative point of view there were any substantial difference between the scans? please add this information

-> Visual inspection of the raw and SUV PET images between test and retest scans revealed qualitative difference (see Example in Fig. S1). Actually, this difference was global and not regional. There is indeed a global factor difference between scan and rescan SUV images that is greatly decreased in the normalized SUVr images (especially in SUVr-cerebellum images).

We inserted this information in the Result section. Please see p 12 line 213.

- The mean glycemia level of this group of animals is comparable with other study with the same species? Please add this information

-> Similar glucose levels in cynomolgus macaques (about 5 years old) have previously been reported in few studies (most metabolic studies in primates use the SUV parameter): Wu et al. (2012) reported a glycemia range of 2 to 4 mmol/L in male primates and Guo et al. (2014) found glycemia values around 4.7 mmol/L for instance.

We inserted this information in the Discussion section. Please see p 18 lines 324

Minor comment:

- Line 63 “by the” is repeated twice

-> Thank you, the typo has been corrected.

Attachment

Submitted filename: Response-to-Reviewers_final.docx

Decision Letter 1

Matteo Bauckneht

23 Sep 2020

Comparative test-retest variability of outcome parameters derived from brain [18F]FDG PET studies in non-human primates

PONE-D-20-17186R1

Dear Dr. Lavisse,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Matteo Bauckneht

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Matteo Bauckneht

25 Sep 2020

PONE-D-20-17186R1

Comparative test-retest variability of outcome parameters derived from brain [18F]FDG PET studies in non-human primates

Dear Dr. Lavisse:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Matteo Bauckneht

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Representative time activity curves and SUV PET images.

    A)- Time activity curves of a representative animal in the pons and thalamus regions; B)- SUV PET images of the same animal at test and retest sessions.

    (TIFF)

    S1 File

    (XLSX)

    Attachment

    Submitted filename: Response-to-Reviewers_final.docx

    Data Availability Statement

    All relevant data are within the paper and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES