Skip to main content
eLife logoLink to eLife
. 2020 Aug 6;9:e57258. doi: 10.7554/eLife.57258

A machine-vision approach for automated pain measurement at millisecond timescales

Jessica M Jones 1,, William Foster 1,, Colin R Twomey 1,, Justin Burdge 1, Osama M Ahmed 2, Talmo D Pereira 2, Jessica A Wojick 3, Gregory Corder 3, Joshua B Plotkin 1, Ishmail Abdus-Saboor 1,
Editors: Kate M Wassum4, Rebecca Seal5
PMCID: PMC7434442  PMID: 32758355

Abstract

Objective and automatic measurement of pain in mice remains a barrier for discovery in neuroscience. Here, we capture paw kinematics during pain behavior in mice with high-speed videography and automated paw tracking with machine and deep learning approaches. Our statistical software platform, PAWS (Pain Assessment at Withdrawal Speeds), uses a univariate projection of paw position over time to automatically quantify seven behavioral features that are combined into a single, univariate pain score. Automated paw tracking combined with PAWS reveals a behaviorally divergent mouse strain that displays hypersensitivity to mechanical stimuli. To demonstrate the efficacy of PAWS for detecting spinally versus centrally mediated behavioral responses, we chemogenetically activated nociceptive neurons in the amygdala, which further separated the pain-related behavioral features and the resulting pain score. Taken together, this automated pain quantification approach will increase objectivity in collecting rigorous behavioral data, and it is compatible with other neural circuit dissection tools for determining the mouse pain state.

Research organism: Mouse

Introduction

Numerous genetic and environmental factors shape the subjective experience of pain. While humans can articulate the intensity and unpleasantness of their perceived pain in the form of pain scales and questionnaires (Attal et al., 2018; Melzack, 1987), determining pain states in non-verbal animals remains a significant challenge. Rodents are the predominant model organism to study pain and there is an urgent need to develop high-throughput approaches that accurately measure pain. The past 50 years of pain research have relied on the paw withdrawal reflex metric to measure pain-related behaviors in rodents, which has contributed to important discoveries about nociception (Basbaum et al., 2009; Deuis et al., 2017). However, the traditional approach of manually scoring paw lifting suffers from an inability to determine whether paw movement away from a stimulus is motivated by the experience of pain. Improving the resolution, and increasing the dimensionality, of the common paw withdrawal assay has the potential to increase the predictive validity of translational pain therapeutics and to increase the rate at which basic science findings are translated to the clinic.

Animals generate rapid motor responses to somatosensory stimuli at millisecond speeds that cannot be readily detected by eye (Severson et al., 2017; Douglass et al., 2008). Therefore, significantly increasing the recording rate of the motor actions, coupled with the sub-second mapping of behavioral signatures, will sharpen the resolution and confidence for assessing an animal’s internal pain state. For example, researchers recorded optogenetically-induced nociceptive behaviors at 240 frames per second (fps), which facilitated the precise mapping of nocifensive behaviors including paw withdrawal, paw guarding, jumping, and vocalization (Arcourt et al., 2017). Two additional studies recording between 500 and 1000 fps using both natural and optogenetic nociceptive stimuli, demonstrated that nociceptive withdrawal latencies were on the order of 20–130 milliseconds (ms) (Blivis et al., 2017; Browne et al., 2017). More recently, we recorded mouse somatosensory behaviors at 500–1000 fps, coupled with manual behavioral mapping, statistical modeling, and machine learning to create a more objective ‘pain scale’ (Abdus-Saboor et al., 2019). Although these studies provide a framework for using high-speed videography for fine-assessment of pain, a major limitation lies in the relatively low-throughput nature of manual scoring of the video frames, which adds potential human error, and limits the ease of platform adoption in other laboratories.

Recently, computational neuroethology platforms have introduced a suite of machine learning and deep neural networks to automatically track animal body parts during behavior for postural estimation (Datta et al., 2019). Platforms such as Motion Sequencing (MoSeq) use three-dimensional depth imaging, quantitative analyses, and fitting with unsupervised computational models to estimate animal posture within an open arena and can automatically reveal ~60 unique sub-second behavioral signatures (Wiltschko et al., 2015). DeepLabCut and LEAP, train deep neural networks (DNN) with relatively limited training datasets, allowing the computer to accurately track unlabeled body parts such as a mouse paw, ear, or even a single digit through many frames of videography data (Pereira et al., 2019; Mathis et al., 2018). Alternatively, the markerless automated tracking software ProAnalyst tracks moving objects across high frame rate videography data (Tiriac et al., 2012; Libby et al., 2012). This approach does not use deep learning but relies on built-in machine learning algorithms for automated tracking, which provides an easier point of entry for researchers with limited time for software development or computing power.

Here, we present an automated mouse pain scale that combines videography at 2000 fps, automated paw tracking with ProAnalyst and SLEAP, and new software called PAWS (Pain Assessment at Withdrawal Speeds), which automatically scores seven defined behavioral features and produces a resulting univariate pain score. Beginning with seven commonly used genetically inbred mouse strains we revealed stereotyped sub-second paw trajectory patterns, with simple up-down lifts typifying the response to innocuous stimuli and elaborate sinusoidal patterns typifying the responses to noxious stimuli. By projecting paw position onto the time-varying principal axis of paw movement, we identified shaking behavior as simple sequences of peaks and valleys in this univariate time series, and paw guarding as extended periods of stasis devoid of shaking before returning the paw to the ground.

After building an automated pain assessment platform, we confirmed that the seven movement features we automatically measured were sufficient to separate behavioral responses to innocuous touch from noxious pinprick stimuli. Moreover, this could be accomplished using a single univariate measure of pain identified as a linear transformation of the seven behavioral features, by ordinal logistic regression. This same univariate scale further segregated noxious intensity. Cross-validation of our univariate pain scale confirmed that PAWS performs well for decoding the stimulus type and intensity across mouse genetic lines based on the animal’s sub-second behavioral responses. Finally, using our recently described protocol to gain genetic access to basolateral amygdala (BLA) neurons that are responsive to pain (Corder et al., 2019), we chemogenetically activated the BLA pain ensemble to validate our platform’s ability to detect centrally driven pain responses. Thus, we can automatically measure increases in mechanical pain responsiveness to noxious stimuli while manipulating central pain circuits. Taken together, this work reveals that automating paw tracking and subsequent quantification of pain behaviors with high frame rate videography provides a reliable method to objectively determine the mouse pain state.

Results

High-speed videography and automated paw tracking during evoked behaviors

To capture sub-second behavioral ethograms during somatosensory behaviors in freely behaving mice, we recorded mice at 2000 fps, with a particular focus on the stimulated paw. We reasoned that we could develop a pipeline where we first performed behavior experiments, followed by automated paw tracking and automated pain scoring, and lastly statistical modeling to transform multidimensional datasets into a single dimension that separated touch from pain (Figure 1A). To begin, we used 10 mice from seven commonly used inbred lines and on separate days to avoid sensitization to the stimuli, we applied to one hind paw a static innocuous stimulus (cotton swab), a moving innocuous stimulus (dynamic brush), a weak noxious stimulus (light application of a pinprick), and an intense noxious stimulus (heavy application of a pinprick). We used pinprick instead of von Frey hair filaments (VFHs) based on our prior studies showing that pinprick stimuli activate nociceptors and consistently evoke painful responses, whereas VFHs were more variable depending upon the force applied (Abdus-Saboor et al., 2019). Using traditional pain scoring, we noticed that the paw withdrawal frequencies to these four stimuli varied widely across these seven strains. For example, Balb/cJ, DBA1/J, and CBA/J displayed high paw withdrawal rates to all mechanical stimuli, both the innocuous and the two noxious (Figure 1B). Conversely, C57BL/6J and AKR/J displayed high paw withdrawal rates to both the noxious stimuli and the innocuous dynamic brush, but not to a cotton swab (Figure 1B). Finally, A/J and 129S1 mice displayed high rates of paw lifting to both pinprick stimuli and low withdrawal rates to the two touch stimuli (Figure 1B). Since mice will move their paw to both innocuous and noxious stimuli, it is hard to determine if these differences in withdrawal frequencies are driven by genetic differences in susceptibility to pain. What these data likely reveal using the common traditional approach is that this test in isolation may be an inadequate measurement of pain at baseline states.

Figure 1. Automated pain assessment workflow in comparison to traditional unidimensional pain scoring.

Figure 1.

(A) Workflow pipeline in following order consisting of 1) high-speed videography of freely behaving mice, 2) machine/deep learning-based approaches for automatic tracking of the stimulated paw, 3) PAWS software for automatic quantification of defined pain behavioral features, 4) statistical modeling with ordinal logistic regression for separation of touch versus pain on a trial-by-trial basis. (B) Traditional scoring focused on paw withdrawal frequencies to four mechanical stimuli: cs = cotton swab, db = dynamic brush, lp = light pinprick, hp = heavy pinprick. N = 10 mice per strain. Images from Jackson laboratories.

Next, we turned to automated tracking with the seven mouse strains to determine the X, Y coordinates of the paw across approximately 5000 frames – recording at 2000 fps with total behavior time from stimulus application to paw lift and return to the floor being approximately 2–3 s. Using machine learning algorithms embedded within the ProAnalyst motion tracking software, we manually labeled the center of the stimulated paw in each video and the machine automatically tracked the paw throughout each additional frame (Figure 2A–B') (see Figure 2—video 1). While observing the automated paw trajectory patterns, we noticed that stereotyped motor sequences defined the movement away from the four stimuli, regardless of strain background (Figure 2A–B'). For example, the responses to the two innocuous stimuli were typically up-down C-shaped movements (Figure 2A–B'). Conversely, the responses to the two noxious stimuli were typically more elaborate movements, often accompanied by an orbital tightening of the eye, which is a known facial feature of intense pain (Figure 2A–B'Langford et al., 2010). Only the 129S1 strain showed responses to pinprick that were devoid of sinusoidal irregular paw trajectory patterns (Figure 2Y–B'). In the majority of tested strains, we also noticed that the paw trajectory pattern in response to a weakly painful stimulus (light pinprick) often resulted in a figure-eight like sequence that may point toward a shared sensorimotor neural circuit that governs how animals respond to weak and painful stimuli given their body posture and space constraints (Figure 2A–B').

Figure 2. Automated paw tracking with a high-speed recording of behavior.

(a-b′) ProAnalyst machine learning-based paw tracking. (y-z′), SLEAP deep neural network based paw or toe tracking. All still images represent a single frame of ~5000 total frames. Green and blue lines display paw trajectory patterns during the entire behavior. N = 10/mice per stimulus and images shown are representative of each strain.

Figure 2.

Figure 2—video 1. Automatic paw tracking with the delivery of noxious mechanical stimulus.
Download video file (1.7MB, mp4)
Video recording at 2000 fps showing the stimulated paw (received heavy pinprick) successfully tracked throughout the behavioral sequence. Green dots display paw trajectory patterns mapped using ProAnalyst.

Next, we randomly chose one strain (AKR/J) and we used a deep learning-based pose tracking algorithm called SLEAP (in preparation, based on Pereira et al., 2019), to predict mouse toe and mid-paw positions during somatosensory behaviors recorded at high speed (Figure 2C′-F′). Our mouse paw-tracking model was generated from a small training set of video frames collected from the four assays (~9.5% of video frames per assay). In general, the paw trajectory patterns with SLEAP resemble those of ProAnalyst, and the software package we describe below is compatible with automated tracking data from either tool or even others. Taken together, our ability to detect clear qualitative distinctions in paw movements with automated tracking approaches gave us confidence that we could use spatiotemporal data of paw position to automatically extract features that may be useful in determining the mouse pain state. Additionally, these data demonstrate that the nature of the response when an animal withdrawals its paw might be a more reliable indicator of pain state than the number of times the animal lifts its paws to a given stimulus. In other words, paw withdrawal frequencies revealed great variation across the seven strains (Figure 1), whereas paw trajectory patterns to a given stimulus showed quite stereotyped responses among these strains (Figure 2).

Development of software to automatically score pain behavioral features

We developed software to systematically quantify the seven pain-relevant features of the paw position time series based on existing measurements in the literature (Abdus-Saboor et al., 2019). Maximum paw height, lateral velocity, vertical velocity, and the total distance traveled by the paw, were all computed based on a polynomial smoothing (Savitsky-Golay filter of order 3) of the original paw position time series. These four features were computed for two different windows of the paw trajectory time series: the time leading up to the initial peak in paw height (time t* in Figure 3A), and the time following the initial paw peak, which we refer to as pre-peak and post-peak, respectively. The t* designation reflects a biological designation: behavior before t* corresponds to a reflexive rapid withdrawal of the paw, whereas behavior following t* includes lingering paw attending behaviors that require some degree of conscious perception. In the post-peak time window, we also identified periods of ‘shaking’ and ‘guarding’ based on a threshold displacement along the principal axis of the paw movement (Figure 3B and C). We used this delineation to quantify the total duration spent shaking or guarding as well as the total number of paw shakes across shaking periods. We refer to this software package as PAWS).

Figure 3. Quantification of behavioral features for mouse pain state.

Figure 3.

(A) Raw paw positions (x, y) measured in the camera reference frame (anterior/posterior displacement and vertical height, respectively) as a function of time, with the time of the first peak in paw height, t*, marked in red. (B) The principal axis of paw displacement in a moving time window (double-sided arrow), shown as a function of time. (C) Displacement in the principal axis moving reference frame. Local maxima and minima are indicated by blue circles – the first of which occurs at time t*. Periods of paw shaking were identified based on sequences of displacements between maxima and minima above a given threshold relative to the maximum paw height displacement. Guarding periods were defined simply as periods without shaking, in which the paw remained elevated above its final resting point.

Automated scoring of rapid paw dynamics and lingering pain behaviors

With software generated to automatically measure paw movement features related to the mouse pain state, we plotted and analyzed the data across the seven mouse strains with the four mechanical stimuli described above. For plotting the individual behavioral features, we separated the paw distance traveled measurement into pre-peak and post-peak distances. To standardize across varying units and to appreciate the individual deviation from the median, we transformed the raw output measurements into Z-scores (Figure 4). The first readily apparent feature we noticed was that all measurements across these seven strains were interspersed without clear separation among strains, despite the vast differences in paw withdrawal frequencies to these same stimuli (Figure 1B). These data suggest that these seven strains are phenotypically quite similar in their acute responses to mechanical stimuli despite the genetic variation that distinguishes these mouse lines. In regards to pre-paw peak behavioral features, we observed a statistical separation in the stimulated paw’s velocity on both the X and Y axes between touch (CS, DB) and pain stimuli (LP, HP) as the paw withdrew upwards to reach its first highest peak (Figure 4A–C). The mean max height that the paw reached in its first peak was statistically different in the innocuous versus painful stimuli (Figure 4A).

Figure 4. Automated measurement of pain behavioral features across the seven strains with new software PAWS.

Measurements are converted to Z-score to reveal deviation from the mean of individual measures and to standardize units across the seven measures. (A-C) Pain measurements of the stimulated paw from lift to max height. (D-G) Pain measurements of the stimulated paw from max height to paw return. N = 10/mice of each strain given each stimulus once. Statistical significance was computed by comparing CS+DB versus LP+HP with Wilcoxon matched-pairs signed-rank test. ** represents p-value ≤0.01. **** represents p-value ≤0.0001. On the violin plots, black or white horizontal lines represent quartiles and black or white dashed horizontal lines represent the median. (H) The univariate linear scale that best separates the four pain states studied: CS, DB, LP, and HP, for pre-peak features only. Kernel density estimates and the original data decomposed by pain state are shown projected onto the univariate measure of pain identified by ordinal logistic regression. Here, we use the threshold between no-pain and pain (DB and LP) inferred by ordinal logistic regression as the zero point, and we rescale by the threshold between low pain (LP) and high pain (HP), fixing this point at one. (I) Same as a but for post-peak paw features only.

Figure 4—source data 1. Raw data values automatically computed from PAWS showing each mouse and stimulus across all eight inbred lines tested.
p=pain, NP = no pain.
elife-57258-fig4-data1.xlsx (198.9KB, xlsx)
Figure 4—source data 2. Raw data values automatically computed from PAWS showing each mouse and stimulus across four inbred lines tested before and after unilateral CFA paw injection.
p=pain, NP = no pain.

Figure 4.

Figure 4—figure supplement 1. Relative feature importance for individual behavioral features that contribute to the univariate pain score.

Figure 4—figure supplement 1.

(A) Magnitude of the weights assigned by ordinal logistic regression to the standardized (mean subtracted, variance scaled) pre-peak paw behavioral features. (B) Same as a but for post-peak paw features.
Figure 4—figure supplement 2. The univariate linear scale that best separates the four pain states studied: CS, DB, LP, and HP.

Figure 4—figure supplement 2.

Related to panels 4 H, I. Here, we are asking the model to distinguish between three conditions: no pain (CS or DB), light pain (LP), or heavy pain (HP).
Figure 4—figure supplement 3. Univariate linear scale using post-paw peak features before and after CFA injection and application of dynamic brush or 4 g von Frey hair filament.

Figure 4—figure supplement 3.

Kernel density estimates and the original data decomposed by pain state are shown projected onto the univariate measure of pain identified by ordinal logistic regression. As in Figure 2, we use the threshold between no-pain and pain (DB and LP) inferred by ordinal logistic regression as the zero point, and we rescale by the threshold between low pain (LP) and high pain (HP), fixing this point at one. Please note that we are not plotting an outlier DB data point in the pre-CFA panel (pain score was >10).

Next, we plotted the pain behavioral features that occur after the paw has reached its highest first peak and before the animal places its paw back to the surface. Similar to our observations with the four pre-paw peak pain behaviors, the four post-paw peak behaviors also show statistical separation between innocuous and pain stimuli (Figure 4E–D). Although we are not the first research group to observe paw shaking and guarding behaviors in rodents, this is one of the first technologies to automatically score the number and duration of these paw movement dynamics. Taken together, these automated measurements extracted from paw time series data are sufficient to objectively separate touch from pain in genetically diverse mice. Additionally, each one of our seven behavioral features was sufficient to independently separate touch versus pain during baseline conditions (Figure 4). Therefore, researchers can opt to choose any of these measurements to score the mouse pain state, and it’s possible that some of these features will become more or less important depending upon the model used and the basal state of the mouse (Figure 4—source data 1).

A univariate pain scale for scoring mice behavior

The seven behavioral features we extracted from video each provide more useful information than paw withdrawal frequency alone. Nevertheless, the simplest way to score behavior is using a univariate pain scale derived from these behavioral features. To accomplish this, we used ordinal logistic regression to identify a univariate linear subspace of the seven behavioral features that best separates the four somatosensory stimuli: CS, DB, LP, and HP. We did this for two cases: first, restricting to just four features quantifiable in the pre-peak period (t < t*), and then for all seven features quantified during the post-peak period (Figure 4H and I, respectively). For both the pre- and post-peak paw features, the two no-pain stimuli (CS and DB) cover the same parts of the subspace and are largely indistinguishable. By contrast, the low and high pain conditions (LP and HP) clearly separate from the no-pain stimuli and also separate from each other in this univariate pain score.

The relative importance of each standardized (mean subtracted, variance scaled) behavioral feature was quantified as the loading of that feature on the univariate pain scale identified by ordinal logistic regression. The feature importance for both pre- and post-peak features (Figure 4—figure supplement 1) indicate that relatively simple properties of the paw trajectory, such as the maximum height of the paw and the total distance traveled before the paw returns to resting position, contribute substantially to separating pain from no-pain. The max y velocity and the number of paw shakes further contributes to the separation of low versus high pain categories in the post-peak paw features (low and high pain classification is slightly improved by post- rather than pre-peak features, Figure 4—figure supplement 2).

Next, we used the PAWS system to quantify pain-related paw movement in mice treated with a common inflammatory pain agent, the complete Freund’s adjuvant (CFA). To accomplish this, we used 20 mice (5 each of the C57BL/6, 129S1, Balb/c, and A/J strains chosen randomly) and applied the innocuous dynamic brush and the 4 g VFH filament before and 48 hr after unilateral hind paw injection of CFA. Since the dynamic brush is not painful and 4 g VFH lies close to the boundary separating painful and non-painful responses, we reasoned that we would observe a mechanical allodynia phenotype with increased PAWS values (Abdus-Saboor et al., 2019; Alhadeff et al., 2018). However, we did not observe a significant increase in the PAWS measurements following CFA (Figure 4—figure supplement 3). In fact, following CFA the responses to dynamic brush were slightly decreased, and the responses to 4 g VFH, although greater than to dynamic brush, nevertheless fell near the threshold separating touch and pain at baseline (Figure 4—figure supplement 3Figure 4—source data 2). While performing these assays, we noted that the CFA-injected paw was red, inflamed, and swollen and the mice tried not to move the injured paw whatsoever. Thus, for peripheral manipulations that cause animals to drag heavy and swollen limbs, PAWS may not be a suitable system to detect pain hypersensitivity. Measuring thermal hyperalgesia with the Hargreaves assay, or detecting changes in mechanical threshold with VFHs, may be the most suitable method of quantifying CFA-induced pain hypersensitivity (Petrus et al., 2007; Dhandapani et al., 2018).

The performance of the univariate pain score was evaluated by leave-one-out (LOO) cross-validation for individual mice (Figure 5A), and by leave-one-strain-out for strains (Figure 5B). We compared the pre- and post-peak paw univariate pain scale classifications to a null model that assigns pain classes according to their probability in the training data, without reference to measured behavioral features. For prediction, CS and DB were treated as a single ‘no-pain’ class, while LP and HP were treated as a single ‘pain’ class, resulting in a binary classification: no-pain or pain (Figure 5—figure supplement 1). For predicting the pain state of a given mouse, or all the mice in a strain (generalizing across strains), post-peak paw features consistently performed best. In particular, the univariate pain score based on post-peak features was able to correctly identify the stimulus as painful or non-painful with 83.5% accuracy (cross-validated by leave-out one mouse); and it provided 81.3% accuracy when predicting the stimulus for a mouse of a novel strain (cross-validated by leave-out one strain). Nonparametric bootstrap 95% confidence intervals were [79.1%, 87.8%] and [76.9%, 85.9%], respectively.

Figure 5. Validating the accuracy of the univariate pain scale.

(A) Leave-one-out (LOO) cross-validation performance (binary classification accuracy of not pain/pain) of the pre-peak and post-peak univariate pain scales, compared to a null model (random assignment), for individual mice. Error bars show nonparametric bootstrap 95% confidence intervals. (B) Same as a, but leaving out an entire strain (colors) or an equivalent number of mice chosen uniformly at random (gray).

Figure 5.

Figure 5—figure supplement 1. Validating the accuracy of the univariate pain scale.

Figure 5—figure supplement 1.

Related to Figure 5. Plots show ternary classification accuracy (no pain/LP/HP).

Applying automated tracking and PAWS to a new mouse line

After testing the seven inbred lines described above, we next tested an additional strain (SJL) that other labs have used in various experimental contexts. The SJL mice have not been studied in the context of pain, but some researchers use these mice to study aggression, as these mice display strong bouts of aggression and are prone to fight (Lumley et al., 2004; Tellegen and Horn, 1972). We reasoned that this strain would make a good test case for our new platform.

Using traditional scoring with the SJL strain, we noticed the response rate mirrored the canonical C57BL/6 strain (Figure 1), with low responsiveness to a cotton swab, and high responsiveness to the other stimuli (Figure 6A). Using this metric alone, we might have concluded that the SJL strain is no different than the most commonly used wildtype strain C57BL/6. However, when using high-speed videography and PAWS we find that SJL appears hypersensitive to mechanical pain – responding to the soft dynamic brush as if it were pinprick and having even more elaborate paw withdrawal stimuli (Figure 6B). The outlier pattern of this strain is observed when scoring the seven individual pain behavioral features. Consistent with this finding, our confidence in predicting the stimulus the animal received based on its response using our univariate pain scale is significantly reduced in the SJL strain (significant difference in mean LOO accuracy between DBA1 and SJL; one-sided Welch two-sample t-test, t = 2.9405, df = 36.894, p=0.003), reflecting the nature of the outlier phenotype in these mice (Figure 6C). These data reveal that our automated pain assessment platform is capable of detecting individual strain differences in susceptibility to mechanical pain and that this platform can provide new information that would not have been predicted based on simple paw withdrawal frequencies.

Figure 6. Automated pain assessment platform uncovers an outlier strain.

Figure 6.

(A) Traditional scoring focused on paw withdrawal frequencies to four mechanical stimuli: cs = cotton swab, db = dynamic brush, lp = light pinprick, hp = heavy pinprick. N = 7 mice for the SJL strain. (B) ProAnalyst tracking showing SJL mice have pain-like response to dynamic brush and heightened pinprick responses. (C) Leave-one-out (LOO) cross validation by strain shows stimulus prediction for SJL mice is poor, likely because their responses typically map outside of the normal range of the other seven strains. Error bars show nonparametric bootstrap 95% confidence intervals.

PAWS detects motivational changes in pain perception following chemogenetic brain circuit manipulation

Finally, we sought to directly test whether our separation of pre- versus post-paw peak features (denoted by t*) could be functionally separated or attributed to nociception in supraspinal circuits for affective-motivational behaviors rather than circuits that might influence the sensitivity of the pre-peak reflexives. To accomplish this reasoned that we could not use a standard inflammatory or neuropathic pain model, as those assays would alter the pain circuit from periphery to the brain, and thus the question we sought to address in relation to separating pre- versus post-paw peak behaviors would be obscured. Therefore, we focused our attention on manipulating pain circuits in the BLA (Corder et al., 2019). Thus, we hypothesized that the post-paw peak features could be increased by selectively driving hyperactivity in a recently identified basolateral amygdala nociceptive ensemble. Briefly, we used the activity-dependent transgenic TRAP2 mice (Fos-FOS-p2A-iCre-ERT2) to gain viral-genetic access to BLA neurons that are responsive to a noxious pinprick to the left hind paw. The transgenic mice in combination with an AAV expressing an excitatory DREADD (AAV5-hSyn-FLEx-hM3q-mCherry) allow the specific expression of hM3 only in neurons responsive to the noxious pinprick stimulus (painTRAP2hM3; Figure 7A). We confirmed bilateral hM3-mCherry expression that was restricted to only the BLA and not within the neighboring central nucleus (Figure 7—figure supplement 1). We first performed our analysis on Cre-negative control animals and observed no behavioral effects of hM3-agonist CNO (i.p., 3.0 mg/kg). Next, we performed our behavioral analysis on painTRAP2hM3 mice at baseline (-CNO) and after activation of the BLA pain ensemble (+CNO) (Figure 7). Because activation of the BLA pain ensemble resulted in unilateral spontaneous guarding pain behaviors, we waited until mice were calm, stood still , and had all four paws on the surface before applying our sensory stimuli. We applied a cotton swab and dynamic brush on one day, and a light and heavy pinprick on a second day. We observed that the PAWS measurements at baseline (-CNO) mirrored those that we observed with the seven strains described above, showing separation between innocuous and noxious stimuli (Figure 7B–H). Conversely, with activation of the BLA nociceptive ensemble in painTRAP2hM3 mice (+CNO), we noticed increased separation with the post-paw peak pain measurements when delivering the heavy pinprick stimuli, most noticeable in the combined shaking/guarding measurement (Figure 7H). When we used ordinal logistic regression for statistical separation that combined our seven behavioral features, we observed an emergent behavioral representation of the post-paw peak features that was highly separated from controls, indicating that selective activation of supraspinal amygdalar ensembles shifted the internal state of the animals to engage in heightened nociceptive behaviors (Figure 7I,J). Specifically, after CNO administration all pain scale values with heavy pinprick applications are greater than every other stimulus (Figure 7J). Thus, PAWS automatically measures increased mechanical hyperalgesia to noxious stimuli that was influenced by top-down processes in the absence of peripheral or spinal sensitization. These data suggest that our pre- versus post-peak designations have biological relevance because we can genetically increase the gain on the more temporally-delayed behavioral representations, which may reflect the aversiveness of the pain experience (Figure 7JFigure 7—source data 1).

Figure 7. Pain hypersensitivity with chemogenetic activation of the BLA pain ensemble automatically captured via PAWS.

(A) Schematic to permanently tag pain-active neurons in the BLA with an excitatory DREADD in transgenic painTRAP2hM3 mice. (B-D) Automatic measurement of pre-paw peak features comparing mice at baseline (-CNO) to mice administered CNO. (E-H) Automatic measurement of post-paw peak features comparing mice at baseline (-CNO) to mice administered CNO. Stimulus abbreviations are same as above. Statistical significance was computed with student’s t-test. *p-value ≤0.05. **p-value ≤0.01. Raw values were plotted instead of z-scores as shown in Figure 3 because we are plotting only six mice in this figure. (I-J) Ordinal logistic regression reveals separation of innocuous versus noxious stimuli at baseline (-CNO) and further segregation of the HP group after CNO, only when using all seven paw features. N = 6 mice tested in these experiments. The second panel in J omits one outlier mouse (HP treatment) that produced pain score >25.

Figure 7—source data 1. Raw data values automatically computed from PAWS showing each painTRAP2hM3 mouse combined with sensory stimuli before and after CNO administration.
p=pain, NP = no pain.

Figure 7.

Figure 7—figure supplement 1. Targeting excitatory DREADD in painTRAP2hM3 mice.

Figure 7—figure supplement 1.

(A) 4X image of the specific and robust expression of m-Cherry-labeled excitatory DREADD bilaterally in the BLA. (B) 20X image of BLA from the white box in A. Scale bars represent 500 μm.

Discussion

Here, we describe an automated approach to quantify the most salient behavioral features following mechanical stimulation of the mouse paw for separating responses according to stimulus intensity. Scoring the paw withdrawal reflex to a natural stimulus is the most commonly used assessment method in preclinical rodent models with yes/no responsiveness used as a proxy for inferring pain states. While this methodology for measuring pain in rodents is not fundamentally flawed, it lacks resolution – a limitation that can now be overcome with advances in videography and automated tracking. Here, by automatically tracking paw dynamics at sub-second speeds with high-speed videography coupled with machine learning approaches, we reveal stereotyped trajectory patterns in response to innocuous versus noxious stimuli spanning genetically diverse mice. With an accurate pinpoint of the paw at high spatiotemporal resolution, a freely available software package we term PAWS automatically quantifies seven behavioral features that are combined to determine the mouse pain state. Notably, the seven behavioral features we defined have the benefit of quantifying pain response in terms of intuitive behaviors (such as paw shakes, guarding, etc), in contrast to generic unsupervised machine learning approaches. Demonstrating the robustness of the platform, we identify an outlier mouse strain that displays heightened pain sensitivity, and we accurately measure a heightened pain state when we simultaneously activate pain in the periphery and brain using chemogenetic and natural stimuli.

Behavioral neuroscience in model organisms is undergoing a renaissance with the emergence of new tools to automatically track and measure behavior (Datta et al., 2019; Fried et al., 2020; von Ziegler et al., 2020; Günel et al., 2019; Graving et al., 2019; Itskovits et al., 2017; Kabra et al., 2013; Klibaite et al., 2017). This renaissance is coincident with many in the research community questioning the robustness of rodent models of pain, addiction, depression, anxiety, and other neuropsychiatric disorders. Many researchers are taking steps backward to first properly understand the components of a complex behavioral sequence before proceeding to identify the neuronal correlates that drive those motor patterns (Datta et al., 2019; Fried et al., 2020; Berman, 2018). Both supervised and unsupervised machine learning algorithms are now able to follow unlabeled individual limbs on an experimental animal and automatically define behaviors of interest (Wiltschko et al., 2015; Pereira et al., 2019; Mathis et al., 2018; Graving et al., 2019; Berman et al., 2014; Kabra et al., 2013). Although the majority of these tools have yet to be adopted en masse by the pain research community, some of this technology is already in use by pain researchers. For example, the automated grimace scale developed by the Mogil and Zylka labs uses a convolutional neural network trained with 6000 facial images of mice in ‘pain’ or ‘no-pain’, to make accurate predictions of the mouse pain state in novel datasets (Tuttle et al., 2018). The automated grimace scale still requires additional customization to assess rodents of different coat colors and to measure chronic pain. Tools like the automated grimace scale that focus on the face, could be combined with the automated pain assessment platform described here that focuses on the paw, for a comprehensive picture of both evoked and spontaneous behavioral responsiveness.

Here, with our platform, we observed both homogeneties in behavioral responses across seven genetically distinct mouse lines, as well as an outlier strain with responses that mapped outside the range of those seven. A wealth of prior literature demonstrated that individual differences in responsiveness to pain in both mice and humans are driven in part by allelic variation in genes important for pain processing (Mogil, 2012; Calvo et al., 2019). In regards to the mouse, pioneering studies carried out 20 years ago by the Mogil group testing pain sensitivity across 11 inbred lines using 12 behavioral read-outs, revealed that depending upon the sensory modality tested, and whether the test was performed before or after injury to the somatosensory system, genotype appeared to influence mouse pain behaviors (Mogil et al., 1999). A meta-analysis from 10 years ago described over 400 papers from mouse pain research that implicated ~350 genes in pain and analgesia (LaCroix-Fralish et al., 2007). However, with the relatively limited resolution of some conventional pain behavior assays, and displayed here with our traditional scoring of the data, it remains unclear how reliable some of these studies are and which potential target genes merit further development as novel analgesics. This assertion is underscored by the fact that only a handful of targets that have shown promise in rodents, have made it to the clinic as novel therapeutics, causing many to question the robustness of the animal models used in pain testing (Vardeh et al., 2016; Woolf, 2010).

We also observed a new hypersensitivity phenotype, where SJL mice respond to a soft brush as if it were a pinprick. To the best of our knowledge, this is the first report of a pain hyper-sensitivity phenotype for SJL mice at baseline. Of note, SJL mice are known to be an aggressor mouse line and even in our studies, three animals had to be removed from testing due to excessive fighting between cage mates (Miner et al., 1993). Therefore, future studies are necessary to determine if the genetic repertoire that makes these mice aggressive also contributes to their heightened pain responses. Additionally, SJL mice are homozygous for the retinal degeneration one mutation, Pde6brd1, causing poor eyesight and even blindness in some of these mice (Chang et al., 2002; Chang et al., 2013). It is therefore possible that what we perceive as pain hypersensitivity may reflect more of heightened startle response, especially if the mice perceive an intense stimulation yet cannot localize who is delivering this stimulus. Future studies are warranted to determine if the altered behavioral changes seen here are driven by somatosensory system changes or are rather the byproduct of alterations elsewhere in the animal. Together, these results demonstrate that we can begin to use this platform as a behavioral screening tool to identify outlier strains (some will be of more interest than others), followed by subsequent genetic mapping approaches to uncover new alleles that may directly contribute to pain perception.

Finally, we tested the ability of our platform to uniquely identify changes that were driven by selective manipulation of a central pain circuit. We demonstrate that we can chemogenetically activate the BLA pain ensemble and detect hypersensitivity to peripheral stimuli. In addition to confirming the precision of our technology, these experiments raise an intriguing biological question: what is the emotional and sensory experience of activating pain-responsive neurons without a peripheral injury? Would tonic activation of this BLA pain ensemble be comparable to human experiences of chronic pain? Would DREADD inhibition of this BLA circuit be sufficient to block the post-paw peak behavioral features such as paw shaking and guarding? Further studies to examine these questions are ongoing. Additionally, we noticed that only the heavy pinprick stimulus increased the aversive responses when we reactivated the amygdalar nociceptive circuit. Based on our single-neuron microendoscope calcium imaging data in the basolateral amygdala (Corder et al., 2019), the nociceptive neurons respond only to strongly-noxious sensory stimuli. Only after peripheral nerve damage to induce chronic neuropathic pain, did we observe that formerly-innocuous stimuli engage this BLA ensemble. This suggests that some up-circuit plasticity occurred that redirected touch information into the BLA ensemble (e.g. opening of the spinal ‘gate’, see Torsney and MacDermott, 2006; Cui et al., 2016; Petitjean et al., 2015; Braz et al., 2014; Cheng et al., 2017; Duan et al., 2014; Foster et al., 2015). Thus, without this central plasticity, the other stimuli likely do not activate the ensemble under normal, uninjured conditions. Therefore, chemogenetic activation of the nociceptive ensemble is not necessarily predicted to amplify or modulate neural processes related to non-noxious stimuli. We did observe some spontaneous lifting and licking of the hind paws upon CNO treatment, which could reflect general aversive perception, but how this might alter withdrawal reflexes that use distinct neural circuits is unclear. The fact that we observed a robust increase in paw shaking and guarding-related behaviors supports the conclusion that the BLA nociceptive ensemble is specifically tuned to encode and modulate nociception only.

In summary, we have developed a rapid and user-friendly automated pain assessment platform for measuring mechanical pain in mice. Since mechanical stimulation of the rodent hind paw remains the most common method to measure pain in mice, the tools described here, with the addition of a high-speed camera, are fully compatible with the setups that most labs currently use. As such, we do not foresee major hurdles in the wide adoption of this methodology. To increase our fundamental understanding of the neurobiology of pain and to translate our basic science findings in pain research to improved patient outcomes, the pain measurement tools in rodents must be robust. The platform described here should aid in that pursuit.

Materials and methods

Key resources table.

Reagent type
(species) or resource
Designation Source or reference Identifiers Additional information
Genetic reagent (M. musculus) C57BL/6J Jackson Laboratory RRID:IMSR_JAX:000664
Genetic reagent (M. musculus) A/J Jackson Laboratory RRID:IMSR_JAX:000646
Genetic reagent (M. musculus) 129S1/SvlmJ Jackson Laboratory Stock No: 002448
Genetic reagent (M. musculus) BALB/cJ Jackson Laboratory Stock No: 000651
Genetic reagent (M. musculus) DBA/1J Jackson Laboratory Stock No: 000670
Genetic reagent (M. musculus) AKR/J Jackson Laboratory RRID:IMSR_JAX:000648
Genetic reagent (M. musculus) CBA/J Jackson Laboratory Stock No: 000656
Genetic reagent (M. musculus) SJL/J Jackson Laboratory Stock No: 000686
Genetic reagent (M. musculus) Fos-FOS-2A-iCre-ERT2 Jackson Laboratory Stock no: 030323 PMID:28912243
Chemical compound, drug Clozapine N-oxide
Sigma C0832 3 mg/kg
Chemical compound, drug Complete Freund’s Adjuvant Sigma F5881 10 μL/paw
Software, algorithm PAWS this paper https://github.com/crtwomey/paws
Software, algorithm ProAnalyst https://www.xcitex.com/proanalyst-motion-analysis-software.php
Software, algorithm SLEAP https://sleap.ai/ PMID:30573820

Mouse strains

Mice for behavior testing were maintained in a barrier animal facility in the Carolyn Lynch or Translational Research Laboratory (TRL) buildings at the University of Pennsylvania. The Lynch and TRL vivariums are temperature controlled and maintained under a 12 hr light/dark cycle (7 am/7 pm) at 70 degrees Fahrenheit with ad lib access to food (Purina LabDiet 5001) and tap water. The feed compartment on the wire box lid of the cage was kept at a minimum of 1/3 full at all times. All cages were provided with nestlets to provide enrichment for mice. All procedures were conducted according to animal protocols approved by the university Institutional Animal Care and Use Committee (IACUC) and in accordance with the National Institutes of Health (NIH) guidelines. C57BL/6J, A/J, 129S1/SvlmJ, Balb/cJ, DBA/1J, AKR/J, CBA/J, SJL/J, and TRAP2 mice (Fos-FOS-2A-iCre-ERT2, stock no. 030323) mice were all purchased from Jackson Laboratories. All animals were habituated to our facility for 2 weeks after delivery before beginning behavioral experiments described below. All inbred strains of mice tested were male with 10 mice used per strain used unless otherwise noted in the figure or figure legend, while six female TRAP2 mice tested. The choice to use female mice in the TRAP2 experiments was random based on availability of mice. All mice were adults between 2 and 4 months. Animals were co-housed with 4–5 mice per cage in a large holding room containing approximately 500 cages of mice. For the SJL/J strain, some aggression was observed, and several mice had to be removed from testing and separated due to fighting.

High-speed imaging and video storage

Mouse behaviors were recorded at 2000 fps with a high-speed camera (Photron FastCAM Mini AX 50 170 K-M-32GB - Monochrome 170K with 32 GB memory) and attached lens (Zeiss 2/100M ZF.2-mount). Mice performed behavior in rectangular plexiglass chambers on an elevated mesh platform. The camera was placed at a ~45° angle at ~1–2 feet away from the Plexiglas holding chambers on a tripod with geared head for Photron AX 50. CMVision IP65 infrared lights that mice cannot detect were used to adequately illuminate the paw for subsequent tracking in ProAnalyst. All data were collected on a Dell laptop computer with Photron FastCAM Analysis software. Each cohort of mice (n = 40) used roughly 80 GB of video storage space, (average size of video file = ~2 GB), which amounts to a total of ~160 GB for all eight inbred strains. The format for tracking data = . csv and. txt output (which amounted to ~5.4 MB for the eight inbred lines).

Somatosensory behavior assays

In all behavioral experiments, we used a sample size of 6–10 mice per strain, as these numbers of consistent with studies of this kind in the literature to reach statistically significant conclusions. For experiments with inbred mice, animals were not ear-tagged, so we had no source of identification, and thus animals within a given strain were randomly tested. For experiments with fos-trap mice, both the experimenter and evaluator of the data were blind to which animals were in the -CNO versus +CNO experimental groups. In general, all mice were habituated for a minimum of 5 days, for one hour each day, in the Plexiglas holding chambers before testing commenced. During baseline, mice were tested in groups of five and chambers were placed in a row with barriers preventing mice from seeing each other. On testing day, mice were habituated for an additional ~10 min before stimulation and tested one at a time. Stimuli were applied through the mesh to the hind paw proximal to the camera. Testing only occurred when the camera’s view of the paw was unobstructed. Mice only received one stimulus on a given testing day (cs, db, lp, or hp) and were given at least 24 hr between each stimulus session. Stimuli were tested from least painful to most: cotton swab, dynamic brush, light pinprick, and heavy pinprick. Cotton swab tests consisted of contact between the cotton Q-tip and the hind paw until paw withdrawal. Dynamic brush tests were performed by wiping a concealer makeup brush (L'Oréal Paris Infallible Concealer Brush, item model number 3760228170158) across the hind paw from back to front. Light pinprick tests were performed by touching a pin (Austerlitz Insect Pins) to the hind paw of the mouse. The pin was withdrawn as soon as contact was observed. Heavy pinprick tests were performed by sharply pressing this pin into the paw so that it was pushed upward, without the breaking the skin barrier. The pin was withdrawn as soon as approximately 1/3 of the pin’s length had passed through the mesh. For the application of von Frey hairs (VFHs, Stoelting Company, 58011), we used 4 g of force. As previously described, the VFH was directed at the center of the plantar paw and pressed upward until the filament bent (Abdus-Saboor et al., 2019). For inducing inflammatory pain, approximately 10 μL of Complete Freund’s Adjuvant (CFA, Sigma, F5881) was injected into the plantar surface of 3% isoflurane-anesthetized mice as previously published (Abdus-Saboor et al., 2019).

Automated paw tracking

We used ProAnalyst software to automatically track hind paw movements following stimulus application. This software allowed us to integrate automated and manually scored data, possible through the ‘interpolation’ feature within ProAnalyst. We were able to define specific regions of interest (paw), track, and generate data containing ‘x’ and ‘y’ coordinates of the paw through time, as well as velocity, speed, and acceleration information. In a subset of videos, additional manual annotation was performed for increased accuracy. For deep learning-based paw tracking with the SLEAP algorithm, we pseudo-randomly chose a small set of training frames from each video and hand-labeled the paw and toe. We trained SLEAP to predict toe and paw positions in unlabeled video frames (>~90% of total video frames). To generate trajectories, we overlaid the inferred x, y positions of the toe and paw in each video frame on a single still image corresponding to the apex of the mouse’s paw during the assay.

Development of PAWS software to quantify pain behaviors

Behavioral features were extracted from raw paw position time series in an automated and standardized procedure. First, the start and end of paw movement (paw at rest on the ground) were identified, and analysis was restricted to this time window. Peaks in paw height were then determined based on Savitsky-Golay smoothed estimates of paw velocity, and the first peak identified. The time of the first peak (designated t*) was used to separate pre-peak behavioral feature calculations from post-peak calculations. To differentiate shaking from guarding in the post-peak period, we constructed a moving reference frame based on the principal axis of paw displacement across a sliding window (0.04 s in duration) for each time point, and identified periods of consecutive displacements above a specified threshold (35% of maximum paw height) as periods of shaking. Note that in the construction of the moving reference frame the principal axes of variation were recovered via principal component analyses, which is not invariant to the sign of the recovered axes. Since displacement is measured over time it is sensitive to reversals in sign along the axis we measure it. We therefore ensured consistency by using the axis direction minimizing the angular deviation from the axis recovered at the previous time step. PAWS is open source and freely available at https://github.com/crtwomey/paws.

Drugs

4-hydroxytamoxifen (Hello Bio, #HB2508) prepared in Kolliphor EL (Sigma, #27963), Clozapine-N-oxide (Hello Bio, #HB6149), and 0.9% sodium chloride (Sigma, #S3014).

Viral reagents

For chemogenetic manipulation of BLA pain-active neurons, we intracranially injected 200 nL of AAV5-hSyn-DIO-hM3D(Gq)-mCherry (Addgene, titer: 7 × 1012) into both the left and right BLA at coordinates AP: −1.4 mm, ML:±3.1 mm, DV: −4.2.

Stereotactic injections and surgical procedures

We conducted surgeries under aseptic conditions using a small stereotaxic instrument (World Precision Instruments). We anesthetized mice with isofluorane (5% induction, 1–2% maintenance) during the entire surgery and maintained body temperature using a heating block. We injected mice with a beveled 33G needle attached to a 10 µL syringe (Nanofil, WPI) for delivery of 200 nL of viral reagent at a rate of 40 nL/min. After viral injection, the needle remained at the injection depth for 10 min before slow withdrawal over 2 min. After surgery, we maintained the animal’s body temperature using a heating pad.

Targeted recombination in active populations (TRAP) of BLA pain ensemble

We utilized female Fos2A-iCreERT2 (TRAP2) mice, Jackson Laboratory, stock #030323 aged P46-73. We bilaterally transduced an AAV (AAV9-hSyn-DIO-hM3D(Gq)-mCherry) into the BLA. Two weeks after injection, mice were stimulated with a noxious pinprick on the left hind paw every 30 s for 10 min. One hour later, we injected mice with 4-hydroxytamoxifen (4-OHT) (20 mg/kg in 0.2 mL vehicle; subcutaneous) to induce genetic recombination. Eight weeks following 4-OHT administration, we examined mouse behavior. We examined behavior 30 min after injection of CNO (3 mg/kg in 0.2 mL vehicle; subcutaneous) or vehicle. Mice were euthanized via transcardial perfusion 4 months after viral injections. Formalin-fixed brains were collected and sectioned at 50 µm on a cryostat. Tissue was mounted and imaged on a fluorescent Keyence microscope.

Acknowledgements

We thank Dr. Mala Murthy and Nat Tabris for the development of the deep neural network SLEAP and freely sharing an early and unpublished version of this platform with us. We thank members of the Abdus-Saboor, Plotkin, Murthy, and Corder labs for helpful comments on this manuscript. IA-S, JJ, and WF are supported by startup funds from the University of Pennsylvania and by a grant from the National Institutes of Health (NIH/NIDCR, R00-DE026807). OMA and TDP are supported by NIH BRAIN Initiative R01 NS104899. OMA is also supported by the BWF PDEP. Members of the Corder lab are supported by NIH/NIDA grant R00-DA043609. JBP is supported by the Defense Advanced Research Projects Agency NGS2 program (grant D17AC00005), the Army Research Office (grant W911NF-17-1-0083), and the David and Lucile Packard Foundation. CRT is supported by the UPenn mindCORE fellowship.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Ishmail Abdus-Saboor, Email: ishmail@sas.upenn.edu.

Kate M Wassum, University of California, Los Angeles, United States.

Rebecca Seal, University of Pittsburgh School of Medicine, United States.

Funding Information

This paper was supported by the following grants:

  • National Institutes of Health R00-DE026807 to Jessica M Jones, Ishmail Abdus-Saboor.

  • National Institutes of Health R01 NS104899 to Osama Ahmed, Talmo D Pereira.

  • National Institutes of Health R00-DA043609 to Jessica A Wojick, Gregory Corder.

  • Army Research Office W911NF-17-1-0083 to Joshua B Plotkin.

  • Defense Advanced Research Projects Agency D17AC00005 to Joshua B Plotkin.

  • Burroughs Wellcome Fund PDEP to Osama M Ahmed.

  • mindCORE UPenn mindCORE fellowship to Colin Twomey.

  • David and Lucile Packard Foundation to Joshua B Plotkin.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Resources, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - review and editing.

Conceptualization, Resources, Software, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Resources, Data curation, Formal analysis, Investigation, Visualization, Methodology, Writing - review and editing.

Resources, Investigation, Methodology, Writing - review and editing.

Investigation, Methodology, Writing - review and editing.

Data curation, Formal analysis, Investigation, Writing - review and editing.

Data curation, Formal analysis, Investigation, Methodology, Writing - review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Conceptualization, Resources, Data curation, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing - review and editing.

Ethics

Animal experimentation: This study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. All of the animals were handled according to approved institutional animal care and use committee (IACUC) protocols (#806519) of the University of Pennsylvania.

Additional files

Transparent reporting form

Data availability

Raw data are associated with figures as source data.

References

  1. Abdus-Saboor I, Fried NT, Lay M, Burdge J, Swanson K, Fischer R, Jones J, Dong P, Cai W, Guo X, Tao YX, Bethea J, Ma M, Dong X, Ding L, Luo W. Development of a mouse pain scale using Sub-second behavioral mapping and statistical modeling. Cell Reports. 2019;28:1623–1634. doi: 10.1016/j.celrep.2019.07.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alhadeff AL, Su Z, Hernandez E, Klima ML, Phillips SZ, Holland RA, Guo C, Hantman AW, De Jonghe BC, Betley JN. A neural circuit for the suppression of pain by a competing need state. Cell. 2018;173:140–152. doi: 10.1016/j.cell.2018.02.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arcourt A, Gorham L, Dhandapani R, Prato V, Taberner FJ, Wende H, Gangadharan V, Birchmeier C, Heppenstall PA, Lechner SG. Touch Receptor-Derived sensory information alleviates acute pain signaling and Fine-Tunes nociceptive reflex coordination. Neuron. 2017;93:179–193. doi: 10.1016/j.neuron.2016.11.027. [DOI] [PubMed] [Google Scholar]
  4. Attal N, Bouhassira D, Baron R. Diagnosis and assessment of neuropathic pain through questionnaires. The Lancet Neurology. 2018;17:456–466. doi: 10.1016/S1474-4422(18)30071-1. [DOI] [PubMed] [Google Scholar]
  5. Basbaum AI, Bautista DM, Scherrer G, Julius D. Cellular and molecular mechanisms of pain. Cell. 2009;139:267–284. doi: 10.1016/j.cell.2009.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Berman GJ, Choi DM, Bialek W, Shaevitz JW. Mapping the stereotyped behaviour of freely moving fruit flies. Journal of the Royal Society Interface. 2014;11:20140672. doi: 10.1098/rsif.2014.0672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Berman GJ. Measuring behavior across scales. BMC Biology. 2018;16:23. doi: 10.1186/s12915-018-0494-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Blivis D, Haspel G, Mannes PZ, O'Donovan MJ, Iadarola MJ. Identification of a novel spinal nociceptive-motor gate control for aδ pain stimuli in rats. eLife. 2017;6:e23584. doi: 10.7554/eLife.23584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Braz J, Solorzano C, Wang X, Basbaum AI. Transmitting pain and itch messages: a contemporary view of the spinal cord circuits that generate gate control. Neuron. 2014;82:522–536. doi: 10.1016/j.neuron.2014.01.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Browne LE, Latremoliere A, Lehnert BP, Grantham A, Ward C, Alexandre C, Costigan M, Michoud F, Roberson DP, Ginty DD, Woolf CJ. Time-Resolved fast mammalian behavior reveals the complexity of protective pain responses. Cell Reports. 2017;20:89–98. doi: 10.1016/j.celrep.2017.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Calvo M, Davies AJ, Hébert HL, Weir GA, Chesler EJ, Finnerup NB, Levitt RC, Smith BH, Neely GG, Costigan M, Bennett DL. The genetics of neuropathic pain from model organisms to clinical application. Neuron. 2019;104:637–653. doi: 10.1016/j.neuron.2019.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chang B, Hawes NL, Hurd RE, Davisson MT, Nusinowitz S, Heckenlively JR. Retinal degeneration mutants in the mouse. Vision Research. 2002;42:517–525. doi: 10.1016/S0042-6989(01)00146-8. [DOI] [PubMed] [Google Scholar]
  13. Chang B, Hurd R, Wang J, Nishina P. Survey of common eye diseases in laboratory mouse strains. Investigative Opthalmology & Visual Science. 2013;54:4974–4981. doi: 10.1167/iovs.13-12289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cheng L, Duan B, Huang T, Zhang Y, Chen Y, Britz O, Garcia-Campmany L, Ren X, Vong L, Lowell BB, Goulding M, Wang Y, Ma Q. Identification of spinal circuits involved in touch-evoked dynamic mechanical pain. Nature Neuroscience. 2017;20:804–814. doi: 10.1038/nn.4549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Corder G, Ahanonu B, Grewe BF, Wang D, Schnitzer MJ, Scherrer G. An amygdalar neural ensemble that encodes the unpleasantness of pain. Science. 2019;363:276–281. doi: 10.1126/science.aap8586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cui L, Miao X, Liang L, Abdus-Saboor I, Olson W, Fleming MS, Ma M, Tao YX, Luo W. Identification of early RET+ deep dorsal spinal cord interneurons in gating pain. Neuron. 2016;91:1137–1153. doi: 10.1016/j.neuron.2016.07.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Datta SR, Anderson DJ, Branson K, Perona P, Leifer A. Computational neuroethology: a call to action. Neuron. 2019;104:11–24. doi: 10.1016/j.neuron.2019.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Deuis JR, Dvorakova LS, Vetter I. Methods used to evaluate pain behaviors in rodents. Frontiers in Molecular Neuroscience. 2017;10:284. doi: 10.3389/fnmol.2017.00284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Dhandapani R, Arokiaraj CM, Taberner FJ, Pacifico P, Raja S, Nocchi L, Portulano C, Franciosa F, Maffei M, Hussain AF, de Castro Reis F, Reymond L, Perlas E, Garcovich S, Barth S, Johnsson K, Lechner SG, Heppenstall PA. Control of mechanical pain hypersensitivity in mice through ligand-targeted photoablation of TrkB-positive sensory neurons. Nature Communications. 2018;9:1640. doi: 10.1038/s41467-018-04049-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Douglass AD, Kraves S, Deisseroth K, Schier AF, Engert F. Escape behavior elicited by single, channelrhodopsin-2-evoked spikes in zebrafish somatosensory neurons. Current Biology. 2008;18:1133–1137. doi: 10.1016/j.cub.2008.06.077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Duan B, Cheng L, Bourane S, Britz O, Padilla C, Garcia-Campmany L, Krashes M, Knowlton W, Velasquez T, Ren X, Ross S, Lowell BB, Wang Y, Goulding M, Ma Q. Identification of spinal circuits transmitting and gating mechanical pain. Cell. 2014;159:1417–1432. doi: 10.1016/j.cell.2014.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Foster E, Wildner H, Tudeau L, Haueter S, Ralvenius WT, Jegen M, Johannssen H, Hösli L, Haenraets K, Ghanem A, Conzelmann KK, Bösl M, Zeilhofer HU. Targeted ablation, silencing, and activation establish glycinergic dorsal horn neurons as key components of a spinal gate for pain and itch. Neuron. 2015;85:1289–1304. doi: 10.1016/j.neuron.2015.02.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fried NT, Chamessian A, Zylka MJ, Abdus-Saboor I. Improving pain assessment in mice and rats with advanced videography and computational approaches. Pain. 2020;161:1420–1424. doi: 10.1097/j.pain.0000000000001843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Graving JM, Chae D, Naik H, Li L, Koger B, Costelloe BR, Couzin ID. DeepPoseKit, a software toolkit for fast and robust animal pose estimation using deep learning. eLife. 2019;8:e47994. doi: 10.7554/eLife.47994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Günel S, Rhodin H, Morales D, Campagnolo J, Ramdya P, Fua P. DeepFly3D, a deep learning-based approach for 3D limb and appendage tracking in tethered, adult Drosophila. eLife. 2019;8:e48571. doi: 10.7554/eLife.48571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Itskovits E, Levine A, Cohen E, Zaslaver A. A multi-animal tracker for studying complex behaviors. BMC Biology. 2017;15:29. doi: 10.1186/s12915-017-0363-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kabra M, Robie AA, Rivera-Alba M, Branson S, Branson K. JAABA: interactive machine learning for automatic annotation of animal behavior. Nature Methods. 2013;10:64–67. doi: 10.1038/nmeth.2281. [DOI] [PubMed] [Google Scholar]
  28. Klibaite U, Berman GJ, Cande J, Stern DL, Shaevitz JW. An unsupervised method for quantifying the behavior of paired animals. Physical Biology. 2017;14:015006. doi: 10.1088/1478-3975/aa5c50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. LaCroix-Fralish ML, Ledoux JB, Mogil JS. The pain genes database: an interactive web browser of pain-related transgenic knockout studies. Pain. 2007;131:3e1–33. doi: 10.1016/j.pain.2007.04.041. [DOI] [PubMed] [Google Scholar]
  30. Langford DJ, Bailey AL, Chanda ML, Clarke SE, Drummond TE, Echols S, Glick S, Ingrao J, Klassen-Ross T, Lacroix-Fralish ML, Matsumiya L, Sorge RE, Sotocinal SG, Tabaka JM, Wong D, van den Maagdenberg AM, Ferrari MD, Craig KD, Mogil JS. Coding of facial expressions of pain in the laboratory mouse. Nature Methods. 2010;7:447–449. doi: 10.1038/nmeth.1455. [DOI] [PubMed] [Google Scholar]
  31. Libby T, Moore TY, Chang-Siu E, Li D, Cohen DJ, Jusufi A, Full RJ. Tail-assisted pitch control in lizards, robots and dinosaurs. Nature. 2012;481:181–184. doi: 10.1038/nature10710. [DOI] [PubMed] [Google Scholar]
  32. Lumley LA, Robison CL, Slusher BS, Wozniak K, Dawood M, Meyerhoff JL. Reduced isolation-induced aggressiveness in mice following NAALADase inhibition. Psychopharmacology. 2004;171:375–381. doi: 10.1007/s00213-003-1610-z. [DOI] [PubMed] [Google Scholar]
  33. Mathis A, Mamidanna P, Cury KM, Abe T, Murthy VN, Mathis MW, Bethge M. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nature Neuroscience. 2018;21:1281–1289. doi: 10.1038/s41593-018-0209-y. [DOI] [PubMed] [Google Scholar]
  34. Melzack R. The short-form McGill pain questionnaire. Pain. 1987;30:191–197. doi: 10.1016/0304-3959(87)91074-8. [DOI] [PubMed] [Google Scholar]
  35. Miner LL, Elmer GI, Pieper JO, Marley RJ. Aggression modulates genetic influences on morphine analgesia as assessed using a classical mendelian cross analysis. Psychopharmacology. 1993;111:17–22. doi: 10.1007/BF02257401. [DOI] [PubMed] [Google Scholar]
  36. Mogil JS, Wilson SG, Bon K, Lee SE, Chung K, Raber P, Pieper JO, Hain HS, Belknap JK, Hubert L, Elmer GI, Chung JM, Devor M. Heritability of nociception I: responses of 11 inbred mouse strains on 12 measures of nociception. Pain. 1999;80:67–82. doi: 10.1016/S0304-3959(98)00197-3. [DOI] [PubMed] [Google Scholar]
  37. Mogil JS. Pain genetics: past, present and future. Trends in Genetics. 2012;28:258–266. doi: 10.1016/j.tig.2012.02.004. [DOI] [PubMed] [Google Scholar]
  38. Pereira TD, Aldarondo DE, Willmore L, Kislin M, Wang SS, Murthy M, Shaevitz JW. Fast animal pose estimation using deep neural networks. Nature Methods. 2019;16:117–125. doi: 10.1038/s41592-018-0234-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Petitjean H, Pawlowski SA, Fraine SL, Sharif B, Hamad D, Fatima T, Berg J, Brown CM, Jan L-Y, Ribeiro-da-Silva A, Braz JM, Basbaum AI, Sharif-Naeini R. Dorsal horn parvalbumin neurons are Gate-Keepers of Touch-Evoked pain after nerve injury. Cell Reports. 2015;13:1246–1257. doi: 10.1016/j.celrep.2015.09.080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Petrus M, Peier AM, Bandell M, Hwang SW, Huynh T, Olney N, Jegla T, Patapoutian A. A role of TRPA1 in mechanical hyperalgesia is revealed by pharmacological inhibition. Molecular Pain. 2007;3:40. doi: 10.1186/1744-8069-3-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Severson KS, Xu D, Van de Loo M, Bai L, Ginty DD, O'Connor DH. Active touch and Self-Motion encoding by merkel Cell-Associated afferents. Neuron. 2017;94:666–676. doi: 10.1016/j.neuron.2017.03.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Tellegen A, Horn JM. Primary aggressive motivation in three inbred strains of mice. Journal of Comparative and Physiological Psychology. 1972;78:297–304. doi: 10.1037/h0032192. [DOI] [PubMed] [Google Scholar]
  43. Tiriac A, Uitermarkt BD, Fanning AS, Sokoloff G, Blumberg MS. Rapid whisker movements in sleeping newborn rats. Current Biology. 2012;22:2075–2080. doi: 10.1016/j.cub.2012.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Torsney C, MacDermott AB. Disinhibition opens the gate to pathological pain signaling in superficial neurokinin 1 receptor-expressing neurons in rat spinal cord. Journal of Neuroscience. 2006;26:1833–1843. doi: 10.1523/JNEUROSCI.4584-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Tuttle AH, Molinaro MJ, Jethwa JF, Sotocinal SG, Prieto JC, Styner MA, Mogil JS, Zylka MJ. A deep neural network to assess spontaneous pain from mouse facial expressions. Molecular Pain. 2018;14:174480691876365. doi: 10.1177/1744806918763658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Vardeh D, Mannion RJ, Woolf CJ. Toward a Mechanism-Based approach to pain diagnosis. The Journal of Pain. 2016;17:T50–T69. doi: 10.1016/j.jpain.2016.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. von Ziegler L, Sturman O, Bohacek J. Big behavior: challenges and opportunities in a new era of deep behavior profiling. Neuropsychopharmacology. 2020;475:7. doi: 10.1038/s41386-020-0751-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wiltschko AB, Johnson MJ, Iurilli G, Peterson RE, Katon JM, Pashkovski SL, Abraira VE, Adams RP, Datta SR. Mapping Sub-Second structure in mouse behavior. Neuron. 2015;88:1121–1135. doi: 10.1016/j.neuron.2015.11.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Woolf CJ. Overcoming obstacles to developing new analgesics. Nature Medicine. 2010;16:1241–1247. doi: 10.1038/nm.2230. [DOI] [PubMed] [Google Scholar]

Decision letter

Editor: Rebecca Seal1
Reviewed by: Rebecca Seal2, Andrew Shepherd

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Decision letter after peer review:

[Editors’ note: the authors submitted for reconsideration following the decision after peer review. What follows is the decision letter after the first round of review.]

Thank you for submitting your work entitled "A machine-vision approach for automated pain measurement at millisecond timescales" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and a Senior Editor, one of whom is a member of our Board of Reviewing Editors. The following individuals involved in review of your submission have agreed to reveal their identity: Rebecca Seal (Reviewer #1); Andrew Shepherd (Reviewer #2).

Our decision has been reached after consultation between the reviewers. Based on these discussions and the individual reviews below, we regret to inform you that your work will not be considered further for publication in eLife.

Specifically, while an automated, objective approach to measure evoked mechanical pain behavior in rodents will be a highly significant contribution to the pain field and the authors have presented solid efforts toward this goal, several critical aspects of the PAWS system and its validation that would successfully elevate the approach to a level above the manual one published previously (Abdus-Saboor et al., t) remain underdeveloped or missing. (1) A clear, well-defined and easy to understand method for reporting the response using PAWS is lacking. (2) The point of measuring SJL and 129S1 strains as extremes using PAWS was weakened by the absence of measuring traditional von Frey and also motor function. (3) While the reviewers understand the appeal of the eDREADD approach to identify the role of specific brain circuits in pain, a classic inflammatory pain model to demonstrate the detection of hypersensitivity by PAWS would be more straightforward and have broader utility. (4) Many of the quantitative aspects of foundational measures used to interpret PAWS were lacking, for example, how non-overlapping response is defined in the LDA and statistical analysis for interpretations of significance. Please also see the full Comments from the Reviewers, included below. We do appreciate that you could make revisions that would address some of the reviewers' specific comments, but our determination is that addressing these concerns would be beyond the scope of a revision at eLife.

Reviewer #1:

This work described in the manuscript by Jones, Foster, et al., describes an automated protocol to quantify and interpret (based on 8 components of the paw trajectory: 4 pre and 4 post) the response of mice to noxious and innocuous mechanical stimuli applied to the plantar hind paw using high speed video recordings. The protocol is designed to objectively report whether the mouse interprets the stimuli as non-painful or painful as well as the severity or intensity of the pain. Mouse strains that show extremes in their response to the stimuli (hypo and hyper) and also eDREADD activation of the pain neuronal ensemble within the amygdala were used to validate whether the protocol is able to accurately quantify and interpret a range of stimuli.

Comments:

The development of an easily accessible, automated and objective way to measure evoked mechanical pain in rodents will be extremely valuable for the field, as will delineating affective versus discriminative components (and in the context of chronic pain) and this is a good first step.

1) Authors should discuss variability in PAWs in relation to the variability inherent in the methods used to simulate the paw.

2) PAWS data in Figure 4 are showing significant differences between CS+DB and LP+HP in the scoring but the LD1/LD2 analyses extract differences in the LP and HP for post-peak features. A deeper explanation of how data are to be reported be interpreted would be helpful.

3) Activation of the pain ensemble with the DREADD and CNO evokes a pain like behavior. I would have expected that the LP would have resulted in greater shaking and guarding behavior than the -CNO control. Do the authors have an explanation?

4) Authors should discuss the interpretation for how mice like the CBA strain, which seem hypersensitive in the traditional testing method but are apparently within range by PAWS, compare to the SJL mice, which were not tested by the traditional method but are deemed hypersensitive by PAWS.

5) The 0.7 probability correct shown in Figure 5 and Figure 6 as being sufficient should be interpreted for the reader a bit more than what is provided in subsection “Automated scoring of rapid paw dynamics and lingering pain behaviors”. Related to this: There are no stats for these comparisons.

6) The rationale for mentioning a potential relationship to aggressive behavior in the Discussion section is not well-developed. What about a relationship of PAWS outcomes to the magnitude of a startle reflex?

Reviewer #2:

In this article, Jones et al. describe an automated assessment of tactile sensitivity. They report development and validation of a novel combination of high-speed videography and automated paw tracking. With this resource, they demonstrate that paw withdrawal to innocuous versus noxious stimuli can be separated in six inbred mouse strains. Using this system, they also present evidence that activating an ensemble of basolateral amygdala neurons during noxious stimulation changes paw withdrawal metrics. This approach represents a significant advance in behavioral assessment of tactile sensitivity, with the potential to contribute to much-needed discoveries in this domain.

Major comments:

This manuscript builds upon prior machine vision-based approaches in other animal models. High-speed videography of such withdrawal responses has been attempted before, but the demonstration of the ability to detect chemogenetic activation of pain aversion neurons in the amygdala and the validation across multiple inbred strains are new developments. The importance of this approach stems from its potential to address previously challenging problems, i.e. high-throughput, objective assessment of sensory and affective components of innocuous touch and noxious stimuli. PAWS has the potential to drastically improve the resolution and dimensionality of rodent pain scoring when compared to subjective 'yes/no' withdrawal scoring and threshold calculation, which has been the standard in the field for several decades.

1) Chemogenetic activation of a BLA pain ensemble is a worthwhile experiment, but the rationale for choosing this particular experiment wasn't particularly clear in the manuscript. Many potential experiments could have been proposed that would target some aspect of the pain neuraxis, with the hypothesis that a shift in sensitivity would be detected by the PAWS system. Why did the authors opt for chemogenetic activation of a BLA ensemble? Furthermore, would an equivalent experiment using an inhibitory DREADD have been likely to tell us anything further? Can these experiments tell us anything about the affective component of a response to a noxious stimulus in terms of paw withdrawal responses?

2) It would benefit the readers to state more clearly that analgesic reversal of pain-related changes to paw withdrawal were detected in Abdus-Saboor et al., (2019), wherein a software-assisted, manual scoring system of similar indices was used. Reversal of changes in paw withdrawal-associated behaviors is a robust indicator that they are pain-related.

3) In Figure 6, the authors mention two outlier strains (129S1 and SJL), which exhibit unusually low and high degrees of sensitivity. I applaud the authors for the inclusion of these datasets; it benefits the research community to describe the limitations and caveats of a new assay as early and as comprehensively as possible. However, are these shifts in sensitivity in 129S1 and SJL seen with conventional von Frey hair testing, as reported for the other strains in Figure 1B?

Whether or not von Frey data 'match' the PAWS data for 129S1 and SJL mice, it is likely to be instructive. A match between von Frey and PAWS data would be further validation of the accuracy of the PAWS system, and a discrepancy would raise the possibility that PAWS is extracting information from a multidimensional dataset which cannot be achieved with conventional von Frey assessment.

4) Related to the previous point, it is not clear if prior reports characterizing pain sensitivity in 129S1 versus other strains have also seen relative hyposensitivity. The discussion does mention that such strain differences have been assessed, but no direct comparisons are between these reports and the data in this manuscript are made.

5) In subsection “Statistical modeling with linear discriminant analyses separates touch versus pain across six inbred mouse strains” the authors note that pre-peak paw movements are nocifensive in nature, whereas those behaviors seen post-peak (shaking/guarding) are supraspinal in origin. I agree that this distinction seems plausible, given the timescales involved, but are there data in the literature to support this? Unless this is founded upon findings from prior studies, this comment might be more at home in the Discussion section.

Reviewer #3:

While I appreciate the importance of what the authors are seeking to accomplish in this study, it is not obvious how PAWS (Pain Assessment at Withdrawal Speeds), which scores eight defined behavioral endpoints, can be easily used by researchers in the pain field to quantitatively evaluate the magnitude of pain or to accurately predict the pain state. The authors convincingly show that a painful pinprick evokes a withdrawal response that looks very different from a non-painful response in terms of trajectory and pattern. A more straightforward approach would thus be to develop an algorithm that captures and classifies these trajectory and pattern differences over time. This would likely generate a more accurate and simple classifier relative to what the authors did-breaking these distinct motions into component variables like X and Y velocity, paw height, etc. The authors should seek to capture this visual difference in paw withdrawal trajectories with a single metric, and then show that this metric is scalable based on pain intensity. Without such a measure, the current implementation of PAWS is of limited general use for those who are interested in studying pain in mice. For example, what endpoint(s) is someone in the field supposed to use when studying pain and responses to analgesics? Y-velocity? X-velocity? # paw shakes? Paw height? All of these variables? Two of these variables? Three? Four? It is confusing and not simplified.

The title of the paper implies the machine vision approach is automated; however, this does not appear to be the case. The first step in this process is to manually label the center of the stimulated paw. This is thus more akin to a "semi-automated" approach.

Time of the first withdrawal peak (t*) is a critical variable in their analytic pipeline. However, it is unclear precisely how t* is calculated or defined. In the paper, authors write that t* is the time leading up to the initial paw peak. And in Figure 3A, since this is a 3d graph in 2d, it is unclear where t* is relative to the trajectory data.

From the LDA analyses, the authors state that the low and high pain stimuli separate from the no-pain stimuli (Figure 5A,B). However, I do not see a clear separation between these groups in the figure. Instead there appears to be significant overlap, which raises the question as to how specific the LDA analysis is at discriminating, in a quantitative manner, the magnitude of a pain response.

Figure 1B. The paw withdrawal frequency data has no error bars and the number of mice used to generate these data is not indicated. The authors write that strains differ in some of these assays but provide no statistics to confirm that the differences shown are statistically significant. Moreover, the authors are encouraged to consult and cite work by Jeff Mogil's group who evaluated mechanical sensitivity in different mouse strains many years ago.

Figure 3B and C are difficult to interpret. Provide more details in legend and in the figure itself. Ex. in B, are those lines with two arrow heads? What does the length of the line and angle of the line mean? And in C, how was the shaking vs guarding bout determined? Was this done by a human or did the algorithm make these assessments in an unbiased manner?

Figure 6. Authors state 129 and SJL mice are outliers, but based on data presented in this figure, it is hard to appreciate how exactly they are outliers. In panel B, 129 mice show a similar probability correct relative to all other strains, and SJL error bounds largely overlap the other strains. An outlier is typically defined as being two or more standard deviations from the mean.

Moreover, the atypical withdrawal response may have nothing to do with a pain hyposensitivity phenotype, as the authors assert. Instead, these strains may simply have motor deficits that prevent them from performing more vigorous/elaborate paw withdraw responses.

The use of chemogenetic amygdala stimulation to demonstrate the efficacy of PAWS for detecting hypersensitivity to noxious stimuli seems out of place in this study. A simpler, more straightforward, and more broadly applicable (for pain field) approach would be to inflame hindpaw with complete freunds adjuvant and/or perform a nerve injury surgery. These are commonly used ways of inducing pain hypersensitivity in the field, and hence as a first test, it will be important to show that PAWS can detect this form of hypersensitivity.

Moreover, it will be important to show that PAWS can detect graded changes in pain hypersensitivity, such as in response to a known analgesic.

eLife. 2020 Aug 6;9:e57258. doi: 10.7554/eLife.57258.sa2

Author response


[Editors’ note: The authors appealed the original decision. What follows is the authors’ response to the first round of review.]

Specifically, while an automated, objective approach to measure evoked mechanical pain behavior in rodents will be a highly significant contribution to the pain field and the authors have presented solid efforts toward this goal, several critical aspects of the PAWS system and its validation that would successfully elevate the approach to a level above the manual one published previously (Abdus-Saboor et al., t) remain underdeveloped or missing.

(1) A clear, well-defined and easy to understand method for reporting the response using PAWS is lacking.

We want to thank the referees for this suggestion – which has led us to completely rework the statistical basis of the PAWS analysis and software, focused on producing a simple univariate pain scale.

To achieve this, we have completely removed the two-dimensional linear discriminant analysis (LD1/LD2). Instead, we develop a simpler, univariate measure of pain intensity that can be automatically scored from the motions tracked in the videography. The technical term for the underlying statistical method is an ordinal logistic regression. The regression is still based on underlying features of paw movement (X and Y velocity, max paw height etc) extracted automatically from the videography – but the upshot is a simple univariate measure of pain intensity.

Using a single, univariate measure has tremendously simplified our analyses and made the results easier to interpret. And we find that this simple univariate pain scale is even slightly better at discriminating between pain and no-pain stimuli as compared to the more complicated, two-dimensional analysis in our original submission.

Once again, we thank the referee for pushing us in this direction, which has led to a different implementation of PAWS that is easier to interpret and to generalize.

(2) The point of measuring SJL and 129S1 strains as extremes using PAWS was weakened by the absence of measuring traditional von Frey and also motor function.

This is a wonderful point and we have now added the traditional scoring method with the SJL strain (Figure 6A) and the 129S1 strain (Figure 1B). This clearly shows that using the PAWS system provides more information about pain sensitivity than the traditional method. Moreover, another referee pointed out, strain 129 was not statistically different than other strains in our original analyses. We have revisited this question in the revised analyses, using a univariate pain scale. And the referee is correct: 129 is not statistically different from other strains in our experiments and analyses, as reflected in the updated manuscript. (Whereas strain SJL remains a strong outlier in pain hypersensitivity).

(3) While the reviewers understand the appeal of the eDREADD approach to identify the role of specific brain circuits in pain, a classic inflammatory pain model to demonstrate the detection of hypersensitivity by PAWS would be more straightforward and have broader utility.

We appreciate this point and in the text we have clarified why we needed to use an excitatory DREADD approach with fos-trap2 mice to address our question. We were not simply trying to determine if PAWS can detect increasing pain levels, as we have already demonstrated this earlier in the paper. We were attempting to determine if PAWS could detect behaviors driven specifically by central input and causing a peripheral injury with a classic inflammatory model would obscure this goal.

Nonetheless, the reviewers comments about the utility of seeing how PAWS performs with a common inflammatory pain model are valid, and thus we performed these assays with 10 C57BL/6 mice. Admittedly, as described below and now included in the main text of the manuscript with accompanying supplemental figure, we did not observe the expected results. In an attempt to be fully transparent and not hide a negative result, we would like to publish these findings as a supplemental figure and we hope the reviewers agree that this is appropriate. We presume that a spontaneous pain detection using deep learning approaches could be valuable with the CFA model, but such a project is beyond the scope of this work. Nonetheless, the PAWS platform presented here, in our opinion, still represents a major leap forward in measuring peripherally and centrally driven mechanical pain in mice without inflamed paws.

New text:

“Next, we used the PAWS system to quantify pain-related paw movement in mice treated with a common inflammatory pain agent, the complete Freund’s adjuvant (CFA). To accomplish this, we used 20 mice (5 each of the C57BL/6, 129S1, Balb/c, and A/J strains chosen randomly) and applied the innocuous dynamic brush and the 4g von Frey Hair (VFH) filament before and 48 hours after unilateral hind paw injection of CFA. Since dynamic brush is not painful and 4g VFH lies close to the boundary separating painful and non-painful responses, we reasoned that we would observe a mechanical allodynia phenotype with increased PAWS values [10, 19]. However, we did not observe a significant increase in the PAWS measurements following CFA (Figure 4 —figure supplement 3). In fact, following CFA the responses to dynamic brush were slightly decreased, and responses to 4g VFH, although greater than to dynamic brush, nevertheless fell near the threshold separating touch and pain at baseline (Figure 4 —figure supplement 3) (Figure 4—source data 2). While performing these assays we noted that the CFA-injected paw was red, inflamed, and swollen and the mice tried not to move the injured paw whatsoever. Thus, for peripheral manipulations that cause animals to drag heavy and swollen limbs, PAWS may not be a suitable system to detect pain hypersensitivity. Measuring thermal hyperalgesia with the Hargreaves assay, or detecting changes in mechanical threshold with VFHs, may be the most suitable method of quantifying CFA-induced pain hypersensitivity [20, 21].”

(4) Many of the quantitative aspects of foundational measures used to interpret PAWS were lacking, for example, how non-overlapping response is defined in the LDA and statistical analysis for interpretations of significance. Please also see the full Comments from the Reviewers, included below. We do appreciate that you could make revisions that would address some of the reviewers' specific comments, but our determination is that addressing these concerns would be beyond the scope of a revision at eLife.

We agree: it was unclear, both because the ellipses on the original figures were overlapping, and because the two-dimensional (LD1/LD2) analysis was confusing. We have completely reworked the PAWS analysis pipelines using a univariate measure instead.

Using the revised, univariate measure we report the discriminatory power to distinguish between pain (HP and LP treatments) and no-pain (DB and CS) treatments, which is 85% on average. Additionally, we report Leave-One-Out (LOO) cross-validation of pain / nonpain discriminatory power, which was 84%, with [79%,88%] nonparametric bootstrap 95% CI. And we also separately report the power to determine whether the stimuli treatment was no-pain (DB and CS) versus low pain (LP) versus high pain (HP), which is 69% on average (LOO 66%, with [60%,72%] 95% CI).

The revised figures show the clear separation between painful and non-painful stimuli along a unidimensional axis.

Although the revised main text focuses on the binary discrimination task between pain and no-pain stimuli, the same univariate pain scale be used further to discriminate between light pain (light pinprick stimulus) and heavy pain (heavy pinprick treatment). To demonstrate this, we include a supplementary figure that shows the accuracy of this trinary discrimination task.

Reviewer #1:

This work described in the manuscript by Jones, Foster, et al., describes an automated protocol to quantify and interpret (based on 8 components of the paw trajectory: 4 pre and 4 post) the response of mice to noxious and innocuous mechanical stimuli applied to the plantar hind paw using high speed video recordings. The protocol is designed to objectively report whether the mouse interprets the stimuli as non-painful or painful as well as the severity or intensity of the pain. Mouse strains that show extremes in their response to the stimuli (hypo and hyper) and also eDREADD activation of the pain neuronal ensemble within the amygdala were used to validate whether the protocol is able to accurately quantify and interpret a range of stimuli.

Comments:

The development of an easily accessible, automated and objective way to measure evoked mechanical pain in rodents will be extremely valuable for the field, as will delineating affective versus discriminative components (and in the context of chronic pain) and this is a good first step.

1) Authors should discuss variability in PAWs in relation to the variability inherent in the methods used to simulate the paw.

This point is now added in the Discussion section.

2) PAWS data in Figure 4 are showing significant differences between CS+DB and LP+HP in the scoring but the LD1/LD2 analyses extract differences in the LP and HP for post-peak features. A deeper explanation of how data are to be reported be interpreted would be helpful.

We have completely re-worked the PAWS analysis and software to produce a simple, univariate measure of pain, derived from the features extracted from high-speed video. The paper now directly addresses the ability of this scale to detect pain vs no-pain, and also, separately, high pain (HP) vs low pain (LP).

3) Activation of the pain ensemble with the DREADD and CNO evokes a pain like behavior. I would have expected that the LP would have resulted in greater shaking and guarding behavior than the -CNO control. Do the authors have an explanation?

This is an astute observation made by the reviewer and we have now added this explanation in the Discussion section:

“Based on our single-neuron microendoscope calcium imaging data in the basolateral amygdala (Corder, 2019), the nociceptive neurons respond only to stronglynoxious sensory stimuli. Only after a peripheral nerve damage to induce chronic neuropathic pain, did we observe that formerly-innocuous stimuli engage this BLA ensemble. This suggests that some up-circuit plasticity occurred that re-directed touch information into the BLA ensemble (e.g. opening of the spinal “gate”, see Torsney and MacDermott, 2006). Thus, without this central plasticity the LP stimulus likely does not activate the ensemble under normal, uninjured conditions. Therefore, chemogenetic activation of the nociceptive ensemble is not necessarily predicted to amplify or modulate neural processes related to non-noxious stimuli. We did observe some spontaneous lifting and licking of the hindpaws upon CNO treatment, which could reflect general aversive perception, but how this might alter withdrawal reflexes that use distinct neural circuits is unclear. The fact that we observed a robust increase in paw shaking and guarding-related behaviors supports the conclusion that the BLA nociceptive ensemble is specifically tuned to encode and modulate nociception only.”

5) The 0.7 probability correct shown in Figure 5 and Figure 6 as being sufficient should be interpreted for the reader a bit more than what is provided in subsection “Automated scoring of rapid paw dynamics and lingering pain behaviors”. Related to this: There are no stats for these comparisons.

We have updated the text to clarify these reported results. In the revised version of PAWS, based on a univariate pain scale, we can discriminate no-pain vs pain treatments with 85% accuracy, in a model fitted to all the data.

Later in the manuscript, we also report the “leave-one-out cross-validated” accuracy, which involves omitting each mouse (or an entire strain) when fitting the model, and then attempting to predict the pain treatment of the mouse (or strain) that had been left out. In cross validation our accuracy is slightly reduced – to about 84% (when leaving out a single mouse) or 81% (when leaving out an entire strain), with nonparametric bootstrap 95% confidence intervals of [79%,88%] and [77%,86%], respectively. These results reflect the accuracy that researchers can expect to achieve when scoring pain treatments of mice (or strains) not included in the model fitting.

6) The rationale for mentioning a potential relationship to aggressive behavior in the Discussion section is not well-developed. What about a relationship of PAWS outcomes to the magnitude of a startle reflex?

This is good point raised by the reviewer and we have updated the discussion to further develop the aggression linkage, and we have included the reviewer’s idea about potential startle reflexes in the SJL strain driving increased measurements with PAWS.

Reviewer #2:

In this article, Jones et al. describe an automated assessment of tactile sensitivity. They report development and validation of a novel combination of high-speed videography and automated paw tracking. With this resource, they demonstrate that paw withdrawal to innocuous versus noxious stimuli can be separated in six inbred mouse strains. Using this system, they also present evidence that activating an ensemble of basolateral amygdala neurons during noxious stimulation changes paw withdrawal metrics. This approach represents a significant advance in behavioral assessment of tactile sensitivity, with the potential to contribute to much-needed discoveries in this domain.

Major comments:

2) It would benefit the readers to state more clearly that analgesic reversal of pain-related changes to paw withdrawal were detected in Abdus-Saboor et al., (2019), wherein a software-assisted, manual scoring system of similar indices was used. Reversal of changes in paw withdrawal-associated behaviors is a robust indicator that they are pain-related.

It is not clear to us what the reviewer is requesting here in referencing Abdus-Saboor et al., (2019). In that context, we used a CFA model and activated polymodal nociceptors, and while giving a cocktail of an opioid and anti-inflammatory agent, we observed that this analgesic cocktail reduced optogenetically induced pain behaviors. We don’t use a similar model in this work, and thus the instructions here are not clear. However, based on Abdus-Saboor et al., (2019)and work from many colleagues in the field, we believe it is clear that the features that we automatically measure with PAWS (paw shaking, paw guarding, paw height, paw velocity) are indeed pain-related.

5) In subsection “Statistical modeling with linear discriminant analyses separates touch versus pain across six inbred mouse strains” the authors note that pre-peak paw movements are nocifensive in nature, whereas those behaviors seen post-peak (shaking/guarding) are supraspinal in origin. I agree that this distinction seems plausible, given the timescales involved, but are there data in the literature to support this? Unless this is founded upon findings from prior studies, this comment might be more at home in the Discussion section.

This is a good point made by the reviewer and we have moved this to the Discussion section instead of the results.

Reviewer #3:

Without such a measure, the current implementation of PAWS is of limited general use for those who are interested in studying pain in mice. For example, what endpoint(s) is someone in the field supposed to use when studying pain and responses to analgesics? Y-velocity? X-velocity? # paw shakes? Paw height? All of these variables? Two of these variables? Three? Four? It is confusing and not simplified.

Yes, we agree: it was confusing. In the revised work there is a single, simple endpoint: a univariate pain scale, which is automatically computed for each mouse.

As we show in the revision, this univariate pain scale allows for accurate scoring of painful versus non-painful stimuli (85% accuracy).

It is still true that the computation of this univariate pain scale combines several behavioral features like x-velocity and #paw shakes, etc – but these features are automatically extracted by the software and the resulting endpoint is a simple, univariate pain scale.

The title of the paper implies the machine vision approach is automated; however, this does not appear to be the case. The first step in this process is to manually label the center of the stimulated paw. This is thus more akin to a "semi-automated" approach.

We respectfully disagree with the reviewer about the manuscript’s title, as the PAWS software automatically measures a pain scale using machine-vision approaches to track the paw. All such machine-vision approaches require some initial input to indicate the object being visualized, but there is no manual annotation of features or manual scoring of pain.

Time of the first withdrawal peak (t*) is a critical variable in their analytic pipeline. However, it is unclear precisely how t* is calculated or defined. In the paper, authors write that t* is the time leading up to the initial paw peak. And in Figure 3A, since this is a 3d graph in 2d, it is unclear where t* is relative to the trajectory data.

Thank you for asking us to clarify. The revised manuscript defines t* clearly: it is the time of the first peak in paw height (y-axis). All times following t* are called “post-peak”, and all times prior to t* are called “pre-peak”.

We agree that time t* is difficult to visualize directly, but the definition itself is clear (and it is also codified concretely in the software code that accompanies the paper, which will be made freely available to all researchers).

From the LDA analyses, the authors state that the low and high pain stimuli separate from the non-pain stimuli (Figure 5A,B). However, I do not see a clear separation between these groups in the figure. Instead there appears to be significant overlap, which raises the question as to how specific the LDA analysis is at discriminating, in a quantitative manner, the magnitude of a pain response.

Figure 1B. The paw withdrawal frequency data has no error bars and the number of mice used to generate these data is not indicated. The authors write that strains differ in some of these assays but provide no statistics to confirm that the differences shown are statistically significant.

Thank you, the revised work includes N numbers of mice used in the figure and figure legend and a direct statistical test that shows a significant difference in the accuracy of determining pain stimulus in the SJL strain versus other strains (significant difference in mean classification accuracy between SJL and DBA1; one-sided Welch two sample t-test t=2.9406, df=36.894, p=0.003).

Moreover, the authors are encouraged to consult and cite work by Jeff Mogil's group who evaluated mechanical sensitivity in different mouse strains many years ago.

We have consulted and cited Jeff Mogil’s work in relation to genetic strain differences (see Discussion section).

Figure 3B and C are difficult to interpret. Provide more details in legend and in the figure itself. Ex. in B, are those lines with two arrow heads? What does the length of the line and angle of the line mean? And in C, how was the shaking vs guarding bout determined? Was this done by a human or did the algorithm make these assessments in an unbiased manner?

Periods of guarding and shaking (and also the number of shakes) are determined automatically by the software. Indeed, the entire PAWS platform is automatic with no human input required.

We have clarified the definition of shaking vs guarding periods in the caption to Figure 3. But the basic idea is simple: shaking is a series of back-and-forth movements of the paw in the principle direction of movement, whose distance exceeds 35% of the first paw peak height; and all other periods of time outside of these shakes are scored as guarding.

Yes, the arrows in Figure 3B have arrowheads on both sides: they represent the vector (in x-y plane) of principle movement for that window of time. The arrows have constant length. We have clarified caption.

Figure 6. Authors state 129 and SJL mice are outliers, but based on data presented in this figure, it is hard to appreciate how exactly they are outliers. In panel B, 129 mice show a similar probability correct relative to all other strains, and SJL error bounds largely overlap the other strains. An outlier is typically defined as being two or more standard deviations from the mean.

You are absolutely correct: strain 129 was not in fact a statistical outlier, and so we have removed any such claims or discussion from the manuscript altogether. Strain SJL is indeed an outlier (it falls outside the 95% CI for other strains, and a t-test comparing to strain DBA1 is significant; one-sided Welch two sample t-test t=2.9406, df=36.894, p=0.003).

Moreover, the atypical withdrawal response may have nothing to do with a pain hyposensitivity phenotype, as the authors assert. Instead, these strains may simply have motor deficits that prevent them from performing more vigorous/elaborate paw withdraw responses.

We agree, and thank for pointing this out. It is possible that abnormal pain responses result from primary differences outside of the pain neuroaxis. Further studies will need to be performed to determine if this is the case, as described in the revised manuscript.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 4—source data 1. Raw data values automatically computed from PAWS showing each mouse and stimulus across all eight inbred lines tested.

    p=pain, NP = no pain.

    elife-57258-fig4-data1.xlsx (198.9KB, xlsx)
    Figure 4—source data 2. Raw data values automatically computed from PAWS showing each mouse and stimulus across four inbred lines tested before and after unilateral CFA paw injection.

    p=pain, NP = no pain.

    Figure 7—source data 1. Raw data values automatically computed from PAWS showing each painTRAP2hM3 mouse combined with sensory stimuli before and after CNO administration.

    p=pain, NP = no pain.

    Transparent reporting form

    Data Availability Statement

    Raw data are associated with figures as source data.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES