Abstract
Background
Wearables have been gaining increasing momentum and have enormous potential to provide insights into daily life behaviors and longitudinal health monitoring. However, to date, there is still a lack of principled algorithmic framework to facilitate the analysis of actigraphy and objectively characterize day-by-day data patterns, particularly in cohorts with sleep problems.
Objective
This study aimed to propose a principled algorithmic framework for the assessment of activity, sleep, and circadian rhythm patterns in people with posttraumatic stress disorder (PTSD), a mental disorder with long-lasting distressing symptoms such as intrusive memories, avoidance behaviors, and sleep disturbance. In clinical practice, these symptoms are typically assessed using retrospective self-reports that are prone to recall bias. The aim of this study was to develop objective measures from patients’ everyday lives, which could potentially considerably enhance the understanding of symptoms, behaviors, and treatment effects.
Methods
Using a wrist-worn sensor, we recorded actigraphy, light, and temperature data over 7 consecutive days from three groups: 42 people diagnosed with PTSD, 43 traumatized controls, and 30 nontraumatized controls. The participants also completed a daily sleep diary over 7 days and the standardized Pittsburgh Sleep Quality Index questionnaire. We developed a novel approach to automatically determine sleep onset and offset, which can also capture awakenings that are crucial for assessing sleep quality. Moreover, we introduced a new intuitive methodology facilitating actigraphy exploration and characterize day-by-day data across 49 activity, sleep, and circadian rhythm patterns.
Results
We demonstrate that the new sleep detection algorithm closely matches the sleep onset and offset against the participants' sleep diaries consistently outperforming an existing open-access widely used approach. Participants with PTSD exhibited considerably more fragmented sleep patterns (as indicated by greater nocturnal activity, including awakenings) and greater intraday variability compared with traumatized and nontraumatized control groups, showing statistically significant (P<.05) and strong associations (|R|>0.3).
Conclusions
This study lays the foundation for objective assessment of activity, sleep, and circadian rhythm patterns using passively collected data from a wrist-worn sensor, facilitating large community studies to monitor longitudinally healthy and pathological cohorts under free-living conditions. These findings may be useful in clinical PTSD assessment and could inform therapy and monitoring of treatment effects.
Keywords: actigraphy, sleep, Geneactiv, posttraumatic stress disorder, wearable technology
Introduction
Background
Posttraumatic stress disorder (PTSD) is a mental disorder with lifetime prevalence ranging from 1.9% [1] to 8.8% [2]. However, these figures are considerably exacerbated in conflict, torture, and rape survivors [3,4]. PTSD may develop following exposure to a traumatic event, which is defined as exposure to actual or threatened death, serious injury, or sexual violence in the Diagnostic and Statistical Manual of Mental Disorders 5th edition (DSM-5) [5]. It is characterized by four symptom categories: (1) re-experiencing symptoms associated with the trauma, which include intrusive memories of the event, (2) avoidance of stimuli associated with the trauma, (3) negative alterations in cognition and mood, and (4) alterations in arousal.
Sleep disturbances have been termed the hallmark of PTSD [6] but are still poorly understood. They often have a chronic course and are among the most distressing symptoms [7]; these include difficulties falling and staying asleep [8] and nightmares [9]. Poor sleep has been associated with daytime PTSD symptom exacerbation and severity [10,11], indicating a potentially vicious maintenance cycle between sleep disturbances and PTSD symptoms [11], and highlights the importance of investigating sleep to potentially inform targeted treatment [12]. Crucially, from the perspective of this study, the monitoring environment for symptom assessment is critical: PTSD patients report sleeping better in lab settings than at home [13], where problems are more commonly detected [14,15]. This emphasizes the importance of conducting community studies under free-living conditions instead of lab-based assessments. Moreover, given that PTSD is a chronic condition, it would be desirable to develop low-cost monitoring systems that could provide insight into the patient’s daily behaviors longitudinally; ideally, such systems would require minimum input from participants.
Longitudinal monitoring of mental disorders is typically achieved using the patient-reported outcome measures (PROMs), where participants are prompted to complete standardized validated questionnaires capturing generic or disease-specific symptoms [16-18]. For example, sleep disturbances are common in mental disorders such as depression and PTSD and cause considerable distress and disability. Sleep diaries are recommended as the gold standard for subjective prospective sleep monitoring [19]. Sleep diaries are typically completed over 1 week, asking participants to report the following (among others): time to bed, how long it took to fall asleep, length and number of awakenings, wake-up time, and sleep quality [19]. However, sleep diaries are inherently limited by relying on an individual’s ability to estimate their own sleep times [20]; similar to PROMs, they may be subject to recall bias [21].
Therefore, although self-assessments have merit, they are by nature subjective and may not be easily comparable across individuals [22]. On the other hand, objective monitoring of daily behaviors may offer an additional dimension to understanding pathologies (here, PTSD) using passively collected data. In addition to overcoming the aforementioned problems, sensor data can be streamlined and facilitate direct comparison across participants. There have been considerable developments in the wearable sensor market over the last 5 years; in particular, wrist-worn sensors are widely available and are becoming increasingly affordable. Some have functionalities such as displaying time and communicating with smartphones to convey notifications; in short, these devices are becoming attractive beyond a research exercise, and they are generally embraced by the public. Although many companies have a business model where they employ proprietary algorithms and only offer access to processed data (eg steps, number of stairs, and sleep characteristics such as sleep onset and sleep offset), there are some devices that offer access to the raw actigraphy (3D acceleration) data, often complemented with additional modalities such as light, temperature, and heart rate. Overall, the raw signals from the wrist-worn sensor provide a rich source of information and researchers can develop algorithms tailored to the application at hand (often fusing different modalities). For example, actigraphy can be used to objectively quantify physical activity, which can be used as a biomarker for well-being [23]. Although it is currently not possible to obtain accurate estimates of sleep architecture and sleep stages using actigraphy, it provides a good measure of general sleep characteristics including sleep efficiency, total sleep time, sleep onset latency, and awakenings after sleep onset times [24]. Recent work emphasizes participant adherence and the potential of using wrist-worn accelerometers to collect actigraphy data in large-scale population studies [25]. Actigraphy has been used in people with PTSD [10,26] and has shown consistency with sleep diary reports in a PTSD population [27].
Unfortunately, the research literature discussing algorithmic tools to process the raw actigraphy data is limited and fragmented; there is no agreed data-processing protocol and frequently, actigraphy analysis is completed manually relying on visual inspection. This is because studies often have very different focus in terms of applications, for example, to assess general physical activity [25,28] or sleep timings [29], and it is still early days for trying to standardize the commonly extracted features to characterize actigraphy data. Most studies focus on developing and validating actigraphy algorithms using only healthy control cohorts. However, it has been shown across a range of applications that developed algorithms on healthy controls may fail to generalize sufficiently well in cohorts with pathologies [30-32].
Objectives
This study offers a principled framework summarizing many of the known algorithms and introducing new algorithmic approaches to extract potentially useful information from the actigraphy data. Although the application of this study is PTSD, the developed methodology is generic and, in principle, applicable to settings where actigraphy data are available. To the best of our knowledge, this is the first study to investigate a large set of activity, sleep, and circadian rhythm patterns using a relatively large number of both healthy controls and a mental disorders group where participants commonly exhibit severely fragmented sleep and sleep disturbances.
The aims of this study were to (1) develop an automated approach to accurately determine sleep onset, sleep offset, and awakenings from passively collected data using a wrist-worn sensor targeting a cohort in which sleep is often disturbed, validating findings against participants’ sleep diaries; (2) explore differences in these sleep characteristics under free-living conditions between PTSD participants, traumatized controls, and nontraumatized controls; and (3) methodologically contribute toward a large set of activity, sleep, and circadian rhythm patterns to objectively characterize daily behaviors and develop a simple to use package to facilitate actigraphy analysis in MATLAB (The MathWorks Inc).
Methods
Study Cohort
The study cohort comprised 115 participants: 30 nontraumatized controls, 43 trauma-exposed without PTSD, and 42 with PTSD. Inclusion criteria for all groups were as follows: aged between 18 and 65 years, could read and write in English, had no history of or current bipolar or psychosis, no current substance or alcohol dependence, and if they were taking psychotropic medication they had been on a stable dose for at least two months. Additional inclusion criteria for the PTSD group were a traumatic event experience as defined by Criterion A of the DSM-5 [5] and a current diagnosis of PTSD. Inclusion criteria for the control group were no experience of a Criterion A traumatic event and no current mental health problems. Inclusion criteria for the trauma control group were a Criterion A traumatic event experience but not meeting the criteria for a current diagnosis of PTSD. Table 1 summarizes the demographic characteristics of the study cohort. The groups were age and gender matched, and among trauma survivors (n=85), there was no difference in trauma type (interpersonal vs noninterpersonal) between PTSD and trauma controls.
Table 1.
Demographics | Nontraumatized controls (n=30) | Trauma exposed controls (n=43) | Posttraumatic stress disorder (n=42) | |
Age (years), mean (SD) | 31.17 (10.38) | 34.02 (14.01) | 32.51 (9.93) | |
Females, n (%) | 23 (76.7) | 31 (72.1) | 26 (61.9) | |
Trauma typea, n (%) | ||||
|
Interpersonal | N/Ab | 14 (32.6) | 20 (47.6) |
|
Not interpersonal | N/A | 29 (64.4) | 22 (52.4) |
Time since trauma (years)c, mean (SD) | N/A | 10.12 (10.18) | 8.23 (9.71) |
aTrauma type and characteristics are only for trauma survivors (n=85).
bN/A: not applicable.
cTime since trauma was calculated as the time (years) from trauma to study participation date.
The study received National Health Service (NHS) ethical approval from the South-Central Oxford C Research Ethics Committee (Ref 14/SC/0198). Recruitment began in June 2014 and ended in April 2016. Control and traumatized control participants were recruited via advertisements within the University of Oxford on the departmental website and online community forums in Oxford. The Clinical-Administered PTSD Scale [33], the gold standard for the assessment of PTSD symptom severity, was conducted by EW or JS to establish a PTSD diagnosis according to DSM-5 [5] criteria. All participants completed a sleep questionnaire at trial onset and, over the course of a week, wore a wrist-worn device and kept a detailed sleep diary (see details below).
Sleep Questionnaire: Pittsburgh Sleep Quality Index
Sleep quality was assessed using the standardized Pittsburgh Sleep Quality Index (PSQI) questionnaire [34]. The PSQI is a self-report assessment comprising 19 items that are mapped onto seven components (each scored in the range 0 to 3): (1) subjective sleep quality, (2) sleep latency, (3) sleep duration, (4) habitual sleep efficiency, (5) sleep disturbances, (6) use of sleeping medication, and (7) daytime dysfunction. The sum of the 7 subscale component scores generates the total score, known as total PSQI, which has a range of 0 to 21; scoring above 5 is used as a standard threshold to indicate poor sleep [34]. The study participants self-assessed the occurrence of sleep disturbances over the previous month on a scale from 0 (not during the last month) to 3 (3 or more times a week) for each item. PSQI was collected once in this study and serves as an overall indication of sleep quality.
Sleep Diary
A sleep diary was used to prospectively monitor participants’ sleep over 7 days. Participants completed the diary in the morning answering questions about their sleep the previous night. Specifically, they recorded time they got into bed, time they started trying to sleep, sleep onset duration and wake-up time, and number of awakenings (including approximate times), known as wake after sleep onset (WASO). We clarify that by sleep onset, we used the time that participants recorded as falling asleep (ie, time they switched the lights out and started trying to sleep plus reported sleep onset latency). The sleep diary followed recommendations for sleep research [20] and for the prospective self-monitoring of sleep [19].
Wrist-Worn Device
Participants wore a triaxial accelerometer (Geneactiv, ActivInsights Ltd) on their nondominant wrist for 7 days, coinciding with the sleep diary recording. A recent study endorsed recording at least six nights of standard actigraphy measurements for a reliable measure of self-reported sleep [35]. The device recorded 3D acceleration data at a sampling rate of 100 Hz with a dynamic range of ±8 g (g is a gravity unit, 1 g=9.81 m/second²) and 12-bit resolution. In addition to triaxial acceleration, light (Lux) and wrist temperature (°C) were recorded. We obtained actigraphy data from 113 participants (the missing data were from 2 PTSD participants).
Methods to Process Data
The following sections describe the methodology used to process the wrist-worn data and statistical tools to compare cohorts.
Calibration and Data Preprocessing
Accelerometers convert mechanical force into electrical signals; in practice, they require calibration to ensure that their outputs are directly comparable. Electrical signals are considered a linear function of the acceleration, involving an offset and a gain factor. In some of the older devices, researchers developed algorithms to compute the calibration values [36]; for the Geneactiv used in this study, the manufacturer provides the offset and the gain factor for the 3 axes for each device. We applied the standard calibration procedure using the manufacturer provided offset and gain factors.
Subsequently, the data were resampled at 10 Hz to simplify further processing. Before extracting data patterns, we automatically detected nonwear times, so that these segments could be excluded from the analysis. The nonwear times were determined following a similar process to Zhou et al [37], marking nonwear periods lasting at least 15 min. The argument is that shorter nonwear periods are unlikely to have a marked effect on the results and attempting to determine very short nonwear times would increase the number of segments mistakenly assigned to be nonwear times.
Data Visualization and Exploration
We produced three main plot types to facilitate visualization and exploration of the data: (1) Data summary plot, simultaneously presenting all the raw signal modalities collected (3D acceleration, temperature, and light); (2) Actogram, where stacked plots depict activity over 24 or 48 consecutive hours (in the latter case, there is a 24-hour overlap between successive plots) on successive days; (3) Colored actogram, which is like an actogram, but here we express the average level of activity on a 10-min window using a color scale to facilitate direct comparison of activity across days.
A critical intermediate step before further visualization and processing of the data requires summarizing the raw triaxial data. Previous research suggests that there are different approaches but no unique single best way to summarize the activity [38,39]. Here, we define two simple summary measures to achieve this, the movement and the xyz variation variables, which are both expressing activity in 1-min time windows for time-efficient processing and visualization. The movement variable is defined as the square root of the sum of the squared successive differences of the triaxial acceleration data. This is similar to the standard Euclidean distance using the 3D data, with the additional twist that successive differences of the raw acceleration entries are used for each of the 3 axes. We have found that empirically this led to more visually appealing results as we are interested more in the changes of position in the 3D space. The movement variable is used to efficiently summarize the data and is the key variable we use to succinctly present activity. We also summarize the raw actigraphy data by introducing the xyz variation, which we defined as the rolling 10-min median of successive 1-min acceleration differences with a 90% overlapping window.
We have overlaid the nonwear duration times and sleep duration time (see the following section for details) with transparent color in the data summary plot and the actogram. The data summary plots were further annotated using the sleep diary data provided by the participants. In all cases, the aim was to identify trends in the data and develop an intuitive understanding of the continuity and stability of the emerging patterns.
Automatically Detecting Sleep
Previous research has proposed methods to automatically detect sleep using actigraphy, but these are typically evaluated only on healthy controls. Intuitively, we can consider that sleep can be fundamentally determined using actigraphy data on the basis of sustained inactivity. Different algorithms proposed in the literature essentially differ on how inactivity is quantified and the use of empirical thresholds, for example, see [29,40,41]. We remark that the latter two approaches were developed using counts, a proprietary device-specific estimate of activity. Although there has been a relatively recent attempt to provide backward compatibility with count-based schemes [42], in principle, it would be better to develop approaches using the raw actigraphy data as suggested by van Hees et al [29]. van Hees et al [29] proposed quantifying angular arm movement and assigning time segments to denote sleep when the angle is lower than 5 degrees for 5 successive minutes or more.
The sleep detection algorithm proposed in this study is somewhat more sophisticated. First, we computed the time segments that are considered sleep candidates on the basis of the following empirical rules that must be jointly true:
The rolling 10-min median movement variable is lower than 0.07.
The rolling 5-min average xyz variation is lower than 0.1.
The rolling 5-min average light is lower than 30 Lux.
Subsequently, we used a postprocessing approach where sleep candidate segments of at least 30 min in duration were joined if they differed by up to 30 min (to form a continuous sleep candidate segment). The intervening period was recorded as an awakening and used to characterize sleep. Finally, all sleep candidate segments less than 2 hours were removed (the threshold might be shortened if we were interested in capturing accurately shorter sleep during the day, at the risk that sedentary activity might be mislabeled as sleep). The proposed sleep detection approach requires the availability of ambient light data (some older actigraphy devices do not collect ambient light), but we have found that including this additional modality overcomes problems with sedentary activity which might otherwise be mislabeled as sleep.
Evaluation of Sleep Detection Accuracy
Lacking polysomnography (PSG), which is the gold standard method for thorough assessment in sleep medicine to determine sleep and sleep architecture [32], we evaluated the accuracy of the proposed sleep detection algorithm using the participants’ sleep diaries. We aimed to demonstrate its competitiveness against the algorithm by van Hees et al [29] (which had been previously validated against PSG). For the algorithm of van Hees et al [29], we used their implementation in the GGIR package [43]. In all cases, we report the difference between the algorithm’s estimate and the ground truth for the purpose of validation (sleep diary).
Characterizing Activity, Sleep, and Circadian Rhythm
The detailed equations for the computation of the extracted patterns are provided in Multimedia Appendix 1. The patterns along with a short description are summarized in Table 2. We remark that the categorization of the features into three groups (activity, sleep, and circadian rhythm) is for reporting convenience, and different categorization approaches or feature membership into these categories are possible. For example, M10, L5, RA, IS, and IV characterize diurnal rhythm behaviors and could be assigned to the circadian rhythm category, as some previous studies have suggested [44,45]. The computation of the features can be achieved using any preprocessing approach that summarizes the raw actigraphy data into a vector. Here, we used the movement variable. Alternative approaches using other variables that summarize the 3D actigraphy data would also be possible; we defer further elaboration regarding preprocessing for the Discussion.
Table 2.
Categorya and pattern | Description | |
Activity | ||
|
M10 | Average activity for the 10 most active hours |
|
M10 time | Start time of 10 most active hours |
|
L5 | Average activity for the 5 least active hours |
|
L5 time | Start time of 5 least active hours |
|
RA | Relative amplitude of most and least active hours |
|
MDA | Mean diurnal activity (rise time to bed time) |
|
MNA | Mean nocturnal activity |
|
MA | Mean activity with diurnal and nocturnal components |
|
Diurnal skewness | Skewness of the probability distribution of diurnal activity |
|
Percentiles diurnal activity | 5, 25, 50, 75, 95 percentiles of diurnal activity |
|
% nocturnal activity (% NA) | Ratio of nocturnal activity over sum 24-hour activity |
|
IS1 | Interday stability using 1-hour windows |
|
IS2 | Interday stability using 1-hour windows with 30-min overlap |
|
IV1 | Intraday variability (24 hours) |
|
IV2 | Intraday variability (1440 min) |
|
IV3 | Intraday variability (24 hours with 30-min overlap) |
|
Activity TKEO diurnal | Computing the diurnal activity variability using the Teager-Kaiser Energy Operator (TKEO) |
|
Activity ratio TKEO | Ratio of diurnal activity variability against overall activity variability evaluated using TKEO |
|
Activity RMSSD | Computing the diurnal activity variability using the root mean squared successive differences (RMSSD) |
|
Activity ratio RMSSD | Ratio of diurnal activity variability against overall activity variability evaluated using RMSSD |
|
CMSE | Composite multiscale entropy, evaluating the complexity of the time series at 5, 30, 60, 120 min |
Sleep | ||
|
Sleep onset | Time starting sleep |
|
Sleep offset | Wake up time |
|
Sleep duration | Duration of main (nocturnal) sleep |
|
Number wake-up | Number of wake-up periods during sleep |
|
Wake after sleep onset (WASO) minutes | Minutes awake interrupting sleep |
|
Sleep entropy | Entropy of the activity during sleep (variability of activity during sleep) |
|
Percentiles sleep activity | 5, 25, 50, 75, 95 percentiles of activity during sleep |
|
Awakenings total minutes | Total number of minutes awakenings lasted for each automatically detected nocturnal sleep |
Circadian rhythm | ||
|
Sleep temperature zenith | Maximum temperature during sleep |
|
Sleep temperature zenith time | Time of maximum temperature during sleep |
|
Sleep temperature nadir | Minimum temperature during sleep |
|
Sleep temperature nadir time | Time of minimum temperature during sleep |
|
Sleep temperature range | Range of temperature during sleep |
|
Sleep onset phase | Successive differences in sleep onset timing |
|
Sleep offset phase | Successive differences in sleep offset timing |
|
Cosinor: MESOR | Cosinor model: average measure of rhythm |
|
Cosinor: Amplitude | Cosinor model: amplitude of fitted sinusoid |
|
Cosinor: Phase | Cosinor model: phase of fitted sinusoid |
aFor algorithmic details, see Multimedia Appendix 1. Overall, we have 49 extracted patterns (counting the percentiles and the CMSE entries separately). We remark that the categorization of the patterns into the three groups (activity, sleep, and circadian rhythm) is for reporting convenience.
Activity
Accelerometers have been traditionally used to quantify physical activity, with an increasing number of wearables and smartphone apps capitalizing on the acceleration signals. There is a growing body of research literature on characteristics that can be extracted from raw actigraphy to quantify activity. Further details regarding the algorithmic expressions for the activity patterns extracted in this study are found in Multimedia Appendix 1.
Sleep
We developed a new approach to detect sleep, which was presented in the section Automatically Detecting Sleep. Subsequently, we extracted patterns to quantify nocturnal sleep, including awakenings, disturbances, and periods of excessive activity during sleep, which are of clinical interest in PTSD. The algorithmic details for the sleep patterns are presented in Multimedia Appendix 1.
Circadian Rhythm
Given the reported chronic course of sleep problems and general long-term effects in daily life, we can use the actigraphy, light, and temperature data to extract circadian rhythm disturbances, that is, processes with 24-hour oscillations. The term circadian rhythm, strictly speaking, refers to endogenous, entrainable processes; chronobiologists prefer the use of the more general term diurnal rhythm to describe self-sustained, repeated processes with 24-hour oscillations when their endogenous nature cannot be confirmed. Given that we are extracting both temperature and activity patterns, we will use the former expression as an umbrella term for simplicity.
A standard approach is to measure core temperature and identify minimum and maximum values over consecutive days [46]. Here, we only have access to wrist temperature, but we wanted to test whether there were any intrinsic differences observed in PTSD. Moreover, we computed differences in terms of the activity phase and sleep phases over consecutive days. Algorithmic details of the extracted circadian rhythm patterns are presented in Multimedia Appendix 1.
Statistical Analysis
This section describes the statistical tools used to visualize and compare the findings across the three cohorts.
Density Plots and Statistical Hypothesis Testing
We computed the densities of the extracted patterns using histograms for discrete random variables and using kernel density estimation with Gaussian kernels for continuous random variables. For discrete random variables, we used the chi-square test to determine whether the distributions are statistically significantly different at P=.05 level. For continuous random variables, we used the 2-sample Kolmogorov-Smirnov goodness-of-fit statistical hypothesis test to determine whether the distributions are statistically significantly different, assessing statistical significance at the P=.05 level. The null hypothesis was that the samples are drawn from the same underlying distribution. In all cases, we aimed to assess whether there were statistically significant pairwise differences between the three groups.
Statistical Association Between Patterns and Groups
We computed pairwise statistical associations between the summarized patterns and groups to quantify differences. Specifically, we computed point-biserial correlation coefficients and used the standard empirical rule of thumb approach that correlations with a magnitude larger than 0.3 are statistically strong [47,48].
Data Accessibility
Requests for access to the data can be made to EW, but the data cannot be placed into a publicly accessible repository.
Source Code Availability
The MATLAB source code for the computation of the actigraphy patterns will be made available on the first author’s website [49].
Results
Self-Reported Sleep: Sleep Questionnaire and Sleep Diary
People with PTSD reported more severe subjective sleep disturbances (higher total PSQI scores) compared with both trauma controls and controls (Figure 1). The PSQI was statistically significantly different between PTSD participants and nontraumatized controls (P<.001), and also between PTSD participants and traumatized controls (P<.001). There was no statistically significant difference in terms of PSQI between traumatized controls and nontraumatized controls (P=.73). In the PTSD group, 91% (38/42) of the participants had clinically marked sleep problems at baseline (total PSQI>5).
Detection of Sleep Onset and Offset With Automatic Algorithms Compared With Sleep Diary Data
Visual inspection of the sleep diaries and comparison with the actigraphy data revealed that a few participants recorded sleep times that differed considerably compared with the standard sedentary activity typically observed in actigraphy data: this may highlight inherent limitations of sleep diaries [20]. Therefore, we wanted to exclude participants where the difference was markedly clear that it seemed possible that the sleep diary entry was not reliable. This was assessed by visual inspection of the actigraphy data and superimposing self-reported sleep entries. In total, 17 out of the 113 participants were excluded from this assessment: 7 due to lack of valid sleep diary entries for at least three days, 1 due to fewer than 3 days of valid actigraphy data, and 9 due to consistent major disagreement between actigraphy and self-reported sleep entries (also see Multimedia Appendix 1). We clarify that by sleep onset, we used the time that participants recorded as falling asleep (not the time they reported as going to bed or starting trying to sleep, in most cases, there was a couple of minutes difference known as sleep onset latency).
The agreement of the automatic algorithms against the participants’ sleep diaries in terms of sleep onset and sleep offset is summarized in Table 3. For convenience, we summarize the statistical distributions of the errors (differences in minutes between the algorithmic estimates and sleep diaries) in the form median (IQR). Furthermore, Figure 2 presents the error density plots comparing side-by-side the estimated sleep onset and sleep offset of the new proposed sleep detection algorithm and the sleep detection algorithm by van Hees et al [29], against the sleep diaries that are used as ground truth. We remark that the proposed sleep detection algorithm generalizes sufficiently well for both the nontraumatized controls and the PTSD participants. Overall, the new sleep detection algorithm appears to be very accurate and highly competitive against van Hees’s algorithm, which has been used in some previous studies. As expected, the results are, in general, more accurate for the nontraumatized control group; the PTSD group appears considerably more challenging, likely due to the nature of the disorder affecting sleep, resulting in more irregular patterns, which may complicate sleep detection. Both algorithms can detect sleep offset (wake-up) more accurately than sleep onset. This indicates that people exhibit greater activity in the morning compared with before bedtime, which intuitively verifies what we would have expected.
Table 3.
Cohort | van Hees et al [29] sleep detection algorithma | Proposed sleep detection algorithm in this study | ||
|
Sleep onset, median (IQR) | Sleep offset, median (IQR) | Sleep onset, median (IQR) | Sleep offset, median (IQR) |
Nontraumatized controls | −56 (112) | 22.5 (106) | −12.5 (51) | 2 (30.25) |
Traumatized controls | −81 (147) | 35.5 (95.5) | −18 (50) | 10 (46.75) |
PTSDb participants | −78 (131.25) | 41.5 (122.5) | −34 (78.25) | 10 (45.25) |
aFor the algorithm of van Hees et al [29], we used their implementation in the GGIR R package. The results indicate minutes of sleep onset difference and sleep offset difference between the actigraphy algorithm and the ground truth for the purpose of validation (sleep diary). For details on the distributions of the errors, see Figure 2.
bPTSD: posttraumatic stress disorder.
To facilitate direct comparison of the two competing sleep detection algorithms across all samples, we also provide scatter plot results in Figure 3. The algorithm proposed in this study is compared with the algorithm suggested by van Hees et al [29], which had been developed for the programming language R (R Core Team, R Foundation for Statistical Computing). We used standard default settings in the GGIR package developed by van Hees. For both the sleep detection algorithm by van Hees and the sleep detection algorithm in this study, the sleep diary data were used only to compare findings. Subsequently, we also present Bland-Altman plots to assess the agreement between the new proposed sleep detection algorithm and the sleep detection algorithm by van Hees [29] (see Figure 4).
Finally, we provide a visual illustration in Figure 5, comparing the algorithm developed by van Hees et al [29] and the algorithm in this study for a randomly selected participant. We observe that the proposed algorithm appears to match very well the participant’s self-reported onset and offset in the sleep diary. We remark that in 2 of the nights during the week this participant was monitored, the algorithm detected awakenings (illustrated with noncontinuous transparent green over the course of the night, indicating broken sleep). The participant had reported awakenings on those 2 nights, although the timings of those awakenings were not recorded. We provide further details about the subplots presented in Figure 5 in the next section.
Data Visualization and Objective Outcomes
We present indicative plots for a PTSD participant to illustrate how data can be presented to visualize patterns. Figure 6 shows the data summary plot presenting the raw data, Figure 7 presents the actogram, and Figure 8 illustrates the activity using a color scale. To protect participant anonymity, we have changed the dates in these illustrations.
There are several important insights to be gained by visualizing the raw data in Figure 5: (1) during sleep, the movement is considerably reduced compared with the rest of the day and there are a few large noted changes across 2 axes usually (indicating an occasional, relatively long arm movement during sleep), (2) during sleep the temperature is elevated compared with the rest of the day, (3) we can observe the continuity of activity and sedentary periods during the day, we observe light exposure, which is known to have a strong influence on circadian rhythms (in addition, light can be used to support monitoring wake-ups during the night, eg, to detect toilet visits). The automatically assessed sleep times coincide almost perfectly with the participant’s self-reported sleep diary across all 6 days in Figure 4 (this participant did not provide an entry for the 7th day in the diary). Further examples and detailed results on the comparison of the sleep detection algorithm against sleep diaries and the sleep detection algorithm of van Hees et al [29] are provided in Multimedia Appendix 1.
The actogram in Figure 7 is useful for visualizing the long-term regularity and potential shift changes in activity. Using the sleep annotation (here automatically deduced), we determined the regularity of sleep timings. We also observed movements during sleep, which can be used to quantify sleep quality.
The colored actogram in Figure 8 complements the standard actogram: visualizing the active times of subsequent days once again provides insight into pattern regularity, which can be useful for extracting circadian rhythm patterns. The extracted patterns in the study were motivated in part by visualizing multiple plots of the form presented in Figures 6-8 from the three cohorts.
Group Differences
Ultimately, each of the extracted patterns characterizes each day, that is, for each participant, we typically have a 49×6 (extracted 49 patterns for 6 days). We summarize these patterns in terms of the mean (SD) values for each pattern to characterize each participant. Table 4 summarizes the statistical distributions of the summarized patterns for the three cohorts, along with pairwise comparisons between cohorts. The direction of the effects can be inferred from the signs of the correlation coefficients: negative signs in the correlations indicate that the summarized pattern generally exhibits lower scores for the first group in the comparison. Overall, we observed that the PTSD cohort exhibited some statistically strong (|R|>0.3) differences compared with the nontraumatized control group. For example, the sleep entropy appears to be a good differentiator between PTSD and the nontraumatized control group. Descriptive statistics of the patterns shown in Table 4 for each of the three groups are presented in Multimedia Appendix 1. The direction of the effect can be inferred from the sign of the correlation coefficient.
Table 4.
Pattern | Statistical comparisons (P values) | Correlations (point biserial correlation coefficient) | ||||
|
Control vs trauma | Control vs PTSDa | Trauma vs PTSD | Control vs trauma | Control vs PTSD | Trauma vs PTSD |
IV2b | .06 | .005 c | .06 | 0.22 | 0.33 | 0.21 |
Sleep entropy | .002 | .005 | .88 | −0.34 | −0.32 | 0.016 |
Awakenings total minutes | .25 | .01 | .16 | 0.14 | 0.30 | 0.16 |
Number wake ups | .33 | .03 | .06 | 0.12 | 0.27 | 0.21 |
WASOd total minutes | .41 | .03 | .05 | 0.10 | 0.26 | 0.21 |
aPTSD: posttraumatic stress disorder.
bIV2: intraday variability (1440 min).
cStatistically significant associations (at the P=.05 level) are italicized. We present five indicative summarized patterns that exhibit the largest correlation magnitudes for the binary comparisons between groups. The negative signs in the correlations indicate that the summarized pattern generally exhibits lower scores for the first group in the comparison. Correlations with a magnitude over 0.3 are considered statistically strong. Further details are presented in Multimedia Appendices 1 and 2 (Multimedia Appendix 2 presents all the investigated variables).
dWASO: wake after sleep onset.
In Multimedia Appendix 1, we present additional plots to convey a visual impression on indicative differences between a PTSD participant, a traumatized control participant, and a nontraumatized control participant.
Discussion
Principal Findings
Sensor-based at-home monitoring is rapidly emerging with the proliferation of wearables. Wearables provide a convenient platform for chronic condition management, facilitating detailed longitudinal assessment across diverse metrics, including activity, sleep, and circadian rhythm patterns. This study demonstrated the potential of using raw triaxial acceleration, light, and temperature data to gain insight into weekly patterns of activity and sleep in the participants’ everyday life. We illustrated compact approaches to visualize annotated data modalities and proposed a novel algorithmic approach to estimate sleep onset and offset (including awakenings) that appears to be very accurate against sleep diaries.
We emphasize that the extent of our claim regarding the accuracy of the new sleep detection algorithm is that it replicates reasonably accurately the sleep diaries, which is a proxy for the true sleep onset and sleep offset. Any automated sleep detection algorithm validated on sleep diaries implicitly relies on having accurate labels to report performance and would ideally need to be benchmarked against PSG. We decided to exclude 9 participants from further analysis due to consistent major disagreement between actigraphy and self-reported sleep diary entries following visual inspection of the data summary plots (see Multimedia Appendix 1).
Importantly, some of the actigraphy algorithms were sensitive in showing distinguishable patterns for participants with PTSD compared with nontraumatized controls, which were in line with the self-reported sleep disturbances. These results are summarized in Table 4 and Multimedia Appendix 1. For example, sleep-based features (awakenings, WASO minutes) may provide a useful objective indicator of the degree of restlessness experienced by PTSD patients during sleep. Similarly, the intraday variability (IV2) differentiates well PTSD from nontraumatized controls, possibly indicating bursts of activity in PTSD throughout the day, whereas nontraumatized controls exhibited less activity variability within each day on average. Collectively, these findings are well reflected in the PTSD literature, where PTSD patients are known to have difficulty with sleep, nightmares, and variable sleep [7-10], attributes which are quantified here by the different sleep-related actigraphy metrics. The results provide further important insight into the quality of objective sleep in PTSD. The objective sleep differences are consistent with actigraphy studies that have found differences (such as increased wake after sleep) between people with PTSD and healthy controls [8,50] and between trauma survivors with and without PTSD [51]. However, objective sleep duration did not differ between groups, consistent with some [27], but not all previous actigraphy studies [50]. Therefore, this study adds to the tentative understanding of the nature of sleep disturbances in PTSD by means of the quantified disturbances in the reported patterns and highlights possible targets for potential intervention to further improve comorbid problems during therapy.
We remark that the results reported in Table 4 have not been corrected for multiple comparisons (eg, some researchers use the Bonferroni correction). These corrections are used to reduce type I errors (rejecting the null hypothesis when it is true) but have the important side effect of introducing type II errors (accepting the null hypothesis when it is false). This has urged researchers to suggest that effect sizes, correlation coefficients, and other metrics should be used to support research findings [51]. Here, we report the point biserial correlations for the extracted patterns, some of which appear to denote statistically strong associations (|R|>0.3).
The empirical rules and thresholds for the sleep detection algorithm were originally developed by the first author using his own Geneactiv data (n=1, collected for over 4 years) to correctly match onset, offset, and awakenings. From an algorithmic developer’s perspective, using one’s own data has the distinct advantage of cross-referencing activities, knowing the exact underlying ground truth across multiple days (including naps, awakenings, days of bad sleep, etc) and observing the recorded actigraphy signals. The thresholds of the algorithm were adjusted using a small subset (n<10) of the dataset described in this study, but we purposefully did not formally explore in detail optimizing thresholds to correctly match sleep diary entries to avoid overfitting. Ultimately, the sleep detection algorithm proposed in this study will need to be further validated in new cohorts to demonstrate how well it generalizes. The proposed sleep detection algorithm has functionality to detect awakenings, as illustrated in Figure 5. However, we do not have the self-reported onset and offset timings of awakenings; therefore, unfortunately, this is something we could not properly test and validate. Further work is required to demonstrate the potential of the proposed sleep detection algorithm in correctly detecting awakenings and comparing findings against self-reports or preferably PSG. Typically, algorithms developed to process actigraphy signals rely on hard thresholds (including the algorithm by van Hees et al [29]). It is possible that the thresholds chosen for the sleep detection algorithm proposed here might need to be refined in a different cohort (or with a different sensor recording actigraphy signals).
We believe that the colored actogram is a novel, convenient representation to understand longitudinal patterns of behavior. It serves as an intuitive summary of activity levels, potentially highlighting breaks in patterns. In principle, it could also be used as a composite for multiple participants, for example, to visually compare cohorts and identify whether at an aggregate level, there are some consistent patterns of behavior.
Comparison With Prior Work
The topic of automatically detecting sleep using wrist-worn devices has recently attracted attention [29]. However, the algorithms developed by van Hees et al [29] have not been validated in a group that exhibits considerable sleep problems [29], which is of particular importance in clinical assessment of mental disorders [12]. The new algorithmic approach proposed here to detect sleep has been developed with the aim of being sensitive to cohorts exhibiting perturbed sleep like PTSD and for extracting sleep patterns. In Multimedia Appendix 1, we provided indicative examples of this new approach, and in Table 3 and Figure 2, we demonstrate its competitiveness compared with the algorithm developed by van Hees et al [29]. Nevertheless, we remark that the sleep diaries used to assess sleep onset and wake-up times in this study are inherently limited and subject to participant self-report bias [20]. Therefore, although the current results are extremely promising, more rigorous validation is required to establish the validity of the new sleep detection algorithm against PSG and ideally in a larger sample, including diverse sleep pathologies. PSG provides the objective ground truth for sleep onset, sleep offset, and awakenings: the actigraphy-based patterns extracted computed by GGIR or any algorithmic package cannot be considered as PSG validated. Hence, the focus of using PSG is primarily to assess and compare the accuracy of competing sleep detection algorithms (ideally tested on both healthy controls and people with different sleep disorders). The subsequent computation of sleep-based patterns is heavily dependent on accurate sleep estimates.
High-end equipment has been previously used to provide objective assessment of activity and sleep in mental disorders assessment, including PTSD [52]. PSG is the gold standard for sleep assessment and remains the only accurate approach to gain insight into the actual sleep architecture [32]. However, PSG relies on the use of expensive specialized equipment, and is logistically costly as assessment requires multiple hours of dedicated time by a sleep-certified expert [24]. Standard PSG also requires the participant’s physical presence in the clinic; however, some recent studies have demonstrated the potential of home-based PSG recordings [53]. Similarly, the ActivPAL monitor is considered the gold standard in free-living conditions for step-counting, time spent in sitting/lying, standing, and walking postures [54]. However, it is designed to be worn at the thigh, fixed with a special adhesive pad, and hence is not very practical for long-term use. These difficulties motivate the use of alternative, easy-to-use, more affordable sensors, and wrist-worn watches are prime candidates in that respect. The data in this study were collected using the Geneactiv watch, which has attracted considerable interest across different research groups worldwide [36,42]. A similar popular wrist-worn device that was used, for example, in the UK Biobank study [25], Axivity AX3, has been shown to provide equivalent signal vector magnitude output with Geneactiv [55]; hence, the developed algorithms should, in principle, be directly applicable in studies where that device was used, crucially in the UK Biobank. Similar devices that provide access to the raw triaxial accelerometer data might require some cross-device calibration; otherwise, the developed framework should generalize well. Some older actigraphy devices used counts instead of raw acceleration signals and hence tools used to be device specific; although a relatively recent study provides for backward compatibility [42], the trend is moving toward tools that capitalize on the raw actigraphy data.
Summarizing the 3D accelerometry signals in a vector is crucial and is a required preprocessing step in advance of computing the actigraphy patterns (eg, IS, IV, etc). There are many different approaches reported in the research literature but no unique single best way to summarize the activity [38,39]. For example, the Euclidean Norm Minus One [38] is sensitive to calibration errors. Other approaches often rely on short-term windows [39] aiming to smoothen accelerometry fluctuations owing to internal accelerometer noise and hence might not effectively capture transient movements. The proposed approach in this study for the computation of the movement as an accelerometry summary, aims to address inherent accelerometer noise fluctuations by effectively operating on successive differences in the raw 3D accelerometer data before computing the Euclidean distance. We tentatively argue that this instantaneous-based approach rather than using local windows might have some advantages in terms of mitigating inherent accelerometer noise. Further work is required to assess whether there is any superior approach toward summarizing 3D accelerometry signals, for example, against a gold standard.
Alternatively, someone might use the subsequent computation of patterns (such as IS, IV, etc) working on each of the different accelerometry summaries (which are used as a preprocessing step) and assess how those patterns might be associated with a clinical outcome. This effectively draws parallels with the feature selection problem in data analytics, lacking ground truth of which are the true features and which are predictive of an outcome; researchers apply feature selection algorithms and feed different classifiers. On the basis of the classification performance, they can assess which feature selection algorithms perform best in a given problem [56,57]. We emphasize that for this approach to be valid and generalizable, researchers would need to perform comparisons across different datasets, ideally associating the extracted features with different outcomes. It is also possible that there are different combinations of accelerometry summaries and computation of patterns that work best; future work would be needed to investigate this in more detail.
Limitations
The primary limitations of this study are (1) the lack of PSG data to validate findings and (2) the study duration of 7 days. With long-term data, more detailed markers of weekly and monthly activity, sleep, and circadian rhythm variability could be developed and further explored. Participants adhered well and wearing the watch was not reported as disconcerting by any participant, which suggests that longer term monitoring may be viable in accordance with a recent study in the UK BioBank [25]. The sample size was sufficiently large for the exploratory aims of this study; nevertheless, larger cohorts might provide better insight into the nature of PTSD. Verification of the developed algorithm for sleep estimation (including awakenings) against PSG would be important in future studies to confirm sleep and wake detection, and further clarify the results. Similarly, we do not have detailed daily self-reported outcome measures (eg, daily mood self-reports as we had used longitudinally in related previous research [18,58,59]) other than the sleep diaries, which could have been associated with actigraphy-extracted patterns and hence further validate the developed algorithms.
The sleep detection algorithm proposed in this study capitalizes on the accelerometry and ambient light modalities; the latter is useful for differentiating sedentary activity and sleep. However, this suggests that the current version of the sleep detection algorithm would not be backwards compatible with devices that only record accelerometry signals. With the sophistication of wearables, additional modalities are becoming available (such as heart rate) and could be harnessed to potentially further improve sleep detection.
More generally, the light sensor modality should be used carefully in the analysis when developing algorithmic tools: lack of detected light does not necessarily indicate that someone is in a dark room. The wrist sensor may hide under a long sleeve, for example, in long-sleeved clothes or pajamas. There may also be abrupt changes in the detected light signal, if the sensor is temporarily visible (or vice-versa blocked).
The study participants wore the standard Geneactiv (Geneactiv Original). The temperature sensor is encased within the waterproof housing of the watch; hence, the recorded temperature depends on how tightly the participants wear the watch: the sensor actually measures a mixture of body and room temperatures. In addition, during the night, temperature changes may (at least partly) indicate that a person has moved their hand above or under the blanket. The wrist temperature recorded has been primarily useful to detect nonwear times and to assess within-person changes, but not for between-person comparison. Researchers wishing to compare recorded temperatures between participants may want to explore a device that provides direct skin temperature recordings such as the Geneactiv Sleep variant [60]. Therefore, the use of the raw temperature measurements in this study should be interpreted very tentatively. For example, the increase in the recorded temperature during sleep seen in Figure 5 is likely a reflection of environmental temperature increase rather than an increase in the wrist temperature of the participant.
Conclusions
We envisage the developed algorithmic framework laying the foundation for using actigraphy analysis in different settings where raw wrist-worn triaxial accelerometer data are available, aiming to monitor healthy and pathological cohorts longitudinally. We encourage research colleagues to use and expand on the user-friendly MATLAB source code provided in this study to facilitate actigraphy data visualization and analysis. Among mental health conditions, applications to depression are of interest given that low levels of activity and poor sleep are characteristics of the disorder. Future research could investigate extracting additional patterns from the raw signals, potentially complementing it with additional modalities such as heart rate and geolocation, which are embedded in some recent devices. We are currently exploring the potential of using and extending the objective measures provided in this study to monitor longitudinal PTSD behavior, therapy effects, and long-term recovery.
Acknowledgments
The authors are grateful to all study participants for their meticulous adherence to the research protocol. The authors thank Juliane Sachschal for assisting with recruitment, interviews, and testing of the participants. They also thank Jennifer Wild, Emma-Warnock Parkes, and the staff of the Centre for Anxiety Disorders and Trauma, South London, and Maudsley NHS Foundation Trust for their help with recruitment. The study was funded by a Medical Research Council DTA award to EW and by Wellcome Trust grant 069777/200796 to AE. This study was supported by the National Institute for Health Research Oxford Health Biomedical Research Centre. AT was supported at the early stages of this work by the Wellcome Trust through a Centre grant no. 098461/Z/12/Z, The University of Oxford Sleep and Circadian Neuroscience Institute (SCNi) during the early stages of this work. The study was also supported by the Health Data Research UK, which receives funding from HDR UK Ltd (HDR-5012), funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), the British Heart Foundation, and the Wellcome Trust. The funders had no role in the study or the decision to submit this work to be considered for publication.
Abbreviations
- DSM-5
Diagnostic and Statistical Manual of Mental Disorders 5th edition
- NHS
National Health Service
- PROM
patient-reported outcome measure
- PSG
polysomnography
- PSQI
Pittsburgh Sleep Quality Index
- PTSD
posttraumatic stress disorder
- WASO
wake after sleep onset
Appendix
Further details on algorithmic background and additional results.
Statistical comparisons of features across cohorts.
Footnotes
Authors' Contributions: AT developed the algorithms, analyzed the data, and wrote the first draft of the manuscript. EW designed the study with AE, wrote the protocol and ethics application, collected and entered the data, analyzed self-report data, and provided clinical insight for the interpretation of the activity data. AE codesigned the study and protocol, supervised recruitment, diagnostic interviews, and the running of the study. All authors critically reviewed and commented on the text and gave final approval for submission of this manuscript for publication.
Conflicts of Interest: None declared.
References
- 1.Alonso J, Angermeyer M, Lépine JP, European Study of the Epidemiology of Mental Disorders (ESEMeD) Project The European study of the epidemiology of mental disorders (ESEMeD) project: an epidemiological basis for informing mental health policies in Europe. Acta Psychiatr Scand Suppl. 2004;109(420):5–7. doi: 10.1111/j.1600-0047.2004.00325.x. [DOI] [PubMed] [Google Scholar]
- 2.Bunting BP, Ferry FR, Murphy SD, O'Neill SM, Bolton D. Trauma associated with civil conflict and posttraumatic stress disorder: evidence from the Northern Ireland study of health and stress. J Trauma Stress. 2013 Feb;26(1):134–41. doi: 10.1002/jts.21766. [DOI] [PubMed] [Google Scholar]
- 3.Kessler RC, Sonnega A, Bromet E, Hughes M, Nelson CB. Posttraumatic stress disorder in the national comorbidity survey. Arch Gen Psychiatry. 1995 Dec;52(12):1048–60. doi: 10.1001/archpsyc.1995.03950240066012. [DOI] [PubMed] [Google Scholar]
- 4.Steel Z, Chey T, Silove D, Marnane C, Bryant RA, van Ommeren M. Association of torture and other potentially traumatic events with mental health outcomes among populations exposed to mass conflict and displacement: a systematic review and meta-analysis. J Am Med Assoc. 2009 Aug 5;302(5):537–49. doi: 10.1001/jama.2009.1132. [DOI] [PubMed] [Google Scholar]
- 5.American Psychiatric Association . Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Fifth Edition. Washington, DC: American Psychiatric Association Publication; 2013. [Google Scholar]
- 6.Ross RJ, Ball WA, Sullivan KA, Caroff SN. Sleep disturbance as the hallmark of posttraumatic stress disorder. Am J Psychiatry. 1989 Jun;146(6):697–707. doi: 10.1176/ajp.146.6.697. [DOI] [PubMed] [Google Scholar]
- 7.Nappi C, Drummond SP, Hall JM. Treating nightmares and insomnia in posttraumatic stress disorder: a review of current evidence. Neuropharmacology. 2012 Feb;62(2):576–85. doi: 10.1016/j.neuropharm.2011.02.029. http://europepmc.org/abstract/MED/21396945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Straus L, Drummond SP, Nappi CM, Jenkins MM, Norman SB. Sleep variability in military-related PTSD: a comparison to primary insomnia and healthy controls. J Trauma Stress. 2015 Feb;28(1):8–16. doi: 10.1002/jts.21982. http://europepmc.org/abstract/MED/25630526. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Leskin G, Woodward SH, Young HE, Sheikh JI. Effects of comorbid diagnoses on sleep disturbance in PTSD. J Psychiatr Res. 2002;36(6):449–52. doi: 10.1016/s0022-3956(02)00025-0. [DOI] [PubMed] [Google Scholar]
- 10.Cox R, Olatunji BO. A systematic review of sleep disturbance in anxiety and related disorders. J Anxiety Disord. 2016 Jan;37:104–29. doi: 10.1016/j.janxdis.2015.12.001. [DOI] [PubMed] [Google Scholar]
- 11.van Liempt S. Sleep disturbances and PTSD: a perpetual circle? Eur J Psychotraumatol. 2012;3 doi: 10.3402/ejpt.v3i0.19142. http://europepmc.org/abstract/MED/23050070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sheaves B, Porcheret K, Tsanas A, Espie CA, Foster RG, Freeman D, Harrison PJ, Wulff K, Goodwin GM. Insomnia, nightmares, and chronotype as markers of risk for severe mental illness: results from a student population. Sleep. 2016 Jan 1;39(1):173–81. doi: 10.5665/sleep.5342. http://europepmc.org/abstract/MED/26350467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Woodward S, Bliwise DL, Friedman MJ, Gusman DF. Subjective versus objective sleep in Vietnam combat veterans hospitalized for PTSD. J Trauma Stress. 1996 Jan;9(1):137–43. doi: 10.1007/bf02116839. [DOI] [PubMed] [Google Scholar]
- 14.Germain A, Hall M, Shear MK, Nofzinger EA, Buysse DJ. Sleep disruption in PTSD: A pilot study with home-based polysomnography. Sleep Biol Rhythms. 2006 Oct;4(3):286–9. doi: 10.1111/j.1479-8425.2006.00230.x). doi: 10.1111/j.1479-8425.2006.00230.x). [DOI] [Google Scholar]
- 15.Herbst E, Metzler TJ, Lenoci M, McCaslin SE, Inslicht S, Marmar CR, Neylan TC. Adaptation effects to sleep studies in participants with and without chronic posttraumatic stress disorder. Psychophysiology. 2010 Nov;47(6):1127–33. doi: 10.1111/j.1469-8986.2010.01030.x. http://europepmc.org/abstract/MED/20456661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cella D, Riley W, Stone A, Rothrock N, Reeve B, Yount S, Amtmann D, Bode R, Buysse D, Choi S, Cook K, Devellis R, de Walt D, Fries JF, Gershon R, Hahn EA, Lai J, Pilkonis P, Revicki D, Rose M, Weinfurt K, Hays R, PROMIS Cooperative Group The patient-reported outcomes measurement information system (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005-2008. J Clin Epidemiol. 2010 Nov;63(11):1179–94. doi: 10.1016/j.jclinepi.2010.04.011. http://europepmc.org/abstract/MED/20685078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schwartz S, Schultz S, Reider A, Saunders E. Daily mood monitoring of symptoms using smartphones in bipolar disorder: a pilot study assessing the feasibility of ecological momentary assessment. J Affect Disord. 2016 Feb;191:88–93. doi: 10.1016/j.jad.2015.11.013. http://europepmc.org/abstract/MED/26655117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Tsanas A, Saunders K, Bilderbeck A, Palmius N, Osipov M, Clifford G, Goodwin G, de Vos M. Daily longitudinal self-monitoring of mood variability in bipolar disorder and borderline personality disorder. J Affect Disord. 2016 Nov 15;205:225–33. doi: 10.1016/j.jad.2016.06.065. https://linkinghub.elsevier.com/retrieve/pii/S0165-0327(16)30781-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Carney C, Buysse DJ, Ancoli-Israel S, Edinger JD, Krystal AD, Lichstein KL, Morin CM. The consensus sleep diary: standardizing prospective sleep self-monitoring. Sleep. 2012 Feb 1;35(2):287–302. doi: 10.5665/sleep.1642. http://europepmc.org/abstract/MED/22294820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Buysse D, Ancoli-Israel S, Edinger JD, Lichstein KL, Morin CM. Recommendations for a standard research assessment of insomnia. Sleep. 2006 Sep;29(9):1155–73. doi: 10.1093/sleep/29.9.1155. [DOI] [PubMed] [Google Scholar]
- 21.Proudfoot J, Parker G, Pavlovic DH, Manicavasagar V, Adler E, Whitton A. Community attitudes to the appropriation of mobile phones for monitoring and managing depression, anxiety, and stress. J Med Internet Res. 2010 Dec 19;12(5):e64. doi: 10.2196/jmir.1475. https://www.jmir.org/2010/5/e64/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tsanas A, Saunders K, Bilderbeck A, Palmius N, Goodwin G, de Vos M. Clinical insight into latent variables of psychiatric questionnaires for mood symptom self-assessment. JMIR Ment Health. 2017 May 25;4(2):e15. doi: 10.2196/mental.6917. https://mental.jmir.org/2017/2/e15/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lee I, Shiroma EJ, Lobelo F, Puska P, Blair SN, Katzmarzyk PT, Lancet Physical Activity Series Working Group Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet. 2012 Jul 21;380(9838):219–29. doi: 10.1016/S0140-6736(12)61031-9. http://europepmc.org/abstract/MED/22818936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ancoli-Israel S, Cole R, Alessi C, Chambers M, Moorcroft W, Pollak CP. The role of actigraphy in the study of sleep and circadian rhythms. Sleep. 2003 May 1;26(3):342–92. doi: 10.1093/sleep/26.3.342. [DOI] [PubMed] [Google Scholar]
- 25.Doherty A, Jackson D, Hammerla N, Plötz T, Olivier P, Granat MH, White T, van Hees VT, Trenell MI, Owen CG, Preece SJ, Gillions R, Sheard S, Peakman T, Brage S, Wareham NJ. Large scale population assessment of physical activity using wrist worn accelerometers: the UK biobank study. PLoS One. 2017;12(2):e0169649. doi: 10.1371/journal.pone.0169649. http://dx.plos.org/10.1371/journal.pone.0169649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kobayashi I, Lavela J, Mellman TA. Nocturnal autonomic balance and sleep in PTSD and resilience. J Trauma Stress. 2014 Dec;27(6):712–6. doi: 10.1002/jts.21973. [DOI] [PubMed] [Google Scholar]
- 27.Kobayashi I, Huntley E, Lavela J, Mellman TA. Subjectively and objectively measured sleep with and without posttraumatic stress disorder and trauma exposure. Sleep. 2012 Jul 1;35(7):957–65. doi: 10.5665/sleep.1960. http://europepmc.org/abstract/MED/22754042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Sabia S, van Hees VT, Shipley MJ, Trenell MI, Hagger-Johnson G, Elbaz A, Kivimaki M, Singh-Manoux A. Association between questionnaire- and accelerometer-assessed physical activity: the role of sociodemographic factors. Am J Epidemiol. 2014 Mar 15;179(6):781–90. doi: 10.1093/aje/kwt330. http://europepmc.org/abstract/MED/24500862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.van Hees VT, Sabia S, Anderson KN, Denton SJ, Oliver J, Catt M, Abell JG, Kivimäki M, Trenell MI, Singh-Manoux A. A novel, open access method to assess sleep duration using a wrist-worn accelerometer. PLoS One. 2015;10(11):e0142533. doi: 10.1371/journal.pone.0142533. http://dx.plos.org/10.1371/journal.pone.0142533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tsanas A. Oxford University Research Archive. University of Oxford, UK: 2012. [2020-03-31]. Accurate Telemonitoring of Parkinson's Disease Symptom Severity Using Nonlinear Speech Signal Processing and Statistical Machine Learning, PhD thesis https://ora.ox.ac.uk/objects/uuid:2a43b92a-9cd5-4646-8f0f-81dbe2ba9d74. [Google Scholar]
- 31.Tsanas A, Clifford GD. Stage-independent, single lead EEG sleep spindle detection using the continuous wavelet transform and local weighted smoothing. Front Hum Neurosci. 2015;9:181. doi: 10.3389/fnhum.2015.00181. doi: 10.3389/fnhum.2015.00181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kryger T, Roth T, Dement WC. Principles and Practice of Sleep Medicine. Fifth Edition. Canada: Elsevier; 2010. [Google Scholar]
- 33.Weathers FW, Bovin MJ, Lee DJ, Sloan DM, Schnurr PP, Kaloupek DG, Keane TM, Marx BP. The Clinician-Administered PTSD Scale for DSM-5 (CAPS-5): development and initial psychometric evaluation in military veterans. Psychol Assess. 2018 Mar;30(3):383–95. doi: 10.1037/pas0000486. http://europepmc.org/abstract/MED/28493729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Buysse DJ, Reynolds CF, Monk TH, Berman SR, Kupfer DJ. The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research. Psychiatry Res. 1989 May;28(2):193–213. doi: 10.1016/0165-1781(89)90047-4. [DOI] [PubMed] [Google Scholar]
- 35.Aili K, Åström-Paulsson S, Stoetzer U, Svartengren M, Hillert L. Reliability of actigraphy and subjective sleep measurements in adults: the design of sleep assessments. J Clin Sleep Med. 2017 Jan 15;13(1):39–47. doi: 10.5664/jcsm.6384. doi: 10.5664/jcsm.6384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.van Hees VT, Fang Z, Langford J, Assah F, Mohammad A, da Silva IC, Trenell MI, White T, Wareham NJ, Brage S. Autocalibration of accelerometer data for free-living physical activity assessment using local gravity and temperature: an evaluation on four continents. J Appl Physiol (1985) 2014 Oct 1;117(7):738–44. doi: 10.1152/japplphysiol.00421.2014. http://journals.physiology.org/doi/full/10.1152/japplphysiol.00421.2014?url_ver=Z39.88-2003&rfr_id=ori:rid:crossref.org&rfr_dat=cr_pub%3dpubmed. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhou S, Hill RA, Morgan K, Stratton G, Gravenor MB, Bijlsma G, Brophy S. Classification of accelerometer wear and non-wear events in seconds for monitoring free-living physical activity. BMJ Open. 2015 May 11;5(5):e007447. doi: 10.1136/bmjopen-2014-007447. http://bmjopen.bmj.com/cgi/pmidlookup?view=long&pmid=25968000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.van Hees VT, Gorzelniak L, León ED, Eder M, Pias M, Taherian S, Ekelund U, Renström F, Franks PW, Horsch A, Brage S. Separating movement and gravity components in an acceleration signal and implications for the assessment of human daily physical activity. PLoS One. 2013;8(4):e61691. doi: 10.1371/journal.pone.0061691. http://dx.plos.org/10.1371/journal.pone.0061691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Karas M, Bai J, Strączkiewicz M, Harezlak J, Glynn NW, Harris T, Zipunnikov V, Crainiceanu C, Urbanek JK. Accelerometry data in health research: challenges and opportunities. Stat Biosci. 2019 Jul;11(2):210–37. doi: 10.1007/s12561-018-9227-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kushida CA, Chang A, Gadkary C, Guilleminault C, Carrillo O, Dement WC. Comparison of actigraphic, polysomnographic, and subjective assessment of sleep parameters in sleep-disordered patients. Sleep Med. 2001 Sep;2(5):389–96. doi: 10.1016/s1389-9457(00)00098-8. [DOI] [PubMed] [Google Scholar]
- 41.Sadeh A, Sharkey KM, Carskadon MA. Activity-based sleep-wake identification: an empirical test of methodological issues. Sleep. 1994 Apr;17(3):201–7. doi: 10.1093/sleep/17.3.201. [DOI] [PubMed] [Google Scholar]
- 42.te Lindert BH, van Someren EJ. Sleep estimates using microelectromechanical systems (MEMS) Sleep. 2013 May 1;36(5):781–9. doi: 10.5665/sleep.2648. http://europepmc.org/abstract/MED/23633761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.van Hees VT, Fang Z, Zhao JH, Heywood J, Mirkes E, Sabia S, Pujol JC, Migueles JH, Patterson MR. The Comprehensive R Archive Network. 2019. Dec 4, [2020-04-03]. GGIR: Raw Accelerometer Data Analysis https://cran.r-project.org/web/packages/GGIR/index.html.
- 44.van Someren EJ, Kessler A, Mirmiran M, Swaab DF. Indirect bright light improves circadian rest-activity rhythm disturbances in demented patients. Biol Psychiatry. 1997 May 1;41(9):955–63. doi: 10.1016/S0006-3223(97)89928-3. [DOI] [PubMed] [Google Scholar]
- 45.Di J, Spira A, Bai J, Urbanek J, Leroux A, Wu M, Resnick S, Simonsick E, Ferrucci L, Schrack J, Zipunnikov V. Joint and individual representation of domains of physical activity, sleep, and circadian rhythmicity. Stat Biosci. 2019 Apr 15;11(2):371–402. doi: 10.1007/s12561-019-09236-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Benloucif S, Guico MJ, Reid KJ, Wolfe LF, L'hermite-Balériaux M, Zee PC. Stability of melatonin and temperature as circadian phase markers and their relation to sleep times in humans. J Biol Rhythms. 2005 Apr;20(2):178–88. doi: 10.1177/0748730404273983. [DOI] [PubMed] [Google Scholar]
- 47.Meyer GJ, Finn SE, Eyde LD, Kay GG, Moreland KL, Dies RR, Eisman EJ, Kubiszyn TW, Reed GM. Psychological testing and psychological assessment. A review of evidence and issues. Am Psychol. 2001 Feb;56(2):128–65. doi: 10.1037/0003-066x.56.2.128. [DOI] [PubMed] [Google Scholar]
- 48.Tsanas A, Little MA, McSharry PE. A methodology for the analysis of medical data. In: Sturmberg JP, Martin C, editors. Handbook of Systems and Complexity in Health. New York, USA: Springer; 2013. pp. 113–25. [Google Scholar]
- 49.The Data Analytics Research and Technology in Healthcare (DARTH group) 2019. [2020-04-03]. Software https://www.darth-group.com/software.
- 50.Calhoun P, Wiley M, Dennis MF, Means MK, Edinger JD, Beckham JC. Objective evidence of sleep disturbance in women with posttraumatic stress disorder. J Trauma Stress. 2007 Dec;20(6):1009–18. doi: 10.1002/jts.20255. [DOI] [PubMed] [Google Scholar]
- 51.Nakagawa S. A farewell to Bonferroni: the problems of low statistical power and publication bias. Behav Ecol. 2004;15(6):1044–5. doi: 10.1093/beheco/arh107. [DOI] [Google Scholar]
- 52.Klein E, Koren D, Arnon I, Lavie P. Sleep complaints are not corroborated by objective sleep measures in post-traumatic stress disorder: a 1-year prospective study in survivors of motor vehicle crashes. J Sleep Res. 2003 Mar;12(1):35–41. doi: 10.1046/j.1365-2869.2003.00334.x. https://onlinelibrary.wiley.com/resolve/openurl?genre=article&sid=nlm:pubmed&issn=0962-1105&date=2003&volume=12&issue=1&spage=35. [DOI] [PubMed] [Google Scholar]
- 53.Heinzer R, Vat S, Marques-Vidal P, Marti-Soler H, Andries D, Tobback N, Mooser V, Preisig M, Malhotra A, Waeber G, Vollenweider P, Tafti M, Haba-Rubio J. Prevalence of sleep-disordered breathing in the general population: the HypnoLaus study. Lancet Respir Med. 2015 Apr;3(4):310–8. doi: 10.1016/S2213-2600(15)00043-0. http://europepmc.org/abstract/MED/25682233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Kooiman TJ, Dontje ML, Sprenger SR, Krijnen WP, van der Schans CP, de Groot M. Reliability and validity of ten consumer activity trackers. BMC Sports Sci Med Rehabil. 2015;7:24. doi: 10.1186/s13102-015-0018-5. https://bmcsportsscimedrehabil.biomedcentral.com/articles/10.1186/s13102-015-0018-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Ladha C, Ladha K, Jackson D, Olivier P. Shaker Table Validation of OpenMovement AX3 Accelerometer. Proceedings of the Third International Conference on Ambulatory Monitoring of Physical Activity and Movement; ISMPB'13; June 17-19, 2013; Amherst, MA. 2013. https://www.researchgate.net/publication/312500148_Shaker_table_validation_of_OpenMovement_accelerometer. [DOI] [Google Scholar]
- 56.Guyon I, Eliseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157–82. http://www.jmlr.org/papers/volume3/guyon03a/guyon03a.pdf. [Google Scholar]
- 57.Tsanas A, Little MA, McSharry PE, Spielman J, Ramig LO. Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease. IEEE Trans Biomed Eng. 2012 May;59(5):1264–71. doi: 10.1109/TBME.2012.2183367. [DOI] [PubMed] [Google Scholar]
- 58.Palmius N, Tsanas A, Saunders KE, Bilderbeck AC, Geddes JR, Goodwin GM, de Vos M. Detecting bipolar depression from geographic location data. IEEE Trans Biomed Eng. 2017 Aug;64(8):1761–71. doi: 10.1109/TBME.2016.2611862. http://europepmc.org/abstract/MED/28113247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Carr O, Saunders KE, Bilderbeck AC, Tsanas A, Palmius N, Geddes JR, Foster R, de Vos M, Goodwin GM. Desynchronization of diurnal rhythms in bipolar disorder and borderline personality disorder. Transl Psychiatry. 2018 Apr 12;8(1):79. doi: 10.1038/s41398-018-0125-7. http://europepmc.org/abstract/MED/29643339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Activinsights. [2020-04-03]. GENEActiv Sleep https://www.activinsights.com/actigraphy/geneactiv-sleep/
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Further details on algorithmic background and additional results.
Statistical comparisons of features across cohorts.
Data Availability Statement
Requests for access to the data can be made to EW, but the data cannot be placed into a publicly accessible repository.