Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Apr 9.
Published in final edited form as: Proc Int Conf Automot User Interfaces Interact Veh Appl (2014). 2014 Sep 17;2014:2667335. doi: 10.1145/2667317.2667335

Estimating Drivers’ Stress from GPS Traces

Sudip Vhaduri 1, Amin Ali 1, Moushumi Sharmin 1, Karen Hovsepian 2, Santosh Kumar 1
PMCID: PMC4390997  NIHMSID: NIHMS671141  PMID: 25866847

Abstract

Driving is known to be a daily stressor. Measurement of driver’s stress in real-time can enable better stress management by increasing self-awareness. Recent advances in sensing technology has made it feasible to continuously assess driver’s stress in real-time, but it requires equipping the driver with these sensors and/or instrumenting the car. In this paper, we present “GStress”, a model to estimate driver’s stress using only smartphone GPS traces. The GStress model is developed and evaluated from data collected in a mobile health user study where 10 participants wore physiological sensors for 7 days ( for an average of 10.45 hours/day) in their natural environment. Each participant engaged in 10 or more driving episodes, resulting in a total of 37 hours of driving data. We find that major driving events such as stops, turns, and braking increase stress of the driver. We quantify their impact on stress and thus construct our GStress model by training a Generalized Linear Mixed Model (GLMM) on our data. We evaluate the applicability of GStress in predicting stress from GPS traces, and obtain a correlation of 0.72. By obviating any burden on the driver or the car, we believe, GStress can make driver’s stress assessment ubiquitous.

Author Keywords: Mobile Health, Driving, Stress, GPS

ACM Classification Keywords: H.1.2. Models and Principles: User/Machine Systems

INTRODUCTION

Daily stress experienced during driving [17] may increase the likelihood of adverse events (e.g., accidents and traffic fatalities [12, 20]) and contribute to adverse health. Repeated occurrences of stress can cause or worsen cancer [37], heart diseases [7], hypertension [2], aging [29], shrinking of brain [16], fatigue, depression, and rage [24, 25]. Consequently, there has been tremendous interest in both the scientific community as well as the technology and automotive industry [5, 15, 43, 45] in developing systems to measure stress in real-time in order to enhance stress-awareness of drivers and to deliver just-in-time stress interventions [22, 27].

Driver stress had traditionally been measured post-hoc via self-reports [17] or biofluid assays [11]. Emergence of physiological sensors such as electrocardigram (ECG) and galvanic response (GSR) and their instrumentation on drivers [9] or in the car [5, 15, 43, 45] has made continuous stress measurement feasible. Research on developing accurate measure of stress during driving from physiological sensors has been an active area of research for more than a decade now [14, 41]. More recently, these physiological measures have been supplemented with data collected from the car about driving events such as steering wheel movements and braking episodes [36], and information about driving conditions from the environment [35]. In addition, video has been used to capture driver’s facial expressions to detect stress during driving [30]. Recent research is exploring the role of extraneous cognitive load on the driver (e.g., texting, navigation) [42] that may further escalate a driver’s stress.

Stress monitoring of drivers is yet to be ready for wider adoption, though. This is because monitoring of driver’s stress today requires either having the drivers wear physiological sensors or instrumenting the car with sensors and cameras. For stress monitoring to be widely available, a method is needed that can infer stress from data collected passively by commonly used existing technology, such as mobile phones.

In this paper, we propose a model named GStress to estimate the stress level of drivers from GPS traces. GPS sensors are readily available in current navigation systems and more and more smart phones are also equipped with GPS – making attainment of GPS traces in real-time increasingly feasible. Drivers can use the GStress model to become more aware of their stress during driving, overlay the stress data on a map to identify road segments frequently associated with elevated stress to plan their route accordingly, and adapt their driving behaviors if needed. GStress model can also help inform the design and use of in-vehicle technologies. For example, calls or texts can be blocked or postponed if the driver is stressed. Wide adoption of such models can be used to annotate traffic maps with current stress levels being experienced by drivers on various routes, similar to real-time traffic update displayed by the navigation systems. Such data can also be used by city planners to identify pain points in a city’s road network (e.g., difficult intersections that cause stress in many drivers).

To develop the GStress model, we used physiological data collected from 30 human volunteers who wore AutoSense [9] sensors for at least 10 hours/day for a week in their natural environment. However, 19 of these 30 volunteers were living on the university campus and rarely used a vehicle for commute. The remaining 11 participants had at least 10 driving episodes and in this paper we only report data collected from these participants.

We use GPS and self-report data to identify driving episodes from the entire day’s data. For model development, we first analyze driving episodes to identify events that have been shown to be stressful, which include stops, braking, and turns. Next, considering the wide variability across individuals in stress reactivity, we develop a Generalized Linear Mixed Model (GLMM) that separates the effects of between-person variability. The GLMM model also permits exploiting nonlinear relationships while retaining the simplicity of a linear regression. By using three factors (stops, turns, and braking) from the GPS traces, our model obtains an r value of 0.72 for predicting stress from GPS traces. We then obtain a population estimate of the person-specific biases and obtain a person-independent model. Using leave-one-subject-out evaluation, our model provides a median (across all participants) r value of 0.687 while a person-dependent model improves this median correlation to 0.762. We find that stops have the highest impact on stress, confirming the widespread belief that impedance is the genesis of stress during commute [17, 28], with quantitative data from real-life driving.

RELATED WORK

Assessment of driver’s stress is an active area of research. However, most of the existing research focus on measuring stress from physiological data [14, 41], video [30], and acoustic data [21]. More recently, these measurements have been supplemented with driving and traffic information [36, 42]. For example, [36] performed real-time stress detection using physiological signals and measurements obtained from the vehicle’s CAN-bus (e.g., speed, RPM, and throttle) and combined the physiological stress response with driving behaviors (e.g., overtake, hard brake, etc.) to improve accuracy. They trained a stress model from self-reported data. However, self-reports are episodic, prone to bias, noise, and misreport, and is less reliable for training stress models [32].

Measurement of driver’s stress has traditionally been confined to simulators due to the difficulty, effort, and risk involved in collecting data in the natural environment [22, 42]. The limited number of studies that are conducted in the natural environment followed scripted routes under supervision for a limited duration [14, 38, 41]. For example, in [14], four drivers wore physiological sensors during highway and urban driving episodes and self-report showed that urban driving was more stressful than highway, which, in turn, was more stressful than when parked in a garage. A continuous rating of stress was also obtained, however, this stress model was not validated against behaviorally accepted measures of stress.

To the best of our knowledge, this is the first study that collected continuous stress data from uncontrolled and unscripted driving episodes where participants drove their own vehicles. In addition, while most existing research utilize driving event measurements together with physiological measurements [14, 41] to improve stress measurement accuracy, we present a model for stress estimation using GPS traces obtained via smart phone alone.

DATA COLLECTION PROCEDURE

To obtain continuous stress measures experienced by drivers during (unscripted) driving, we conducted a week-long mobile health user study on stress, where participants wore a physiological sensor suite and carried a smart phone with GPS for at least 10 hours/day in their natural environment. The physiological data thus collected is processed (by applying a validated stress model [32]) to obtain a continuous measure of stress experienced by these participants and to identify association between stress and various driving events. In this section, we describe the devices and measures used in this study.

Participants

A total of 30 participants (15 females) completed the study. Our participants were university students with an average age of 24 years(SD = 4.5 years). The goal of the study was to assess stress in the natural environment where commute is a part of daily life. However, 19 of these 30 participants lived in or around campus and did not drive frequently, resulting is very few driving episodes. We excluded all data collected from participants with less than 10 driving episodes from our analysis and model development.

Sensor Suite

Our participants wore a wireless sensor suite called AutoSense [9] underneath their clothes. AutoSense consists of two unobtrusive, flexible bands worn about the chest and upper arm, respectively. The chest band provided respiration data by measuring the expansion and contraction of the chest via inductive plethysmography (RIP), two-lead electrocardiograph (ECG), and 3-axis accelerometer. Measurements collected by the sensors were transmitted wirelessly to a Sony Ericsson Xperia X8 smart phone. The sampling rates for the chest band were 21.33 HZ for RIP, 64 HZ for ECG, and 10.67 HZ for each of the three axes of accelerometers, and 1 HZ for battery level. These samples were sent 28 times/second with each packet containing 8 bytes of data and consisting of 5 samples.

Mobile Phone

Participants carried a smart phone customized to communicate with the sensor suite. The smart phone had three roles. First, it received and stored data transmitted by the sensor suite. Second, it stored data from the phone sensors, including GPS and accelerometers and these measurements were synchronized to the measurements transmitted from AutoSense. Finally, participants used the phone to complete system-initiated self-reports in the field.

Self-report Measures

In addition to the sensor measurements, data was collected from participants using self-reports, up to 20 times each day, which was administered on the study phone. The 42-item survey asked participants to rate their subjective stress level on a 6-point Likert scale [19] as well as provided additional contextual data such as whether the participant was riding or driving a vehicle. Participants were allowed to postpone or not answer a prompt. We note that data obtained using self-reports was only used for identifying drivers and not for modeling stress.

Continuous Stress Measure

The entire day (including the driving episodes) was divided into 30 second segments and the stress state for each segment was computed as a continuous measure in terms of posterior probability of being stressed using the model presented in [32]. Stress estimation from ECG has traditionally relied on a single feature (e.g., heart rate or heart rate variability) [24, 25]. Machine learning models that identify a more specific fingerprint of physiological activation began emerging in the past decade [14]. We utilized the model developed in [32] as it has been validated in the lab and the natural field environment. Although the initial model was developed to provide a binary stress output for each 1 minute of physiological data, we adapted the model to provide a stress state for each 30 second segment. We reduced the granularity from 1 minute to 30 seconds to capture stress levels associated with events such as braking, stops, and turns, events which may not last a minute. As the stress model uses statistics such as heart rate and heart rate variability, using a window smaller than 30 second will reduce the accuracy of the stress model to below 80%. This model provides a continuous stress measure that is normalized between 0 and 1, where stress values can be interpreted as No, Low, Moderate, High, and Extremely High stress for the ranges of 0–0.2, 0.2–0.4, 0.4–0.6, 0.6–0.8, and 0.8–1 respectively.

DETECTION OF TRAFFIC EVENTS

We now describe methods that we use to computationally detect potentially stressful driving events from GPS traces. We first define each event and then describe the computational method we use to detect them. In most cases, our goal is to compute the duration of these events.

Driving Episode

Definition

Vehicular movements are usually sandwiched between walking segments. Start time of a vehicular movement episode occurs when the speed (obtained from GPS samples) is over the maximum gait speed of 2.53 meter/sec [4]. A vehicular movement episode is considered a driving episode if the participant is the driver. A driving episode consists of various driving events such as stops, turns, congestions, etc.

Computation

In order to determine whether the participant was driving or riding in a commuting episode, we used participant’s response to the self-report question (completed upon conclusion of the driving episode): If you commuted since the last prompt, what type? with possible answers — Driving, Biking, Walking, Riding as a Passenger, Riding Public Transportation, and Did not commute.

Duration of Stop Segments

Definition

Stop segments refer to parts of the driving episode when the vehicular speed obtained from GPS reaches zero. Usually, stops occur when a vehicle encounters a road intersection and the traffic signal is red, there is a stop sign, or when the car is maneuvering (e.g., moving in or out of a drive-way). A stop segment can consist of multiple consecutive stops or a single stop. For our stress analysis, we separate stop events in parking lots and garages from stop events during driving by plotting the location of stops in Google Maps. We discard stop events in parking lots and garages from analysis. The total stop time is defined as the time taken between driver initiating a deceleration and the start of the following acceleration event.

Computation

To compute the duration of stop segments, we search backwards from the end time of a complete stop. We search for the moment when the driver starts decelerating. We consider prior 5 seconds from the final stop mark and compute the speed difference

dv=vtstart-vtend, (1)

where tend and tstart (= tend – 5) are the end and start time of the 5 second window respectively, vtend and vtstart are the instantaneous speeds at time tend and tstart respectively. We continue this process with 4 second overlaps (i.e. 1 second sliding) until the speed difference dv at tend is less than 10% of the speed at tstart i.e., dv ≤ 0.1 × vtstart and both of the two speeds (i.e., vtstart and vtend ) are above the maximum gait speed. We mark tstart as the start time of deceleration.

Merging of Closely Spaced Stops

We merge multiple stops that appear closely with slow moving driving segments among them. To merge these intermittent stops, we consider 5 seconds prior to the start (tstart) of each stop and compute the area under the speed curve as follows.

Areai=tstart-5tstartdv(t)dt, (2)

where Areai is the area associated with the ith stop under the speed curve from time (tstart – 5) to tstart and dv is the change in speed during that time window. If Areai < ε, then we consider this slow moving driving segment as part of the ith stop and replace the speed values for this time period with zero. The backward propagation of the algorithm allows the detection and merging of a stop with the preceding slowly moving driving patterns until Areaiε. The value of ε can be derived by taking the average of areas of all prior windows across all stops. For our dataset, ε = 10.87. Area under the speed curve represents distance travelled in 5 seconds.

Turn Detection

Definition

A turn is associated with a change in the driving direction (of more than 30° [33]) and a decrease in the speed of the vehicle. The geometric properties [13] of the curve determines the amount of change in driving direction and speed.

Computation

We detect turn from changes in driving direction obtained from GPS bearing using a modified approach proposed in [33, 46]. For speeds < 3m/s, when GPS bearing data is less accurate, we consider the heading at current sample and fifth previous sample from the current sample. For speeds ≥ 3m/s, we consider the heading at current sample and third previous sample from the current sample. A turn is detected when the absolute value of direction change (i.e. difference between heading at the two samples) is > 30° [33]. Sign of the direction change defines the turn type (right or left).

Duration of Braking and Acceleration After Braking & Stop

Definition

Braking is a driving event that results in an immediate deceleration and sometimes can result in a complete stop. In our analysis, we consider only those braking segments that are followed by an acceleration segment. Braking that leads to a stop are considered to be part of the corresponding stop event. As a result, some braking events within a driving episode are subsumed in stop segments. Intensity of deceleration defines the category of braking such as negligible, moderate, and severe [8, 18]. After braking, a driver usually starts to increase the speed up to an almost constant speed. For our analysis, we consider this segment of speed up after braking as an acceleration after braking segment. We also consider the speed up segment after a stop as an acceleration after stop segment in our analysis.

Computation

For braking, we first find the local minima in the speed curve of each driving segment and then identify the start time of deceleration. We use “Imregionalmin” [34] to find Regional minima and “PeakFinder” [31] to find local minima. To identify the start time of deceleration, we examine the speed difference between prior seconds following a similar approach that we used to identify deceleration start time for stops. The end time of a braking segment occurs when acceleration begins. We determine the end of an acceleration after braking segment by reversing the search process we use to find the start of a braking event. Similarly, we determine an acceleration after stop segment starting from the end of stop segment.

Duration of Congestion

Definition

When the demand of a road network exceeds its capacity, the network imposes additional travel cost to all users of the network, a situation known as vehicular congestion. It can happen on a regular, cyclic basis reflecting the social and economic activities of an area or happen irregularly in certain points in the network due to irregular occurrence of road work, breakdown, and/or accidents [44].

Computation

We use Density-Based Spatial Clustering of Applications with Noise (DBSCAN) [10] to detect traffic congestion. We did not have access to surrounding traffic and geographical information [3, 6] that could improve the detection accuracy. To apply DBSCAN, we define our core points as those points/sample that have ≥ 3 points/sample within the neighborhood of radius 15 meter since a driver moving as slowly as 5 m/s can pass 15 meters in 3 seconds and meanwhile the GPS receiver can gather 3 samples at a sampling rate of 1 Hz. After we find the core points, we consider them together to make congestion segments. We consider only ≥ 5 minutes long segments. We found only a few congestion segments in our dataset, due to unscripted natural driving.

Duration of Driving Segments

We consider those portions of a driving episode as driving segment that remains after extracting the stops, turns, braking, acceleration, and congestion segments.

G-STRESS MODEL

We first present a summary of our dataset, observed stress patterns from physiological data, and finally present the development and evaluation of the GStress model.

Data Summary

Of the 30 participants, 25 reported at least one instance of driving and we obtained a total of 372 driving episodes. For stress measurement and model development, we selected only those participants who reported at least 10 driving episodes, resulting in 11 participants who contributed 295 driving episodes. We further excluded data from one participant (ID# 31) as s/he reported having an unusually stressful week. Of these 295 episodes considered initially, we excluded episodes with low quality physiological data, resulting in 181 (37.05 hours) driving episodes. We identified 637 stops, 1120 turns, 840 braking, and 1477 acceleration segments within these 181 driving episodes. Figure 1 shows distribution of these driving segments. The average durations of stop, turn, braking, and acceleration after braking segments were 2.0389 minutes (SD = 1.72 minutes), 0.28 minutes (SD = 0.18 minutes), 0.20 minutes (SD = 0.32 minutes), and 0.44 minutes (SD = 0.51 minutes) respectively. We also observed five congestion segments with an average duration of 5.95 minutes (SD = 1.04 minutes).

Figure 1.

Figure 1

Distribution of driving episode counts and driving durations. We observe that a majority of driving episodes are ≤ 30 minutes.

Patterns of Stress from Physiological Data

The average stress levels (obtained by applying a stress model [32] on physiological data) during driving and rest of the day are 0.3992 and 0.2178 (with standard errors of 0.0041 and 0.0013) respectively. Thus, driving is 83% more stressful than rest of the day, which is consistent with existing literature [14, 36, 40, 41]. Figure 2 presents average stress level of 11 participants during driving and rest of the day.

Figure 2.

Figure 2

Average stress levels (obtained from physiological data) during driving and rest of the day for 11 selected participants.

To further examine the difference between stress during driving and rest of the day, we used a two sample t-Test with the null hypothesis, H0 : μD = μRD, where μD and μRD are average stress during driving and rest of the day, respectively. We performed for individual participants and for the entire population, i.e. on all participants together. At α = 0.05 significance level, we reject the null hypothesis for both individual and population level with a p-value < 0.001. Therefore, participants’ average stress levels during driving and rest of the day are not equal. It is interesting to point out that, while calculating stress values, we observed an anomaly for Participant ID#31. This participant had three exams and a couple of deadlines during the study week, resulting in unusually high levels of stress for the rest of the day. During this study week, this participant used driving as a stress reduction technique as s/he mentioned enjoying driving a lot. Hence, P#31’s data was excluded from the analysis.

Development of the GStress Model

The goal of the GStress model is to estimate drivers’ stress level during a driving episode from GPS traces. For stress estimation, we used time duration of stops, turns, braking, and driving segments within a driving episode.

The simplest model is a linear regression model with the assumption that errors are normally distributed. This assumption does not hold when the response variable (Y ) refers to count, proportion, or positive continuous data. Our response variable (Y ) refers to average stress of a driving episode, a positive continuous data.

The next candidate is Generalized Linear Models (GLM) that does not assume normality. GLM assumes that a linear function (η = g(.)) of the mean (μ = E[Y ]) of the response variable is related to the predictors i.e. η = g(μ) = , where, X stands for predictor variables, and β are the fixed-effects regression coefficients. We use Gamma for the underlying distribution that is suitable for scenarios where the response variable are positive continuous data. We use identity as our transformation/link function (i.e. μ = (E[Y ]) = , where, Y = + ε). An advantage of using such a descriptive model is that, it enables us to determine the relative importance of each factor in measuring the response variable.

Driving in the natural environment, however, may involve unanticipated events such as phone calls/texts and adverse weather/road conditions in addition to the major events detected from GPS traces. These unaccounted factors and wide between-person variation in stress-reactivity limits the use of regression models such as GLM that rely on fixed effects only, i.e. effects that can be explained with known factors such as stops, turns, and braking.

To account for the random effects, we use Generalized Linear Mixed Model (GLMM), a widely used model in the health research domain. GLMM takes into account both the fixed effects and the random effects that cannot be estimated with fixed effects or errors. The general form of the model is Y = + + ε, where, Y, X, β, Z, γ, and ε stand for the response variable, predictor variables, fixed-effects regression coefficients, design matrix for random effects, random effects, and random errors or residuals, respectively.

Our predictor variables for the fixed effects are the amount of time (in terms of 30 second segments) affected by stop (x3), braking (x4), turn (x5), acceleration after braking (x6), acceleration after stop (x7), and congestion (x8). We consider these segments individually and independently. To determine which segment is affected by what type of driving event, we apply majority-voting on the duration of different driving events that occurs within or overlaps with an individual segment. We also include the stress level prior to driving (x1) and the amount of driving time (x2) not affected by these driving events. We scale all the predictor variables (except x1) on a 0 to 1 scale by dividing the xi’s with the duration of a driving episode. The output or response variable (y) is the average stress of a driving episode. We consider person-level variation as our random effect.

Our stress estimation model based on GPS traces, GStress is

yij=(β0+γj)+k=18βkxkij+ε (3)

where, yij is the average stress of ith driving episode of person j; for k = 1, . . . , 8, βk is the kth fixed effect, xkij is the kth predictor variable for fixed effect in ith episode of jth person; β0 is the fixed effect on intercept and γj is the random effect on intercept for the jth person.

While building the GStress model, we did not find sufficient instances of congestion. We tested the significance of both “Congestion” and “Acceleration” (both after braking and stop) i.e. H0 : β6 = β7 = β8 = 0. We built two models — with and without the congestion and acceleration effects referred to as “gm44” and “gm33”, respectively. We could not reject the H0 for χ2 = 1.5678 and p-value = 0.2105 (see Table 1) at α = 0.05 significance level. Also, model “gm33” had lower Akaike Information Criterion (AIC) [1] and Bayesian Information Criterion (BIC) [39] (see Table 1), so the effect of “Congestion” and “Acceleration” was not significant in our dataset. However, in a dataset where instances of congestion is not minimal, congestion may have a significant impact.

Table 1.

χ2 test to check the significance of “person random effect” and fixed effect “congestion” and “acceleration”.

Model AIC BIC Deviance χ2 p-value
gm11 −3367.1 −3341.4 −3383.1
gm22 −3447.3 −3418.4 −3465.3 82.245 < 0.001

gm33 −95.5 −69.9 −107.88
gm44 −95.074 −66.287 −109.35 1.5678 0.2105

We also examined whether the random effect of subject is significant i.e. H0 : γ = 0. We built two models — with and without person random effect referred to as “gm22” and “gm11”, respectively. We rejected the null hypothesis for χ2 = 3237.8 and p-value < 0.001 (see Table 1) at α = 0.001 significance level. Also, model “gm22” had lower AIC, and BIC (see Table 1). Hence, the person random effect is significant. Therefore, the final GStress model is yij=(β0+γj)+k=15βkxkij+ε Table 2 presents the proposed GStress model. Stress levels prior to driving (x1), driving time without events (x2), amount of driving time affected by stop (x3) and affected by turn (x5) were significant at α = 0.001, amount of time affected by braking(x4) was significant at α = 0.01, and intercept was significant at α = 0.05. We observe that all significant fixed effect factors have positive coefficients, i.e. they increase stress while driving. Among all the driving events, stop has the highest weight (β3 = 0.755).

Table 2.

G-Stress: Driving Stress Estimation Model. Here, SE stands for standard error.

Fixed effects β SE t-value p-value
Intercept −0.445 0.181 −2.454 0.014147
x1 0.681 0.077 8.795 <0.001000
x2 0.682 0.188 3.632 0.000281
x3 0.755 0.207 3.655 0.000257
x4 0.668 0.210 3.178 0.001480
x5 0.703 0.200 3.523 0.000427

Evaluation of GStress

We obtained a Pearson Correlation, r = 0.722 for predicting stress from GPS traces via the GStress model (see Table 2). The variance for person variability, residual, and fixed effects were 0.002, 0.188, and 0.061, respectively. Therefore, RGLMM2(c)=0.252 [26], i.e. 25% variability of the data can be explained with both fixed effect and random effect.

To evaluate the applicability of the GStress model on participants on whom no training data has been collected, we trained the model on nine participants and applied it on the remaining one participant with a population estimate of the random effect and fixed effects. Figure 3 shows the correlation for each subject. For this leave-one-subject-out validation, we obtained a median correlation of 0.687.

Figure 3.

Figure 3

Pearson Correlation, r between actual and estimated average stress for “leave one subject out” validation. Horizontal magenta line corresponds to median correlation (0.687) from all 10 test subjects.

We also evaluated the GStress model using leave-one-episode-out validation, where we trained the model on n – 1 driving episodes out of n episodes from the same participant and applied it on the remaining one driving episode to estimate stress. We continued this process to get n estimated stress values for all n episodes and obtained correlation between the estimated and actual stress of all these n episodes. Similarly, we calculated correlation for all other subjects via the leave-one-episode-out approach. With this person-dependent model, we obtained a median correlation of 0.762 (Figure 4). We observed that the person-dependent model (Figure 4) has a better correlation than the person-independent model (Figure 3). The minimum correlation improves from 0.49 to 0.64. It suggests that the prediction accuracy is likely to improve over time for any user.

Figure 4.

Figure 4

Pearson Correlation, r between actual and estimated average stress for “leave one driving episode out” validation. Horizontal magenta line corresponds to median correlation.

To further evaluate the performance of the GStress model, we analyzed the Bland-Altman plot [23](Figure 5). The plot shows moderate agreement between actual and predicted stress, however, lack of consistent agreement for values lower than 0.4 and variability above 0.4 were mainly observed within the 50% difference levels. From Figure 5, we cannot reject the null hypothesis, H0, that “the actual and estimated stress values have same mean i.e. the mean of paired differences of actual average stress and predicted average stress is zero (0)” with a p-value = 0.8893 at α = 0.1 significance level (using paired t-test). Therefore, the two distributions, i.e. estimated average stress values obtained from the GStress model and the actual average stress values, were not significantly different.

Figure 5.

Figure 5

Bland-Altman plot for the actual and estimated average stress for all driving episodes obtained from GStress. Green, cyan, magenta and red dashed lines are for 25%, 50%, 75% and 100% difference respectively.

Contribution of Driving Events to Stress

We now quantify the contribution of each driving event to the total stress. The contribution of the events depend on two factors — how frequent they occur in a typical driving episode and their weight (β) in the driving stress model (Table 2). For the first, we compute the amount of time (in terms of number of 30 second segments) that are classified to be affected by a particular event. Figure 6(a) shows the fraction of time in a driving episode that corresponds to each of the driving events — stops, turns, brakings, and driving and Figure 6(b) shows the contribution of these events to the overall stress level. We observe that stops were more stressful than driving as stops contribute 27% to the stress even though they only occur 24% of the time. On the other hand, driving accounts for 32% of the time, but only contributes 28% to the total stress level. This may imply that reducing number of stops in a driving episode may be an effective approach to reducing stress during driving. We note that high occurrence of stops, turns, and braking in our dataset may be due to participants driving in university neighborhoods.

Figure 6.

Figure 6

(a) shows the fraction of an average driving episode that is affected by various stressful factors i.e. frequency of events in an average driving episode. (b) shows the contribution of each factor to the total stress level.

CONCLUSIONS, LIMITATIONS, AND FUTURE WORK

Our work pointed out the feasibility of estimating drivers’ stress using only GPS traces collected from the driver’s smart phone. The proposed GStress model provides a correlation of over 0.7 by identifying major factors such as stops, turns, and braking associated with driving stress. This work has several limitations that may be addressed in future research. First, the accuracy of stress estimation can be improved by incorporating additional factors such as road information (e.g., complex intersections, number of lanes, type of road, speed limit, location of traffic light, curvature information, etc.) that can be obtained from a geographic database, real-time traffic conditions (e.g., level of congestion) and driving behaviors (e.g., patterns of acceleration and braking).

Second, dataset used in developing and evaluating GStress is limited to 10 drivers without any scripting of the roads on which they all drive. While it improves the ecological validity of the model, it prevents comparison with literature that mostly prescribed scripted routes to obtain uniformity of road conditions. A larger scale study that involves more participants for a much longer duration can generate data set that are large enough to compare the stress responses of several drivers when they happen to drive on the same road segments under similar conditions. That way comparison with existing works can be done while maintaining ecological validity. Third, even though GStress model is based on real-life dataset from 10 drivers, these are mostly college students. The stress response of young drivers may not accurately represent those experienced by other populations such as seniors, or those with a history with serious motor vehicle accidents. Fourth, stress experienced by drivers in a specific city (from where participants were selected for our study) may not accurately represent stress experienced during driving in other cities in the United States and more broadly in other parts of the world. Additional studies need to be conducted to evaluate the generalizability of GStress or to determine the adaptations necessary to improve its generalizability.

Finally, although GStress has a potential to improve stress awareness and reduce stress of drivers, this potential is yet to be demonstrated. Stress awareness applications or stress interventions that make use of the GStress model need to be developed and evaluated for its efficacy in diverse population in various driving environments to generate evidence of clinical utility of GStress in reducing drivers’ stress.

Acknowledgments

We would like to thank Emre Ertin from the Ohio State University, Vijay Mago from Troy University and J. Gayle Beck, Satish Kedia, Timothy Hnat, Syed Monowar Hossain, Nazir Saleheen, Md. Mahbubur Rahman, Lucas Salazar, Jeremy Luno, and Kenneth D. Ward from University of Memphis for their help with data collection and brainstorming. This work was supported in part by NSF grants CNS-0910878 (funded under the American Recovery and Reinvestment Act of 2009 (Public Law 111-5)), CNS-1212901, IIS-1231754, and by NIH Grants U01DA023812 and R01DA035502, from NIDA.

Contributor Information

Sudip Vhaduri, Email: svhaduri@memphis.edu.

Amin Ali, Email: aaali@memphis.edu.

Moushumi Sharmin, Email: msharmin@memphis.edu.

Karen Hovsepian, Email: khovsepian@troy.edu.

Santosh Kumar, Email: skumar4@memphis.edu.

References

  • 1.Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control. 1974;19(6):716–723. [Google Scholar]
  • 2.al’Absi M, Arnett D. Adrenocortical responses to psychological stress and risk for hypertension. Biomedecine & Pharmacotherapy. 2000;54(5):234–244. doi: 10.1016/S0753-3322(00)80065-7. [DOI] [PubMed] [Google Scholar]
  • 3.Bauza R, et al. Road traffic congestion detection through cooperative vehicle-to-vehicle communications. IEEE LCN. 2010:606–612. [Google Scholar]
  • 4.Bohannon RW. Comfortable and maximum walking speed of adults aged 20–79 years: reference values and determinants. Age and Ageing. 1997;26(1):15–19. doi: 10.1093/ageing/26.1.15. [DOI] [PubMed] [Google Scholar]
  • 5.Charette RN. [Accessed: September 2012];Your Car as Stress Monitor? http://bit.ly/1oVPLJd.
  • 6.Diker AC, Nasibov E. Estimation of traffic congestion level via fn-dbscan algorithm by using gps data. IEEE PCI. 2012:1–4. [Google Scholar]
  • 7.Dimsdale J. Psychological stress and cardiovascular disease. Journal of the American College of Cardiology. 2008;51(13):1237–1246. doi: 10.1016/j.jacc.2007.12.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ehlert PA, Rothkrantz LJ. Microscopic traffic simulation with reactive driving agents. IEEE ITS. 2001:860–865. [Google Scholar]
  • 9.Ertin E, et al. Autosense: Unobtrusively wearable sensor suite for inferencing of onset, causality, and consequences of stress in the field. ACM SenSys. 2011:274–287. [Google Scholar]
  • 10.Ester M, et al. A density-based algorithm for discovering clusters in large spatial databases with noise. KDD. 1996;96:226–231. [Google Scholar]
  • 11.Evans GW, Carrère S. Traffic congestion, perceived control, and psychophysiological stress among urban bus drivers. Journal of Applied Psychology. 1991;76(5):658–663. doi: 10.1037/0021-9010.76.5.658. [DOI] [PubMed] [Google Scholar]
  • 12.Global Status Report on Road Safety. World Health Organization; 2013. http://bit.ly/U9QmvZ. [Google Scholar]
  • 13.Fundamentals of Transportation/Horizontal Curves/Fundamental Horizontal Curve Properties. http://bit.ly/1noRx18.
  • 14.Healey J, Picard R. Detecting stress during real-world driving tasks using physiological sensors. IEEE ITS Transactions. 2005;6(2):156–166. [Google Scholar]
  • 15.Howard B. Ford smart car locks your phone when youre stressed or distracted, Accessed. 2012 Sep; http://bit.ly/1rbIl81.
  • 16.Kang H, et al. Decreased expression of synapse-related genes and loss of synapses in major depressive disorder. Nature Medicine. 2012;18(9):1413–1417. doi: 10.1038/nm.2886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Koslowsky M, Kluger A, Reich M. Commuting Stress: Causes, Effects and Methods of Coping. Springer; 1995. [Google Scholar]
  • 18.Lee JD, et al. Collision warning design to mitigate driver distraction. ACM SIGCHI conference on Human factors in computing systems. 2004:65–72. [Google Scholar]
  • 19.Likert R. A technique for the measurement of attitudes. Archives of Psychology. 1932 [Google Scholar]
  • 20.Liu C, Ye TJ. Run-off-road crashes: An on-scene perspective, National Highway Traffic Safety Administration. Tech Rep DOT HS 811 500. 2011 [Google Scholar]
  • 21.Lu H, et al. Stresssense: Detecting stress in unconstrained acoustic environments using smartphones. ACM UbiComp. 2012 [Google Scholar]
  • 22.MacLean D, et al. Moodwings: a wearable biofeedback device for real-time stress intervention. Proceedings of the 6th International Conference on Pervasive Technologies Related to Assistive Environments. 2013;66 [Google Scholar]
  • 23.Martin Bland J, Altman D. Statistical methods for assessing agreement between two methods of clinical measurement. The lancet. 1986;327(8476):307–310. [PubMed] [Google Scholar]
  • 24.McEwen B. Stress, adaptation, and disease: Allostasis and allostatic load. Annals of the New York Academy of Sciences. 2006;840(1):33–44. doi: 10.1111/j.1749-6632.1998.tb09546.x. [DOI] [PubMed] [Google Scholar]
  • 25.McEwen B. Physiology and neurobiology of stress and adaptation: Central role of the brain. Physiological Reviews. 2007;87(3):873–904. doi: 10.1152/physrev.00041.2006. [DOI] [PubMed] [Google Scholar]
  • 26.Nakagawa S, et al. A general and simple method for obtaining r2 from generalized linear mixed-effects models. Methods in Ecology and Evolution. 2013;4(2):133–142. [Google Scholar]
  • 27.Nirjon S, et al. Musicalheart: A hearty way of listening to music. ACM SenSys. 2012:43–56. [Google Scholar]
  • 28.Novaco RW, Stokols D, Milanesi L. Objective and subjective dimensions of travel impedance as determinants of commuting stress. American journal of community psychology. 1990;18(2):231–257. doi: 10.1007/BF00931303. [DOI] [PubMed] [Google Scholar]
  • 29.O’Donovan A, et al. Stress appraisals and cellular aging: A key role for anticipatory threat in the relationship between psychological stress and telomere length. Brain, Behavior, and Immunity. 2012 doi: 10.1016/j.bbi.2012.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Paschero M, et al. A real time classifier for emotion and stress recognition in a vehicle driver. IEEE ISIE. 2012:1690–1695. [Google Scholar]
  • 31.How to find local peaks or valleys in a noisy vector? http://bit.ly/1oArFRk.
  • 32.Plarre K, et al. Continuous inference of psychological stress from sensory measurements collected in the natural environment. IEEE IPSN. 2011:97–108. [Google Scholar]
  • 33.Quddus MA, et al. Current map-matching algorithms for transport applications: State-of-the art and future research directions. Transportation Research Part C: Emerging Technologies. 2007;15(5):312–328. [Google Scholar]
  • 34.How to find regional minima of an image? http://bit.ly/1oVPxSk.
  • 35.Rigas G, et al. Towards driver’s state recognition on real driving conditions. International Journal of Vehicular Technology. 2011 [Google Scholar]
  • 36.Rigas G, et al. Real-time driver’s stress event detection. IEEE ITS Transactions. 2012;13(1):221–234. [Google Scholar]
  • 37.Sapolsky R. Why Zebras Don’t Get Ulcers. Owl Books; 2004. [Google Scholar]
  • 38.Schneegass S, et al. A data set of real world driving to assess driver workload. ACM International Conference on Automotive UI. 2013:150–157. [Google Scholar]
  • 39.Schwarz G, et al. Estimating the dimension of a model. The Annals of statistics. 1978;6(2):461–464. [Google Scholar]
  • 40.Singh M, Queyam AB. A novel method of stress detection using physiological measurements of automobile drivers. International Journal of Electronics Engineering. 2013;5(2) [Google Scholar]
  • 41.Singh RR, et al. Assessment of driver stress from physiological signals collected under real-time semi-urban driving scenarios. International Journal of Computational Intelligence Systems. 2013:1–15. ahead-of-print. [Google Scholar]
  • 42.Solovey ET, et al. Classifying driver workload using physiological and driving performance data: Two field studies. ACM CHI. 2014 [Google Scholar]
  • 43. [Accessed: September 2012];BMW Steering Wheel monitors your heart. http://bit.ly/1jJ2BMa.
  • 44.Taylor MA, et al. Integration of the global positioning system and geographical information systems for traffic congestion studies. Transportation Research Part C: Emerging Technologies. 2000;8(1):257–285. [Google Scholar]
  • 45.White JB. [Accessed: September 2012];A Car That Takes Your Pulse. http://on.wsj.com/1mbbWXj.
  • 46.You C-W, et al. Carsafe app: alerting drowsy and distracted drivers using dual cameras on smartphones. ACM MobiSys. 2013:13–26. [Google Scholar]

RESOURCES