Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 May 14.
Published in final edited form as: Alcohol Clin Exp Res. 2022 May 14;46(5):783–796. doi: 10.1111/acer.14804

Sensitivity, specificity, and tolerability of the BACTrack Skyn compared to other alcohol monitoring approaches among young adults in a field-based setting

Garrett I Ash 1,2, Ralitza Gueorguieva 3, Nancy P Barnett 4, Wuyi Wang 1, David S Robledo 1, Kelly S DeMartini 1, Brian Pittman 1, Nancy S Redeker 1,5, Stephanie S O’Malley 1, Lisa M Fucito 1,6,7
PMCID: PMC9179100  NIHMSID: NIHMS1786444  PMID: 35567595

Abstract

Background

There is a need for novel alcohol biosensors that are accurate, able to detect alcohol close in time to consumption, and feasible and acceptable for many clinical and research applications. We evaluated the field accuracy and tolerability of a novel (BACTrack Skyn) and established (Alcohol Monitoring Systems SCRAM CAM) alcohol biosensor.

Methods

The sensor and diary data were collected in a larger study of a biofeedback intervention but compared observationally in the present sub-study. Participants (high-risk drinkers, 40% female; median age 21) wore both Skyn and SCRAM CAM sensors for 1 to 6 days and were instructed to drink as usual. Data from the first cohort of participants (N=27; 101 person-days) were used to find threshold values of transdermal alcohol that classified each day as meeting or not meeting defined levels of drinking (heavy, above-moderate, any). These values were used to develop scoring metrics that were subsequently tested using the second cohort (N = 20; 57 person-days). Data from both biosensors were compared to mobile diary self-report to evaluate sensitivity and specificity in relation to a priori standards established in the literature.

Results

Skyn classification rules for Cohort #1 within 3 months of device shipment showed excellent sensitivity for heavy drinking (94%) and exceeded expectations for above-moderate and any drinking (78%/69%), while specificity met expectations (91%). However, classification worsened when Cohort #1 devices ≥3 months from shipment were tested (area under curve for receiver operator characteristic .87 vs .79) and the derived classification threshold when applied to Cohort #2 was inadequately specific (70%). Skyn tolerability metrics were excellent and exceeded the SCRAM CAM (p ≤.001).

Conclusions

Skyn tolerability was favorable and accuracy rules were internally derivable but did not yield useful scoring metrics going forward across device lots and months of usage.

Keywords: alcohol use, transdermal alcohol sensors, wearable alcohol sensors, biosensors, self-monitoring

Introduction

Alcohol use monitoring has many benefits, including identifying candidates for early intervention to promote moderation and abstinence, self-awareness, self-regulation (Dougherty et al., 2015; Barnett et al., 2017; Davis-Martin et al., 2021), and legally enforced sobriety to protect the public. Mobile alcohol self-management applications have ≥2.7 million downloads (Hoeppner et al., 2017) and judicial-ordered monitoring has enrolled ~1 million clients (Alcohol Monitoring Systems, 2021). Besides frequent breathalyzer operation, the best-validated monitoring is self-report, but it suffers from biases, especially underreporting and misestimation of drinking timing (Alessi et al., 2019; Hultgren et al., 2020; Merrill et al., 2020). These problems have been partially addressed by the Alcohol Monitoring Systems SCRAM CAM (Littleton, CO) ankle bracelet which measures transdermal alcohol content (TAC) but has limitations. Its size, 6-ounce weight, ankle location, and noise/vibration can create physical and social discomfort and disrupt daily activities like exercise and sleep. Another limitation of the SCRAM CAM is that data upload occurs by physical proximity to a home or office base station rather than in real-time. These issues challenge long-term feasibility and ecological validity of the use of the SCRAM CAM (Barnett et al., 2017) and limit its application for a range of clinical and research questions. Other limitations shared by other TAC sensors are that the TAC curve has a substantial and variable time delay from measured BAC (Leffingwell et al., 2013) and sensitivity to sub-heavy drinking levels (i.e., 1–3 standard drinks) is poor to fair: 60%−73% for 90% specificity (Karns-Wright et al., 2018; Roache et al., 2019).

Through recent advances in alcohol biosensor technology, a new generation of TAC sensors have emerged with the potential to expand the opportunities for real-time, objective monitoring of alcohol consumption. Among these new sensors, the most promising is the Skyn by BACTrack (San Francisco, CA) (Davis-Martin et al., 2021; Fairbairn and Bosch, 2021; Wang et al., 2019). Compared to the SCRAM CAM, the Skyn is wrist-worn, less invasive (Wang, et al., 2019; Wang et al., 2021), samples alcohol exposure more frequently, releases data immediately by Bluetooth, and detects drinking with less time delay (Fairbairn and Kang, 2019; Fairbairn et al., 2020). With these features, the Skyn is similar to commercially available health/fitness biosensors, which guide individuals to passively quantify, visualize and cogitate biometrics for the sake of motivating and supporting self-management decisions (Sim, 2019; Ash et al., 2020a; Ash et al., 2021). The data from TAC sensors like the Skyn could allow researchers/clinicians to devise behavior change strategies that leverage real-time, personalized feedback about alcohol-health associations (Davis-Martin et al., 2021; Wang et al., 2021). For instance, there has been an explosion in the importance of sleep (Worley, 2018) and technology options for improving it (Choi et al., 2018; Goldman, 2017) and we previously demonstrated that young adults are interested in feedback about alcohol’s effects on sleep and the utility of sleep for engaging them about their drinking (Fucito et al., 2015; Fucito et al., 2017; Ash et al., 2020b; Fucito et al., 2021). Older alcohol biosensor technology such as the SCRAM CAM can disrupt sleep for some participants due to discomfort (Barnett et al., 2017; Alessi et al., 2017), but the Skyn and new generation devices likely have minimal impact on sleep, thereby increasing the opportunity for these novel alcohol-health associations and personalized feedback applications. Besides sleep, the SCRAM CAM’s weight and appearance disrupt other aspects of daily living for some participants including exercise, clothing choice, social comfort, and physical comfort (Barnett et al., 2017), which could impact drinking behaviors and confound alcohol-health associations. Thus, an important question is whether new TAC technology like the Skyn avoids such disruptions. To date, prior research has focused on the user experience of viewing Skyn data on its mobile app (Wang et al., 2021). No studies have compared the Skyn or similar new biosensors with the SCRAM CAM or other alcohol self-monitoring tools (e.g., daily diaries) on measures of disruption to daily living.

The Skyn also requires further study for its accuracy. Previous investigators issued the Skyn to predominantly non-treatment-seeking, social drinkers who volunteered to drink at a standardized time, to test the time delay between drinking and TAC elevation (Fairbairn and Kang, 2019; Fairbairn et al., 2020; Wang et al., 2021). However, they did not extend this quantitative analysis to Skyn TAC results in a field setting where participants could freely choose to drink or not and the timing of their drinking. In the real world, these drinking choices typically vary day-to-day.

One study (Wang et al., 2021) conducted preliminary research in a field setting by issuing the Skyn to non-treatment-seeking social drinkers across the adult lifespan (21+ years old) living with and without HIV who were instructed to freely drink during daily living. The authors presented a subjective description of visual results from 1 (subjectively chosen) of the 12 participants. They noted that the Skyn produced 8 TAC elevations over 14 days that looked like those observed in their laboratory experiments, however 5 of these 8 elevations did not correspond to a mobile diary self-report (i.e., potential false positives). There were also 2 instances with no noticeable elevation despite positive diary self-report (i.e., potential false negatives). In summary, subjective visual inspection for drinking events yielded 60% sensitivity and 50% specificity for one selected participant. The other 11 participants were presented as supplemental data. We downloaded those files and found that compared to the one case example presented in the main text, many had even more visual anomalies (e.g., 1 or more TAC elevations daily from a sample of participants supposedly identified as moderate drinkers, and shape of TAC elevations often very different than results from controlled drinking), and furthermore no self-report data were included to tabulate false positives or negatives. Thus, visual inspection of these data was clearly inadequate to detect drinking, and furthermore visual inspection could not be translated into future automated scoring rules. The authors stressed these limitations and called for further studies objectively analyzing Skyn readings in a field setting with unknown drinking timing, to objectively calculate accuracy and rules going forward.

Furthermore, drinking classification accuracy may vary by drinking quantity: when such accuracy has been tested for the SCRAM CAM, sensitivity was 93% for heavy drinking but 60%−73% for lower drinking levels (Barnett et al., 2014; Roache et al., 2015; Karns-Wright et al., 2018; Roache et al., 2019). Even among these lower levels, 2–3 standard drinks were more easily detected than 1 standard drink (Roache et al., 2015). Finally, TAC accuracy might vary with field conditions such as temperature, contaminants, and physical perturbations. Thus, there is a need to test how accurately the Skyn can distinguish drinking from non-drinking without a priori knowledge of drinking timing, whether this accuracy varies by drinking quantity, and whether useful drinking detection metrics can be developed that are robust to varying field conditions.

To address these gaps and advance the science, we conducted a field-based study of the Skyn with heavy-drinking young adults. We sought to determine the broad acceptability (i.e., tolerability) of field-based Skyn use, test its accuracy to detect drinking, and determine whether automated scoring metrics to detect drinking could be used going forward. We hypothesized that the Skyn would compare favorably on tolerability metrics to the SCRAM CAM and a mobile diary report method, detect drinking with similar accuracy as the SCRAM CAM worn by the same individuals in the present study and reported from prior studies, and lead to the development of drinking detection rules that could be used across multiple cohorts. The sample was young adults who reported recent heavy drinking and tested positive on a validated alcohol screener (DeMartini and Carey, 2012) in a naturalistic setting.

Material and methods

Reporting follows STARD guidelines (Bossuyt et al., 2015; Supplement #1).

2.1. Study design

This investigation was a planned sub-study of a parent trial in which participants completed 14 days of monitoring with the SCRAM CAM and a sleep biosensor and daily smartphone diaries under instructions to drink as usual (NCT03658954 [Fucito et al., 2021]). They returned to the office weekly for psychoeducation and feedback from the devices on drinking and sleep behaviors (Figure 1). For the current sub-study, a subset of participants were fit for a Skyn Generation 2 watch on their dominant wrist as a third biosensor at the intake visit which was synced and recharged at the day 8 visit. To avoid bias, no field charging/syncing was permitted so one battery life duration per week was collected after which participants removed it.

Figure 1.

Figure 1.

Single participant timeline. In some cases, participants only completed one of the two Skyn-wearing periods due to inventory.

The first 65 self-reported drinking episodes almost always (k=62, 94%) started after 12:00pm and often (k=35, 54%) continued past 12:00am, so we demarcated the data by “social days” (12:00pm – 12:00pm) rather than calendar days. A total of 158 social days were recorded which were truncated on the front end for study start times after 12:00pm (latest 12:15am) or the back end if the Skyn battery died before 12:00pm the next day (earliest 2:50am). On average, these days contained 21.8 (SD=2.8) hours of Skyn data. Initially 171 days were collected but 5 (2.9%) were removed because the SCRAM CAM detected a manual tamper or major alcohol contaminant (defined in Sect. 2.5.1.3), 4 (2.3%) because the SCRAM CAM had a technical malfunction, 3 (1.8%) because research staff missed a Skyn download, and 1 (0.6%) because drinking self-report was not completed. This extremely low rate of self-report noncompletion was reflective of the same metric in the parent trial (<4%). In summary, the average participant in the 14-day study (total 658 person-days) provided 3.4 days of Skyn data (158 person-days), with 97.4% of the nonrecorded days attributable to battery life limitations.

The Yale University Institutional Review Board approved the study. All volunteers signed the informed consent document. Participants were compensated up to US $311 for completing procedures for the parent trial including up to $3 per day for adherence to sensors and diaries.

2.2. Participants

We recruited participants via paid social media advertising and flyers posted in the community (Ash et al., 2020b). Enrolled participants were heavy drinkers (≥3 heavy drinking occasions [≥4 standard drinks for women, ≥5 for men; Corbin et al., 2014] in the last 2 weeks and Alcohol Use Disorders Identification Test Consumption Questions scores ≥5 for women and ≥7 for men indicative of risk of harm from drinking [DeMartini and Carey, 2012]) aged 18–25 years who read and understood English, owned smartphones, and were willing/able to complete daily diaries and wear alcohol biosensors. Participants were excluded for: (1) current, severe alcohol use disorder (e.g., withdrawal symptoms), (2) substance use disorder or positive urine drug screen for substances other than marijuana, (3) sleep disorders, (4) night or rotating shift work, (5) current enrollment in alcohol or sleep treatment, or (6) medical conditions preventing SCRAM CAM use (circulation problems, neuropathy, deep vein thrombosis, leg ulcers, tendonitis, diabetes, pregnancy, history of swelling, metal allergies, pacemaker, or any other implanted medical device).

The parent trial enrolled 120 participants who started the intervention (Figure 2). Among them, 69 enrolled when we could access Skyn wristwatch devices (between August 2, 2019 and February 24, 2021). Among this subset, participants were excluded as follows: 3 declined sustained wearing of the SCRAM CAM, 1 had a SCRAM CAM technical malfunction during the entire wear period, and 1 did not comply with diary self-report. We initially excluded a further 18 participants because their randomization group did not include daily diary self-report but reinstated 3 of them because they completed the parent trial timeline followback (TLFB) follow-up assessment at the day 15 visit which we have found typically has excellent day-level agreement with the prior 14 days’ diaries. The other 15 / 18 participants in this randomization group were excluded because they did not complete the follow-up TLFB until the day 28 visit, at which point its day-level agreement with the day 1–14 diaries was not acceptable. After these exclusions, the remaining 47 participants were consecutively included in this sub-study.

Figure 2.

Figure 2.

Flow of participants.

We used data from the first sequential 27 participants (Cohort #1, 101 person-days collected August 2019–March 2020 before the COVID-19 pandemic) to explore and establish classification rules for the Skyn. Data from the remaining 20 participants (Cohort #2, 57 person-days collected September 2020–March 2021 during the COVID-19 pandemic) were used to test these Skyn rules and compare performance to the SCRAM CAM. COVID-19 pandemic restrictions limited the Cohort #2 sample size.

2.3. Instructions and screening

Participants were instructed to arrive at office visits with 0.00% blood alcohol levels, confirmed by breathalyzer (Cohort #1) or self-report (Cohort #2). They were also instructed to avoid alcohol-containing personal hygiene and household cleaning products (e.g., mouthwash, medicinal alcohol) throughout the study. Participants were told to contact research staff if having any intolerable discomfort with devices so staff could travel to adjust the device or offer the option of removal, though no such removals were made.

The Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition assessed current and past substance use disorders and other current psychiatric diagnoses (First et al., 2014). Other criteria were assessed by self-report.

2.4. Self-reported alcohol use

2.4.1. Timeline followback (TLFB)

At baseline, the 30-day Timeline Followback Interview (TLFB) (Sobell, L. C. et al., 2003) was administered by a research assistant to obtain alcohol use frequency and quantity data. Participants were taught standard drink definitions (14g ethanol) using visual aids and with personalized guidance based upon concentration and container size of typical drinks.

2.4.2. Daily diary

Sensor readings and self-report data assessed the same drinking events during which no clinical interventions occurred. Participants received a daily prompt via smartphone text to a mobile-friendly webform (Qualtrics, Provo, UT) at their expected wake time each morning, with up to two reminder prompts sent as needed (0.5hr and 2.5hr later). When participants indicated they drank yesterday, they were presented questions from the standardized and validated Drinking Self-Monitoring Log (Sobell, M. B. and Sobell, 1993): “1) Yesterday, what time did you START drinking alcohol? 2) What time did you STOP drinking alcohol? 3) How many TOTAL standard alcoholic drinks did you have?” Standard drink definitions from the TLFB interview were reiterated. Diary responses were exported from Qualtrics into an SPSS master database where each response represented one case. We inserted an additional variable indicating the social day (i.e., 12pm-12pm) referenced by the response, which was calculated as the date stamp of the response minus 1. This validated diary was chosen as reference standard based on literature precedent (Roache et al., 2019; Karns-Wright et al., 2018; Barnett et al., 2017) and its close temporal proximity to the sensor measurements.

2.4.3. Drinking classification

Using established national criteria, heavy drinking was defined as ≥4 standard drinks on 1 occasion for women, ≥5 standard drinks on 1 occasion for men (Corbin et al., 2014). Above-moderate drinking was considered >1 standard drink on 1 occasion for women, >2 standard drinks on 1 occasion for men (U.S. Department of Agriculture and U.S. Department of Health and Human Services., 2020). Any drinking was defined as ≥1 standard drink on 1 occasion.

2.5. Transdermal alcohol

2.5.1. SCRAM CAM

2.5.1.1. Inventory.

We purchased 10 SCRAM CAM devices in November 2018 for use with both cohorts, though some were replaced due to device failure. Two SCRAM CAMs were replaced before Cohort #1, two during Cohort #1, and three during Cohort #2.

2.5.1.2. Data collection.

The SCRAM CAM is worn just above the ankle. Its compartment (at the back of Figure 3) contains a fuel cell that produces an electric current when exposed to alcohol, as well as a pump that pushes air vapor from the skin to the fuel cell every 30min. These items are protected by a metal faceplate that rests on the participant’s skin and is replaced between participants.

Figure 3.

Figure 3.

SCRAM CAM (left), BACtrack Skyn (middle), and a US quarter (right, for size reference). Adapted with permission (Wang et al., 2019).

The SCRAM CAM device was attached and adjusted for fit in the laboratory by research staff. Participants were asked to not remove the device for any reason except a medical emergency. They were also instructed to avoid submerging the device in water, shower normally, use no products other than soap and water in the area around the bracelet, but clean around and under the device daily with mild soap and inspect for any redness, irritation, or bruising. The device was linked to the participant’s anonymous ID number on the manufacturer’s Scramnet cloud platform and uploaded weekly by magnet and USB-connected plastic receptacle. Participants were reminded their participation was not judicial-mandated, data would not be reported to any legal or other bodies, and there were no penalties for drinking.

2.5.1.3. Drinking classification.

For each participant, a csv file containing the individual readings and demarcation of drinking episodes was downloaded from Scramnet. Scramnet defines drinking events as ≥3 transdermal alcohol concentration (TAC) points above the minimum value of 0.02g/dL, rise from baseline to peak (i.e., absorption) <0.05g/dL, and fall from peak to baseline (i.e., elimination) <0.025g/dL (if peak <0.15g/dL) or <0.015 (if peak >0.15g/dL). Scramnet’s subsequent analysis of skin temperature and infrared light projection classified the event as (1) true drinking, (2) non-drinking events with minor environmental alcohol contamination, or (3) data invalidated by major environmental alcohol contamination, physical interferents such as clothing, device tampering, device non-wear, or technical malfunction. A column corresponding to the SCRAM CAM was added to SPSS database, in which social days containing a confirmed drinking episode were coded as a SCRAM CAM-registered drinking day, those containing any invalid readings were excluded (Sect. 2.1), and all others were coded as SCRAM CAM-registered non-drinking days.

2.5.2. Skyn

2.5.2.1. Inventory.

Cohort #1 used 6 Skyn Generation 2 devices shipped in August 2019. Cohort #2 used 5 Skyn Generation 2 devices shipped in July 2020 and another 5 shipped in January 2021.

2.5.2.2. Data collection.

The Skyn is worn on the wrist like a watch. Its small compartment (at the back of Figure 3) contains a fuel cell that produces an electric current when exposed to alcohol, similar to the SCRAM CAM but reliant on passive airflow through a thin semipermeable membrane that is recorded every 20 seconds, rather than active pumping across a metal faceplate. It reports μg of alcohol per L of air.

The Skyn was attached and adjusted for fit in the laboratory by research staff. The Skyn is not waterproof so participants were asked to remove it for tasks including showering or other activities that would expose it to water. They were asked to maintain and monitor the area of wrist device placement similarly to the SCRAM CAM: clean daily with mild soap, inspect for any adverse reactions, and notify research staff in these instances. At the next office visit, the device was synced to the manufacturer’s Skyn App from which we downloaded a csv file containing timestamped TAC readings. Unlike Scramnet, the Skyn App currently does not provide any automated scoring of TAC readings into drinking recognition.

2.5.2.3. Drinking classification.

We established field-based Skyn device rules in Cohort #1 (n=27 participants, k=101 person-days) and then validated them in Cohort #2 (n=20 participants, k=57 person-days). The parameters used by Scramnet to define drinking events (peak, absorption, elimination; Sect 2.5.1.3) are hallmarks of distinguishing physiological TAC from confounders (Barnett et al., 2014; Roache et al., 2015; Karns-Wright et al., 2018; Roache et al., 2019). Therefore, we initially proposed to develop a Skyn drinking detection algorithm incorporating these parameters. However, unlike the SCRAM CAM which has a constant baseline at zero, Skyn readings constantly fluctuate even in the absence of TAC. Thus, choosing baseline values upon which to calculate absorption and elimination was equivocal (Supplement #2). We also considered machine learning leveraging the Skyn’s higher sampling frequency, however breath alcohol data to train machine learning (Fairbairn et al., 2020) was unavailable in the field setting. Nonetheless, day-level analysis was possible from parameters including peak value (PEAKTAC) and area under the curve (AUCTAC), normalized to 24hr in the case of truncated days. PEAKTAC was subsequently excluded because AUCTAC had higher area under the curve of the ROC (AUCROC) than PEAKTAC and rendered PEAKTAC insignificant in exploratory multiple variable logistic regression models. A threshold AUCTAC value was chosen from the ROC to achieve conditional optimization of sensitivity, the condition being at least ~90% specificity as per literature standards for non-judicial settings where up to 10% false positives are acceptable (Barnett et al., 2014; Karns-Wright et al., 2018; Roache et al., 2019). AUCTAC exceeding this threshold was established as the rule for future prediction of drinking events. Therefore, a column corresponding to the Skyn was added to the SPSS database, in which social days exceeding the chosen AUCTAC threshold were coded as a Skyn-registered drinking day, and all others as a Skyn-registered non-drinking day. Although the number of available days varied by participant (range 1 to 6, median 3, mean 3.4), the rules were not adjusted for within-participant correlations because we sought rules to be transferable across participants.

We then repeated the analysis (conditional optimization of sensitivity) but set the condition to 100% specificity rather than 90%. This condition aligned with the Scramnet algorithm (Sect. 2.5.1.3) which targets 100% specificity as required in judicial contexts.

Finally, we plotted data from the Skyn’s temperature sensor in a time series and inspected for abrupt, sustained decreases that may have represented episodes of device removal (Supplement #3).

2.6. Blinding

Analysis of Cohort #1 was not blinded so that we could explore and establish classification rules. Analysis of Cohort #2 was automated, so blinding was not applicable.

2.7. Survey of tolerability

At the end of monitoring on day 15, participants returned to remove the devices and complete a survey about life disruption caused by monitoring activities on a 5-point Likert scale ranging from “strongly disagree” to “strongly agree” that it was a disruption. Device questions were modified from a prior study and focused on discomfort, interference with daily activities, opportunity costs, physical adverse effects, and future willingness to use (Barnett et al., 2017). Diary questions were designed for this study and pertained to ease of completion, ease of remembering to complete, burdensomeness, opportunity costs, and future willingness to complete. Qualitative information was also obtained by open-ended written and verbal questions but not analyzed since quantitative results were conclusive and consistent with our visual inspection of qualitative responses.

2.8. Statistical analysis

2.8.1. Descriptive statistics

Demographic and self-reported drinking behaviors were compared between the cohorts by the Mann-Whitney U, chi-squared, or Fisher’s exact test as appropriate.

2.8.2. Missing data

Within the scope of the sub-study design (Figure 1), the Skyn missed a very low percentage of the expected 3 readings per minute (≤1% for 154 days and ≤10% for the remaining 4 days) compared to the 20%−30% that experts allow for more established biosensors (Troiano et al., 2008; O’Brien et al., 2013; Battelino et al., 2019). The few missing readings were intrinsically handled in analysis by the trapezoidal linear interpolation of AUCTAC calculations. Days missing SCRAM CAM data (Sect. 2.5.1.3) were excluded from analysis since all but 4 of these days reflected environmental alcohol or participant noncompliance which likely confounded Skyn data as well.

2.8.3. Alcohol detection accuracy

Using self-report as the reference standard, sensitivity (true positives÷[true positives + false negatives]) for each drinking level and specificity (true negatives÷[true negatives + false positives]) for drinking vs non-drinking were calculated for each cohort. Their precision was estimated as a 95% confidence interval by the Clopper-Pearson method. Sensitivity and specificity results were judged in comparison to the best prior sensitivity and specificity demonstrated by the SCRAM CAM in the literature. These values were 93% sensitivity to heavy drinking, 73% sensitivity to above-moderate drinking, 65% sensitivity to light drinking, and 90% specificity for drinking versus non-drinking (Barnett et al., 2014; Karns-Wright et al., 2018; Roache et al., 2019).

2.8.4. Exploratory analysis of durability

Since devices were re-used and device performance could be a function of device age, data captured within versus outside 3 months of factory shipping (i.e., “durability”) were visually compared for sensitivity and specificity, as an exploratory analysis when permitted by sample size.

2.8.5. Survey of life disruption/acceptability

We compared the satisfaction survey items across SCRAM CAM, Skyn, and mobile diary, by a non-parametric approach (Friedman test with post hoc pairwise Wilcoxon Signed Rank Tests) due to observed skewness. Items corresponding to a specific discomfort or interference (e.g., interference with exercise) were tabulated for the proportion answering “agree” or “strongly agree” and compared by McNemar’s test. Significance was set at p <0.05.

2.8.6. Sample size

The sample size was determined by consecutive sampling around logistical constraints (section 2.2, paragraph #2).

2.8.7. Software

AUCTAC and PEAKTAC were calculated using R Studio v1.4.1106 (MESS package). Group-level analyses were conducted using SPSS v26 (Chicago, IL) except 95% confidence intervals of proportions were calculated by R Studio (binom.test function).

Results

3.1. Demographics

The sample had a slight majority of men and college enrollment, and a large majority of white race and non-Hispanic ethnicity (Table 1). Demographics and drinking behaviors were not statistically different by cohort (p > .20).

Table 1.

Descriptive Statistics

Cohort #1 (n=27) Cohort #2 (n=20) Difference P
Age 21 (20, 23) 21 (20, 22) U = 267.0 .95
Female gender 9 (33%) 9 (45%) χ2 = 0.7 .42
Race
 White 21 (78%) 19 (95%) Fisher’s Exact .21
 Asian 1 (4%) 0 (0%)
 Black 3 (11%) 0 (0%)
 Multiple 1 (4%) 0 (0%)
 Other 1 (4%) 1 (5%)
Ethnicity (non-Hispanic) 23 (85%) 15 (75%) χ2 = 0.8 .38
Marital status (single) 27 (100%) 20 (100%) Identical Identical
Employment status
 Student 16 (59%) 15 (75%) χ2 = 1.3 .26
 Employed full-time 5 (19%) 4 (20%)
 Employed part-time 3 (11%) 1 (5%)
 Unemployed 3 (11%) 0 (0%)
Current college enrollment 16 (59%) 15 (75%) χ2 = 1.3 .26
Drinking during 14 days before baseline
 Average drinks per drinking day 5 (4, 7) 5 (4, 6) U = 243.5 .57
 % of days with drinking 50 (43, 64) 64 (45, 71) U = 317.5 .30
 % of days with heavy drinking 29 (29, 43) 36 (29, 48) U = 287.0 .70

Given as median (interquartile range) or n (%).

3.2. Self-report

The 158 person-days contained 89 days with any self-reported drinking (56%), of which 69 were days with “above-moderate” drinking (44%) though only 45 of those days were “heavy” drinking days (29%).

3.3. Skyn

For cohort #1, the AUCROC was fair at .70 which resulted in poor sensitivity for the different drinking levels (39% for any drinking up to 47% for heavy drinking) for the condition of 90% specificity. However, visual inspection of the SPSS database revealed that AUCROC was higher when restricted to datapoints within 3 months of device shipment (.87) versus ≥3 months from shipment (.79). We therefore based AUCTAC thresholds upon the former subset only, resulting in fair to excellent sensitivity for the different drinking levels (69%−94%) for the required specificity (91%) among Cohort #1. Testing these thresholds in Cohort #2 also yielded fair to excellent sensitivity for the different drinking levels (63%−100%) but specificity (70%) was far below the 90% condition (Table 2). We were, however, able to calculate a separate AUCTAC threshold within Cohort #2 that had comparable specificity and sensitivity to above-moderate and heavy drinking as Cohort #1 (Table 2). In summary, internal classification thresholds meeting literature standards were derived within both cohorts but they varied widely between the two cohorts (56 vs 173 hour•μg/L).

Table 2.

Accuracy results

Skyn rules developed on Cohort #1 and tested on Cohort #1 Skyn rules developed on Cohort #1 and tested on Cohort #2 Skyn rules developed on Cohort #2 and tested on Cohort #2
Skyn for condition of 90% specificity*
AUCTAC Cutoff (hour•μg/L) 56 56 173
Sensitivity
 Any drinking 22/32, 69% [50%, 84%] 19/30, 63% [44%, 80%] 14/30, 47% [28%, 66%]
 Above moderate drinking 22/28, 78% [59%, 92%] 16/18, 89% [65%, 99%] 14/18, 78% [52%, 94%]
 Heavy drinking 17/18, 94% [73%, 100%] 11/11, 100% [72%, 100%] 10/11, 91% [59%, 100%]
Specificity for any drinking 20/22, 91% [71%, 99%] 19/27, 70% [50%, 86%] 24/27, 89% [71%, 98%]
Skyn for condition of 100% specificity*
AUCTAC Cutoff (hour•μg/L) 106 106 374
Sensitivity
 Any drinking 19/32, 59% [41%, 76%] 19/30, 63% [44%, 80%] 12/30, 40% [23%, 59%]
 Above moderate drinking 19/28, 78% [48%, 84%] 16/18, 89% [65%, 99%] 12/18, 67% [41%, 87%]
 Heavy drinking 16/18, 89% [65%, 99%] 11/11, 100% [72%, 100%] 9/11, 82% [48%, 98%]
Specificity for any drinking 22/22, 100% [85%, 100%] 21/27, 78% [58%, 91%] 27/27, 100% [87%, 100%]
SCRAM CAM rules from Alcohol Monitoring Systems
Sensitivity
 Any drinking 13/32, 41% [24%, 59%] 16/30, 53% [34%, 72%] 16/30, 53% [34%, 72%]§
 Above moderate drinking 13/28, 46% [28%, 66%] 13/18, 72% [47%, 90%] 13/18, 72% [47%, 90%]§
 Heavy drinking 9/18, 50% [26%, 74%] 10/11, 91% [59%, 100%] 10/11, 91% [59%, 100%]§
Specificity for any drinking 22/22, 100% [85%, 100%] 27/27, 100% [87%, 100%] 27/27, 100% [87%, 100%]§

Cohort #1 contained k=101 person-days over 6 months, but visual inspection revealed reduced accuracy after the first 3 months so only the first k=54 person-days were part of rule development.

Cohort #2 contained k=57 person-days over 6 months.

AUCTAC, area under the transdermal alcohol concentration curve.

Heavy drinking, ≥4 standard drinks for women, ≥5 standard drinks for men

Above-moderate drinking, >1 standard drink for women, >2 standard drinks for men

Any drinking, ≥1 standard drink

Bracketed numbers represent 95% confidence intervals.

*

Skyn rules classify drinking based on exceeding the AUCTAC cutoff developed from the indicated cohort (described in section 2.5.2.3)

SCRAM CAM rules classify drinking based on a combination of TAC peak value, absorption slope, and elimination slope (described in Section 2.5.1.3).

Both this column and the previous column represent Skyn rules developed on the same cohort (i.e., Cohort #1) so they have the same AUCTAC cutoff

§

Both this column and the previous column represent the SCRAM CAM rules applied to the same cohort (i.e., Cohort #2) so they have the same results

We then repeated the analysis with AUCTAC thresholds chosen to optimize sensitivity under the condition of 100% specificity, rather than 90% as in the first analysis. This second analysis produced a similar pattern as the first analysis: the threshold for Cohort #1 (106 hour•μg/L) yielded unacceptable specificity when tested in Cohort #2 (78%), but it was possible to achieve acceptable specificity in Cohort #2 by choosing a higher threshold (374 hour•μg/L) (Table 2). As expected, this second analysis that required higher specificity had lower sensitivity than the first analysis. Each cohort had 1 heavy drinking episode and 2–3 above-moderate drinking episodes that were detected in the first analysis but missed in the second analysis.

Inspecting the temperature time series in Cohort #1 (Supplement #2, Figure S2) revealed 12 potential non-wear episodes totaling 66.2 hours (2.8% of total monitored hours). Cohort #2 featured 1 potential non-wear episode totaling 2.3 hours (0.2% of total monitored hours).

3.4. SCRAM CAM

The SCRAM CAM had 100% specificity for both cohorts (Table 2). In Cohort #1 its sensitivity was poor for all drinking levels (41% for any drinking up to 50% for heavy drinking). In Cohort #2 its sensitivity was again poor for any drinking (53%), but comparable to literature standards for above-moderate (72%) and heavy drinking (91%).

3.5. Survey of tolerability

Declining to wear the SCRAM CAM caused exclusion of 5 otherwise eligible participants (2 pre-randomization, 3 post-randomization; Figure 2) while nobody declined to wear the Skyn. Among the 47 included participants, interference with exercise, sleep, and clothing choice were each noted by a large fraction (32%−51%) for the SCRAM CAM but virtually none (0%−2%) for the Skyn (Table 3). Participants’ average ratings for the burden, adverse effects, and aesthetic effects of wearing the biosensors were very low to low for the Skyn and low to moderate for the SCRAM CAM (Table 3). Participants perceived greater opportunity cost from wearing the SCRAM CAM than either wearing the Skyn or completing the mobile diary. Among all three alcohol use monitoring tools, the Skyn was rated better on overall burden and willingness to use in the future than the SCRAM CAM or diary.

Table 3.

Tolerability Results (N=47)

Skyn SCRAM CAM Mobile Diary Test Statistic P
Life disruptions*
Physical discomfort 2 (4%) 25 (53%) N/A§ McNemar’s test <.001
Social discomfort 0 (0%) 19 (40%) N/A§ McNemar’s test <.001
Interference with daily activities
 Work 0 (0%) 5 (11%) N/A§ McNemar’s test .06
 Exercise 1 (2%) 18 (38%) N/A§ McNemar’s test <.001
 Sleep 1 (2%) 15 (32%) N/A§ McNemar’s test .001
 Ability to concentrate 0 (0%) 4 (9%) N/A§ McNemar’s test .13
 Clothing choice 0 (0%) 24 (51%) N/A§ McNemar’s test <.001
 Noticeable in any way 6 (13%) 24 (51%) N/A§ McNemar’s test <.001
Opportunity cost (1–5 scale) 1.8 (1.1) 3.0 (1.4) 1.7 (1.0) χ2 (2) = 34.3 <.001
Overall burden (1–5 scale) 1.6 (0.5) 2.8 (0.9) 2.7 (0.3) χ2 (2) = 59.4 <.001
Adverse effects (1–10 scale)
Itching 1.6 (1.4) 2.9 (2.1) N/A W = 3.4 .001
Sweating 1.5 (1.1) 3.0 (2.0) N/A W = 4.3 <.001
Skin Irritation 1.6 (1.5) 3.6 (2.8) N/A W = 4.0 <.001
Marks on skin 7 (15%) 22 (47%) N/A McNemar’s test <.001
Willing to wear in the future (1–5 scale) 4.3 (0.9) 3.7 (1.2) 3.7 (1.0) χ2 (2) = 17.3 <.001

Values given as mean (SD)

*

N (%) answering “agree” or “strongly agree”

Different than SCRAM CAM (P < .01)

Different than diary (P < .01)

§

These items were previously (Barnett et al., 2017) designed for tracking devices and often unapplicable to the diaries. For example, a diary would not cause physical discomfort. Therefore, diaries were evaluated by a separate set of questions we designed (ease of completion, ease of remembering to complete, enjoyability, burdensomeness).

Calculated as average of 5-point responses to prior rows and diary-specific questions mentioned in previous footnote.

The possible inconvenience of removing the Skyn when interacting with water was not directly queried, but only 4 / 47 participants (9%) noted water as an inconvenience on free responses.

Discussion

4.1. Main Findings

The purpose of this study was to determine the tolerability and accuracy of the Skyn alcohol biosensor in relation to the current standard SCRAM CAM device. We operationalized these criteria and then evaluated them among treatment-seeking young adult high-risk drinkers who freely consumed alcohol in their natural environment, since prior studies had already completed validation among social drinkers with standardized drinking timing. The Skyn’s tolerability compared extremely favorably overall to the SCRAM CAM device and the alternative option of keeping a mobile daily diary. Furthermore, the Skyn was perceived as less disruptive than the SCRAM CAM upon specific behavioral variables that are useful to monitor concurrently with drinking and may confound drinking if they are disturbed, such as sleep (Fucito et al., 2015; Fucito et al., 2017; Ash et al., 2020b; Fucito et al., 2021), exercise (Weinstock et al., 2016), and social comfort (Book and Randall, 2002).

As for Skyn accuracy, we were able to identify a TAC threshold (quantified as AUCTAC) in each cohort that was typically exceeded on drinking days but not non-drinking days. The accuracy metrics achieved based upon this classification system were comparable to literature standards for TAC sensors: 91%−94% sensitivity to heavy drinking (literature standard 93%), 78% sensitivity to above-moderate drinking (literature standard 73%), 47%−69% sensitivity to any drinking (literature standard 65%), and 89%−91% specificity for drinking vs non-drinking (literature standard 90%). Even when we set a condition of 100% specificity to simulate judicial system requirements, sensitivity to heavy drinking (the typical level of drinking detectable by judicial system monitors) was still good at 82%−89%. Unfortunately, however, these findings did not translate into robust drinking detection rules because the TAC threshold differed widely between the two cohorts. Specifically, the threshold for cohort #1 was much lower than cohort #2, so application of the cohort #1 threshold to cohort #2 resulted in low specificity (i.e., frequent false positives). Furthermore, even within cohort #1 there was inconsistency that could only be resolved by discarding data collected more than 3 months after device shipment. Further refinement to the Skyn device could troubleshoot this wide variation of TAC thresholds, so that future studies can develop robust rules that can convert Skyn TAC data to automated detection of drinking versus non-drinking. In addition, we demonstrated that the Skyn’s high sampling frequency was robust to field conditions, which could be leveraged for more sophisticated analyses, such as TAC change over time after resolving fluctuation issues (Supplement #2) or machine-learning upon the rich time-series features (Fairbairn et al., 2020; Fairbairn and Bosch, 2021).

Such problems of inconsistent specificity did not arise with the SCRAM CAM. The AMS Scramnet scoring rules yielded 100% specificity in both cohorts. Although, the AMS Scramnet rules had in some cases poor sensitivity (e.g., just 50% to heavy drinking in cohort #1). Future studies applying more sophisticated analyses to Skyn data should also consider more sophisticated analyses that optimize SCRAM CAM sensitivity (Roache et al., 2019; Barnett et al., 2014). An incidental finding was that the SCRAM CAM sensitivity increased over the course of the study (i.e., higher for cohort #2 than cohort #1). We might speculate that the implementation of COVID-19 social distancing requirements (which occurred between cohort #1 and cohort #2) could have reduced potential confounders like physical movement and environmental alcohol at social gatherings. Physical movement could have been further reduced for cohort #2 versus cohort #1 since habitual physical activity is lower in colder seasons (Belanger et al., 2009). Regardless of its explanation, the improvement of SCRAM CAM accuracy over time reinforces that the deterioration of accuracy over time was unique to the Skyn.

Thus, the present study reveals important findings about the longevity of new generation alcohol sensors. Manufacturers may consider further development before incurring the burden of sophisticated machine-learning analyses and the sample sizes that must complete field-based protocols to have adequately-sized training and testing cohorts (~70 to 100 participants) (e.g., Fairbairn et al., 2020; Suchting et al., 2019).

4.2. Comparison to Previous Research

Previous researchers issued the Skyn to predominantly non-treatment-seeking, social drinkers in a laboratory (Fairbairn and Kang, 2019; Fairbairn et al., 2020) or research staff members in a naturalistic setting (Wang et al., 2021) and asked them to drink at a standard, preset time. These designs enabled the measurement of the time delay between drinking start and TAC rise from baseline and demonstrated that TAC rise from baseline was automatically and reliably detected by MATLAB Changepoint or machine learning (Fairbairn and Kang, 2019; Fairbairn et al., 2020). However, in these prior studies, there was no need for rules to classify whether TAC rise represented drinking or artefact, since there was a priori knowledge that drinking occurred. One prior study conducted a preliminary inspection of Skyn readings in the field from individuals outside their research team, but encountered many anomalies visually and did not attempt a quantitative system for detecting drinking and inform automated scoring metrics going forward. All of these prior authors have flagged a need for systematic and robust rules that can distinguish whether Skyn TAC rise represents drinking or artefact under field conditions with varying drinking timing and environmental contaminants (Fairbairn and Kang, 2019; Fairbairn et al., 2020; Wang et al., 2021).

Our study responded to this call by testing the Skyn’s accuracy to detect drinking by a systematic, objective method, then determining whether this method could lead to automated drinking classification rules going forward. In addition, our population was heavy-drinking young adults who have even wider day-to-day variation and drinking magnitude than the socially-drinking adults of various ages in the prior study (Wang et al., 2021). These objectives were previously pursued extensively for the SCRAM CAM (Barnett et al., 2014; Roache et al., 2015; Karns-Wright et al., 2018; Roache et al., 2019) and researchers derived rules that classify SCRAM CAM TAC departures from baseline as true versus false drinking. We sought to develop similar classification ability for the Skyn, which was achieved using an AUCTAC threshold. This threshold could distinguish true versus false drinking, but was unstable over device usage and across manufacturing lots.

This study was not designed to assess reasons for such instability, but it is worthwhile to consider that the Skyn’s thinner membrane than the SCRAM CAM may be more susceptible to 1) contaminants that produce acute artefacts, 2) wear and tear over time, and/or 3) physical alterations by the manufacturer between device lots. Acute artefact readings were minimal since the Skyn could accurately classify drinking within the same manufacturing lot used over a short time window (~3 months). However, wear and tear over time is a possibility since devices were worn much longer (115 wear hours/participant*52 participants÷16 devices = 374 wear hours/device) without direct observation of their handling compared to the largest prior study (6 wear hours/participant*110 participants÷5 devices = 132 wear hours/device) (Fairbairn and Kang, 2019; Fairbairn et al., 2020). Although the lack of acute artefact readings is consistent with participants generally following our instructions to avoid exposing the Skyn to water and alcohol-containing household products, single exposure events such as applying sunblock may impede the Skyn membrane’s permeability and make the sensor ineffective (Wang et al., 2019). Mechanical damage from dropping or hitting it is also possible. Lastly, even with proper handling, the Skyn membrane may wear down naturally and benefit from being replaced periodically. In our data, we see a decline in Skyn performance at 3 months which is the timepoint at which the SCRAM CAM requires users to replace its membrane. Data completeness from the Skyn was much improved from past reports. These reports were missing 30% of the data due to unknown reasons, human researcher errors related to the delicate prototype interface, or batteries dying within the 6-hour monitoring period (Fairbairn and Kang, 2019; Fairbairn et al., 2020). In contrast, the present study had almost no missing data until batteries died after 24–72 hours of field use. Nonetheless, battery life requires further improvement before meeting the 72 hours as advertised by the manufacturer.

Consistent with a prior study among heavy-drinking young adults (Barnett et al., 2017), our participants reported the SCRAM CAM was sometimes disruptive to physical and social comfort, sleep, exercise, clothing choice, and carried moderate overall opportunity cost and burden. In fact, it was reported as equally burdensome as active self-monitoring with a mobile daily diary. Likewise, in line with a prior study, the SCRAM CAM somewhat commonly caused adverse effects like itching (Barnett et al., 2017). Although prior studies indicate that not all participants experience these problems with the SCRAM CAM (Barnett et al., 2017; Alessi et al., 2017), we took the novel step of directly comparing the Skyn to the SCRAM CAM on these points and found the former was much less likely to cause life disruption in all these areas and adverse effects were much more minor and infrequent. Also, the proportions of our participants experiencing SCRAM CAM disruption to exercise, sleep, and clothing choice were substantial (32%−51%) and concerning when considering that monitoring multiple health behaviors drives engagement in our programs (Fucito et al., 2015; Fucito et al., 2017; Ash et al., 2020b; Fucito et al., 2021). The Skyn clearly offers potential to eliminate such confounding as it disrupted virtually nobody (0%−2%) in these areas.

4.3. Implications for practice

TAC sensors to date have been primarily used for judicial-ordered monitoring, but several research studies in the past decade have explored their use with other drinking populations to motivate and support alcohol self-management. For instance, in three studies, high-risk non-treatment-seeking adult drinkers wore the SCRAM CAM and received contingent financial reinforcement for reducing or abstaining from drinking; the results showed less frequent and severe drinking that persisted in some cases beyond removal of the reinforcements (Dougherty et al., 2014; Barnett et al., 2017). Similarly, we recently demonstrated that offering tracking of alcohol use in conjunction with sleep tracking via biosensors generated strong engagement among non-treatment-seeking heavy-drinking young adults relative to typical uptake rates for alcohol trials with this cohort (Ash et al., 2020b). TAC sensor technology also has potential to support mobile behavioral strategies such as delivering tailored support messages timed to moments of vulnerability to relapse (i.e., just-in-time adaptive interventions) (Wang and Miller, 2020) and evaluating and refining alcohol treatment interventions based upon resultant long-term drinking changes measured objectively rather than self-report (Dougherty et al., 2014; Barnett et al., 2017). However, these novel intervention strategies necessitate longer-term, consistent uptake of these devices, which our findings suggest may be strong for the Skyn, but only moderate for the SCRAM CAM. Moreover, the SCRAM CAM was more disruptive to health behaviors (i.e., exercise, sleep) and activities of daily living, which limits its potential for tailored and adaptive alcohol interventions and personalized feedback strategies on alcohol-health associations (Fucito et al., 2015; Fucito et al., 2017; Ash et al., 2020b; Fucito et al., 2021; Weinstock et al., 2016).

Nevertheless, our results in combination with prior research findings suggest that passive alcohol use self-monitoring tools are appealing (Wang et al., 2021), effective at promoting alcohol behavior change (Dougherty et al., 2014; Barnett et al., 2017), and may be preferred over more active self-monitoring methods such as daily diaries. Given the better implementation potential of the Skyn and next generation TAC sensors, more research is needed on these devices. To this aim, we took the novel step of testing Skyn accuracy to detect drinking at the day level. We developed a mathematical approach that could detect drinking, but unfortunately the derived rules were not robust over time and across device lots. If this limitation could be addressed by improved hardware and analytics (c.f. Sect 4.1), future research should explore the Skyn for detecting not only the presence but also the timing of drinking to test its momentary relationship to critical events like sleep and driving.

4.4. Limitations

Some study limitations should be noted. The Skyn’s availability and battery life limited the fraction of days in the parent trial that could be analyzed. However, there is no reason to suspect these factors introduced bias. The Skyn does not yet have a non-wear algorithm to verify compliance. Unlike the SCRAM CAM the Skyn does not detect infrared light projection onto the skin. However, the good sensitivity to detect drinking when using rules established internal to the cohort suggests that accuracy issues were due to transferability of classification rules rather than non-wear; furthermore, temperature aberrancies suggestive of non-wear were extremely uncommon. Although our sample size was larger than the prior field-based study of the Skyn (Wang et al., 2021) it was still relatively small when divided into training (Cohort #1) and testing (Cohort #2), which could have influenced the robustness of the derived thresholds and confidence interval widths. A larger sample would have also permitted evaluation of within-participant correlations and assess inter-participant variability in classification thresholds, although such findings would further preclude the objective of finding rules transferable across participants. In addition, although demographic factors (sex, age, race, ethnicity, drinking risk level) were ruled out as a confounder of our analysis since they were not statistically different between cohorts, a larger and more diverse sample is needed for better determination of generalizability.

4.5. Main conclusions

Skyn feasibility and tolerability results were excellent compared to the SCRAM CAM and an alternative procedure, keeping a mobile drinking diary. These results provide support for Skyn and similar new TAC sensor technology to the behavioral self-management space currently enjoyed by commercially available health/fitness biosensors, if the Skyn could develop accuracy competitive with the SCRAM CAM and mobile drinking diary. Accuracy results were encouraging along these lines in that we derived an approach to validly detect drinking that agreed with mobile diaries to the same or greater extent as previously reported for the SCRAM CAM across a range of drinking levels, but it did not translate into scoring metrics that could be usefully applied across timeframes >3 months among different device lots and individuals. We recommend that improved hardware and analytics resolve these issues, at which time future studies could address time delay from drinking. Most wearable technology studies use carefully maintained devices, but studies like this one can reveal durability concerns and guide next steps in transferring findings to clinical and commercial practice.

Supplementary Material

supinfo1
sup legends
supinfo2
supinfo3

Acknowledgments

The authors would like to thank Kathleen Tackos, Sydney Cannon, Jack LaVigne, Megan Benlock, Haneen Alani, Marisa Rogers, Madilyn Augustine, Allison McNeilly, Natalia Roberts, Anthony Cifre, Momoko Ishii, Jenna Burton, and Asiya Way for their contributions to the implementation of the study. The authors would like to thank Dr. Walter Roberts for expert advice on analysis and interpretation of Skyn data.

This work was supported by a grant (R34AA026021) and its supplement (R34AA026021-01A1S1) from the National Institute on Alcohol Abuse and Alcoholism, a component of the National Institutes of Health (NIH). GA was supported by a fellowship from the Office of Academic Affiliations at the United States Veterans Health Administration, a Robert E. Leet and Clara Guthrie Patterson Trust Mentored Research Award, Bank of America, N.A., Trustee, and American Heart American Grant #852679 (Garrett Ash, 2021-2024). RG was supported by Yale’s Clinical and Translational Science Award UL1TR000142 from the National Center for Advancing Translational Science, also a component of the NIH. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding sources. The funding sources had no further role in the study design, in the collection, analysis and interpretation of the data, in the writing of the report, or in the decision to submit the paper for publication.

Footnotes

Conflict of interest

The authors have no conflict of interest to declare.

References

  1. Alcohol Monitoring Systems. SCRAM CAM Continuous Alcohol Monitoring. 2021. Accessed May 20, 2021 Available from: https://www.scramsystems.com/monitoring/scram-continuous-alcohol-monitoring/.
  2. Alessi SM, Barnett NP, Petry NM (2017) Experiences with SCRAMx alcohol monitoring technology in 100 alcohol treatment outpatients. Drug Alcohol Depend 178:417–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Alessi SM, Barnett NP, Petry NM (2019) Objective continuous monitoring of alcohol consumption for three months among alcohol use disorder treatment outpatients. Alcohol 81:131–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ash GI, Stults-Kolehmainen M, Busa MA, Gregory R, Garber CE, Liu J, Gerstein M, Casajus JA, Gonzalez-Aguero A, Constantinou D, Geistlinger M, Guppy FM, Pigozzi F, Pitsiladis YP (2020a) Establishing a Global Standard for Wearable Devices in Sport and Fitness: Perspectives from the New England Chapter of the American College of Sports Medicine Members. Curr Sports Med Rep 19:45–49. [DOI] [PubMed] [Google Scholar]
  5. Ash GI, Robledo DS, Ishii M, Pittman B, DeMartini KS, O’Malley SS, Redeker NS, Fucito LM (2020b) Using Web-Based Social Media to Recruit Heavy-Drinking Young Adults for Sleep Intervention: Prospective Observation. J Med Internet Res 22(8):e17449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ash GI, Stults-Kolehmainen M, Busa MA, Gaffey AE, Angeloudis K, Muniz-Paros B, Gregory R, Huggins RA, Redeker NS, Weinzimer SA, Grieco LA, Lyden K, Megally E, Vogiatzis I, Schar L, Zhu X, Baker JS, Brandt C, Businelle MS, Fucito LM, Griggs S, Jarrin R, Mortazavi B, Prioleau T, Roberts W, Spanakis EK, Nally LM, Debruyne A, Bachl N, Pigozzi F, Halabchi F, Ramagole DA, Janse van Rensburg DC, Wolfarth B, Fossati C, Rozenstoka S, Tanisawa K, Borjesson M, Casajus JA, Gonzalez-Aguero A, Zelenkova I S J, Gursoy G, Meyerson W, Liu J, Greenbaum D, Pitsiladis YP, Gerstein MB (2021) Establishing a Global Standard for Wearable Devices in Sport and Exercise Medicine: Perspectives from Academic and Industry Stakeholders. Sports Med 51(11):2237–2250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Barnett NP, Celio MA, Tidey JW, Murphy JG, Colby SM, Swift RM (2017) A preliminary randomized controlled trial of contingency management for alcohol use reduction using a transdermal alcohol sensor. Addiction 112:1025–1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Barnett NP, Meade EB, Glynn TR (2014) Predictors of detection of alcohol use episodes using a transdermal alcohol sensor. Exp Clin Psychopharmacol 22:86–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Battelino T, Danne T, Bergenstal RM, Amiel SA, Beck R, Biester T, Bosi E, Buckingham BA, Cefalu WT, Close KL, Cobelli C, Dassau E, DeVries JH, Donaghue KC, Dovc K, Doyle FJ 3rd, Garg S, Grunberger G, Heller S, Heinemann L, Hirsch IB, Hovorka R, Jia W, Kordonouri O, Kovatchev B, Kowalski A, Laffel L, Levine B, Mayorov A, Mathieu C, Murphy HR, Nimri R, Nørgaard K, Parkin CG, Renard E, Rodbard D, Saboo B, Schatz D, Stoner K, Urakami T, Weinzimer SA, Phillip M (2019) Clinical Targets for Continuous Glucose Monitoring Data Interpretation: Recommendations From the International Consensus on Time in Range. Diabetes Care 42:1593–1603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Belanger M, Gray-Donald K, O’Loughlin J, Paradis G, Hanley J (2009) Influence of weather conditions and season on physical activity in adolescents. Annals of epidemiology 19(3): 180–186. [DOI] [PubMed] [Google Scholar]
  11. Book SW, Randall CL (2002) Social Anxiety Disorder and Alcohol Use. Alcohol Res Health 26:130–135. [Google Scholar]
  12. Choi YK, Demiris G, Lin SY, Iribarren SJ, Landis CA, Thompson HJ, McCurry SM, Heitkemper MM, Ward TM (2018) Smartphone Applications to Support Sleep Self-Management: Review and Evaluation. J Clin Sleep Med 14:1783–1790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Corbin WR, Zalewski S, Leeman RF, Toll BA, Fucito LM, O’Malley SS (2014) In with the old and out with the new? A comparison of the old and new binge drinking standards. Alcohol Clin Exp Res 38:2657–2663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Davis-Martin RE, Alessi SM, Boudreaux ED (2021) Alcohol Use Disorder in the Age of Technology: A Review of Wearable Biosensors in Alcohol Use Disorder Treatment. Front Psychiatry 12:642813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Demartini KS, Carey KB (2012) Optimizing the use of the AUDIT for alcohol screening in college students. Psychol Assess 24:954–963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dougherty DM, Hill-Kapturczak N, Liang Y, Karns TE, Cates SE, Lake SL, Mullen J, Roache JD (2014) Use of continuous transdermal alcohol monitoring during a contingency management procedure to reduce excessive alcohol use. Drug Alcohol Depend 142:301–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dougherty DM, Karns TE, Mullen J, Liang Y, Lake SL, Roache JD, Hill-Kapturczak N (2015) Transdermal alcohol concentration data collected during a contingency management program to reduce at-risk drinking. Drug Alcohol Depend 148:77–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fairbairn CE, Kang D (2019) Temporal Dynamics of Transdermal Alcohol Concentration Measured via New-Generation Wrist-Worn Biosensor. Alcohol Clin Exp Res 43:2060–2069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fairbairn CE, Kang D, Bosch N (2020) Using machine learning for real-time BAC estimation from a new-generation transdermal biosensor in the laboratory. Drug Alcohol Depend 216:108205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fairbairn CE, Bosch N (2021) A new generation of transdermal alcohol biosensing technology: practical applications, machine -learning analytics and questions for future research. Addiction (Abingdon, England), 116(10): 2912–2920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. First MB, Williams JBW, Karg RS, Spitzer RL (2014) Structured Clinical Interview for DSM-V Research Version SCID-5-RV. American Psychiatric Association Publishing, Washington, DC. [Google Scholar]
  22. Fucito LM, DeMartini KS, Hanrahan TH, Whittemore R, Yaggi HK, Redeker NS (2015) Perceptions of Heavy-Drinking College Students About a Sleep and Alcohol Health Intervention. Behav Sleep Med 13:395–411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fucito LM, DeMartini KS, Hanrahan TH, Yaggi HK, Heffern C, Redeker NS (2017) Using Sleep Interventions to Engage and Treat Heavy-Drinking College Students: A Randomized Pilot Study. Alcohol Clin Exp Res 41:798–809. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fucito LM, Ash GI, DeMartini KS, Pittman B, Barnett NP, Li CR, Redeker NS, O’Malley SS (2021) A Multimodal Mobile Sleep Intervention for Young Adults Engaged in Risky Drinking: Protocol for a Randomized Controlled Trial. JMIR Res Protoc 10:e26557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Goldman D (2017) Investing in the growing sleep health economy, in , M. Company, Editor. 2017: San Francisco, CA. p. 1–7. in Private Equity and Principal Investors Practice, (Anonymous), pp 1–7. McKinsey & Company, San Francisco, CA. [Google Scholar]
  26. Hoeppner BB, Schick MR, Kelly LM, Hoeppner SS, Bergman B, Kelly JF (2017) There is an app for that - Or is there? A content analysis of publicly available smartphone apps for managing alcohol use. J Subst Abuse Treat 82:67–73. [DOI] [PubMed] [Google Scholar]
  27. Hultgren BA, Scaglione NM, Buben A, Turrisi R (2020) Examining protocol compliance and self-report congruence between daily diaries and event-contingent ecological momentary assessments of college student drinking. Addict Behav 110:106471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Karns-Wright TE, Dougherty DM, Hill-Kapturczak N, Mathias CW, Roache JD (2018) The correspondence between transdermal alcohol monitoring and daily self-reported alcohol consumption. Addict Behav 85:147–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Leffingwell TR, Cooney NJ, Murphy JG, Luczak S, Rosen G, Dougherty DM, Barnett NP (2013) Continuous objective monitoring of alcohol use: twenty-first century measurement using transdermal sensors. Alcohol Clin Exp Res 37:16–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Merrill JE, Fan P, Wray TB, Miranda R Jr (2020) Assessment of Alcohol Use and Consequences: Comparison of Data Collected Via Timeline Followback Interview and Daily Reports. J Stud Alcohol Drugs 81:212–219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. O’Brien E, Parati G, Stergiou G, Asmar R, Beilin L, Bilo G, Clement D, de la Sierra A, de Leeuw P, Dolan E, Fagard R, Graves J, Head GA, Imai Y, Kario K, Lurbe E, Mallion JM, Mancia G, Mengden T, Myers M, Ogedegbe G, Ohkubo T, Omboni S, Palatini P, Redon J, Ruilope LM, Shennan A, Staessen JA, vanMontfrans G, Verdecchia P, Waeber B, Wang J, Zanchetti A, Zhang Y, European Society of Hypertension Working Group on Blood Pressure Monitoring (2013) European society of hypertension position paper on ambulatory blood pressure monitoring. J Hypertens 31:1731–1768. [DOI] [PubMed] [Google Scholar]
  32. Roache JD, Karns TE, Hill-Kapturczak N, Mullen J, Liang Y, Lamb RJ, Dougherty DM (2015) Using Transdermal Alcohol Monitoring to Detect Low-Level Drinking. Alcohol Clin Exp Res 39:1120–1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Roache JD, Karns-Wright TE, Goros M, Hill-Kapturczak N, Mathias CW, Dougherty DM (2019) Processing transdermal alcohol concentration (TAC) data to detect low-level drinking. Alcohol 81:101–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Sim I (2019) Mobile devices and health. N Engl J Med 956–968. [DOI] [PubMed] [Google Scholar]
  35. Sobell MB, Sobell LC (1993) Problem drinkers: guided self-change treatment. Guilford Press, New York. [Google Scholar]
  36. Sobell LC, Agrawal S, Sobell MB, Leo GI, Young LJ, Cunningham JA, Simco ER (2003) Comparison of a quick drinking screen with the timeline followback for individuals with alcohol problems. J Stud Alcohol 64:858–861. [DOI] [PubMed] [Google Scholar]
  37. Suchting R, Hébert ET, Ma P, Kendzor DE, Businelle MS (2019) Using Elastic Net Penalized Cox Proportional Hazards Regression to Identify Predictors of Imminent Smoking Lapse. Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco, 21(2):173–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Troiano RP, Berrigan D, Dodd KW, Masse LC, Tilert T, McDowell M (2008) Physical activity in the United States measured by accelerometer. Med Sci Sports Exerc 40:181–188. [DOI] [PubMed] [Google Scholar]
  39. U.S. Department of AgricultureU.S. Department of Health and Human Services. Dietary Guidelines for Americans, 2020–2025. 9th Edition. December 2020. 2020. Accessed May 24, 2021. [Google Scholar]
  40. Wang L, Miller LC (2020) Just-in-the-Moment Adaptive Interventions (JITAI): A Meta-Analytical Review. Health Commun 35:1531–1544. [DOI] [PubMed] [Google Scholar]
  41. Wang Y, Fridberg DJ, Leeman RF, Cook RL, Porges EC (2019) Wrist-worn alcohol biosensors: Strengths, limitations, and future directions. Alcohol 81:83–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Wang Y, Fridberg DJ, Shortell DD, Leeman RF, Barnett NP, Cook RL, Porges EC (2021) Wrist-worn alcohol biosensors: Applications and usability in behavioral research. Alcohol 92:25–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Weinstock J, Petry NM, Pescatello LS, Henderson CE (2016) Sedentary College Student Drinkers Can Start Exercising and Reduce Drinking After Intervention. Psychol Addict Behav. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Worley SL (2018) The Extraordinary Importance of Sleep: The Detrimental Effects of Inadequate Sleep on Health and Public Safety Drive an Explosion of Sleep Research. P T 43:758–763. [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supinfo1
sup legends
supinfo2
supinfo3

RESOURCES