Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Jul 29.
Published in final edited form as: Clin Psychol Sci. 2016 Jul 29;4(6):939–956. doi: 10.1177/2167702616639532

Developing a Risk Model to Target High-risk Preventive Interventions for Sexual Assault Victimization among Female U.S. Army Soldiers

Amy E Street 1, Anthony J Rosellini 2, Robert J Ursano 3, Steven G Heeringa 4, Eric D Hill 2, John Monahan 5, James A Naifeh 3, Maria V Petukhova 2, Ben Y Reis 6, Nancy A Sampson 2, Paul D Bliese 7, Murray B Stein 8, Alan M Zaslavsky 2, Ronald C Kessler 2
PMCID: PMC5279920  NIHMSID: NIHMS764188  PMID: 28154788

Abstract

Sexual violence victimization is a significant problem among female U.S. military personnel. Preventive interventions for high-risk individuals might reduce prevalence, but would require accurate targeting. We attempted to develop a targeting model for female Regular U.S. Army soldiers based on theoretically-guided predictors abstracted from administrative data records. As administrative reports of sexual assault victimization are known to be incomplete, parallel machine learning models were developed to predict administratively-recorded (in the population) and self-reported (in a representative survey) victimization. Capture-recapture methods were used to combine predictions across models. Key predictors included low status, crime involvement, and treated mental disorders. Area under the Receiver Operating Characteristic curve was .83−.88. 33.7-63.2% of victimizations occurred among soldiers in the highest-risk ventile (5%). This high concentration of risk suggests that the models could be useful in targeting preventive interventions, although final determination would require careful weighing of intervention costs, effectiveness, and competing risks.

Keywords: Machine learning, military sexual trauma, prediction model, rape, risk model, sexual assault


Sexual assault (i.e., forcible or attempted rape, sodomy, fondling/touching) is a significant problem in the U.S. military (Turchik & Wilson, 2010). Recent survey data indicate that 4.9% of active duty women experience sexual assault in a given 12-month period, with just under half of those experiences being penetrative assaults (i.e., rapes) and the remainder involving physical sexual contacts that did not include penetration (Morral et al., 2014). Because of demographic differences across samples and methodological inconsistencies across studies, it is difficult to make direct comparisons on sexual assault prevalence in the military versus the civilian population. However, there is evidence that active duty women are more likely than demographically similar reserve component women (who spend less time in military settings) to experience sexual assault (Morral et al., 2015b). Sexual assault is known to be associated with significantly elevated subsequent levels of posttraumatic stress disorder, depression, and substance use disorders (Kimerling, Gima, Smith, Street, & Frayne, 2007; Street, Stafford, Mahan, & Hendricks, 2008).

The Department of Defense (DoD) has responded to the problem of military sexual assault by creating an extensive sexual assault prevention program (Secretary of Defense, 2014). Consistent with widely accepted public health perspectives on sexual assault prevention, much of DoD’s prevention program focuses on primary prevention efforts aimed at changing the behaviors of potential perpetrators (DeGue et al., 2012). However, evidence is mounting that there is also an important role for effective risk reduction programs aimed at the subset of women who have especially high risk of victimization (Senn et al., 2015). Although many sexual assault prevention programs involve relatively non-intensive universal interventions, successful large-scale prevention also requires more intensive selective interventions with high-risk individuals (Foster & Jones, 2006; Golubnitschaja & Costigliola, 2012). For such interventions to be cost-effective in preventing outcomes with low base rates, though, methods need to be developed to target high-risk individuals for program implementation. Although actuarial risk prediction tools exist to identify individuals at high risk of sexual assault perpetration (Harris, Phenix, Hanson, & Thornton, 2003; Quinsey, Harris, Rice, & Cormier, 2006; Thornton et al., 2003) and intimate partner violence victimization (Campbell, Webster, & Glass, 2009; Chan, 2012; Fengler & Winkel, 2014), none exists to identify military personnel at high risk of sexual assault victimization.

One method for developing a risk model for military sexual assault victimization would be to select the predictors from the extensive series of administrative databases available for all military personnel. These databases were recently used successfully to develop prediction models for post-hospitalization suicides (Kessler et al., 2015) and violent crime perpetration (Rosellini et al., 2016) in the U.S. Army. Models of this sort could have important practical applications because all variables identified in the risk profiles are available on a continuously updating basis for every military service member, allowing predictions of risk to be made at low cost and updated as necessary over time. The current report presents the results of an attempt to develop a prediction model of this sort for sexual assault victimization among female U.S. Army soldiers using predictors obtained from administrative databases.

A requirement for developing such a model is to impose some structure on the literally hundreds of thousands of administrative variables available for soldiers in order to make the task of analyzing the data tractable. We did this by reviewing the scientific literature on risk factors for sexual assault victimization and limiting the administrative variables considered in our analysis to those that operationalized the significant predictors in that literature. These predictors can be grouped into five categories, three of them having relevance to sexual assaults in any setting (i.e., socio-demographics, mental disorders, and prior experiences with crime) (Coxell, King, Mezey, & Gordon, 1999; Franklin, Franklin, Nobles, & Kercher, 2011; Harned, Ormerod, Palmieri, Collinsworth, & Reed, 2002; Jewkes, Sen, & Garcia-Moreno, 2002; Kimerling, et al., 2007; Merrill et al., 1999; Sadler, Booth, Cook, & Doebbeling, 2003; Tewksbury & Mustaine, 2001; Turchik & Wilson, 2010) and the other two being specific to the military (i.e., military career variables, military contextual-environmental variables) (Kimerling, et al., 2007; Sadler, et al., 2003; Turchik & Wilson, 2010).

The significant socio-demographic predictors of sexual assault victimization documented in the research literature include indicators of low sociocultural power, most notably young age, low socio-economic status, and being non-married (Harned, et al., 2002). These associations are consistent with the notion that sexual assault perpetrators often target women with low sociocultural power based on the assumption that these women will be both especially vulnerable to coercive behaviors and less likely than other women to report victimizations to authorities (Lisak & Miller, 2002). In addition, young, unmarried women are more frequently exposed than other women to high-risk social situations (e.g., dates with unfamiliar partners; large parties involving alcohol; (Krebs, Lindquist, Warner, Fisher, & Martin, 2007).

A number of different mental disorders have been found to predict sexual assault victimization, including post-traumatic stress disorder, social phobia, and substance use disorder. Although there has been no research investigating the causal pathways involved in these associations, prior PTSD has been interpreted as a risk marker for the wide range of risk factors that led to earlier victimization experiences resulting in PTSD (Orcutt, Erickson, & Wolfe, 2002). It has also been noted that some PTSD symptoms (e.g., numbing, dissociation) might interfere with normal protective mechanisms like accurately attending to and responding to threat cues or effectively engaging in resistance behaviors (Risser, Hetzel-Riggin, Thomsen, & McCanne, 2006). Symptoms of social phobia have also been hypothesized to increase risk of sexual assault victimization by reducing assertiveness in fending off sexual advances (Schry & White, 2013). The associations of maladaptive alcohol and drug use with increased risk of sexual assault victimization have been hypothesized to operate through mechanisms involving increased exposure to high-risk situations, impulsivity, and impaired judgment (H. Littleton & Ullman, 2013; Ullman, 2003).

There is also strong evidence in the literature that prior exposure to criminal victimization, particularly sexual victimization, is associated with increased risk of future sexual assault victimization. Although the preponderance of this evidence has focused on documenting associations between childhood sexual abuse and later sexual assault victimization in adolescence and adulthood (Lalor & McElvaney, 2010), there is also evidence that women who experience sexual assault as adults are at increased risk of subsequent sexual assault victimization in both civilian and military settings (Littleton, Axsom, & Grills-Taquechel, 2009; Sadler, et al., 2003). Explanations for these associations have focused on stable individual differences in risk factors that might lead to repeat victimization as well as to effects of earlier victimization on subsequent interpersonal or sexual behaviors that increase risk for future victimization, such as increased number of consensual sexual partners, decreased sexual assertiveness, and acceptance of rape myths and sex-role stereotypes (Lalor & McElvaney, 2010; Messman-Moore & Long, 2000; Schry & White, 2013).

Military career variables that have been found to predict female military sexual assault victimization include low power in the military hierarchy (e.g., low enlisted rank and few years in service) (Harned, et al., 2002; Sadler, et al., 2003) and deployment to a combat theatre (Leardmann et al., 2013). The latter association has been interpreted as reflecting increased exposure to perpetrators who are less concerned than they might otherwise be with the consequences of committing assaults because of being in high-stress war zone environments. Military contextual-environment variables, finally, that have been hypothesized to increase risk of female sexual assault victimization include a culture that tolerates sexual violence and encourages resolving interpersonal problems between peers rather than bringing them to the attention of superiors (Castro, Kintzle, Schuyler, Lucas, & Warner, 2015). Norms of hypermasculinity and acceptance of violence have also been hypothesized to predispose to high rates of female military sexual assault victimization (Turchik & Wilson, 2010). Although it has proven difficult to test these hypotheses empirically, it has been shown that risk of female sexual assault victimization is elevated in military units where officers engage in sexually harassing behaviors or tolerate the sexually harassing behaviors of others (Sadler, et al., 2003).

We took advantage of a unique data source to select administrative indicators of the above predictors to develop a prediction model of sexual assault victimization among women in the U.S. Army: the Historical Administrative Data System (HADS) of the Army Study to Asses Risk and Resilience in Servicemembers (Army STARRS; (Ursano et al., 2014)). The HADS is a compilation of 38 different Army and Department of Defense administrative data systems (Appendix Table 1) organized into a single consolidated person-month database for each of the 975,057 Regular U.S. Army soldiers who were on active duty at any time between January 1, 2004 and December 31, 2009 (Kessler, Colpe, et al., 2013). In addition to containing criminal justice records of reported sexual assault victimizations that we were able to use as our primary outcome, the HADS contains extensive data on socio-demographics, medical history, criminal justice history, military career experiences, and selected information on unit experiences for all active duty soldiers during the years 2004–2009.

Table 1.

Coefficients (odds-ratios) from the final penalized and un-penalized survival models predicting first administratively-reported non-familial major sexual crime victimization among women in the HADS (2004-2009)a

Penalized Un-penalized
% (se) OR OR (95% CI) VIFb


I. Socio-demographics
    Age - 17-19 8.1 (0.1) 2.1 2.0 (1.9-2.2) 1.6
    Age - 20-22 20.2 (0.2) 1.5 1.5 (1.4-1.7) 1.4
    Marital status - Not currently married 60.6 (0.2) 1.3 1.3 (1.3-1.5) 1.1
    Race/ethnicity - Non-Hispanic White 42.7 (0.2) 1.9 1.9 (1.8-2.0) 1.1
    Education – Less than high school 6.7 (0.1) 1.4 1.4 (1.3-1.5) 1.1
II. Prior crime
    Victim of any crime (past 24 months) 5.6 (0.1) 1.6 1.6 (1.5-1.8) 1.0
III. Clinical factors
    Injury and poisoning - Any outpatient visits (past 12 months) 18.0 (0.2) 1.3 1.3 (1.2-1.4) 1.1
    Any mental diagnosis - 3+ days with outpatient visits (past 12 months) 6.6 (0.1) 1.3 1.3 (1.1-1.4) 1.1
    Any mental health diagnosis - Any outpatient visits (past 12 months) 40.3 (0.2) 1.4 1.4 (1.3-1.5) 1.2
    Any physical-related diagnosis - Any outpatient visits (past 12 months) 67.5 (0.2) 1.3 1.3 (1.2-1.4) 1.1
    PTSD – Any inpatient hospitalizations (past 12 months) 0.1 (0.0) 3.9 3.9 (2.5-6.1) 1.0
IV. Military career
    Rank junior enlisted ( E1-E4) 47.4 (0.2) 5.7 6.6 (5.4-8.0) 2.3
    Rank intermediate enlisted ( E5-E6) 25.9 (0.2) 2.6 3.1 (2.5-3.8) 1.6
    Years-in-service – 1 or less 14.4 (0.2) 2.5 2.6 (2.3-2.8) 1.9
    Years-in-service – 1-2 11.8 (0.1) 1.6 1.6 (1.5-1.8) 1.4
V. Military context
    Number of different duty units over past 12 monthsc 1.7 (0.0) 1.3 1.3 (1.2-1.3) 1.2

Abbreviations: OR, odds ratio; 95% CI, 95% confidence interval; VIF, variance inflation factor; PTSD, posttraumatic stress disorder

a

All predictors shown here are significant at the .05 level (2-sided test) in the unpenalized model. Predictors were selected in the elastic net model based on internal cross-validation rather than significance tests, which is why no significance tests are presented for the penalized model ORs. The mixing model parameter (α) in the elastic net model was set to 0.5 based on the empirical finding that this was the value that maximized concentration of risk in the 5% of the sample with highest predicted risk. The analysis sample included all person-months with the outcome plus a probability sample of all other person-months in the population (total case-control sample of 113,592 person-month). All records in the control sample were weighted by the inverse of the probability of selection. One time control variable was selected by the elastic net model but is are not shown here. Two operational command variables stepped into the model but are not shown here.

b

Variance Inflation Factor (VIF) for the coefficient associated with predictor Xi in the unpenalized model equals 1/(1-R2i), where R2i is the coefficient of determination of a logistic regression equation in which Xi is the dependent variable and all the other predictors in the model are included as predictors of Xi. VIF ≥ 5.0 is typically considered an indicator of meaningful multicollinearity. The results reported here show that multicollinearity was not a problem in the unpenalized model despite the fact that the optimal mixing model parameter in the elastic net model used to select the predictors was α= 0.5

c

This was coded 1-4 (1=1 unit; 2=2 units; 3=3 units; 4=4+ units). The value reported in the percentage columns reflects the average number of different duty units in the total sample based on that truncated coding scheme.

Although administrative records of reported sexual assault victimization are the main outcome in our analysis, anonymous surveys of military personnel suggest that only a minority of military sexual assaults are reported to authorities (Mengeling, Booth, Torner, & Sadler, 2014). These same surveys, which ask respondents not only if they were sexually assaulted but also whether or not they reported the assault to military authorities, also show that reported and unreported sexual assaults differ in important ways. For example, reported assaults are more likely than unreported sexual assaults to be perpetrated by strangers and to involve victim injuries and offender use of weapons (Wolitzky-Taylor et al., 2011). Based on this evidence, we not only developed a model to predict administratively-reported sexual assaults in the total population of female soldiers in the HADS, but we also developed a parallel model using the same master set of HADS administrative variables to predict self-reported sexual assault victimization among the women who participated in the Army STARRS consolidated All-Army Study, a probability survey of all active duty U.S. Army soldiers exclusive of those in Basic Combat Training (Ursano, et al., 2014). As the predictors in the two models were drawn from the same administrative database, it was possible to use capture-recapture methods (Alho, 1990; Alho, Mulry, Wurdeman, & Kim, 1993; Hook, Hsia, & Regal, 2012; Sekar & Deming, 1949) to combine predictions across the two models to arrive at a single consolidated estimate of individual-level risk of sexual assault victimization whether administratively-recorded or self-reported.

METHODS

Samples

The model to predict administratively-recorded victimization was based on all female soldiers in the Regular Army (i.e., excluding the Army Reserve and National Guard) on active duty at any time between January 2004 and December 2009. We focused on first recorded instances of sexual assault victimization as the administratively-recorded outcome. A discrete-time survival analysis framework with person-month the unit of analysis was used to develop the model. Predictors that could change over time were coded as time-varying covariates (Willett & Singer, 1993). The 153,250 female soldiers in the population during the years of study were in service for a total of 5,181,659 person-months over the study period. Person-months with incident administratively-recorded sexual assault victimizations were coded 1 on the outcome variable and all other person-months were coded 0. Person-months were censored either at outcome occurrence, death, termination of active duty service, or end of the study period, whichever came first. We focused on non-familial sexual assaults because only a small proportion of administratively-recorded military sexual assaults involve perpetration by family members (Rock, 2013) and the predictors of non-familial and familial sexual victimization are known to be quite different from each other (Testa, VanZile-Tamsen, & Livingston, 2007). Rather than carry out the analysis in the full population, control person-months were sampled using the logic of case-control analysis (Schlesselman, 1982) and weighted by the inverse of their probability of selection in the final analysis sample.

The model to predict self-reported sexual assault victimization was based on the subsample of female respondents in the Army STARRS consolidated All-Army Study (AAS) who consented to having their survey responses linked to their Army and DoD administrative records. The predictors used to develop the model were the same administrative predictors abstracted from the HADS as those in the analysis to predict administratively-recorded victimization. However, these administrative variables were available to us only for the consolidated AAS sample, not the entire population of all soldiers, as of the time of the survey, which was administered in 2011–2012 to a representative sample of active duty Regular Army soldiers exclusive of those in Basic Combat Training.

While the AAS was administered throughout the world, we focused the model-building reported here on the subsample of 1,272 female AAS respondents consenting to record linkage who were stationed in the continental United States in large bases at the time of the survey. This restriction was due to the fact that these respondents were the only ones administered the computer-assisted version of the AAS, which allowed questions about the timing of recent exposure to traumatic events (one of which was sexual assault) to be asked with enough precision to match up with HADS administrative reports of victimization in the prior 12 months (or less) when not deployed as well as separately with administrative reports of victimization at any time during deployments that ended within the prior 12 months (even if they began more than 12 months ago). Other AAS respondents (i.e., those stationed either on small bases in the continental U.S. or outside of the continental U.S.) completed a paper and pencil version of the survey that collected less precise information on timing of traumatic event exposure.

Given that AAS respondents were asked to self-report sexual assault victimization over the past 12 months of service (or, in the case of those who were deployed at any time in the past 12 months, separately to report victimizations that occurred during the recent deployment and during the part of the past 12 months when they were not deployed), the model to predict self-reported victimization was a person-level logistic model (rather than, as in the model to predict administratively-recorded sexual assault victimization, a person-month model) in which the administrative predictors were time-invariant and were coded as of the month before the beginning of the recall period (i.e., either the month before the beginning of the most recent deployment to predict victimization during deployment, the last month of deployment to predict victimization after returning from deployment, or the 13th month before interview to predict victimization during the past 12 months among respondents who were not deployed at any time in the past 12 months).

AAS sampling began by selecting quarterly replicate samples of units or sub-units in the eight quarters of 2011–2012 (i.e., January-March 2011, April-June 2011, …, October-December 2012) with probabilities proportional to authorized unit strength excluding units of fewer than 30 soldiers (less than 2% of Army personnel). The sample was stratified by Army Command and location. An additional augmented AAS sample surveyed three Combat Brigades shortly before they deployed to Afghanistan and then again after their return from Afghanistan. Only the baseline survey in that panel sample is used here. On the day of the survey, all personnel in the selected unit attended an informed consent presentation explaining study purposes, confidentiality, and voluntary participation before being asked for written informed consent to complete a self-administered questionnaire, to link their administrative records to questionnaire responses, and to participate in future data collections. Identifying information needed to link administrative data to survey responses (e.g., name, SSN) was collected from respondents consenting to linkage and kept in a separate secure file. These recruitment, consent, and data protection procedures were approved by the Human Subjects Committees of the Uniformed Services University of the Health Sciences for the Henry M. Jackson Foundation (the primary grantee), the Institute for Social Research at the University of Michigan (the organization collecting the data), and all other collaborating organizations.

The computerized version of the AAS survey was self-administered using laptop computers in a group setting. The response rate in the target sample was 74.5%. The remaining unit members included 15.0% nonresponse, much of it due to conflicting duty assignments, and 10.5% incompletion after starting the survey, much of it due to logistical complications, such as units either arriving late or having to leave the sessions early, although some respondents needed more than the allotted time to complete the survey. Among AAS survey completers, 65.1% provided data for administrative record linkage. Based on the American Association of Public Opinion Research COOP1 calculation methods (American Association for Public Opinion Research, 2009), the overall survey completion-successful-linkage cooperation rate was 49.4% [.816x.931x.651]. Additional details about the AAS survey design, samples, and informed consent procedures are reported elsewhere (Heeringa et al., 2013; Kessler, Colpe, et al., 2013; Kessler, Heeringa, et al., 2013).

As survey data were available for the 34.9% of survey completers who did not agree to administrative data linkage, a weighting adjustment was used to make the remaining 65.1% of survey completers (the latter group reported a slightly higher rate of sexual assault than the 34.9% who did not agree to administrative data linkage) equivalent to the total sample on the cross-classification of survey variables that most strongly differentiated the two subsamples. And as a restricted set of anonymized administrative data were made available to us for the population of all soldiers during the years of the consolidated AAS, we were also able to post-stratify the weighted subsample of consolidated AAS respondents with linked administrative data to the cross-classification of those population distributions as well as to the distribution of Army Command, location, and the phase of the unit in the Army ARFORGEN (Army Force Generation; http://www.forscom.army.mil/) cycle. We used these doubly-weighted AAS data to build the model to predict self-reported sexual assault victimization. A detailed discussion of AAS weighting and post-stratification is presented elsewhere (Kessler, Heeringa, et al., 2013).

Measures

Administratively-recorded victimization

Data from 5 criminal justice data systems (Sexual Assault Data Management System; Criminal Investigation Division Information Management System (CIMS)/Automated Criminal Investigation/Criminal Intelligence System; CIMS/Automated System Crime Record Center; Army Court Martial Information System; Centralized Operations Police Suite/Military Police Reporting System) were combined to identify the date, type, and judicial outcome of unrestricted non-familial administratively-recorded sexual assault victimizations that occurred over the study period. Qualifying crimes of administratively-recorded sexual assault victimization included rape (i.e., forcible vaginal intercourse), forcible sodomy (i.e., attempted or forcible oral or anal sex), and “other” sexual assault (i.e., attempted rape, fondling, indecent assault), as coded according to the Bureau of Justice Statistics National Corrections Reporting Program classification system (U.S. Department of Justice, 2009). In other words, our administratively-recorded outcome did not include non-contact forms of unwanted sexual attention such as sexual harassment, exhibitionism, or Peeping Tom.

As mentioned above, unrestricted administrative records were used to define the outcome. However, sexual assault victims may also file restricted reports using the Victim Preference Reporting Statement (DD Form 2910). Restricted reports allow victims to receive medical treatment and counseling without triggering an official investigation of the assault, while unrestricted reports trigger an official investigation in addition to allowing the services available in restricted reporting. Victims may also decide to convert restricted reports to unrestricted reports at any time. As only unrestricted reporting data were made available to Army STARRS, it is important to note that a recent RAND survey found that among Army respondents who said they filed official reports, 51% were unrestricted reports, 20% were restricted reports that were eventually converted to unrestricted reports, and 23% were restricted reports (the remaining 6% of Army respondents were not sure what type of report they filed; (Morral, et al., 2015b).

Self-reported victimization

The survey outcome, in comparison, was defined based on a question that asked respondents how many times they had been “sexually assaulted or raped.” All respondents were asked this question as part of a longer checklist of traumatic experiences (TEs). It is important to note that use of this question may have failed to identify victims who would have qualified for our somewhat broader definition of administratively-recorded victimization. Along these lines, although a large literature has focused on how the prevalence of sexual assault victimization varies depending on the phrasing of the self-report questions asked, time constraints precluded the AAS from asking additional behaviorally-specific questions and/or using a two-stage approach for assessing sexual assault in the survey (Cook, Gidycz, Koss, & Murphy, 2011; Fisher, 2009; Kruttschnitt, Kalsbeek, & House, 2014).

Respondents who were not deployed in the 12 months before the AAS survey (n=1,056 of the female AAS respondents considered here) were presented with a single checklist of TEs that might have occurred to them at any time in the past 12 months of service. Respondents who were deployed at any time in the past 12 months (the other n=216 female AAS respondents), in comparison, were presented with two lists: the first asking about TEs that occurred at any time during that deployment; the second asking about TEs that occurred at any time in the past 12 months of service other than during deployment. As the vast majority of the recent deployments of these respondents began more than 12 months before the survey, we treated these deployments as if they represented observations for 216 separate respondents that we combined with observations of times not deployed among all 1,272 respondents into a sample with a total of 1,488 observations. Not all of these 1,488 observations had a full 12 months of time-at-risk, as deployments varied between 2 and 25 months (mean=10.1) and time in-service in the past 12 months other than during deployment varied between 2 and 12 months (mean=11.2 months). This variation was taken into consideration in the analysis by including months-at-risk as a control variable in the model. For purposes of the capture-recapture analysis (see below), we also constructed a yes-no variable for each of the 1,488 observations for whether there was an administrative record of the respondent having been sexually assaulted at any time during the months covered by the retrospective reporting period of that observation.

Administrative predictors

As reviewed earlier, the risk factors for military sexual assault victimization extracted from the 38 Army and Department of Defense administrative data systems (see Supplemental Table 1) in the larger HADS database were grouped into five categories. Given that our use of the HADS was opportunistic (i.e., we used administrative data collected for other purposes to operationalize constructs the measures were not originally designed to assess), we cast a wide net. Specifically, we selected 446 HADS variables to operationalize as many of the predictors as possible found in the literature, including 21 socio-demographic variables, 282 clinical variables (treated mental and physical disorders and medications), 66 variables for prior experience with crime (both victimization and perpetration), 38 variables defining military career experiences, and 39 military contextual variables (e.g., unit and leadership characteristics). We also included controls for year and season to adjust for the possibility of temporal variation. A complete description of predictors is available elsewhere (Supplemental Tables 2–5).

Analysis methods

Building the models

Data analysis was carried out remotely by Harvard Medical School analysts on a secure server at the University of Michigan Army STARRS Data Coordination Center. De-identified analysis was approved by the Human Subjects Committees of the Uniformed Services University of the Health Sciences for the Henry M. Jackson Foundation (the primary Army STARRS grantee), the University of Michigan, and Harvard Medical School. The governing Institutional Review Boards did not require informed consent from individual soldiers because HADS data were de-identified.

We followed a six step analysis plan to predict each of the two outcomes:

  1. We examined bivariate associations of predictors with victimization controlling for historical time either with a discrete-time person-month survival model having a logistic link function (Willett & Singer, 1993) for the administrative outcome or a person-level logistic model for the survey outcome, both using SAS PROC LOGISTIC (SAS Institute Inc., 2010);

  2. We transformed the functional forms of significant bivariate associations involving non-dichotomous predictors to capture substantively plausible nonlinearities;

  3. We estimated multivariate associations among significant bivariate predictors in logistic models;

  4. As multivariate coefficients were unstable, we used machine learning methods to build more stable models. We began this phase of the analysis by using 10-fold cross-validated stepwise regression to select the optimal minimum number of predictors to maximize concentration of risk (the proportion of observed occurrences of the outcome) in the top-ventile of risk (i.e., the 5% of cases with highest cross-validated predicted risk);

  5. We then searched for stable interactions among predictors in the optimal stepwise model using the R-package RandomForests (Breiman, Cutler, Liaw, & Wiener, 2014) and determined incremental improvement in fit by examining top-ventile concentration of risk in final 10-fold cross-validated stepwise regression models with and without the addition of the RandomForests predicted probability of the outcome;

  6. We then estimated elastic net penalized regression models specifying that the model select the optimal number of predictors determined in the stepwise model in order to obtain the most stable set among this optimal minimum number of predictors (Zou & Hastie, 2005) using the R-package glmnet (Friedman, Hastie, & Tibshirani, 2010).

Once the final penalized models were estimated, we generated predicted probabilities of the outcome for each person-month (administrative outcome) or person (survey outcome) in the dataset and computed area under the Receiver Operating Characteristic curve (AUC) to evaluate model prediction accuracy, collapsing individual-level predicted probabilities into 20 groups of equal frequency (ventiles) to calculate the distribution of within-ventile concentration of risk.

Combining individual-level predicted probabilities across models

We then used the logic of capture-recapture analysis (Alho, 1990; Alho, et al., 1993; Sekar & Deming, 1949) to illustrate how the predicted probabilities of victimization based on the two models could be combined into consolidated individual-level estimates for each female soldier in the Army for practical use of the models in risk targeting. Importantly, we used this approach to estimate the consolidated prevalence of sexual assault victimization, which included estimates of actual administratively-recorded victimization in the population, self-reported victimization in the AAS survey, and victimizations that were unreported in both the administrative data and the survey. It is important to note, though, that this consolidated calculation was made only as an illustration of the potential value of the approach in future coordinated applications because Army STARRS was not designed to develop precise capture-recapture estimates and our data were consequently limited in three ways that made it impossible to develop a definitive consolidated prediction model. First, sexual assault victimization was assessed in the AAS using a much less precise definition than the one used to define administratively-recorded victimization. Second, administratively-recorded victimization was based solely on unrestricted reports of victimization. Third, the retrospective time period over which self-reported victimization was assessed in the AAS (November 2009 through June 2012) was different from the time period for which we were granted access by the Army to the HADS data (January 2004 through December 2009). As noted above, while we had administrative data for AAS respondents who provided signed informed consent for us to have these data, we were not granted access to individual-level administrative data for the entire population of soldiers on active duty beyond 2009. Population-level data of that sort would have been required for the same time period as when the AAS was carried out to adjust the AAS sample data for discrepancies with the population and to construct a practically useful consolidated risk model.

Once appropriately linked administrative and survey data were available, we used a four-step process to combine individual-level predicted probabilities of sexual assault victimization based on separate models predicting administratively-recorded and self-reported victimization for each female soldier in the HADS database.

  1. We began by building conditional models for self-reported victimization in the presence of (p1), and separately in the absence of (p2), administratively-recorded victimization. In future applications, these models should be based on a much larger sample than the AAS to guarantee precise estimates and to ask about sexual assault victimization using the same definition as in the administrative records. In order to estimate these illustrative models, we needed to include in the models information about administratively-recorded victimization over the time period covered in the survey to determine if the predictors (i.e., administrative variables available at the time point prior to the reporting period in the survey) differed significantly depending on the presence versus absence of administratively-recorded victimization over the same reporting period as in the survey;

  2. We then built a model for administratively-recorded victimization (p3) in the total population. Note that future applications should do this for the same reporting period as in the survey used administrative predictors that were available as the time point prior to the beginning of the recall period. As noted above, this was not possible in our illustrative application due to the fact that the HADS data were available to us only for 2004–2009 while the AAS data were collected in 2011–2012;

  3. We then generated individual-level estimates of p1, p2, and p3 for each woman in the AAS database based on the coefficients in the above models and manipulated these estimates to generate predicted probabilities of having both administratively-recorded and self-reported victimization (a = p1p3), administratively-recorded but not self-reported victimization (b = [1-p1] p3), and self-reported but not administratively-recorded victimization (c = p2[1-p3]);

  4. We then estimated individual-level probabilities of being a true sexual assault victim by combining estimates of a, b, and c for each individual with an estimate of the probability of being a true sexual assault victim but both failing to report it in an unrestricted fashion to the authorities and failing to report it in the survey (d). We estimated d based on the conservative assumption that the probability of reporting the victimization in unrestricted fashion to authorities and the probability of reporting it in the survey are independent of each other, in which case the individual-level value of d is equal to the individual-level values of bc/a (Alho, 1990; Alho, et al., 1993; Sekar & Deming, 1949).

It is noteworthy that the assumption of independence in developing a consolidated estimate of individual-level probability of victimization by summing a, b, c, and d at the individual level is conservative given that (i) we considered it implausible that there would be a negative association (given observed covariates and experience of victimization) between reporting to authorities and reporting in the survey; and (ii) d increases as the positive association between the two forms of reporting increases (Alho, 1990; Hook, et al., 2012; Sekar & Deming, 1949). The estimate is also conservative in that it excludes soldiers who had a 0 probability of either reporting to authorities or reporting in the survey. A standard error of the consolidated prevalence estimate (the mean of the sum of the individual-level estimates of a, b, c, and d) was obtained by simulation using the method of jackknife repeated replications (Wolter, 1985) based on a SAS (SAS Institute Inc., 2010) macro.

It is noteworthy that the assumption in the capture-recapture model that there might be some soldiers who reported to authorities but not in the survey contradicts the assumption typically made implicitly in anonymous surveys that there are no such individuals, in which case survey reports are taken as accurate reflections of true prevalence (Morral et al., 2015a) and information about bias in administrative reports is assumed to be obtained without bias by asking survey-reported victims about whether or not they also reported the victimization to authorities (Morral, et al., 2015b). But this assumption could be incorrect. For example, some soldiers might believe that administrative reports would be confidential but that survey reports might not be confidential. Indeed, we document below that a substantial number of female sexual assault victims in the U.S. Army have exactly this type of reporting pattern, making it important to think through the logic of using linked administrative-survey data collection and capture-recapture data analysis in future studies of the prevalence and correlates of military sexual assault victimization.

RESULTS

Prevalence of sexual assault victimization

Administratively-recorded sexual assault victimization (i.e., rape, sodomy, attempted assault, fondling/touching) was reported without restriction by 4,252 women in the Regular Army over the years 2004–2009. Prevalence over that time period was 10.0/1,000 person-years. 85.2% of all administratively-recorded victims were women (Supplemental Table 6).

Prevalence of first administratively-recorded sexual assault victimization among AAS women, in comparison, was 15.9/1,000 person-years. The fact that this rate was higher than in the HADS presumably reflects the facts that the AAS was based on a sample rather than the population, that the AAS subsample considered here was limited to soldiers who had completed basic training and were stationed in the continental United States at the time of survey, and that the reporting period in the AAS was later than in the HADS.

Prevalence of self-reported sexual assault (i.e., based on reports of being “sexually assaulted or raped”) among AAS women over the reporting period in the survey was 18.6/1,000 person-years, While administratively-recorded and self-reported victimization were strongly related to each other among AAS women (OR=42.4; 95% CI =34.5–52.1), the association was far from perfect, with only 29.3% of the self-reported victimizations being recorded administratively and only 34.2% of administratively-recorded victimizations being reported in the survey. Prevalence of either was 29.0/1,000 person-years.

The finding that close to two-thirds of administratively-recorded cases were not reported in the AAS survey is especially striking in light of the fact that, as noted in the introduction, previous research has for the most part assumed that anonymous survey reports are comprehensive; that is, that there are no victims who report to authorities but do not report in surveys. One possible explanation for this survey non-reporting is that these women experienced fondling/touching rather than rape, sodomy, or attempted assault and consequently did not consider themselves to qualify for what the survey described as being “sexually assaulted or raped.” However, an investigation of more detailed HADS reports shows that this explanation is inadequate, as a substantial proportion of the women with administratively-recorded sexual assault victimization who failed to self-report in the survey that they had been sexually assaulted or raped were classified in the HADS as having been victims of rape.

It is also worth recalling in this regard that the subset of AAS respondents considered here represent those who agreed for their surveys to be de-identified (i.e., for their identities to be known for purposes of record linkage and follow-up) rather than anonymous (i.e., for their identities not to be known). It is conceivable that survey reporting was more complete in the segment of the AAS sample that required their survey responses to be anonymous. However, this possibility is indirectly inconsistent with the fact that comparison of these reports (results available on request) showed that rates of self-reported victimization was lower among anonymous than de-identified AAS respondents. It is possible, though, that this result occurred because survey respondents who agreed to administrative linkage gave self-reports more consistent with their prior administrative reports than did survey respondents who did not agree to administrative linkage due to the former respondents knowing that this consistency would be checked.

Building the separate models for administratively-recorded and self-reported victimization

Roughly two-thirds (69.9%) of the 446 administrative predictors had significant (.05 level, two-sided tests) univariate associations with administratively-recorded victimization in the HADS and 33.3% with self-reported victimization in the AAS. (Supplemental Tables 7–19) In the unrestricted stepwise models, 81 of these predictors entered at the .05 level in the HADS and 56 in the AAS, 19 (HADS) and 8 (AAS) of which remained in the cross-validated models. AUC and top-ventile concentration of risk did not improve when summary variables for RandomForests predictors were added to the predictor set in the HADS. (Supplemental Table 20) The sample size was too small for RandomForests to be estimated in the AAS.

Top-ventile concentration of risk was very similar in the optimal penalized and un-penalized models (33.6% in both models in the HADS ; 60.8–63.7% in the AAS). (Figure 1) AUC of the penalized and un-penalized models was also very similar (.83 in both models in the HADS; .88 in both models in the AAS). It is important to note, though, that we included a dichotomy indicating whether or not the respondent had administratively-recorded victimization as one of the predictors of self-reported victimization in the AAS model. We would expect AUC of the model to be inflated due to that predictor being included in the model. It is consequently noteworthy that the AUC of the AAS model was .84 when calculation of AUC was limited to the 1,458 cases out of the 1,488 in the total sample in which there was no administratively-recorded victimization over the recall period. Top-ventile concentration of risk in that subsample was 71.1% in the unpenalized model and 59.9% in the penalized model. Prevalence in the top-ventile of predicted risk (i.e., positive predictive value) was 67.3–67.4/1,000 person-years in the penalized and unpenalized models in the HADS (6.7 times the total-sample prevalence) and 226.1–237.0/1,000 person-years in the AAS (12.2–12.7 times the total-sample prevalence).

Figure 1. Concentration of Risk of Major Sexual Crime Victimization by Ventile of Predicted Risk Based on the Final Discrete-Time Survival Models in the HADS(19 predictors) and AAS (8 predictors)a.

Figure 1

aVentiles are 20 groups of person-months (HADS) or soldiers (AAS) of equal frequency dividing the total sample into equally sized groups defined by level of predicted victimization risk.

Model coefficients

Five socio-demographic characteristics were significant predictors of administratively-reported victimization in the HADS: young age (age 17–19 and 20–22), unmarried, Non-Hispanic White, and less than high school education. (Table 1) No socio-demographics were significant predictors of self-reported victimization in the AAS. One indicator of prior experience with crime in the HADS and two in the AAS were also significantly associated with elevated risk of victimization: previously being the victim of any crime in the past 24 months (HADS), previously being the perpetrator of a crime involving possession of an illegal substance (AAS), and the measure of administratively-recorded sexual crime victimization over the same recall period as in the AAS. (Table 2)

Table 2.

Coefficients (odds-ratios) from the final penalized and un-penalized logistic regression models predicting self-reported sexual assault victimization among women in the selected AAS subsample (2009-2012)a

Penalized Unpenalized
% (se) OR OR (95% CI) VIFb


I. Prior crime
  Administratively-recorded sexual crime victimization (not time-lagged) 1.5 (0.3) 32.0 130.9* (101.2-169.2) 1.0
  Perpetrator of illegal drug possession crime (past 12 months) 0.1 (0.1) 140.4 3226.1* (1102.0-9443.5) 1.0
II. Clinical factors
  Number of outpatient visits (past 12 months) for…c
    Traumatic stress 0.9 (0.3) 5.6 6.8 (5.2-8.9) 1.1
    Injury-poisoning 16.1 (1.3) 1.3 2.7 (2.3-3.2) 1.1
  Number of prescriptions (past 12 months) of…c
    Benzodiazepines 5.5 (1.0) 1.2 1.4 (1.2-1.6) 1.1
    Other sedatives-hypnotics 5.6 (0.8) 1.3 1.6 (1.4-1.8) 1.1
    Medications to treat migraine 1.4 (0.4) 1.8 2.2 (1.8-2.6) 1.0
  Any prescription for short-acting narcotic analgesic (past 12 months) 20.0 (1.5) 1.1 3.5 (2.8-4.4) 1.1

Abbreviations: OR, odds ratio; 95% CI, 95% confidence interval; VIF, variance inflation factor; PTSD,

a

All predictors shown here were significant at the .05 level (2-sided test) in univariate models but only two are significant in the unpenalized logistic model. Variable selection in the elastic net model was based on internal cross-validation rather than significance tests, which accounts for why no significance tests are presented for the penalized model ORs and why predictors were included in the final model that were judged to be insignificant based on logistic regression model confidence intervals. The mixing model parameter (α) in the elastic net model was set to 0.9 based on the empirical finding that this was the value that maximized concentration of risk in the 5% of the sample with highest predicted risk. The data were doubly-weighted to adjust for differences in the survey characteristics of AAS respondents who did versus did not agree to have their Army and DoD administrative records linked to their survey responses (Weight 1) and to post-stratify the weighted AAS respondents who agreed to record linkage to match the cross-classification of selected de-identified population-level administrative variables that were provided to us by the Army for this purpose. Confidence intervals in the unpenalized model were calculated using the design-based method of jackknife repeated replications to take these weights into consideration along with the effects of the clustering of observations.

b

Variance Inflation Factor (VIF) for the coefficient associated with predictor Xi in the above equation equals 1/(1-R2i), where R2i is the coefficient of determination of a regression equation in which Xi is the dependent variable and all the other predictors in the model are included as predictors of Xi. VIF ≥ 5.0 is typically considered an indicator of meaningful multicollinearity.32

c

These were coded 0-4 (0=0 visits; 1=1-2 visits; 2=3-5 visits; 3=6-10, 4=11+ visits). The value reported in the percentage column reflects the proportion of the sample with one or more visits or prescriptions in the past 12 months.

As one would expect from the very high bivariate OR reported above between administratively-recorded and self-reported victimization, the OR of the former “predicting” the latter in the AAS was very high (130.9–32.0 in the unpenalized and penalized models, respectively). No significant interactions were found between this predictor and any of the other administrative predictors of self-reported victimization despite the fact that all such interactions were included in the variable set used to build the optimal elastic net model. These interactions were included in the initial variable set to allow for the possibility that the predictors of self-reported victimization differ depending on the presence versus absence of administratively-recorded victimization. It is noteworthy that the small size of the AAS female sample resulted in low power to detect meaningful interactions of this type, though, which means that such interactions should not be assumed to be absent based on the AAS results.

Five clinical health factors in the HADS and 6 in the AAS were also significant predictors. Most of these focused on mental-health related care. In the HADS these were any outpatient visit for one or more visits for any mental disorder, three or more visits for any mental disorder, any inpatient hospitalizations for PTSD, and visit for an injury/poisoning, and any visit for a physical-related diagnosis. The AAS variables were all for a 12-month retrospective time period and included two counts of outpatient visits (for injury-poisoning and traumatic stress), three measures of number of prescriptions (for benzodiazepines, other sedatives-hypnotics, and medications used primarily for migraine therapy), and a dichotomy for any prescription for a short-acting narcotic analgesic. No other predictors were significant in the AAS, but a number of military career variables were significant in the HADS. Two involved operational commands and another six, all associated with elevated risk of victimization, pertained to early career stage (junior-intermediate enlisted rank; 0–2 years in service). Finally, one military contextual predictor, number of different duty units over the past 12 months, was associated with significantly elevated risk

Sensitivity of results

As the AAS sample was so small, no attempt was made to carry out internal validation of model results. This kind of validation was possible, though, for the HADS due its much larger size. As the HADS model was designed to predict victimization this month, prediction accuracy over longer time periods was evaluated by calculating average top-ventile concentration of risk for all possible 1-, 6- and 12-month follow-up periods from January 2004 through January 2009 and in 20-month and 30-month intervals. (We excluded February-December 2009 because we did not have 12-months of follow-up data after these months.) While average top-ventile estimates of concentration of risk were highest over 1-month periods (26.4–32.8%), they remained elevated over 6-month (23.7–27.8%) and 12-month (21.4–24.2%) periods and were consistent across the five 20-month and 30-month time-intervals. (Supplemental Table 21)

Although short time-in-service and early career stage were strong predictors of elevated victimization risk in the HADS, the failure of RandomForests to improve fit means that no interactions were found between time-in-service and other predictors. This might have been because we lacked adequate statistical power to detect these interactions, though, due to the high proportion of victimizations occurring in the first years of service. We evaluated this possibility by examining top-ventile concentration of risk in the HADS within sub-groups defined by time-in-service. (Table 3) Unsurprisingly, the proportion of soldiers in the top-ventile of predicted risk varied inversely with time-in-service (χ 25=2,884.5, p<.001). However, when we recalibrated cut-points to focus on the top-ventile of predicted risk within each time-in-service subsample, the association between time-in-service and top-ventile concentration of risk became much smaller among (χ 27=15.0, p=.040).

Table 3.

Prevalence/1,000 person-years and concentration of risk (CR) in the top-ventile (both overall and within time-in-service subsamples) of observations with highest predicted risk of first administratively-recorded non-familial major sexual assault victimization by time-in-service among female Regular Army soldiers in the Army STARRS 2004-2009 Historical Administrative Data System (HADS)a

Overall ventiles with highest predicted risk Within time-in-service ventiles with
highest predicted riskb


Prevalence/1,000
person-years
Within-row
CR
Proportion of
person-
months in top-
ventile
Prevalence/1,000
person-years
Within-row CR


Years-in- service Est (se) Est (se) % (se) Est (se) Est (se)





    0-1 73.5 2.8 57.1 1.1 23.8 0.5 126.9 8.9 20.5 0.9
    1-2 55.1 3.6 31.9 1.4 12.0 0.4 67.6 6.4 16.4 1.1
    2-3 50.5 12.7 4.5 0.9 1.0 0.1 37.8 4.7 17.6 1.7
    3-4 45.0 19.4 2.6 1.0 0.4 0.1 21.5 3.6 15.7 2.2
    4-5 19.8 15.3 1.1 0.8 0.3 0.1 18.6 3.8 15.5 2.7
    5-10 27.1 31.1 0.3 0.3 0.0 0.0 13.6 1.8 20.1 2.3
    10-20 --c --c --c --c --c --c 5.5 1.1 26.6 4.6
    20+ --c --c --c --c --c --c 1.2 1.2 12.5 11.7
    Total 67.3 2.2 33.6 0.7 5.0 0.1 37.6 1.5 18.7 0.6
      χ25-7 21.5* 1631.4* 2884.5* 435.3* 15.0*

*

Significant at the .05 level, two-sided test.

a

Estimates are based on the coefficients from the total sample penalized models

b

Concentration of risk ventiles were re-classified independently within each time-in-service group so the top-ventile of predicted risk includes 5% of the person-months within each time-in-service category

c

There were zero person-months in the overall top-ventile of predicted risk for women with 10+ years-in-service. A five degree of freedom χ2 test was used to examine variation in these rates by time-in-service.

While the association of time-in-service with top-ventile CR decreased when ventiles were defined within time-in-service subsamples, the association of time-in-service with top-ventile positive predictive value increased. When the top-ventile of predicted risk was defined without restriction, top-ventile positive predictive value ranged between 73.5/1,000 person-years in the first year of service (when 23.8% of female soldiers were included in the top-ventile and top-ventile concentration of risk was 57.1%) and 0.0/1,000 person-years after 10 years of service (when 0.0% of female soldiers were included in the top-ventile). When the top-ventile was defined within time-in-service sub-groups, in comparison, top-ventile positive predictive value ranged between 126.9/1,000 person-years in the first year of service (when top-ventile concentration of risk was 20.5%) and 1.2/1,000 person-years after 20+ years of service (when top-ventile concentration of risk was 12.5%).

Combining estimates across models

The four-step process described above in the section on analysis methods was used to generate consolidated individual-level estimates of victimization for the 1,488 observations in the AAS. We imputed individual-level predicted probabilities of administratively-recorded victimization in the AAS for that purpose by applying the coefficients from the HADS model to make initial estimates and then calibrating those estimates to make the sum of predicted probabilities equal the observed weighted proportion of respondents with administratively-recorded victimization over the reporting period of the survey. Given that the model for self-reported victimization included administratively-recorded victimization as one of the predictors, we were able to estimate individual-level predicted probabilities of self-reported victimization separately in the presence and absence of administratively-recorded victimization directly from the AAS model coefficients by applying the individual-level predicted probability of administratively-recorded victimization to the model.

We reported above that prevalence of sexual assault victimization among AAS women was 15.9/1,000 person-years administratively-recorded and 18.6/1,000 person-years self-reported. The combined prevalence rate (i.e., administratively-recorded and/or self-reported) was 29.0/1,000 person-years. Based on the above four-step process, we estimated conservatively that the prevalence of sexual assault unreported either to authorities or in the survey was 25.2/1,000 person-years, for an estimated total prevalence (standard error) of 54.2 (10.8)/1,000 person-years. To the extent that the independence assumption needed to generate this estimate was accurate, it means that no more than 29.3% of all sexual assaults experienced by these women were reported to authorities, no more than 34.2% were self-reported in the survey, and no more than 46.5% were reported either to authorities or in the survey.

DISCUSSION

Although considerable research has studied victim-focused predictors of sexual assault (Coxell, et al., 1999; Franklin, et al., 2011; Harned, et al., 2002; Jewkes, et al., 2002; Kimerling, et al., 2007; Merrill, et al., 1999; Sadler, et al., 2003; Tewksbury & Mustaine, 2001; Turchik & Wilson, 2010), the current report presents the results of the first attempt to develop an actuarial risk model that could be used to identify women for interventions to prevent sexual assault victimization. Our findings that indicators of lower sociocultural (e.g., young age, low education, unmarried) and organizational (e.g., lower rank, more recently entered service, short time in the duty unit) power are significant predictors of administratively-recorded military sexual assault victimization are consistent with previous studies (Harned, et al., 2002), but we failed to find similar predictors of self-reported victimization. This inconsistency raises the possibility that soldiers with low sociocultural and organizational power are more likely than other soldiers to make administrative reports when they are sexually assaulted. This possibility is consistent with independent survey data indicating that, among military personnel who report in a survey that they experienced sexual assault, junior enlisted personnel are most likely and senior officers least likely to say that they made an administrative report of this event (Morral, et al., 2015a). This might mean that seniority brings with it increased concerns about potential career-damaging retaliation of such reports, a commonly-described barrier to sexual assault reporting (Morral, et al., 2015b).

The finding in both of our models that prior involvement in crime is associated with elevated risk of sexual assault victimization is broadly consistent with previous research (Jewkes, et al., 2002), although the indicator in the model to predict administratively-recorded victimization was a measure of prior crime victimization while the indicator in the model to predict self-reported victimization was a measure of prior crime perpetration. The finding in both models that treated mental disorders were associated with elevated risk of subsequent victimization is also consistent with a good deal of previous research (H. Littleton & Ullman, 2013; Orcutt, et al., 2002; Risser, et al., 2006; Schry & White, 2013; Ullman, 2003). The finding that treatment for injuries-poisonings was a risk factor, in comparison, has never before been examined, although it is noteworthy that we found this variable to be a significant predictor in the models for both administratively-recorded and survey self-reported victimization. It is noteworthy that this injury-poisoning measure excluded most common military injuries (e.g., sprains, fractures) and focused on poisonings along with less common injuries (e.g., nerve injuries), raising the possibility that it reflects an influence of impulsive predispositions that lead both to injuries and to increased risk of sexual assault victimization.

The fit of our models was quite good (AUC = .83 for administratively-recorded victimization; AUC = .88 for self-reported victimization) compared to models developed to predict other types of violence (Fazel, Singh, Doll, & Grann, 2012; Whittington et al., 2013; Yang, Wong, & Coid, 2010). We also found that approximately one-third of administratively-reported victimizations and over 60% of self-reported victimizations occurred among the 5% of female soldiers with our highest predicted risk scores, although caution is needed in interpreting the very high value of the latter percentage because the AAS sample size was so small and this might have led to over-fitting.

Despite this caution, the strong performance of both the HADS and AAS models argues that it might be feasible to carry out more definitive research designed explicitly to develop a consolidated set of models to predict overall sexual assault victimizations for female soldiers and use these models to determine which of these women should receive a high-risk preventive intervention. This survey would have to be based on a much larger and broadly representative sample of female soldiers than the AAS and would have to ask much more carefully-worded questions about sexual assault victimization than the AAS to map closely to the official definition used in administrative records (Kruttschnitt, et al., 2014). Administrative predictor variables would have to be available for the beginning of the same reporting period of this survey and information on administratively-recorded victimization (in the ideal case, both unrestricted and restricted reports) would have to be available over this same reporting period. With these data in hand, machine learning models like those developed here could be estimated and the coefficients from those models could be combined using the capture-recapture methods illustrated in our analysis to generate consolidated individual-level predicted probabilities of sexual assault victimization in subsequent cohorts. It would be important to update this model on a periodic basis (i.e., to carry out new periodic surveys and re-estimate models) both to capitalize on the opportunities presented by new Army and DoD data systems (e.g., the new TAPAS personality assessment questionnaire (Niebuhr et al., 2013), which was not available during the years of the HADS but is now administered to all Army) and to adjust for the possibility that some predictors of sexual assault victimization might change over time.

Before embarking on such an undertaking, though, it is important to ask whether female sexual assault victimization is sufficiently common even in the highest-risk strata in our models to make selective intervention cost-effective. The answer depends in large part on the strength of the intervention. There is evidence that fairly intensive victim-focused interventions can be effective in preventing sexual assault (Brecklin, 2008; Senn, et al., 2015; Vladutiu, Martin, & Macy, 2011). But these programs would be prohibitively expensive to implement with all female soldiers, many of whom have extremely low risks of sexual assault victimization. However, our results suggest that interventions of this sort, if focused on women in the top-ventile of predicted risk, would reach 33–63% of all those who would otherwise experience administratively-reported and/or self-reported victimization along with some unknown proportion of those whose victimization currently goes unreported. Our models also suggest that substantial proportions of these high-risk women would go on to be victimized in the absence of a preventive intervention. These predicted victimization rates are sufficiently high (6.7% in the model for administratively-recorded victimization; 23.6% in the model for self-reported victimization) and the documented adverse psychological effects of victimization sufficiently strong (Turchik & Wilson, 2010) to justify formal cost-benefit analyses of such an intervention (Foster & Jones, 2006) taking competing risks (e.g., stigma, negative career consequences from identification as high-risk) into consideration. If the results are positive, it would then make sense to invest in the research needed to develop a definitive consolidated set of models to predict overall sexual assault victimization and then use those models to select female soldiers for a pilot implementation and then, if successful, ongoing system-wide implementation of the intervention with high-risk women.

In considering these future directions, it should also be noted that even though we focused our model-building efforts on women, the AAS found that nearly half of all self-reported sexual assault victimizations occurred to men. This finding is broadly consistent with other surveys of military sexual assault (Morral, et al., 2015b). We could have estimated a model for male administratively-recorded victimization, but we know from the AAS and other previous surveys (Morral, et al., 2015b) that only a very small proportion (probably no more than 5%) of the sexual assault victimizations reported by male soldiers in surveys are reported to authorities. This means that self-reports of victimization would play a much more important role in developing consolidated models to predict overall sexual assault victimization for male than female soldiers. This was not possible in the AAS because of the low proportion of male respondents who reported sexual assault victimization (less than 0.2%) even though these reports constitute a meaningful proportion of all self-reported sexual assaults due to the fact that 85% of soldiers are male. Given this low prevalence, it is unclear whether a consolidated prediction model for male sexual assault victimization could be developed that would have a sufficiently high concentration of risk to make it feasible to implement targeted high-risk preventive interventions cost-effectively. A much larger survey than the AAS would be needed to investigate this question with sufficient precision to draw a definitive conclusion.

Another important future direction should be to develop models to predict sexual assault perpetration. Although the AAS did not ask male soldiers about sexual assault perpetration, surveys of this sort have been carried out among college students (see reviews in (Koss, 1993; Schewe, 2002) and to a lesser extent more general community samples of adult males (Abbey, Parkhill, BeShears, Clinton-Sherrod, & Zawacki, 2006; Merrill, Thomsen, Gold, & Milner, 2001; Widman, Olson, & Bolen, 2013)), although we are aware of no such survey among male military personnel. It would be useful to experiment with the feasibility of carrying out such a survey among male military personnel and, along the lines of the surveys on victimization carried out among female military personnel, to ask these men not only about self-reported perpetration but also about whether their victim(s) reported them to the military authorities. In addition, it would be useful to explore the possibility of developing a model of the administrative predictors of administratively-recorded sexual assault perpetration. We are in the process of attempting to develop the latter type of model in our continued analyses of the HADS data. If successful, such a model could be useful in targeting high-risk preventive interventions, although evidence for the effectiveness of such interventions to reduce sexual assault perpetration is weaker than for interventions to reduce sexual assault victimization (Ellsberg et al., 2015).

Supplementary Material

Supplemental Tables

Acknowledgments

Funding/Support: The data analyzed in this report were collected as part of the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS). Army STARRS was sponsored by the Department of the Army and funded under cooperative agreement number U01MH087981 with the U.S. Department of Health and Human Services, National Institutes of Health, National Institute of Mental Health (NIH/NIMH). This research was conducted by Harvard Medical School and is funded by the Department of Defense, Office of the Assistant Secretary for Defense for Health Affairs, Defense Health Program (OASD/HA), awarded and administered by the U.S. Army Medical Research & Materiel Command (USAMRMC), at Fort Detrick, MD, under contract numbers W81XWH-12-2-0113 and W81XWH-13-1-0037. Dr Street’s effort was supported with resources and the use of facilities at VA Boston Healthcare System, Boston, MA.

Role of the Funder/Sponsor: The sponsors had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. Although a draft of this manuscript was submitted to the Department of Defense, the Department of the Army, and NIMH for review and comment prior to submission, this was with the understanding that comments would be no more than advisory.

Footnotes

Author Contributions:

Street, Zaslavsky, and Kessler developed the study concept. All authors contributed to the study design. Hill and Petukhova performed the data analyses under the supervision of Kessler, Sampson, Rosellini, and Zaslavsky. Street, Kessler, Rosellini, and Monahan drafted the paper, while the other coauthors provided several rounds of critical comments and helped prepare revisions. All authors approved the final version of the paper for submission.

Conflict of Interest Disclosures: Dr Stein has been a consultant for Care Management Technologies, received payment for his editorial work from UpToDate, Depression and Anxiety, and Biological Psychiatry, and has been a paid consultant for Pfizer, Tonix, and Janssen. Dr Monahan is co-owner of Classification of Violence Risk (COVR), Inc. Dr Kessler has been a consultant for Johnson & Johnson Wellness and Prevention and Sonofi-Aventis Groupe and has served on an advisory board for Lake Nona Institute. Dr Kessler is a co-owner of DataStat, Inc. The other authors report no financial relationships with commercial interests.

Disclaimer: The views, opinions and/or findings contained in this research are those of the authors and do not necessarily reflect the views of the funding agencies and should not be construed as an official DoD/Army position, policy or decision unless so designated by other documentation. No official endorsement should be made.

REFERENCES

  1. Abbey A, Parkhill MR, BeShears R, Clinton-Sherrod AM, Zawacki T. Cross-Sectional Predictors of Sexual Assault Perpetration in a Community Sample of Single African American and Caucasian Men. Aggressive Behavior. 2006;32:54–67. doi: 10.1002/ab.20107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alho JM. Logistic regression in capture-recapture models. Biometrics. 1990;46:623–635. [PubMed] [Google Scholar]
  3. Alho JM, Mulry MH, Wurdeman K, Kim J. Estimating heterogeneity in the probabilities of enumeration for dual-system estimation. Journal of the American Statistical Association. 1993;88(1):130–136. [PubMed] [Google Scholar]
  4. American Association for Public Opinion Research. Standard Definitions: Final Dispositions of Case Codes and Outcome Rates for Surveys. 2009 from http://www.aapor.org/AAPORKentico/Communications/AAPOR-Journals/Standard-Definitions.aspx.
  5. Brecklin LR. Evaluation outcomes of self-defense training for women: A review. Aggression and Violent Behavior. 2008;13:60–76. [Google Scholar]
  6. Breiman L, Cutler A, Liaw A, Wiener M. Breiman and Culter’s random forests for classification and regression (Version 4.6-10) 2014 [Google Scholar]
  7. Campbell JC, Webster DW, Glass N. The danger assessment: validation of a lethality risk assessment instrument for intimate partner femicide. Journal of Interpersonal Violence. 2009;24:653–674. doi: 10.1177/0886260508317180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Castro CA, Kintzle S, Schuyler AC, Lucas CL, Warner CH. Sexual assault in the military. Current Psychiatry Reports. 2015;17:54. doi: 10.1007/s11920-015-0596-7. [DOI] [PubMed] [Google Scholar]
  9. Chan KL. Predicting the risk of intimate partner violence: the chinese risk assessment tool for victims. Journal of Family Violence. 2012;27:157–164. doi: 10.1007/s10896-012-9418-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cook SL, Gidycz CA, Koss MP, Murphy M. Emerging issues in the measurement of rape victimization. [Review] Violence Against Women. 2011;17:201–218. doi: 10.1177/1077801210397741. [DOI] [PubMed] [Google Scholar]
  11. Coxell A, King M, Mezey G, Gordon D. Lifetime prevalence, characteristics, and associated problems of non-consensual sex in men: cross sectional survey. BMJ. 1999;318:846–850. doi: 10.1136/bmj.318.7187.846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. DeGue S, Simon TR, Basile KC, Yee SL, Lang K, Spivak H. Moving forward by looking back: reflecting on a decade of CDC’s work in sexual violence prevention, 2000-2010. [Review] Journal of Women’s Health. 2012;21:1211–1218. doi: 10.1089/jwh.2012.3973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Ellsberg M, Arango DJ, Morton M, Gennari F, Kiplesund S, Contreras M, Watts C. Prevention of violence against women and girls: what does the evidence say? [Research Support, Non-U.S. Gov’t Review] Lancet. 2015;385:1555–1566. doi: 10.1016/S0140-6736(14)61703-7. [DOI] [PubMed] [Google Scholar]
  14. Fazel S, Singh JP, Doll H, Grann M. Use of risk assessment instruments to predict violence and antisocial behaviour in 73 samples involving 24 827 people: systematic review and meta-analysis. BMJ. 2012;345:e4692. doi: 10.1136/bmj.e4692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fengler J, Winkel FW. The development of an (ODARA-based) actuarial screening version of the Danger Assessment Inventory (DAI): a traumatic diversity-approach and some follow up evidence. In: Schaefer P, Weitekamp E, editors. Establishing Victimology. Niederrhein, Germany: Niederrhein University Press; 2014. pp. 151–169. [Google Scholar]
  16. Fisher BS. The effects of survey question wording on rape estimates: evidence from a quasi-experimental design. [Research Support, U.S. Gov’t, Non-P.H.S.] Violence Against Women. 2009;15:133–147. doi: 10.1177/1077801208329391. [DOI] [PubMed] [Google Scholar]
  17. Foster EM, Jones D. Can a costly intervention be cost-effective?: an analysis of violence prevention. Archives of General Psychiatry. 2006;63:1284–1291. doi: 10.1001/archpsyc.63.11.1284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Franklin CA, Franklin TW, Nobles MR, Kercher G. Risk Factors Associated with Women’s Victimization. Huntsville, TX: Crime Victims’ Institute, Criminal Justice Center, Sam Houston State University; 2011. [Google Scholar]
  19. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software. 2010;33:1–22. [PMC free article] [PubMed] [Google Scholar]
  20. Golubnitschaja O, Costigliola V. General report & recommendations in predictive, preventive and personalised medicine 2012: white paper of the European Association for Predictive, Preventive and Personalised Medicine. EPMA Journal. 2012;3:14. doi: 10.1186/1878-5085-3-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Harned MS, Ormerod AJ, Palmieri PA, Collinsworth LL, Reed M. Sexual assault and other types of sexual harassment by workplace personnel: a comparison of antecedents and consequences. Journal of Occupational and Health Psychology. 2002;7:174–188. doi: 10.1037//1076-8998.7.2.174. [DOI] [PubMed] [Google Scholar]
  22. Harris A, Phenix A, Hanson RK, Thornton D. STATIC-99 Coding Rules Revised. Ottawa, Canada: 2003. [Google Scholar]
  23. Heeringa SG, Gebler N, Colpe LJ, Fullerton CS, Hwang I, Kessler RC, Ursano RJ. Field procedures in the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS) International Journal of Methods in Psychiatric Research. 2013;22:276–287. doi: 10.1002/mpr.1400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hook EB, Hsia MS, Regal RR. Accuracy of Capture-Recapture Estimates of Prevalence. Epidemiologic Methods. 2012;1:1–11. [Google Scholar]
  25. Jewkes R, Sen P, Garcia-Moreno C. Sexual violence. In: Krug EG, editor. World Report on Violence and Health. Vol. 1. Geneva, Switzerland: World Health Organization; 2002. pp. 213–239. [Google Scholar]
  26. Kessler RC, Colpe LJ, Fullerton CS, Gebler N, Naifeh JA, Nock MK, Heeringa SG. Design of the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS) International Journal of Methods in Pyschiatric Research. 2013;22:267–275. doi: 10.1002/mpr.1401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kessler RC, Heeringa SG, Colpe LJ, Fullerton CS, Gebler N, Hwang I, Ursano RJ. Response bias, weighting adjustments, and design effects in the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS) International Journal of Methods in Psychiatric Research. 2013;22:288–302. doi: 10.1002/mpr.1399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kessler RC, Warner CH, Ivany C, Petukhova MV, Rose S, Bromet EJ, Ursano RJ. Predicting suicides after psychiatric hospitalization in US Army soldiers: the Army Study To Assess Risk and Resilience in Servicemembers (Army STARRS) JAMA Psychiatry. 2015;72:49–57. doi: 10.1001/jamapsychiatry.2014.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kimerling R, Gima K, Smith MW, Street A, Frayne S. The Veterans Health Administration and military sexual trauma. American Journal of Public Health. 2007;97:2160–2166. doi: 10.2105/AJPH.2006.092999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Koss MP. Rape. Scope, impact, interventions, and public policy responses. [Research Support, U.S. Gov’t, P.H.S.] The American Psychologist. 1993;48:1062–1069. [PubMed] [Google Scholar]
  31. Krebs CP, Lindquist CH, Warner TD, Fisher BS, Martin SL. The Campus Sexual Assault (CSA) Study (221153) Washington, DC: National Institute of Justice, U.S. Department of Justice; 2007. [Google Scholar]
  32. Kruttschnitt C, Kalsbeek WD, House CC. Estimating the incidence of rape and sexual assault. 2014 from http://www.nap.edu/read/18605/chapter/1. [PubMed]
  33. Lalor K, McElvaney R. Child sexual abuse, links to later sexual exploitation/high-risk sexual behavior, and prevention/treatment programs. [Research Support, Non-U.S. Gov’t Review] Trauma, Violence & Abuse. 2010;11:159–177. doi: 10.1177/1524838010378299. [DOI] [PubMed] [Google Scholar]
  34. Leardmann CA, Pietrucha A, Magruder KM, Smith B, Murdoch M, Jacobson IG, Smith TC. Combat deployment is associated with sexual harassment or sexual assault in a large, female military cohort. [Research Support, U.S. Gov’t, Non-P.H.S.] Women’s Health Issues : Official Publication of the Jacobs Institute of Women’s Health. 2013;23:e215–e223. doi: 10.1016/j.whi.2013.05.002. [DOI] [PubMed] [Google Scholar]
  35. Lisak D, Miller PM. Repeat rape and multiple offending among undetected rapists. Violence and Victims. 2002;17:73–84. doi: 10.1891/vivi.17.1.73.33638. [DOI] [PubMed] [Google Scholar]
  36. Littleton H, Axsom D, Grills-Taquechel A. Sexual assault victims’ acknowledgement status and revictimization risk. Psychology of Women Quarterly. 2009;33:34–42. [Google Scholar]
  37. Littleton H, Ullman SE. PTSD symptomatology and hazardous drinking as risk factors for sexual assault revictimization: examination in European American and African American women. Journal of Traumatic Stress. 2013;26:345–353. doi: 10.1002/jts.21807. [DOI] [PubMed] [Google Scholar]
  38. Mengeling MA, Booth BM, Torner JC, Sadler AG. Reporting sexual assault in the military: who reports and why most servicewomen don’t. American Journal of Preventive Medicine. 2014;47:17–25. doi: 10.1016/j.amepre.2014.03.001. [DOI] [PubMed] [Google Scholar]
  39. Merrill LL, Newell CE, Thomsen CJ, Gold SR, Milner JS, Koss MP, Rosswork SG. Childhood abuse and sexual revictimization in a female Navy recruit sample. Journal of Traumatic Stress. 1999;12:211–225. doi: 10.1023/A:1024789723779. [DOI] [PubMed] [Google Scholar]
  40. Merrill LL, Thomsen CJ, Gold SR, Milner JS. Childhood abuse and premilitary sexual assault in male Navy recruits. [Research Support, U.S. Gov’t, Non-P.H.S.] Journal of Consulting and Clinical Psychology. 2001;69:252–261. doi: 10.1037//0022-006x.69.2.252. [DOI] [PubMed] [Google Scholar]
  41. Messman-Moore TL, Long PJ. Child sexual abuse and revictimization in the form of adult sexual abuse, adult physical abuse, and adult psychological maltreatment. Journal of Interpersonal Violence. 2000;15:489–502. [Google Scholar]
  42. Morral AR, Gore KL, Schell TL, Bicksler B, Farris C, Ghosh-Dastidar B, Williams KM. Sexual Assault and Sexual Harassment in the U.S. Military: Annex to Volume 2. Tabular Results from the 2014 RAND Military Workplace Study for Department of Defense Service Members. 2015a from http://www.rand.org/pubs/research_reports/RR870z3.html.
  43. Morral AR, Gore KL, Schell TL, Bicksler B, Farris C, Ghosh-Dastidar B, Williams KM. Sexual Assault and Sexual Harassment in the U.S. Military: Volume 2. Estimates for Department of Defense Service Members from the 2014 RAND Military Workplace Study. 2015b from http://www.rand.org/pubs/research_reports/RR870z2.html.
  44. Morral AR, Gore KL, Schell TL, Jaycox LH, Ghosh-Dastidar B, Farris CMWK. Sexual Assault and Sexual Harassment in the U.S. Military: Top-Line Estimates for Active-Duty Service Members from the 2014 RAND Military Workplace Study. Santa Monica, CA: National Defense Research Institute; 2014. [Google Scholar]
  45. Niebuhr DW, Gubata ME, Oetting AA, Weber NS, Feng X, Cowan DN. Personality Assessment Questionnaire as a pre-accession screen for risk of mental disorders and early attrition in U. S. Army recruits. [Research Support, U.S. Gov’t, Non-P.H.S.] Psychological Services. 2013;10:378–385. doi: 10.1037/a0032783. [DOI] [PubMed] [Google Scholar]
  46. Orcutt HK, Erickson DJ, Wolfe J. A prospective analysis of trauma exposure: the mediating role of PTSD symptomatology. [Research Support, U.S. Gov’t, Non-P.H.S.] Journal of Traumatic Stress. 2002;15:259–266. doi: 10.1023/A:1015215630493. [DOI] [PubMed] [Google Scholar]
  47. Quinsey VL, Harris GT, Rice ME, Cormier CA. Violent offenders: appraising and managing risk. 2nd. Washington, DC: American Psychological Association; 2006. [Google Scholar]
  48. Risser HJ, Hetzel-Riggin MD, Thomsen CJ, McCanne TR. PTSD as a mediator of sexual revictimization: the role of reexperiencing, avoidance, and arousal symptoms. Journal of Traumatic Stress. 2006;19:687–698. doi: 10.1002/jts.20156. [DOI] [PubMed] [Google Scholar]
  49. Rock LM. 2012 Workplace and Gender Relations Survey of Active Duty Members. Washington, D.C: Defense Technical Information Center Retrieved; 2013. from http://www.sapr.mil/public/docs/research/2012_Workplace_and_Gender_Relations_Survey_of_Active_Duty_Members-Survey_Note_and_Briefing.pdf. [Google Scholar]
  50. Rosellini AJ, Monahan J, Street AE, Heeringa SG, Hill ED, Petukhova M, Kessler RC. Predicting non-familial major physical violent crime perpetration in the US Army from administrative data. Psychological Medicine. 2016;46:303–316. doi: 10.1017/S0033291715001774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Sadler AG, Booth BM, Cook BL, Doebbeling BN. Factors associated with women’s risk of rape in the military environment. American Journal of Industrial Medicine. 2003;43:262–273. doi: 10.1002/ajim.10202. [DOI] [PubMed] [Google Scholar]
  52. SAS Institute Inc. SAS/STATR Software (Version Version 9.3 for Unix) Cary, NC: SAS Institute Inc; 2010. [Google Scholar]
  53. Schewe PA. Preventing violence in relationships : interventions across the life span. Washington, D.C: American Psychological Association; 2002. [Google Scholar]
  54. Schlesselman JJ. Case-control studies: design, conduct, analysis. New York: Oxford University Press; 1982. [Google Scholar]
  55. Schry AR, White SW. Sexual assertiveness mediates the effect of social interaction anxiety on sexual victimization risk among college women. Behavior therapy. 2013;44:125–136. doi: 10.1016/j.beth.2012.09.001. [DOI] [PubMed] [Google Scholar]
  56. Secretary of Defense. 2014-2016 Sexual Assault Prevention Strategy. 2014 from http://www.sapr.mil/index.php/prevention/prevention-strategy.
  57. Sekar CC, Deming WE. On a method of estimating birth and death rates and the extent of registration. Journal of the American Statistical Association. 1949;44:101–115. [Google Scholar]
  58. Senn CY, Eliasziw M, Barata PC, Thurston WE, Newby-Clark IR, Radtke HL, Hobden KL. Efficacy of a sexual assault resistance program for university women. The New England Journal of Medicine. 2015;372:2326–2335. doi: 10.1056/NEJMsa1411131. [DOI] [PubMed] [Google Scholar]
  59. Street AE, Stafford J, Mahan CM, Hendricks A. Sexual harassment and assault experienced by reservists during military service: prevalence and health correlates. [Historical Article Research Support, U.S. Gov’t, Non-P.H.S.] Journal of Rehabilitation Research and Development. 2008;45:409–419. doi: 10.1682/jrrd.2007.06.0088. [DOI] [PubMed] [Google Scholar]
  60. Testa M, VanZile-Tamsen C, Livingston JA. Prospective prediction of women’s sexual victimization by intimate and nonintimate male perpetrators. Journal of Consulting and Clinical Psychology. 2007;75:52–60. doi: 10.1037/0022-006X.75.1.52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Tewksbury R, Mustaine EE. Lifestyle factors associated with the sexual assault of men: a routine activity theory analysis. Journal of Men’s Studies. 2001;9:153–182. [Google Scholar]
  62. Thornton D, Mann R, Webster S, Blud L, Travers R, Friendship C, Erikson M. Distinguishing and combining risks for sexual and violent recidivism. [Validation Studies] Annals of the New York Academy of Sciences. 2003;989:225–235. doi: 10.1111/j.1749-6632.2003.tb07308.x. discussion 236–246. [DOI] [PubMed] [Google Scholar]
  63. Turchik JA, Wilson SM. Sexual assault in the U.S. military: a review of the literature and recommendations for the future. Aggression and Violent Behavior. 2010;15:267–277. [Google Scholar]
  64. U.S. Department of Justice. National Corrections Reporting Program, 2009 (ICPSR 30799) Ann Arbor, MI: National Achive of Criminal Justice Data; 2009. [Google Scholar]
  65. Ullman SE. A critical review of field studies on the link of alcohol and adult sexual assault in women. Aggression and Violent Behavior. 2003;8:471–486. doi: http://dx.doi.org/10.1016/S1359-1789(03)00032-6. [Google Scholar]
  66. Ursano RJ, Colpe LJ, Heeringa SG, Kessler RC, Schoenbaum M, Stein MB. The Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS) Psychiatry. 2014;77:107–119. doi: 10.1521/psyc.2014.77.2.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Vladutiu CJ, Martin SL, Macy RJ. College- or university-based sexual assault prevention programs: a review of program outcomes, characteristics, and recommendations. [Review] Trauma, Violence & Abuse. 2011;12:67–86. doi: 10.1177/1524838010390708. [DOI] [PubMed] [Google Scholar]
  68. Whittington R, Hockenhull JC, McGuire J, Leitner M, Barr W, Cherry MG, Dickson R. A systematic review of risk assessment strategies for populations at high risk of engaging in violent behaviour: update 2002-8. Health Technology Assessment. 2013;17:1–128. doi: 10.3310/hta17500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Widman L, Olson MA, Bolen RM. Self-reported sexual assault in convicted sex offenders and community men. [Comparative Study] Journal of Interpersonal Violence. 2013;28:1519–1536. doi: 10.1177/0886260512468237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Willett JB, Singer JD. Investigating onset, cessation, relapse, and recovery: why you should, and how you can, use discrete-time survival analysis to examine event occurrence. Journal of Consulting and Clinical Psychology. 1993;61:952–965. doi: 10.1037//0022-006x.61.6.952. [DOI] [PubMed] [Google Scholar]
  71. Wolitzky-Taylor KB, Resnick HS, McCauley JL, Amstadter AB, Kilpatrick DG, Ruggiero KJ. Is reporting of rape on the rise? A comparison of women with reported versus unreported rape experiences in the National Women’s Study-Replication. [Comparative Study Research Support, U.S. Gov’t, Non-P.H.S.] Journal of Interpersonal Violence. 2011;26:807–832. doi: 10.1177/0886260510365869. [DOI] [PubMed] [Google Scholar]
  72. Wolter KM. Introduction to variance estimation. New York: Springer-Verlag; 1985. [Google Scholar]
  73. Yang M, Wong SC, Coid J. The efficacy of violence prediction: a meta-analytic comparison of nine risk assessment tools. Psychological Bulletin. 2010;136:740–767. doi: 10.1037/a0020473. [DOI] [PubMed] [Google Scholar]
  74. Zou H, Hastie T. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B. 2005;67:301–320. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Tables

RESOURCES