Abstract
Background/Aims
Before diagnosing premenstrual dysphoric disorder (PMDD), 2 months of prospective assessment are required to confirm menstrual cyclicity in symptoms. For a diagnosis of premenstrual syndrome (PMS), this is not required. Women with PMDD and PMS often report that their symptoms interfere with mood and social functioning, and are said to show cyclical changes in interpersonal behaviour, but this has not been examined using a prospective approach. We sampled cyclicity in mood and interpersonal behaviour for 2 months in women with self- reported PMS.
Methods
Participants met the criteria for PMS on the Premenstrual Symptoms Screening Tool (PSST), a retrospective questionnaire. For 2 menstrual cycles, after each social interaction, they used the online software TEMPEST to record on their smartphones how they felt and behaved. We examined within-person variability in negative affect, positive affect, quarrelsomeness, and agreeableness.
Results
Participants evaluated TEMPEST as positive. However, we found no evidence for menstrual cyclicity in mood and interpersonal behaviour in any of the individual women (n = 9).
Conclusion
Retrospective questionnaires such as the PSST may lead to oversampling of PMS. The diagnosis of PMS, like that of PMDD, might require 2 months of prospective assessment.
Keywords: Daily ratings, Ecological momentary assessment, Experience sampling method, Menstrual cycle, Premenstrual syndrome
Introduction
Premenstrual dysphoric disorder (PMDD) is considered a depressive disorder [1]. The diagnosis requires at least 1 marked psychological symptom (e.g., affective lability, irritability, or depressed mood) and at least 4 additional symptoms. Two months of daily symptom ratings are required to confirm that the symptoms are cyclical (i.e., present during the premenstrual phase and otherwise absent) and interfere with daily life. A PMDD diagnosis is made prospectively because previous studies have shown that this helps to differentiate PMDD from other depressive disorders [2, 3]. Women with PMDD or premenstrual syndrome (PMS) frequently report that their symptoms interfere with social functioning [4] and are said to show cyclical changes in interpersonal behaviour, particularly quarrelsomeness [5]. While premenstrual mood symptoms might be associated with premenstrual increases in quarrelsomeness, this has not been examined prospectively in women with PMS.
Daily symptom ratings, or diaries, are less likely to overestimate symptom severity than measures considering the past month [5]. However, diaries remain subject to memory biases, as people are asked to reconstruct each day. Morning experiences may be remembered inaccurately at night [6] and symptoms may vary throughout the day [7]. The experience sampling method (ESM), also known as an ecological momentary assessment (EMA), has been shown to complement diaries [6, 8]. Like diary data, ESM/EMA data provide insight into both within- and between-person variability in momentary states. Unlike diaries, ESM/EMA can be used to detect within-day patterns [6]. Event-contingent recording (ECR) is a type of ESM/EMA that asks people to indicate their state following prespecified events. A validated ECR method is available for assessing mood and interpersonal behaviour (e.g., negative affect [NA] and quarrelsomeness) during social interactions [9]. This method allows the prospective assessment of how people feel and behave during interactions with others.
In the present ECR study, 9 women with self-reported PMS reported on their everyday social interactions for 2 months. The online software TEMPEST [10] was used to administer the ECR questionnaires via the participants' smartphones. PMS was studied because prospective ratings are not currently a requirement for diagnosing PMS, while they are a requirement for diagnosing PMDD. We postulated that our results might show that prospective ratings may also aid in diagnosing PMS. Additionally, as past studies employing ECR of social interactions have exclusively used paper questionnaires and lasted less than 1 month, we studied participant satisfaction with TEMPEST.
Method
Participants
Advertisements posted in public buildings around the city asked women with premenstrual complaints to participate in a smartphone diary study. Respondents (n = 22) completed the Premenstrual Symptoms Screening Tool (PSST) [11]. Inclusion criteria were: scoring “moderate” or “severe” on at least 1 of the 4 core symptoms and on at least 4 other symptoms listed in part A of the PSST and at least “mild” on at least 1 of the items in part B, age 18–40 years, a regular menstrual cycle (28 ± 3 days), owning a suitable smartphone, no past or present diagnosis of a psychiatric disorder, not currently using psychotropic medication, not using hormonal contraceptives for at least 3 months prior to study entry, no current pregnancy, and not breastfeeding for at least 9 months prior to study entry. Ten respondents met these criteria and provided written informed consent; 9 completed this study. The Ethics Committee of Psychology of the University of Groningen approved this study.
Measures
In addition to the PSST, the Premenstrual Assessment Form (PAF) [12] was used to assess the severity of premenstrual complaints. The Quick Inventory of Depressive Symptomatology (QIDS; range 0–27) [13] was used to measure depression severity in the past week. The System Usability Scale (SUS; range 0–100) [14] evaluated participants' experiences with TEMPEST. A formal, system-independent representation of the ECR questionnaire that was running in TEMPEST is available at https://osf.io/j7ngw/.
The ECR questionnaires asked about the context of each social interaction (e.g., gender and role of the interaction partner), included items from the Dutch language Social Behaviour Inventory [15] for assessing quarrelsomeness, agreeableness, dominance, and submissiveness, and sampled NA and positive affect (PA) using 5 and 4 adjectives, respectively [16].
A daily questionnaire was completed each morning for obtaining data on participants' menstruation (absent vs. present).
Data Analysis
The data were analysed with longitudinal Bayesian MCMC models. Daily mean levels of NA, PA, quarrelsomeness, and agreeableness were created so the data could be analysed with time series models for measurements equidistant in time. These mean levels were transformed to lie within the interval (0, 1) and were subsequently modelled through β-distributions (see https://osf.io/j7ngw/ for details).
Several increasingly complex group and individual models were fitted to the data. Analyses started with model 1, which included the number of days (Ni = 1, 2, …, n) for person i (i = A, …, I). In model 2 the spread was dependent on the number of social interactions per day. Model 3 included a moving average, thus taking the score of the previous day into account to predict the value of the following day. Dummy variables for the menstrual phases were included in model 4, at both a group level (all participants receiving the same estimates; model 4A) and an individual level (all participants receiving person-specific estimates; model 4B). Model selection was based on the deviance information criterion [17], a measure of model fit that penalises for complexity, with a lower value indicating a better fit. When the difference in deviance information criterion values exceeded 10, a model was discarded for a less parsimonious one [18].
The menstrual phase, based on the days on which a participant reported menstruation, varied in duration (range: 5–8 days). The premenstrual phase was defined as the 5 days preceding the menstrual phase (i.e., late luteal phase). The postmenstrual phase was defined as the 7 days following the menstrual phase (i.e., late follicular phase). The intermenstrual phase entailed the remaining cycle days; these were typically the early luteal days.
Results
Retrospective Measures
Table 1 provides individual responses on the PSST, the PAF, and the QIDS. According to the PSST, all of the participants except participant I had moderate-to-severe PMS. Participant I had the same PSST total score as participant H, but her symptoms interfered only mildly with her daily life. Nonetheless, her average PAF score across 3 consecutive months (PAF1–3) was higher than that of 3 other participants.
Table 1.
Participant | PSSTtotala | PSST core symptomsb | PSST other symptomsb | PSST PSST interferenceb | PAF1 | paf2 | paf3 | PAF1–3 (SD)c | QIDS1 | QIDS2 | QIDS3 | QIDS1–3 (SD)d |
---|---|---|---|---|---|---|---|---|---|---|---|---|
A | 55 | 3 | 7 | 3 | 47 | 48 | 43 | 46 (2.6) | 10 | 11 | 14 | 11.7 (2.1) |
B | 52 | 4 | 7 | 1 | 48 | 47 | 51 | 48.7 (2.1) | 4 | 10 | 5 | 6.3 (3.2) |
C | 49 | 4 | 6 | 2 | 50 | 50 | 42 | 47.3 (4.6) | 5 | 4 | 7 | 5.3 (1.5) |
D | 49 | 2 | 8 | 1 | 37 | 36 | 27 | 33.3 (5.5) | 18 | 18 | 14 | 16.7 (2.3) |
E | 47 | 2 | 5 | 2 | 33 | 37 | 46 | 38.7 (6.7) | 1 | 2 | 6 | 3 (2.6) |
F | 47 | 4 | 4 | 1 | 40 | 33 | 39 | 37.3 (3.8) | 2 | 5 | 2 | 3 (1.7) |
G | 47 | 2 | 7 | 1 | 42 | 24 | 27 | 31 (9.6) | 3 | 6 | 6 | 5 (1.7) |
H | 43 | 2 | 7 | 1 | 36 | 29 | 30 | 31.7 (3.8) | 6 | 9 | 8 | 7.7 (1.5) |
I | 43 | 2 | 4 | 0 | 35 | 37 | 32 | 34.7 (2.5) | 9 | 17 | 14 | 13.3 (4.0) |
Mean of all | 48.0 | 2.8 | 6.1 | 1.3 | 40.9 | 37.9 | 37.4 | 38.7 | 6.4 | 9.1 | 8.4 | 8.0 |
particpants (SD) | (3.9) | (0.97) | (1.5) | (0.87) | (6.2) | (8.9) | (8.8) | (6.9) | (5.3) | (5.6) | (4.5) | (4.8) |
The PAF and the QIDS were administered: (1) before, (2) halfway, and (3) after the social interaction data collection. PSST, Premenstrual Symptoms Screening Tool; PAF, Premenstrual Assessment Form; QIDS, Quick Inventory of Depressive Symptomatology.
Total score of all items combined.
Number of items with a score indicating moderate-to-severe symptoms or interference.
Values are based on the mean of PAF1, PAF2, and PAF3. d Values are based on the mean of QIDS1, QIDS2, and QIDS3.
The Spearman correlation r between the PSST and the PAF1–3 was 0.66 (n = 9, p = 0.054), confirming that both questionnaires tap into the same construct. The correlations between the PSST and the QIDS1–3 and between the PAF1–3 and the QIDS1–3 were r = 0.11 (p = 0.74) and r = −0.14 (p = 0.72), respectively, indicating that, in our sample, PMS symptoms were mostly unrelated to depressive symptoms.
ECR Data
Participants recorded their interactions for 61–77 days (mean = 65.56, SD = 6.62). The mean number of missing days was 3.56 (SD = 5.15, range 0–15). The mean total number of interactions was 248 (SD = 92.59, range 114–360). The mean daily number of interactions was 3.91 (SD = 1.10, range 0–16).
Mood and Interpersonal Behaviour across the Menstrual Cycle
Model 1 fitted the ECR data consistently much worse than the other models (Table 2). As model 3 had the best fit with the NA data, we found no evidence for menstrual cyclicity in NA. If there had been cyclicity in NA, then including the menstrual phases in model 4 should have provided a substantial improvement in model fit compared to model 3. We also found no evidence for cyclicity in PA, quarrelsomeness, and agreeableness. For all 3 variables, model 2 had the best fit. We checked whether the fit would improve with median, minimum, or maximum daily scores, rather than daily averages. Results (see https://osf.io/j7ngw/) provided no indications for altered conclusions.
Table 2.
Model | Entire sample (n = 9)a |
Highest PSST scores (n = 3)c |
||||||
---|---|---|---|---|---|---|---|---|
NA | PA | quarrelsomeness | agreeableness | NA | PA | quarrelsomeness | agreeableness | |
1 (number of days per person) | –2,222 | –759 | –1,091 | –1,058 | –727 | –363 | –436 | –432 |
2 (number of social interactions) | –2,311b | –846b | –1,932b | –1,704b | –771b | –429b | –738b | –694b |
3 (moving average) | –2,336b | –855 | –1,925 | –1,697 | –787b | –425 | –739 | –697 |
4A (group dummy variables for phases of the menstrual cycle) | –2,334 | –850 | –1,919 | –1,692 | –790 | –427 | –734 | –700 |
4B (individual dummy variables for phases of the menstrual cycle) | –2,335 | –857 | –1,896 | –1,690 | –790 | –423 | –726 | –700 |
DIC, deviance information criterion; NA, negative affect; PA, positive affect; PSST, Premenstrual Symptom Screening Tool.
DIC values can only be compared for models on the same data [17]. The DIC values of NA are lower than those of PA, and this is due to more variation in the NA values compared to the PA values and not necessarily to a better model fit.
The decrease from the previous model exceeds 10.
Participants A to C.
When the analyses were repeated in the 3 participants (A to C) who reported the most severe PMS symptoms and the most interference in daily life on the PSST, and the most severe cyclical change in symptoms on the PAF, and the pattern was similar to that of all 9 participants (Table 2). In sum, we found no evidence for menstrual phase differences in mood and interpersonal behaviour.
As the dependent variables were aggregated into daily means, it is possible that cyclicity of NA, PA, quarrelsomeness, and agreeableness was confounded by menstrual cyclicity in the occurrence of a specific social context. However, when we investigated whether this was the case, the additional analyses provided no indications for altered conclusions (also see https://osf.io/j7ngw/).
User Experiences with TEMPEST
Participants occasionally reported issues with the software not responding or responding slowly. Nonetheless, their mean SUS score was 84.17 (SD = 7.60), indicating an overall positive experience.
Discussion
In 9 women with self-reported PMS we found no evidence for variation in mood (NA and PA) and interpersonal behaviour (quarrelsomeness and agreeableness) across the menstrual cycle. Thus, according to our prospective ECR data, the premenstrual phase of our participants did not appear to be characterized by 3 core symptoms of PMS, i.e., depressed mood, anxiety/tension, and irritability. However, on the PSST, participants reported these symptoms retrospectively, thereby indicating that they had PMS. This inconsistency suggests that prospective methods may be needed for diagnosing PMS in individual women, similar to what is currently required for PMDD [1].
Discrepant retrospective and prospective mood assessments have been reported previously. Ainscough [19] used the Moos Menstrual Distress Questionnaire to assess NA daily for 8 weeks in 51 women. Most participants retrospectively reported having experienced premenstrual mood symptoms; however the Moos Menstrual Distress Questionnaire data provided no indication of menstrual cyclicity in NA. Our study adds that retrospectively reported PMS may also not be reflected in prospectively measured PA or in prospective measures of interpersonal behaviour rather than mood.
Our results suggests the PSST, a retrospective measure, may be invalid for diagnosing PMS. Other retrospective measures have also been found to result in symptom overestimation [5]. Nonetheless, it has also been reported that the PSST yields prevalence rates of PMDD and PMS that are comparable to rates reported in prospective studies [11], and that retrospective and prospective assessment of premenstrual symptoms are positively correlated [20]. Thus, also given our small sample size, more research on the validity of the PSST is warranted.
An additional aim of our study was to evaluate the TEMPEST software for data collection [10]. Offering the ECR questionnaires online had several advantages. First, while previous paper-based studies provided participants with 10 questionnaires per day, in the present study participants could complete as many questionnaires as they wished (the daily maximum was 16). Second, there was no need to return the completed questionnaires by post, which reduced the study burden and missing data. Third, we could monitor whether participants completed the ECR questionnaires regularly. This was communicated beforehand to prevent backfilling questionnaires, for example when participants forgot to carry or charge their phone, which occurred infrequently. Importantly, participants evaluated their overall experience with TEMPEST as positive.
In our study, the premenstrual phase was defined as the 5 days preceding menstruation. However, the number of days during which symptoms are reported can vary between women, as well as within women between menstrual cycles [21]. We also fitted models in which the length of the premenstrual and postmenstrual phases was 7 days, but again no cyclicity was found (model outcomes are available at https://osf.io/j7ngw/). Nonetheless, to gain more insight into cycle phases, daily measures of hormone levels or body temperature could have been included.
The small sample size may be considered another limitation. However, we were interested in menstrual cyclicity in mood and interpersonal behaviour in individual women. Clinically, these are more relevant than group effects. Nonetheless, replication in larger samples is recommended.
In conclusion, women who report having PMS on the PSST, a retrospective measure, may prospectively show no menstrual cyclicity in mood and interpersonal behaviour. This conclusion is preliminary in light of this study's limitations but supports the idea that a PMS diagnosis cannot be made using retrospective measures alone and, like for PMDD, should include prospective measures.
Funding Sources
The NWO had no role in the study design, data collection and analysis, the decision to publish, or in the preparation of this paper.
Acknowledgment
This study was funded by the Innovation Research Incentives Scheme Veni of the Netherlands Organization for Scientific Research (NWO) via a grant awarded to Dr. aan het Rot (No. 451-09-013).
References
- 1.American Psychiatric., Association . Washington: American Psychiatric Association; 2013. Diagnostic and Statistical Manual of Mental Disorders, ed 5. [Google Scholar]
- 2.Pincus SM, Schmidt PJ, Palladino-Negro P, Rubinow DR. Differentiation of women with premenstrual dysphoric disorder, recurrent brief depression, and healthy controls by daily mood rating dynamics. J Psychiatr Res. 2008;42:337–347. doi: 10.1016/j.jpsychires.2007.01.001. [DOI] [PubMed] [Google Scholar]
- 3.Gehlert S, Hartlage S. A design for studying the DSM-IV research criteria of premenstrual dysphoric disorder. J Psychosom Obstet Gynecol. 1997;18:36–44. doi: 10.3109/01674829709085567. [DOI] [PubMed] [Google Scholar]
- 4.Heinemann LA, Minh TD, Heinemann K, Lindemann M, Filonenko A. Intercountry assessment of the impact of severe premenstrual disorders on work and daily activities. Health Care Women Int. 2012;33:109–124. doi: 10.1080/07399332.2011.610530. [DOI] [PubMed] [Google Scholar]
- 5.Haywood A, Slade P, King H. Assessing the assessment measures for menstrual cycle symptoms: a guide for researchers and clinicians. J Psychosom Res. 2002;52:223–237. doi: 10.1016/s0022-3999(02)00297-0. [DOI] [PubMed] [Google Scholar]
- 6.Shiffman S, Stone AA, Hufford MR. Ecological momentary assessment. Annu Rev Clin Psychol. 2008;4:1–32. doi: 10.1146/annurev.clinpsy.3.022806.091415. [DOI] [PubMed] [Google Scholar]
- 7.Steinberg EM, Cardoso GMP, Martinez PE, Rubinow DR, Schmidt PJ. Rapid response to fluoxetine in women with premenstrual dysphoric disorder. Depress Anxiety. 2012;29:531–540. doi: 10.1002/da.21959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Csikszentmihalyi M, Larson R. Validity and reliability of the experience sampling method. J Nerv Ment Dis. 1987;175:526–536. doi: 10.1097/00005053-198709000-00004. [DOI] [PubMed] [Google Scholar]
- 9.Moskowitz DS, Sadikaj G. Event-contingent recording. In: Mehl MR, Conner TS, editors. Handbook of Research Methods for Studying Daily Life. New York: Guilford; 2013. pp. 160–175. [Google Scholar]
- 10.Batalas N, Markopoulos P. Introducing Tempest, a modular platform for in situ data collection. 7th Nord Conf Hum-Comput Interact: Making Sense Des. Copenhagen. 2012:781–782. [Google Scholar]
- 11.Steiner M, Macdougall M, Brown E. The Premenstrual Symptoms Screening Tool (PSST) for clinicians. Arch Womens Ment Health. 2003;6:203–209. doi: 10.1007/s00737-003-0018-4. [DOI] [PubMed] [Google Scholar]
- 12.Allen SS, McBride CM, Pirie PL. The shortened premenstrual assessment form. J Reprod Med. 1991;36:769–772. [PubMed] [Google Scholar]
- 13.Rush AJ, Trivedi MH, Ibrahim HM, Carmody TJ, Arnow B, Klein DN, Markowitz JC, Ninan PT, Kornstein S, Manber R, Thase ME, Kocsis JH, Keller MB. The 16-item Quick Inventory of Depressive Symptomatology (QIDS), Clinician Rating (QIDS-C), and Self-Report (QIDS-SR): a psychometric evaluation in patients with chronic major depression. Biol Psychiatry. 2003;54:573–583. doi: 10.1016/s0006-3223(02)01866-8. [DOI] [PubMed] [Google Scholar]
- 14.Brooke J. SUS: a quick and dirty usability scale. Usability Eval Ind. 1996;189:4–7. [Google Scholar]
- 15.aan het Rot M, Hogenelst K, Moskowitz DS. Comparability of the Social Behavior Inventory in English and Dutch. J Pers Assess. 2013;2013:5. doi: 10.1080/00223891.2013.775138. [DOI] [PubMed] [Google Scholar]
- 16.Diener E, Emmons RA. The independence of positive and negative affect. J Pers Soc Psychol. 1984;47:1105–1117. doi: 10.1037//0022-3514.47.5.1105. [DOI] [PubMed] [Google Scholar]
- 17.Spiegelhalter DJ, Best NG, Carlin BP, Van Der., Linde A Bayesian measures of model complexity and fit. J R Statist Soc Series B Statist Methodol. 2002;64:583–639. [Google Scholar]
- 18.Barendse MT, Albers CJ, Oort FJ, Timmerman ME. Measurement bias detection through Bayesian factor analysis. Front Psychol. 2014;5:1087. doi: 10.3389/fpsyg.2014.01087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ainscough CE. Premenstrual emotional changes a prospective study of symptomatology in normal women. J Psychosom Res. 1990;34:35–45. doi: 10.1016/0022-3999(90)90006-p. [DOI] [PubMed] [Google Scholar]
- 20.Steiner M, Streiner DL. Validation of a revised visual analog scale for premenstrual mood symptoms: results from prospective and retrospective trials. Can J Psychiatry. 2005;50:327–332. doi: 10.1177/070674370505000607. [DOI] [PubMed] [Google Scholar]
- 21.Pearlstein T, Yonkers KA, Fayyad R, Gillespie JA. Pretreatment pattern of symptom expression in premenstrual dysphoric disorder. J Affect Disord. 2005;85:275–282. doi: 10.1016/j.jad.2004.10.004. [DOI] [PubMed] [Google Scholar]