Skip to main content
Springer logoLink to Springer
. 2024 Oct 30;34(2):289–304. doi: 10.1007/s11136-024-03811-z

Psychometric properties of self-reported measures of psychological birth trauma in puerperae: A COSMIN systematic review

Pingping Chen 1,4, Chao Zhang 2, Guangjian Liu 3, Hongxia Zuo 2, Menghe Wang 1, Xiaoyan Shi 4, Longti Li 5,
PMCID: PMC11865166  PMID: 39476199

Abstract

Purpose

To identify and evaluate the psychometric properties of available patient-reported outcome measures (PROMs) of psychological birth trauma (PBT) in postpartum women.

Methods

A literature search was carried out across eight databases—PubMed, Embase, Web of Science, CINAHL, PsycINFO, China National Knowledge Infrastructure (CNKI), Wanfang Database, and VIP Database for Chinese Technical Periodicals—covering studies published from the inception of each database up to 21 May 2024. English and Chinese language studies employing any research design and reporting at least one psychometric property of PBT in puerperae were included. Independent reviewers extracted data and followed the Consensus-based Standards for the selection of health Measurement Instruments (COSMIN) guidelines to evaluate three aspects of the included instruments: methodological quality, psychometric properties, and level of evidence assessed using the modified Grading of Recommendations Assessment, Development, and Evaluation (GRADE) framework.

Results

Thirty-one studies with twelve PROMs were included, measurement error, cross-cultural validity, and responsiveness were not reported for most PROMs. Nine PROMs received a Class B recommendation, based on their measurement attribute ratings and overall evidence quality. In contrast, the CTI, IES-R, and PBTAS with high-quality evidence for insufficient measurement properties, so received a Class C recommendation.

Conclusions

This systematic review provisionally recommends the City BiTS as credible tool for assessing PBT in both clinical practice and research involving puerperae. However, further comprehensive studies are needed to conduct more comprehensive validations of the psychometric properties of existing PROMs.

Supplementary Information

The online version contains supplementary material available at 10.1007/s11136-024-03811-z.

Keywords: Psychological birth trauma, Puerperae, Patient-reported outcome measures, Psychometric propertie, COSMIN, Systematic review

Plain English summary

Psychological birth trauma (PBT) occurs frequently in maternal women all over the world, through a maternal-centered ripple effect, which has a wide and far-reaching negative impact on the health of the mother, her infant, her partner, family relationships, and future reproductive decision-making, as well as the increased utilization of health care resources. While numerous scales have been developed to measure PBT, there is currently no consensus on which instrument is most suitable to measure PBT in puerperae. We conducted a systematic review following the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) methodology to summarize and review the psychometric properties of available patient-reported outcome measures (PROMs) of PBT, aimed to guide healthcare providers about appropriate selection of a PROM for assessing maternal PBT in postpartum women. This systematic review recommends the City Birth Trauma Scale as the preferred assessment tool for measuring PBT in clinical practice and research involving puerperae. However, further comprehensive studies are needed to better understand and improve the psychometric properties of all scales.

Introduction

Maternal health is crucial to public health, with growing emphasis on psychological well-being alongside physical health. Although childbirth is a natural process for most women, it can be physically and psychologically traumatic for some [1]. Approximately 10% to 20% of women report traumatic childbirth, leading to significant emotional distress and long-term negative mental health outcomes [2, 3]. While Psychological Birth Trauma (PBT) lacks a universally accepted definition, Walker and Avant’s concept analysis identified four key elements: subjective emotions, distressing experiences, trauma during the birth process, and the persistence of these effects into the postnatal period [4]. Reported incidences of PBT range from 10 to 44% [5, 6]. Beck et al. [7] highlighted the "ripple effect" of childbirth trauma, which affects not only the mother but also the mother-infant bond [8], breastfeeding [9], marital relationships [10], future reproductive choices [6], partners' mental health [11], and healthcare service utilization [12]. Studies estimate that around 19% of women develop PTSD after traumatic childbirth [13], a rate significantly higher than the 4% in the general postpartum population [14]. Thus, recognising and understanding PBT is essential for ensuring appropriate support and interventions.

Patient-reported outcome measures (PROMs) are tools used to evaluate patients' health, health-related quality of life, and other relevant constructs. PROMs have proven effective in capturing patients' subjective health perceptions, especially when direct observations are challenging or time is limited in clinical settings [15]. By improving communication between patients and physicians and informing clinical decision-making, PROMs play a crucial role in improving care quality and health outcomes [16].

Currently, several PROMs are available for assessing maternal Psychological Birth Trauma (PBT), including the City Birth Trauma Scale (City BiTS) [17], Perinatal Posttraumatic Stress Disorder Questionnaire-Revised (PPQ-R) [18], Childbirth Trauma Index (CTI) [19], Childbirth Trauma Index-Revised (CTI-R) [20], the Post-Traumatic Stress Disorder Checklist for DSM-5 (PCL-5) [21], Posttraumatic Diagnostic Scale (PDS) [22], Birth Trauma Perception Scale for Women During Vaginal Delivery (BTPS-WVD)[23], Psychological Childbirth Trauma Assessment Scale (PCTAS) [24], Impact of Event Scale-Revised (IES-R) [25], Maternal Childbirth Trauma Scale (MCTS)[26], and Psychological Birth Trauma Assessment Scale (PBTAS) [27]. Despite the availability of these tools, prior studies have not rigorously evaluated their measurement properties or offered recommendations for their selection [28, 29]. The Consensus-based Standards for health Measurement instruments (COSMIN) methodology offers guidelines for systematically reviewing the methodological quality and psychometric properties of PROMs [30]. These guidelines assist researchers and clinicians in choosing the most appropriate PROMs for both research and clinical practice. This systematic review used the COSMIN methodology to synthesise studies on self-report instruments assessing PBT from the perspectives of postpartum women. It assessed the psychometric properties and methodological quality of these instruments to provide evidence-based, transparent recommendations for the use of PROMs.

Methods

Design

We conducted a systematic review in accordance with the COSMIN guidelines and the manual for systematic reviews of PROMs [30], following the PRISMA-COSMIN reporting guidelines for Outcome Measurement Instruments [31]. The protocol was registered with the International Prospective Register of Systematic Reviews (PROSPERO) (CRD42024425406).

Search strategy

A systematic search was performed across eight databases: PubMed, Web of Science, Embase, CINAHL, PsycINFO, China National Knowledge Infrastructure (CNKI), Wanfang, and the VIP Database for Chinese Technical Periodicals. The search included studies published from the inception of each database up to May 21, 2024, and the references of the included studies were traced. The search strategy combined MeSH terms, entry terms, and a filter developed by Terwee et al.[32] to identify studies related to measurement properties. Detailed search strategies for all databases are provided in Supplementary Information S1.

Eligibility criteria

The inclusion criteria were: (I) studies on the development or validation of self-administered instruments for assessing PBT; (II) studies involving puerperal or postpartum women; and (III) reporting at least one measurement property of the instrument. The exclusion criteria were: (I) studies using the PROM solely as an outcome measure (e.g., in randomized controlled trials or in the validation of another instrument); (II) studies not published in English or Chinese; (III) secondary literature, including books, conference papers, reviews, systematic reviews, or meta-analyses; and (IV) duplicate publications.

Study selection and data extraction

Two researchers, trained in evidence-based methodologies and familiar with COSMIN guidelines, independently performed the literature screening and data extraction. EndNote X9 was initially used to remove duplicates. Titles and abstracts were screened to identify relevant studies, followed by a full-text review for further assessment. Reasons for exclusion were documented. For eligible studies, data were independently extracted using a standardized form, and accuracy and completeness were verified. Discrepancies were resolved through discussion with a third researcher. Extracted data included the first author, publication year, PROM name, country and language of the research, target population age, sample size, number of items, scoring method, and total score.

Data analysis

According to the COSMIN guidelines (http://www.cosmin.nl), the evaluation included a methodological quality rating, assessment of measurement properties, evidence synthesis, and evidence grading.

Assessment of the methodological quality

The COSMIN Risk of Bias checklist [33] consists of ten sections covering standards for PROM development and nine measurement properties: content validity, structural validity, internal consistency, cross-cultural validity/measurement invariance, reliability, measurement error, criterion validity, hypotheses testing for construct validity, and responsiveness. The checklist comprises 116 items, with reliability and measurement error assessed using the 2021 updated version [34]. Each item was rated as "very good (V)", "adequate (A)", "doubtful (D)", "inadequate (I)", or "not applicable (NA)". The overall methodological quality rating for each study was determined by the lowest score within the box. For instance, if the lowest rating in the structural validity box was "inadequate," the overall structural validity rating for that study would be "inadequate."

Evaluation of psychometric properties

PROM measurement attributes were evaluated using the COSMIN criteria [35], which included content validity, construct validity, internal consistency, stability, measurement error, hypotheses testing for construct validity, cross-cultural validity or measurement invariance, criterion validity, and responsiveness. Content validity was assessed following the COSMIN methodology, focusing on three aspects: relevance, comprehensiveness, and comprehensibility [36, 37]. Each item was rated as "sufficient ( +)", "insufficient (−)", or "indeterminate (?)". The detailed criteria for good measurement properties are provided in Supplementary Information S2. For construct validity, the review team defined a priori hypotheses for sufficient measurement properties, which are presented in Supplementary Information S3.

Summarizing the evidence and grading the quality of the evidence

When summarizing evidence for each measurement property of a PROM, if all studies yield results of either " + " or "−", the overall rating will be either " + " or "−". In cases of inconsistent results, the review team must select the most appropriate strategy. Possible strategies include: explaining and summarizing results by subgroup; not summarizing and rating the results as " ± " without grading the evidence; or rating based on 75% consistency while downgrading for inconsistencies. The quality of evidence was assessed using the modified Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach for systematic reviews of clinical trials in this study [35]. The default assumption is that the overall result is of high quality; however, it can be downgraded by one or two levels based on risk of bias (study quality), inconsistency (variation in results), imprecision (due to small sample sizes), or indirectness (evidence from different populations, interventions, or outcomes). Details of the modified GRADE approach are provided in Supplementary Information S4, with quality levels defined as "high," "moderate," "low," and "very low" in Supplementary Information S5. PROM recommendations are categorized as follows: if the scale’s content validity is "sufficient ( +)" and the evidence for internal consistency is at least "low," it receives a Class A recommendation. A scale with "high quality" evidence of "insufficient (−)" content validity or other psychometric properties is not recommended and receives a Class C recommendation. PROMs not classified as A or C are designated as Class B. The COSMIN Recommended Criteria for Selecting the Most Suitable PROMs are detailed in Supplementary Information S6.

Results

Search results

A total of 1905 studies were initially identified, with an additional 4 studies found through reference and citation searches. A total of 321 duplicates were removed, resulting in 70 articles obtained after screening titles and abstracts. Of these, thirty-three studies [1727, 3859] were ultimately included in the review. The PRISMA flow diagram and reasons for exclusion are presented in Fig. 1.

Fig. 1.

Fig. 1

PRISMA flow diagram illustrating screening and selection of articles.

Characteristics of the included PROMs

A total of thirty-three studies examining twelve self-report scales measuring PBT from the perspectives of patients were included. The psychometric properties of the City BiTS were tested in sixteen studies, the PPQ-R was cross-culturally adapted in five studies, and the original scale was evaluated in one study; additionally, the reliability and validity of the PCL-5 were reported in three studies. Other measurement instruments, including the CTI, CTI-R, PDS, BTPS-WVD, IES-R, PCTAS, MCTS, and PBTAS, were validated in only a single study each. All scales included in this review specifically targeted mothers within 18 months postpartum. The studies included in this review spanned a long period, with all studies published between 1999 and 2024. The majority of PROMs were published in English. Detailed characteristics of the included PROMs are provided in Table 1.

Table 1.

Characteristics of the included patient-reported outcome measures (PROMs) (n = 33)

PROM Year Author Country/language Target population Age(years) Sample size Number of
items
Response options Score range
City BiTS 2018 Susan Ayers [17] UK/English mothers, within 1 year of childbirth 32.25 ± 4.62 950 29 liker4 (0 – 3) 0 – 60
2018 Joanthan E [40] Israel/Hebrew mothers, within 1 month of childbirth 31 ± 3.5 504 31 liker4 (0 – 3) 0 – 60
2020 Sandra Nakic´ Radoš [44] Croatia/Croatian mothers, within 1 year of childbirth 30.64 ± 4.68 603 29 liker4 (0 – 3) 0 – 60
2020 Bayrı Bingöl [46] Turkey/Turkish mothers, in the sixth month of childbirth 28.97 ± 5.03 315 29 liker4 (0 – 3) 0 – 60
2020 Zichen Shen [45] China/Chinese mothers, within 1 year of childbirth 20 – 42 328 29 liker4 (0 – 3) 0 – 60
2020 Mariana Fortunata Donadon [43] Brazil/Portuguese mothers, within 1 year of childbirth 30.81 ± 6.3 16 35 NR NR
2021 Flavia de Lima Osório [52] Brazil/Portuguese mothers, within 1 year of childbirth 30.8 ± 5.98 343 29 liker4 (0 – 3) 0 – 60
2021 Tobias Weigl [49] Germany/German mothers, within 1 year of childbirth 30.6 ± 4.7 1072 29 liker4 (0 – 3) 0 – 60
2021 Rafael A. Caparros-Gonzalez [47] Spain/Spanish mothers, within 1 month of childbirth 31.94 ± 5.32 207 29 liker4 (0 – 3) 0 – 60
2022 Alysha-leigh Fameli [54] Australia/English mothers, within 1 year of childbirth 31.06 ± 3.92 705 29 liker4 (0 – 3) 0 – 60
2022 Vania Sandoz [53] France/French mothers, within 1 year of childbirth 30.14 ± 4.25 541 29 liker4 (0 – 3) 0 – 60
2022 Hannah Dobson [50] Oceania /English mothers, within 1 year of childbirth 32.1 ± 4.32 195 29 liker4 (0 – 3) 0 – 60
2022 Nie Xiaofei [51] China/Chinese mothers, within 1 year of childbirth 29.83 ± 4.16 596 27 liker4 (0 – 3) NR
2023 Greta Stén [55] Sweden/Swedish woman, within 6 – 16 weeks of childbirth 32.77 ± 4.50 619 29 liker4(0 – 3) 0 – 60
2023 Amin Vatanparast [56] Iran/Persian mothers, within 1 year of childbirth 29.91 ± 4.58 300 29 liker4(0 – 3) 0 – 60
2024 Olga  Riklikiene [59] Lithuania/ Lithuanian mothers, within 1 – 12 months of childbirth 30.18 ± 4.86 794 22 liker4(0 – 3) NR
CTI 2011 Cheryl Anderson [19] USA/English, Spanish adolescents, within 72 h of childbirth 15 – 19 112 14 liker 4(0 – 3) 0 – 42
CTI-R 2020 Anderson, C. A [20] USA/English adolescents, within 3 days postpartum 13 – 19 160 8 liker 4(0 – 3) 0 – 24
PPQ 1999 Francella A [38] USA/English mothers, within 6 months of childbirth NR 142 14 liker 2 NR
PPQ-R 2006 JL Callahan [18] USA/English mothers of premature and other high-risk infants 25.22 ± 1.05 144 14 liker5(0 – 4) 0 – 56
2018 Kaboni Whitney Gondw [39] Chichewa/Chichewa mothers, within 6 weeks of childbirth 25.5 ± 6.31 30 14 liker5(0 – 4) 0 – 56
2018 Di Zhang [41] China/Wuhan/Chinese mothers, within 1 – 18 months of childbirth NR 243 14 liker5(0 – 4) 0 – 56
2021 Antonio Hernández-Martínez [48] Spain/Spanish mothers, within 6 months of childbirth 35.4 ± 4.22 432 14 liker5(0 – 4) 0 – 56
2024 Stephanie  Alves[57] European/Portuguese mothers, within 1 – 18 months of childbirth 32.84 ± 5.03 451 11 liker5(0 – 4) NR
PCL-5 2021 Eirini Orovou [21] Greece/Greek mothers, within 6 weeks of childbirth 32.58 ± 6.15 469 20 liker5(1 – 5) 20 – 80
2022 Rachel Buhagiar [42] Malta/Maltese mothers, within 6 months of childbirth 36.8 ± 3.45 175 12 liker5(1 – 5) NR
2024 Isha Hemant Arora, Btech [58] USA/English women who experienced traumatic childbirth 33.47 ± 4.63 59 20 liker5(1 – 5) 20 – 80
PDS 2017 Pelin Dikmen-Yildiz [22] Turkey/Turkish mothers, within 6 months of childbirth 27 ± 5.31 829 17 Liker4(0 – 3) 0 – 51
IES-R 2006 Eelco Olde [25] Netherlands/Dutch mothers, within 3 months of childbirth 30.9 ± 3.5 436 15 Liker4(0 – 5) NR
BTPS-WVD 2023 Zerong Lian [23] China/Chinese Women, vaginal delivery NR 225 31 NR NR
PCTAS 2023 Ke Zhang [24] China/Chinese mothers, within 72 h of childbirth 30 ± 3.7 712 15 NR NR
MCTS 2023 Hui Wang [26] China/Chinese mothers, within 2 weeks of childbirth NR 178 25 NR NR
PBTAS 2024 Xiaoqin Sun [27] China/Chinese mothers, within 1 year of childbirth NR 516 29 liker5(1 – 5) NR

PROM patient-reported outcome measure, City BiTS City Birth Trauma Scale, PPQ Perinatal Posttraumatic Stress Disorder Questionnaire, PPQ-R Perinatal Posttraumatic Stress Disorder Questionnaire-Revised, CTI Childbirth Trauma Index, CTI-R Childbirth Trauma Index-Revised, PCL-5 Post Traumatic Stress Disorder Checklist for DSM-5, PDS Posttraumatic Diagnostic Scale, BTPS-WVD Birth Trauma Perception Scale for Women During Vaginal Delivery; IES-R Impact of Event Scale—Revised, PCTAS Psychological Childbirth Trauma Assessment Scale, MCTS Maternal Childbirth Trauma Scale, PBTAS Psychological Birth Trauma Assessment Scale. NR not reported

Methodological quality assessment and psychometric properties rating of the included studies

The evaluations of the methodological quality and measurement properties of all included studies are presented in Table 2. None of the studies assessed measurement error or responsiveness.

Table 2.

Methodological Quality Assessment and psychometric properties rating of the included studies (n = 33)

PROM Author PROM development Content validity Structural validity Internal consistency Cross-Cultural validity/measurement invariance Reliability Criterion validity Hypothesis testing
comprehe-nsiveness relevance comprehe-nsibility meth qual results(rating) meth qual results(rating) meth qual results(rating) meth qual results(rating) meth qual results(rating) meth qual results(rating)
City BiTS Susan Ayers [17] D D(a,b)/ +  D(a,b)/ +  Db/ +  A ? V ? NR NR NR NR NR NR NR NR
Jonathan E [40] NR NR NR NR A ? V ? NR NR NR NR NR NR A  + 
Zichen Shen [45] D Da/ +  Da/ +  Db/ +  A ? V ? NR NR I ? NR NR NR NR
Sandra Nakic´ Radoš [44] NR NR NR NR V  +  V  +  NR NR NR NR NR NR A  + 
Fadime Bayrı Bingöl [46] D Da/? Da/ +  NR V  +  V NR NR D ? NR NR NR NR
Mariana Fortunata Donadon [43] D Dab/ +  Dab/ +  Db/ +  NR NR NR NR NR NR NR NR NR NR NR NR
Flavia de Lima Osório [52] NR NR NR NR A ? V ? NR NR A  +  V  +  A  + 
Tobias Weigl [49] D NR NR Db/ +  V  +  V  +  NR NR NR NR NR NR A  + 
Rafael A.  Caparros-Gonzalez [47] D Db/ +  Db/ +  Db/ +  V  +  V  +  D  +  NR NR NR NR NR NR
Alysha-leigh Fameli [54] NR NR NR NR V  +  V  +  NR NR NR NR NR NR A  + 
Vania Sandoz [53] NR NR NR NR V  +  V  +  NR NR NR NR NR NR A  + 
Hannah Dobson [50] NR NR NR NR A  +  V  +  NR NR I ? NR NR A  + 
Xiaofei Nie [51] NR NR Da/ +  Db/ +  V  +  V  +  NR NR D ? NR NR NR NR
Greta Stén [55] D NR Db/ +  Db/ +  V  +  V  +  NR NR D NR NR A  + 
Amin Vatanparast [56] NR NR NR NR V  +  V  +  NR NR A  +  NR NR A  + 
Olga Riklikiene [59] NR NR NR NR V  +  V  +  NR NR NR NR NR NR A  + 
PPQ Francella A. Quinnell [38] NR NR NR NR NR NR NR NR NR NR NR NR NR NR A  + 
PPQ-R JL Callahan [18] NR NR NR NR A ? V ? NR NR NR NR NR NR A  + 
Di Zhang [41] D NR Db/ +  Db/ +  V V  +  NR NR D ? NR NR NR NR
Kaboni Whitney Gondwe [39] D D ab/? Dab/? Db/ +  NR NR V  +  NR NR NR NR NR NR A  + 
Antonio Hernández-Martínez [48] NR NR NR NR A ? V ? NR NR NR NR NR NR I  + 
Stephanie Alves [57] NR NR NR NR V D ? NR NR NR NR NR NR A  + 
CTI Cheryl Anderson [19] I Db/? Db/? Db/? A ? V ? NR NR NR NR NR NR A -
CTI-R Cheryl Anderson [20] D Db/? Da/? Db/ +  V  +  I ? NR NR NR NR NR NR A  + 
PCL-5 Eirini Orovou [21] NR NR NR NR V  +  V  +  NR NR I ? NR NR A  + 
Rachel Buhagiar [42] D NR NR Db/ +  NR NR V ? NR NR I ? V  +  NR NR
Isha Hemant Arora [58] NR NR NR NR NR NR V ? NR NR I  +  V  +  A  + 
PDS Pelin Dikmen-Yildiz [22] NR NR NR NR V  +  V  +  NR NR I  +  V ? A  + 
PCTAS Ke Zhang [24] D NR Db/ +  Db/ +  V  +  V  +  NR NR NR NR NR NR NR NR
IES-R Eelco Olde [25] NR NR NR NR V V NR NR NR NR NR NR V  + 
BTPS-WVD Zerong Lian [23] Da D/ +  D(a,b)/ +  NR A ? V ? NR NR D ? NR NR NR NR
MCTS Hui Wang [26] D Da/ +  Da/? Db/ +  A ? V ? NR NR D  +  NR NR NR NR
PBTAS Xiaoqin Sun [27] D Da,b/ +  Da,b/ +  Ab/ +  V V ? NR NR NR NR NR NR V  + 

PROM patient-reported outcome measure, City BiTS City Birth Trauma Scale, PPQ Perinatal Posttraumatic Stress Disorder Questionnaire, PPQ-R revised Perinatal Posttraumatic Stress Disorder Questionnaire-Revised, CTI Childbirth Trauma Index, CTI-R Childbirth Trauma Index-Revised, PCL-5 Post Traumatic Stress Disorder Checklist for DSM-5, PDS Posttraumatic Diagnostic Scale, BTPS-WVD Birth Trauma Perception Scale for Women During Vaginal Delivery, IES-R Impact of Event Scale-Revised, PCTAS Psychological Childbirth Trauma Assessment Scale, MCTS Maternal Childbirth Trauma Scale, PBTAS Psychological Birth Trauma Assessment Scale

a: ask experts; b: Ask the Patient; meth qual, methodological quality; Methodological quality is rated as: V very good; A adequate; D doubtful, I inadequate. Results of PROMs are rated as follows: + , sufficient; −, insufficient, indeterminate; ± , inconsistent. NA Not Applicable, NR not reported

Development of PROMs

All scales except for the CTI, PCTAS, and PBTAS were developed with a clear theoretical foundation. The City BiTS, PDS, and PCL-5 were derived from the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5), published by the American Psychiatric Association [60]. The development of the PPQ and PPQ-R was based on the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV), also published by the American Psychiatric Association [61]. The BTPS-WVD was developed based on the cognitive-phenomenology-transactional model [62]. The MCTS was based on the theoretical foundation of psychological stress theory [63]. However, the items of the CTI [19] and CTI-R [20] were formulated through a literature review, while the PCTAS [24] and PBTAS [27] were developed through a combination of a literature review, semi-structured interviews with parturients, and a Delphi expert consultation.

Content validity

Sixteen studies in total reported on content validity. One study [46] assessed content validity through expert consultation, while nine studies [17, 20, 23, 26, 27, 39, 43, 45, 51] sought input from both experts and patients. However, none of these studies provided detailed descriptions of their statistical analysis methods, resulting in a "D" rating for methodological quality. Content validity was rated as " + " in five studies [17, 27, 43, 45, 47], while the remaining PROMs received a "?" rating.

Structural validity

Nineteen studies [2022, 24, 25, 27, 41, 44, 46, 47, 4951, 5357, 59] conducted confirmatory factor analysis (CFA) and were rated "V" for methodological quality. Of these, three studies [25, 41, 57] received a "−" rating due to a model fit index below 0.95, while the remaining sixteen studies[2022, 24, 27, 44, 46, 47, 4951, 5356, 59] met COSMIN construct validity requirements and were rated " + ". Additionally, nine studies [1719, 23, 26, 40, 45, 48, 52] used exploratory factor analysis (EFA) and were rated "A" for methodological quality, though their measurement properties were rated "?".

Internal consistency

All thirty-one studies [1727, 3942, 4459] reported internal consistency validity. Most studies calculated the Cronbach’ α coefficient, and their methodological quality was rated as "V". However, one study [20] received an "I" rating for not calculating the Cronbach’s α coefficient. Another study [57] only reported inter-item correlations and received a "D" rating. Among the measurement properties of these studies, fifteen[21, 22, 24, 39, 41, 44, 47, 4951, 5356, 59] were rated as " + ", meeting the COSMIN updated criteria which require at least low evidence for " + " structural validity, and Cronbach’s α coefficient ≥ 0.70 for each unidimensional scale or subscale. In contrast, two studies [25, 46] were rated "−", due to a Cronbach’s α < 0.70 despite meeting the COSMIN structural validity criteria. The remaining fourteen studies [1720, 23, 26, 27, 40, 42, 45, 48, 52, 57, 58] were rated "?" for not meeting COSMIN's criteria for at least low evidence of structural validity.

Cross-cultural validity/measurement invariance

Of the studies included in this systematic review, only one [47] conducted a cross-cultural validity test and performed differential item functioning (DIF) analysis. This study used the City BiTS to measure PBT and examined variables including maternal age, gestational age, delivery type, parity, pain relief, belief about serious injury or death, symptom duration, and impact on normal activities. The DIF analysis showed no significant differences across these variables. However, the study did not explicitly state whether other characteristics were comparable beyond the grouping variables.

Consequently, the methodological quality of the study was rated "D", while the measurement properties received a " + " rating.

Reliability

Fourteen studies [2123, 26, 41, 42, 45, 46, 5052, 55, 56, 58] assessed reliability. Six studies [23, 26, 41, 46, 51, 55] were rated "D" due to insufficient details about whether the constructs remained stable during the retest period or if conditions were consistent. One study [45] used a telephone follow-up for the retest, which differed from the initial face-to-face data collection. As a result, the study’s methodological quality was rated "I". Another six studies [21, 22, 42, 45, 50, 58] were rated "I" due to inappropriate retest intervals or failure to calculate the intraclass correlation coefficient (ICC) or correlation coefficient. In contrast, one study[52] used a random number table for selecting retest subjects, and another[56] specified a 2 week retest interval with consistent conditions, both receiving an "A" rating. Four studies [22, 52, 56, 58] reported ICC or Kappa values exceeding 0.7, earning a " + " rating for measurement attributes. However, one study [55] had an ICC below 0.7, earning a "−" rating. The remaining studies [21, 23, 26, 41, 42, 45, 46, 50, 51] did not calculate ICCs or relied only on Pearson correlation coefficients, leading to a "?" rating for measurement attributes.

Criterion validity

Criterion validity was assessed in four studies [22, 42, 52, 58]. Three studies [42, 52, 58] reported correlations ≥ 0.70 with the gold standard or AUC values ≥ 0.70, earning a "V" rating for methodological quality and a " + " rating for measurement properties. Conversely, one study [22] reported sensitivity, specificity, and a Kappa value of 0.64 between the PDS and the Structured Clinical Interview for DSM-5 Disorders-Clinical Version (SCID-5-CV) [64], but did not report correlation or AUC values. Therefore, while its methodological quality was rated as "V", its measurement property was rated as "?".

Hypotheses testing for construct validity

Twenty-two studies [1822, 25, 27, 3840, 43, 44, 4850, 5359] assessed convergent validity through comparisons with other widely used measurement tools and examined discriminative or known-groups validity via subgroup comparisons. Twenty-two studies[1822, 25, 27, 3840, 43, 44, 4850, 5359] assessed convergent validity through comparisons with other widely used measurement tools and examined discriminative or known-groups validity via subgroup comparisons. One study [48] was rated "I" due to an unclear structure of the control measurement tool and missing information on its measurement attributes. With the exception of one study[19], which did not align with the hypotheses testing and received a "-" rating for measurement properties, all other studies supported hypotheses testing and were rated as " + ".

The level of evidence evaluation and recommendations for the included measurement tools

In this study, the quality of the measurement attributes for each assessment tool was synthesized, and evidence quality was evaluated. Recommendations were made based on the risk of bias, inconsistency, imprecision, and indirectness of the included studies. Most PROMs were given a Class B recommendation, reflecting their measurement attribute ratings and the overall quality of evidence. In contrast, the CTI, IES-R, and PBTAS, which had high-quality evidence for insufficient measurement properties, received a Class C recommendation. The reasons for downgrading are provided in Table 3.

Table 3.

Evaluation of evidence level of the included scales and the formation of recommendations (n = 12)

PROM Content validity Structural validity Internal consistency Cross-Cultural validity/measurement invariance Reliability Criterion validity Hypothesis testing Class
Overall rating Quality of evidence Overall rating Quality of evidence Overall rating Quality of evidence Overall rating Quality of evidence Overall rating Quality of evidence Overall rating Quality of evidence Overall rating Quality of evidence
City BiTS  +  Ma  ±  NA  ±  NA  +  Mb  ±  NA  +  H  +  H B
PPQ NR NR NR NR NR NR NR NR NR NR NR NR  +  H B
PPQ-R ? VLc  ±  NA  ±  NA NR NR ? Md  +  H  +  H B
CTI ? Vle ? H ? H NR NR NR NR NR NR H C
CTI-R ? Mf  +  H ? Lg NR NR NR NR NR NR  +  H B
PCL-5 ? Lh  +  H  ±  Mi NR NR  +  VLj  +  Lk  +  Ml B
PDS NR NR  +  H  +  H NR NR  +  Lm ? H  +  H B
PCTAS ? Mn  +  H  +  H NR NR NR NR NR NR NR NR B
IES-R NR NR H H NR NR NR NR NR NR  +  H C
BTPS-WVD ? Mo ? H ? H NR NR ? Mp NR NR NR NR B
MCTS  +  Mq ? H ? H NR NR  +  Mr NR NR NR NR B
PBTAS  +  Ms H ? H NR NR NR NR NR NR  +  H C

PROM patient-reported outcome measure, City BiTS City Birth Trauma Scale, PPQ Perinatal Posttraumatic Stress Disorder Questionnaire, PPQ-R Perinatal Posttraumatic Stress Disorder Questionnaire-Revised, CTI Childbirth Trauma Index, CTI-R Childbirth Trauma Index-Revised, PCL-5 Post Traumatic Stress Disorder Checklist for DSM-5, PDS Posttraumatic Diagnostic Scale, BTPS-WVD Birth Trauma Perception Scale for Women During Vaginal Delivery, IES-R Impact of Event Scale—Revised, PCTAS Psychological Childbirth Trauma Assessment Scale, MCTS Maternal Childbirth Trauma Scale, PBTAS Psychological Birth Trauma Assessment Scale. Overall rating: + , sufficient; −, insufficient, indeterminate; ± , inconsistent. Quality of evidence is rated as H high, M moderate, L low; VL very low. NA not applicable, NR not reported

GRADE assessments: aEvidence downgraded − 1 due to Serious Risk of Bias (multiple studies of doubtful quality); bEvidence downgraded − 1 due to Serious Risk of Bias (multiple studies of doubtful quality or 1 study of adequate quality); cEvidence downgraded − 1 due to Serious Risk of Bias (multiple studies of doubtful quality) and − 2 due to Imprecision (sample size of one study < 50); dEvidence downgraded − 1 due to Serious Risk of Bias (multiple studies of doubtful quality); eEvidence downgraded − 3 due to extremely Serious Risk of Bias (PROM development study inadequate quality); fEvidence downgraded − 1 due to Serious Risk of Bias (only one study included of doubtful quality); gEvidence downgraded − 2 due to very Serious Risk of Bias (only one study included of inadequate quality); hEvidence downgraded − 1 due to Serious Risk of Bias(multiple studies of doubtful quality) and − 1 due to Serious Indirectness (the population of one study included 175 mothers, but only 7 (4%) were new mothers and the rest were multiparas in the third trimester); iEvidence downgraded − 1 due to Serious Indirectness (the population of one study included 175 mothers, but only 7 (4%) were new mothers and the rest were multiparas in the third trimester); jEvidence downgraded-2 due to very Serious Risk of Bias (multiple studies of inadequate quality) and − 1 due to Serious Indirectness (the population of one study included 175 mothers, but only 7 (4%) were new mothers and the rest were multiparas in the third trimester); kEvidence downgraded − 1 due to Serious Risk of Bias(one study of adequate quality) and − 1 due to Serious Indirectness (the population of one study included 175 mothers, but only 7 (4%) were new mothers and the rest were multiparas in the third trimester); lEvidence downgraded -1 due to Serious Indirectness (the population of one study included 175 mothers, but only 7 (4%) were new mothers and the rest were multiparas in the third trimester); mEvidence downgraded − 2 due to very Serious Risk of Bias (only one study included of inadequate quality); nEvidence downgraded − 1 due to Serious Risk of Bias (one study of doubtful quality); oEvidence downgraded −  due to Serious Risk of Bias (one study of doubtful quality); pEvidence downgraded − 1 due to Serious Risk of Bias (one study of doubtful quality); qEvidence downgraded − 1 due to Serious Risk of Bias (one study of doubtful quality); rEvidence downgraded − 1 due to Serious Risk of Bias (one study of doubtful quality); sEvidence downgraded − 1 due to Serious Risk of Bias (one study of doubtful quality);

Discussion

Principal findings

This systematic review analyzed thirty-three studies on the psychometric properties of twelve tools for assessing maternal PBT. None fully adhered to COSMIN guidelines, with gaps in the assessment of measurement error and responsiveness. Although no PROMs received Grade A recommendations, the City BiTS emerged as the most evaluated and comprehensive tool, with moderate-quality evidence supporting its content validity, making it a provisional recommendation. Eight other Class B tools showed potential but require further research to comprehensively evaluate their measurement properties. The PCL and PDS, initially developed for non-postpartum populations, present limitations in specificity. The PPQ and PPQ-R also require further research due to uncertain content validity. The CTI-R, BTPS-WVD, PCTAS, and MCTS were each tested in only one study, with the latter three validated solely in Chinese populations, highlighting the need for further research on global applicability. The CTI, IES-R, and PBTAS received Class C recommendations based on high-quality evidence of insufficient measurement properties. Overall, the City BiTS stands out as the most robust and reliable tool for clinicians to assess maternal PBT.

Psychometric properties

Although several scales exist for the explicit measurement of PBT, most PROMs included in this review lack comprehensive evidence regarding a range of psychometric properties. Content validity is widely considered the most important property of a scale, as it assesses how well a PROM reflects the construct it is designed to measure [65]. In this systematic review, most studies focused on expert consultation for correlation and evaluation, often neglecting patient perspectives. As Selby and Velikova [66] highlight, public involvement is essential to both the design and implementation of PROMs. Patient perspectives are crucial to understanding the impact of childbirth on postpartum mental health. Future studies should integrate rigorous qualitative interviews with quantitative surveys to more effectively incorporate both patient and expert input, thereby enhancing the content validity of PROMs.

The most commonly reported psychometric property was internal consistency. According to COSMIN guidelines, assessing internal consistency requires at least low evidence of "sufficient" structural validity, which can be derived from various studies [30]. One-third of the included studies primarily utilized exploratory factor analysis (EFA) rather than confirmatory factor analysis (CFA) to evaluate structural validity. The limited use of CFA resulted in inadequate structural validity, which consequently affected the assessment of internal consistency. Reliability reflects the stability of the PROM over time, it was assessed in nearly half of the studies included in this review. However, none of these studies provided a rationale for the chosen test–retest interval, introducing potential bias in the reliability validity. The COSMIN guidelines suggest a retest interval of 2 weeks, as shorter or longer intervals may lead to either an overestimation or underestimation of reliability [67]. Furthermore, when reporting quantitative reliability outcomes, it is recommended to present intraclass correlation coefficients (ICCs) instead of correlation coefficients.

A PROM developed in a specific context may not be suitable for application in a different setting. Therefore, the same PROM should be used when directly comparing populations from diverse regions and linguistic backgrounds. In this review, cross-cultural validity was evaluated using differential item functioning (DIF) based on item response theory (IRT) [68] in only one study[47], while other studies relied solely on simple translations of the PROMs, neglecting the critical process of cross-cultural adaptation. As a result, the measurement equivalence of these scales across varying cultures, regions, and socioeconomic groups remains uncertain.

Criterion validity, which refers to the extent to which a scale score reflects the gold standard, was assessed using various structured clinical interviews [69]. Two studies [22, 52] used the Structured Clinical Interview for DSM-5 Disorders-Clinical Version (SCID-5-CV) [64] as the gold standard, while two others [42, 58] employed the Clinician-Administered PTSD Scale (CAPS) [70]. In some studies, widely used scales were incorrectly treated as the gold standard, leading to confusion with hypotheses testing for construct validity. According to COSMIN guidelines [30], comparisons between newly developed scales and existing ones should follow the "hypotheses testing for construct validity" procedure. Therefore, to properly assess the criterion validity of a PROM, clinical interviews should be used instead of other commonly used scales.

None of the twelve assessment tools included in the review reported measurement error or responsiveness. None of the twelve assessment tools included in the review reported measurement error or responsiveness. Responsiveness, which reflects a scale's sensitivity to change, is essential for evaluating the effectiveness of clinical interventions aimed at improving health outcomes [71]. Therefore, future research should adopt longitudinal or experimental study designs to comprehensively assess PROM responsiveness.

Implications for clinical practice and future directions

PBT has significant negative effects on women’s mental health, maternal role adjustment, subsequent reproductive experiences, and healthcare utilization. Early identification and intervention are therefore essential. PROMs can improve outcome monitoring and provide valuable supplementary information alongside clinical data [72]. Additionally, PROMs have the potential to enhance care and health outcomes at individual, institutional, and population levels [73]. In light of this, this review adheres to COSMIN guidelines and provisionally recommends the City BiTS scale for use. Comprising 29 items across two dimensions, the City BiTS is noted for its short completion time, clinical practicality, and ease of use, which help reduce survey fatigue. Furthermore, the City BiTS demonstrates strong psychometric properties across 15 language versions, making it well-suited for global epidemiological studies on postpartum PBT.

The growing use of electronic questionnaires in medicine, driven by technological advancements, has streamlined data collection, reducing both labor and time costs. Electronic Patient-Reported Outcome Measures (ePROMs) minimize errors and facilitate complex survey management with user-friendly statistical displays. Future research should prioritize evaluating the equivalence between electronic and traditional paper questionnaires [74]. In clinical settings, integrating electronic questionnaires with electronic health records or developing dedicated applications can improve the routine assessment of postpartum trauma (PBT), providing timely and accessible information for healthcare providers and enabling prompt diagnosis and treatment.

Strengths and limitations

This systematic review is the first to apply the PRISMA-COSMIN reporting guidelines for Outcome Measurement Instruments [31] and the updated COSMIN guidelines to evaluate the methodological quality and psychometric properties of postpartum trauma (PBT) instruments in postpartum women. We employed a comprehensive search strategy, exploring eight Chinese and English databases and reviewing references to ensure a thorough identification of relevant studies. Quality assessment was conducted independently by two researchers trained in evidence-based methodology, with a third reviewer from the Chinese evidence-based field resolving any disagreements. This approach provides reliable and transparent evidence-based guidance for selecting measures and identifies areas for further research. However, several limitations exist. Language restrictions excluded studies on instruments developed or validated in non-English and non-Chinese languages. Additionally, full texts [75, 76] for two studies were inaccessible, and some PROMs lacked comprehensive reporting on measurement properties, which may affect the results of this review. Finally, to address the subjective nature of methodological quality and evidence grading, we involved at least two researchers in each review step to minimize variance and maintain objectivity and reliability.

Conclusion

This systematic review identified twelve patient-reported measurement instruments for assessing PBT. None of these tools received a Class A recommendation according to the COSMIN guidelines. Among the nine tools that received a Class B recommendation, the City BiTS is provisionally recommended for use and is considered credible in assessing maternal PBT. However, the methodological quality and reporting of these instruments varied across studies. Consequently, we encourage future researchers to conduct more comprehensive validations of the psychometric properties of existing PROMs or to develop new, higher-quality tools for more scientifically reliable assessments of PBT in postpartum women. Overall, these findings may provide valuable guidance to healthcare providers and researchers in selecting high-quality PBT PROMs for their work.

Supplementary Information

Below is the link to the electronic supplementary material.

Author contributions

Pingping Chen, Chao Zhang, and Longti Li designed the research protocol. The literature search and eligibility screening were finished by Xiaoyan Shi and Menghe Wang. Guangjian Liu and Hongxia Zuo contributed to literature screening, data extraction. Pingping Chen and Guangjian Liu as well as Chao Zhang conducted quality assessment, evidence synthesis, and grade recommendation. Pingping Chen wrote the manuscript. Longti Li critically reviewed the final drafting of the paper. The Corresponding Author (Longti Li) had full access to all the data in the study and had final responsibility for the decision to submit for publication. All authors approved the final version of the manuscript.

Funding

This research is funded by support from Innovative Research Program for Graduates of Hubei University Medicine (No. YC2023063) and the Key Program of Department of Education of Hubei Province (No. 22D092).

Data availability

The data that support the findings of this review are available on request from the corresponding author.

Declarations

Conflict of interest

No conflict of interest is declared in this article.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Shorey, S., & Wong, P. Z. E. (2022). Traumatic childbirth experiences of new parents: A meta-synthesis. Trauma, Violence & Abuse,23(3), 748–763. 10.1177/1524838020977161 [DOI] [PubMed] [Google Scholar]
  • 2.Ayers, S. (2017). Birth trauma and post-traumatic stress disorder: The importance of risk and resilience. Journal of Reproductive and Infant Psychology,35(5), 427–430. 10.1080/02646838.2017.1386874 [DOI] [PubMed] [Google Scholar]
  • 3.Leinweber, J., Fontein-Kuipers, Y., Thomson, G., Karlsdottir, S. I., Nilsson, C., Ekström-Bergström, A., Olza, I., Hadjigeorgiou, E., & Stramrood, C. (2022). Developing a woman-centered, inclusive definition of traumatic childbirth experiences: A discussion paper. Birth,49(4), 687–696. 10.1111/birt.12634 [DOI] [PubMed] [Google Scholar]
  • 4.Sun, X., Fan, X., Cong, S., Wang, R., Sha, L., Xie, H., Han, J., Zhu, Z., & Zhang, A. (2022). Psychological birth trauma: A concept analysis. Frontiers in Psychology,13, 1065612. 10.3389/fpsyg.2022.1065612 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Pidd, D., Newton, M., Wilson, I., & East, C. (2023). Optimising maternity care for a subsequent pregnancy after a psychologically traumatic birth: A scoping review. Women and Birth,36(5), e471–e480. 10.1016/j.wombi.2023.03.006 [DOI] [PubMed] [Google Scholar]
  • 6.Holopainen, A., Stramrood, D. C., Pampus, D. M. G. V., Hollander, D. M., & Schuengel, D. C. (2020). Subsequent childbirth after previous traumatic birth experience: Women’s choices and evaluations. British Journal of Midwfiery,28(8), 488–496. 10.12968/bjom.2020.28.8.488 [Google Scholar]
  • 7.Beck, C. T. (2015). Middle range theory of traumatic childbirth: The ever-widening ripple effect. Global Qualitative Nursing Research,2, 2333393615575313. 10.1177/2333393615575313 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Beck, C. T., & Watson, S. (2019). Mothers’ experiences interacting with infants after traumatic childbirth. MCN: The American Journal of Maternal-Child Nursing,44(6), 338–344. 10.1097/NMC.0000000000000565 [DOI] [PubMed] [Google Scholar]
  • 9.Beck, C. T. (2009). An adult survivor of child sexual abuse and her breastfeeding experience: A case study. MCN: The American Journal of Maternal-Child Nursing,34(2), 91–97. 10.1097/01.NMC.0000347302.85455.c8 [DOI] [PubMed] [Google Scholar]
  • 10.Delicate, A., & Ayers, S. (2023). The impact of birth trauma on the couple relationship and related support requirements: A framework analysis of parents’ perspectives. Midwifery,123, 103732. 10.1016/j.midw.2023.103732 [DOI] [PubMed] [Google Scholar]
  • 11.Daniels, E., Arden-Close, E., & Mayers, A. (2020). Be quiet and man up: A qualitative questionnaire study into fathers who witnessed their Partner’s birth trauma. BMC Pregnancy and Childbirth,20(1), 236. 10.1186/s12884-020-02902-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Turkstra, E., Creedy, D. K., Fenwick, J., Buist, A., Scuffham, P. A., & Gamble, J. (2015). Health services utilization of women following a traumatic birth. Archives of Women’s Mental Health,18(6), 829–832. 10.1007/s00737-014-0495-7 [DOI] [PubMed] [Google Scholar]
  • 13.Lai, X., Chen, J., Li, H., Zhou, L., Huang, Q., Liao, Y., Krewski, D., Wen, S. W., Zhang, L., & Xie, R. H. (2023). The incidence of post-traumatic stress disorder following traumatic childbirth: A systematic review and meta-analysis. International Journal of Gynecology & Obstetrics,162(1), 211–221. 10.1002/ijgo.14643 [DOI] [PubMed] [Google Scholar]
  • 14.Yildiz, P. D., Ayers, S., & Phillips, L. (2017). The prevalence of posttraumatic stress disorder in pregnancy and after birth: A systematic review and meta-analysis. Journal of Affective Disorders,208, 634–645. 10.1016/j.jad.2016.10.009 [DOI] [PubMed] [Google Scholar]
  • 15.DeSalvo, K. B., Jones, T. M., Peabody, J., McDonald, J., Fihn, S., Fan, V., He, J., & Muntner, P. (2009). Health care expenditure prediction with a single item, self-rated health measure. Medical Care,47(4), 440–447. 10.1097/MLR.0b013e318190b716 [DOI] [PubMed] [Google Scholar]
  • 16.Etkind, S. N., Daveson, B. A., Kwok, W., Witt, J., Bausewein, C., Higginson, I. J., & Murtagh, F. E. (2015). Capture, transfer, and feedback of patient-centered outcomes data in palliative care populations: Does it make a difference? A systematic review. Journal of Pain and Symptom Management,49(3), 611–624. 10.1016/j.jpainsymman.2014.07.010 [DOI] [PubMed] [Google Scholar]
  • 17.Ayers, S., Wright, D. B., & Thornton, A. (2018). Development of a measure of postpartum PTSD: The city birth trauma scale. Frontiers in Psychiatry. 10.3389/fpsyt.2018.00409 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Callahan, J. L., Borja, S. E., & Hynan, M. T. (2006). Modification of the perinatal PTSD questionnaire to enhance clinical utility. Journal of Perinatology,26(9), 533–539. 10.1038/sj.jp.7211562 [DOI] [PubMed] [Google Scholar]
  • 19.Anderson, C. (2011). Construct validity of the childbirth trauma index for adolescents. The Journal of perinatal education,20(2), 78–90. 10.1891/1058-1243.20.2.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Anderson, C. A., & Connolly, J. P. (2020). Measuring adolescent birth trauma: Development, adaptation, and evaluation of the psychometric properties for the childbirth trauma index. Journal of Nursing Measurement,28(3), 489–502. 10.1891/jnm-d-19-00012 [DOI] [PubMed] [Google Scholar]
  • 21.Orovou, E., Theodoropoulou, I. M., & Antoniou, E. (2021). Psychometric properties of the post traumatic stress disorder checklist for DSM-5 (PCL-5) in Greek women after cesarean section. PLOS ONE. 10.1371/journal.pone.0255689 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Dikmen-Yildiz, P., Ayers, S., & Phillips, L. (2017). Screening for birth-related PTSD: Psychometric properties of the Turkish version of the posttraumatic diagnostic scale in postpartum women in Turkey. European Journal of Psychotraumatology,8(1), 1306414–1306414. 10.1080/20008198.2017.1306414 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lian Zerong, Y. Z., Yuqing, C., Qingwen, G., Lihua, W., & Lilan, Z. (2023). Development and test the psychometric properties evaluation of the birth trauma perception scale for women during vaginal delivery. Chinese Nursing Management,23(01), 42–46. 10.3969/j.issn.1672-1756.2023.01.009 [Google Scholar]
  • 24.Zhang, K., Wu, M., Zeng, T., Yuan, M., Chen, Y., & Yang, L. (2023). Development and psychometric testing of a scale for assessing the psychological birth trauma. Frontiers in psychology,14, 1071336–1071336. 10.3389/fpsyg.2023.1071336 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Olde, E., Kleber, R. J., van der Hart, O., & Pop, V. J. M. (2006). Childbirth and posttraumatic stress responses: A validation study of the dutch impact of event scale-revised. European Journal of Psychological Assessment,22(4), 259–267. 10.1027/1015-5759.22.4.259 [Google Scholar]
  • 26.Hui, W. (2023). Development and psychometric test of the maternal childbirth trauma scale unpublished master’s thesis. Anhui Medical University. [Google Scholar]
  • 27.Xiaoqing, S., Aixia, Z., Zhu, Z., Xuemei, F., Shijuan, M., Xinxin, H., Shengnan, C., & Hongyan, X. (2024). Development and reliability and validity test of the psychological birth trauma assessment scale. Chinese Journal of Nursing,59(09), 1099–1105. 10.3761/j.issn.0254-1769.2024.09.011 [Google Scholar]
  • 28.Zhu, F. X. P. C. Z. A. Z. C. Z. (2022). Research progress of maternal psychological birth trauma. Chinese Journal of Nursing,57(10), 1256–1262. 10.3761/j.issn.0254-1769.2022.10.016 [Google Scholar]
  • 29.Williams, M. E., Strobino, D. M., & Holliday, C. N. (2023). Measuring post-traumatic stress after childbirth: A review and critical appraisal of instruments. Journal of Reproductive and Infant Psychology,41(5), 599–613. 10.1080/02646838.2022.2030052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Prinsen, C. A. C., Mokkink, L. B., Bouter, L. M., Alonso, J., Patrick, D. L., de Vet, H. C. W., & Terwee, C. B. (2018). COSMIN guideline for systematic reviews of patient-reported outcome measures. Quality of Life Research,27(5), 1147–1157. 10.1007/s11136-018-1798-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Elsman, E. B. M., Mokkink, L. B., Terwee, C. B., Beaton, D., Gagnier, J. J., Tricco, A. C., Baba, A., Butcher, N. J., Smith, M., Hofstetter, C., Aiyegbusi, O. L., Berardi, A., Farmer, J., Haywood, K. L., Krause, K. R., Markham, S., Mayo-Wilson, E., Mehdipour, A., Ricketts, J., … Offringa, M. (2024). Guideline for reporting systematic reviews of outcome measurement instruments (OMIs): PRISMA-COSMIN for OMIs 2024. Quality of Life Research,33(8), 2029–2046. 10.1007/s11136-024-03634-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Terwee, C. B., Jansma, E. P., Riphagen, I. I., & de Vet, H. C. (2009). Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Quality of Life Research,18(8), 1115–1123. 10.1007/s11136-009-9528-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.COSMIN tools. (2018). COSMIN risk of bias checklist. (2018). Retrieved September 14, 2024, from https://www.cosmin.nl/wp-content/uploads/COSMIN-RoB-checklist-V2-0-v17_rev3.pdf
  • 34.COSMIN tools. (2021). COSMIN Risk of Bias tool to assess the quality of studies on reliability and measurement error of outcome measurement instrument. Retrieved March 10, 2024, from https://www.cosmin.nl/wp-content/uploads/user-manual-COSMIN-Risk-of-Bias-tool_v4_JAN_final.pdf [DOI] [PMC free article] [PubMed]
  • 35.Mokkink, L. B., Prinsen, C., Patrick, D. L., Alonso, J., Bouter, L., De Vet, H. C., Terwee, C. B. (2018). COSMIN methodology for systematic reviews of Patient‐Reported Outcome Measures (PROMs) user manual. Retrieved August 1, 2023, from https://cosmin.nl/wp-content/uploads/COSMIN-syst-review-for-PROMs-manual_version-1_feb-2018.pdf
  • 36.Terwee, C. B., Prinsen, C. A. C., Chiarotto, A., Westerman, M. J., Patrick, D. L., Alonso, J., Bouter, L. M., de Vet, H. C. W., & Mokkink, L. B. (2018). COSMIN methodology for evaluating the content validity of patient-reported outcome measures: A Delphi study. Quality of Life Research,27(5), 1159–1170. 10.1007/s11136-018-1829-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.COSMIN tools. (2018). COSMIN methodology for assessing the content validity of PROMs. Retrieved July 30, 2023, from https://cosmin.nl/wp-content/uploads/COSMIN-methodology-for-content-validity-user-manual-v1.pdf
  • 38.Quinnell, F. A., & Hynan, M. T. (1999). Convergent and discriminant validity of the perinatal PTSD questionnaire (PPQ): A preliminary study. Journal of Trauma & Stress,12(1), 193–199. 10.1023/a:1024714903950 [DOI] [PubMed] [Google Scholar]
  • 39.Gondwe, K. W., Yang, Q., Brandon, D., Chirwa, E., & Holditch-Davis, D. (2018). Validation of the chichewa perinatal PTSD questionnaire and chichewa child health worry scale. International Journal of Africa Nursing Sciences,9, 42–48. 10.1016/j.ijans.2018.07.003 [Google Scholar]
  • 40.Handelzalts, J. E., Hairston, I. S., & Matatyahu, A. (2018). Construct validity and psychometric properties of the hebrew version of the city birth trauma scale. Frontiers in Psychology,9, 1726–1726. 10.3389/fpsyg.2018.01726 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zhang, D., Zhang, J., Gan, Q., Wang, Q., Fan, N., Zhang, R., & Song, Y. (2018). Validating the psychometric characteristics of the perinatal posttraumatic stress disorder questionnaire (PPQ) in a Chinese context. Archives of Psychiatric Nursing,32(1), 57–61. 10.1016/j.apnu.2017.09.016 [DOI] [PubMed] [Google Scholar]
  • 42.Buhagiar, R., Dimech, C., & Felice, E. (2019). The validation of the post-traumatic checklist for DSM-V (PCL-5) in the maltese perinatal population. Journal of Reproductive and Infant Psychology,37(5), 6. [Google Scholar]
  • 43.Donadon, M. F., Darwin, A. C. R., Bombonatti, E. A., Pereira-Lima, K., Santos, R. G., Machado-de-Sousa, J. P., Apolinario da Silva, T. D., Poli Neto, O. B., Gaspardo, C. M., Cantilino, A., de Mattos Souza, L. D., Ayres, S., & Osorio, F. L. (2020). Cross-cultural adaptation of the city birth trauma scale for the Brazilian context. Archives of Clinical Psychiatry,47(4), 110–118. 10.1590/0101-60830000000243 [Google Scholar]
  • 44.Nakić Radoš, S., Matijaš, M., Kuhar, L., Anđelinović, M., & Ayers, S. (2020). Measuring and conceptualizing PTSD following childbirth: Validation of the city birth trauma scale. Psychological Trauma: Theory, Research, Practice, and Policy,12(2), 147–155. 10.1037/tra0000501 [DOI] [PubMed] [Google Scholar]
  • 45.Zichen, S., Yang, Z., Dongmei, Z., Jun, Z., Jiajie, W., & Weiping, C. (2020). Translation of the Chinese version of the city birth trauma scale and the test of its reliability and validity. Chinese Journal of Nursing,55(11), 1757–1761. 10.3761/j.issn.0254-1769.2020.11.028 [Google Scholar]
  • 46.Bingol, F. B., Bal, M. D., Dissiz, M., Sormagec, M. T., & Yildiz, P. D. (2021). Validity and reliability of the Turkish version of the city birth trauma scale (CityBiTS). Journal of Obstetrics and Gynaecology,41(7), 1023–1031. 10.1080/01443615.2020.1821354 [DOI] [PubMed] [Google Scholar]
  • 47.Caparros-Gonzalez, R. A., Romero-Gonzalez, B., Peralta-Ramirez, M. I., Ayers, S., Galán-Paredes, A., & Caracuel-Romero, A. (2021). Assessment of posttraumatic stress disorder among women after childbirth using the city birth trauma scale in Spain. Psychological Trauma: Theory, Research, Practice, and Policy,13(5), 545–554. 10.1037/tra0001007 [DOI] [PubMed] [Google Scholar]
  • 48.Hernandez-Martinez, A., Martinez-Vazquez, S., Rodriguez-Almagro, J., Saeed Khan, K., Delgado-Rodriguez, M., & Miguel Martinez-Galiano, J. (2021). Validation of perinatal post-traumatic stress disorder questionnaire for Spanish women during the postpartum period. Scientific Reports. 10.1038/s41598-021-85144-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Weigl, T., Beck-Hiestermann, F. M. L., Stenzel, N. M., Benson, S., Schedlowski, M., & Garthus-Niegel, S. (2021). Assessment of childbirth-related PTSD: psychometric properties of the German version of the city birth trauma scale. Frontiers in Psychiatry. 10.3389/fpsyt.2021.731537 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Dobson, H., Malpas, C., & Kulkarni, J. (2022). Measuring posttraumatic stress disorder following childbirth. Australasian Psychiatry,30(4), 476–480. 10.1177/10398562221077900 [DOI] [PubMed] [Google Scholar]
  • 51.Nie, X.-F., Cheng, L., Huang, F., Zhang, T.-T., Yao, Y.-D., Tan, J., & Ke, Y.-Y. (2022). Construct validity and psychometric properties of the Chinese version of the city birth trauma scale. Journal of Obstetrics and Gynaecology Research,48(11), 2946–2955. 10.1111/jog.15391 [DOI] [PubMed] [Google Scholar]
  • 52.Osório, F. D. L., Rossini Darwin, A. C., Bombonetti, E. A., & Ayers, S. (2022). Posttraumatic stress following childbirth: Psychometric properties of the Brazilian version of the city birth trauma scale. Journal of Psychosomatic Obstetrics and Gynecology,43(3), 374–383. 10.1080/0167482X.2021.1977278 [DOI] [PubMed] [Google Scholar]
  • 53.Sandoz, V., Hingray, C., Stuijfzand, S., Lacroix, A., El Hage, W., & Horsch, A. (2022). Measurement and conceptualization of maternal PTSD following childbirth: Psychometric properties of the city birth trauma scale—French Version (City BiTS-F). Psychological Trauma: Theory, Research, Practice, and Policy,14(4), 696–704. 10.1037/tra0001068 [DOI] [PubMed] [Google Scholar]
  • 54.Fameli, A. L., Costa, D. S. J., Coddington, R., & Hawes, D. J. (2023). Assessment of childbirth-related post traumatic stress disorder in Australian mothers: Psychometric properties of the city birth trauma scale. Journal of Affective Disorders,324, 559–565. 10.1016/j.jad.2022.12.123 [DOI] [PubMed] [Google Scholar]
  • 55.Stén, G., Ayers, S., Malmquist, A., Nieminen, K., & Grundström, H. (2023). Assessment of maternal posttraumatic stress disorder following childbirth: Psychometric properties of the Swedish version of city birth trauma scale. Psychological Trauma: Theory, Research, Practice, and Policy,15(7), 1153–1163. 10.1037/tra0001465 [DOI] [PubMed] [Google Scholar]
  • 56.Vatanparast, A., Kamrani, A., Shakiba, S., Amouchie, R., Akbari, E., & Ayers, S. (2023). The latent factor structure and assessment of childbirth-related PTSD: Psychometric characteristics of the city birth trauma scale-Persian version (City-BiTS-P). Frontiers in Psychiatry. 10.3389/fpsyt.2023.1204392 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Alves, S., Claro, M., & Beato, A. (2024). Psychometric properties of the European Portuguese version of the modified perinatal PTSD questionnaire (PPQ-II). Current Psychology,43(13), 11865–11874. 10.1007/s12144-023-05277-7 [Google Scholar]
  • 58.Arora, I. H., Woscoboinik, G. G., Mokhtar, S., Quagliarini, B., Bartal, A., Jagodnik, K. M., Barry, R. L., Edlow, A. G., Orr, S. P., & Dekel, S. (2024). A diagnostic questionnaire for childbirth related posttraumatic stress disorder: A validation study. American Journal of Obstetrics and Gynecology,231(1), 134.e131-134.e113. 10.1016/j.ajog.2023.11.1229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Riklikiene, O., Jarasiunaite-Fedosejeva, G., Sakalauskiene, E., Luneckaite, Z., & Ayers, S. (2024). Translation and validation of the city birth trauma scale with Lithuanian postpartum women: Findings and initial results. Evaluation & the Health Professions. 10.1177/01632787241239339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Weathers, F. W. (2017). Redefining posttraumatic stress disorder for DSM-5. Current Opinion in Psychology,14, 122–126. 10.1016/j.copsyc.2017.01.002 [DOI] [PubMed] [Google Scholar]
  • 61.Guze, S. B. (1995). Diagnostic and statistical manual of mental disorders, (DSM-IV). American Journal of Psychiatry,152(8), 1228–1228. 10.1176/ajp.152.8.1228 [Google Scholar]
  • 62.Lazarus, R. S. (1993). From psychological stress to the emotions: A history of changing outlooks. Annual Review of Psychology,44, 1–21. 10.1146/annurev.ps.44.020193.000245 [DOI] [PubMed] [Google Scholar]
  • 63.Lazarus, R. S. (1966). Psychological Stress and the coping process. McGraw-Hill. [Google Scholar]
  • 64.Shabani, A., Masoumian, S., Zamirinejad, S., Hejri, M., Pirmorad, T., & Yaghmaeezadeh, H. (2021). Psychometric properties of structured clinical interview for DSM-5 disorders-clinician version (SCID-5-CV). Brain and Behavior: A Cognitive Neuroscience Perspective,11(5), e01894. 10.1002/brb3.1894 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Mokkink, L. B., Terwee, C. B., Patrick, D. L., Alonso, J., Stratford, P. W., Knol, D. L., Bouter, L. M., & de Vet, H. C. (2010). The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. Journal of Clinical Epidemiology,63(7), 737–745. 10.1016/j.jclinepi.2010.02.006 [DOI] [PubMed] [Google Scholar]
  • 66.Selby, P., & Velikova, G. (2018). Taking patient reported outcomes centre stage in cancer research—why has it taken so long? Research Involvement and Engagement,4, 25. 10.1186/s40900-018-0109-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Moreira, R. S., Bassolli, L., Coutinho, E., Ferrer, P., Braganca, E. O., Carvalho, A. C., de Paola, A. A., & Luna Filho, B. (2016). Reproducibility and reliability of the quality of life questionnaire in patients with atrial fibrillation. Arquivos Brasileiros de Cardiologia,106(3), 171–181. 10.5935/abc.20160026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Stark, S., Chernyshenko, O. S., & Drasgow, F. (2006). Detecting differential item functioning with confirmatory factor analysis and item response theory: Toward a unified strategy. Journal of Applied Psychology,91(6), 1292–1306. 10.1037/0021-9010.91.6.1292 [DOI] [PubMed] [Google Scholar]
  • 69.Steel, J. L., Dunlavy, A. C., Stillman, J., & Pape, H. C. (2011). Measuring depression and PTSD after trauma: Common scales and checklists. Injury,42(3), 288–300. 10.1016/j.injury.2010.11.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Weathers, F. W., Bovin, M. J., Lee, D. J., Sloan, D. M., Schnurr, P. P., Kaloupek, D. G., Keane, T. M., & Marx, B. P. (2018). The clinician-administered PTSD scale for DSM-5 (CAPS-5): Development and initial psychometric evaluation in military veterans. Psychological Assessment,30(3), 383–395. 10.1037/pas0000486 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Terwee, C. B., Dekker, F. W., Wiersinga, W. M., Prummel, M. F., & Bossuyt, P. M. (2003). On assessing responsiveness of health-related quality of life instruments: Guidelines for instrument evaluation. Quality of Life Research,12(4), 349–362. 10.1023/a:1023499322593 [DOI] [PubMed] [Google Scholar]
  • 72.Greenhalgh, J. (2009). The applications of PROs in clinical practice: What are they, do they work, and why? Quality of Life Research,18(1), 115–123. 10.1007/s11136-008-9430-6 [DOI] [PubMed] [Google Scholar]
  • 73.Black, N. (2013). Patient reported outcome measures could help transform healthcare. BMJ,346, f167. 10.1136/bmj.f167 [DOI] [PubMed] [Google Scholar]
  • 74.Byrom, B., Doll, H., Muehlhausen, W., Flood, E., Cassedy, C., McDowell, B., Sohn, J., Hogan, K., Belmont, R., Skerritt, B., & McCarthy, M. (2018). Measurement equivalence of patient-reported outcome measure response scale types collected using bring your own device compared to paper and a provisioned device: Results of a randomized equivalence trial. Value in Health,21(5), 581–589. 10.1016/j.jval.2017.10.008 [DOI] [PubMed] [Google Scholar]
  • 75.Dilcen, H. Y., Türkmen, H. Y., & Erdoğan, A. (2021). Validity and reliability study of the childbirth trauma index. Journal of Nursing Measurement. 10.1891/jnm-d-20-00103 [DOI] [PubMed] [Google Scholar]
  • 76.Kassongo, A. C. J. (2017). Validation studies of the traumatic event scale—childbirth, in the reality of angolan women. Universidade Autonoma de Lisboa. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The data that support the findings of this review are available on request from the corresponding author.


Articles from Quality of Life Research are provided here courtesy of Springer

RESOURCES